Pinned Loading
-
harbor-framework/terminal-bench-3
harbor-framework/terminal-bench-3 PublicMeasuring agents' ability to get work done on a computer
-
agentic-labs/lsproxy
agentic-labs/lsproxy PublicMulti-language code navigation API in a container
-
agentic-labs/erp-bench
agentic-labs/erp-bench PublicERP-Bench: benchmark tasks and agent harness for ERP workflows
Python
-
agentic-labs/pi-mono
agentic-labs/pi-mono PublicForked from earendil-works/pi
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
TypeScript
-
agentic-labs/terminal-bench-3
agentic-labs/terminal-bench-3 PublicForked from harbor-framework/terminal-bench-3
🚧 Accepting Task Submissions 🚧
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



