THUDM/AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

[view on github]last commit: Feb 8, 2026
stars
3,447
7d
+15
30d
+86
90d
+276
## star history
## found in