THUDM/AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

[view on github]last commit: Feb 8, 2026
stars
3,327
7d
+17
30d
-
90d
-
## star history
## found in