awesomer
_
>
home
/
THUDM/AgentBench
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
[view on github]
last commit: Feb 8, 2026
stars
3,327
7d
+17
30d
-
90d
-
## star history
## found in
Awesome Open Source AI/Benchmark Suites