openai/mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
[view on github]last commit: Apr 24, 2026
stars
1,539
7d
+6
30d
-
90d
-
## star history
## found in
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering