openai/mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

[view on github]last commit: Apr 24, 2026
stars
1,539
7d
+6
30d
-
90d
-
## star history
## found in