openai/mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

[view on github]last commit: Apr 24, 2026

stars

1,588

7d

+10

30d

+54

90d

-

## star history

## found in

Awesome Open Source AI/Benchmark Suites