THUDM/slime

slime is an LLM post-training framework for RL Scaling.

[view on github]last commit: May 23, 2026
stars
5,769
7d
+44
30d
+299
90d
+1,406
## star history
## found in