huggingface/trl

Train transformer language models with reinforcement learning.

[view on github]last commit: Apr 15, 2026
stars
18,058
7d
+80
30d
-
90d
-
## star history
## found in