huggingface/trl

Train transformer language models with reinforcement learning.

[view on github]last commit: May 22, 2026
stars
18,457
7d
+45
30d
+292
90d
+1,056
## star history
## found in