OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

[view on github]last commit: Apr 14, 2026
stars
9,354
7d
+28
30d
-
90d
-
## star history
## found in