NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

[view on github]last commit: Apr 15, 2026
stars
16,053
7d
+79
30d
-
90d
-
## star history
## found in