NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

[view on github]last commit: May 24, 2026
stars
16,434
7d
+48
30d
+277
90d
+1,147
## star history
## found in