huggingface/nanotron

Minimalistic large language model 3D-parallelism training

[view on github]last commit: Apr 7, 2026
stars
2,699
7d
+5
30d
+33
90d
+131
## star history
## found in