huggingface/nanotron

Minimalistic large language model 3D-parallelism training

[view on github]last commit: Apr 7, 2026
stars
2,649
7d
+7
30d
-
90d
-
## star history
## found in