gpustack/gpustack
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.
[view on github]last commit: Apr 15, 2026
stars
4,846
7d
+48
30d
-
90d
-
## star history
A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.