xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

[view on github]last commit: Apr 15, 2026
stars
9,230
7d
+18
30d
-
90d
-
## star history
## found in