xorbitsai/inference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

[view on github]last commit: May 23, 2026
stars
9,309
7d
+3
30d
+48
90d
+289
## star history
## found in