michaelfeil/infinity
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
[view on github]last commit: Mar 23, 2026
stars
2,805
7d
+4
30d
+32
90d
+135
## star history
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali