michaelfeil/infinity
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
[view on github]last commit: Mar 23, 2026
stars
2,757
7d
+7
30d
-
90d
-
## star history
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali