jd-opensource/xllm

A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.

[view on github]last commit: May 23, 2026
stars
1,298
7d
+21
30d
-
90d
-
## star history
## found in