turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
[view on github]last commit: Mar 4, 2026
stars
4,532
7d
+9
30d
+28
90d
+106
## star history
A fast inference library for running LLMs locally on modern consumer-class GPUs