turboderp/exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

[view on github]last commit: Mar 4, 2026
stars
4,532
7d
+9
30d
+28
90d
+106
## star history
## found in