turboderp/exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

[view on github]last commit: Mar 4, 2026
stars
4,483
7d
-
30d
-
90d
-
## star history
## found in