casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

[view on github]last commit: May 11, 2025
stars
2,320
7d
-
30d
-
90d
-
## star history
## found in