casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
[view on github]last commit: May 11, 2025
stars
2,320
7d
-
30d
-
90d
-
## star history
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation: