ISTA-DASLab/Llama-2-7b-AQLM-1Bit-1x8-hf
1B
•
Updated
•
33
•
1
None defined yet.
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization
CAGE: Curvature-Aware Gradient Estimation For Accurate Quantization-Aware Training