Update spqr.md

elvircrn · elvircrn · commit 17d1c72a7ef7 · 2025-01-15T14:20:52.000+01:00
diff --git a/docs/source/en/quantization/spqr.md b/docs/source/en/quantization/spqr.md
@@ -16,7 +16,7 @@ rendered properly in your Markdown viewer.
 
 # SpQR
 
-[SpQR](https://github.com/Vahe1994/SpQR) quantization algorithm involves a 16x16 tile, 3-bit configuration, and unstructured sparsity as detailed in [SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression](https://arxiv.org/abs/2306.03078).
+[SpQR](https://github.com/Vahe1994/SpQR) quantization algorithm involves a 16x16 tiled bi-level group 3-bit quantization structure, with sparse outliers as detailed in [SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression](https://arxiv.org/abs/2306.03078).
 
 To SpQR-quantize a model, refer to the [Vahe1994/SpQR](https://github.com/Vahe1994/SpQR) repository.
 
@@ -32,4 +32,4 @@ quantized_model = AutoModelForCausalLM.from_pretrained(
     device_map="auto"
 )
 tokenizer = AutoTokenizer.from_pretrained("elvircrn/Llama-2-7b-SPQR-3Bit-16x16-red_pajama-hf")
-```
+```