SDXL Clip doesn't get quantized and is set to FP32 #605

rmatif · 2025-02-27T22:04:23Z

I've noticed that with the release master-9578fdc , and specifically with this commit, the ability to quantize CLIP on SDXL models has been removed, causing it to always load as FP32. And it's still the case in the current release

log 9578fdc

log_9578fdc.txt

log b5f4932 (the just previous release)

log_b5f4932.txt

I'm wondering if this change was intentional. If not, I'm surprised that no one has noticed it yet

The text was updated successfully, but these errors were encountered:

stduhpf · 2025-02-28T02:45:41Z

That's definitely not intentional. Only the VAE is supposed to be forced as f32 for SDXL. I'll look into it.

rmatif · 2025-02-28T10:53:43Z

@stduhpf Thanks for taking a look!

stduhpf · 2025-02-28T13:24:35Z

Interesting, it seems to only be happening when using a .safetensors file. With a .gguf, the clip models are quantized as expected, which might explain why it got unnoticed before...

stduhpf mentioned this issue Feb 28, 2025

Fix: preprocess tensor names in tensor types map #607

Merged

leejet closed this as completed in #607 Mar 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SDXL Clip doesn't get quantized and is set to FP32 #605

SDXL Clip doesn't get quantized and is set to FP32 #605

rmatif commented Feb 27, 2025

stduhpf commented Feb 28, 2025

rmatif commented Feb 28, 2025

stduhpf commented Feb 28, 2025 •

edited

Loading

SDXL Clip doesn't get quantized and is set to FP32 #605

SDXL Clip doesn't get quantized and is set to FP32 #605

Comments

rmatif commented Feb 27, 2025

stduhpf commented Feb 28, 2025

rmatif commented Feb 28, 2025

stduhpf commented Feb 28, 2025 • edited Loading

stduhpf commented Feb 28, 2025 •

edited

Loading