Skip to content

CUDA: quantized KV support for FA vec#7527

Merged
JohannesGaessler merged 14 commits intoggml-org:masterfrom
JohannesGaessler:cuda-fattn-vec-quant-3
Jun 1, 2024

Commits

Commits on May 27, 2024

Commits on May 29, 2024

Commits on May 31, 2024