Releases: JamePeng/llama-cpp-python
Releases · JamePeng/llama-cpp-python
v0.3.8-cu126-win-20250424
Sync mtmd : Support Pixtral 12B (#13065)
Sync quantize: Handle user-defined quantization levels for additional tensors (#12511)
Sync llama : Support llama 4 text-only
Update llama : add option to override model tensor buffers
Sync llama-vocab : add SuperBPE pre-tokenizer
class LlamaSampler: append add_xtc(), add_top_n_sigma() and add_dry()
v0.3.8-cu124-win-20250424
Sync mtmd : Support Pixtral 12B (#13065)
Sync quantize: Handle user-defined quantization levels for additional tensors (#12511)
Sync llama : Support llama 4 text-only
Update llama : add option to override model tensor buffers
Sync llama-vocab : add SuperBPE pre-tokenizer
class LlamaSampler: append add_xtc(), add_top_n_sigma() and add_dry()