Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

musa: enable freediskspace for docker image build devops improvements to build systems and github actions
#12839 opened Apr 9, 2025 by yeahdongcn Loading…
convert : write tensors in parallel performance Speed related topics python python script changes
#12837 opened Apr 8, 2025 by compilade Loading…
1 of 5 tasks
vulkan: In coopmat2 mmq, load q4_k/q5_k scales through shared memory ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12833 opened Apr 8, 2025 by jeffbolznv Loading…
clip : do not print ftype examples
#12832 opened Apr 8, 2025 by ngxson Loading…
Fixes #12823 ggml changes relating to the ggml tensor library for machine learning
#12830 opened Apr 8, 2025 by mehendarkarprajwal Loading…
Add AVX512 implementation of GEMM - q4kx8 ggml changes relating to the ggml tensor library for machine learning
#12829 opened Apr 8, 2025 by Srihari-mcw Loading…
Support Qwen3 and Qwen3MoE python python script changes
#12828 opened Apr 8, 2025 by bozheng-hit Loading…
ci: detach common from the library examples server testing Everything test related
#12827 opened Apr 8, 2025 by pminev Loading…
common: add partial regex support examples server testing Everything test related
#12808 opened Apr 7, 2025 by ochafik Draft
ci: fix cross-compile sync issues devops improvements to build systems and github actions
#12804 opened Apr 7, 2025 by bandoti Loading…
server: inject date_string in llama 3.x template + fix date for firefunction v2 examples python python script changes server testing Everything test related
#12802 opened Apr 7, 2025 by ochafik Loading…
DeepSeek V2/V3 MLA implementation python python script changes
#12801 opened Apr 7, 2025 by jukofyork Loading…
opencl: fix couple crashes ggml changes relating to the ggml tensor library for machine learning
#12795 opened Apr 7, 2025 by linehill Loading…
Support for OuteTTS 1.0 examples python python script changes
#12794 opened Apr 7, 2025 by edwko Draft
SYCL: Add fp16 type support to unary op kernels ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12788 opened Apr 7, 2025 by qnixsynapse Loading…
[CANN]Support Opt CONV_TRANSPOSE_1D and ELU Ascend NPU issues specific to Ascend NPUs devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning
#12786 opened Apr 7, 2025 by noemotiovon Loading…
vulkan: Use fp16 for the flash attention P*V multiplication ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12783 opened Apr 6, 2025 by jeffbolznv Loading…
ggml: use _mm[512/256]_dpbusd[_avx]_epi32 to directly accumulate into the result register ggml changes relating to the ggml tensor library for machine learning
#12773 opened Apr 5, 2025 by SongXiaoXi Loading…
Added all CPU to Docker GPU images for 'token_embd.weight' compatibility devops improvements to build systems and github actions
#12749 opened Apr 4, 2025 by rudiservo Loading…
(wip) support ultravox audio input examples python python script changes
#12745 opened Apr 3, 2025 by ngxson Draft
Update llama-quant.cpp llama_tensor_get_type with DeepSeek friendly modifications ggml changes relating to the ggml tensor library for machine learning
#12727 opened Apr 3, 2025 by bartowski1182 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.