Skip to content

Releases: ggml-org/llama.cpp

b5124

12 Apr 16:17
bc091a4
Compare
Choose a tag to compare
common : Define cache directory on AIX (#12915)

b5123

12 Apr 09:31
a483757
Compare
Choose a tag to compare
vulkan: use aligned loads for flash attention mask (#12853)

Rewrite the stride logic for the mask tensor in the FA shader to force the
stride to be aligned, to allow using more efficient loads.

b5122

12 Apr 06:14
e59ea53
Compare
Choose a tag to compare
llava: Fix cpu-only clip image encoding sefault (#12907)

* llava: Fix cpu-only clip image encoding

* clip : no smart ptr for ggml_backend_t

* Fix for backend_ptr push_back

---------

Co-authored-by: Xuan Son Nguyen <[email protected]>

b5121

11 Apr 22:22
c94085d
Compare
Choose a tag to compare
server : add VSCode's Github Copilot Chat support (#12896)

* server : add VSCode's Github Copilot Chat support

* cont : update handler name

b5120

11 Apr 22:08
e8a6263
Compare
Choose a tag to compare
rpc : Set cache directory in rpc-server.cpp on FreeBSD (#12903)

b5119

11 Apr 21:57
b6930eb
Compare
Choose a tag to compare
`tool-call`: fix non-tool-calling grammar crashes w/ Qwen / Hermes 2 …

b5118

11 Apr 21:14
68b08f3
Compare
Choose a tag to compare
common : Define cache directory on FreeBSD (#12892)

b5117

11 Apr 14:30
578754b
Compare
Choose a tag to compare
sycl: Support sycl_ext_oneapi_limited_graph (#12873)

The current usage of the SYCL-Graph extension checks for
the `sycl_ext_oneapi_graph` device aspect. However, it is also
possible to support `sycl_ext_oneapi_limied_graph` devices that
don't support update

b5116

11 Apr 13:57
b2034c2
Compare
Choose a tag to compare
contrib: support modelscope community (#12664)

* support download from modelscope

* support login

* remove comments

* add arguments

* fix code

* fix win32

* test passed

* fix readme

* revert readme

* change to MODEL_ENDPOINT

* revert tail line

* fix readme

* refactor model endpoint

* remove blank line

* fix header

* fix as comments

* update comment

* update readme

---------

Co-authored-by: tastelikefeet <yuze.zyz@alibaba-inc/com>

b5115

11 Apr 13:55
06bb53a
Compare
Choose a tag to compare
llama-model : add Glm4Model implementation for GLM-4-0414 (#12867)

* GLM-4-0414

* use original one

* Using with tensor map

* fix bug

* change order

* change order

* format with flask8