forked from abetlen/llama-cpp-python
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit bf766bd
merge test (#2)
* feat: add support for KV cache quantization options (abetlen#1307)
* add KV cache quantization options
abetlen#1220
abetlen#1305
* Add ggml_type
* Use ggml_type instead of string for quantization
* Add server support
---------
Co-authored-by: Andrei Betlen <[email protected]>
* fix: Changed local API doc references to hosted (abetlen#1317)
* chore: Bump version
* fix: last tokens passing to sample_repetition_penalties function (abetlen#1295)
Co-authored-by: ymikhaylov <[email protected]>
Co-authored-by: Andrei <[email protected]>
* feat: Update llama.cpp
* fix: segfault when logits_all=False. Closes abetlen#1319
* feat: Binary wheels for CPU, CUDA (12.1 - 12.3), Metal (abetlen#1247)
* Generate binary wheel index on release
* Add total release downloads badge
* Update download label
* Use official cibuildwheel action
* Add workflows to build CUDA and Metal wheels
* Update generate index workflow
* Update workflow name
* feat: Update llama.cpp
* chore: Bump version
* fix(ci): use correct script name
* docs: LLAMA_CUBLAS -> LLAMA_CUDA
* docs: Add docs explaining how to install pre-built wheels.
* docs: Rename cuBLAS section to CUDA
* fix(docs): incorrect tool_choice example (abetlen#1330)
* feat: Update llama.cpp
* fix: missing logprobs in response, incorrect response type for functionary, minor type issues. Closes abetlen#1328 abetlen#1314
* fix: missing logprobs in response, incorrect response type for functionary, minor type issues. Closes abetlen#1328 Closes abetlen#1314
* feat: Update llama.cpp
* fix: Always embed metal library. Closes abetlen#1332
* feat: Update llama.cpp
* chore: Bump version
---------
Co-authored-by: Limour <[email protected]>
Co-authored-by: Andrei Betlen <[email protected]>
Co-authored-by: lawfordp2017 <[email protected]>
Co-authored-by: Yuri Mikhailov <[email protected]>
Co-authored-by: ymikhaylov <[email protected]>
Co-authored-by: Sigbjørn Skjæret <[email protected]>1 parent 76b51c3 commit bf766bdCopy full SHA for bf766bd
File tree
0 file changed
+0
-0
lines changedFilter options
0 file changed
+0
-0
lines changed
0 commit comments