You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[auto] Sync version 2404081813.0.0+llamacpp-release.b2632
== Relevant log messages from source repo:
commit b73e564b16086845a8b4fffd26e22685d3e0c3db
Author: Georgi Gerganov <[email protected]>
Date: Mon Apr 8 16:23:01 2024 +0300
quantize : fix precedence of cli args (#6541)
commit e3c337d87ca650972105a51c6ce302dd236c07ad
Author: Rick G <[email protected]>
Date: Mon Apr 8 06:02:30 2024 -0700
llama : support negative ith in llama_get_ API (#6519)
* llama_sampling_sample with default args is more naively usable
* Batches populated by either llama_batch_get_one or llama_batch_add work with default args
* Previously get_one could use the default argument
* Previously add should usually have used the last index where logits[idx] == true
* This hopefully encourages the use of llama_batch_add
* By giving expected results when using default arguments.
* Adds "negative indexing" feature to llama_get_logits_ith and llama_get_embeddings_ith
* Believed to work with any currently well behaved program
* Default arg now works for both cases (previously would give strange results for add case)
* Any non-negative number is unaffected and behaves as previously
* Negative arguments were previously invalid.
* Implemented as a special case of indexing as suggested by @compilade in ggml-org/llama.cpp#6519
* Fixed mismatch type errors
* cited in macOS CI tests
* Missed in original updates based on PR feedback in ggml-org/llama.cpp#6519
commit beea6e1b16e783a0886e78dec01002a8c00db24d
Author: Jan Boon <[email protected]>
Date: Mon Apr 8 20:43:30 2024 +0800
llama : save and restore kv cache for single seq id (#6341)
* llama : save and restore kv cache for single seq id
* remove trailing whitespace
* respond error in case there's no space in the kv cache
* add kv seq save restore to test case
* add --slot-save-path arg to enable save restore and restrict save location
* Returning 0 for some cases, instead of asserting.
* cleanup error cases
* rename sequence state functions
* rename state get set functions
* add previous function names back in with DEPRECATED notice
* update doc
* adjust endpoints to preferred style
* fix restoring zero cell count
* handle seq rm return value
* unused param
* keep in the size check
* fix return types
* add server test case for slot save restore
* cleanup
* add cake
* cleanup style
* add special
* removing a whole sequence never fails
* move sequence state file functionality from server to llama to match session api and add version tags
* catch exceptions on save as well
* error log messages
* check types for stricter restore
* update server doc
* readme : update API changes date
* strict filename validation
* move include, reject bom as well
* also reject empty filename
* reject whitespace and trailing dot
---------
Co-authored-by: Martin Evans <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>
0 commit comments