You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Text Generation] Support for causal masks, internal KV cache, and initial testing framework (#1172)
* initial commit
* improved logic
* additional improvements
* Update src/deepsparse/transformers/pipelines/text_generation.py
* Update src/deepsparse/utils/onnx.py
Co-authored-by: Benjamin Fineran <[email protected]>
* Update src/deepsparse/utils/onnx.py
Co-authored-by: Benjamin Fineran <[email protected]>
* response to Ben's comments
* finish rebasing
* update user messages + add assertion for safety
* minor improvements before landing
* Fix the helper function that has been broken after a merge
* [Text Generation] Internal KV Cache Support + Initial Testing Framework (#1163)
* Create test_nl_decoder_engine.py
* [Text Generation][Tests] DecoderKVCache (#1154)
* [Text Generation][Tests] NLDecoderEngine (#1155)
* initial commit
* initial commit
* [Text Generation][Tests] Text Generation Pipeline (#1162)
* initial implementation
* problems with multitoken prefill
* almost there...
* finally all tests pass
* just need to change to stub
* fix bad merge
* Make tests work with stub (as much as possible), cleanup test names, disable heavy tests, include patch for running without causal mask
* use patch from unittest library - remove additional dependency
* Update tests/deepsparse/transformers/pipelines/test_text_generation.py
* clarify todo comment
* [Text Generation] KV Cache internal Deepsparse support (#1135)
* fix kv cache
* refactor
* add validation pathway
* avx2 support
* initial commit
* initial commit
* initial implementation
* problems with multitoken prefill
* its working
* almost there...
* finally all tests pass
* just need to change to stub
* fix bad merge
* added some tests
* ready for review
* full support
---------
Co-authored-by: dbogunowicz <[email protected]>
Co-authored-by: Damian <[email protected]>
* incomplete string in parametrize
* few nits before the merge
---------
Co-authored-by: Benjamin Fineran <[email protected]>
Co-authored-by: Sage Moore <[email protected]>
---------
Co-authored-by: Benjamin Fineran <[email protected]>
Co-authored-by: Sage Moore <[email protected]>
0 commit comments