Update base to transformers v4.30.2 #81

bfineran · 2023-06-15T19:52:59Z

rebases NM changes from base version v4.23.1 to v4.30.2

* Update trainer and model flows to accommodate sparseml Disable FP16 on QAT start (#12) * Override LRScheduler when using LRModifiers * Disable FP16 on QAT start * keep wrapped scaler object for training after disabling Using QATMatMul in DistilBERT model class (#41) Removed double quantization of output of context layer. (#45) Fix DataParallel validation forward signatures (#47) * Fix: DataParallel validation forward signatures * Update: generalize forward_fn selection Best model after epoch (#46) fix sclaer check for non fp16 mode in trainer (#38) Mobilebert QAT (#55) * Remove duplicate quantization of vocabulary. enable a QATWrapper for non-parameterized matmuls in BERT self attention (#9) * Utils and auxillary changes update Zoo stub loading for SparseZoo 1.1 refactor (#54) add flag to signal NM integration is active (#32) Add recipe_name to file names * Fix errors introduced in manual cherry-pick upgrade Co-authored-by: Benjamin Fineran <[email protected]>

* add GHA workflow files to build nightly and release packages * fix name --------- Co-authored-by: dhuang <[email protected]>

Co-authored-by: dhuang <[email protected]>

dbogunowicz · 2023-06-16T12:47:35Z

Tested with the current main of deepsparse

make test green
testing the commands from https://github.com/neuralmagic/deepsparse/blob/main/src/deepsparse/transformers/README.md

KSGulin · 2023-06-16T13:14:40Z

Tested with:

Sparse transfer learn commands from Add accelerate package dep for transformers sparseml#1633
make testinteg TARGETS=transformers

* Add recipe_name to default file names * Upgrade to transformers release V4.30.2 (#62) * Update trainer and model flows to accommodate sparseml Disable FP16 on QAT start (#12) * Override LRScheduler when using LRModifiers * Disable FP16 on QAT start * keep wrapped scaler object for training after disabling Using QATMatMul in DistilBERT model class (#41) Removed double quantization of output of context layer. (#45) Fix DataParallel validation forward signatures (#47) * Fix: DataParallel validation forward signatures * Update: generalize forward_fn selection Best model after epoch (#46) fix sclaer check for non fp16 mode in trainer (#38) Mobilebert QAT (#55) * Remove duplicate quantization of vocabulary. enable a QATWrapper for non-parameterized matmuls in BERT self attention (#9) * Utils and auxillary changes update Zoo stub loading for SparseZoo 1.1 refactor (#54) add flag to signal NM integration is active (#32) Add recipe_name to file names * Fix errors introduced in manual cherry-pick upgrade Co-authored-by: Benjamin Fineran <[email protected]> * update build versions for NM fork pypi push (#74) * fix nightly package name (#75) * add make build command (#76) * add GHA workflow files to build nightly and release packages (#77) * add GHA workflow files to build nightly and release packages * fix name --------- Co-authored-by: dhuang <[email protected]> * bump up version to 1.6.0 (#79) Co-authored-by: dhuang <[email protected]> --------- Co-authored-by: Konstantin <[email protected]> Co-authored-by: Konstantin Gulin <[email protected]> Co-authored-by: dhuangnm <[email protected]> Co-authored-by: dhuang <[email protected]>

KSGulin and others added 7 commits June 13, 2023 16:50

Add recipe_name to default file names

ccfa243

update build versions for NM fork pypi push (#74)

f767a5f

fix nightly package name (#75)

e7dca16

add make build command (#76)

61c3aae

add GHA workflow files to build nightly and release packages (#77)

790a385

* add GHA workflow files to build nightly and release packages * fix name --------- Co-authored-by: dhuang <[email protected]>

bump up version to 1.6.0 (#79)

4054a5b

Co-authored-by: dhuang <[email protected]>

bfineran requested review from rahul-tuli, KSGulin and dbogunowicz June 15, 2023 19:52

bfineran self-assigned this Jun 15, 2023

KSGulin changed the base branch from main to upstream-v4.30.2-release-copy June 16, 2023 11:49

KSGulin mentioned this pull request Jun 16, 2023

Add accelerate package dep for transformers neuralmagic/sparseml#1633

Merged

KSGulin approved these changes Jun 16, 2023

View reviewed changes

dbogunowicz approved these changes Jun 16, 2023

View reviewed changes

KSGulin merged commit 0798c9e into upstream-v4.30.2-release-copy Jun 19, 2023

dbogunowicz deleted the rebase-upstream-4.30.2 branch December 5, 2023 10:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update base to transformers v4.30.2 #81

Update base to transformers v4.30.2 #81

Uh oh!

bfineran commented Jun 15, 2023

Uh oh!

dbogunowicz commented Jun 16, 2023 •

edited

Loading

Uh oh!

KSGulin commented Jun 16, 2023 •

edited

Loading

Uh oh!

Uh oh!

Update base to transformers v4.30.2 #81

Update base to transformers v4.30.2 #81

Uh oh!

Conversation

bfineran commented Jun 15, 2023

Uh oh!

dbogunowicz commented Jun 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KSGulin commented Jun 16, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

dbogunowicz commented Jun 16, 2023 •

edited

Loading

KSGulin commented Jun 16, 2023 •

edited

Loading