Update 4.7.0 -> 4.18.0 #37

KSGulin · 2022-03-10T17:54:51Z

No description provided.

* Make sure custom configs work with Transformers * Apply code review suggestions

* Add Wav2Vec2 Adapter Weights to Flax * Suggested changes

* typical decoding * changing arg name * add test config params * forgotten arg rename * fix edge case where scores are same * test for typical logits warper * code quality fixes

…5416) * added classes to get started with constrained beam search * in progress, think i can directly force tokens now but not yet with the round robin * think now i have total control, now need to code the bank selection * technically works as desired, need to optimize and fix design choices leading to undersirable outputs * complete PR #1 without disjunctive decoding * removed incorrect tests * Delete k.txt * Delete test.py * Delete test.sh * revert changes to test scripts * genutils * full implementation with testing, no disjunctive yet * shifted docs * passing all tests realistically ran locally * removing accidentally included print statements * fixed source of error in initial PR test * fixing the get_device() vs device trap * fixed documentation docstrings about constrained_beam_search * fixed tests having failing for Speech2TextModel's floating point inputs * fix cuda long tensor * added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search * deleted accidentally added test halting code with assert False * code reformat * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <[email protected]> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <[email protected]> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <[email protected]> * Update tests/test_generation_utils.py Co-authored-by: Patrick von Platen <[email protected]> * Update tests/test_generation_utils.py * fixing based on comments on PR * took out the testing code that should but work fails without the beam search moditification ; style changes * fixing comments issues * docstrings for ConstraintListState * typo in PhrsalConstraint docstring * docstrings improvements Co-authored-by: Patrick von Platen <[email protected]>

* Expose hub test problem * Fix tests

Co-authored-by: ydshieh <[email protected]>

* [trainer docs] document how to select specific gpus * expand * add urls * add accelerate launcher

Co-authored-by: Niels Rogge <[email protected]>

* Expand tutorial for custom models * Style * Apply suggestions from code review Co-authored-by: Lysandre Debut <[email protected]> Co-authored-by: Lysandre Debut <[email protected]>

* Add TensorFlow support for ONNX export * Change documentation to mention conversion with Tensorflow * Refactor export into export_pytorch and export_tensorflow * Check model's type instead of framework installation to choose between TF and Pytorch Co-authored-by: Lysandre Debut <[email protected]> Co-authored-by: Alberto Bégué <[email protected]> Co-authored-by: lewtun <[email protected]>

…gface#14139) (huggingface#15175) * Compute loss independent from decoder (as 14139) * fix expected seq_len + style * Apply the same change to TFVisionEncoderDecoderModel * fix style * Add case with labels in equivalence test * uncomment * Add case with labels in equivalence test * add decoder_token_labels * use hf_compute_loss * Apply suggestions from code review Co-authored-by: NielsRogge <[email protected]> * Add copied from Co-authored-by: ydshieh <[email protected]> Co-authored-by: NielsRogge <[email protected]>

Co-authored-by: Niels Rogge <[email protected]>

) * Add local and TensorFlow ONNX export examples to docs * Use PyTorch - TensorFlow split

…environment (huggingface#15625)

…ace#15612) * Add informative warning

* Fix TF MT5 vocab resize * more assertive testing

…#15629)

* [deepspeed docs] round_robin_gradients * training and/or eval/predict loss is * Update docs/source/main_classes/deepspeed.mdx Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

* Move generate docs * up * Update docs/source/_toctree.yml * correct * correct some stuff * correct tests * more fixes * finish generate * add to doc stest * finish * finalize * add warning to generate method

* Add attentions_option to common tester * Fix tests, apply suggestion * Apply suggestion from code review Co-authored-by: Niels Rogge <[email protected]>

* Fix Bug in Flax-Speech-Encoder-Decoder Test * change thresholds for CPU precision

* fix Co-authored-by: ydshieh <[email protected]>

* Build the doc in a seperate folder then move it * Allow job * Is this it? * Dislike comments? * Copy instead of move * Removing version built * Typos * No variable * Take _versions.yml into account * Finish main job and add dev job * Forgot the run * Fix syntax error * Execute builder from the repo * Typo

* MVP * apply decorator to TFBertModel * finish updating bert * update rembert (copy-linked to bert) * update roberta (copy-linked to bert); Fix args * Now working for non-text modalities

* Fix Bug in Flax Seq2Seq Models * incorporate suggested changes

* Support for torch 1.11 * Address Sylvain's comment

review-notebook-app · 2022-03-10T17:55:03Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

* support not sharing embeddings * update modeling * update tokenizer * fix conversion script * always use self.shared * boom boom * begin tests * update tests * fix resize_decoder_token_embeddings * address Patrick's comments * style * update conversion script * fix conversion script * fix tokenizer * better name target vocab * add integration test for tokenizer with two vocabs * style * address Patrick's comments * add integration test for model

…gface#16045) * Fix duplicate arguments passed to dummy inputs in ONNX export * Fix M2M100 ONNX config * Ensure we check PreTrained model only if torch is available * Remove TensorFlow tests for models without PyTorch parity

sgugger and others added 30 commits February 9, 2022 10:04

Make sure custom configs work with Transformers (huggingface#15569)

1f60bc4

* Make sure custom configs work with Transformers * Apply code review suggestions

Add Wav2Vec2 Adapter Weights to Flax (huggingface#15566)

9e00566

* Add Wav2Vec2 Adapter Weights to Flax * Suggested changes

Upgrade click version (huggingface#15579)

7029240

[Flax tests/FlaxBert] make from_pretrained test faster (huggingface#1…

f588cf4

…5561)

Add implementation of typical sampling (huggingface#15504)

0113aae

* typical decoding * changing arg name * add test config params * forgotten arg rename * fix edge case where scores are same * test for typical logits warper * code quality fixes

Trigger doc build

eed3186

Fix quality

b1ba03e

Fix tests hub failure (huggingface#15580)

315e674

* Expose hub test problem * Fix tests

update serving_output for some TF models (huggingface#15568)

2584808

Co-authored-by: ydshieh <[email protected]>

[trainer docs] document how to select specific gpus (huggingface#15551)

dee17d5

* [trainer docs] document how to select specific gpus * expand * add urls * add accelerate launcher

Add link (huggingface#15588)

a86ee22

Co-authored-by: Niels Rogge <[email protected]>

Expand tutorial for custom models (huggingface#15587)

c722753

* Expand tutorial for custom models * Style * Apply suggestions from code review Co-authored-by: Lysandre Debut <[email protected]> Co-authored-by: Lysandre Debut <[email protected]>

Make slow tests slow

644ec05

Reformat tokenization_fnet

e923917

Add example batch size to all commands (huggingface#15596)

3d5dea9

Fix Seq2SeqTrainer (huggingface#15603)

3a2ed96

Co-authored-by: Niels Rogge <[email protected]>

Add local and TensorFlow ONNX export examples to docs (huggingface#15604

2e8b85f

) * Add local and TensorFlow ONNX export examples to docs * Use PyTorch - TensorFlow split

Correct JSON format (huggingface#15600)

c0864d9

[Generate] Small refactor (huggingface#15611)

45c7b5b

Mark "code in the Hub" API as experimental (huggingface#15624)

6cf06d1

Enable ONNX export when PyTorch and TensorFlow installed in the same …

7e4844f

…environment (huggingface#15625)

TF: Add informative warning for inexistent CPU backprop ops (huggingf…

3fae83d

…ace#15612) * Add informative warning

Rebase (huggingface#15606)

8c03df1

TF MT5 embeddings resize (huggingface#15567)

2f40c72

* Fix TF MT5 vocab resize * more assertive testing

🖍 remove broken link (huggingface#15615)

85aee09

Fix _configuration_file argument getting passed to model (huggingface…

2dce350

…#15629)

[deepspeed docs] misc additions (huggingface#15585)

f15c99f

* [deepspeed docs] round_robin_gradients * training and/or eval/predict loss is * Update docs/source/main_classes/deepspeed.mdx Co-authored-by: Sylvain Gugger <[email protected]> Co-authored-by: Sylvain Gugger <[email protected]>

patrickvonplaten and others added 10 commits March 10, 2022 11:54

[Docs] Improve PyTorch, Flax generate API (huggingface#15988)

6ce11c2

* Move generate docs * up * Update docs/source/_toctree.yml * correct * correct some stuff * correct tests * more fixes * finish generate * add to doc stest * finish * finalize * add warning to generate method

[Tests] Add attentions_option to ModelTesterMixin (huggingface#15909)

8d83ebd

* Add attentions_option to common tester * Fix tests, apply suggestion * Apply suggestion from code review Co-authored-by: Niels Rogge <[email protected]>

[README] fix url for Preprocessing tutorial (huggingface#16042)

b2a1c99

Fix Bug in Flax-Speech-Encoder-Decoder Test (huggingface#16041)

1da84ae

* Fix Bug in Flax-Speech-Encoder-Decoder Test * change thresholds for CPU precision

Fix TFDebertaV2ConvLayer in TFDebertaV2Model (huggingface#16031)

2f463ef

* fix Co-authored-by: ydshieh <[email protected]>

Don't compute metrics in LM examples on TPU (huggingface#16029)

1959799

TF: Unpack model inputs through a decorator (huggingface#15907)

b7018ab

* MVP * apply decorator to TFBertModel * finish updating bert * update rembert (copy-linked to bert) * update roberta (copy-linked to bert); Fix args * Now working for non-text modalities

Fix Bug in Flax Seq2Seq Models (huggingface#16021)

741e493

* Fix Bug in Flax Seq2Seq Models * incorporate suggested changes

DeBERTa/DeBERTa-v2/SEW Support for torch 1.11 (huggingface#16043)

e66743e

* Support for torch 1.11 * Address Sylvain's comment

patil-suraj and others added 2 commits March 10, 2022 19:41

KSGulin force-pushed the update-4.18.0-refactor branch from 0647141 to 8cac95f Compare March 10, 2022 19:51

KSGulin changed the title ~~[Test] different update approach~~ Update 4.7.0 -> 4.18.0d Mar 10, 2022

KSGulin changed the title ~~Update 4.7.0 -> 4.18.0d~~ Update 4.7.0 -> 4.18.0 Mar 10, 2022

Update: Add QatMatMul and sync with update

7d1bb5f

KSGulin force-pushed the update-4.18.0-refactor branch from 2d6d005 to 7d1bb5f Compare March 10, 2022 20:28

Merge remote-tracking branch 'origin/master' into update-4.18.0-refactor

454ea60

KSGulin requested review from a team, natuan, markurtz and horheynm and removed request for a team March 11, 2022 12:10

KSGulin self-assigned this Mar 11, 2022

natuan approved these changes Mar 14, 2022

View reviewed changes

natuan requested a review from spacemanidol March 14, 2022 17:51

spacemanidol approved these changes Mar 15, 2022

View reviewed changes

spacemanidol merged commit 5d1246c into master Mar 15, 2022

spacemanidol deleted the update-4.18.0-refactor branch March 15, 2022 01:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update 4.7.0 -> 4.18.0 #37

Update 4.7.0 -> 4.18.0 #37

Uh oh!

KSGulin commented Mar 10, 2022

Uh oh!

review-notebook-app bot commented Mar 10, 2022

Uh oh!

Uh oh!

Update 4.7.0 -> 4.18.0 #37

Update 4.7.0 -> 4.18.0 #37

Uh oh!

Conversation

KSGulin commented Mar 10, 2022

Uh oh!

review-notebook-app bot commented Mar 10, 2022

Uh oh!

Uh oh!