v1.3.1 #451

rsepassi · 2017-12-01T22:58:18Z

No description provided.

PiperOrigin-RevId: 177371794

…PU training. Modify transformer to keep the packed-together examples from attending to one another. PiperOrigin-RevId: 177481956

…int compatibility bug. PiperOrigin-RevId: 177487398

PiperOrigin-RevId: 177487419

PiperOrigin-RevId: 177505082

…mprovements/fixes PiperOrigin-RevId: 177538074

PiperOrigin-RevId: 177540047

PiperOrigin-RevId: 177547599

PiperOrigin-RevId: 177554962

PiperOrigin-RevId: 177635374

PiperOrigin-RevId: 177641254

googlebot · 2017-12-01T22:58:21Z

We found a Contributor License Agreement for you (the sender of this pull request), but were unable to find agreements for the commit author(s). If you authored these, maybe you used a different email address in the git commits than was used to sign the CLA (login here to double check)? If these were authored by someone else, then they will need to sign a CLA as well, and confirm that they're okay with these being contributed to Google.
In order to pass this check, please resolve this problem and have the pull request author add another comment and the bot will run again.

martinpopel · 2017-12-01T23:05:24Z

setup.py

@@ -24,7 +24,6 @@
        'tensor2tensor/bin/t2t-datagen',
        'tensor2tensor/bin/t2t-decoder',
        'tensor2tensor/bin/t2t-make-tf-configs',
-        'tensor2tensor/bin/t2t-bleu',


Is the deletion of t2t-bleu intentional?
(I can understand it, if yes.)

Thanks for noticing that. No, it was not intentional and likely due to a bad internal merge of the PR.

lukaszkaiser · 2017-12-01T23:19:05Z

Sorry guys, it was me and halfway intentional. There was a python 2 vs 3 problem and I'm not sure if we want to maintain another binary when there is SacreBLEU now. But we should certainly add the new BLEU as a metric to Tensorboard. In any case: can we get this in and redo the BLEU binary in another PR? Would that be ok Martin? Sorry for the issue!

martinpopel · 2017-12-01T23:28:08Z

Yes, we can redo t2t-bleu.py in another PR, I don't want to slow down your work.
If really needed, I can look at the Python2 compatibility (btw. SacréBLEU is Python3 only).
Note that t2t-bleu.py does not really overlap with the sacréBLEU functionality (actually, it could use SacréBLEU internally) - it focuses on evaluating all checkpoints and storing to Tensorboard event file (see #436 summary).

lukaszkaiser · 2017-12-01T23:33:59Z

Let's do that. The problem with python2 is that Google uses it internally, so if it's not compatible then all our internal tests and stuff start failing. So yes, let's get this in and then redo the BLEU. Could we have your BLEU score added to metrics, or maybe replace the current approx-BLEU entirely? It's much more useful as it's much closer to the real BLEU than the current approx. What do you think?

martinpopel · 2017-12-02T01:17:35Z

Could we have your BLEU score added to metrics, or maybe replace the current approx-BLEU entirely?

"my" BLEU actually uses "your" bleu_hook.py compute_bleu(), i.e. the same code as is used for computing approx_bleu (I just added smoothing, but this is already merged).
The important difference is that for approx_bleu a tensor with subword IDs is used as the input reference_corpus and translation_corpus, but when used from t2t-bleu a list of words is used instead (tokenized from plaintext using bleu_tokenize()). I am not sure whether it is wise to use this instead of approx_bleu - it may be too slow, and also I am not sure if the detokenization+tokenization can be converted to static graph ops. To get close to the real BLEU, we would also need eval_run_autoregressive and beam search. Currently, internal evaluation does not work for multi-gpu training, so we need an external script/job anyway. I like it this way because I can run it on CPU (or older GPU) and not slow down the training on GPUs.

martinpopel · 2017-12-02T01:20:42Z

What about the failing Travis for this PR?
I've noticed the tests sometime fail non-deterministically, e.g. here.

rsepassi · 2017-12-02T01:44:32Z

Yes, that Algorithmic Algebra test is flaky. You can see that master builds fine (no change from this PR). We'll likely end up removing those problems as nobody is using them.

T2T Team and others added 11 commits December 1, 2017 09:17

Discrete autoencoder with VQ-VAE as in https://arxiv.org/abs/1711.00937.

20c7e41

PiperOrigin-RevId: 177371794

Packed datasets - combine examples to constant length for efficient T…

c9144df

…PU training. Modify transformer to keep the packed-together examples from attending to one another. PiperOrigin-RevId: 177481956

Make Parallelism object use reuse=True by default. Solves tpu checkpo…

7f3ef1e

…int compatibility bug. PiperOrigin-RevId: 177487398

Remove TranslateEndeWmtBpe32kPacked. We have TranslateEndeWmt32kPacked.

01030eb

PiperOrigin-RevId: 177487419

Clean up transformer_vae and add refining.

24c1fd7

PiperOrigin-RevId: 177505082

T2T depends on TF 1.4+, daisy_chain_getter bug fix, some Eager-mode i…

aa2c0b7

…mprovements/fixes PiperOrigin-RevId: 177538074

v1.3.1

b1abcf4

PiperOrigin-RevId: 177540047

New BLEU cleanup and small correction to VAE.

c93a188

PiperOrigin-RevId: 177547599

Enable Transformer fast decoding in eager mode

e133a1a

PiperOrigin-RevId: 177554962

Fix decoding and training issues in external colab.

654f74e

PiperOrigin-RevId: 177635374

TF Eager improvements for T2TModel

889fc84

PiperOrigin-RevId: 177641254

martinpopel reviewed Dec 1, 2017

View reviewed changes

lukaszkaiser merged commit 970dac9 into tensorflow:master Dec 1, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.3.1 #451

v1.3.1 #451

rsepassi commented Dec 1, 2017

googlebot commented Dec 1, 2017

martinpopel Dec 1, 2017

rsepassi Dec 1, 2017

lukaszkaiser commented Dec 1, 2017

martinpopel commented Dec 1, 2017

lukaszkaiser commented Dec 1, 2017

martinpopel commented Dec 2, 2017

martinpopel commented Dec 2, 2017

rsepassi commented Dec 2, 2017

v1.3.1 #451

v1.3.1 #451

Conversation

rsepassi commented Dec 1, 2017

googlebot commented Dec 1, 2017

martinpopel Dec 1, 2017

Choose a reason for hiding this comment

rsepassi Dec 1, 2017

Choose a reason for hiding this comment

lukaszkaiser commented Dec 1, 2017

martinpopel commented Dec 1, 2017

lukaszkaiser commented Dec 1, 2017

martinpopel commented Dec 2, 2017

martinpopel commented Dec 2, 2017

rsepassi commented Dec 2, 2017