Masking RNN with zeros input #386

cjermain · 2020-02-17T18:37:20Z

This PR reproduces a discrepancy between the ONNX and Keras models for handling inputs that are fully masked. This occurs for RNN models that contain both a masking layer, and use a non-zero bias. When one of the input entries is fully zero (i.e. will be fully masked by the Masking layer), the RNN output is non-zero. Turning off the bias or using the default bias reproduces consistent behavior with Keras.

(Pdb) up
> /home/user/keras-onnx/tests/test_layers.py(1817)test_masking_bias()
-> self.assertTrue(run_onnx_runtime(onnx_model.graph.name, onnx_model, x, expected, self.model_files))
(Pdb) x
array([[[365.8103 , 107.39152, 701.68   , 615.55023, 468.88876],
        [831.7788 , 680.243  , 890.5216 , 550.38116, 970.3993 ],
        [134.01295, 771.0398 , 594.99915, 315.76196, 753.1429 ]],

       [[  0.     ,   0.     ,   0.     ,   0.     ,   0.     ],
        [  0.     ,   0.     ,   0.     ,   0.     ,   0.     ],
        [  0.     ,   0.     ,   0.     ,   0.     ,   0.     ]]],
      dtype=float32)
(Pdb) down
> /home/user/keras-onnx/tests/test_utils.py(153)run_onnx_runtime()
-> return res
(Pdb) expected
[array([[ 0.7615942,  0.       , -0.7615942,  0.       ,  0.       ,
         0.       ,  0.7615942,  0.       ],
       [ 0.       ,  0.       ,  0.       ,  0.       ,  0.       ,
         0.       ,  0.       ,  0.       ]], dtype=float32)]
(Pdb) actual
[array([[ 0.76159436,  0.        , -0.76159436,  0.        ,  0.        ,
         0.        ,  0.76159436,  0.        ],
       [ 0.2423832 ,  0.26090473,  0.08070117,  0.30603507,  0.13741694,
         0.24133024,  0.513518  ,  0.37147206]], dtype=float32)]

This is a problem for architectures using multiple RNN layers with masking. Any suggestions or thoughts are appreciated!

claassistantio · 2020-02-17T18:37:47Z

All committers have signed the CLA.

cjermain · 2020-02-17T21:39:17Z

The following is the ONNX graph in Netron. It seems like any masked rows are set to zeros in the input to the LSTM (at the Mult operator). I would have expected a operator after the LSTM to set any masked values to their corresponding value (i.e. zero in this case).

cjermain · 2020-02-24T03:03:37Z

I've added a section to generate the sequence_lens input for the LSTM operator, based on the output mask from the Masking layer. This replicates the functionality in Keras, and fixes the blocking test that I had added earlier.

cjermain · 2020-02-24T03:16:43Z

This same strategy should be able to be applied to bidirectional.py. I will try that next.

wenbingl · 2020-02-25T01:03:18Z

ready for reviewing or still WIP?

cjermain · 2020-02-25T01:07:25Z

@wenbingl it would be great to get a review. I am thinking to hold off on the bidirectional implementation as a separate PR.

keras2onnx/ke2onnx/simplernn.py

wenbingl

thanks for the fixing.

cjermain added 2 commits February 17, 2020 12:57

Adding broken test for masked-RNN with bias for zeros input

f483e29

Improving comparability of masked-RNN test

fcc4985

cjermain mentioned this pull request Feb 19, 2020

Support masking for LSTM, RNN? onnx/onnx#2248

Open

cjermain added 2 commits February 23, 2020 19:31

Merge branch 'master' into bug/masking_rnn_bias

30175c5

Using masking input to generate sequence lengths for LSTM layer

036ab82

Adding masking support to GRU and SimpleRNN layers

b8b380d

cjermain requested a review from wenbingl February 24, 2020 03:15

Ensuring tests run over LSTM, GRU, and SimpleRNN

2838f7d

cjermain changed the title ~~[WIP] Masking RNN with zeros input~~ Masking RNN with zeros input Feb 25, 2020

cjermain added 2 commits February 24, 2020 20:37

Adding a more realistic masking case to avoid edge cases with SimpleRNN

8d37a44

Improving comment around masking_bias test

774647d

wenbingl reviewed Feb 25, 2020

View reviewed changes

keras2onnx/ke2onnx/simplernn.py Outdated Show resolved Hide resolved

Merge branch 'master' into bug/masking_rnn_bias

2cff844

wenbingl reviewed Feb 25, 2020

View reviewed changes

keras2onnx/ke2onnx/simplernn.py Show resolved Hide resolved

Using apply_cast for sequence length calculations

7bf9e09

wenbingl approved these changes Feb 27, 2020

View reviewed changes

Merge branch 'master' into bug/masking_rnn_bias

8388874

wenbingl merged commit 2e7f659 into onnx:master Feb 27, 2020

cjermain mentioned this pull request Mar 1, 2020

Bidirectional Masking support #400

Merged

cjermain deleted the bug/masking_rnn_bias branch March 1, 2020 15:43

q-ycong-p mentioned this pull request Mar 1, 2022

Keras LSTM not converted correctly when precedent Embedding layer specifies mask_zero=True onnx/tensorflow-onnx#1871

Open

q-ycong-p mentioned this pull request Aug 5, 2022

LSTM Y output is inconsistent with TF inference result when seq_len is effective microsoft/onnxruntime#12492

Open

q-ycong-p mentioned this pull request Aug 27, 2022

Add masked LSTM support onnx/tensorflow-onnx#2030

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Masking RNN with zeros input #386

Masking RNN with zeros input #386

cjermain commented Feb 17, 2020

claassistantio commented Feb 17, 2020 •

edited

Loading

cjermain commented Feb 17, 2020

cjermain commented Feb 24, 2020

cjermain commented Feb 24, 2020

wenbingl commented Feb 25, 2020

cjermain commented Feb 25, 2020

wenbingl left a comment

Masking RNN with zeros input #386

Masking RNN with zeros input #386

Conversation

cjermain commented Feb 17, 2020

claassistantio commented Feb 17, 2020 • edited Loading

cjermain commented Feb 17, 2020

cjermain commented Feb 24, 2020

cjermain commented Feb 24, 2020

wenbingl commented Feb 25, 2020

cjermain commented Feb 25, 2020

wenbingl left a comment

Choose a reason for hiding this comment

claassistantio commented Feb 17, 2020 •

edited

Loading