Use the fallback method for GRU and LSTM on ROCm if padded I/O is needed by ekuznetsov139 · Pull Request #17111 · keras-team/keras

ekuznetsov139 · 2022-10-04T16:21:39Z

On ROCm, the low-end library that implements RNNs (MIOpen) does not support inputs with variable sequence lengths. At present, an attempt to pass such inputs to a RNN results in an exception. Because of this limitation, several Keras unit tests are disabled for ROCm and several others currently fail.

This change correctly switches from GPU-optimized RNNs (CudnnRNNV3) to the fallback implementation as needed.

fchollet

Thanks for the PR! It makes sense.

keras/layers/rnn/gru_lstm_utils.py

keras/layers/rnn/lstm.py

fchollet · 2022-10-06T04:45:19Z

keras/layers/rnn/lstm.py

                    # Under eager context, check the device placement and prefer
                    # the GPU implementation when GPU is available.
                    if can_use_gpu:
+                        print("Accepted GPU version: ", mask, row_lengths)


Remove debug line

Please push your changes

gbaned · 2022-10-14T12:05:32Z

@ekuznetsov139 Can you please check @fchollet's comments and keep us posted ? Thank you!

fchollet · 2022-10-25T16:50:36Z

keras/layers/rnn/lstm.py

                    # Under eager context, check the device placement and prefer
                    # the GPU implementation when GPU is available.
                    if can_use_gpu:
+                        print("Accepted GPU version: ", mask, row_lengths)


Please remove debug line

Reformatting Disabling a test that fails on fallback path

fchollet

Thanks, LGTM!

Imported from GitHub PR #17587 A previous PR #17111 added some logic to use fallback implementations of GRU and LSTM on ROCm in situations where padded i/o is needed (since ROCm does not support padded i/o). That logic turns out to be too restrictive - it chooses the fallback path in cases where it is not really needed, which may result in significant performance degradations. This PR resolves the problem. Copybara import of the project: -- e78b4ab by Eugene Kuznetsov <eugene.kuznetsov@amd.com>: Less restrictive fallback logic Merging this change closes #17587 FUTURE_COPYBARA_INTEGRATE_REVIEW=#17587 from ekuznetsov139:rocm_rnn_fallback_v2 e78b4ab PiperOrigin-RevId: 511817599

google-ml-butler bot added the size:M label Oct 4, 2022

google-ml-butler bot assigned gbaned Oct 4, 2022

ekuznetsov139 marked this pull request as draft October 4, 2022 16:36

Use the fallback method for GRU and LSTM on ROCm if padded I/O is needed

dd588d3

ekuznetsov139 force-pushed the rocm_rnn_fallback branch from 593323a to dd588d3 Compare October 5, 2022 01:01

ekuznetsov139 marked this pull request as ready for review October 5, 2022 01:02

fchollet reviewed Oct 5, 2022

View reviewed changes

keras/layers/rnn/gru_lstm_utils.py Outdated Show resolved Hide resolved

keras/layers/rnn/lstm.py Show resolved Hide resolved

fchollet reviewed Oct 6, 2022

View reviewed changes

ekuznetsov139 force-pushed the rocm_rnn_fallback branch from fab214f to 81790c0 Compare October 21, 2022 09:01

fchollet reviewed Oct 25, 2022

View reviewed changes

ekuznetsov139 force-pushed the rocm_rnn_fallback branch from 81790c0 to a3b3924 Compare October 25, 2022 17:07

tf.cond optimization

6fed911

Reformatting Disabling a test that fails on fallback path

ekuznetsov139 force-pushed the rocm_rnn_fallback branch from a3b3924 to 6fed911 Compare October 25, 2022 17:08

fchollet approved these changes Oct 26, 2022

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Oct 26, 2022

kokoro-team removed the kokoro:force-run label Oct 26, 2022

copybara-service bot merged commit add6c1d into keras-team:master Oct 28, 2022

ekuznetsov139 mentioned this pull request Feb 21, 2023

Less restrictive ROCm+GRU/LSTM fallback logic #17587

Merged

This was referenced Feb 22, 2023

Less restrictive fallback logic #17590

Closed

Less restrictive fallback logic #17591

Merged

copybara-service bot mentioned this pull request Feb 23, 2023

PR #17587: Less restrictive ROCm+GRU/LSTM fallback logic #17594

Closed

renovate bot mentioned this pull request Aug 28, 2024

chore(deps): Update dependency keras to v3 KindaiCVLAB/machinelearning-images#178

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the fallback method for GRU and LSTM on ROCm if padded I/O is needed#17111

Use the fallback method for GRU and LSTM on ROCm if padded I/O is needed#17111
copybara-service[bot] merged 2 commits intokeras-team:masterfrom
ekuznetsov139:rocm_rnn_fallback

ekuznetsov139 commented Oct 4, 2022

Uh oh!

fchollet left a comment

Uh oh!

Uh oh!

Uh oh!

fchollet Oct 6, 2022

Uh oh!

ekuznetsov139 Oct 6, 2022

Uh oh!

fchollet Oct 9, 2022

Uh oh!

gbaned commented Oct 14, 2022

Uh oh!

fchollet Oct 25, 2022

Uh oh!

fchollet left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ekuznetsov139 commented Oct 4, 2022

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

fchollet Oct 6, 2022

Choose a reason for hiding this comment

Uh oh!

ekuznetsov139 Oct 6, 2022

Choose a reason for hiding this comment

Uh oh!

fchollet Oct 9, 2022

Choose a reason for hiding this comment

Uh oh!

gbaned commented Oct 14, 2022

Uh oh!

fchollet Oct 25, 2022

Choose a reason for hiding this comment

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants