MT5 onnx conversion for beam search by tianleiwu · Pull Request #11958 · microsoft/onnxruntime

tianleiwu · 2022-06-23T00:07:09Z

Description:
Support mT5 model to onnx conversion for beam search like:

python convert_beam_search.py -m google/mt5-large --model_type mt5 --output mt5-large-beamsearch.onnx -e

The output will have two files (like mt5-large-beamsearch.onnx and mt5-large-beamsearch.onnx.data) when external data format (-e) is used.

Note: please install ONNX 1.12 package for this. Otherwise, you might encounter error like 'Message onnx.ModelProto exceeds maximum protobuf size of 2GB:' during saving the output model.

Some intermediate encoder or decoder onnx models can be found in ./google, and those files are not needed (They are preserved for debugging purpose).

Right now the model can run, but max diff could be like 8e-3 for encoder or decoder, which could cause beam search result difference between PyTorch and ORT. That's a separate issue for pytorch exporter.

Motivation and Context

Why is this change required? What problem does it solve?
If it fixes an open issue, please link to the issue here.

Related issues: #11813, #11848

lgtm-com · 2022-06-23T00:23:39Z

This pull request introduces 1 alert when merging fe6605d into e24349b - view on LGTM.com

new alerts:

1 for Unused import

wangyems · 2022-06-23T17:19:49Z

onnxruntime/python/tools/transformers/models/t5/convert_to_onnx.py

@@ -217,6 +228,7 @@ def export_onnx_models(

 def main():


maybe comment/assert somewhere that onnx>=1.12 is needed for subgraph proto > 2G case?

Sure. Let me add a check of ONNX version in next PR.

wangyems

lgtm

tianleiwu added 5 commits June 16, 2022 23:41

save external data to one file

309bcd0

update default value of --model_name_or_path and --decoder_onnx

e987cee

support mt5

14c787f

Merge remote-tracking branch 'origin/master' into tlwu/t5_external_data

d0ef028

support mt5

fe6605d

tianleiwu requested a review from wangyems June 23, 2022 00:07

wangyems reviewed Jun 23, 2022

View reviewed changes

wangyems approved these changes Jun 23, 2022

View reviewed changes

tianleiwu merged commit 2c4e4b6 into master Jun 23, 2022

tianleiwu deleted the tlwu/t5_external_data branch June 23, 2022 17:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MT5 onnx conversion for beam search#11958

MT5 onnx conversion for beam search#11958
tianleiwu merged 5 commits intomasterfrom
tlwu/t5_external_data

tianleiwu commented Jun 23, 2022 •

edited

Loading

Uh oh!

lgtm-com bot commented Jun 23, 2022

Uh oh!

wangyems Jun 23, 2022 •

edited

Loading

Uh oh!

tianleiwu Jun 23, 2022

Uh oh!

wangyems left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tianleiwu commented Jun 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lgtm-com bot commented Jun 23, 2022

Uh oh!

wangyems Jun 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tianleiwu Jun 23, 2022

Choose a reason for hiding this comment

Uh oh!

wangyems left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tianleiwu commented Jun 23, 2022 •

edited

Loading

wangyems Jun 23, 2022 •

edited

Loading