-
Notifications
You must be signed in to change notification settings - Fork 5.9k
[Paddle Inference] Add float_to_half_pass to support inference with mixed precision #47993
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Paddle Inference] Add float_to_half_pass to support inference with mixed precision #47993
Conversation
|
你的PR提交成功,感谢你对开源项目的贡献! |
a116b94 to
60d4d72
Compare
b593f0c to
686c441
Compare
c286881 to
d0db320
Compare
XieYunshen
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for
set_tests_properties(gpu_ernie_half_test PROPERTIES TIMEOUT 40)
jiweibo
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
XiaoguangHu01
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
From00
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for mutable_data in float_to_half_pass
* [Release2.4] Revert python link prs (#48573) * Revert "Fix mac link python (#48017)" This reverts commit 3fa7a73. * Revert "[Cherry-pick] Fix python link error (#47811)" This reverts commit ff642c6. * Update config.go * [Paddle Inference] Add float_to_half_pass to support inference with mixed precision (#47993) * [Inference] optimize some code and fix some bug (#48780) * clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass * fix unitest timeout * [Paddle Inference] clean unused code (#48392) * fix * update * update Co-authored-by: Chen Weihang <[email protected]>
PR types
Others
PR changes
Others
Describe
文档API更新PR1、PR2、PR3、PR4
PR工作:
使用方式:
python ---> config.enable_use_gpu(512, 0, PrecisionType::kHalf);
c++ ---> config.EnableUseGpu(512, 0, PrecisionType::kHalf);
TODO: