-
Notifications
You must be signed in to change notification settings - Fork 684
[Feat] ernie4_5_vl_moe support CudaGraph
#3226
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Thanks for your contribution! |
fastdeploy/model_executor/models/ernie4_5_vl/ernie4_5_vl_moe.py
Outdated
Show resolved
Hide resolved
fastdeploy/model_executor/models/ernie4_5_vl/ernie4_5_vl_moe.py
Outdated
Show resolved
Hide resolved
|
麻烦处理下冲突,这个PR合入的同时能否把多模的CI 流水线里的CUDAGraph监控打开 |
ernie4_5_vl_moe support CudaGraph
| ids_remove_padding=ids_remove_padding, | ||
| image_features=image_features, | ||
| forward_meta=forward_meta, | ||
| vl_moe_meta=vl_moe_meta, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
有性能测试结果吗,移出来后对P节点的性能是否有影响?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
暂时没有测试过PD分离场景
| if (num_rows == 0){ | ||
| return; | ||
| } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
c++代码最好也格式化一下
gongshaotian
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
| dtype = meta["dtype"] | ||
| if "." in meta["dtype"]: | ||
| dtype = _resolve_path(fd_config, meta["dtype"]) | ||
| self._mm_buffers[name] = paddle.full( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
_mm_buffers这个变量名也改一下
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ernie4_5_vl_moe适配CudaGraph。