[feat] support fa3 backend for pd disaggregated #2695

yuanlehome · 2025-07-03T11:20:32Z

Currently, it is only supported for use in PD, by export FD_ATTENTION_BACKEND=FLASH_ATTN
Currently, We need skip profile, by specifying --num-gpu-blocks-override
By the way, delete the use_fast_ffn parameter

paddle-bot · 2025-07-03T11:20:38Z

Thanks for your contribution!

EmmonsCurse

LGTM

support fa3 backend run in pd disaggregated

f8157ef

yuanlehome force-pushed the add_fa3_backend_v2 branch from 0f83e81 to f8157ef Compare July 3, 2025 12:14

yuanlehome added 4 commits July 3, 2025 20:28

support fa3 backend run in pd disaggregated

b0ad34e

support fa3 backend run in pd disaggregated

4d46eda

support fa3 backend run in pd disaggregated

69ae9eb

delete use_fast_ffn

4ae9167

EmmonsCurse approved these changes Jul 3, 2025

View reviewed changes

yuanlehome merged commit 240bdac into PaddlePaddle:develop Jul 3, 2025
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat] support fa3 backend for pd disaggregated #2695

[feat] support fa3 backend for pd disaggregated #2695

Uh oh!

yuanlehome commented Jul 3, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Jul 3, 2025

Uh oh!

EmmonsCurse left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[feat] support fa3 backend for pd disaggregated #2695

[feat] support fa3 backend for pd disaggregated #2695

Uh oh!

Conversation

yuanlehome commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paddle-bot bot commented Jul 3, 2025

Uh oh!

EmmonsCurse left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yuanlehome commented Jul 3, 2025 •

edited

Loading