[EP] Refactor DeepEP Engine Organization for Mixed Mode & Buffer Management Optimization #3182

RichardWooSJTU · 2025-08-04T07:03:01Z

This PR introduces two key improvements to the ep.py DeepEP engine implementation:

Unified Buffer Initialization in Mixed Mode

Modified the DeepEP engine organization to initialize a single deepep.Buffer instance for both prefill and decode phases in mixed mode operation.
The change affects all workflows utilizing mixed-mode execution.

Explicit Buffer Cleaning for MoE Workspaces

Added mandatory calls to clean_low_latency_buffer() at the conclusion of each Mixture-of-Experts (MoE) processing stage.
This ensures proper workspace cleanup between prefill/decode transitions, preventing potential memory corruption from stale tensor references.
The modification is critical for maintaining stability in long-running inference sessions with variable sequence lengths.

paddle-bot · 2025-08-04T10:01:32Z

Thanks for your contribution!

rsmallblue added 2 commits August 4, 2025 14:47

Add support for mixed-ep across multi nodes

3de7228

code refine

a0691cc

heavengate approved these changes Aug 4, 2025

View reviewed changes

yuanlehome approved these changes Aug 5, 2025

View reviewed changes

Jiang-Jia-Jun merged commit f5c64a0 into PaddlePaddle:develop Aug 5, 2025
16 of 21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[EP] Refactor DeepEP Engine Organization for Mixed Mode & Buffer Management Optimization #3182

[EP] Refactor DeepEP Engine Organization for Mixed Mode & Buffer Management Optimization #3182

Uh oh!

RichardWooSJTU commented Aug 4, 2025

Uh oh!

paddle-bot bot commented Aug 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[EP] Refactor DeepEP Engine Organization for Mixed Mode & Buffer Management Optimization #3182

[EP] Refactor DeepEP Engine Organization for Mixed Mode & Buffer Management Optimization #3182

Uh oh!

Conversation

RichardWooSJTU commented Aug 4, 2025

Uh oh!

paddle-bot bot commented Aug 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants