[pipelining] try not to dry run module when creating PipelineStage?

### 🚀 The feature, motivation and pitch

Today `PipelineStage`'s init method would dry run the module with the example input:
https://github.com/pytorch/pytorch/blob/48d18fbd4cf785e1f69a6555d97a39023a5d199e/torch/distributed/pipelining/stage.py#L1270

This demands extra memory and may OOM for large models which additionally requires TP/FSDP or Activation Checkpointing to keep the memory envelope low. (But they might not have been applied at this point of pipeline stage creation.) 

### Alternatives

The dryrun is for generating `output_args`, the shape of which we rely on to create gradient recv buffers during backward.

A workaround would be for user to provide `output_args` to `PipelineStage` init but it is not ergonomic. 

Also, inference runs do not have backward to worry about.

### Additional context

cc: @H-Huang @wconstab 

cc @XilunWu @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pipelining] try not to dry run module when creating PipelineStage? #136226

🚀 The feature, motivation and pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[pipelining] try not to dry run module when creating PipelineStage? #136226

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions