Force PyTorch to clear CUDA cache

### 🚀 The feature, motivation and pitch

Especially during hyperparameter optimization, exceptions like OOM can occur.
I'm looking for a way to restore and recover from OOM exceptions and would like to propose an additional `force` parameter for `torch.cuda.empty_cache()`, that forces PyTorch to release all cache, even if due to a memory leak some elements remain.
Optionally a function like `torch.cuda.reset()` would obviously work as well.

Current suggestions with `gc.collect` and `torch.cuda.empty_cache()` are not reliable enough to restore the initial state.

### Alternatives

Completely restart Python kernel releases all CUDA memory, but does not work on HPO.

### Additional context

Suggestions how to properly track down memory leaks and solve my core problem are appreciated. 

cc @ngimel

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Force PyTorch to clear CUDA cache #72117

🚀 The feature, motivation and pitch

Alternatives

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Force PyTorch to clear CUDA cache #72117

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions