Skip to content

Comments

Fix DML regression from allocator refactor and enable unrounded weight allocation in ORT API#17030

Merged
jeffbloo merged 2 commits intomainfrom
user/jeffbloo/DmlAllocFix
Aug 11, 2023
Merged

Fix DML regression from allocator refactor and enable unrounded weight allocation in ORT API#17030
jeffbloo merged 2 commits intomainfrom
user/jeffbloo/DmlAllocFix

Conversation

@jeffbloo
Copy link
Contributor

@jeffbloo jeffbloo commented Aug 7, 2023

This addresses a DML performance regression from the following PR resulting in allocations not being rounded and pooled in the DML execution provider.

#15833

This also fixes a pre-existing limitation that allocations during session initialization (primarily large weights and persistent resources) only bypassed rounding and pooling while using the Winml API. The allocator now also respects a caller's rounding mode parameter when provided.

PatriceVignola
PatriceVignola previously approved these changes Aug 7, 2023
smk2007
smk2007 previously approved these changes Aug 7, 2023
@jeffbloo jeffbloo dismissed stale reviews from smk2007 and PatriceVignola via 9eddebb August 7, 2023 21:11
@justinchuby
Copy link
Contributor

To merge this PR I suggest reverting the "apply lintrunner" commit, then manually remove the trailing spaces on this line:

image

@jeffbloo jeffbloo force-pushed the user/jeffbloo/DmlAllocFix branch from 9eddebb to 1bee33d Compare August 10, 2023 21:05
@jeffbloo jeffbloo merged commit 0180c04 into main Aug 11, 2023
@jeffbloo jeffbloo deleted the user/jeffbloo/DmlAllocFix branch August 11, 2023 00:02
jchen351 pushed a commit that referenced this pull request Aug 12, 2023
…t allocation in ORT API (#17030)

This addresses a DML performance regression from the following PR
resulting in allocations not being rounded and pooled in the DML
execution provider.

#15833

This also fixes a pre-existing limitation that allocations during
session initialization (primarily large weights and persistent
resources) only bypassed rounding and pooling while using the Winml API.
The allocator now also respects a caller's rounding mode parameter when
provided.
@justinchuby
Copy link
Contributor

kleiti pushed a commit to kleiti/onnxruntime that referenced this pull request Mar 22, 2024
…t allocation in ORT API (microsoft#17030)

This addresses a DML performance regression from the following PR
resulting in allocations not being rounded and pooled in the DML
execution provider.

microsoft#15833

This also fixes a pre-existing limitation that allocations during
session initialization (primarily large weights and persistent
resources) only bypassed rounding and pooling while using the Winml API.
The allocator now also respects a caller's rounding mode parameter when
provided.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants