WIP: Add support for AMX instructions #5780

jwlawson · 2021-03-02T16:49:47Z

Currently done as a pass before the 2d ramps are flattened. After that the loads are converted to a concat of multiple loads and it is hard to get the strides from the index expressions, mainly because Halide is cautious about overflows so can't simplify the expressions back to simple base + i * stride indexes it started with.

Introduces an AMXTile type, so that the tiles that have to be stored by Halide are of the right type, and so that LLVM can alloca the right thing. Trying to use <i32 x 256> causes problems as Halide tries to break the loads and stores into multiple vector loads. This type doesn't really need to be externally available, but I'm not sure if there's a way to have an internal only type. For the tiles that do not need to be stored (ie used directly in a tile matmul intrinsic) we don't need to use the AMX type, which should allow us to use the overloaded intrinsics to select the right instruction for the datatypes.

Currently (ab)uses the way Halide calls intrinsics, but these tile intrinsics are needed to load and store from memory, so the default call->setDoesNotAccessMemory(); is not valid. I'm not too sure how to handle this in better. There are also some hacks to get the tile_store intrinsic to work, as it really should return void, but Halide makes some assumptions about the return types of Call Exprs which caused problems.

steven-johnson requested a review from halidebuildbots March 3, 2021 19:16

jwlawson mentioned this pull request Mar 9, 2021

Move where intrinsic function attributes are set #5795

Merged

steven-johnson mentioned this pull request Mar 9, 2021

Fix out of bounds reads in strided ARM loads #5784

Merged

jwlawson added 7 commits March 11, 2021 10:50

Add support for AMX tile instructions

05d2751

Make AMX transform opt-in with memory type

758ab07

Clean up tiled_matmul test

0f30587

Handle AMX intrinsic attributes better

89b5c9e

Format

1d6e94e

Fix test to behave like other tests

427ff33

Add doc and missing load check

644af10

jwlawson force-pushed the tile_matmul branch from 1898720 to 644af10 Compare March 11, 2021 14:33

jwlawson and others added 5 commits March 11, 2021 14:50

Format

c131586

Throw error if user requests AMX for invalid operation

d954996

Add Tile lowering pass to makefile

06714a5

Use spaces in Makefile

55db851

Merge branch 'master' into pr/5780

bd091c0

mcleary mentioned this pull request Mar 17, 2021

Add support for AMX instructions #5818

Merged

jwlawson closed this Mar 18, 2021

alexreinking modified the milestone: v12.0.0 May 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP: Add support for AMX instructions #5780

WIP: Add support for AMX instructions #5780

Uh oh!

jwlawson commented Mar 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

WIP: Add support for AMX instructions #5780

WIP: Add support for AMX instructions #5780

Uh oh!

Conversation

jwlawson commented Mar 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants