Consider removing `MLActivation` parameters used for op fusion

`MLActivation` currently has two uses:

1. With operators like `batchNormalization` and `conv2d`, as a hint that this activation may be fusable with the given operator
2. With recurrent operators like `lstm` and `gru`, as activations applied to gates within the given recurrent unit

| Operator           | Recurrent? | Purpose         |
| ------------------ | ---------- | --------------- |
| batchNormalization | No         | Maybe fusable   |
| conv2d             | No         | Maybe fusable   |
| convTranspose2d    | No         | Maybe fusable   |
| gru                | Yes        | Applied to gate |
| gruCell            | Yes        | Applied to gate |
| lstm               | Yes        | Applied to gate |
| lstmCell           | Yes        | Applied to gate |

I have some thoughts on (2), but I'll leave that to another issue. Let's focus on (1) here :)

My claim is that **there's no benefit to passing `MLActivation`s as parameters to `MLOperand`s for the purpose of op fusion**. Here's why:

1. *False positives*: For any given permutation of operator, activation, and backend, op fusion may not be possible (it's quite unlikely, actually)
   - CoreML does not natively support fusing activations with any of these operators
   - DirectML only supports fusing some activations with some operators. There's [an existing Chromium bug](https://crbug.com/336589268) to _un-fuse_ `MLActivation`s from their respective `MLOperand` if the combo is not supported for the given version of DML
2. *False negatives*: For any given operator which does _not_ take currently take an `MLActivation` in the WebNN spec, it may in fact be fusible with its input or output
   - [DML claims to support op fusion](https://learn.microsoft.com/en-us/windows/ai/directml/dml-fused-activations#operators-that-support-fused-activation) for `DML_OPERATOR_ELEMENT_WISE_ADD1` and `DML_OPERATOR_GEMM`, which ~map to WebNN's `add` and `gemm` operators
   - [TFLite supports fusing various operators](https://www.tensorflow.org/lite/models/convert/operation_fusion), as well

What this means in practice is:

- Implementations wrapping backends which can't fuse operators, or can't fuse some operators with some activations, must trivially break apart the `MLOperand` and its `MLActivation` into what's effectively just two `MLOperand`s connected to each other ([as Chromium's CoreML backend currently does](https://crrev.com/c/5459105)):
  ```
     input
       |
    operator
       |
   activation
       |
     output
  ```
- Implementations wrapping backends which _can_ fuse operators must do an optimization pass anyways to fuse operators which do not have `MLActivation` parameters in WebNN ([as Chromium's DML backend currently does](https://crsrc.org/c/services/webnn/dml/graph_impl.cc;l=4976-4990;drc=c00dbc9dc0167c5cf373db46f987869654c393ab))

Whether a given operator can be fused with a given activation is a very backend-specific quirk. Presumably we don't want to plumb through a new `MLActivation` operator to the web for every operator which any backend decides it can now fuse with some activation! This seems best left as an implementation detail either by the user agent (as described above) or the framework (who knows how much op fusion is happening in Core ML under the hood?! 🤔)

Thoughts?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider removing `MLActivation` parameters used for op fusion #658

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Operator	Recurrent?	Purpose
batchNormalization	No	Maybe fusable
conv2d	No	Maybe fusable
convTranspose2d	No	Maybe fusable
gru	Yes	Applied to gate
gruCell	Yes	Applied to gate
lstm	Yes	Applied to gate
lstmCell	Yes	Applied to gate

Consider removing MLActivation parameters used for op fusion #658

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Consider removing `MLActivation` parameters used for op fusion #658