Skip to content
This repository was archived by the owner on Nov 17, 2023. It is now read-only.
This repository was archived by the owner on Nov 17, 2023. It is now read-only.

cuDNN 7.2 Tensor Core support #12463

@sbodenstein

Description

@sbodenstein

cuDNN 7.2 has simplified the usage of Tensor Cores for convolutions and RNNs, and explicit casts to float16 can be avoided: https://devblogs.nvidia.com/tensor-ops-made-easier-in-cudnn/

Are there plans to expose this in MXNet already? If not, lets discuss a design in this issue.

@szha

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions