AMX Quick Start Guide

Uploaded by

Hongming Zheng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views1 page

AMX Quick Start Guide

Uploaded by

Hongming Zheng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Accelerate AI with Intel® AMX

For the latest version of this guide, see Intel® Advanced Matrix Extensions Overview.
Post your questions to Intel DevHub discord or AI Tools forum.

Intel® Advanced Matrix Extensions (Intel® AMX) accelerates deep learning fine-tuning
and inference on Intel® Xeon® Scalable processors. Intel AMX is built into every core on
4th and 5th Gen Xeon processors (formerly codenamed Sapphire Rapids & Emerald
Rapids), accelerating bfloat16 (BF16) and INT8 data types.

Get started with Intel AMX

Intel AMX can deliver up to 10x generational performance gains1 for AI workloads. It is
enabled in Intel 4th Gen Xeon Scalable processors available through OEMs, partners, or
hosted on cloud service providers such as:
Cloud Service
Provider More to be
announced
Intel AMX launch GCP- C3 C7i, M7i, R7i Q1’24 {GCR} 8i Q4’22 {GCM} 6

To learn more, see the Tuning Guide for AI on 4th Gen Intel Scalable Processors.

Preparing the model for Intel AMX

For AMX to accelerate your deep learning model, it needs to be in BF16 or INT8 format.
You can convert your model to this optimized form using auto-mixed precision for BF16
or quantization for INT8, either natively in your framework (e.g. PyTorch* or
TensorFlow*) or with open-source tools from Intel which have additional features.
BF16 is an easy conversion and will generally preserve accuracy. INT8 is a more efficient
data type, and you can use Intel’s open-source compression tools to preserve accuracy.

BF16 on PyTorch pip install intel-extension-for-pytorch

import intel_extension_for_pytorch as ipex
Example & PyTorch documentation model = [Link](model, dtype=torch.bfloat16)
with torch.no_grad():
For LLMs, see this example. with [Link]():
[Link](model, data)

BF16 on TensorFlow export TF_SET_ONEDNN_FPMATH_MODE=BF16

Get Started Guide & TensorFlow documentation
Convert by setting an environment variable (for v2.13+)

Automatic BF16 with Runtime

Intel® Distribution of OpenVINO™ toolkit is an open-source AI deployment library that
will automatically convert eligible models to BF16 when Intel AMX is present (v2023+).
OpenVINO can take in TensorFlow, PyTorch, and ONNX models and add optimizations
for accelerated, centralized deployment. See examples here.

INT8 Quantization
You can convert your model to the optimized INT8 format within its native framework
(PyTorch, TensorFlow, ONNX Runtime*, etc.). Intel also provides open-source tools
(Intel Neural Compressor, Hugging Face* Optimum, and OpenVINO NNCF) for
quantization with additional features such as maintaining a set amount of accuracy.

Notices & Disclaimers: Performance varies by use, configuration, and other factors. Learn more at [Link]/PerformanceIndex.
Performance results are based on testing as of dates shown in configurations and may not reflect all publicly available updates. See
backup for configuration details. No product or component can be absolutely secure. Your costs and results may vary. Intel
technologies may require enabled hardware, software, or service activation. © Intel Corporation. Intel, the Intel logo, and other Intel
marks are trademarks of Intel Corporation or its subsidiaries. *Other names and brands may be claimed as the property of others.
Performance Claims
1 [Link]

PUBLIC-ai-vsphere-vsan-with-xeon-amx-brief Final
No ratings yet
PUBLIC-ai-vsphere-vsan-with-xeon-amx-brief Final
5 pages
Module 4 - Hardware Accelerators For Deep Learning
No ratings yet
Module 4 - Hardware Accelerators For Deep Learning
25 pages
Openvino - Ai: Visit Github To Try For Yourself
No ratings yet
Openvino - Ai: Visit Github To Try For Yourself
5 pages
Deep Learning With Intel AVX512 and Intel Deep Learning Boost Tuning Guide On 3rd Generation Intel Xeon Scalable Processors 1
No ratings yet
Deep Learning With Intel AVX512 and Intel Deep Learning Boost Tuning Guide On 3rd Generation Intel Xeon Scalable Processors 1
24 pages
BRKFP292
No ratings yet
BRKFP292
15 pages
Intel Bluedata Ai ML Webinar 62818 Final - 435477
No ratings yet
Intel Bluedata Ai ML Webinar 62818 Final - 435477
31 pages
1725876250-Unit 3 Computer Vision With OpenVINO
No ratings yet
1725876250-Unit 3 Computer Vision With OpenVINO
30 pages
GenAI 5 AI PC Overview Resource Guide
No ratings yet
GenAI 5 AI PC Overview Resource Guide
4 pages
Module5 - Streamlining AI Application Development and Deployment With Deep Learning Workbench
No ratings yet
Module5 - Streamlining AI Application Development and Deployment With Deep Learning Workbench
34 pages
Accelerated Ai Inference With Confidential Computing Fortanix Solution Brief
No ratings yet
Accelerated Ai Inference With Confidential Computing Fortanix Solution Brief
2 pages
Module2 - Optimization & Quantization of AI Models For Improved Performance
No ratings yet
Module2 - Optimization & Quantization of AI Models For Improved Performance
45 pages
Unified, Cross-Architecture Programming Model: Product Brief
No ratings yet
Unified, Cross-Architecture Programming Model: Product Brief
3 pages
Demidovskij 2021 J. Phys. Conf. Ser. 1828 012012
No ratings yet
Demidovskij 2021 J. Phys. Conf. Ser. 1828 012012
9 pages
Module 3 - Creating Scalable and Future-Ready AI Applications With The OpenVINO Runtime
No ratings yet
Module 3 - Creating Scalable and Future-Ready AI Applications With The OpenVINO Runtime
48 pages
HCL Presentation Intel
No ratings yet
HCL Presentation Intel
9 pages
04 AMD Edge AI TechDay - Singapore - 2024 - FrankWang
No ratings yet
04 AMD Edge AI TechDay - Singapore - 2024 - FrankWang
29 pages
Intel Architecture Day 2021 Presentation
No ratings yet
Intel Architecture Day 2021 Presentation
195 pages
How Intel Is Powering The Future of Artificial Intelligence 195719
No ratings yet
How Intel Is Powering The Future of Artificial Intelligence 195719
4 pages
Business Transformation With Enterprise Ai - mr. Đặng Văn Đức
No ratings yet
Business Transformation With Enterprise Ai - mr. Đặng Văn Đức
23 pages
OpenVINO Quick Start Guide
No ratings yet
OpenVINO Quick Start Guide
3 pages
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
No ratings yet
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
149 pages
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
No ratings yet
AI From The Data Center To The Edge An Optimized Path Using Intel Architecture PDF
149 pages
Hc2024 Amd Vpeng
No ratings yet
Hc2024 Amd Vpeng
36 pages
Running Generative AI On Intel AI Laptops
No ratings yet
Running Generative AI On Intel AI Laptops
10 pages
AI Infrastructure for Developers
No ratings yet
AI Infrastructure for Developers
29 pages
Warboy Brochure
No ratings yet
Warboy Brochure
10 pages
Fact Sheet Xeon 6 P Core
No ratings yet
Fact Sheet Xeon 6 P Core
2 pages
Intel Ai Optimizations Quick Start Guide 1
No ratings yet
Intel Ai Optimizations Quick Start Guide 1
4 pages
04 TechTalks The Evolution of Edge Computing and AI Consolidated Sharing 1213
No ratings yet
04 TechTalks The Evolution of Edge Computing and AI Consolidated Sharing 1213
41 pages
Intel Optimization Reference Manual V1 050
No ratings yet
Intel Optimization Reference Manual V1 050
895 pages
UP Pentium E6500
No ratings yet
UP Pentium E6500
2 pages
Desktop 6th Gen Core Family Spec Update
No ratings yet
Desktop 6th Gen Core Family Spec Update
50 pages
Ryzen Ai Max Series How To Sell Guide Competitive
No ratings yet
Ryzen Ai Max Series How To Sell Guide Competitive
2 pages
Li Thesis 2022 - 2
No ratings yet
Li Thesis 2022 - 2
45 pages
Desktop 6th Gen Core Family Datasheet Vol 1
No ratings yet
Desktop 6th Gen Core Family Datasheet Vol 1
128 pages
Getting Started With OpenVINO
No ratings yet
Getting Started With OpenVINO
3 pages
Amd Instinct Mi300x Data Sheet
100% (1)
Amd Instinct Mi300x Data Sheet
2 pages
Skylake Architecture
No ratings yet
Skylake Architecture
31 pages
Amd Instinct Mi300x Platform Data Sheet
No ratings yet
Amd Instinct Mi300x Platform Data Sheet
2 pages
Core M Processor Family Datasheet Vol 1
No ratings yet
Core M Processor Family Datasheet Vol 1
91 pages
Intel Spring Hill Data Center Chip
No ratings yet
Intel Spring Hill Data Center Chip
12 pages
Versal Ai Edge Gen2 Ai Box Solution Brief
No ratings yet
Versal Ai Edge Gen2 Ai Box Solution Brief
2 pages
Accelerating Throughput With Optimized Yolov7 1742810904
No ratings yet
Accelerating Throughput With Optimized Yolov7 1742810904
28 pages
Aiot CT2
No ratings yet
Aiot CT2
230 pages
Module 1 - Introduction To AI and OpenVINO
No ratings yet
Module 1 - Introduction To AI and OpenVINO
78 pages
7th Gen X Series Datasheet Vol 1
No ratings yet
7th Gen X Series Datasheet Vol 1
86 pages
Cpuz Readme
No ratings yet
Cpuz Readme
19 pages
Ug1703 Vitis Ai Developer Guide WTMKX
No ratings yet
Ug1703 Vitis Ai Developer Guide WTMKX
137 pages
Data Center Gpu Max Series Product Brief
No ratings yet
Data Center Gpu Max Series Product Brief
3 pages
Intel Core™ X-Series Processor Families: Datasheet - Volume 1
No ratings yet
Intel Core™ X-Series Processor Families: Datasheet - Volume 1
57 pages
Intel Edge Ai Portfolio Ebooklet 2020
No ratings yet
Intel Edge Ai Portfolio Ebooklet 2020
67 pages
3rd Gen - Intel Xeon - Scalable Processors HPC - Sales Guide - Manufacturing Public
No ratings yet
3rd Gen - Intel Xeon - Scalable Processors HPC - Sales Guide - Manufacturing Public
2 pages
Amd授權簡報-Ai，不只是雲端的事 Amd 如何賦能在地化智慧製造 - 台美供應鏈
No ratings yet
Amd授權簡報-Ai，不只是雲端的事 Amd 如何賦能在地化智慧製造 - 台美供應鏈
21 pages
Visual Intelligence Mai Ai Vision
No ratings yet
Visual Intelligence Mai Ai Vision
50 pages
Amd Epyc Gen 5 Inference Educational Infographic
No ratings yet
Amd Epyc Gen 5 Inference Educational Infographic
1 page
HWiNFO64 Report
No ratings yet
HWiNFO64 Report
49 pages
CPU-Z 1.94.8 Readme & Config Guide
No ratings yet
CPU-Z 1.94.8 Readme & Config Guide
15 pages
MechArena.lua
No ratings yet
MechArena.lua
14 pages
MSI K8 Motherboard Specs
No ratings yet
MSI K8 Motherboard Specs
33 pages
Chapter 12 Interrupt Slide 1
No ratings yet
Chapter 12 Interrupt Slide 1
42 pages
SDL2 Tutorial
No ratings yet
SDL2 Tutorial
22 pages
WinCC Flexible Compatibility List e
100% (1)
WinCC Flexible Compatibility List e
4 pages
Via Remote Access Solution Guide
No ratings yet
Via Remote Access Solution Guide
25 pages
Safeguarding Records: Internal Labels
No ratings yet
Safeguarding Records: Internal Labels
2 pages
WS 2008 EoS Proactive Proposal
No ratings yet
WS 2008 EoS Proactive Proposal
3 pages
Vsphere Install Configure Manage V5
No ratings yet
Vsphere Install Configure Manage V5
2 pages
GE Intelligent Platforms - Knowledge Base CPUE05 Time Synchronization
No ratings yet
GE Intelligent Platforms - Knowledge Base CPUE05 Time Synchronization
1 page
Apb Uart
No ratings yet
Apb Uart
8 pages
Running Windows LC-3 On Mac
No ratings yet
Running Windows LC-3 On Mac
3 pages
Azure Kubernetes Service Overview
No ratings yet
Azure Kubernetes Service Overview
24 pages
ICT 100 - Introduction To IT PDF
No ratings yet
ICT 100 - Introduction To IT PDF
98 pages
To Disable Dr. Watson
No ratings yet
To Disable Dr. Watson
1 page
5DT Data Glove Ultra Manual v1.3
No ratings yet
5DT Data Glove Ultra Manual v1.3
84 pages
PC Build Guide for Gamers
No ratings yet
PC Build Guide for Gamers
2 pages
Capstone Project-Naan Mudlvan
No ratings yet
Capstone Project-Naan Mudlvan
2 pages
Brtools Move File in Sap
No ratings yet
Brtools Move File in Sap
6 pages
Proactive vs. Reactive Routing Protocols
No ratings yet
Proactive vs. Reactive Routing Protocols
16 pages
HP Pavilion 15 Notebook PC: Maintenance and Service Guide
No ratings yet
HP Pavilion 15 Notebook PC: Maintenance and Service Guide
126 pages
04 - IBM Watsonx - Data - Apache Spark
No ratings yet
04 - IBM Watsonx - Data - Apache Spark
33 pages
Samsung M3 Portable Recovery Summary
No ratings yet
Samsung M3 Portable Recovery Summary
4 pages
Catchlogs - 2024-12-16 at 21-28-37 - 5.24.1.100 (1204) - .Java
No ratings yet
Catchlogs - 2024-12-16 at 21-28-37 - 5.24.1.100 (1204) - .Java
31 pages
PHP PDFlib Integration Guide
No ratings yet
PHP PDFlib Integration Guide
11 pages
Mobile Applications & Wireless Networking
No ratings yet
Mobile Applications & Wireless Networking
36 pages
CSS J-SHS Quarter-2 LAS-5 Week-5
No ratings yet
CSS J-SHS Quarter-2 LAS-5 Week-5
13 pages
Tutorial 6 Qns
No ratings yet
Tutorial 6 Qns
3 pages
Office Click-To-Run Link Generator
No ratings yet
Office Click-To-Run Link Generator
2 pages
Programming The IBM 1130 and 1800 - Anna's Archive
No ratings yet
Programming The IBM 1130 and 1800 - Anna's Archive
456 pages

AMX Quick Start Guide

Uploaded by

AMX Quick Start Guide

Uploaded by

Accelerate AI with Intel® AMX

Get started with Intel AMX

Preparing the model for Intel AMX

BF16 on PyTorch pip install intel-extension-for-pytorch

BF16 on TensorFlow export TF_SET_ONEDNN_FPMATH_MODE=BF16

Automatic BF16 with Runtime

You might also like