Figure from:Neural Network Quantization with AI Model Efficiency Toolkit (AIMET)

Figure 1.1: High-level view of AIMET architecture.

Figure 3.2: AIMET quantization simulation workflow.  Workflow Figure 3.2 shows the workflow for using AIMET quantization simulation to simulate on-target quantized accuracy.

Figure 4.5: PTQ debugging flow chart. Error is the difference between floating-point and quantized model accuracy.

Figure 5.2: Quantization-aware training pipeline. The blue boxes represent the steps and the turquoise boxes recommended choices and grey box is an optional step.