CalibQuant: 1-Bit KV Cache Quantization via Calibration for Multimodal LLMs

Code for the paper "CalibQuant: 1-Bit KV Cache Quantization for Multimodal LLMs"

Authors: Insu Han, Zeliang Zhang, Zhiyuan Wang, Yifan Zhu, Susan Liang, Jiani Liu, Haiting Lin, Mingjie Zhao, Chenliang Xu, Kun Wan, Wentian Zhao

This repository provides a guide for setting up and running InternVL with KVcacheQuant for efficient inference.

Installation

Install required packages (e.g., InternVL, Triton):

pip install internvl triton==3.2.0

Download the InternVL2.5-26B/8B model from HuggingFace:

Modify Parameters

Change batch size (line 104 in infer.py)
Set bit number (line 13 in calibquant.py)
Run Inference

python infer.py

Notes

Ensure all dependencies are installed before running the script.
Modify parameters accordingly for optimal performance based on your hardware.
If you encounter issues, refer to the official documentation or repository.

Citation

@article{,
  title={CalibQuant: 1-Bit KV Cache Quantization via Calibration for Multimodal LLMs},
  author={Han, Insu and Zhang, Zeliang and Zhu, Yifan and Liang, Susan and Wang, Zhiyuan and Liu, Jiani and Lin, Haiting and Zhao, Mingjie and Xu, Chenliang and Wan, Kun and Zhao, Wentian},
  journal={arXiv preprint arXiv:2502.14882},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
qjl		qjl
LICENSE		LICENSE
README.md		README.md
calibquant.py		calibquant.py
infer.py		infer.py
patch_internvl.py		patch_internvl.py
test.png		test.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CalibQuant: 1-Bit KV Cache Quantization via Calibration for Multimodal LLMs

Installation

Modify Parameters

Notes

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

insuhan/calibquant

Folders and files

Latest commit

History

Repository files navigation

CalibQuant: 1-Bit KV Cache Quantization via Calibration for Multimodal LLMs

Installation

Modify Parameters

Notes

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages