Skip to content

IVGSZ/Flash-VStream

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Flash-VStream Logo

[ICCV 2025] Flash-VStream: Efficient Real-Time Understanding for Long Video Streams

Haoji Zhang*, Yiqin Wang*, Yansong Tang, Yong Liu, Jiashi Feng, Xiaojie Jin✉†

*Equally contributing first authors, Correspondence, Project Leader

Work done when interning at Bytedance.

We proposed Flash-VStream, an efficient VLM with a novel Flash Memory mechanism that enables real-time understanding and Q&A of extremely long video streams. Our model achieves outstanding accuracy and efficiency on EgoSchema, MLVU, LVBench, MVBench and Video-MME Benchmarks.

News

Contents

Flash-VStream-Qwen

See Flash-VStream-Qwen/README.md.

Flash-VStream-LLaVA

See Flash-VStream-LLaVA/README.md.

Citation

If you find this project useful in your research, please consider citing:

@article{zhang2025flashvstream,
    title={Flash-VStream: Efficient Real-Time Understanding for Long Video Streams}, 
    author={Haoji Zhang and Yiqin Wang and Yansong Tang and Yong Liu and Jiashi Feng and Xiaojie Jin},
    journal={arXiv preprint arXiv:2506.23825},
    year={2025},
}
@article{zhang2024flashvstream,
    title={Flash-vstream: Memory-based real-time understanding for long video streams},
    author={Zhang, Haoji and Wang, Yiqin and Tang, Yansong and Liu, Yong and Feng, Jiashi and Dai, Jifeng and Jin, Xiaojie},
    journal={arXiv preprint arXiv:2406.08085},
    year={2024}
}

Acknowledgement

We would like to thank the following repos for their great work:

  • This work is built upon the LLaVA.
  • This work utilizes LLMs from Vicuna.
  • Some code is borrowed from LLaMA-VID.
  • We perform video-based evaluation from Video-ChatGPT.

License

Code License

This project is licensed under the Apache-2.0 License.

About

This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"

Resources

License

Stars

Watchers

Forks