About Me
I'm now a First-year M.S. student in Computer Science, UIUC. I obtained my Bachelor's Degree from ZJU-UIUC Institute. My research interests mainly lie in computer vision, especially in video action recognition and VLM.
List of Projects
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models
We propose TextRegion, a simple, effective, and training-free framework that combines the strengths of image-text models and SAM2 to generate powerful text-aligned region tokens. These tokens enable detailed visual understanding while preserving open-vocabulary capabilities. They can be directly applied to various downstream tasks, including open-world semantic segmentation, referring expression comprehension, and grounding. We conduct extensive evaluations and consistently achieve superior or competitive performance compared to state-of-the-art training-free methods.