A Ph.D. student at The University of Hong Kong.
Pinned Loading
-
Visual-AI/3DRS
Visual-AI/3DRS Public[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
Python 138
-
Visual-AI/FROSTER
Visual-AI/FROSTER Public[ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition
-
Visual-AI/PruneVid
Visual-AI/PruneVid Public[ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models
Python 63
-
SkeletonGCL
SkeletonGCL Public[ICLR 2023] Graph Contrastive Learning for Skeleton-based Action Recognition.
-
Visual-AI/JoVA
Visual-AI/JoVA PublicJoVA: Unified Multimodal Learning for Joint Video-Audio Generation
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



