🎯
Focusing
PhD student @dvlab-research, CSE@CUHK. Multimodal Large Language Models
-
The Chinese University of Hong Kong
- Hong Kong SAR
- https://wcy1122.github.io/
Pinned Loading
-
dvlab-research/MGM
dvlab-research/MGM PublicOfficial repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
-
dvlab-research/LLaMA-VID
dvlab-research/LLaMA-VID PublicLLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
-
dvlab-research/GroupContrast
dvlab-research/GroupContrast Public[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
-
dvlab-research/Lyra
dvlab-research/Lyra PublicLyra: An Efficient and Speech-Centric Framework for Omni-Cognition
Python 11
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.