About Me
Hi! I am a first-year PhD student at MMLab, The Chinese University of Hong Kong, supervised by Prof. Hongsheng Li and Prof. Xiaogang Wang. Prior to this, I received my Master's degree from the AAIS, Peking University in 2025, where I am a member of Lanco Lab, Institute of Computational Linguistics (ICL), supervised by Prof. Xu Sun. I obtained my Bachelor's degree from the School of Data Science, Fudan University in 2022.
My research interests lie in multimodal learning, like visual understanding, visual representation, image captioning and so on. I also have interest in some multimodal generation tasks, like diffusion models, and their applications in text-guided generation.
News
- [2025.10] Release Techinical report for SAIL-Embedding, tailored for Douyin Recommendation.
- [2025.08] Joining MMLab@CUHK as a PhD student.
- [2025.08] One paper accepted by EMNLP 2025 – RICO.
- [2025.07] One paper accepted by ACM MM 2025.
- [2025.06] One paper accepted by ACL 2025.
- [2025.02] One paper accepted by CVPR 2025 – VidTwin.
- [2024.12] One paper accepted by AAAI 2025.
- [2024.12] Two papers accepted by FinNLP@COLING 2025.
- [2024.06] We release the InstructAvatar project page!
- [2024.05] One paper accepted by ACL 2024 (Findings).
- [2024.03] One paper accepted by NAACL 2024 – LaDiC.
- [2024.01] One paper accepted by ICLR 2024 – GAIA.
- [2023.10] We release the GAIA demo!
- [2023.10] One paper accepted by FMDM@NeurIPS 2023.
- [2023.05] Starting internship at Microsoft Research Asia ML Group.
- [2022.09] Joining Lanco Lab, Peking University.
Education
CUHK
PKU
Fudan
Selected Publications Full List →
Internship
ByteDance
Kling (可灵) Kuaishou Technology
MSRA
Academic Service
- PKU Class 2024 Spring: Introduction to Large Language Models
- ELEG5760: Machine Learning for Multimedia Applications
- AIMS5710: Deep Learning Fundamentals and Theories