Technology
I'm interested in computer vision and graphics, generative AI, and multimodal machine learning.
Recent News
- 9 papers accepted to ICML 2026
- 8 papers accepted to CVPR 2026
- 14 papers accepted to ICLR 2026
- 8 papers accepted to NeurIPS 2025
- 8 papers accepted to ICCV 2025
- 7 papers accepted to CVPR 2025
- 6 papers accepted to SIGGRAPH & SIG Asia 2025
Selected Publications
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
Jiehui Huang, Yuechen Zhang, Xu He, Yuan Gao, Zhi Cen, Bin Xia, Yan Zhou, Xin Tao, Pengfei Wan, Jiaya Jia
CVPR, 2026
AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration
Xinlong Chen, Yue Ding, Weihong Lin, Jingyun Hua, Linli Yao, Yang Shi, Bozhou Li, Yuanxing Zhang, Qiang Liu, Pengfei Wan, Liang Wang
ICLR, 2026
ReCamMaster: Camera-Controlled Generative Rendering from a Single Video
Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang
ICCV Oral,
Best Paper Award Finalist, 2025
MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
Zhicheng Zhang, Wuyou Xia, Chenxi Zhao, Yan Zhou, Xiaoqiang Liu, Yongjie Zhu, Wenyu Qin, Pengfei Wan, Di Zhang, Jufeng Yang
ICML Spotlight, 2025
DVIS++: Improved Decoupled Framework for Universal Video Segmentation
Tao Zhang, Xingye Tian, Yikang Zhou, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu Wu
TPAMI, 2025
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Pengfei Wan, Di Zhang, Yufan Liu, Weiming Hu, Zhengjun Zha, Haibin Huang, Chongyang Ma
SIGGRAPH, 2024
Miscellanea
Talks
-
Presentation on "An Introduction to Kling and Our Research towards More Powerful Video Generation Models",
Tutorial Session: From Video Generation to World Model, CVPR, Nashville, 2025
- Virtual panel discussion on "Video Generation Models", Project Odyssey AI Film Gala, San Francisco, 2024
-
Roundtable forum on "The Innovations and Challenges of the Next-generation Artificial Intelligence Architecture",
Plenary Session: Scientific Frontier, World Artificial Intelligence Conference (WAIC), Shanghai, 2024
-
Presentation on "Kling Video Generation Models" & roundtable forum on "Multimodality, AGI, On-device AI",
BAAI Conference, Beijing, 2024
-
Presentation on "Multimodal Digital Human: Technological Innovations and Industrial Applications",
Opening Plenary, China Digital Human Conference, Beijing, 2024
Services
Reviewer/Program Committee Member of CVPR, ICCV, NeurIPS, ICLR, AAAI, ACL, etc.