Pengfei Wan


I'm the head of the Visual Generation and Interaction Center (aka the Kling Team) at Kuaishou Technology. I used to be the director of AI department (MT Lab) at Meitu, Inc. I did my Ph.D. at the ECE Department of HKUST, and B.E. at the EEIS Department of USTC.

My long-term passion is building intelligent multimodal content creation algorithms and systems, both for people (to express and live better) and for machines (to simulate and evolve better). My recent focus is on video generation, multimodal interaction, and related topics. I am continuously seeking outstanding talents to join our team. Drop me an email if you are interested.

Kling AI  /  Google Scholar  /  GitHub  /  Email

profile photo

Technology

I'm interested in computer vision and graphics, generative AI, and multimodal machine learning. Below is a list of selected publications in recent years.

Context as Memory: Scene-consistent Interactive Long Video Generation with Memory Retrieval
Jiwen Yu, Jianhong Bai, Yiran Qin, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu
SIGGRAPH Asia, 2025
ReCamMaster: Camera-Controlled Generative Rendering from a Single Video
Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang
ICCV Oral, 2025
Towards Precise Scaling Laws for Video Diffusion Transformers
Yuanyang Yin, Yaqi Zhao, Mingwu Zheng, Ke Lin, Jiarong Ou, Rui Chen, Victor Shea-Jay Huang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang, Kun Gai
CVPR, 2025
MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding
Zhicheng Zhang, Wuyou Xia, Chenxi Zhao, Yan Zhou, Xiaoqiang Liu, Yongjie Zhu, Wenyu Qin, Pengfei Wan, Di Zhang, Jufeng Yang
ICML Spotlight, 2025
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin
ICLR, 2025
DVIS++: Improved Decoupled Framework for Universal Video Segmentation
Tao Zhang, Xingye Tian, Yikang Zhou, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu Wu
TPAMI, 2025
Agent Attention: On the Integration of Softmax and Linear Attention
Dongchen Han, Tianzhu Ye, Yizeng Han, Zhuofan Xia, Siyuan Pan, Pengfei Wan, Shiji Song, Gao Huang
ECCV, 2024
VideoTetris: Towards Compositional Text-to-Video Generation
Ye Tian, Ling Yang, Haotian Yang, Yuan Gao, Yufan Deng, Jingmin Chen, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui
NeurIPS, 2024
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Pengfei Wan, Di Zhang, Yufan Liu, Weiming Hu, Zhengjun Zha, Haibin Huang, Chongyang Ma
SIGGRAPH, 2024
Augmentation-Aware Self-Supervision for Data-Efficient GAN Training
Liang Hou, Qi Cao, Yige Yuan, Songtao Zhao, Chongyang Ma, Siyuan Pan, Pengfei Wan, Zhongyuan Wang, Huawei Shen, Xueqi Cheng
NeurIPS, 2023
Towards Practical Capture of High-fidelity Relightable Avatars
Haotian Yang, Mingwu Zheng, Wanquan Feng, Haibin Huang, Yu-Kun Lai, Pengfei Wan, Zhongyuan Wang, Chongyang Ma
SIGGRAPH Asia, 2023
FEditNet: Few-Shot Editing of Latent Semantics in GAN Spaces
Mengfei Xia, Yezhi Shu, Yuji Wang, Yu-Kun Lai, Qiang Li, Pengfei Wan, Zhongyuan Wang, Yong-Jin Liu
AAAI Oral, 2023
PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-Step Point Moving Paths
Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu
TPAMI, 2022
Debiased Self-Training for Semi-Supervised Learning
Baixu Chen, Junguang Jiang, Ximei Wang, Pengfei Wan, Jianmin Wang, Mingsheng Long
NeurIPS Oral, 2022
Snowflake Point Deconvolution for Point Cloud Completion and Generation with Skip-Transformer
Peng Xiang, Xin Wen, Yu-Shen Liu, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Zhizhong Han
TPAMI, 2022

Miscellanea

Talks

Presentation on "An Introduction to Kling and Our Research towards More Powerful Video Generation Models",
Tutorial Session: From Video Generation to World Model, CVPR, Nashville, 2025

Virtual panel discussion on "Video Generation Models", Project Odyssey AI Film Gala, San Francisco, 2024

Roundtable forum on "The Innovations and Challenges of the Next-generation Artificial Intelligence Architecture",
Plenary Session: Scientific Frontier, World Artificial Intelligence Conference (WAIC), Shanghai, 2024

Presentation on "Kling Video Generation Models" & roundtable forum on "Multimodality, AGI, On-device AI",
BAAI Conference, Beijing, 2024

Presentation on "Multimodal Digital Human: Technological Innovations and Industrial Applications",
Opening Plenary, China Digital Human Conference, Beijing, 2024

Services

Reviewer/Program Committee Member of CVPR, ICCV, NeurIPS, ICLR, AAAI, TIP, etc.

This template is a modification of Jon Barron's website