About
I am currently a Senior Algorithm Expert at Alibaba, ATH, Qwen Business Unit. We focus on cutting-edge technologies in image/video generation and editing, multimodal generation and unified multimodal models. Our algorithm has been integrated into the Qwen App, serving millions of users.
Before joining Alibaba, I was a Senior Researcher at Tencent AI Lab/ARC Lab. I obtained my Ph.D. in Electronic and Computer Engineering from the Hong Kong University of Science and Technology (HKUST) in 2021, supervised by Prof. Pedro V. Sander. Prior to that, I received my B.E. degree from the Huazhong University of Science and Technology (HUST) in 2017.
π We are actively seeking highly motivated research interns, algorithm engineers, and researchers to join our team working on related topics. Please feel free to reach out via email (xliea@connect.ust.hk) with your CV.
π News
- Mar 2026: I joined Alibaba.
- Mar 2026: 2 papers were accepted to SIGGRAPH 2026.
- Mar 2026: 1 paper was accepted to IJCV.
- Feb 2026: 2 papers were accepted to CVPR 2026.
- Jan 2026: 3 papers were accepted to ICLR 2026.
- Dec 2025: 1 paper was accepted to AAAI 2025.
- Sep 2025: 1 paper was accepted to PAMI.
- Aug 2025: 1 paper was accepted to SIGGRAPH Asia 2025.
- Jun 2025: 1 paper was accepted to ICCV 2025.
- Feb 2025: 4 papers were accepted to CVPR 2025.
Education
- Ph.D, Hong Kong University of Science and Technology, 2021
- B.E., Huazhong University of Science and Technology, 2017
Work Experience
- [2026-now] Senior Algorithm Expert, Qwen Business Unit, Alibaba
- [2021-2025] Senior Researcher, AI Lab/ARC Lab, Tencent
Publications
* equal contribution β # corresponding author / project lead
2026:
ViewWeaver: Geometry-Grounded Neural Generative Rendering for 3D-Aware Image Customization
Yaowei Li, Xiaoyu Li#, Zhaoyang Zhang, Hongxiang Li, Long Chen, Ying Shan, and Yuexian Zou
SIGGRAPH 2026.BoxCtrl: 3D-Aware Visual Prompting for Geometric Image Editing
Feifei Wang, Shiyuan Yang, Xiaoyu Li, Jing Liao
SIGGRAPH 2026.4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation
Shuzhou Yang, Xiaodong Cun, Xiaoyu Li#, Yaowei Li, Jian Zhang
IJCV. [Page] [Paper]IC-Custom: Diverse Image Customization via In-Context Learning
Yaowei Li, Xiaoyu Li#, Zhaoyang Zhang, Yuxuan Bian, Gan Liu, Xinyuan Li, Jiale Xu, Wenbo Hu, Yating Liu, Lingen Li, Jing Cai, Yuexian Zou, Yancheng He, Ying Shan
ICLR 2026. [Page] [Paper] [Code]GenCompositor: Generative Video Compositing with Diffusion Transformer
Shuzhou Yang, Xiaoyu Li#, Xiaodong Cun, Guangzhi Wang, Lingen Li, Ying Shan, Jian Zhang
ICLR 2026. [Page] [Paper] [Code]ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
Lingen Li, Guangzhi Wang, Zhaoyang Zhang, Yaowei Li, Xiaoyu Li, Qi Dou, Jinwei Gu, Tianfan Xue, Ying Shan
ICLR 2026. [Page] [Paper] [Code]CubeComposer: Spatio-Temporal Autoregressive 4K 360Β° Video Generation from Perspective Video
Lingen Li, Guangzhi Wang, Xiaoyu Li, Zhaoyang Zhang, Qi Dou, Jinwei Gu, Tianfan Xue, Ying Shan
CVPR 2026. [Page] [Paper] [Code]VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
Sixiao Zheng, Minghao Yin, Wenbo Hu, Xiaoyu Li, Ying Shan, Yanwei Fu
CVPR 2026. [Page] [Paper] [Code]
2025:
UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling
Yujiao Jiang, Qingmin Liao, Xiaoyu Li#, Li Ma, Qi Zhang, Chaopeng Zhang, Zongqing Lu, Ying Shan
Knowledge-Based Systems. [Page] [Paper]MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
Xu He, Xiaoyu Li, Di Kang, Jiangnan Ye, Chaopeng Zhang, Liyang Chen, Xiangjun Gao, Han Zhang, Zhiyong Wu, Haolin Zhuang
AAAI 2025. [Page] [Paper] [Code]ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Wangbo Yu, Jinbo Xing, Li Yuan, Wenbo Hu, Xiaoyu Li, Zhipeng Huang, Xiangjun Gao, Tien-Tsin Wong, Ying Shan, Yonghong Tian
PAMI. [Page] [Paper] [Code]BlobCtrl: Taming Controllable Blob for Element-level Image Editing
Yaowei Li, Lingen Li, Zhaoyang Zhang, Xiaoyu Li, Guangzhi Wang, Hongxiang Li, Xiaodong Cun, Ying Shan, Yuexian Zou
SIGGRAPH Asia 2025. [Page] [Paper] [Code]HumanRef-GS: Image-to-3D Human Generation With Reference-Guided Diffusion and 3D Gaussian Splatting
Jingbo Zhang, Xiaoyu Li#, Hongliang Zhong, Qi Zhang, Yanpei Cao, Ying Shan, Jing Liao
TCSVT. [Paper]GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Tian-Xing Xu, Xiangjun Gao, Wenbo Hu, Xiaoyu Li, Song-Hai Zhang, Ying Shan
ICCV 2025. [Page] [Paper] [Code]DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Wenbo Hu, Xiangjun Gao, Xiaoyu Li#, Sijie Zhao, Xiaodong Cun, Yong Zhang, Long Quan, Ying Shan
CVPR 2025. [Page] [Paper] [Code]DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
Minghong Cai, Xiaodong Cun, Xiaoyu Li#, Wenze Liu, Zhaoyang Zhang, Yong Zhang, Ying Shan, Xiangyu Yue
CVPR 2025. [Page] [Paper] [Code]Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
Xiangjun Gao, Xiaoyu Li#, Yiyu Zhuang, Qi Zhang, Wenbo Hu, Chaopeng Zhang, Yao Yao, Ying Shan, Long Quan
CVPR 2025. [Page] [Paper] [Code]NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
Lingen Li, Zhaoyang Zhang, Yaowei Li, Jiale Xu, Wenbo Hu, Xiaoyu Li, Weihao Cheng, Jinwei Gu, Tianfan Xue, Ying Shan
CVPR 2025. [Page] [Paper] [Code]
