About

I am currently a Senior Algorithm Expert at Alibaba, ATH, Qwen Business Unit. We focus on cutting-edge technologies in image/video generation and editing, multimodal generation and unified multimodal models. Our algorithm has been integrated into the Qwen App, serving millions of users.

Before joining Alibaba, I was a Senior Researcher at Tencent AI Lab/ARC Lab. I obtained my Ph.D. in Electronic and Computer Engineering from the Hong Kong University of Science and Technology (HKUST) in 2021, supervised by Prof. Pedro V. Sander. Prior to that, I received my B.E. degree from the Huazhong University of Science and Technology (HUST) in 2017.

πŸ‘‹ We are actively seeking highly motivated research interns, algorithm engineers, and researchers to join our team working on related topics. Please feel free to reach out via email (xliea@connect.ust.hk) with your CV.


πŸ“Œ News

  • Mar 2026: I joined Alibaba.
  • Mar 2026: 2 papers were accepted to SIGGRAPH 2026.
  • Mar 2026: 1 paper was accepted to IJCV.
  • Feb 2026: 2 papers were accepted to CVPR 2026.
  • Jan 2026: 3 papers were accepted to ICLR 2026.
  • Dec 2025: 1 paper was accepted to AAAI 2025.
  • Sep 2025: 1 paper was accepted to PAMI.
  • Aug 2025: 1 paper was accepted to SIGGRAPH Asia 2025.
  • Jun 2025: 1 paper was accepted to ICCV 2025.
  • Feb 2025: 4 papers were accepted to CVPR 2025.


Education

  • Ph.D, Hong Kong University of Science and Technology, 2021
  • B.E., Huazhong University of Science and Technology, 2017


Work Experience

  • [2026-now] Senior Algorithm Expert, Qwen Business Unit, Alibaba
  • [2021-2025] Senior Researcher, AI Lab/ARC Lab, Tencent


Publications

* equal contribution   # corresponding author / project lead

2026:

  • ViewWeaver: Geometry-Grounded Neural Generative Rendering for 3D-Aware Image Customization
    Yaowei Li, Xiaoyu Li#, Zhaoyang Zhang, Hongxiang Li, Long Chen, Ying Shan, and Yuexian Zou
    SIGGRAPH 2026.

  • BoxCtrl: 3D-Aware Visual Prompting for Geometric Image Editing
    Feifei Wang, Shiyuan Yang, Xiaoyu Li, Jing Liao
    SIGGRAPH 2026.

  • 4DVD: Cascaded Dense-view Video Diffusion Model for High-quality 4D Content Generation
    Shuzhou Yang, Xiaodong Cun, Xiaoyu Li#, Yaowei Li, Jian Zhang
    IJCV. [Page] [Paper]

  • IC-Custom: Diverse Image Customization via In-Context Learning
    Yaowei Li, Xiaoyu Li#, Zhaoyang Zhang, Yuxuan Bian, Gan Liu, Xinyuan Li, Jiale Xu, Wenbo Hu, Yating Liu, Lingen Li, Jing Cai, Yuexian Zou, Yancheng He, Ying Shan
    ICLR 2026. [Page] [Paper] [Code]

  • GenCompositor: Generative Video Compositing with Diffusion Transformer
    Shuzhou Yang, Xiaoyu Li#, Xiaodong Cun, Guangzhi Wang, Lingen Li, Ying Shan, Jian Zhang
    ICLR 2026. [Page] [Paper] [Code]

  • ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing
    Lingen Li, Guangzhi Wang, Zhaoyang Zhang, Yaowei Li, Xiaoyu Li, Qi Dou, Jinwei Gu, Tianfan Xue, Ying Shan
    ICLR 2026. [Page] [Paper] [Code]

  • CubeComposer: Spatio-Temporal Autoregressive 4K 360Β° Video Generation from Perspective Video
    Lingen Li, Guangzhi Wang, Xiaoyu Li, Zhaoyang Zhang, Qi Dou, Jinwei Gu, Tianfan Xue, Ying Shan
    CVPR 2026. [Page] [Paper] [Code]

  • VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
    Sixiao Zheng, Minghao Yin, Wenbo Hu, Xiaoyu Li, Ying Shan, Yanwei Fu
    CVPR 2026. [Page] [Paper] [Code]

2025:

  • UV Gaussians: Joint Learning of Mesh Deformation and Gaussian Textures for Human Avatar Modeling
    Yujiao Jiang, Qingmin Liao, Xiaoyu Li#, Li Ma, Qi Zhang, Chaopeng Zhang, Zongqing Lu, Ying Shan
    Knowledge-Based Systems. [Page] [Paper]

  • MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement
    Xu He, Xiaoyu Li, Di Kang, Jiangnan Ye, Chaopeng Zhang, Liyang Chen, Xiangjun Gao, Han Zhang, Zhiyong Wu, Haolin Zhuang
    AAAI 2025. [Page] [Paper] [Code]

  • ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
    Wangbo Yu, Jinbo Xing, Li Yuan, Wenbo Hu, Xiaoyu Li, Zhipeng Huang, Xiangjun Gao, Tien-Tsin Wong, Ying Shan, Yonghong Tian
    PAMI. [Page] [Paper] [Code]

  • BlobCtrl: Taming Controllable Blob for Element-level Image Editing
    Yaowei Li, Lingen Li, Zhaoyang Zhang, Xiaoyu Li, Guangzhi Wang, Hongxiang Li, Xiaodong Cun, Ying Shan, Yuexian Zou
    SIGGRAPH Asia 2025. [Page] [Paper] [Code]

  • HumanRef-GS: Image-to-3D Human Generation With Reference-Guided Diffusion and 3D Gaussian Splatting
    Jingbo Zhang, Xiaoyu Li#, Hongliang Zhong, Qi Zhang, Yanpei Cao, Ying Shan, Jing Liao
    TCSVT. [Paper]

  • GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
    Tian-Xing Xu, Xiangjun Gao, Wenbo Hu, Xiaoyu Li, Song-Hai Zhang, Ying Shan
    ICCV 2025. [Page] [Paper] [Code]

  • DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
    Wenbo Hu, Xiangjun Gao, Xiaoyu Li#, Sijie Zhao, Xiaodong Cun, Yong Zhang, Long Quan, Ying Shan
    CVPR 2025. [Page] [Paper] [Code]

  • DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation
    Minghong Cai, Xiaodong Cun, Xiaoyu Li#, Wenze Liu, Zhaoyang Zhang, Yong Zhang, Ying Shan, Xiangyu Yue
    CVPR 2025. [Page] [Paper] [Code]

  • Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
    Xiangjun Gao, Xiaoyu Li#, Yiyu Zhuang, Qi Zhang, Wenbo Hu, Chaopeng Zhang, Yao Yao, Ying Shan, Long Quan
    CVPR 2025. [Page] [Paper] [Code]

  • NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
    Lingen Li, Zhaoyang Zhang, Yaowei Li, Jiale Xu, Wenbo Hu, Xiaoyu Li, Weihao Cheng, Jinwei Gu, Tianfan Xue, Ying Shan
    CVPR 2025. [Page] [Paper] [Code]