publication

Conference Proceedings

Conference Articles

  1. FoleyDesigner: Immersive Stereo Foley Generation with Precise Spatio-Temporal Alignment for Film Clips
    Mengtian Li ,  Kunyan Dai ,  Yi Ding ,  Ruobing Ni ,  Ying Zhang ,  Wenwu Wang ,  and  Zhifeng Xie
    In CVPR , 2026
  2. EditMaster: Bridging Text instruction and Visual Example for Multimodal guided Image Editing
    Jiahui Zhang ,  Mengtian Li* ,  Jiewei Tang ,  Junyu Deng ,  Siyu Tian ,  Xiang Liu ,  Meng Zhang ,  Guangnan Ye* ,  and  Yu-Gang Jiang
    In ACMMM , 2025
  3. FilmSceneDesigner: Chaining Set Design for Procedural Film Scene Generation
    Zhifeng Xie ,  Keyi Zhang ,  Yiye Yan ,  Yuling Guo ,  Fan Yang ,  Jiting Zhou ,  and  Mengtian Li*
    In AAAI , 2026
  4. GTAD: Global Temporal Aggregation Denoising Learning for 3D Semantic Occupancy Prediction
    Tianhao Li ,  Yang Li ,  Mengtian Li ,  Yisheng Deng ,  and  Weifeng Ge
    In IROS , 2025
  5. CustAny: Customizing Anything from A Single Example
    Lingjie Kong ,  Kai Wu ,  Chengming Xu ,  Xiaobin Hu ,  Wenhui Han ,  Jinlong Peng ,  Donghao Luo ,  Mengtian Li ,  Jiangning Zhang ,  Chengjie Wang ,  and  others
    In CVPR (Oral) , 2025
  6. FilmComposer: LLM-Driven Music Production for Silent Film Clips
    Zhifeng Xie ,  Qile He ,  Youjia Zhu ,  Qiwei He ,  and  Mengtian Li*
    In CVPR , 2025
  7. StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts
    Zhaoxing Gan ,  Mengtian Li* ,  Ruhua Chen ,  Zhongxia Ji ,  Sichen Guo ,  Huanling Hu ,  Guangnan Ye* ,  and  Zuo Hu
    In CVPR , 2025
  8. LMTalker: Sparse Landmark-guided Gaussian Splatting for High-fidelity Talking Head Synthesis
    Zhifeng Xie ,  Zhiwen Jiang ,  Xuemin Lei ,  and  Mengtian Li*
    In ICASSP , 2025
  9. Knowledge Transfer Across Modalities for Weakly Supervised Point Cloud Semantic Segmentation
    Zihan Wang ,  Yunhang Shen ,  Mengtian Li ,  Ke Li ,  Xing Sun ,  Shaohui Lin ,  and  Lizhuang Ma
    In ICASSP , 2025
  10. HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models
    Zhifeng Xie ,  Hao Li ,  Huiming Ding ,  Mengtian Li ,  Xinhan Di ,  and  Ying Cao
    In AAAI , 2024
  11. SonicVisionLM: Playing Sound with Vision Language Models
    Zhifeng Xie ,  Shengye Yu ,  Qile He ,  and  Mengtian Li*
    In CVPR , 2024
  12. Hyperspherical learning in multi-label classification
    Bo Ke ,  Yunquan Zhu ,  Mengtian Li ,  Xiujun Shu ,  Ruizhi Qiao ,  and  Bo Ren
    In ECCV , 2022
  13. Hybridcr: Weakly-supervised 3d point cloud semantic segmentation via hybrid contrastive regularization
    Mengtian Li ,  Yuan Xie ,  Yunhang Shen ,  Bo Ke ,  Ruizhi Qiao ,  Bo Ren ,  Shaohui Lin ,  and  Lizhuang Ma
    In CVPR , 2022

Journal Articles

Journal Articles

  1. AniGaussian: Animatable Gaussian Avatar With Pose-Guided Deformation
    Mengtian Li ,  Shengxiang Yao ,  Kai Chen ,  Zhifeng Xie ,  and  Keyu Chen
    Computer Graphics Forum, Mar 2026
  2. Class-imbalanced semi-supervised learning for large-scale point cloud semantic segmentation via decoupling optimization
    Mengtian Li ,  Shaohui Lin ,  Zihan Wang ,  Yunhang Shen ,  Baochang Zhang ,  and  Lizhuang Ma
    Pattern Recognition, Mar 2024
  3. A fine-grained vision and language representation framework with graph-based fashion semantic knowledge
    Huiming Ding ,  Sen Wang ,  Zhifeng Xie ,  Mengtian Li* ,  and  Lizhuang Ma
    Computers & Graphics, Mar 2023
  4. Paying attention for adjacent areas: Learning discriminative features for large-scale 3D scene segmentation
    Mengtian Li ,  Yuan Xie ,  and  Lizhuang Ma
    Pattern Recognition, Mar 2022

Preprints & Others

Miscellaneous

  1. GaussianMorphing: Mesh-Guided 3D Gaussians for Semantic-Aware Object Morphing
    Mengtian Li ,  Yunshu Bai ,  Yimin Chu ,  Yijun Shen ,  Zhongmei Li ,  Weifeng Ge ,  Zhifeng Xie ,  and  Chaofeng Chen
    2025
  2. AvatarBrush: Monocular Reconstruction of Gaussian Avatars with Intuitive Local Editing
    Mengtian Li ,  Shengxiang Yao ,  Yichen Pan ,  Haiyao Xiao ,  Zhongmei Li ,  Zhifeng Xie ,  and  Keyu Chen
    2025
  3. Infinite Motion: Extended Motion Generation via Long Text Instructions
    Mengtian Li ,  Chengshuo Zhai ,  Shengxiang Yao ,  Zhifeng Xie ,  and  Keyu Chen
    2024
  4. GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting
    Mengtian Li ,  Shengxiang Yao ,  Zhifeng Xie ,  and  Keyu Chen
    2024