publications

2025

  1. FilmComposer: LLM-Driven Music Production for Silent Film Clips
    Zhifeng Xie ,  Qile He ,  Youjia Zhu ,  Qiwei He ,  and  Mengtian Li*
    In CVPR , 2025
  2. StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts
    Zhaoxing Gan ,  Mengtian Li* ,  Ruhua Chen ,  Zhongxia Ji ,  Sichen Guo ,  Huanling Hu ,  Guangnan Ye ,  and  Zuo Hu
    In CVPR , 2025
  3. AniGaussian: Animatable Gaussian Avatar with Pose-guided Deformation
    Mengtian Li ,  Shengxiang Yao ,  Chen Kai ,  Zhifeng Xie ,  Keyu Chen ,  and  Yu-Gang Jiang
    2025
  4. LMTalker: Sparse Landmark-guided Gaussian Splatting for High-fidelity Talking Head Synthesis
    Zhifeng Xie ,  Zhiwen Jiang ,  Xuemin Lei ,  and  Mengtian Li*
    In ICASSP , 2025
  5. Knowledge Transfer Across Modalities for Weakly Supervised Point Cloud Semantic Segmentation
    Zihan Wang ,  Yunhang Shen ,  Mengtian Li ,  Ke Li ,  Xing Sun ,  Shaohui Lin ,  and  Lizhuang Ma
    In ICASSP , 2025

2024

  1. ArtNVG: Content-Style Separated Artistic Neighboring-View Gaussian Stylization
    Zixiao Gu ,  Mengtian Li* ,  Ruhua Chen ,  Zhongxia Ji ,  Sichen Guo ,  Zhenye Zhang ,  Guangnan Ye ,  and  Zuo Hu
    2024
  2. HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models
    Zhifeng Xie ,  Hao Li ,  Huiming Ding ,  Mengtian Li ,  Xinhan Di ,  and  Ying Cao
    In AAAI , 2024
  3. Infinite Motion: Extended Motion Generation via Long Text Instructions
    Mengtian Li ,  Chengshuo Zhai ,  Shengxiang Yao ,  Zhifeng Xie ,  Keyu Chen ,  and  Yu-Gang Jiang
    2024
  4. SonicVisionLM: Playing Sound with Vision Language Models
    Zhifeng Xie ,  Shengye Yu ,  Qile He ,  and  Mengtian Li*
    In CVPR , 2024
  5. GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting
    Mengtian Li ,  Shengxiang Yao ,  Zhifeng Xie ,  and  Keyu Chen
    2024
  6. Class-imbalanced semi-supervised learning for large-scale point cloud semantic segmentation via decoupling optimization
    Mengtian Li ,  Shaohui Lin ,  Zihan Wang ,  Yunhang Shen ,  Baochang Zhang ,  and  Lizhuang Ma
    Pattern Recognition, 2024

2023

  1. A fine-grained vision and language representation framework with graph-based fashion semantic knowledge
    Huiming Ding ,  Sen Wang ,  Zhifeng Xie ,  Mengtian Li* ,  and  Lizhuang Ma
    Computers & Graphics, 2023

2022

  1. Hyperspherical learning in multi-label classification
    Bo Ke ,  Yunquan Zhu ,  Mengtian Li ,  Xiujun Shu ,  Ruizhi Qiao ,  and  Bo Ren
    In ECCV , 2022
  2. Hybridcr: Weakly-supervised 3d point cloud semantic segmentation via hybrid contrastive regularization
    Mengtian Li ,  Yuan Xie ,  Yunhang Shen ,  Bo Ke ,  Ruizhi Qiao ,  Bo Ren ,  Shaohui Lin ,  and  Lizhuang Ma
    In CVPR , 2022
  3. Paying attention for adjacent areas: Learning discriminative features for large-scale 3D scene segmentation
    Mengtian Li ,  Yuan Xie ,  and  Lizhuang Ma
    Pattern Recognition, 2022