Chengming Xu

Computer Vision · Multimodal Learning

Chengming Xu

Researcher at Youtu Lab, Tencent · Ph.D. in Data Science, Fudan University

I work on deep learning for computer vision with limited supervision, with recent interests in visual in-context learning, multimodal reasoning, and controllable generation.

Research interests: few-shot learning, visual in-context learning, vision-language models, and video generation/editing.

Recent Publications

View full list →

FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing

CVPR 2026

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

CVPR 2026

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow

ICLR 2026

Towards Reliable and Holistic Visual In-Context Learning Prompt Selection

NeurIPS 2025