FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
Computer Vision · Multimodal Learning
Chengming Xu
Researcher at Youtu Lab, Tencent · Ph.D. in Data Science, Fudan University
I work on deep learning for computer vision with limited supervision, with recent interests in visual in-context learning, multimodal reasoning, and controllable generation.
Research interests: few-shot learning, visual in-context learning, vision-language models, and video generation/editing.