3D scene understanding, reconstruction, and rendering from images and videos.
Animatable reconstruction of clothed humans and realistic digital human creation.
Computer vision and machine learning for pose estimation, tracking, and recognition.
Diffusion-based and GAN-based methods for 3D content generation and editing.
Vision-language models and multimodal large language models for visual understanding.
Full list available on Google Scholar.
CVPR 2018
ACM MM 2014
Meta Superintelligence Labs, Meta Platforms, Inc.
Reality Labs Research, Meta Platforms, Inc.
University of California, Los Angeles (UCLA)