About
I am a fourth-year Ph.D. candidate at The Hong Kong Polytechnic University (PolyU), supervised by Prof. Jiannong Cao and Prof. Yuhui Shi at Southern University of Science and Technology (SUSTech). During my Ph.D. studies, I have closely collaborated with Dr. Pichao Wang from NVIDIA.
I received my B.E. in Computer Science and Technology from SUSTech in 2021, where I graduated in the top 10% of my class. I was also an exchange student at the University of Wisconsin-Madison (2020).
My current research focuses on Video Generation, Video Editing, World Model and 3D Vision, with a particular emphasis on diffusion models and their applications in visual content generation and manipulation.
I am currently actively seeking internship and full-time positions starting in 2026.
Selected Publications
View All →Refacade: Editing Object with Given Reference Texture
Youze Huang†, Penghui Ruan†, Bojia Zi†, Xianbiao Qi, Jianan Wang, Rong Xiao
Under Review, Equal contribution
Ctrl&Shift: High-Quality Geometry-Aware Object Manipulation in Visual Generation
Penghui Ruan, Bojia Zi, Youze Huang, Pichao Wang, Xianbiao Qi, Rong Xiao, Jiannong Cao, Yuhui Shi.
Under Review
JDM: Joint Distribution Modeling for Fine-Grained Text-to-Video Generation
Penghui Ruan†, Bojia Zi†, Youze Huang, Pichao Wang, Xianbiao Qi, Rong Xiao, Jiannong Cao, Yuhui Shi.
Under Review, Equal contribution
Señorita-2M: A High-Quality Instruction-Based Dataset for General Video Editing by Video Specialists
Bojia Zi†, Penghui Ruan†, Xianbiao Qi, Shaozhe Hao, Shihao Zhao, Youze Huang, Bin Liang, Rong Xiao, Kam-Fai Wong
Advances in Neural Information Processing Systems (NeurIPS) 2025
Accepted, Equal contribution
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning
Penghui Ruan, Pichao Wang, Divya Saxena, Jiannong Cao, Yuhui Shi
Advances in Neural Information Processing Systems (NeurIPS) 2024
Accepted
Professional Experience
Applied Scientist Intern
Oct 2025 - PresentAmazon
Researching and developing foundational models for high-quality movie dubbing with precise lip synchronization and video preservation.
AIGC Research Intern
Dec 2024 - Oct 2025IntelliFusion Inc.
Built and trained large-scale T2V/video-editing models with fine-grained control over content via textual conditioning and explicit 3D camera control. Spearheading the creation of a large-scale, multi-task instructional video editing dataset.
Software Engineer Intern
Apr 2021 - Jun 2021Tencent
Migrated sensitive word filtering service from HTTPS to TRPC, optimizing efficiency and enhancing security. Contributed to the Routing Service for video search.
Education
Ph.D. in Computer Science
Sep 2022 - Aug 2026 (Expected)The Hong Kong Polytechnic University
Specializing in Text-to-Video Generation, Video Editing, and 3D Vision. Advisors: Prof. Jiannong Cao and Prof. Yuhui Shi.
B.E. in Computer Science and Technology
Sep 2017 - Jun 2021Southern University of Science and Technology
GPA: 3.74/4.00 (Top 10%)
Exchange Student
Jan 2020 - Sep 2020University of Wisconsin-Madison
GPA: 3.83/4.0 (Top 5%)
