I am currently a Ph.D. candidate in the Computer Vision and Robotic Perception (CVRP) Laboratory at National University of Singapore, under the supervision of A/P Gim Hee Lee.
Currently, I am interning at GenAI, Microsoft. I received my B.E. degree from Tianjin University in 2020.
My research interests lie in AIGC and generalizable computer vision systems. Currently, I am working on controllable 3D and video generation.
I'm on the job market and looking for a Research Scientist/Engineer position starting in the summer of 2025. Feel free to reach out if you have any openings!
[November 2024] GenXD is released! A joint framework for general 3D and 4D generation!
[September 2024] I was awared Outstanding Self-financed Students Abroad!
[September 2024] X-Ray is accepted to NeurIPS 2024 as Spotlight!
[July 2024] TreeSBA is accepted to ECCV 2024!
[November 2023] Animate124 is released!
[September 2023] Two papers about visual domain generalization and parameter efficient fine-tuning are accepted to IJCV!
[May 2023] Make-A-Protagonist is released! This is my first step to AIGC.
[January 2023] I received the Research Achievement Award by NUS!
[September 2022] Our AdvStyle is accepted to NeurIPS 2022!
The first framework for generic video editing with both visual and textual clues. Make-A-Protagonist can achieve background editing, protagonist editing, and text-to-video editing with protagonist.
Extension of our ECCV 2022 paper (SHADE).
This paper applies SHADE to visual domain generalization tasks, including semantic segmentation with Transformer backbone, image classification, and object detection.
We introduce a dual consistency learning framework for domain generalized semantic segmentation, and propose a style hallucination module to generate pair-wise stylized samples.