Yuyang Zhao
I am currently a Ph.D. candidate in the Computer Vision and Robotic Perception (CVRP) Laboratory at National University of Singapore , under the supervision of A/P Gim Hee Lee . I received my B.E. degree from Tianjin University in 2020.
My research interests lie in AIGC and generalizable computer vision systems. Currently, I am working on controllable 3D and video generation.
   
   
   
[November 2023 ] Animate124 is released!
[September 2023 ] Two papers about visual domain generalization and parameter efficient fine-tuning are accepted to IJCV!
[May 2023 ] Make-A-Protagonist is released! This is my first step to AIGC.
[January 2023 ] I received the Research Achievement Award by NUS!
[September 2022 ] Our AdvStyle is accepted to NeurIPS 2022!
[July 2022 ] I received the Outstanding Reviewer Award in ICML 2022 (Top 10%) !
[July 2022 ] One paper about domain generalized semantic segmentation is accepted to ECCV 2022!
[May 2022] One paper about open compound domain adaptation is accepted to IEEE TCSVT!
[March 2022 ] One paper about novel class discovery is accepted to CVPR 2022!
[November 2021] One paper about optical flow estimation is accepted to Neurocomputing.
[September 2021] One paper about optical flow estimation is accepted to Signal Processing: Image Communication.
[March 2021 ] One paper about domain generalized person re-identification is accepted to CVPR 2021!
Animate124: Animating One Image to 4D Dynamic Scene
Yuyang Zhao ,
Zhiwen Yan ,
Enze Xie ,
Lanqing Hong ,
Zhenguo Li ,
Gim Hee Lee
Arxiv
The first work to animate a single in-the-wild image into 3D video through textual motion descriptions.
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
Yuyang Zhao ,
Enze Xie ,
Lanqing Hong ,
Zhenguo Li ,
Gim Hee Lee
Arxiv
The first framework for generic video editing with both visual and textual clues. Make-A-Protagonist can achieve background editing, protagonist editing, and text-to-video editing with protagonist.
Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds
Yuyang Zhao ,
Na Zhao ,
Gim Hee Lee
Arxiv
PDF /
Code (Coming Soon!)
The first work on domain generalized semantic segmentation in 3D indoor scenes.
Revisit Parameter-Efficient Transfer Learning: A Two-Stage Paradigm
Henry Hengyuan Zhao ,
Hao Luo ,
Yuyang Zhao ,
Pichao Wang ,
Fan Wang,
Mike Zheng Shou
Arxiv , 2023
Style-Hallucinated Dual Consistency Learning: A Unified Framework for Visual Domain Generalization
Yuyang Zhao ,
Zhun Zhong ,
Na Zhao ,
Nicu Sebe ,
Gim Hee Lee
IJCV , 2023
Extension of our ECCV 2022 paper (SHADE).
This paper applies SHADE to visual domain generalization tasks, including semantic segmentation with Transformer backbone, image classification, and object detection.
SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Henry Hengyuan Zhao ,
Pichao Wang ,
Yuyang Zhao ,
Hao Luo ,
Fan Wang,
Mike Zheng Shou
IJCV , 2023
Two Heads Are Better Than One: Improving Fake News Video Detection by Correlating with Neighbors
Peng Qi ,
Yuyang Zhao ,
Yufeng Shen,
Wei Ji ,
Juan Cao,
Tat-Seng Chua
ACL Findings , 2023
Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation
Zhun Zhong *,
Yuyang Zhao *,
Gim Hee Lee ,
Nicu Sebe
(* Equal contribution)
NeurIPS , 2022
PDF /
Code
AdvStyle adversarially changes the channel-wise mean and standard deviation to diversify source samples.
Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation
Yuyang Zhao ,
Zhun Zhong ,
Na Zhao ,
Nicu Sebe ,
Gim Hee Lee
ECCV , 2022
We introduce a dual consistency learning framework for domain generalized semantic segmentation, and propose a style hallucination module to generate pair-wise stylized samples.
Novel Class Discovery in Semantic Segmentation
Yuyang Zhao ,
Zhun Zhong ,
Nicu Sebe ,
Gim Hee Lee
CVPR , 2022
The first work focuses on novel class discovery in semantic segmentation. This work addresses the co-occurrence of base, novel and background classes.
Source-Free Open Compound Domain Adaptation in Semantic Segmentation
Yuyang Zhao *,
Zhun Zhong *,
Zhiming Luo ,
Gim Hee Lee ,
Nicu Sebe
(* Equal contribution)
IEEE Transactions on Circuits and Systems for Video Technology , 2022
PDF /
Code
Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification
Yuyang Zhao *,
Zhun Zhong *,
Fengxiang Yang ,
Zhiming Luo ,
Shaozi Li,
Nicu Sebe
(* Equal contribution)
CVPR , 2021
PDF /
Code
FPCR-Net: Feature Pyramidal Correlation and Residual Reconstruction for Optical Flow Estimation
Xiaolin Song,
Yuyang Zhao ,
Jingyu Yang,
Cuiling Lan,
Wenjun Zeng
Neurocomputing , 2021
PDF
Spatio-temporal Context-aware Optical Flow Estimation
Xiaolin Song,
Yuyang Zhao ,
Jingyu Yang
Signal Processing: Image Communication , 2021
PDF
Program Comittee / Conference Reviewer: CVPR, ICCV, ECCV, ICML, NeurIPS, ICLR
Journal Reviewer: IJCV, IEEE TPAMI, IEEE TMM, IEEE TCSVT
Research Achievement Award , National University of Singapore, 2023
Outstanding Reviewer Award , ICML, 2022
Research Scholarship , National University of Singapore, 2021