Yue Wu(吴玥)

Email: wu.kathrina[at]gmail.com / Google Scholar / CV / Github /Linkedin /OpenReview

I am currently a researcher at AI Theory Lab of Huawei Noah's Ark Lab (Hong Kong), working closely with Dr. Zhenguo Li and Dr. Enze Xie . I obtained my Ph.D. degree at Computer Science and Engineering department of Hong Kong University of Science and Technology (HKUST) at June 2023 , supervised by Prof. Qifeng Chen . Prior to this, I got my Bachelor's degree from Wuhan University in 2018.

I have research experience in computational photography, image/video synthesis, 3D generative models, and neural rendering. And I'm conducting research in AIGC in 2D/3D/Video.

Always looking for research interns with strong CV/ML background, feel free to shoot an email if interested.



  • News (Aug 2023): A paper is accepted to SIGGRAPH Asia 2023.
  • News (Aug 2023): I joined the AI Theory Lab of Huawei Noah's Ark Lab (Hong Kong).
  • News (June 2023): I passed my Ph.D. defense!

profile photo
Work Experience

Researcher, Huawei Noah's Ark Lab, Aug.2023 - now.
Working closely with Dr. Zhenguo Li and Dr. Enze Xie .

Research Intern, SenseTime, Jul.2017 - Dec.2017.
Working with Dr. Wentao Liu and Dr. Chen Qian .

Research Intern, MSRA, Jan.2022 - May.2023.
Working with Dr. Jiaolong Yang , Dr. Fangyun Wei and Dr. Xin Tong in vision computing group.

Publications

* indicates joint authors


PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation.
Junsong Chen*, Chongjian Ge*, Enze Xie*†, Yue Wu*, Lewei Yao, Xiaozhe Ren, Zhongdao Wang, Ping Luo, Huchuan Lu, Zhenguo Li
Arxiv, 2024
[PDF] [Code] [机器之心]


PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models.
Junsong Chen, Yue Wu, Simian Luo, Enze Xie, Sayak Paul, Ping Luo, Hang Zhao, Zhenguo Li
Tech Report, 2024
[PDF]


PIXART-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis.
Junsong Chen*, Jincheng Yu*, Chongjian Ge*, Lewei Yao*, Enze Xie, Yue Wu, Zhongdao Wang, James Kwok, Ping Luo, Huchuan Lu, Zhenguo Li
ICLR Spotlight, 2024
[PDF] [Arxiv] [Project] [Code] [机器之心]

Editing Massive Concepts in Text-to-Image Diffusion Models.
Tianwei Xiong, Yue Wu, Enze Xie, Yue Wu, Zhenguo Li, Xihui Liu
Preprint, 2024
[Project]

Automatic Controllable Colorization by Imagination.
Xiaoyan Cong, Yue Wu, Qifeng Chen, Chenyang Lei
CVPR, 2024

Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation.
Shentong Mo, Enze Xie, Yue Wu, Junsong Chen, Matthias Nießner, Zhenguo Li
Preprint, 2023
[Project] [Arxiv]

Reasoning with Foundation Models: Concepts, Methodologies, and Outlook.
Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Jirong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, and Zhenguo Li
Preprint, 2023
[Project] [Arxiv]

Drag-A-Video: Non-rigid Video Editing with Point-based Interaction.
Yao Teng, Enze Xie, Yue Wu , Haoyu Han, Zhenguo Li, Xihui Liu
Preprint, 2023
[Arxiv] [Project]

AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections.
Yue Wu*, Sicheng Xu*, Jianfeng Xiang, Fangyun Wei, Qifeng Chen, Jiaolong Yang, Xin Tong
SIGGRAPH Asia, 2023
[PDF] [Project] [Code] [新智元]

AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars
Yue Wu, Yu Deng, Jiaolong Yang, Fangyun Wei, Qifeng Chen, Xin Tong
Neural Information Processing Systems (NeurIPS) (Spotlight), 2022
[PDF] [Project] [BibTeX] [Code]
Video Waterdrop Removal via Spatio-Temporal Fusion in Driving Scenes
Qiang Wen, Yue Wu, Qifeng Chen
IEEE International Conference on Robotics and Automation (ICRA), 2023
[PDF] [Code] [极市平台]
Improving Video Super-Resolution with Long-Term Self-Exemplars
Guotao Meng*, Yue Wu*, Qifeng Chen
IEEE International Conference on Robotics and Automation (ICRA), 2023
[arXiv]
Optimizing Video Prediction via Video Frame Interpolation
Yue Wu, Qiang Wen, Qifeng Chen
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
[PDF] [Project] [Code and Results] [BibTeX]
Embedding Novel Views in a Single JPEG Image
Yue Wu*, Guotao Meng*, Qifeng Chen
International Conference on Computer Vision (ICCV), 2021
[PDF] [arXiv] [Project] [BibTeX]
Towards Photorealistic Colorization by Imagination
Chenyang Lei *, Yue Wu*, Qifeng Chen
Preprint 2021,
[arXiv]
Future Video Synthesis with Object Motion Prediction
Yue Wu, Rongrong Gao, Jaesik Park, Qifeng Chen
Computer Vision and Pattern Recognition Conference (CVPR), 2020
[PDF] [arXiv] [Code and Result] [BibTeX]
Towards Multi-Person Pose Tracking: Bottom-up and Top-down Methods
Sheng Jin, Xujie Ma, Zhipeng Han, Yue Wu, Wei Yang, Wentao Liu, Chen Qian, Wanli Ouyang
International Conference on Computer Vision (ICCV Workshops), 2017
[PDF]

Ranked 2nd Places in ICCV Posetrack Challenge

Saliency map generation based on saccade target theory
Yue Wu, Zhenzhong Chen
ICME, 2017
[PDF]
Teaching

COMP5411: Computer Graphics, 2019
COMP2711H: Honors Discrete Mathematical Tools for Computer Science, 2020
COMP3511: Operating System, 2021
COMP5214: Advanced Deep Learning Architectures, 2022

Honors and Awards

HKUST RedBird Academic Excellence Award, HKUST
Postgraduate Scholarship, HKUST
National scholarship, Wuhan University
First-Class scholarship, Wuhan University
Best Head Movement Prediction Student Prize ICME Grand Challenge Salient360! 2017
Meritorious Winner Interdisciplinary Contest In Modeling (ICM) 2017

Academic Service

Reviewer for ECCV 2022, CVPR 2022/2023/2024, ICRA 2023, JMLR

Flag Counter