Biography
I obtained my Ph.D. degree in the Department of Computer Science and Engineering at the Chinese University of Hong Kong, supervised by
Prof. Jiaya Jia and
Prof. Bei Yu. During Ph.D. life, I have spent wonderful times in collaborating with academics at the university (e.g.,
Prof. Chi-Wing Fu at CUHK,
Prof. Yingcong Chen at HKUST,
Prof. Hengshuang Zhao at HKU,
Prof. Philip H. S. Torr at Oxford,
Prof. Antonio Torralba at MIT,
Prof. Christian Theobalt and
Dr. Thomas Leimkuehler at Max Planck Institute for Informatics), and researchers in the industrial community (
Dr. Jiangbo Lu and
Dr. Nianjuan Jiang at SmartMore,
Dr. Ning Xu at Adobe Research,
Dr. Vibhav Vineet at Microsoft Research).
Before that, I obtained my B.E. degree in the Information Engineering at College of Information Science and Electronic Engineering, Zhejiang University.
Several specific topics of our current research interests and focus:
1. Generative Computational Photography: large model and efficiency optimization;
2. Multi-modality data (image, video, 3D, etc.) generation & manipulation via AIGC;
3. Multi-modality large understanding model;
4. Security and alignment for large models/AGI.
If you are interested in working with me, please feel free to contact me through the email.
News
- [9/2024]Two papers are accepted by NeurIPS2024
- [9/2024]One paper is accepted by EMNLP2024
- [9/2024]One paper is accepted by TVCG
- [7/2024]Three papers are accepted by ECCV2024
- [6/2024]We release the demo and code of Sagiri, which is a representative model to incorporate restoration and AIGC, especially for HDR, Project page.
- [6/2024]We release the demo and code of DepthAnything V2, which is a stronger open-world depth estimation model, Project page.
- [5/2024]We release several works about large models (Model Safety, Embodied AI, Federated LLMs, VLM)
- [4/2024]Two papers are accepted by ICML2024
- [3/2024]One paper is accepted by Transactions on Information Forensics & Security (CCF-A)
- [3/2024]In the 2024 NTIRE competition, we won fourth place in the Image Super Resolution (x4) track.
- [3/2024]Our work of virtual is incorporated into DAMO AI Platform of Alibaba
- [2/2024]Five papers are accepted by CVPR2024 (two papers are selected as Highlight)
- [1/2024]We release the demo and code of DepthAnything, Project page, Huggingface
- [12/2023]One paper is accepted by AAAI2024
- [12/2023]We release the demo and code of LucidDreamer, a excellent text-to-3D generation framework, Project page, Gradio Demo
- [09/2023]Two papers are accepted by NeurIPS2023
- [07/2023]Two papers are accepted by ICCV2023
- [05/2023]One papers are accepted by IJCV
- [04/2023]One papers are accepted by IJCAI2023
- [03/2023]Three papers are accepted by CVPR2023
- [11/2022]One paper is accepted by AAAI2023
- [07/2022]Two papers are accepted by ECCV2022
- [03/2022]One paper is accepted by CVPR2022
- [11/2021]One paper is accepted by AAAI2022
- [07/2021]Two paper are accepted by ICCV2021
- [03/2021]One paper are accepted by CVPR2021
Talks & Presentations
-
Give a talk in Southeast University with the topic of "AIGC-based Image Restoration".
Oct. 2024.
-
Give a talk in Zhejiang Lab with the topic of "Improve Model Robustness under Extreme Dark Environments".
Sep. 2024.
-
Co-host a workshop in ChinaSys with the topic of "AI System Building for Large Models".
June. 2024.
-
Give a talk at Nanjing University of Aeronautics and Astronautics about "Transferrable Adversarial Attacks" .
June. 2024.
-
Oral presentation about future media technology at Huawei STW conference (Shenzhen).
May. 2024.
-
Invited poster presentation at VALSE2024 on "Boosting Image Restoration via Priors from Pre-trained Models".
May. 2024.
-
Invited talk at China3DV with the topic of "Efficient 3D Modeling for Data with Real-world Degradations".
Apr. 2024.
-
Give a talk at Nankai University with the topic of "AIGC for Computational Photography in the RAW Domain" .
Apr. 2024.
-
Invited talk to Responsible AI team at ByteDance, on "Responsible LLM and AIGC".
Apr. 2024.
-
Selected into "Young Talent Nurturing Project at Zhejiang Lab (之江青年人才托举)" for Large Models (大模型).
Mar. 2024.
-
Organizer at GAMES Webinar on "Multi-view Synthesis and 3D Shape Completion via Diffusion Models", [
News].
Mar. 2024.
-
Presentation at [
Shining 3D] with topic of "High-quality 3D Reconstruction and Generation".
Jan. 2024.
-
Give a talk at Alibaba International Digital Commerce (AIDC), "Intelligent Generation and Restoration".
Dec. 2023.
-
Presentation for Huawei Central Media Research Institute, "Multi-Modality Low-Light Data Enhancement".
Oct. 2023.
-
Invited talk at VALSE Webinar on "LLIE via Structure Modeling and Guidance", [
News].
Sep. 2023.
-
Organize a nationwide academic meeting at Hangzhou with topic of "Intelligent Computing and Security".
Sep. 2023.
-
Give a talk at Zhejiang University, "Reliable Artificial Intelligence for Image Generation and Manipulation".
July. 2023.
-
Presentation for CVLab at ETH, "Multi-Modality Restoration".
July. 2023.
-
Invited to give a talk at HKUST, "Effective Generative Models for Real-World Manipulation and Restoration".
July. 2023.
-
Invitation from JiQiZhiXin (机器之心): "White Paper for Security and Privacy of Large Generative Model", [
News]
June. 2023.
-
Give a talk to Alibaba DAMO Academy, with topic of "Real-world Generation for 2D and 3D Data".
May. 2023.
-
Awarded with "Science Fund Program for Excellent Young Scientists at Zhejiang Lab (之江优秀青年科学基金)".
Mar. 2023.
-
AI TIME Personal Talk: "Deep Parametric 3D Filters for Multiple Degradations Restoration".
Mar. 2023.
-
AI TIME ECCV 2022: "Multi‑Task Learning via Transformer and Cross‑Task Reasoning".
Dec. 2022.
Research Summary
Multi-modality Generation
Multi-modality Restoration
Multi-modality Understanding
Image Generation
Video Generation
3D Generation
Image Restoration
Video Restoration
3D Restoration/Reconstruction
Understanding Accuracy
Understanding Robustness
Understanding for Anomaly
Technical Report
*: equal contribution, #: corresponding author
Full Reports in 2024
Full Reports in 2023
Full Reports in 2022
*: equal contribution, #: corresponding author
Selective Publications
-
Hawk: Learning to Understand Open-World Video Anomalies
Jiaqi Tang, Hao Lu, Ruizheng Wu, Xiaogang Xu, Ke Ma, Cheng Fang, Bin Guo, Jiangbo Lu, Qifeng Chen, Ying-Cong Chen.
Conference on Neural Information Processing Systems (NeurIPS), 2024, (acceptance rate 25.8%).
[Paper]
[Code]
-
Depth Anything V2
Lihe Yang, Bingyi Kang, Zilong Huang, Zhen Zhao, Xiaogang Xu, Jiashi Feng, Hengshuang Zhao.
Conference on Neural Information Processing Systems (NeurIPS), 2024, (acceptance rate 25.8%).
[Paper]
[Code]
-
An Incremental Unified Framework for Small Defect Inspection
Jiaqi Tang, Hao Lu, Xiaogang Xu, Ruizheng Wu, Sixing Hu, Tong Zhang, Tsz Wa Cheng, Ming Ge, Yingcong Chen, Fugee Tsung.
European Conference on Computer Vision (ECCV), 2024, (acceptance rate 18% (2395/12600)).
[Paper]
[Code]
-
Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction
Shuchi Wu, Chuan Ma, Kang Wei, Xiaogang Xu, Ming Ding, Yuwen Qian, Di Xiao, Tao Xiang.
European Conference on Computer Vision (ECCV), 2024, (acceptance rate 18% (2395/12600)).
[Paper]
[Code]
Full Publications
Publications in 2024
-
Hawk: Learning to Understand Open-World Video Anomalies
Jiaqi Tang, Hao Lu, Ruizheng Wu, Xiaogang Xu, Ke Ma, Cheng Fang, Bin Guo, Jiangbo Lu, Qifeng Chen, Ying-Cong Chen.
Conference on Neural Information Processing Systems (NeurIPS), 2024, (acceptance rate 25.8%).
[Paper]
[Code]
-
Depth Anything V2
Lihe Yang, Bingyi Kang, Zilong Huang, Zhen Zhao, Xiaogang Xu, Jiashi Feng, Hengshuang Zhao.
Conference on Neural Information Processing Systems (NeurIPS), 2024, (acceptance rate 25.8%).
[Paper]
[Code]
-
An Incremental Unified Framework for Small Defect Inspection
Jiaqi Tang, Hao Lu, Xiaogang Xu, Ruizheng Wu, Sixing Hu, Tong Zhang, Tsz Wa Cheng, Ming Ge, Yingcong Chen, Fugee Tsung.
European Conference on Computer Vision (ECCV), 2024, (acceptance rate 18% (2395/12600)).
[Paper]
[Code]
-
Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction
Shuchi Wu, Chuan Ma, Kang Wei, Xiaogang Xu, Ming Ding, Yuwen Qian, Di Xiao, Tao Xiang.
European Conference on Computer Vision (ECCV), 2024, (acceptance rate 18% (2395/12600)).
[Paper]
[Code]
Publications in 2023
Publications in 2022
Publications in 2021
Publications in 2020
Publications in 2019
Publications in 2018
Publications in 2017
Intern Experiences
-
Feb. 2020 – June. 2022
Research Intern
Advisor: Jiangbo Lu and Nianjuan Jiang
Topic: low-light image/video enhancement and image/video denoising
-
2021 – 2022
Research Intern
Advisor: Vibhav Vineet
Topic: Universal Vision System
-
2020 – 2021
Research Intern
Advisor: Ning Xu
Topic: Scene‑graph‑based Image Creation
-
2021
Visiting Researcher
Advisor: Philip Torr
Topic: Universal Adaptive Data Augmentation and Generative Models for Adversarial Robustness
-
Dec. 2018 – Feb. 2020
Research Intern
Advisor: Xin Tao and Xiaoyong Shen
Topic: deep learning for image manipulation
-
June. 2018 – Sep. 2018
Research Intern
Advisor: Yilun Wang
Topic: high-accuracy segmentation model for open-world roads
-
Apr. 2018 – June. 2018
Research Intern
Advisor: Mingyang Li
Topic: Multi-modality Retrieve for Tmall Genie System
-
Sep. 2017 – Apr. 2018
Undergraduate Research Assistant
Advisor: Shouling Ji
Topic: adversarial CAPTCHAs
-
Oct. 2017 – Mar. 2018
Research Intern
Advisor: Hanqing Jiang and Guofeng Zhang
Topic: video depth estimation
-
July. 2017 – Oct. 2017
Visiting Researcher
Advisor: Yangqiu Song and Huan Zhao
Topic: machine learning for complex graph algorithms
Professional Activities
- Conference Reviewer:
IEEE Conference on Computer Vision and Pattern Recognition (CVPR'18-24, CCF-A).
IEEE International Conference on Computer Vision (ICCV'19-23, CCF-A).
European Conference on Computer Vision (ECCV'20-24, CCF-B).
SIGGRAPH and SIGGRAPH Asia (23-24, CCF-A).
Neural Information Processing Systems (NeurIPS'19-23, CCF-A).
International Conference on Learning Representations (ICLR'20-24).
AAAI Conference on Artificial Intelligence (AAAI'20-24, CCF-A).
International Conference on Machine Learning (ICML'22-24, CCF-A).
IEEE Winter Conference on Applications of Computer Vision (WACV'21-24).
Asian Conference on Computer Vision (ACCV'22, CCF-C).
European Conference on Artificial Intelligence (ECAI'24, CCF-B).
Chinese Conference on Pattern Recognition and Computer Vision (PRCV'24, CCF-C).
- Journal Reviewer:
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A).
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT, CCF-B).
IEEE Transactions on Visualization and Computer Graphics (TVCG, CCF-A).
IEEE Transactions on Multimedia (TMM, CCF-B).
IEEE Transactions on Instrumentation and Measurement (TIM, JCR-Q1).
IEEE Transactions on Neural Networks and Learning Systems (TNNLS, CCF-B).
IEEE Signal Processing Letters (SPL, CCF-C).
Computer Vision and Image Understanding (CVIU, CCF-B).
Neural Processing Letters (CCF-C).
International Journal of Computer Vision (IJCV, CCF-A).
International Journal of Human-Computer Interaction (IJHC, CCF-B).
Neurocomputing (CCF-C).
Pattern Recognition (PR, CCF-B).
Knowledge-Based Systems (KBS, CCF-C).
Neural Networks (CCF-B).
IET Image Processing (CCF-C).
Journal of Computer-Aided Design & Computer Graphics (计算机辅助设计与图形学学报, CCF中文A类).
ACM Transactions on Multimedia Computing Communications and Applications (TOMM, CCF-B).
- Program Committee:
AAAI Conference on Artificial Intelligence (AAAI'23-24).
Honors & Awards
-
2024
-
Large Model Safety Risk Guardrail Theory and Key Technologies (浙江省自然科学基金重大项目, 1,000,000 CNY)
2024
-
Cadre member in Zhejiang KunPeng Project (鲲鹏计划, the highest honored research funding in Zhejiang Province)
2023
-
Science Fund Program for Excellent Young Scientists at Zhejiang Lab (之江优秀青年科学基金, 1,000,000 CNY)
2023
-
2021
-
2018
-
Hong Kong PhD Fellowship
2018
-
Outstanding Final-Year Project, ZJU
2018
-
Outstanding Graduate, ZJU
2018
-
National Scholarship, Ministry of Education of P.R. China
2015
-
Title of Outstanding Students, ZJU
2015/16/17
-
The scholarship for excellence in research and innovation, ZJU
2016/17
-
Zhejiang Provincial Government Scholarship
2016
-
China Undergraduate Mathematical Contest in Modeling, National second prize
2016
-
Mathematical Contest in Modeling (Honorable Mention), COMAP (U.S.A)
2016
Patents
- CN, "A method and symstem for generate text-based adversarial captchas via adding noise in the frequency domain" (一种基于频域加噪的字符对抗验证码生成方法和系统).
- CN, "A method and symstem for generate image-based adversarial captchas via adversarial learning" (一种基于对抗学习的图像对抗验证码生成方法和系统).
- CN, "An efficient inverse pipeline from Log video to RAW video".
- CN, "A noise-consistency-based collection strategy for video denosing pairs".
- CN, "A text-guided image manipulation system via feature alignment".
Teaching
-
ENGG1110: Problem Solving by Programming
Fall, 2018-2019 in CUHK
-
CSCI4190: Introduction to Social Networks
Spring, 2018-2019 in CUHK
-
ENGG1110: Problem Solving by Programming
Fall, 2019-2020 in CUHK
-
CSCI3310: Mobile Computing & Application Development
Spring, 2019-2020 in CUHK
-
ENGG1110: Problem Solving by Programming
Fall, 2020-2021 in CUHK
© Xiaogang Xu | Last updated: 1/3/2024