I am currently a third-year Ph.D. Student in the School of Computing, National University of Singapore, advised by Prof. Yang You. Before that, I obtained my master’s and bachelor’s degrees from Northwestern Polytechnical University, China, in 2019 and 2022, respectively. During my master’s study, I was fortunatedly to collaborate with Dr. Nian Liu, under the supervision from Prof. Junwei Han.

My research interest includes efficient deep learning, dynamic neural network, and mulit-modal model. I have published more than 10 papers at the top international AI conferences and journals with .

All talents are welcome to send an email (wangbo.zhao96@gmail.com) to me if you are interested in collaborating on projects related to efficient deep learning or other promising research directions.

Apart from research, I am an amateur track and field athlete, specializing in the 400 meters (PB 53.40) and 400-meter hurdles (PB 1:01.78).

🔥 News

  • 2024.09:  🎉🎉 One paper accepted to NeurIPS 2024.
  • 2024.07:  🎉🎉 One paper accepted to ECCV 2024.

📝 Publications

arXiv 2024
sym

Dynamic diffusion transformer

Code

Wangbo Zhao, Yizeng Han, Jiasheng Tang, Kai Wang, Yibing Song, Gao Huang, Fan Wang, Yang You

  • We propose to dynamically adjust the computation of DiT in different timesteps and spatial locations of images. The computation of DiT-XL could be saved by 50% without sacrificing generation quality.
NeurIPS 2024
sym

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

Code

Wangbo Zhao, Jiasheng Tang, Yizeng Han, Yibing Song, Kai Wang, Gao Huang, Fan Wang, Yang You

  • We propose to adapt static ViT to dynamic ViT via parameter-efficient fine-tuning without full-parameter tuning.
ECCV 2024 (Oral)
sym

Mmbench: Is your multi-modal model an all-around player?

Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin

  • We propose MMBench, a bilingual benchmark for assessing the multi-modal capabilities of VLMs.
CVPR 2024
sym

VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Ziyang Luo, Nian Liu, Wangbo Zhao, Xuguang Yang, Dingwen Zhang, Deng-Ping Fan, Fahad Khan, Junwei Han

  • We introduce VSCode a generalist model with novel 2D prompt learning to jointly address four SOD tasks and three COD tasks
ICCV 2023
sym

Multi-grained temporal prototype learning for few-shot video object segmentation

Nian Liu, Kepan Nan, Wangbo Zhao, Yuanwei Liu, Xiwen Yao, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Junwei Han, Fahad Shahbaz Khan

  • We propose to leverage multi-grained temporal guidance information for handling the temporal correlation nature of video data for few-shot video object segmentation
CVPR 2022
sym

Modeling motion with multi-modal features for text-based video segmentation

Wangbo Zhao, Kai Wang, Xiangxiang Chu, Fuzhao Xue, Xinchao Wang, Yang You

  • We design a method to fuse and align appearance, motion, and linguistic features to achieve accurate text-based video segmentation.
ICCV 2021
sym

Light field saliency detection with dual local graph learning and reciprocative guidance

Nian Liu, Wangbo Zhao, Dingwen Zhang, Junwei Han, Ling Shao

  • We introduce a reciprocative guidance scheme for light field saliency detection.
CVPR 2021
sym

Weakly supervised video salient object detection

Wangbo Zhao, Jing Zhang, Long Li, Nick Barnes, Nian Liu, Junwei Han

  • We present the first weakly supervised video salient object detection model based on relabeled fixation guided scribble annotations.

📖 Educations

  • 2022.08 - 2026.06, Ph.D., School of Computing, National University of Singapore, Singapore.
  • 2019.09 - 2022.04, Master, School of Automation, Northwestern Polytechnical University, China
  • 2017.07 - 2019.01, Undergraduate, Université de technologie de Troyes, France
  • 2015.09 - 2019.06, Undergraduate, Honors College, Northwestern Polytechnical University, China