Zhihao Zhang (张志浩)

I am a first-year CS Ph.D. student focusing on 3D computer vision at Michigan State University (MSU), where my advisor is Prof. Xiaoming Liu. Before that, I graduated from Xi’an Jiaotong University (XJTU) with a Bachelor & Master degree in Computer Science, advised by Weizhan Zhang.

Prior to joining MSU, I interned at University of Illinois Urabana-Champaign (UIUC) with Prof. Yu-Xiong Wang.

Email  /  CV  /  Google Scholar  /  Twitter (X)  /  Github  / 

profile photo
Research

My research interests lie at the intersection of multi-modal learning and computer vision with the long-term goal of empowering computational models to better perceive and interact with the 3D visual world. Currently, I'm working on:

  • 3D shape understanding, 3D object detection for autonomous driving
  • Multi-modal learning for 3D perception

Publications
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
Zhihao Zhang*, Shengcao Cao* Yu-Xiong Wang
CVPR , 2024
Project Page / Code / arXiv

Introduce TriAdapter Multi-Modal Learning (TAMM), a novel two-stage learning approach based on three synergistic adapters to different modalities in the pre-training.

Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer
Zhihao Zhang*, Yiwei Chen*, Weizhan Zhang, Caixia Yan, Qinghua Zheng, Qi Wang, Wangdu Chen
ACM MM , 2023
Code / arXiv

Propose a tile classification based viewport prediction method with Multi-modal Fusion Transformer to improve the robustness of viewport prediction.

(* means equal contribution)

The website template was adapted from Ziqi Pang.