Zhihao Zhang 张志浩

PhD Student · UNC-Chapel Hill

I am a Ph.D. student in Computer Science at the University of North Carolina at Chapel Hill (UNC), advised by Prof. Xiaoming Liu. I spent the first two years of my Ph.D. at Michigan State University (MSU). Before my Ph.D., I received my B.S. and M.S. in Computer Science from Xi'an Jiaotong University (XJTU), under the supervision of Prof. Weizhan Zhang. I was also a research intern at the University of Illinois Urbana-Champaign (UIUC), hosted by Prof. Yu-Xiong Wang.

Research

My long-term goal is to build computational models that perceive, reason about, and predict the physical 3D world. I am broadly interested in 3D vision, spatial intelligence, and world models. Concretely, I work on:

3D Vision. Monocular 3D object detection, 3D shape understanding, and multi-modal 3D perception for autonomous driving and beyond.
Spatial Intelligence. Spatial reasoning, 3D grounding, and geometry-aware representations in vision-language models.
World Models. Predictive, action-conditioned modeling of 3D scenes toward embodied perception and interaction.

News

2026.04
Our MonoCoP has been selected as a CVPR 2026 Highlight.
2026.02
Two first-author papers accepted by CVPR 2026.
2026.01
I will join TikTok as a Research Intern in summer 2026 — see you in San Jose!
2025.07
One paper accepted by ICCV 2025.
2024.08
Joined MSU as a PhD student.
2024.06
Honored with the Outstanding Graduate Thesis Award at XJTU.
2024.02
One first-author paper accepted by CVPR 2024.
2023.11
Honored with the National Scholarship.

Selected Publications

Unleashing the Power of Chain-of-Prediction for Monocular 3D Object Detection

Zhihao Zhang, Abhinav Kumar, Girish Chandar Ganesan, Xiaoming Liu

Project / Code / Paper

CVPR 2026 (Highlight)
Towards Intrinsic-Aware Monocular 3D Object Detection

Zhihao Zhang, Abhinav Kumar, Xiaoming Liu

Project / Code / Paper

CVPR 2026
CHARM3R: Towards Unseen Camera Height Robust Monocular 3D Detector

Abhinav Kumar, Yuliang Guo, Zhihao Zhang, Xinyu Huang, Liu Ren, Xiaoming Liu

Project / Code / Paper

ICCV 2025
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding

Zhihao Zhang*, Shengcao Cao*, Yu-Xiong Wang

Project / Code / Paper

CVPR 2024
Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer

Zhihao Zhang*, Yiwei Chen*, Weizhan Zhang, Caixia Yan, Qinghua Zheng, Qi Wang, Wangdu Chen

Project / Code / Paper

ACM MM 2023

* denotes equal contribution.

Education

May 2026 – Present Ph.D. in Computer Science, University of North Carolina at Chapel Hill, USA
Aug. 2024 – May 2026 Ph.D. in Computer Science, Michigan State University, USA
Sept. 2021 – Jun. 2024 M.S. in Computer Science and Technology, Xi'an Jiaotong University, China
Sept. 2017 – Jun. 2021 B.Eng. in Computer Science and Technology, Xi'an Jiaotong University, China (Qian Xuesen Honors College)
Aug. 2019 – Jan. 2020 Berkeley International Study Program, UC Berkeley, USA

Experience

May 2026 – Present Research Intern, TikTok, San Jose, CA, USA
Aug. 2024 – May 2026 Research Assistant, Michigan State University, East Lansing, MI, USA
Jul. 2023 – Feb. 2024 Research Intern, University of Illinois Urbana-Champaign, Urbana, IL, USA
Sept. 2021 – Jun. 2024 Research Assistant, Xi'an Jiaotong University, Xi'an, China

Awards & Honors

May 2026 Engineering Co-op Scholarship, Michigan State University
Jun. 2024 Outstanding Graduate Thesis, Xi'an Jiaotong University (Top 1% of all graduates)
Nov. 2023 National Scholarship, Ministry of Education, China (Top 0.1%)
Nov. 2023 Tencent Scholarship (Top 0.1%)
Nov. 2022 First Prize Scholarship, Xi'an Jiaotong University (Top 10%)
Nov. 2021 Outstanding Freshman Scholarship, Xi'an Jiaotong University (Top 10%)