|
TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding
Zhihao Zhang*,
Shengcao Cao*
Yu-Xiong Wang
CVPR , 2024
Project Page
/
Code
/
arXiv
Introduce TriAdapter Multi-Modal Learning (TAMM), a novel
two-stage learning approach based on three synergistic
adapters to different modalities in the pre-training.
|
|
Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer
Zhihao Zhang*,
Yiwei Chen*,
Weizhan Zhang,
Caixia Yan, Qinghua Zheng, Qi Wang, Wangdu Chen
ACM MM , 2023
Code
/
arXiv
Propose a tile classification based viewport prediction method with Multi-modal Fusion Transformer
to improve the robustness of viewport prediction.
|
(* means equal contribution)
|
|
The website template was adapted from Ziqi Pang.
|