Junlong Ren* · Hao Wu* · Hui Xiong · Hao Wang✉
The Hong Kong University of Science and Technology (GuangZhou)
(* Equal Contribution)
The point clouds, text captions, and segmentation annotations are provided by Parts2Words. You can download the files here.
We also provide the pre-trained model weights in Google Drive.
# Train
CUDA_VISIBLE_DEVICES=0 python train.py --config config/SCA3D.yaml
# Eval
CUDA_VISIBLE_DEVICES=0 python val.py --config config/SCA3D.yamlIf you feel this project helpful to your research, please cite our work.
@article{ren2025sca3d,
title={SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation},
author={Ren, Junlong and Wu, Hao and Xiong, Hui and Wang, Hao},
journal={arXiv preprint arXiv:2502.19128},
year={2025}
}- This work is built on Parts2Words and we borrow some codes from CoVR-ECDE. Thanks for these great works.

