ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo (ICME 2025)

Yuxi Hu, Jun Zhang, Zhe Zhang, Rafael Weilharter, Yuchen Rao, Kuangyi Chen, Runze Yuan, Friedrich Fraundorfer*

📌 Introduction

This repository contains the official implementation of ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo.

🚀 Pipeline

🔧 Setup

1.1 Requirements

Use the following commands to build the conda environment.

conda create -n icgmvsnet python=3.10.8
conda activate icgmvsnet
pip install -r requirements.txt

1.2 Datasets

Download the following datasets and modify the corresponding local path in scripts/data_path.sh.

DTU Dataset

Training data. We use the same DTU training data as mentioned in MVSNet and CasMVSNet, please refer to DTU training data and Depth raw for data download. You should download the Recitfied raw if you want to train the model in raw image resolution. Unzip and organize them as:

dtu_training/
├── Cameras
├── Depths
├── Depths_raw
├── Rectified
└── Rectified_raw (optional)

Testing data. Download DTU testing data. Unzip it as:

dtu_test/
├── scan1
├── scan4
├── ...

BlendedMVS Dataset

Download the low image resolution version of BlendedMVS dataset and unzip it as:

blendedmvs/
└── dataset_low_res
    ├── ...
    └── 5c34529873a8df509ae57b58

Tanks and Temples Dataset

Download the intermediate and advanced subsets of Tanks and Temples dataset. We use the camera parameters of short depth range version, you can download processed data here and change cams_1 to cams.

tanksandtemples/
├── advanced
│   ├── ...
│   └── Temple
│       ├── cams
│       ├── images
│       ├── pair.txt
│       └── Temple.log
└── intermediate
    ├── ...
    └── Train
        ├── cams
        ├── cams_train
        ├── images
        ├── pair.txt
        └── Train.log

🧠 Training

You can train ICG-MVSNet from scratch on DTU dataset and then fine-tune on BlendedMVS dataset. Please make sure to set the dataset path in scripts/data_path.sh before running training or testing.

2.1 DTU

To train ICG-MVSNet on DTU dataset, you can refer to scripts/dtu/train_dtu.sh, and run:

bash scripts/dtu/train_dtu.sh exp_name

2.2 BlendedMVS

To fine-tune the model on BlendedMVS dataset, you can refer to scripts/blend/train_bld_ft.sh, and also specify THISNAME, BLD_CKPT_FILE, and run:

bash scripts/blend/train_bld_ft.sh expname

📊 Testing

3.1 DTU

For DTU testing, we use model trained on DTU training dataset. You can perform depth map estimation, point cloud fusion, and result evaluation according to the following steps.

Depth map estimation and point cloud fusion. Run:

bash scripts/dtu/test_dtu.sh exp_name

Download the ObsMask and Points of DTU GT point clouds from the official website and organize them as:

evaluation/
    ├── ObsMask
    └── Points

Result evaluation. Setup Matlab in command line mode, and run bash scripts/dtu/matlab_quan_dtu.sh. You can adjust the num_at_once config according to your machine's CPU and memory ceiling. After quantitative evaluation, you will get [FUSION_METHOD]_quantitative/ and [THISNAME].log just store the quantitative results.

3.2 Tanks and Temples

For testing on Tanks and Temples benchmark, you can use any of the following configurations:

Only train on DTU training dataset.
Only train on BlendedMVS dataset.
Pretrained on DTU training dataset and finetune on BlendedMVS dataset. (Recommend)

After your training, please follow these steps:

To generate point cloud results, run:

bash scripts/tnt/test_tnt_inter.sh exp_name

bash scripts/tnt/test_tnt_adv.sh exp_name

Follow the Upload Instructions on the Tanks and Temples official website to make online submissions.

3.3 Custom Data

ICG-MVSNet can also reconstruct on custom data. You can refer to MVSNet to organize your data, and run:

bash scripts/custom/test_custom.sh exp_name

🎯 Results

Qualitative Results

Quantitative Results

Our results on DTU and Tanks and Temples (T&T) Dataset are listed in the tables.

DTU	Acc. ↓	Comp. ↓	Overall ↓
Ours	0.327	0.251	0.289

T&T (Intermediate)	Mean ↑	Family	Francis	Horse	Lighthouse	M60	Panther	Playground	Train
Ours	65.53	81.73	68.92	56.59	66.10	64.86	64.41	62.33	59.26

You can download point clouds here.

🔗 Citation

If you find this work useful in your research, please consider citing:

@inproceedings{hu2025icg,
  title={ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo},
  author={Hu, Yuxi and Zhang, Jun and Zhang, Zhe and Weilharter, Rafael and Rao, Yuchen and Chen, Kuangyi and Yuan, Runze and Fraundorfer, Friedrich},
  booktitle={IEEE International Conference on Multimedia and Expo (ICME)},
  year={2025}
}

❤️ Acknowledgements

This repository builds upon the great work of the following projects:

We sincerely thank the authors for their contributions to the MVS community.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
datasets		datasets
evaluations/dtu_parallel		evaluations/dtu_parallel
lists		lists
models		models
outputs		outputs
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
colmap2mvsnet.py		colmap2mvsnet.py
requirements.txt		requirements.txt
test_dtu_dypcd.py		test_dtu_dypcd.py
test_tnt_adv_dypcd.py		test_tnt_adv_dypcd.py
test_tnt_inter_dypcd.py		test_tnt_inter_dypcd.py
train_bld.py		train_bld.py
train_dtu.py		train_dtu.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo (ICME 2025)

📌 Introduction

🚀 Pipeline

🔧 Setup

1.1 Requirements

1.2 Datasets

DTU Dataset

BlendedMVS Dataset

Tanks and Temples Dataset

🧠 Training

2.1 DTU

2.2 BlendedMVS

📊 Testing

3.1 DTU

3.2 Tanks and Temples

3.3 Custom Data

🎯 Results

Qualitative Results

Quantitative Results

🔗 Citation

❤️ Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

YuhsiHu/ICG-MVSNet

Folders and files

Latest commit

History

Repository files navigation

ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo (ICME 2025)

📌 Introduction

🚀 Pipeline

🔧 Setup

1.1 Requirements

1.2 Datasets

DTU Dataset

BlendedMVS Dataset

Tanks and Temples Dataset

🧠 Training

2.1 DTU

2.2 BlendedMVS

📊 Testing

3.1 DTU

3.2 Tanks and Temples

3.3 Custom Data

🎯 Results

Qualitative Results

Quantitative Results

🔗 Citation

❤️ Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages