CrossFuse

About

Codes for CrossFuse: Learning Infrared and Visible Image Fusion by Cross-Sensor Top-K Vision Alignment and Beyond, IEEE Transactions on Circuits and Systems for Video Technology (T-CSVT), 2025.

-[Paper]

-[ArXiv]

Abstract

Infrared and visible image fusion (IVIF) is increasingly applied in critical fields such as video surveillance and autonomous driving systems. Significant progress has been made in deep learning-based fusion methods. However, these models frequently encounter out-of-distribution (OOD) scenes in real-world applications, which severely impact their performance and reliability. Therefore, addressing the challenge of OOD data is crucial for the safe deployment of these models in open-world environments. Unlike existing research, our focus is on the challenges posed by OOD data in real-world applications and on enhancing the robustness and generalization of models. In this paper, we propose an infrared-visible fusion framework based on Multi-View Augmentation. For external data augmentation, Top-k Selective Vision Alignment is employed to mitigate distribution shifts between datasets by performing RGB-wise transformations on visible images. This strategy effectively introduces augmented samples, enhancing the adaptability of the model to complex real-world scenarios. Additionally, for internal data augmentation, self-supervised learning is established using Weak-Aggressive Augmentation. This enables the model to learn more robust and general feature representations during the fusion process, thereby improving robustness and generalization. Extensive experiments demonstrate that the proposed method exhibits superior performance and robustness across various conditions and environments. Our approach significantly enhances the reliability and stability of IVIF tasks in practical applications.

Installation

- python == 3.8.12    torch == 1.9.0+cu111   torchaudio == 0.9.0   torchvision == 0.10.0+cu111
- opencv-python == 4.8.1.78
- scikit-image == 0.21.0
- scikit-learn == 1.3.1
- numpy == 1.24.4   Pillow == 9.5.0   matplotlib == 3.7.3
- scipy == 1.10.1
- tensorboard == 1.13.0
- tqdm == 4.66.1
- pytorch-msssim == 1.0.0

Dataset

Our training dataset can be downloaded from Google drive and placed in the folder './dataset/'. Our test sets can be downloaded from Google drive and placed in the folder './test_image/'.

CrossFuse

Network Architecture

1. Overall Framework

2. Top-k Selective Channel Alignment for External Data Consistency

3. Internal-View Augmentation for Self-supervised Learning

Qualitative results

1. Fusion results on MSRS dataset

2. Fusion results on RoadScene dataset

3. Fusion results on TNO dataset

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
Figure		Figure
dataset		dataset
models		models
utils		utils
Comparison_MSRS_new.png		Comparison_MSRS_new.png
Comparison_RoadScene.png		Comparison_RoadScene.png
README.md		README.md
aug_operation.py		aug_operation.py
network.py		network.py
test.py		test.py
top_k_alignment.py		top_k_alignment.py
top_k_patch_dataprocessing.py		top_k_patch_dataprocessing.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CrossFuse

About

Contents

Abstract

Installation

Dataset

CrossFuse

Network Architecture

Qualitative results

Quantitative results

About

Uh oh!

Releases

Packages

Languages

CidanShi/CrossFuse

Folders and files

Latest commit

History

Repository files navigation

CrossFuse

About

Contents

Abstract

Installation

Dataset

CrossFuse

Network Architecture

Qualitative results

Quantitative results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages