Textual-Visual Interaction for Enhanced Single Image Deraining using Adapter-Tuned VLMs

Abstract

This paper proposes a novel method called Textual-Visual Interaction for Enhanced Single Image Deraining using Adapter-Tuned VLMs (TVI-Derain). By leveraging the extensive textual knowledge from pretrained visual-language models (VLMs), we aim to improve the performance of single image deraining. To address the gap between VLMs and the restoration model, we introduce textual-aware intra-layer (TaIl) adapters that adapt the features of downstream data by capturing task-specific knowledge. Furthermore, a textual-visual feature interaction (TVI) module is designed to bridge the gap between textual and visual features, enabling reliable interaction. The proposed cross-attention feature interaction (CAFI) block within the TVI module effectively represents the interactive features. Semantic and degradation textual prompts are integrated as inputs to the text encoder to mitigate semantic disconnection arising from degraded samples. Extensive experimental results on benchmark datasets demonstrate that our method outperforms other competitive methods in terms of performance, showcasing its potential applications in automotive vision systems and surveillance.

Datasets

prepare data
- mkdir ./datasets/Rain13k
- download the train set and test set

Training

The training code will be released after the paper is accepted. You should change the path to yours in the Train.py file. Then run the following script to test the trained model:

python Train.py

Testing

You should change the path to yours in the test.py file. Then run the following script to test the trained model:

python test.py

Notes

Send e-mail to [email protected] if you have critical issues to be addressed.
Please note that there exists the slight gap in the final version due to errors caused by different testing devices and environments.

Citations

If TVI-Derain helps your research or work, please consider citing TVI-Derain.

@InProceedings{
  
}

Acknowledgment

This code is based on the PromptIR,DA-CLIP,RLP. Thanks for their awesome work.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
TVI		TVI
figure		figure
open_clip		open_clip
training		training
README.md		README.md
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Textual-Visual Interaction for Enhanced Single Image Deraining using Adapter-Tuned VLMs

Abstract

Datasets

Training

Testing

Notes

Citations

Acknowledgment

About

Uh oh!

Releases

Packages

Languages

ncfjd/TVI-Derain

Folders and files

Latest commit

History

Repository files navigation

Textual-Visual Interaction for Enhanced Single Image Deraining using Adapter-Tuned VLMs

Abstract

Datasets

Training

Testing

Notes

Citations

Acknowledgment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages