Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Official Codebase for CPAL 2025 (Oral) paper.

"Approximate Nullspace Augmented Finetuning for Robust Vision Transformers"
Haoyang Liu, Aditya Singh, Yijiang Li, Haohan Wang
[Paper] [Code] [Project Page]

Abstract.

Enhancing the robustness of deep learning models, particularly in the realm of vision transformers (ViTs), is crucial for their real-world deployment. In this work, we provide a finetuning approach to enhance the robustness of vision transformers inspired by the concept of nullspace from linear algebra. Our investigation centers on whether a vision transformer can exhibit resilience to input variations akin to the nullspace property in linear mappings, which would imply that perturbations sampled from this nullspace do not influence the model's output when added to the input. We start from the observation that many existing ViTs satisfy this property because their patch embedding layer has a non-trivial nullspace. Then, we extend the notion of nullspace to nonlinear settings and demonstrate that it is possible to synthesize approximate nullspace elements for ViT's encoder blocks through optimization. Finally, we propose a finetuning strategy for ViTs wherein we augment the training data with synthesized approximate nullspace noise. We find that our finetuning approach significantly improves the models' robustness to both adversarial and natural image perturbations.

Results

Getting Started

Installation

Usage

Citation

If you find this project useful, please consider citing:

@article{liu2024approximate,
  title={Approximate Nullspace Augmented Finetuning for Robust Vision Transformers},
  author={Liu, Haoyang and Singh, Aditya and Li, Yijiang and Wang, Haohan},
  journal={arXiv preprint arXiv:2403.10476},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
__pycache__		__pycache__
hal		hal
hfai		hfai
nulspace_log		nulspace_log
src		src
.env		.env
.gitignore		.gitignore
GELU.py		GELU.py
ImageNetDG.py		ImageNetDG.py
ImageNetDG_10.py		ImageNetDG_10.py
README.md		README.md
constants.py		constants.py
cross_main.py		cross_main.py
cross_theta_main.py		cross_theta_main.py
evaluate_single.py		evaluate_single.py
main_enc_level.py		main_enc_level.py
main_input_level.py		main_input_level.py
max_backbone.py		max_backbone.py
max_noise.py		max_noise.py
methods.py		methods.py
nullcomponent.py		nullcomponent.py
requirements.txt		requirements.txt
train_example.py		train_example.py
train_single.py		train_single.py
upload_artifacts.py		upload_artifacts.py
utils.py		utils.py
vision.py		vision.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Results

Getting Started

Installation

Usage

Citation

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Approximate Nullspace Augmented Finetuning for Robust Vision Transformers

Results

Getting Started

Installation

Usage

Citation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages