Sparse Gaussian Process Attention

This is an example code for the paper titled Calibrating Transformers via Sparse Gaussian Processes (ICLR 2023)

This code implememts SGPA on CIFAR10 and IMDB datasets.

To use this code: simply run train_cifar.py or train_imdb.py

The IMDB dataset can be downloaded here

Dependencies:

Python - 3.8
Pytorch - 1.10.2
numpy - 1.22.4
einops - 0.4.1
pandas - 1.4.3
transformers - 4.18.0

Citing the paper (bib)

@inproceedings{chen2023calibrating,
  title = {Calibrating Transformers via Sparse Gaussian Processes},
  author = {Chen, Wenlong and Li, Yingzhen},
  booktitle = {International Conference on Learning Representations},
  year = {2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
README.md		README.md
mlp.py		mlp.py
sgp_layer.py		sgp_layer.py
sgp_layer_mask.py		sgp_layer_mask.py
train_cifar.py		train_cifar.py
train_imdb.py		train_imdb.py
transformer.py		transformer.py
util.py		util.py
vit.py		vit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sparse Gaussian Process Attention

Citing the paper (bib)

About

Uh oh!

Releases

Packages

Languages

chenw20/SGPA

Folders and files

Latest commit

History

Repository files navigation

Sparse Gaussian Process Attention

Citing the paper (bib)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages