Gaussian-based softmax: Improving Intra-class Compactness and Inter-class Separability of Features

This repository contains the code for G-Softmax introduced in the following paper. It is built on Facebook ResNet Torch project.

Gaussian-based softmax: Improving Intra-class Compactness and Inter-class Separability of Features (paper)

Yan Luo, Yongkang Wong, Mohan S Kankanhalli, and Qi Zhao.

Citation

If you find Gaussian-based Softmax useful in your research, please consider citing:

@article{TNNLS_2019_gsoftmax,
  title={G-softmax: Improving Intra-class Compactness and Inter-class Separability of Features},
  author={Luo, Yan and Wong, Yongkang and Kankanhalli, Mohan S and Zhao, Qi},
  journal={IEEE Transactions on Neural Networks and Learning Systems}
}

Prerequisites

Install Torch and required dependencies like cuDNN. See the instructions here for a step-by-step guide.
Install probability distributions package for Torch. See the instructions here.
Install the requirements of Facebook ResNet Torch project.
For evaluation of multi-label classification, install matio by

luarocks install matio

Core Components

src/GaussianSoftMaxCriterion.lua is for single-label classification while src/MultiCrossEntropyCriterion.lua and src/MltGaussianSoftMaxCriterion.lua is for multi-label classification. Their usages are the same as nn.CrossEntropyCriterion.

The three files in src folder are core files in this project. The rest of files are from ResNet Torch project and are modified accordingly for evaluation purposes.

Train on Tiny ImageNet

Step 1: Download Tiny ImageNet
The directory structure of the dataset should be re-organized as

/home/yluo/project/dataset/tinyimagenet
├── images
│   ├── test
│   ├── train
│   └── val
├── wnids.txt
└── words.txt

Step 2: Download pretrained model ResNet-101 from ResNet Torch homepage
Step 3: To use softmax as the loss function for training, run

export save='logs/rsn101_tinyimagenet'
mkdir -p $save
CUDA_VISIBLE_DEVICES=1,2 th main_single.lua \
	-data '/home/yluo/project/dataset/tinyimagenet/images' \
	-retrain '/home/yluo/project/lua/saliency_torch/pretrained/resnet-101.t7' \
	-save $save \
	-batchSize 80 \
	-nGPU 2 \
	-nThreads 4 \
	-shareGradInput true \
	-dataset 'tinyimagenet' \
	-resetClassifier true \
	-nClasses 200 \
	-LR 1e-03 \
	-imgSize 224 \
	-featSize 7 \
	-nEpochs 30 \
	-lFunc 'ce' | tee $save/log.txt

Set -data to your local Tiny ImageNet images folder and set -retrain to the local path of the pretrained ResNet-101 model.
To use G-softmax as the loss function for training, run

export save='logs/rsn101_tinyimagenet_gsm'
mkdir -p $save
CUDA_VISIBLE_DEVICES=1,2 th main_single.lua \
	-data '/home/yluo/project/dataset/tinyimagenet/images' \
	-retrain '/home/yluo/project/lua/saliency_torch/pretrained/resnet-101.t7' \
	-save $save \
	-batchSize 80 \
	-nGPU 2 \
	-nThreads 4 \
	-shareGradInput true \
	-dataset 'tinyimagenet' \
	-resetClassifier true \
	-nClasses 200 \
	-LR 1e-03 \
	-imgSize 224 \
	-featSize 7 \
	-nEpochs 30 \
	-gsm_mu 0 \
	-gsm_sigma 1 \
	-gsm_scale .1 \
	-lFunc 'gsm' | tee $save/log.txt

where -gsm_mu and -gsm_sigma are the initial mean and stand deviation of the distribution, respectively.

Train on MS COCO

Step 1: Download MS COCO
The directory structure of the dataset should be re-organized as

/home/yluo/project/dataset/mscoco/images
├── train2014
├── train2014.t7
├── val2014
└── val2014.t7

Step 2: Download pretrained model ResNet-101 from ResNet Torch homepage
Step 3: To use softmax as the loss function for training, run

export save='logs/rsn101_coco'
mkdir -p $save
CUDA_VISIBLE_DEVICES=1,2 th main_multi.lua \
	-data '/home/yluo/project/dataset/mscoco/images/' \
	-retrain '/home/yluo/project/lua/saliency_torch/pretrained/resnet-101.t7' \
	-save $save \
	-batchSize 16 \
	-nGPU 2 \
	-nThreads 4 \
	-shareGradInput true \
	-dataset 'coco' \
	-resetClassifier true \
	-nClasses 160 \
	-LR 1e-05 \
	-imgSize 448 \
	-featSize 14 \
	-nEpochs 10 \
	-lFunc 'mce' | tee $save/log.txt

To use G-softmax as the loss function for training, run

export save='logs/rsn101_coco_gsm'
mkdir -p $save
CUDA_VISIBLE_DEVICES=1,2 th main_multi.lua \
	-data '/home/yluo/project/dataset/mscoco/images/' \
	-retrain '/home/yluo/project/lua/saliency_torch/pretrained/resnet-101.t7' \
	-save $save \
	-batchSize 16 \
	-nGPU 2 \
	-nThreads 4 \
	-shareGradInput true \
	-dataset 'coco' \
	-resetClassifier true \
	-nClasses 160 \
	-LR 1e-05 \
	-imgSize 448 \
	-featSize 14 \
	-nEpochs 10 \
	-gsm_mu 0 \
	-gsm_sigma 1 \
	-gsm_scale 1 \
	-lFunc 'gsm' | tee $save/log.txt

Train on NUS-WIDE

Step 1: Download NUS-WIDE
Since NUS-WIDE has invalid and untagged images, we follow the work, Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification, to remove these images by generating new nus_wide_test_imglist.txt, nus_wide_test_label.txt, nus_wide_train_imglist.txt, and nus_wide_train_label.txt under the clean folder. These files can be downloaded from here.

/home/yluo/project/dataset/nuswide/
├── clean
│   ├── nus_wide_test_imglist.txt
│   ├── nus_wide_test_label.txt
│   ├── nus_wide_train_imglist.txt
│   └── nus_wide_train_label.txt
├── Concepts81.txt
└── images

There are 81 folders under images folder.
Step 2: Download pretrained model ResNet-101 from ResNet Torch homepage
Step 3: To use softmax as the loss function for training, run

export save='logs/rsn101_nuswide'
mkdir -p $save
CUDA_VISIBLE_DEVICES=0,1 th main_multi.lua \
	-data '/home/yluo/project/dataset/nuswide/images/' \
	-retrain '/home/yluo/project/lua/saliency_torch/pretrained/resnet-101.t7' \
	-save $save \
	-batchSize 16 \
	-nGPU 2 \
	-nThreads 4 \
	-shareGradInput true \
	-dataset 'nuswideclean' \
	-resetClassifier true \
	-nClasses 162 \
	-LR 1e-05 \
	-imgSize 448 \
	-featSize 14 \
	-nEpochs 10 \
	-lFunc 'mce' | tee $save/log.txt

To use G-softmax as the loss function for training, run

export save='logs/rsn101_nuswide_gsm'
mkdir -p $save
CUDA_VISIBLE_DEVICES=0,1 th main_multi.lua \
	-data '/home/yluo/project/dataset/nuswide/images/' \
	-retrain '/home/yluo/project/lua/saliency_torch/pretrained/resnet-101.t7' \
	-save $save \
	-batchSize 16 \
	-nGPU 2 \
	-nThreads 4 \
	-shareGradInput true \
	-dataset 'nuswideclean' \
	-resetClassifier true \
	-nClasses 162 \
	-LR 1e-05 \
	-imgSize 448 \
	-featSize 14 \
	-nEpochs 10 \
	-gsm_mu 0 \
	-gsm_sigma 1 \
	-gsm_scale .1 \
	-gsm_lr_w 100 \
	-lFunc 'gsm' | tee $save/log.txt

Evaluation

Classification: Top 1 error. The process will print out the top 1 error on the test set at the end of every epoch.

Multi-label Classification: mean average precision (mAP), Class-Precision (C-P), Class-Recall (C-R), Class-F1 (C-F), Overall-Precision (O-P), Overall-Recall (O-R), and Overall-F1 (O-F). The code for evaluation over those metrics are in folder eval, which is imported from Spatial Regularization Network

Contact

luoxx648 at umn.edu
Any discussions, suggestions, and questions are welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
datasets		datasets
eval		eval
models		models
src		src
LICENSE		LICENSE
README.md		README.md
checkpoints.lua		checkpoints.lua
dataloader.lua		dataloader.lua
dataloader_single.lua		dataloader_single.lua
main_multi.lua		main_multi.lua
main_single.lua		main_single.lua
opts.lua		opts.lua
train.lua		train.lua
train_coco.sh		train_coco.sh
train_multi.lua		train_multi.lua
train_nuswide.sh		train_nuswide.sh
train_tinyimgnet.sh		train_tinyimgnet.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gaussian-based softmax: Improving Intra-class Compactness and Inter-class Separability of Features

Citation

Prerequisites

Core Components

Train on Tiny ImageNet

Train on MS COCO

Train on NUS-WIDE

Evaluation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Gaussian-based softmax: Improving Intra-class Compactness and Inter-class Separability of Features

Citation

Prerequisites

Core Components

Train on Tiny ImageNet

Train on MS COCO

Train on NUS-WIDE

Evaluation

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages