LearnToPayAttention

PyTorch implementation of ICLR 2018 paper Learn To Pay Attention

Most Recent Updates

Oct. 29, 2018: Add the implementation of "grid attention" module (NOT tested on CIFAR100 or any other dataset. Feel free to do your own experiments).
Reference paper: https://arxiv.org/abs/1804.05338
Reference code: https://github.com/ozan-oktay/Attention-Gated-Networks
Nov. 2, 2018: Release the pre-trained models

My implementation is based on "(VGG-att3)-concat-pc" in the paper, and I trained the model on CIFAR-100 DATASET.
I implemented two version of the model, the only difference is whether to insert the attention module before or after the corresponding max-pooling layer.

(New!) Pre-trained models

Google drive link
Alternative link(Baidu Cloud Disk)

Dependences

PyTorch (>=0.4.1)
OpenCV
tensorboardX

NOTE If you are using PyTorch < 0.4.1, then replace torch.nn.functional.interpolate by torch.nn.Upsample. (Modify the code in utilities.py).

Training

Pay attention before max-pooling layers

python train.py --attn_mode before --outf logs_before --normalize_attn --log_images

Pay attention after max-pooling layers

python train.py --attn_mode after --outf logs_after --normalize_attn --log_images

Results

Training curve - loss

The x-axis is # iter

Pay attention before max-pooling layers
Pay attention after max-pooling layers
Plot in one figure

Training curve - accuracy on test data

The x-axis is # epoch

Pay attention before max-pooling layers
Pay attention after max-pooling layers
Plot in one figure

Quantitative results (on test data of CIFAR-100)

Method	VGG (Simonyan&Zisserman,2014)	(VGG-att3)-concat-pc (ICLR 2018)	attn-before-pooling (my code)	attn-after-pooling (my code)
Top-1 error	30.62	22.97	22.62	22.92

Attention map visualization (on test data of CIFAR-100)

From left to right: L1, L2, L3, original images

Pay attention before max-pooling layers
Pay attention after max-pooling layers

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
fig		fig
LICENSE		LICENSE
README.md		README.md
blocks.py		blocks.py
initialize.py		initialize.py
main.py		main.py
model1.py		model1.py
model1_bkp.py		model1_bkp.py
model2.py		model2.py
model3_untested.py		model3_untested.py
train.py		train.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LearnToPayAttention

Most Recent Updates

(New!) Pre-trained models

Dependences

Training

Results

Training curve - loss

Training curve - accuracy on test data

Quantitative results (on test data of CIFAR-100)

Attention map visualization (on test data of CIFAR-100)

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

srb-cv/AttentionClassification

Folders and files

Latest commit

History

Repository files navigation

LearnToPayAttention

Most Recent Updates

(New!) Pre-trained models

Dependences

Training

Results

Training curve - loss

Training curve - accuracy on test data

Quantitative results (on test data of CIFAR-100)

Attention map visualization (on test data of CIFAR-100)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages