Name		Name	Last commit message	Last commit date
parent directory ..
.figure		.figure
experiments		experiments
lib		lib
model		model
README.md		README.md
evolution.py		evolution.py
requirements.txt		requirements.txt
supernet_engine.py		supernet_engine.py
supernet_train.py		supernet_train.py

README.md

AutoFormer: Searching Transformers for Visual Recognition

This is an official implementation of AutoFormer.

AutoFormer is new one-shot architecture search framework dedicated to vision transformer search. It entangles the weights of different vision transformer blocks in the same layers during supernet training. Benefiting from the strategy, the trained supernet allows thousands of subnets to be very well-trained. Specifically, the performance of these subnets with weights inherited from the supernet is comparable to those retrained from scratch.

Highlights

Once-for-all

AutoFormer is a simple yet effective method to train a once-for-all vision transformer supernet.

Competive performance

AutoFormers consistently outperform DeiTs.

Environment Setup

To set up the enviroment you can easily run the following command:

conda create -n Autoformer python=3.6
conda activate Autoformer
pip install -r requirements.txt

Data Preparation

You need to first download the ImageNet-2012 to the folder ./data/imagenet and move the validation set to the subfolder ./data/imagenet/val. To move the validation set, you cloud use the following script: https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh

The directory structure is the standard layout as following.

/path/to/imagenet/
  train/
    class1/
      img1.jpeg
    class2/
      img2.jpeg
  val/
    class1/
      img3.jpeg
    class/2
      img4.jpeg

Model Zoo

For evaluation, we provide the checkpoints of our models in Google Drive and GitHub.

After downloading the models, you can do the evaluation following the description in Quick Start - Test).

Model download links:

Model	Params.	Top-1 Acc. %	Top-5 Acc. %	Download link
AutoFormer-T	5.8M	75.3	92.7	Google Drive, GitHub
AutoFormer-S	22.9M	81.7	95.7	Google Drive, GitHub
AutoFormer-B	53.7M	82.4	95.7	Google Drive, GitHub

Quick Start

We provide Supernet Train, Search, Test code of AutoFormer as follows.

Supernet Train

To train the supernet-T/S/B, we provided the corresponding supernet configuration files in /experiments/supernet/. For example, to train the supernet-B, you can run the following command. The default output path is ./, you can specify the path with argument --output.

python -m torch.distributed.launch --nproc_per_node=8 --use_env supernet_train.py --data-path /PATH/TO/IMAGENT --gp \
--change_qk --relative_position --mode super --dist-eval --cfg ./experiments/supernet/supernet-B.yaml --epochs 500 --warmup-epochs 20 \
--output /OUTPUT_PATH --batch-size 128

Search

We run our evolution search on part of the ImageNet training dataset and use the validation set of ImageNet as the test set for fair comparison. To generate the subImagenet in /PATH/TO/IMAGENET, you could simply run:

python ./lib/subImageNet.py --data-path /PATH/TO/IMAGENT

After obtaining the subImageNet and training of the supernet. We could perform the evolution search using below command. Please remember to config the specific constraint in this evolution search using --min-param-limits and --param-limits:

python -m torch.distributed.launch --nproc_per_node=8 --use_env evolution.py --data-path /PATH/TO/IMAGENT --gp \
--change_qk --relative_position --dist-eval --cfg ./experiments/supernet/supernet-B.yaml --resume /PATH/TO/CHECKPOINT \
--min-param-limits YOUR/CONFIG --param-limits YOUR/CONFIG --data-set EVO_IMNET

Test

To test our trained models, you need to put the downloaded model in /PATH/TO/CHECKPOINT. After that you could use the following command to test the model (Please change your config file and model checkpoint according to different models. Here we use the AutoFormer-B as an example).

python -m torch.distributed.launch --nproc_per_node=8 --use_env supernet_train.py --data-path /PATH/TO/IMAGENT --gp \
--change_qk --relative_position --mode retrain --dist-eval --cfg ./experiments/subnet/AutoFormer-B.yaml --resume /PATH/TO/CHECKPOINT --eval

Performance

Left: Top-1 accuracy on ImageNet. Our method achieves very competitive performance, being superior to the recent DeiT and ViT. Right: 1000 random sampled good architectures in the supernet-S. The supernet trained under our strategy allows subnets to be well optimized.

Bibtex

If this repo is helpful for you, please consider to cite it. Thank you! :)

@InProceedings{AutoFormer,
    title     = {AutoFormer: Searching Transformers for Visual Recognition},
    author    = {Chen, Minghao and Peng, Houwen and Fu, Jianlong and Ling, Haibin},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {12270-12280}
}

Acknowledgements

The codes are inspired by HAT, timm, DeiT, SPOS.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AutoFormer

AutoFormer

README.md

AutoFormer: Searching Transformers for Visual Recognition

Highlights

Environment Setup

Data Preparation

Model Zoo

Quick Start

Supernet Train

Search

Test

Performance

Bibtex

Acknowledgements

Files

AutoFormer

Directory actions

More options

Directory actions

More options

Latest commit

History

AutoFormer

Folders and files

parent directory

README.md

AutoFormer: Searching Transformers for Visual Recognition

Highlights

Environment Setup

Data Preparation

Model Zoo

Quick Start

Supernet Train

Search

Test

Performance

Bibtex

Acknowledgements