IAA: Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning

The PyTorch codes for our paper "Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning", which is accepted by the IEEE Transactions on Multimedia, 2022 (IEEE Xplore). It is built on top of the MDR.

📣 The related work about deep metric learning.

Self-supervised Synthesis Ranking for Deep Metric Learning
Accepted by T-CSVT 2022

Introduction

Deep metric learning aims to learn an embedding space, where semantically similar samples are close together and dissimilar ones are repelled against. To explore more hard and informative training signals for augmentation and generalization, recent methods focus on generating synthetic samples to boost metric learning losses. However, these methods just use the deterministic and class-independent generations (e.g., simple linear interpolation), which only can cover the limited part of distribution spaces around original samples. They have overlooked the wide characteristic changes of different classes and can not model abundant intra-class variations for generations. Therefore, generated samples not only lack rich semantics within the certain class, but also might be noisy signals to disturb training.

In this paper, we propose a novel intra-class adaptive augmentation (IAA) framework for deep metric learning. We reasonably estimate intra-class variations for every class and generate adaptive synthetic samples to support hard samples mining and boost metric learning losses. Further, for most datasets that have a few samples within the class, we propose the neighbor correction to revise the inaccurate estimations, according to our correlation discovery where similar classes generally have similar variation distributions. Extensive experiments on five benchmarks show our method significantly improves and outperforms the state-of-the-art methods on retrieval performances by 3%-6%.

Requirements

We recommended the following dependencies.

Python 3.8
torch 1.7.0
torchvision 0.8.0
numpy
tqdm
scipy
Pillow
matplotlib

Preparing Datasets

Download these datasets.
- CUB-200-2011
- Cars-196 (Img, Annotation)
- Stanford Online Products
- In-Shop Clothes Retrieval
Extract the compressed file (tgz or zip) into MyDataset/, e.g., for Cars-196, put the files in the MyDataset/Cars196. Other naming ways can see the python files of realted datatsets in utils/dataset, or modify these names in your way.

Training

Set up the related arguments.
- The folder MyDataset is the root path of datasets using in this paper (including CUB, CARS, SOP), which can be customized by the argparse parameter --data.
- The folder results is the log path to record corresponding models and results of training, which can be customized by the argparse parameter --save-dir.
- The folder weights_models is the path to put the weighting parameters of pretrained backbone networks, which can be customized by the argparse parameter --weight_path.
Run train.py for different metric learning losses and datasets.
If we need use the IAA framework, set --intra 1 (default); we also can train the baseline model by setting --intra 0.

CUB-200-2011

# googlenet
# Contrastive loss and MS loss
python run.py --dataset cub200 --backbone googlenet --loss MS --intra_lamda 0.8 --aug_num 3
python run.py --dataset cub200 --backbone googlenet --loss Contrastive  --lr 3e-5  --intra_lamda 0.8 --aug_num 3

Cars-196

# googlenet
python run.py --dataset cars196 --backbone googlenet --loss MS --intra_lamda 0.8 --aug_num 3
python run.py --dataset cars196 --backbone googlenet --loss Contrastive --intra_lamda 0.8 --aug_num 3

Stanford Online Products

# googlenet
python run.py --dataset stanford --backbone googlenet --batch 180 --lr 1e-4 --loss MS --intra_lamda 0.6 --aug_num 3
python run.py --dataset stanford --backbone googlenet --batch 180 --lr 1e-4 --loss Contrastive --intra_lamda 0.6 --aug_num 2

# bninception
python run.py --dataset stanford --backbone bninception --batch 256 --lr 1e-4 --loss MS -intra_lamda 0.5 --aug_num 3

# resnet50
python run.py --dataset stanford --backbone resnet50 --batch 256 --lr 1e-4 --loss MS -intra_lamda 0.5 --aug_num 3

In-Shop Clothes Retrieval

# googlenet
python run.py --dataset inshop --backbone googlenet --batch 180 --lr 1e-4 --loss MS --intra_lamda 0.6 --aug_num 3
python run.py --dataset inshop --backbone googlenet --batch 180 --lr 1e-4 --loss Contrastive --intra_lamda 0.6 --aug_num 2

# bninception
python run.py --dataset inshop --backbone bninception --batch 256 --lr 1e-4 --loss MS --intra_lamda 0.5 --aug_num 3

# resnet50
python run.py --dataset inshop --backbone resnet50 --batch 256 --lr 1e-4 --loss MS --intra_lamda 0.5 --aug_num 3

Reference

@article{Fu2022IAA,
  author={Fu, Zheren and Mao, Zhendong and Hu, Bo and Liu, An-An and Zhang, Yongdong},
  journal={IEEE Transactions on Multimedia}, 
  title={Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning}, 
  year={2022},
  pages={1-14},
  doi={10.1109/TMM.2022.3227414}
  }

@ARTICLE{Fu2022SSR,
  author={Fu, Zheren and Mao, Zhendong and Yan, Chenggang and Liu, An-An and Xie, Hongtao and Zhang, Yongdong},
  journal={IEEE Transactions on Circuits and Systems for Video Technology}, 
  title={Self-Supervised Synthesis Ranking for Deep Metric Learning}, 
  year={2022},
  volume={32},
  number={7},
  pages={4736-4750},
  doi={10.1109/TCSVT.2021.3124908}}

Contact

If you have any quetions, please directly open a new issue or contact [email protected]. Thanks!

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
metric		metric
model		model
pdf		pdf
utils		utils
LICENSE		LICENSE
README.md		README.md
intra.py		intra.py
losses.py		losses.py
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IAA: Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning

Introduction

Requirements

Preparing Datasets

Training

CUB-200-2011

Cars-196

Stanford Online Products

In-Shop Clothes Retrieval

Reference

Contact

License

About

Releases

Packages

Languages

License

darkpromise98/IAA

Folders and files

Latest commit

History

Repository files navigation

IAA: Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning

Introduction

Requirements

Preparing Datasets

Training

CUB-200-2011

Cars-196

Stanford Online Products

In-Shop Clothes Retrieval

Reference

Contact

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages