Wang, Haotao and Xiao, Chaowei and Kossaifi, Jean and Yu, Zhiding and Anandkumar, Anima and Wang, Zhangyang (2021) AugMax: Adversarial Composition of Random Augmentations for Robust Training. In: Advances in Neural Information Processing Systems 34. Advances in Neural Information Processing Systems , pp. 237-250. https://resolver.caltech.edu/CaltechAUTHORS:20220714-224700672
![]() |
PDF
- Published Version
See Usage Policy. 1MB |
![]() |
PDF
- Supplemental Material
See Usage Policy. 162kB |
![]() |
PDF (ArXiv discussion paper)
- Submitted Version
See Usage Policy. 1MB |
Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20220714-224700672
Abstract
Data augmentation is a simple yet effective way to improve the robustness of deep neural networks (DNNs). Diversity and hardness are two complementary dimensions of data augmentation to achieve robustness. For example, AugMix explores random compositions of a diverse set of augmentations to enhance broader coverage, while adversarial training generates adversarially hard samples to spot the weakness. Motivated by this, we propose a data augmentation framework, termed AugMax, to unify the two aspects of diversity and hardness. AugMax first randomly samples multiple augmentation operators and then learns an adversarial mixture of the selected operators. Being a stronger form of data augmentation, AugMax leads to a significantly augmented input distribution which makes model training more challenging. To solve this problem, we further design a disentangled normalization module, termed DuBIN (Dual-Batch-and-Instance Normalization), that disentangles the instance-wise feature heterogeneity arising from AugMax. Experiments show that AugMax-DuBIN leads to significantly improved out-of-distribution robustness, outperforming prior arts by 3.03%, 3.49%, 1.82% and 0.71% on CIFAR10-C, CIFAR100-C, Tiny ImageNet-C and ImageNet-C. Codes and pretrained models are available: https://github.com/VITA-Group/AugMax.
Item Type: | Book Section | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Related URLs: |
| |||||||||||||||
ORCID: |
| |||||||||||||||
Additional Information: | Work partially done during an internship at NVIDIA. Z.W. is supported by the U.S. Army Research Laboratory Cooperative Research Agreement W911NF17-2-0196 (IOBT REIGN), and an NVIDIA Applied Research Accelerator Program. | |||||||||||||||
Funders: |
| |||||||||||||||
DOI: | 10.48550/arXiv.arXiv.2110.13771 | |||||||||||||||
Record Number: | CaltechAUTHORS:20220714-224700672 | |||||||||||||||
Persistent URL: | https://resolver.caltech.edu/CaltechAUTHORS:20220714-224700672 | |||||||||||||||
Usage Policy: | No commercial reproduction, distribution, display or performance rights in this work are provided. | |||||||||||||||
ID Code: | 115608 | |||||||||||||||
Collection: | CaltechAUTHORS | |||||||||||||||
Deposited By: | George Porter | |||||||||||||||
Deposited On: | 15 Jul 2022 23:06 | |||||||||||||||
Last Modified: | 02 Jun 2023 01:32 |
Repository Staff Only: item control page