"Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation" by Haohan Wang

Selected Works of Eric P. Xing

Article

Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Haohan Wang, Carnegie Mellon University, Pittsburgh, PA, United States
Zeyi Huang, University of Wisconsin-Madison, Madison, PA, United States
Xindi Wu, Carnegie Mellon University, Pittsburgh, PA, United States
Eric P. Xing, Carnegie Mellon University, Pittsburgh, PA, United States & Mohamed bin Zayed University of Artificial Intelligence

Link

Document Type

Conference Proceeding

Abstract

Data augmentation has been proven to be an effective technique for developing machine learning models that are robust to known classes of distributional shifts (e.g., rotations of images), and alignment regularization is a technique often used together with data augmentation to further help the model learn representations invariant to the shifts used to augment the data. In this paper, motivated by a proliferation of options of alignment regularizations, we seek to evaluate the performances of several popular design choices along the dimensions of robustness and invariance, for which we introduce a new test procedure. Our synthetic experiment results speak to the benefits of squared "2 norm regularization. Further, we also formally analyze the behavior of alignment regularization to complement our empirical study under assumptions we consider realistic. Finally, we test this simple technique we identify (worst-case data augmentation with squared "2 norm alignment regularization) and show that the benefits of this method outrun those of the specially designed methods. We also release a software package in both TensorFlow and PyTorch for users to use the method with a couple of lines at https://github.com/jyanln/AlignReg. © 2022 Owner/Author.

DOI

10.1145/3534678.3539438

Publication Date

8-14-2022

Keywords

data augmentation,
machine learning,
robustness,
trustworthy,
Testing

Disciplines

Comments

IR Deposit conditions: non-described

Citation Information

H. Wang, Z. Huang, X. Wu, and E.P. Xing, "Toward Learning Robust and Invariant Representations with Alignment Regularization and Data Augmentation", in Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), NY, USA, pp. 1846–1856. doi:10.1145/3534678.3539438