"Assessing Gaussian Process Regression and Permutationally Invariant Polynomial Approaches To Represent High-Dimensional Potential Energy Surfaces" by Chen Qu

Article

Assessing Gaussian Process Regression and Permutationally Invariant Polynomial Approaches To Represent High-Dimensional Potential Energy Surfaces

Journal of Chemical Theory and Computation

Chen Qu, Emory University
Brian L. Van Hoozen, Jr., Emory University
Joel M. Bowman, Emory University
Rodrigo A. Vargas-Hernandez, University of British Columbia - Canada

Find in your library

Document Type

Article

Publication Date

7-10-2018

Disciplines

Abstract

The mathematical representation of large data sets of electronic energies has seen substantial progress in the past 10 years. The so-called Permutationally Invariant Polynomial (PIP) representation is one established approach. This approach dates from 2003, when a global potential energy surface (PES) for CH5+ was reported using a basis of polynomials that are invariant with respect to the 120 permutations of the five equivalent H atoms. More recently, several approaches from “machine learning” have been applied to fit these large data sets. Gaussian Process (GP) regression is such an approach. Here, we consider the implementation of the (full) GP due to Krems and co-workers, with a modification that renders it permutationally invariant, which we denote by PIP-GP. This modification uses the approach of Guo and co-workers and later extended by Zhang and co-workers, to achieve permutational invariance for neural-network fits. The PIP, GP, and PIP-GP approaches are applied to four case studies for fitting data sets of electronic energies: H3O+, OCHCO+, and H2CO/cis-HCOH/trans-HCOH with the goal of assessing precision, accuracy in normal-mode analysis and barrier heights, and timings. We also report an application to (HCOOH)2, where the full PIP approach is possible but where the PIP-GP one is not feasible. However, by replicating data, which is feasible in this case, the GP approach is able to represent the data with precision comparable to that of the PIP approach. We examine these assessments for varying sizes of data sets in each case to determine the dependence of properties of the fits on the training data size. We conclude with some comments on the different aspects of computational effort of the PIP, GP, and PIP-GP approaches and also challenges these methods face for more “rugged” PESs, exemplified here by H2CO/cis-HCOH/trans-HCOH.

Comments

Additional Comments

NSF grant #: CHE-1463552

ORCID ID

0000-0002-0343-3181

DOI

10.1021/acs.jctc.8b00298

Citation Information

Chen Qu, Brian L. Van Hoozen, Joel M. Bowman and Rodrigo A. Vargas-Hernandez. "Assessing Gaussian Process Regression and Permutationally Invariant Polynomial Approaches To Represent High-Dimensional Potential Energy Surfaces" Journal of Chemical Theory and Computation Vol. 14 Iss. 7 (2018) p. 3381 - 3396 ISSN: 1549-9618
Available at: http://works.bepress.com/brian-vanhoozen/2/