ArXiv TLDR

Don't Get Your Kroneckers in a Twist: Gaussian Processes on High-Dimensional Incomplete Grids

🐦 Tweet
2605.08036

Mads Greisen Højlund, August Smart Lykke-Møller, Henry Moss, Ove Christiansen

cs.LG

TLDR

CUTS-GPR enables numerically exact Gaussian Process Regression on high-dimensional incomplete grids with unprecedented speed and scalability.

Key contributions

  • Introduces CUTS-GPR for numerically exact GPR on high-dimensional incomplete grids.
  • Achieves extremely fast kernel matrix-vector products with near-linear scaling in data size.
  • Combines additive kernels with incomplete grids to exploit kernel matrix structure.
  • Enables Bayesian modeling of high-dimensional potential energy surfaces in chemistry.

Why it matters

This paper addresses a longstanding challenge in computational chemistry by enabling Bayesian modeling of high-dimensional potential energy surfaces. Its unprecedented scalability for GPR opens new avenues for complex scientific and engineering problems.

Original Abstract

We introduce CUTS-GPR, a new method for performing numerically exact Gaussian process regression (GPR) in high-dimensional settings. The key component of CUTS-GPR is an extremely fast kernel matrix-vector product, which exhibits near-linear or even linear scaling with the amount of training data, $N$, and low-order polynomial scaling with dimensionality, $D$. This is obtained by combining an additive kernel with an incomplete grid and exploiting the resulting structure of the kernel matrix. We demonstrate the scalability of the matrix-vector product by running benchmarks with billions of data points and thousands of dimensions. Full GPR calculations, including hyperparameter optimization, are completed in a matter of hours for $N = 447 265$ and $D = 24$. We demonstrate that our CUTS-GPR enables Bayesian modeling of high-dimensional potential energy surfaces - a longstanding challenge in computational chemistry.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.