ArXiv TLDR

Universality of first-order methods on random and deterministic matrices

🐦 Tweet
2604.11729

Nicola Gorini, Chris Jones, Dmitriy Kunisky, Lucas Pesenti

math.PRcs.DScs.LGmath.ST

TLDR

This paper analyzes first-order method dynamics on both random and deterministic matrices, introducing a new AMP iteration and calculating traffic distributions.

Key contributions

  • Calculates traffic distributions for deterministic matrices (Walsh-Hadamard, DCT), determining GFOM dynamics.
  • Resolves parts of long-standing conjectures on GFOM dynamics for structured inputs.
  • Introduces a new AMP iteration unifying prior variants and generalizing to diverse input types.
  • Provides a simple combinatorial interpretation of the Onsager correction in AMP algorithms.

Why it matters

This work provides a unified framework to understand first-order methods on both random and deterministic matrices, addressing a long-standing gap. It introduces a novel AMP algorithm with broad applicability and offers new insights into dynamics.

Original Abstract

General first-order methods (GFOM) are a flexible class of iterative algorithms which update a state vector by matrix-vector multiplications and entrywise nonlinearities. A long line of work has sought to understand the large-n dynamics of GFOM, mostly focusing on "very random" input matrices and the approximate message passing (AMP) special case of GFOM whose state is asymptotically Gaussian. Yet, it has long remained unknown how to construct iterative algorithms that retain this Gaussianity for more structured inputs, or why existing AMP algorithms can be as effective for some deterministic matrices as they are for random matrices. We analyze diagrammatic expansions of GFOM via the limiting traffic distribution of the input matrix, the collection of all limiting values of permutation-invariant polynomials in the matrix entries, to obtain the following results: 1. We calculate the traffic distribution for the first non-trivial deterministic matrices, including (minor variants of) the Walsh-Hadamard and discrete sine and cosine transform matrices. This determines the limiting dynamics of GFOM on these inputs, resolving parts of longstanding conjectures of Marinari, Parisi, and Ritort (1994). 2. We design a new AMP iteration which unifies several previous AMP variants and generalizes to new input types, whose limiting dynamics are Gaussian conditional on some latent random variables. The asymptotic dynamics hold for a large and natural class of traffic distributions (encompassing both random and deterministic input matrices) and the algorithm's analysis gives a simple combinatorial interpretation of the Onsager correction, answering questions posed recently by Wang, Zhong, and Fan (2022).

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.