Agentic AIs Are the Missing Paradigm for Out-of-Distribution Generalization in Foundation Models

May 7, 20262605.06522

Xin Wang, Haibo Chen, Wenxuan Liu, Wenwu Zhu

cs.LGcs.CV

TLDR

Agentic AIs are the missing paradigm for robust out-of-distribution generalization in foundation models, overcoming limitations of model-centric approaches.

Key contributions

Formalizes OOD for foundation models with partially observed multi-stage training.
Proves a "parameter coverage ceiling" limiting model-centric OOD solutions.
Defines agentic OOD systems by perception, strategy, action, and verification.
Shows agentic systems strictly extend OOD generalization beyond model limits.

Why it matters

Foundation models struggle with out-of-distribution generalization in real-world scenarios, a problem not solvable by current model-centric methods. This paper introduces agentic AIs as a complementary paradigm, fundamentally expanding the capabilities of FMs in complex, shifting environments.

Original Abstract

Foundation models (FMs) are increasingly deployed in open-world settings where distribution shift is the rule rather than the exception. The out-of-distribution (OOD) phenomena they face -- knowledge boundaries, capability ceilings, compositional shifts, and open-ended task variation -- differ in kind from the settings that have shaped prior OOD research, and are further complicated because the pretraining and post-training distributions of modern FMs are often only partially observed. Our position is that OOD for foundation models is a structurally distinct problem that cannot be solved within the prevailing model-centric paradigm, and that agentic systems constitute the missing paradigm required to address it. We defend this claim through four steps. First, we give a stage-aware formalization of OOD that accommodates partially observed multi-stage training distributions. Second, we prove a parameter coverage ceiling: there exist practically relevant inputs that no model-centric method (training-time or test-time) can handle within tolerance $\varepsilon$, for reasons intrinsic to parameter-based representation. Third, we characterize agentic OOD systems by four structural properties -- perception, strategy selection, external action, and closed-loop verification -- and show that they strictly extend the reachable set beyond the ceiling. Fourth, we respond to seven counterarguments, conceding two, and outline a research agenda. We do not claim that agentic methods subsume model-centric ones; we argue that the two are complementary, and that progress on FM-OOD requires explicit recognition of the agentic paradigm as a first-class research direction.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers