Agentic AIs Are the Missing Paradigm for Out-of-Distribution Generalization in Foundation Models
Xin Wang, Haibo Chen, Wenxuan Liu, Wenwu Zhu
TLDR
Agentic AIs are the missing paradigm for robust out-of-distribution generalization in foundation models, overcoming limitations of model-centric approaches.
Key contributions
- Formalizes OOD for foundation models with partially observed multi-stage training.
- Proves a "parameter coverage ceiling" limiting model-centric OOD solutions.
- Defines agentic OOD systems by perception, strategy, action, and verification.
- Shows agentic systems strictly extend OOD generalization beyond model limits.
Why it matters
Foundation models struggle with out-of-distribution generalization in real-world scenarios, a problem not solvable by current model-centric methods. This paper introduces agentic AIs as a complementary paradigm, fundamentally expanding the capabilities of FMs in complex, shifting environments.
Original Abstract
Foundation models (FMs) are increasingly deployed in open-world settings where distribution shift is the rule rather than the exception. The out-of-distribution (OOD) phenomena they face -- knowledge boundaries, capability ceilings, compositional shifts, and open-ended task variation -- differ in kind from the settings that have shaped prior OOD research, and are further complicated because the pretraining and post-training distributions of modern FMs are often only partially observed. Our position is that OOD for foundation models is a structurally distinct problem that cannot be solved within the prevailing model-centric paradigm, and that agentic systems constitute the missing paradigm required to address it. We defend this claim through four steps. First, we give a stage-aware formalization of OOD that accommodates partially observed multi-stage training distributions. Second, we prove a parameter coverage ceiling: there exist practically relevant inputs that no model-centric method (training-time or test-time) can handle within tolerance $\varepsilon$, for reasons intrinsic to parameter-based representation. Third, we characterize agentic OOD systems by four structural properties -- perception, strategy selection, external action, and closed-loop verification -- and show that they strictly extend the reachable set beyond the ceiling. Fourth, we respond to seven counterarguments, conceding two, and outline a research agenda. We do not claim that agentic methods subsume model-centric ones; we argue that the two are complementary, and that progress on FM-OOD requires explicit recognition of the agentic paradigm as a first-class research direction.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.