The illusory simplicity of the feedforward pass: evidence for the dynamical nature of stimulus encoding along the primate ventral stream
Daniel Anthes, Sushrut Thorat, Anna Mitola, Paolo Papale, Peter König + 1 more
TLDR
Early primate vision isn't a simple feedforward sweep; it's a dynamic spatiotemporal process encoding information in neural pattern changes.
Key contributions
- Challenges the prevailing view of a simple, stage-like feedforward pass in primate vision.
- Shows V4-IT exchange varied information within 100ms using time-resolved multivariate analysis.
- RNN decoding reveals neural pattern dynamics encode categorical information beyond static spatial patterns.
Why it matters
This paper redefines our understanding of early visual processing, moving beyond static spatial encoding to emphasize dynamic spatiotemporal information. It suggests that even the initial stages of vision are more complex and interactive than previously thought, opening new avenues for research into neural computation.
Original Abstract
In studying primate vision, a large body of work focuses on the first feedforward sweep. During this initial time window, information is thought to pass through ventral stream regions in a stage-like fashion in an effort to extract high-level information from the retinal input. Consequently, electrophysiological analyses commonly focus on spatial response patterns, either by averaging data in time, or by applying decoders in a temporally local fashion. By analysing data recorded simultaneously across multiple arrays placed along the macaque ventral stream, we here show that this prior approach may be missing key aspects of information encoding. First, time-resolved, multivariate analyses of information transfer between V4 and IT reveal temporally and semantically varied information content as being exchanged within the first 100ms of processing. Second, by employing recurrent neural network (RNN) decoding techniques that extend across the temporal domain, we demonstrate that the neural pattern dynamics themselves carry categorical information far beyond the spatially encoded information available at any given time point. These findings challenge the prevailing view of a single, stage-like feedforward process and suggest that even the earliest parts of visual processing are better characterised as a spatiotemporally evolving process that encodes information in its dynamics rather than purely spatial response patterns.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.