ArXiv TLDR

Vision as looking and seeing through a bottleneck

🐦 Tweet
2604.23030

Li Zhaoping

q-bio.NC

TLDR

This paper proposes vision operates as a bottleneck, where peripheral "looking" selects input for foveal "seeing," guided by V1 and top-down feedback.

Key contributions

  • Vision is a bottleneck process where only a small fraction of retinal input is recognized.
  • "Looking" (peripheral) selects visual info via gaze shifts; "Seeing" (central) recognizes it.
  • V1 initiates the bottleneck and guides saccades via a bottom-up saliency map.
  • Top-down feedback refines "seeing" by targeting central visual field representations.

Why it matters

This paper re-frames vision as a bottleneck process, highlighting the overlooked constraint that only a tiny fraction of retinal input is recognized. It offers a new framework for understanding how "looking" and "seeing" interact, driven by V1 and top-down mechanisms. This perspective could accelerate progress in vision research by guiding new theories and experimental designs.

Original Abstract

Progress in vision research has been slower downstream than upstream of primary visual cortex (V1). Traditional frameworks have largely overlooked a central constraint: only a tiny fraction of retinal input is recognized. Thus, to a first approximation, vision is better formulated as looking and seeing through a bottleneck. Looking, mainly by the peripheral visual field, selects visual information to enter this bottleneck, largely via gaze shifts that center selected contents at fovea. Seeing, mainly by the central visual field, recognizes this content. Converging evidence suggests that V1 initiates the bottleneck and contributes to looking by generating a bottom-up saliency map that guides saccades exogenously, and that top-down feedback along the visual pathway, targeting mainly the representation of the central visual field, refines seeing. Progress will accelerate through falsifiable theories that explicitly link behavior with neural substrates, and by experimental designs that avoid forced fixation and precisely track gaze.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.