Directional Confusions Reveal Divergent Inductive Biases Through Rate-Distortion Geometry in Human and Machine Vision

April 23, 20262604.21909

Leyla Roksan Caglar, Pedro A. M. Mediano, Baihan Lin

cs.CVcs.ITq-bio.NC

TLDR

This paper uses directional confusions and Rate-Distortion geometry to reveal distinct inductive biases in human and machine vision.

Key contributions

Directional confusions reveal distinct inductive biases in human vs. machine vision, beyond accuracy.
Introduces a Rate-Distortion (RD) framework to quantify confusion matrix asymmetry and generalization.
Humans show broad, weak asymmetries; deep models exhibit sparser, stronger directional collapses.
Robustness training doesn't fully replicate human-like graded similarity in models.

Why it matters

This paper introduces novel metrics (directional confusions, RD geometry) to reveal and compare inductive biases in human and machine vision. It offers a deeper understanding of model generalization and errors, crucial for developing more robust and human-aligned AI systems.

Original Abstract

Humans and modern vision models can reach similar classification accuracy while making systematically different kinds of mistakes - differing not in how often they err, but in who gets mistaken for whom, and in which direction. We show that these directional confusions reveal distinct inductive biases that are invisible to accuracy alone. Using matched human and deep vision model responses on a natural-image categorization task under 12 perturbation types, we quantify asymmetry in confusion matrices and link it to generalization geometry through a Rate-Distortion (RD) framework, summarized by three geometric signatures (slope (beta), curvature (kappa)) and efficiency (AUC). We find that humans exhibit broad but weak asymmetries, whereas deep vision models show sparser, stronger directional collapses. Robustness training reduces global asymmetry but fails to recover the human-like breadth-strength profile of graded similarity. Mechanistic simulations further show that different asymmetry organizations shift the RD frontier in opposite directions, even when matched for performance. Together, these results position directional confusions and RD geometry as compact, interpretable signatures of inductive bias under distribution shift.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers