ArXiv TLDR

Task-Aware Scanning Parameter Configuration for Robotic Inspection Using Vision Language Embeddings and Hyperdimensional Computing

🐦 Tweet
2605.03909

Zhiling Chen, David Gorsich, Matthew P. Castanier, Yang Zhang, Jiong Tang + 1 more

cs.ROcs.CV

TLDR

This paper introduces ScanHD, a hyperdimensional computing framework that autonomously configures robotic laser profilers using vision-language embeddings.

Key contributions

  • Developed Instruct-Obs2Param, a multimodal dataset for instruction-conditioned sensor configuration.
  • Introduced ScanHD, a hyperdimensional computing framework for autonomous parameter recommendation.
  • ScanHD binds vision-language embeddings for task-aware sensing parameter inference.
  • Achieved 92.7% accuracy with ScanHD, outperforming baselines and enabling adaptive robot inspection.

Why it matters

Manual sensor tuning in robotic inspection is inefficient and error-prone. This paper introduces ScanHD, an autonomous hyperdimensional computing framework that adaptively configures sensors based on task instructions, eliminating manual tuning and boosting measurement fidelity.

Original Abstract

Robotic laser profiling is widely used for dimensional verification and surface inspection, yet measurement fidelity is often dominated by sensor configuration rather than robot motion. Industrial profilers expose multiple coupled parameters, including sampling frequency, measurement range, exposure time, receiver dynamic range, and illumination, that are still tuned by trial-and-error; mismatches can cause saturation, clipping, or missing returns that cannot be recovered downstream. We formulate instruction-conditioned sensing parameter recommendation; given a pre-scan RGB observation and a natural-language inspection instruction, infer a discrete configuration over key parameters of a robot-mounted profiler. To benchmark this problem, we develop Instruct-Obs2Param, a real-world multimodal dataset linking inspection intents and multi-view pose and illumination variation across 16 objects to canonical parameter regimes. We then propose ScanHD, a hyperdimensional computing framework that binds instruction and observation into a task-aware code and performs parameter-wise associative reasoning with compact memories, matching discrete scanner regimes while yielding stable, interpretable, low-latency decisions. On Instruct-Obs2Param, ScanHD achieves 92.7% average exact accuracy and 98.1% average Win@1 accuracy across the five parameters, with strong cross-split generalization and low-latency inference suitable for deployment, outperforming rule-based heuristics, conventional multimodal models, and multimodal large language models. This work enables autonomous, instruction-conditioned sensing configuration from task intent and scene context, eliminating manual tuning and elevating sensor configuration from a static setting to an adaptive decision variable.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.