ArXiv TLDR
← All categories

Robotics

Research on robot control, manipulation, navigation, and human-robot interaction.

cs.RO · 524 papers

HarmoWAM: Harmonizing Generalizable and Precise Manipulation via Adaptive World Action Models

HarmoWAM unifies predictive and reactive control in robot manipulation, achieving both generalizable transit and precise interaction through adaptive expert coordination.

2605.10942May 11, 2026Qiuxuan Feng, Jiale Yu, Jiaming Liu +8

Variational Inference for Lévy Process-Driven SDEs via Neural Tilting

This paper introduces a neural exponential tilting framework for variational inference in Lévy-driven SDEs, addressing challenges in modeling extreme events.

2605.10934May 11, 2026Yaman Kindap, Manfred Opper, Benjamin Dupuis +2

PriorVLA: Prior-Preserving Adaptation for Vision-Language-Action Models

PriorVLA adapts Vision-Language-Action models by preserving broad pretrained priors, using a frozen expert and an adaptation expert for superior performance.

2605.10925May 11, 2026Xinyu Guo, Bin Xie, Wei Chai +4

RoboMemArena: A Comprehensive and Challenging Robotic Memory Benchmark

RoboMemArena is a new, large-scale robotic memory benchmark with 26 tasks, real-world evaluation, and VLM-generated annotations, alongside the PrediMem VLA.

2605.10921May 11, 2026Huashuo Lei, Wenxuan Song, Huarui Zhang +10

MDrive: Benchmarking Closed-Loop Cooperative Driving for End-to-End Multi-agent Systems

MDrive is a new closed-loop cooperative driving benchmark with 225 diverse scenarios, revealing challenges and benefits of multi-agent systems.

2605.10904May 11, 2026Marco Coscoy, Zewei Zhou, Seth Z. Zhao +9

CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models

CapVector learns transferable capability vectors in parametric space for VLA models, enhancing performance and reducing adaptation costs during finetuning.

2605.10903May 11, 2026Wenxuan Song, Han Zhao, Fuhao Li +7

Safe Aerial 3D Path Planning for Autonomous UAVs using Magnetic Potential Fields

This paper introduces 3DMaxConvNet, a magnetic potential field planner for safe, real-time 3D UAV navigation in urban environments.

2605.10880May 11, 2026Haechan Mark Bong, Giovanni Beltrame

Is Your Driving World Model an All-Around Player?

WorldLens is a new benchmark, dataset, and agent for evaluating driving world models beyond visual realism, focusing on physical and behavioral fidelity.

2605.10858May 11, 2026Lingdong Kong, Ao Liang, Tianyi Yan +20

Unified Noise Steering for Efficient Human-Guided VLA Adaptation

UniSteer efficiently adapts VLA models for robotics by unifying human action-space guidance with noise-space RL, boosting success rates quickly.

2605.10821May 11, 2026Junjie Lu, Xinyao Qin, Yuhua Jiang +6

ALAM: Algebraically Consistent Latent Transitions for Vision-Language-Action Models

ALAM learns algebraically consistent latent transitions from action-free videos, significantly boosting VLA policy performance on complex robot manipulation tasks.

2605.10819May 11, 2026Zuojin Tang, Haoyun Liu, Xinyuan Chang +11

MAGS-SLAM: Monocular Multi-Agent Gaussian Splatting SLAM for Geometrically and Photometrically Consistent Reconstruction

MAGS-SLAM is the first RGB-only multi-agent 3D Gaussian Splatting SLAM for collaborative, photorealistic 3D reconstruction without depth sensors.

2605.10760May 11, 2026Zhihao Cao, Qi Shao, Shuhao Zhai +3

C-CoT: Counterfactual Chain-of-Thought with Vision-Language Models for Safe Autonomous Driving

C-CoT uses VLMs and counterfactual chain-of-thought to improve safe autonomous driving decisions, especially in complex, high-risk scenarios.

2605.10744May 11, 2026Kefei Tian, Yuansheng Lian, Kai Yang +2

Decentralized Contingency MPC based on Safe Sets for Nonlinear Multi-agent Collision Avoidance

This paper introduces a decentralized contingency MPC for nonlinear multi-agent collision avoidance, ensuring safety and feasibility without inter-agent communication.

2605.10738May 11, 2026Max Studt, Georg Schildbach

ObjView-Bench: Rethinking Difficulty and Deployment for Object-Centric View Planning

ObjView-Bench is a new framework for evaluating object-centric view planning, disentangling difficulty factors and considering real-world deployment constraints.

2605.10707May 11, 2026Sicong Pan, Hao Hu, Xuying Huang +2

xApp Empowered Resource Management for Non-Terrestrial Users in 5G O-RAN Networks

This paper proposes a DDQN xApp for proactive UAV mobility management in 5G O-RAN, reducing handovers and outages.

2605.10704May 11, 2026Mohammed M. H. Qazzaz, Syed Ali Zaidi, Aubida A. Al-Hameed +2

Embodied AI in Action: Insights from SAE World Congress 2026 on Safety, Trust, Robotics, and Real-World Deployment

This paper summarizes key insights from the SAE World Congress 2026 on safely and trustworthily deploying embodied AI in real-world systems.

2605.10653May 11, 2026Jan-Mou Li, Paul Schmitt, Wei Tong +4

DeepSight: Long-Horizon World Modeling via Latent States Prediction for End-to-End Autonomous Driving

DeepSight improves end-to-end autonomous driving with a world model predicting long-horizon latent states and adaptive text reasoning.

2605.10564May 11, 2026Lingjun Zhang, Changjie Wu, Linzhe Shi +6

VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models

VEGA enhances VLA models' spatial reasoning by directly aligning their visual encoder outputs with 3D-aware features, improving robotic manipulation.

2605.10485May 11, 2026Hao Wang, Xiaobao Wei, Jingyang He +10

VISOR: A Vision-Language Model-based Test Oracle for Testing Robot

VISOR is a VLM-based test oracle that automates robot task assessment, replacing manual evaluation and quantifying task correctness and quality.

2605.10408May 11, 2026Prasun Saurabh, Pablo Valle, Aitor Arrieta +2

Neuromorphic Reinforcement Learning for Quadruped Locomotion Control on Uneven Terrain

This paper introduces a neuromorphic reinforcement learning framework using equilibrium propagation for quadruped locomotion on uneven terrain, enabling on-robot adaptation.

2605.09595May 10, 2026Zhuangyu Han, Abhronil Sengupta
PreviousPage 3 of 27Next

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.