Sebastian Elbaum
2 papers ยท Latest:
Robotics
SafeManip: A Property-Driven Benchmark for Temporal Safety Evaluation in Robotic Manipulation
SafeManip is a new benchmark using LTLf to evaluate temporal safety in robotic manipulation, revealing strong models often behave unsafely.
2605.12386
Software EngineeringTRACE: Topology-aware Reconstruction of Accidents in CARLA for AV Evaluation
TRACE reconstructs real-world accident reports into high-fidelity CARLA simulations, enabling robust AV evaluation against rare, safety-critical scenarios.
2604.22068
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.