ArXiv TLDR

Search-based Robustness Testing of Laptop Refurbishing Robotic Software

🐦 Tweet
2605.07530

Erblin Isaku, Hassan Sartaj, Shaukat Ali, Malaika Din Hashmi, Francois Picard

cs.ROcs.SE

TLDR

PROBE is a search-based method for robustly testing object detection models in laptop refurbishing robots, significantly outperforming random search.

Key contributions

  • Introduces PROBE, a multi-objective search-based approach for robustness testing of object detection models.
  • Leverages NSGA-II to find minimal, localized perturbations that expose failures in robotic vision systems.
  • Demonstrates PROBE is 3-7x more effective than random search in finding failure-inducing perturbations.
  • Shows generated perturbations are smaller, transfer across models, and metamorphic relations provide further insights.

Why it matters

This paper addresses a critical challenge in robotic software: ensuring the robustness of object detection models used in laptop refurbishment. By preventing model failures, PROBE helps avoid damage to laptops, promoting reuse and supporting the circular economy. Its effectiveness significantly improves the reliability of these eco-friendly robotic systems.

Original Abstract

The Danish Technological Institute (DTI) focuses on transferring advanced technologies (including robots) to the industry and the public sector. One key application is laptop refurbishment using specialized robots, aimed at promoting reuse, reducing electronic waste, and supporting the European Circular Economy Action Plan. The software of such robots often includes features that use object detection models to detect objects for various purposes, such as identifying screws for laptop disassembly or detecting stickers to remove them. Ensuring the robustness of such models to small input variations remains a critical challenge, and addressing it is important to avoid potential damage to laptops during refurbishment. In this paper, we propose PROBE, a search-based robustness testing approach that leverages multi-objective optimization to identify minimal, localized perturbations that expose failures in object detection models used in the software of laptop refurbishing robots. PROBE employs NSGA-II to systematically explore the perturbation space, optimizing for failure induction considering both localization and confidence, and perturbation magnitude, while enabling the discovery of diverse failure cases. Results show that PROBE is 3$\times$ to 7$\times$ more effective than random search in generating failure-inducing perturbations, while requiring smaller perturbation magnitudes, and that the generated perturbations transfer across models. We further show that metamorphic relations provide additional insights into model robustness, enabling the assessment of stability even in non-failing cases.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.