Ethics Testing: Proactive Identification of Generative AI System Harms

April 23, 20262604.22089

cs.SEcs.AI

TLDR

This paper introduces "ethics testing," a novel approach to systematically identify and mitigate software harms in content generated by Generative AI systems.

Key contributions

Introduces "ethics testing" to systematically identify software harms in Generative AI content.
Distinguishes ethics testing from fairness testing, focusing on unethical behaviors like IP violations.
Discusses challenges and demonstrates ethics testing through five practical case studies.

Why it matters

Generative AI systems are widely used but can produce harmful or unethical content. This paper addresses a critical gap by proposing "ethics testing," a systematic method to proactively identify such harms, ensuring safer and more responsible AI deployment.

Original Abstract

Generative Artificial Intelligence (GAI) systems that can automatically generate content in the form of source code or other contents (e.g., images) has seen increasing popularity due to the emergence of tools such as ChatGPT which rely on Large Language Models (LLMs). Misuse of the automatically generated content can incur serious consequences due to potential harms in the generated content. Despite the importance of ensuring the quality of automatically generated content, there is little to no approach that can systematically generate tests for identifying software harms in the content generated by these GAI systems. In this article, we introduce the novel concept of ethics testing which aims to systematically generate tests for identifying software harms. Different from existing testing methodologies (e.g., fairness testing that aims to identifying software discrimination), ethics testing aims to systematically detect software harms that could be induced due to unethical behavior (e.g., harmful behavior or behavior that violates intellectual property rights) in automatically generated content. We introduced the concept of ethics testing, discussed the challenges therewithin, and conducted five case studies to show how ethics testing can be performed for generative AI systems.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers