Are You the A-hole? A Fair, Multi-Perspective Ethical Reasoning Framework
Sheza Munir, Ahanaf Rodoshi, Sumin Lee, Feiran Chang, Xujie Si + 1 more
TLDR
A neuro-symbolic framework leverages MaxSAT and LMs to aggregate conflicting ethical judgments, ensuring logical consistency and explainability.
Key contributions
- Proposes a neuro-symbolic framework for aggregating conflicting ethical judgments.
- Utilizes a language model to map natural language explanations into logical predicates.
- Encodes predicates as soft constraints for MaxSAT optimization using a Z3 solver.
- Achieves 86% agreement with independent human evaluators on ethical dilemmas.
Why it matters
This paper solves the problem of aggregating conflicting human judgments in high-conflict ethical domains, where majority voting fails. Its neuro-symbolic framework ensures logical consistency and explainability, vital for trustworthy AI. This enables robust and fair decision-making in complex moral scenarios.
Original Abstract
Standard methods for aggregating natural language judgments, such as majority voting, often fail to produce logically consistent results when applied to high-conflict domains, treating differing opinions as noise. We propose a neuro-symbolic aggregation framework that formalizes conflict resolution through Weighted Maximum Satisfiability (MaxSAT). Our pipeline utilizes a language model to map unstructured natural language explanations into interpretable logical predicates and confidence weights. These components are then encoded as soft constraints within the Z3 solver, transforming the aggregation problem into an optimization task that seeks the maximum consistency across conflicting testimony. Using the Reddit r/AmItheAsshole forum as a case study in large-scale moral disagreement, our system generates logically coherent verdicts that diverge from popularity-based labels 62% of the time, corroborated by an 86% agreement rate with independent human evaluators. This study demonstrates the efficacy of coupling neural semantic extraction with formal solvers to enforce logical soundness and explainability in the aggregation of noisy human reasoning.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.