Self-Adaptive Multi-Agent LLM-Based Security Pattern Selection for IoT Systems
Saeid Jamshidi, Foutse Khomh, Carol Fung, Kawser Wazed Nafi
TLDR
ASPO uses multi-agent LLMs and deterministic optimization for self-adaptive, resource-efficient security pattern selection in IoT edge systems.
Key contributions
- Introduces ASPO, a self-adaptive multi-agent system for IoT security pattern selection using LLMs and deterministic enforcement.
- Separates LLM-based stochastic decision generation from deterministic execution for conflict-free and resource-feasible mitigations.
- Achieves 100% conflict-free activation and consistent resource feasibility in edge IoT security decisions.
- Reduces extreme-case tail latency by 21.9% and energy overheads by 23.1% without increasing mean energy.
Why it matters
This paper addresses the critical need for adaptive, resource-efficient security in IoT edge systems. ASPO combines LLM-based reasoning with deterministic enforcement, ensuring conflict-free and resource-feasible security actions. It significantly reduces extreme-case latency and energy overheads, vital for practical IoT deployments.
Original Abstract
The adoption of Internet of Things (IoT) systems at the network edge of smart architectures is increasing rapidly, intensifying the need for security mechanisms that are both adaptive and resource-efficient. In such environments, runtime defence mechanisms are no longer limited to detection alone but become a resource-constrained task of selecting mitigation actions. Security controls must be carefully selected, combined, and executed under latency, energy, and computational constraints, while preventing unsafe interactions between controls. Existing approaches predominantly rely on static rule sets and learned policies, which provide limited guarantees of feasibility, conflict safety, and execution correctness in resource-constrained edge settings. To address this limitation, we introduce ASPO, a self-adaptive multi-agent security pattern selection that integrates Large Language Model (LLM)-based reasoning with deterministic enforcement within a MAPE-K control loop. ASPO explicitly separates stochastic decision generation from execution: LLM agents propose candidate mitigation portfolios, while a deterministic optimisation core enforces closed-world action integrity, conflict-free composition, and resource feasibility at every decision epoch. We deploy ASPO on a distributed edge-gateway testbed and evaluate it across two workloads, each comprising 500 and 1000 runtime security decisions, using replayed IoT attack traffic. In addition, the results demonstrate invariant safety properties, including 100% conflict-free activation, consistent resource feasibility across workloads, and stable pattern dominance with perfect rank preservation. Importantly, deeper decision exploration reduces extreme-case execution costs, compressing tail latency and energy overheads by 21.9% and 23.1%, respectively, without increasing mean energy consumption.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.