CLAD: A Clustered Label-Agnostic Federated Learning Framework for Joint Anomaly Detection and Attack Classification
Iason Ofeidis, Nikos Papadis, Randeep Bhatia, Leandros Tassiulas, TV Lakshman
TLDR
CLAD is a federated learning framework for IoT security, combining clustered FL and a dual-mode architecture for anomaly detection and attack classification.
Key contributions
- Addresses IoT device heterogeneity and label scarcity in federated learning.
- Introduces a Dual-Mode Micro-Architecture (DM2A) for joint unsupervised anomaly detection and supervised attack classification.
- Utilizes clustered federated learning to group devices with similar traffic patterns, preventing model divergence.
- Achieves 30% relative improvement in detection performance with 80% unlabeled clients and half the communication cost.
Why it matters
This paper offers a robust solution for securing the rapidly expanding IoT/IIoT landscape. By effectively handling diverse device behaviors and leveraging unlabeled data, CLAD significantly enhances intrusion detection capabilities in real-world edge environments, making FL more practical for IoT security.
Original Abstract
The rapid expansion of the Internet of Things (IoT) and Industrial IoT (IIoT) has created a massive, heterogeneous attack surface that challenges traditional network security mechanisms. While Federated Learning (FL) offers a privacy-preserving alternative to centralized Intrusion Detection Systems (IDS), standard approaches struggle to generalize across diverse device behaviors and typically fail to utilize the vast amounts of unlabeled data present in realistic edge environments. To bridge these gaps, we propose CLAD, a holistic framework that seamlessly incorporates Clustered Federated Learning (CFL) with a novel Dual-Mode Micro-Architecture ($\text{DM}^2\text{A}$). This unified approach simultaneously tackles the two primary bottlenecks of IoT security: device heterogeneity and label scarcity. The $\text{DM}^2\text{A}$ component features a shared encoder followed by two branches, enabling joint unsupervised anomaly detection and supervised attack classification; this allows the framework to harvest intelligence from both labeled and unlabeled clients. Concurrently, the clustering component dynamically groups devices with congruent traffic patterns, preventing global model divergence. By carefully combining these elements, CLAD ensures that no data is discarded and distinct operational patterns are preserved. Extensive evaluations demonstrate that this integrated approach significantly outperforms state-of-the-art baselines, achieving a 30% relative improvement in detection performance in scenarios with 80% unlabeled clients, with only half the communication cost.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.