ArXiv TLDR

BRIDGE and TCH-Net: Heterogeneous Benchmark and Multi-Branch Baseline for Cross-Domain IoT Botnet Detection

🐦 Tweet
2604.11324

Ammar Bhilwarawala, Likhamba Rongmei, Harsh Sharma, Arya Jena, Kaushal Singh + 2 more

cs.CRcs.LGcs.NI

TLDR

BRIDGE is a new heterogeneous benchmark for IoT botnet detection, and TCH-Net is a multi-branch network for superior cross-domain generalization.

Key contributions

  • Introduced BRIDGE, the first formally specified heterogeneous multi-dataset benchmark for IoT botnet detection.
  • Unifies 5 diverse datasets using a 46-feature semantic canonical vocabulary for cross-domain evaluation.
  • Proposed TCH-Net, a novel multi-branch network with Cross-Branch Gated Attention Fusion for robust detection.
  • TCH-Net achieves state-of-the-art F1 (0.8296) and highest LODO F1, outperforming 12 baselines.

Why it matters

IoT botnet detection struggles with generalization across diverse environments. This paper introduces BRIDGE, a standardized benchmark, and TCH-Net, a high-performing model, to address this critical gap. This work shifts focus to robust cross-environment detection, significantly enhancing practical IoT security.

Original Abstract

IoT botnet detection has advanced, yet most published systems are validated on a single dataset and rarely generalise across environments. Heterogeneous feature spaces make multi-dataset training practically impossible without discarding semantic interpretability or introducing data integrity violations. No prior work has addressed both problems with a formally specified, reproducible methodology. This paper does. We introduce BRIDGE (Benchmark Reference for IoT Domain Generalisation Evaluation), the first formally specified heterogeneous multi-dataset benchmark for IoT intrusion detection, unifying CICIDS-2017, CIC-IoT-2023, Bot-IoT, Edge-IIoTset, and N-BaIoT through a 46-feature semantic canonical vocabulary grounded in CICFlowMeter nomenclature, with genuine-equivalence-only feature mapping, explicit zero-filling, and per-dataset coverage from 15% to 93%. A leave-one-dataset-out (LODO) protocol makes the generalisation gap precisely measurable: all five evaluated architectures achieve mean LODO F1 between 0.39 and 0.47, and we establish the first community generalisation baseline at mean LODO F1 = 0.5577, a result that shifts the agenda from single-benchmark optimisation toward cross-environment generalisation. We propose TCH-Net, a multi-branch network fusing a three-path Temporal branch (residual convolutional-BiGRU, stride-downsampled BiGRU, pre-LayerNorm Transformer), a provenance-conditioned Contextual branch, and a Statistical branch via Cross-Branch Gated Attention Fusion (CB-GAF) with learnable sigmoid gates for dynamic feature-wise mixing. Across five random seeds, TCH-Net achieves F1 = 0.8296 +/- 0.0028, AUC = 0.9380 +/- 0.0025, and MCC = 0.6972 +/- 0.0056, outperforming all twelve baselines (p < 0.05, Wilcoxon) and recording the highest LODO F1 overall. BRIDGE and the full pipeline are at https://github.com/Ammar-ss/TCH-Net.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.