ArXiv TLDR

Channel-Free Human Activity Recognition via Inductive-Bias-Aware Fusion Design for Heterogeneous IoT Sensor Environments

🐦 Tweet
2604.21369

Tatsuhito Hasegawa

cs.LGcs.HC

TLDR

This paper proposes a channel-free HAR framework for heterogeneous IoT sensor environments, using inductive-bias-aware fusion to handle varying sensor configurations.

Key contributions

  • Processes each sensor channel independently, adapting to varying numbers, orders, and semantic arrangements.
  • Leverages sensor metadata (body location, modality) to recover structural information for robust fusion.
  • Employs metadata-conditioned late fusion via conditional batch normalization for flexible integration.
  • Jointly optimizes channel-level and fused predictions using a combination loss for improved discriminability.

Why it matters

This work addresses a critical challenge in IoT HAR: adapting models to diverse sensor setups without retraining. By enabling channel-free operation, it significantly improves model reusability and robustness across heterogeneous environments, reducing deployment complexity.

Original Abstract

Human activity recognition (HAR) in Internet of Things (IoT) environments must cope with heterogeneous sensor settings that vary across datasets, devices, body locations, sensing modalities, and channel compositions. This heterogeneity makes conventional channel-fixed models difficult to reuse across sensing environments because their input representations are tightly coupled to predefined channel structures. To address this problem, we investigate strict channel-free HAR, in which a single shared model performs inference without assuming a fixed number, order, or semantic arrangement of input channels, and without relying on sensor-specific input layers or dataset-specific channel templates. We argue that fusion design is the central issue in this setting. Accordingly, we propose a channel-free HAR framework that combines channel-wise encoding with a shared encoder, metadata-conditioned late fusion via conditional batch normalization, and joint optimization of channel-level and fused predictions through a combination loss. The proposed model processes each channel independently to handle varying channel configurations, while sensor metadata such as body location, modality, and axis help recover structural information that channel-independent processing alone cannot retain. In addition, the joint loss encourages both the discriminability of individual channels and the consistency of the final fused prediction. Experiments on PAMAP2, together with robustness analysis on six HAR datasets, ablation studies, sensitivity analysis, efficiency evaluation, and cross-dataset transfer learning, demonstrate three main findings...

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.