Triple Spectral Fusion for Sensor-based Human Activity Recognition
Ye Zhang, Longguang Wang, Qing Gao, Chaocan Xiang, Mohammed Bennamoun + 1 more
TLDR
This paper introduces a triple spectral fusion framework for sensor-based human activity recognition, using adaptive filtering in Fourier, graph Fourier, and wavelet domains.
Key contributions
- Introduces adaptive complementary filtering for noise suppression in IMU posture and motion data.
- Applies adaptive filtering in the graph Fourier domain to fuse heterogeneous IMU sensor nodes.
- Utilizes adaptive wavelet frequency selection to reduce context redundancy and shorten feature length.
- Demonstrates superior performance on ten benchmark human activity recognition datasets.
Why it matters
Current HAR struggles with fusing diverse data and maintaining long-term context. This framework's triple spectral fusion, via adaptive filtering in three domains, significantly improves multi-sensor fusion and context correlation for accurate HAR.
Original Abstract
The field of sensor-based human activity recognition (HAR) mainly uses posture, motion and context data of Inertial Measurement Units (IMUs) to identify daily activities. Despite the advancements in learning-based methods, it is challenging to perform information fusion from the temporal perspective due to the complexities in fusing heterogeneous sensor data and establishing long-term context correlations. This paper proposes a novel triple spectral fusion framework tailored for HAR. First, we develop an adaptive complementary filtering technique for noise suppression and organize each IMU's sensors into posture and motion modality nodes. Given that IMU nodes form a dynamic heterogeneous graph, we then apply adaptive filtering within the graph Fourier domain to merge both homogeneous and heterogeneous node information. Furthermore, an adaptive wavelet frequency selection approach is implemented to suppress context redundancy and shorten the length of features. This approach enhances both timestamp-based graph aggregation and the correlation of long-term contexts. Our framework uses adaptive filtering in the Fourier, graph Fourier, and wavelet domains, enabling effective multi-sensor fusion and context correlation. Extensive experiments on ten benchmark datasets demonstrate the superior performance of our framework. Project page: https://github.com/crocodilegogogo/TSF-TPAMI2026.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.