ArXiv TLDR

TimeMM: Time-as-Operator Spectral Filtering for Dynamic Multimodal Recommendation

🐦 Tweet
2604.26247

Wei Yang, Rui Zhong, Zihan Lin, Xiaodan Wang, Cheng Chen + 2 more

cs.IRcs.AI

TLDR

TimeMM uses time-as-operator spectral filtering for dynamic multimodal recommendation, adapting to evolving user interests and modality dynamics over time.

Key contributions

  • Instantiates "Time-as-Operator" by mapping interaction recency to parametric temporal kernels.
  • Employs Adaptive Spectral Filtering to mix operator banks for non-stationary preference dynamics.
  • Introduces Spectral-Aware Modality Routing to calibrate visual and textual contributions temporally.
  • Uses Spectral Diversity Regularization to encourage complementary expert behaviors and prevent collapse.

Why it matters

Existing multimodal recommenders struggle with evolving user interests and modality-specific temporal dynamics. TimeMM provides fine-grained temporal adaptation, improving recommendation accuracy and maintaining linear scalability. This advances dynamic user modeling in complex multimodal settings.

Original Abstract

Multimodal recommendation improves user modeling by integrating collaborative signals with heterogeneous item content. In real applications, user interests evolve over time and exhibit nonstationary dynamics, where different preference factors change at different rates. This challenge is amplified in multimodal settings because visual and textual cues can dominate decisions under different temporal regimes. Despite strong progress, most multimodal recommenders still rely on static interaction graphs or coarse temporal heuristics, which limits their ability to model continuous preference evolution with fine-grained temporal adaptation. To address these limitations, we propose TimeMM, a time-conditioned spectral filtering framework for dynamic multimodal recommendation. TimeMM instantiates Time-as-Operator by mapping interaction recency to a family of parametric temporal kernels that reweight edges on the user--item graph, producing component-specific representations without explicit eigendecomposition. To capture non-stationary interests, we introduce Adaptive Spectral Filtering that mixes the operator bank according to temporal context, yielding prediction-specific effective spectral responses. To account for modality-specific temporal sensitivity, we further propose Spectral-Aware Modality Routing that calibrates visual and textual contributions conditioned on the same temporal context. Finally, a ranking-space Spectral Diversity Regularization encourages complementary expert behaviors and prevents filter-bank collapse. Extensive experiments on real-world benchmarks demonstrate that TimeMM consistently outperforms state-of-the-art multimodal recommenders while maintaining linear-time scalability.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.