DUET: Joint Exploration of User Item Profiles in Recommendation System
Yue Chen, Yifei Sun, Lu Wang, Fangkai Yang, Pu Zhao + 15 more
TLDR
Duet is a novel LLM-based recommendation system that jointly generates user and item textual profiles, outperforming baselines by aligning them for better recommendations.
Key contributions
- Jointly generates user and item textual profiles, addressing inconsistency issues in LLM-based recommenders.
- Uses a three-stage process: compact cue generation, paired profile prompting, and RL-based optimization.
- Optimizes profile generation policy with reinforcement learning, using recommendation performance as feedback.
- Outperforms strong baselines on three real-world datasets, demonstrating benefits of joint textual alignment.
Why it matters
This paper introduces a significant advancement in LLM-based recommendation systems by tackling the critical challenge of generating consistent and effective user and item textual profiles. By jointly creating these profiles and optimizing them with RL, Duet offers a more robust and interpretable approach than previous methods. This could lead to more accurate and trustworthy recommendations.
Original Abstract
Traditional recommendation systems represent users and items as dense vectors and learn to align them in a shared latent space for relevance estimation. Recent LLM-based recommenders instead leverage natural-language representations that are easier to interpret and integrate with downstream reasoning modules. This paper studies how to construct effective textual profiles for users and items, and how to align them for recommendation. A central difficulty is that the best profile format is not known a priori: manually designed templates can be brittle and misaligned with task objectives. Moreover, generating user and item profiles independently may produce descriptions that are individually plausible yet semantically inconsistent for a specific user--item pair. We propose Duet, an interaction-aware profile generator that jointly produces user and item profiles conditioned on both user history and item evidence. Duet follows a three-stage procedure: it first turns raw histories and metadata into compact cues, then expands these cues into paired profile prompts and then generate profiles, and finally optimizes the generation policy with reinforcement learning using downstream recommendation performance as feedback. Experiments on three real-world datasets show that Duet consistently outperforms strong baselines, demonstrating the benefits of template-free profile exploration and joint user-item textual alignment.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.