ArXiv TLDR
← All categories

Information Retrieval

Papers on search engines, recommendation systems, and information extraction.

cs.IR · 379 papers

Expressiveness Limits of Autoregressive Semantic ID Generation in Generative Recommendation

Generative recommendation models' autoregressive ID generation limits expressiveness due to tree-structured decoding, which Latte mitigates for better performance.

2605.06331May 7, 2026Yupeng Hou, Haven Kim, Clark Mingxuan Ju +3

Addressing Labelled Data Scarcity: Taxonomy-Agnostic Annotation of PII Values in HTTP Traffic using LLMs

This paper introduces an LLM-based pipeline for taxonomy-agnostic PII annotation in HTTP traffic, addressing data scarcity and evolving privacy definitions.

2605.06305May 7, 2026Thomas Cory, Axel Küpper

OBLIQ-Bench: Exposing Overlooked Bottlenecks in Modern Retrievers with Latent and Implicit Queries

OBLIQ-Bench introduces a new benchmark for "oblique" queries, revealing that modern retrievers struggle to find documents with latent patterns, unlike LLMs.

2605.06235May 7, 2026Diane Tchuindjo, Devavrat Shah, Omar Khattab

Revisiting Uncertainty: On Evidential Learning for Partially Relevant Video Retrieval

Holmes introduces a hierarchical evidential learning framework to explicitly model and quantify uncertainty in partially relevant video retrieval, outperforming SOTA.

2605.06083May 7, 2026Jun Li, Peifeng Lai, Xuhang Lou +5

A Case-Driven Multi-Agent Framework for E-Commerce Search Relevance

A multi-agent framework automates e-commerce search relevance optimization by replacing human roles with specialized AI agents for case identification and resolution.

2605.05991May 7, 2026Global E-Commerce Search Relevance Team

Bridging Passive and Active: Enhancing Conversation Starter Recommendation via Active Expression Modeling

PA-Bridge enhances conversation starter recommendations by using active user expressions and an adversarial aligner to overcome feedback loop issues.

2605.05855May 7, 2026Yiqing Wu, Haoming Li, Guanyu Jiang +4

Unified Value Alignment for Generative Recommendation in Industrial Advertising

UniVA enhances generative recommendation for advertising by aligning commercial value signals across tokenization, decoding, and online serving.

2605.05803May 7, 2026Xinxun Zhang, Yuling Xiong, Jiale Zhou +13

Beyond Long Tail POIs: Transition-Centered Generalization for Human Mobility Prediction

RECAP improves human mobility prediction by addressing transition-level sparsity, reconstructing rare POI transitions for better generalization.

2605.05771May 7, 2026Dingyang Lyu, Zhengjia Xu, Jey Han Lau +1

Effective Knowledge Transfer for Multi-Task Recommendation Models

EKTM improves multi-task recommendation by transferring knowledge across CVR tasks, boosting conversion rates and platform effectiveness.

2605.05730May 7, 2026Guohao Cai, Jun Yuan, Zhenhua Dong

Text-Graph Synergy: A Bidirectional Verification and Completion Framework for RAG

TGS-RAG is a novel framework that uses bidirectional text-graph synergy to improve RAG by refining textual evidence and resurrecting pruned graph paths.

2605.05643May 7, 2026Jiarui Zhong, Hong Cai Chen

AgenticRAG: Agentic Retrieval for Enterprise Knowledge Bases

AgenticRAG improves enterprise RAG by using an LLM agent with tools for iterative retrieval and analysis, significantly boosting recall and factuality.

2605.05538May 7, 2026Susheel Suresh, Hazel Mak, Shangpo Chou +2

Open-SAT: LLM-Guided Query Embedding Refinement for Open-Vocabulary Object Retrieval in Satellite Imagery

Open-SAT improves open-vocabulary satellite image retrieval by using LLMs to refine query embeddings at inference time, achieving significant F1 score gains.

2605.05344May 6, 2026Md Adnan Arefeen, Biplob Debnath, Ravi K. Rajendran +2

Securing the Agent: Vendor-Neutral, Multitenant Enterprise Retrieval and Tool Use

This paper introduces a layered isolation architecture to secure multitenant enterprise RAG and agentic AI systems, preventing data leakage.

2605.05287May 6, 2026Francisco Javier Arceo, Varsha Prasad Narsing

Interests Burn-down Diffusion Process for Personalized Collaborative Filtering

A new "interests burn-down diffusion process" is proposed for collaborative filtering, better modeling user interest decay for recommendations.

2605.05165May 6, 2026Yifang Qin, Zhaobin Li, Arisa Watanabe +3

CapsID: Soft-Routed Variable-Length Semantic IDs for Generative Recommendation

CapsID introduces soft-routed, variable-length Semantic IDs for generative recommendation, significantly improving recall and efficiency over existing methods.

2605.05096May 6, 2026Wenzhuo Cheng, Menghang Gong, Qixin Guo +4

Empirical Study of Pop and Jazz Mix Ratios for Genre-Adaptive Chord Generation

This paper investigates optimal data mix ratios for fine-tuning a pop-trained chord generation model to jazz, balancing new genre acquisition with old genre retention.

2605.04998May 6, 2026Jinju Lee

TabEmbed: Benchmarking and Learning Generalist Embeddings for Tabular Understanding

TabEmbed introduces a generalist embedding model for tabular data, unifying classification and retrieval, alongside TabBench for evaluation.

2605.04962May 6, 2026Minjie Qiang, Mingming Zhang, Xiaoyi Bao +5

Storage Is Not Memory: A Retrieval-Centered Architecture for Agent Recall

True Memory introduces a retrieval-centered architecture for agent recall, achieving high accuracy by preserving verbatim events and outperforming existing systems.

2605.04897May 6, 2026Joshua Adler, Guy Zehavi

RecGPT-Mobile: On-Device Large Language Models for User Intent Understanding in Taobao Feed Recommendation

RecGPT-Mobile deploys lightweight LLMs directly on mobile devices to understand user intent in real-time, improving e-commerce recommendations.

2605.04726May 6, 2026Bin Zhang, Weipeng Huang, Dimin Wang +9

Rethinking Convolutional Networks for Attribute-Aware Sequential Recommendation

ConvRec introduces a convolution-based model for attribute-aware sequential recommendation, achieving efficiency and outperforming attention methods.

2605.04723May 6, 2026Shereen Elsayed, Ngoc Son Le, Ahmed Rashed +1
PreviousPage 4 of 19Next

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.