Information Retrieval

Papers on search engines, recommendation systems, and information extraction.

cs.IR · 379 papers

ReCoVR: Closing the Loop in Interactive Composed Video Retrieval

ReCoVR introduces a dual-pathway architecture for interactive composed video retrieval, using reflexive perception to refine search with user feedback and retrieval history.

2605.09836May 11, 2026Bingqing Zhang, Yi Zhang, Zhuo Cao +4

LLM Agents Enable User-Governed Personalization Beyond Platform Boundaries

LLM agents empower users to integrate and govern their personal data across platforms, moving beyond fragmented, platform-centric personalization.

2605.09794May 10, 2026Jiacheng Lin, Kun Qian, Arvind Srinivasan +15

FAVOR: Efficient Filter-Agnostic Vector ANNS Based on Selectivity-Aware Exclusion Distances

FAVOR is a new ANNS method that efficiently integrates complex attribute filtering, achieving stable high throughput across varying selectivity levels.

2605.07770May 8, 2026Junjie Song, Yu Liu, Guoyu Hu +4

TRACE: Tourism Recommendation with Accountable Citation Evidence

TRACE introduces a new dataset and benchmark for conversational tourism recommender systems, focusing on verifiable evidence and rejection recovery.

2605.07677May 8, 2026Zixu Zhao, Sijin Wang, Yu Hou +6

LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

LARAG improves RAG systems by leveraging hyperlink structures in technical documentation for more efficient and accurate content retrieval.

2605.07517May 8, 2026Giorgia Bolognesi, Claudio Estatico, Ulderico Fugacci +3

InterLV-Search: Benchmarking Interleaved Multimodal Agentic Search

InterLV-Search is a new benchmark for interleaved language-vision agentic search, revealing current multimodal agents struggle with complex visual evidence integration.

2605.07510May 8, 2026Bohan Hou, Jiuning Gu, Jiayan Guo +5

TCMIIES: A Browser-Based LLM-Powered Intelligent Information Extraction System for Academic Literature

TCMIIES is a browser-based, zero-installation system leveraging commercial LLMs for privacy-preserving, schema-guided information extraction from academic literature.

2605.07507May 8, 2026Hanqing Zhao

A Comprehensive Survey on Agent Skills: Taxonomy, Techniques, and Applications

This survey comprehensively reviews agent skills for LLM-based agents, detailing their lifecycle, techniques, and applications to enhance scalability and robustness.

2605.07358May 8, 2026Yingli Zhou, Wang Shu, Yaodong Su +3

DCGL: Dual-Channel Graph Learning with Large Language Models for Knowledge-Aware Recommendation

DCGL uses dual-channel graph learning with LLMs for knowledge-aware recommendation, improving performance by decoupling semantics and behavior.

2605.07314May 8, 2026Xinchi Zou, Tongzhenzhi Su, Jianjun Li +4

PRISM: Refracting the Entangled User Behavior Space for E-Commerce Search

PRISM disentangles user preference and item relevance in e-commerce search by explicitly modeling their interaction, improving robustness and semantic consistency.

2605.07296May 8, 2026Haoqian Zhang, Ziyuan Yang, Yi Zhang

MLAIRE: Multilingual Language-Aware Information Retrieval Evaluation Protocal

MLAIRE introduces a new protocol and metrics to evaluate multilingual information retrieval, focusing on both semantic relevance and user language preference.

2605.07249May 8, 2026Youngjoon Jang, Seongtae Hong, Hyeonseok Moon +1

DiffRetriever: Parallel Representative Tokens for Retrieval with Diffusion Language Models

DiffRetriever uses diffusion language models to generate multiple representative tokens in parallel, significantly improving retrieval performance over sequential autoregressive methods.

2605.07210May 8, 2026Shuai Wang, Yin Yu, Shengyao Zhuang +2

Topic Is Not Agenda: A Citation-Community Audit of Text Embeddings

Text embeddings fail to capture fine-grained research agendas, leading to 80% off-agenda retrievals in scientific RAG.

2605.07158May 8, 2026Junseon Yoo

RRCM: Ranking-Driven Retrieval over Collaborative and Meta Memories for LLM Recommendation

RRCM is a ranking-driven retrieval framework for LLM recommenders that dynamically selects collaborative and metadata evidence to improve recommendation quality.

2605.07129May 8, 2026Shijun Li, Wooseong Yang, Yu Wang +2

An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

A simple graph heuristic reveals many sequential recommendation benchmarks are "shortcut-solvable," outperforming complex generative models.

2605.07125May 8, 2026Haoyu Han, Li Ma, Hanbing Wang +9

Bridging Textual Profiles and Latent User Embeddings for Personalization

BLUE unifies interpretable textual user profiles with discriminative latent embeddings using reinforcement learning for personalized recommendations.

2605.06981May 7, 2026Zhaoxuan Tan, Xiang Zhai, Yan Zhu +2

From Surface Learning to Deep Understanding: A Grounded AI Tutoring System for Moodle

A Moodle plugin uses RAG and LLMs for Socratic tutoring and educator content generation, ensuring high-quality, hallucination-free education.

2605.06963May 7, 2026Anna Ostrowska, Michał Kukla, Gabriela Majstrak +4

Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

SIRA introduces a superintelligent retrieval agent that uses LLM-guided lexical queries and corpus statistics to achieve superior, efficient, single-round information retrieval.

2605.06647May 7, 2026Zeyu Yang, Qi Ma, Jason Chen +1

Light-FMP: Lightweight Feature and Model Pruning for Enhanced Deep Recommender Systems

Light-FMP is a lightweight framework for deep recommender systems that prunes features and models to enhance both computational efficiency and accuracy.

2605.06441May 7, 2026Nghia Bui, Yue Ning, Lijing Wang

GATHER: Convergence-Centric Hyper-Entity Retrieval for Zero-Shot Cell-Type Annotation

GATHER is a convergence-centric hyper-entity retriever for zero-shot cell-type annotation, efficiently identifying topological convergence points for better accuracy.

2605.06403May 7, 2026Zhonghui Zhang, Feng Jiang, Shaowei Qin +2

PreviousPage 3 of 19Next

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.