Information Retrieval

Papers on search engines, recommendation systems, and information extraction.

cs.IR · 379 papers

HSUGA: LLM-Enhanced Recommendation with Hierarchical Semantic Understanding and Group-Aware Alignment

HSUGA improves LLM-enhanced recommendations by using hierarchical semantic understanding and group-aware alignment for better user preference modeling.

2605.11662May 12, 2026Guorui Li, Dugang Liu, Lei Li +2

TwiSTAR:Think Fast, Think Slow, Then Act,Generative Recommendation with Adaptive Reasoning

TwiSTAR introduces an adaptive reasoning framework for generative recommendation, balancing speed and accuracy by dynamically selecting inference strategies.

2605.11553May 12, 2026Shiteng Cao, Kaian Jiang, Yunlong Gong +1

Much of Geospatial Web Search Is Beyond Traditional GIS

This paper reveals that geospatial web search is far more prevalent and practically oriented than previously understood, often exceeding traditional GIS capabilities.

2605.11336May 11, 2026Ilya Ilyankou, Stefano Cavazzi, James Haworth

Neural at ArchEHR-QA 2026: One Method Fits All: Unified Prompt Optimization for Clinical QA over EHRs

Neural1.5 uses modular prompt optimization and self-consistency to achieve strong results in clinical QA over EHRs, ranking second overall.

2605.10877May 11, 2026Abrar Majeedi, Viswanatha Reddy Gajjala, Sai Prasanna Teja Reddy Bogireddy +1

Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient?

Pi-Serini demonstrates that well-tuned lexical retrieval with capable LLMs can effectively support deep agentic search, outperforming dense retrievers.

2605.10848May 11, 2026Tz-Huan Hsu, Jheng-Hong Yang, Jimmy Lin

Personalized Deep Research: A User-Centric Framework, Dataset, and Hybrid Evaluation for Knowledge Discovery

PDR is a user-centric framework that personalizes deep research agents by adapting retrieval and synthesis to individual user expertise and interests.

2605.10530May 11, 2026Xiaopeng Li, Wenlin Zhang, Yingyi Zhang +6

UniRank: Unified List-wise Reranking via Confidence-Ordered Denoising

UniRank unifies autoregressive and non-autoregressive reranking using confidence-ordered denoising, improving performance and user engagement.

2605.10527May 11, 2026Pengyue Jia, Hailan Yang, Shuchang Liu +7

AgentGR: Semantic-aware Agentic Group Decision-Making Simulator for Group Recommendation

AgentGR uses LLM-driven agents to simulate complex group decision-making, integrating collaborative and semantic preferences for improved group recommendations.

2605.10367May 11, 2026Yangtao Zhou, Wenhao You, Hua Chu +4

Every Preference Has Its Strength: Injecting Ordinal Semantics into LLM-Based Recommenders

OSA is a new LLM-based recommender framework that injects ordinal preference strength into collaborative filtering signals, improving fine-grained recommendations.

2605.10323May 11, 2026Jiwon Jeong, Donghee Han, Sungrae Hong +2

Qwen Goes Brrr: Off-the-Shelf RAG for Ukrainian Multi-Domain Document Understanding

A Qwen-based RAG system achieves high accuracy in Ukrainian multi-domain document understanding using contextual chunking and question-aware reranking.

2605.10296May 11, 2026Anton Bazdyrev, Ivan Bashtovyi, Ivan Havlytskyi +2

To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification

A local LLM approach effectively classifies deliberative process privilege in government documents, outperforming prior methods securely.

2605.10211May 11, 2026Maik Larooij, David Graus

LASAR: Latent Adaptive Semantic Aligned Reasoning for Generative Recommendation

LASAR enables efficient, high-quality generative recommendation by using latent adaptive semantic aligned reasoning, significantly faster than explicit Chain-of-Thought.

2605.10207May 11, 2026Yiwen Chen, Fuwei Zhang, Zehao Chen +8

ASTRA-QA: A Benchmark for Abstract Question Answering over Documents

ASTRA-QA is a new benchmark for abstract question answering over documents, providing robust evaluation for coverage, hallucination, and retrieval scope.

2605.10168May 11, 2026Shu Wang, Shansong Zhou, Xinyang Wang +3

NumColBERT: Non-Intrusive Numeracy Injection for Late-Interaction Retrieval Models

NumColBERT improves dense retrieval for numerical queries using a non-intrusive method that enhances ColBERT without modifying its core architecture.

2605.10109May 11, 2026Haruki Fujimaki, Makoto P. Kato

H-MAPS: Hierarchical Memory-Augmented Proactive Search Assistant for Scientific Literature

H-MAPS is a proactive search assistant that uses hierarchical memory to provide personalized literature recommendations, reducing cognitive load during scientific reading.

2605.10097May 11, 2026Koji Nishikawa, Makoto P. Kato

CCD-Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs

This paper proposes a CCD-level and load-aware thread orchestration framework to boost in-memory vector ANNS performance on multi-core CPUs.

2605.10090May 11, 2026Yuchen Huang, Baiteng Ma, Yiping Sun +6

Enhancing Healthcare Search Intent Recognition with Query Representation Learning and Session Context

Improves healthcare search intent recognition by learning query representations and leveraging session context for better accuracy.

2605.10021May 11, 2026Harshita Jagdish Sahijwani, Madhav Sigdel, Song Aslan +4

Urban-ImageNet: A Large-Scale Multi-Modal Dataset and Evaluation Framework for Urban Space Perception

Urban-ImageNet is a new 2M+ multi-modal dataset and benchmark for evaluating AI's perception of urban spaces using social media imagery.

2605.09936May 11, 2026Yiwei Ou, Chung Ching Cheung, Jun Yang Ang +5

OpenZL: Using Graphs to Compress Smaller and Faster

OpenZL introduces a graph-based compression framework, enabling faster, smaller, and easier-to-develop application-specific compressors.

2605.09928May 11, 2026Yann Collet, Nick Terrell, W. Felix Handte +10

Nautilus Compass: Black-box Persona Drift Detection for Production LLM Agents

Nautilus Compass detects persona drift in black-box LLM agents using prompt-text analysis, offering an efficient and accessible memory solution.

2605.09863May 11, 2026Chunxiao Wang

PreviousPage 2 of 19Next

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.