Information Retrieval

Papers on search engines, recommendation systems, and information extraction.

cs.IR · 379 papers

VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense

VectorSmuggle reveals steganographic data exfiltration in RAG embedding stores and proposes VectorPin, a cryptographic defense for embedding integrity.

2605.13764May 13, 2026Jascha Wanger

Granite Embedding Multilingual R2 Models

Granite Embedding Multilingual R2 models offer state-of-the-art dense retrieval across 200+ languages with a 32k context window.

2605.13521May 13, 2026Parul Awasthy, Aashka Trivedi, Yushu Yang +15

RAG-Enhanced Large Language Models for Dynamic Content Expiration Prediction in Web Search

This paper introduces an LLM-based framework for dynamic content expiration prediction in web search, improving freshness and user experience.

2605.13052May 13, 2026Tingyu Chen, Wenkai Zhang, Li Gao +4

Same Image, Different Meanings: Toward Retrieval of Context-Dependent Meanings

This paper explores retrieving context-dependent image meanings, observing that semantic abstraction dictates how much context is needed for accurate retrieval.

2605.12905May 13, 2026Ayuto Tsutsumi, Ryosuke Kohita

Task-Adaptive Embedding Refinement via Test-time LLM Guidance

This paper introduces an LLM-guided query refinement method that adapts embedding models in real-time for challenging zero-shot search and classification tasks.

2605.12487May 12, 2026Ariel Gera, Shir Ashury-Tahan, Gal Bloch +2

ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging

ORBIT prevents catastrophic forgetting in GenRetrieval LLMs by regulating weight drift, preserving foundational language capabilities.

2605.12419May 12, 2026Neha Verma, Nikhil Mehta, Shao-Chuan Wang +7

Question Difficulty Estimation for Large Language Models via Answer Plausibility Scoring

Q-DAPS estimates LLM question difficulty by analyzing the entropy of answer plausibility scores, outperforming baselines and aligning with human judgment.

2605.12398May 12, 2026Jamshid Mozafari, Bhawna Piryani, Adam Jatowt

Context Convergence Improves Answering Inferential Questions

This paper shows that constructing passages with high "context convergence" significantly improves LLM accuracy on inferential question answering tasks.

2605.12370May 12, 2026Jamshid Mozafari, Bhawna Piryani, Adam Jatowt

MedHopQA: A Disease-Centered Multi-Hop Reasoning Benchmark and Evaluation Framework for LLM-Based Biomedical Question Answering

MedHopQA is a new disease-centered multi-hop reasoning benchmark for evaluating LLMs in biomedical QA, designed to resist saturation and contamination.

2605.12361May 12, 2026Rezarta Islamaj, Robert Leaman, Joey Chan +13

EHR-RAGp: Retrieval-Augmented Prototype-Guided Foundation Model for Electronic Health Records

EHR-RAGp is a retrieval-augmented foundation model for EHRs, dynamically integrating relevant patient history via a prototype-guided module for better clinical predictions.

2605.12335May 12, 2026Saeed Shurrab, Mariam Al-Omari, Dana El Samad +1

Overview of the MedHopQA track at BioCreative IX: track description, participation and evaluation of systems for multi-hop medical question answering

The MedHopQA track benchmarked LLMs on multi-hop medical QA with a new 1,000-pair dataset, highlighting RAG's importance for strong performance.

2605.12313May 12, 2026Rezarta Islamaj, Joey Chan, Robert Leaman +13

BatchBench: Toward a Workload-Aware Benchmark for Autoscaling Policies in Big Data Batch Processing -- A Proposed Framework

The paper proposes BatchBench, an open framework to benchmark diverse autoscaling policies for big data batch processing.

2605.12272May 12, 2026Venkata Krishna Prasanth Budigi, Siri Chandana Sirigiri

Unlocking Crowdsourcing for Ontology Matching Validation

This paper introduces a novel crowdsourcing system with domain-specific mechanisms to validate ontology matching, addressing challenges from LLMs.

2605.12226May 12, 2026Zhangcheng Qiang

Design Your Ad: Personalized Advertising Image and Text Generation with Unified Autoregressive Models

This paper introduces Uni-AdGen, a unified autoregressive model for personalized image and text ad generation, improving realism and user preference.

2605.12138May 12, 2026Yexing Xu, Wei Feng, Shen Zhang +15

Caraman at SemEval-2026 Task 8: Three-Stage Multi-Turn Retrieval with Query Rewriting, Hybrid Search, and Cross-Encoder Reranking

This paper presents a three-stage multi-turn retrieval system using query rewriting, hybrid search, and cross-encoder reranking for SemEval-2026 Task 8.

2605.12028May 12, 2026David-Maximilian Caraman, Gheorghe Cosmin Silaghi

From Trajectories to Phenotypes: Disease Progression as Structural Priors for Multi-organ Imaging Representation Learning

A new framework distills disease trajectory knowledge into imaging models, significantly improving disease prediction, especially for rare conditions.

2605.11958May 12, 2026Zian Wang, Lizhen Lan, Guangming Wang +9

Page 1 of 19Next

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.