Ruiqi Wang

2 papers · Latest: May 1, 2026

PrefMoE: Robust Preference Modeling with Mixture-of-Experts Reward Learning

PrefMoE uses a mixture-of-experts to robustly learn rewards from noisy preference data, improving policy learning in RL.

2605.00384May 1, 2026

Software Engineering

On the Effectiveness of Context Compression for Repository-Level Tasks: An Empirical Investigation

This paper empirically investigates context compression for repository-level code tasks, finding it effective for performance and efficiency.

2604.13725Apr 15, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.