Ruiqi Wang
2 papers ยท Latest:
Robotics
PrefMoE: Robust Preference Modeling with Mixture-of-Experts Reward Learning
PrefMoE uses a mixture-of-experts to robustly learn rewards from noisy preference data, improving policy learning in RL.
2605.00384
Software EngineeringOn the Effectiveness of Context Compression for Repository-Level Tasks: An Empirical Investigation
This paper empirically investigates context compression for repository-level code tasks, finding it effective for performance and efficiency.
2604.13725
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.