Sihong Xie

3 papers · Latest: May 4, 2026

SCPRM: A Schema-aware Cumulative Process Reward Model for Knowledge Graph Question Answering

SCPRM is a new reward model for Knowledge Graph Question Answering that uses schema-aware cumulative rewards to improve multi-hop reasoning accuracy.

2605.02819May 4, 2026

Machine Learning

A decoupled diffusion planner that adapts to changing cost limits by using cost-conditioned generation for safety and reward gradients for performance

SDGD is a decoupled diffusion planner adapting to varying safety limits via cost-conditioned generation for safety and reward gradients for performance.

2605.02777May 4, 2026

Natural Language Processing

Geometry-Calibrated Conformal Abstention for Language Models

A post-hoc framework, Geometry-Calibrated Conformal Abstention, enables LMs to selectively abstain from answering when uncertain, boosting correctness.

2604.27914Apr 30, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.