Jin Xie
2 papers ยท Latest:
Robotics
Toward Visually Realistic Simulation: A Benchmark for Evaluating Robot Manipulation in Simulation
VISER is a new visually realistic benchmark for robot manipulation, bridging the sim-to-real gap with high-fidelity assets and strong real-world correlation.
2605.06311
Software EngineeringSWE-TRACE: Optimizing Long-Horizon SWE Agents Through Rubric Process Reward Models and Heuristic Test-Time Scaling
SWE-TRACE improves long-horizon SWE agents via a unified framework for data curation, process reward RL, and heuristic test-time scaling.
2604.14820
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.