ArXiv TLDR

Yue Wang

8 papers · Latest:

Natural Language Processing

CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers

CoCoReviewBench is a new benchmark for AI reviewers, focusing on completeness and correctness by curating 3,900 papers with expert annotations.

2605.07905
Robotics

ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving

ReflectDrive-2 introduces a discrete diffusion planner for autonomous driving with self-editing capabilities, significantly improved by reinforcement learning.

2605.04647
Computer Vision

Representation Fréchet Loss for Visual Generation

This paper introduces FD-loss, a method to optimize Fréchet Distance in representation space, significantly improving visual generation quality and efficiency.

2604.28190
Computer Vision

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

GLM-5V-Turbo is a new foundation model integrating multimodal perception natively for enhanced agent reasoning, planning, and tool use across diverse contexts.

2604.26752
Robotics

Reference-Augmented Learning for Precise Tracking Policy of Tendon-Driven Continuum Robots

This paper introduces a reference-augmented offline learning framework for precise 6-DOF tracking control of Tendon-Driven Continuum Robots.

2604.25698
Robotics

Learning-Based Dynamics Modeling and Robust Control for Tendon-Driven Continuum Robots

This paper presents a differentiable learning framework for robust control of tendon-driven continuum robots, overcoming complex nonlinearities.

2604.25691
Natural Language Processing

WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning

WebGen-R1 uses reinforcement learning to enable small LLMs to generate functional, aesthetic, multi-page websites, outperforming larger models.

2604.20398
Computer Vision

Seedance 2.0: Advancing Video Generation for World Complexity

Seedance 2.0 is a new multi-modal audio-video generation model with a unified architecture, offering advanced capabilities and improved performance.

2604.14148

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.