Weiming Ren

2 papers · Latest: April 27, 2026

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Tuna-2 is a unified multimodal model using pixel embeddings for understanding and generation, outperforming vision encoders and simplifying architecture.

2604.24763Apr 27, 2026

Artificial Intelligence

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

RationalRewards uses explicit, multi-dimensional critiques to improve visual generation at both training and test time, outperforming scalar rewards.

2604.11626Apr 13, 2026

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.