Shuang Yang
2 papers ยท Latest:
Computer Vision
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
SenseNova-U1 introduces a unified architecture (NEO-unify) that seamlessly integrates multimodal understanding and generation, outperforming specialized VLMs.
2605.12500
Information RetrievalOn the Equivalence Between Auto-Regressive Next Token Prediction and Full-Item-Vocabulary Maximum Likelihood Estimation in Generative Recommendation--A Short Note
This paper proves that auto-regressive next-token prediction in generative recommendation is mathematically equivalent to full-item-vocabulary maximum likelihood estimation.
2604.15739
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.