Lewei Lu
3 papers ยท Latest:
Computer Vision
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
SenseNova-U1 introduces a unified architecture (NEO-unify) that seamlessly integrates multimodal understanding and generation, outperforming specialized VLMs.
2605.12500
Artificial IntelligenceOpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis
OpenMobile is an open-source framework for mobile agents, synthesizing high-quality task instructions and trajectories to achieve competitive results.
2604.15093
Computer VisionInternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
InternVL is a 6-billion parameter vision-language foundation model that aligns large-scale vision models with LLMs to achieve state-of-the-art results across diverse visual-linguistic tasks.
2312.14238
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.