Dasen Dai
2 papers ยท Latest:
Computer Vision
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
OpenSearch-VL provides an open-source recipe for training frontier multimodal deep search agents, achieving state-of-the-art performance.
2605.05185
Natural Language ProcessingUIPress: Bringing Optical Token Compression to UI-to-Code Generation
UIPress introduces the first encoder-side learned optical compression for UI-to-Code generation, significantly boosting speed and performance.
2604.09442
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.