Yan Wang
5 papers ยท Latest:
MindMirror: A Local-First Multimodal State-Aware Support System for Digital Workers
MindMirror is a local-first multimodal system that uses AI to support digital workers by monitoring their state and offering personalized help.
RoboMemArena: A Comprehensive and Challenging Robotic Memory Benchmark
RoboMemArena is a new, large-scale robotic memory benchmark with 26 tasks, real-world evaluation, and VLM-generated annotations, alongside the PrediMem VLA.
CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models
CapVector learns transferable capability vectors in parametric space for VLA models, enhancing performance and reducing adaptation costs during finetuning.
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
GLM-5V-Turbo is a new foundation model integrating multimodal perception natively for enhanced agent reasoning, planning, and tool use across diverse contexts.
Intent Propagation Contrastive Collaborative Filtering
IPCCF improves collaborative filtering by using a double helix message propagation and contrastive learning for better intent disentanglement.
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.