Qichao Zhang
2 papers ยท Latest:
Robotics
PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance
PokeVLA is a lightweight Vision-Language-Action model that improves robot manipulation by integrating comprehensive world knowledge and spatial awareness.
2604.20834
Machine Learning$ฯ$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data
$ฯ$-Play enhances self-play for search agents by using internally generated "question construction paths" as privileged information for dense self-distillation.
2604.14054
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.