Wenbo Hu
2 papers ยท Latest:
Computer Vision
Pixal3D: Pixel-Aligned 3D Generation from Images
Pixal3D introduces a pixel-aligned 3D generation method that significantly improves fidelity for creating high-quality 3D assets from images.
2605.10922
Computer VisionOpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks
OpenVLThinkerV2 introduces Gaussian GRPO and task-level shaping to create a robust multimodal reasoning model, outperforming existing models.
2604.08539
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.