Wanli Ouyang
3 papers ยท Latest:
Computer Vision
OmniLiDAR: A Unified Diffusion Framework for Multi-Domain 3D LiDAR Generation
OmniLiDAR is a unified diffusion framework that generates 3D LiDAR scans across eight diverse domains using text conditioning, addressing single-domain limitations.
2605.13815
Natural Language ProcessingStraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction
StraTA introduces strategic trajectory abstraction to agentic RL, improving LLM performance in long-horizon tasks by enhancing exploration and credit assignment.
2605.06642
Machine LearningBeyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
MODPO is a novel, RL-free method for aligning language models to multiple human preferences simultaneously, achieving stable and efficient optimization across diverse objectives.
2310.03708
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.