Chu-Cheng Lin
2 papers ยท Latest:
Machine Learning
How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum
This paper introduces a Tsallis loss family ($J_Q$) that mitigates cold-start stalling in reasoning models by interpolating between exploitation and density estimation.
2604.25907
Natural Language ProcessingGemini: A Family of Highly Capable Multimodal Models
Gemini is a new family of multimodal AI models excelling in image, audio, video, and text understanding, achieving state-of-the-art results across numerous benchmarks including human-expert level on MMLU.
2312.11805
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.