Min Si
2 papers ยท Latest:
Machine Learning
FreeScale: Distributed Training for Sequence Recommendation Models with Minimal Scaling Cost
FreeScale optimizes distributed training for sequence recommendation models, reducing computational bubbles by up to 90.3% on 256 H100 GPUs.
2604.24073
Artificial IntelligenceThe Llama 3 Herd of Models
Llama 3 is a new family of large multilingual foundation models excelling in language, coding, reasoning, and multimodal tasks, rivaling GPT-4 in quality and offering extensive public releases.
2407.21783
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.