Binyuan Hui
3 papers ยท Latest:
StarCoder 2 and The Stack v2: The Next Generation
StarCoder2 is a next-generation open-source Code LLM trained on a vastly expanded and diverse dataset, achieving state-of-the-art performance on multiple code benchmarks while being more parameter-efficient than larger models.
Qwen Technical Report
Qwen is a versatile large language model series featuring base, chat, coding, and math-specialized models that achieve strong performance across diverse AI tasks, rivaling larger and proprietary models.
OctoPack: Instruction Tuning Code Large Language Models
OctoPack introduces instruction tuning for code LLMs using a massive dataset of Git commits, achieving state-of-the-art results on multi-language coding benchmarks without relying on OpenAI data.
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.