Shen Li
3 papers ยท Latest:
Machine Learning
LoKA: Low-precision Kernel Applications for Recommendation Models At Scale
LoKA introduces a system-model co-design framework to make FP8 low-precision arithmetic practical and efficient for large recommendation models.
2605.10886
Computer VisionC-CoT: Counterfactual Chain-of-Thought with Vision-Language Models for Safe Autonomous Driving
C-CoT uses VLMs and counterfactual chain-of-thought to improve safe autonomous driving decisions, especially in complex, high-risk scenarios.
2605.10744
Machine LearningFreeScale: Distributed Training for Sequence Recommendation Models with Minimal Scaling Cost
FreeScale optimizes distributed training for sequence recommendation models, reducing computational bubbles by up to 90.3% on 256 H100 GPUs.
2604.24073
๐ฌ Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week โ summarized, scored, and delivered to your inbox every Monday.