Shukang Yin

3 papers · Latest: April 22, 2026

SpeechParaling-Bench: A Comprehensive Benchmark for Paralinguistic-Aware Speech Generation

SpeechParaling-Bench is a new benchmark for evaluating paralinguistic-aware speech generation in LALMs, using fine-grained features and a novel LALM-based judge.

2604.20842Apr 22, 2026

Computer Vision

Tango: Taming Visual Signals for Efficient Video Large Language Models

Tango optimizes token pruning in Video LLMs by improving attention selection and similarity clustering, achieving significant speedup with minimal performance loss.

2604.09547Apr 10, 2026

Computer Vision

A Survey on Multimodal Large Language Models

This paper surveys recent advances in Multimodal Large Language Models (MLLMs), highlighting their architectures, training, capabilities, and future research directions.

2306.13549Jun 23, 2023

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.