ArXiv TLDR

Hui Liu

9 papers ยท Latest:

Cryptography & Security

Model-Agnostic Lifelong LLM Safety via Externalized Attack-Defense Co-Evolution

EvoSafety introduces a novel framework for lifelong, model-agnostic LLM safety via externalized attack-defense co-evolution to counter adversarial prompts.

2605.13411
Software Engineering

Characterizing the Failure Modes of LLMs in Resolving Real-World GitHub Issues

This paper analyzes LLM failures in resolving GitHub issues, revealing strategy formulation as the most error-prone stage and localization as the least.

2605.12270
Information Retrieval

An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

A simple graph heuristic reveals many sequential recommendation benchmarks are "shortcut-solvable," outperforming complex generative models.

2605.07125
Machine Learning

Crafting Reversible SFT Behaviors in Large Language Models

This paper introduces LCDD to create sparse, controllable "carriers" for SFT behaviors in LLMs, enabling their selective reversal with SFT-Eraser.

2605.06632
Mesoscale & Nanoscale Physics

Genus-protected higher-order topological phases

This paper introduces a new class of higher-order topological phases protected by the system's global topology (genus), independent of lattice symmetries.

2605.06383
Information Retrieval

FedCRF: A Federated Cross-domain Recommendation Method with Semantic-driven Deep Knowledge Fusion

FedCRF enables privacy-preserving cross-domain recommendation in non-overlapping scenarios via federated semantic learning and deep knowledge fusion.

2604.17681
Information Retrieval

Federated User Behavior Modeling for Privacy-Preserving LLM Recommendation

SF-UBM proposes a privacy-preserving federated LLM recommendation system that uses semantic bridging and knowledge distillation for non-overlapping domains.

2604.14833
Natural Language Processing

Task-Aware LLM Routing with Multi-Level Task-Profile-Guided Data Synthesis for Cold-Start Scenarios

This paper introduces a data synthesis framework and TRouter for effective LLM routing in cold-start scenarios, improving performance and cost efficiency.

2604.09377
Natural Language Processing

Baichuan 2: Open Large-scale Language Models

Baichuan 2 is a series of large-scale, open-source multilingual language models that achieve state-of-the-art performance across general and specialized benchmarks.

2309.10305

๐Ÿ“ฌ Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week โ€” summarized, scored, and delivered to your inbox every Monday.