Subhabrata Mukherjee

2 papers · Latest: May 7, 2026

Crafting Reversible SFT Behaviors in Large Language Models

This paper introduces LCDD to create sparse, controllable "carriers" for SFT behaviors in LLMs, enabling their selective reversal with SFT-Eraser.

2605.06632May 7, 2026

Natural Language Processing

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Orca is a 13B parameter model that improves small model reasoning by progressively learning from GPT-4's complex explanation traces and step-by-step thought processes, achieving state-of-the-art zero-shot performance on challenging benchmarks.

2306.02707Jun 5, 2023

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.