ArXiv TLDR

Hedwig: Dynamic Autonomy for Coding Agents Under Local Oversight

🐦 Tweet
2605.11495

Tanjal Shukla, K. J. Kevin Feng, Leijie Wang, Mohammad Rostami, Amy X. Zhang

cs.HC

TLDR

Hedwig is a CLI coding agent that dynamically adjusts its autonomy based on developer interactions, adapting to user trust and evolving preferences.

Key contributions

  • Identifies developer frustration with static agent autonomy settings via a formative survey.
  • Introduces Hedwig, a CLI coding agent that dynamically adjusts its autonomy based on developer interactions.
  • Learns evolving behavioral guidelines from developer decisions and feedback across multiple sessions.
  • Reduces friction for trusted tasks and tightens oversight when the agent operates in unfamiliar territory.

Why it matters

This paper addresses a critical challenge in AI-assisted coding: the static nature of agent autonomy. Hedwig offers a novel solution by allowing agents to dynamically adapt their oversight based on developer trust and evolving preferences. This approach promises more effective and less frustrating collaboration between developers and coding agents.

Original Abstract

Despite coding agents' advances in handling increasingly complex tasks, their continued tendency to introduce unintended edits, subtle bugs, and scope drift that slip past code review means developers must still decide how much autonomy to grant them. However, existing approaches for setting an agent's level of autonomy, such as static permission settings or instruction files, cannot account for how developers' preferences for agent autonomy can shift across tasks and over time. We conducted a formative survey with 21 software engineers who use coding agents and found that they experience frustration with calibrating autonomy and have evolving preferences for level of oversight. Building on these insights, we present Hedwig, a CLI coding agent that dynamically adjusts its autonomy level based on developer-agent interactions across sessions. Rather than operating on a global, fixed autonomy configuration, Hedwig learns an evolving set of behavioral guidelines from developer decisions and feedback, reducing friction on work for which the agent has earned trust, while tightening oversight when the agent operates outside familiar territory. Hedwig demonstrates the potential of a new paradigm where agents intelligently adapt their level of autonomy based on user trust through active, longitudinal collaboration.

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.