Equivariant Reinforcement Learning for Clifford Quantum Circuit Synthesis
Richie Yeung, Aleks Kissinger, Rob Cornish
TLDR
Introduces an equivariant RL agent for efficient, scalable Clifford quantum circuit synthesis across varying qubit counts.
Key contributions
- Formulates Clifford circuit synthesis as a reinforcement learning problem using symplectic matrix reduction.
- Develops a size-agnostic, equivariant neural network respecting qubit relabelings for generalization.
- Achieves near-optimal synthesis on 6-qubit circuits with fast inference times.
- Scales to 30 qubits, outperforming existing Clifford synthesizers in two-qubit gate count.
Why it matters
This paper presents a novel RL approach that generalizes across qubit counts and improves Clifford circuit synthesis efficiency. It enables scalable quantum compilation critical for advancing quantum computing.
Original Abstract
We consider the problem of synthesizing Clifford quantum circuits for devices with all-to-all qubit connectivity. We approach this task as a reinforcement learning problem in which an agent learns to discover a sequence of elementary Clifford gates that reduces a given symplectic matrix representation of a Clifford circuit to the identity. This formulation permits a simple learning curriculum based on random walks from the identity. We introduce a novel neural network architecture that is equivariant to qubit relabelings of the symplectic matrix representation, and which is size-agnostic, allowing a single learned policy to be applied across different qubit counts without circuit splicing or network reparameterization. On six-qubit Clifford circuits, the largest regime for which optimal references are available, our agent finds circuits within one two-qubit gate of optimality in milliseconds per instance, and finds optimal circuits in 99.2% of instances within seconds per instance. After continued training on ten-qubit instances, the agent scales to unseen Clifford tableaus with up to thirty qubits, including targets generated from circuits with over a thousand Clifford gates, where it achieves lower average two-qubit gate counts than Qiskit's Aaronson-Gottesman and greedy Clifford synthesizers.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.