RefEvo: Agentic Design with Co-Evolutionary Verification for Agile Reference Model Generation
Yifan Zhang, Jianmin Ye, Jiahao Yang, Xi Wang
TLDR
RefEvo is a multi-agent framework that uses co-evolutionary verification and context compression to generate agile, reliable hardware reference models.
Key contributions
- Dynamic Design Planner autonomously tailors workflows based on design complexity.
- Co-Evolutionary Verification rectifies models and testbenches against specifications, mitigating false positives.
- Spec Anchoring Strategy provides lossless context compression, saving over 70,000 tokens.
- Achieves a 95% pass rate on hardware modules with 71% token reduction and 100% spec recall.
Why it matters
RefEvo addresses critical limitations of LLMs in hardware modeling, such as rigid workflows and context overflow, by introducing a dynamic multi-agent framework. This enables the agile and reliable generation of high-fidelity SystemC reference models, crucial for early SoC architecture exploration and verification.
Original Abstract
As the complexity of System-on-Chip (SoC) designs grows, the shift-left paradigm necessitates the rapid development of high-fidelity reference models (typically written in SystemC) for early architecture exploration and verification. While Large Language Models (LLMs) show promise in code generation, their application to hardware modeling faces unique challenges: (1) Rigid, static workflows fail to adapt to varying design complexity, causing inefficiency; (2) Context window overflow in multi-turn interactions leads to catastrophic forgetting of critical specifications; and (3) the Coupled Validation Failure problem--where generated Testbenches (TBs) incorrectly validate flawed models due to correlated hallucinations--severely undermines reliability. To address these limitations, we introduce RefEvo, a dynamic multi-agent framework designed for agile and reliable reference modeling. RefEvo features three key innovations: (1) A Dynamic Design Planner that autonomously decomposes design specifications and constructs tailored execution workflows based on semantic complexity; (2) A Co-Evolutionary Verification Mechanism, which employs a Dialectical Arbiter to simultaneously rectify the model and verification logic against the specification (Spec) oracle, effectively mitigating false positives; and (3) A Spec Anchoring Strategy for lossless context compression. Evaluated on a diverse benchmark of 20 hardware modules, RefEvo achieves a 95% pass rate, outperforming static baselines by a large margin. Furthermore, our context optimization reduces token consumption by an average of 71.04%, achieving absolute savings of over 70,000 tokens per session for complex designs while maintaining 100% specification recall.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.