PPAAS: PVT and Pareto Aware Analog Sizing via Goal-conditioned Reinforcement Learning
PPAAS: PVT and Pareto Aware Analog Sizing via Goal-conditioned Reinforcement Learning
Device sizing is a critical yet challenging step in analog and mixed-signal circuit design, requiring careful optimization to meet diverse performance specifications. This challenge is further amplified under process, voltage, and temperature (PVT) variations, which cause circuit behavior to shift across different corners. While reinforcement learning (RL) has shown promise in automating sizing for fixed targets, training a generalized policy that can adapt to a wide range of design specifications under PVT variations requires much more training samples and resources. To address these challenges, we propose a \textbf{Goal-conditioned RL framework} that enables efficient policy training for analog device sizing across PVT corners, with strong generalization capability. To improve sample efficiency, we introduce Pareto-front Dominance Goal Sampling, which constructs an automatic curriculum by sampling goals from the Pareto frontier of previously achieved goals. This strategy is further enhanced by integrating Conservative Hindsight Experience Replay to stabilize training and accelerate convergence. To reduce simulation overhead, our framework incorporates a Skip-on-Fail simulation strategy. Experiments on benchmark circuits demonstrate $\sim$1.6$\times$ improvement in sample efficiency and $\sim$4.1$\times$ improvement in simulation efficiency compared to existing sizing methods. Code and benchmarks are publicly available at https://github.com/SeunggeunKimkr/PPAAS
Seunggeun Kim、Ziyi Wang、Sungyoung Lee、Youngmin Oh、Hanqing Zhu、Doyun Kim、David Z. Pan
电子电路微电子学、集成电路
Seunggeun Kim,Ziyi Wang,Sungyoung Lee,Youngmin Oh,Hanqing Zhu,Doyun Kim,David Z. Pan.PPAAS: PVT and Pareto Aware Analog Sizing via Goal-conditioned Reinforcement Learning[EB/OL].(2025-08-03)[2025-08-16].https://arxiv.org/abs/2507.17003.点此复制
评论