Chip Placement with Deep Reinforcement Learning

Mirhoseini Azalia,Goldie Anna, Yazgan Mustafa,Jiang Joe,Songhori Ebrahim,Wang Shen,Lee Young-Joon,Johnson Eric,Pathak Omkar, Bae Sungmin,Nazi Azade,Pak Jiwoo, Tong Andy,Srinivasa Kavya,Hang William, Tuncer Emre, Babu Anand,Le Quoc V.,Laudon James,Ho Richard,Carpenter Roger,Dean Jeff

arxiv（2020）

引用 223|浏览508

暂无评分

摘要

In this work, we present a learning-based approach to chip placement, one of the most complex and time-consuming stages of the chip design process. Unlike prior methods, our approach has the ability to learn from past experience and improve over time. In particular, as we train over a greater number of chip blocks, our method becomes better at rapidly generating optimized placements for previously unseen chip blocks. To achieve these results, we pose placement as a Reinforcement Learning (RL) problem and train an agent to place the nodes of a chip netlist onto a chip canvas. To enable our RL policy to generalize to unseen blocks, we ground representation learning in the supervised task of predicting placement quality. By designing a neural architecture that can accurately predict reward across a wide variety of netlists and their placements, we are able to generate rich feature embeddings of the input netlists. We then use this architecture as the encoder of our policy and value networks to enable transfer learning. Our objective is to minimize PPA (power, performance, and area), and we show that, in under 6 hours, our method can generate placements that are superhuman or comparable on modern accelerator netlists, whereas existing baselines require human experts in the loop and take several weeks.

查看译文

关键词

deep reinforcement learning,reinforcement learning,placement

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要