Architecture And Evaluation Of Low Power Many-Core Soc With Two 32-Core Clusters

IEICE TRANSACTIONS ON ELECTRONICS(2014)

引用 0|浏览39
暂无评分
摘要
New media processing applications such as image recognition and AR (Augment Reality) have become into practical on embedded systems for automotive, digital-consumer and mobile products. Many-core processors have been proposed to realize much higher performance than multi-core processors. We have developed a low-power many-core SoC for multimedia applications in 40 nm CMOS technology. Within a 210 mm(2) die, two 32-core clusters are integrated with dynamically reconfigurable processors, hardware accelerators, 2-channel DDR3 I/Fs, and other peripherals. Processor cores in the cluster share a 2MB L2 cache connected through a tree-based Network-on-Chip (NoC). Its total peak performance exceeds 1.5TOPS (Tera Operations Per Second). The high scalability and low power consumption are accomplished by parallelized software for multimedia applications. In case of face detection, the performance scales up to 64 cores and the SoC consumes only 2.21 W. Moreover, it can execute the 1080p 48 fps H.264 decoding about 520 mW by 28 cores and the 4K2K 15 fps super resolution about 770 mW by 32 cores in one cluster. Exploiting parallelism by low power processor cores, the many-core SoC provides several tens of times better energy efficiency than that of a high performance desk-top quad-core processor.
更多
查看译文
关键词
many-core, network-on-chip, VLIW, low power, face detection, H.264, super resolution
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要