FPGA Design of High-Speed Convolutional Neural Network Hardware Accelerator.

Ahmed J. Abd El-Maksoud,Abdallah Mohamed,Ahmed Tarek,Amr Adel, Amr Eid, Farida Khaled, Fatma Khaled,Ziad Ibrahim,Eman El Mandouh,Hassan Mostafa

Novel Intelligent and Leading Emerging Sciences Conference（2021）

引用 2|浏览1

暂无评分

摘要

Convolutional Neural Networks get increasingly importance nowadays as they enable machines to interact with the surrounding environment, which paves the way for computer vision applications. FPGA implementations of CNN architectures have higher speed and lower power consumption compared to GPUs and CPUs. This paper proposes a high-speed hardware accelerator on FPGA for SqueezeNet CNN to accelerate its processing without decreasing the classification accuracy. Several ideas are applied to solve the memory bottleneck issue such as using Ping-Pong memory and deploying several FIFOs in the design. The architecture is built as a pipelined unit to process SqueezeNet CNN layer by layer. Different parallelism techniques are applied while processing the convolution layers to speedup layers processing. Moreover, the proposed accelerator classifies 248.76 fps at a frequency of 100MHz, and 427.4 fps at a frequency of 172 MHz. The proposed accelerator is implemented on Virtex-7 FPGA, and overcomes Geforce RTX 2080Ti GPU and several SqueezeNet FPGA implementations.

查看译文

关键词

Convolutional Neural Networks (CNNs),FPGAs,Hardware Accelerators,SqueezeNet

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要