Design and Implementation of a Fast Convolution Algorithm for Embedded Platform

2022 11th International Conference of Information and Communication Technology (ICTech))(2022)

引用 1|浏览1
暂无评分
摘要
In recent years, deep learning has been gradually applied to the industry with great success. As the demand for the lightweight intelligent devices increases, the deployment of deep learning models on embedded platforms to meet users' needs for real-time performance has become a trend in the development of intelligence. However, due to the pursuit of higher accuracy, existing deep learning frameworks are becoming richer in functionality and more complex in computation. A large amount of memory requirements and computational power demands make it challenging to deploy neural network computing frameworks on embedded platforms with limited resources and computational power. The WPOC algorithm is proposed and integrated into the Darknet framework to address real-time image processing based on the Winograd algorithm. Tested on the ZYNQ-7010 platform was passed. The results show that the WPOC algorithm proposed in this paper can effectively speed up image recognition by about six times under the VGG-16 model while ensuring the same accuracy rate.
更多
查看译文
关键词
embedded devices,neural network acceleration,real-time,winograd algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要