谷歌浏览器插件
订阅小程序
在清言上使用

An OpenMP-Based Parallel Execution of Neural Networks Specified in NNEF

ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II(2020)

引用 0|浏览0
暂无评分
摘要
Recently, we have many research works on the neural networks and their related issues. For exchangeability of neural network frameworks, the Neural Network Exchange Format (NNEF) specification is now widely used. Due to very large size of these neural networks, their accelerations are actively explored, and can be achieved through parallel processing techniques. In this work, we present a prototype implementation of C++ code generator with parallel-processing accelerations based on OpenMP, for the NNEF specification files. Our implementation shows remarkable accelerations, in comparison to the original C++ template-based execution. We will tune the prototype acceleration to achieve more remarkable speed ups.
更多
查看译文
关键词
OpenMP,Parallel processing,Neural network,NNEF,Code generation,Prototype,Acceleration
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要