SPynq: Acceleration of machine learning applications over Spark on Pynq

2017 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)(2017)

引用 4|浏览6
暂无评分
摘要
Spark is one of the most widely used frameworks for data analytics that offers fast development of applications like machine learning and graph computations in distributed systems. In this paper, we present SPynq: A framework for the efficient utilization of hardware accelerators over the Spark framework on heterogeneous MPSoC FPGAs, such as Zynq. Spark has been mapped to the Pynq platform and the proposed framework allows the seamlessly utilization of the programmable logic for the hardware acceleration of computational intensive Spark kernels. We have also developed the required libraries in Spark that hides the accelerator's details to minimize the design effort to utilize the accelerators. A cluster of 4 nodes (workers) based on the all-programmable MPSoCs has been implemented and the proposed platform is evaluated in a typical machine learning application based on logistic regression. The logistic regression kernel has been developed as an accelerator and incorporated to the Spark. The developed system is compared to a high-performance Xeon cluster that is typically used in cloud computing. The performance evaluation shows that the heterogeneous accelerator-based MpSoC can achieve up to 2.3x system speedup compared with a Xeon system (with 90% accuracy) and 20x better energy-efficiency. For embedded application, the proposed system can achieve up to 40x speedup compared to the software only implementation on low-power embedded processors and 30x lower energy consumption.
更多
查看译文
关键词
embedded application,machine learning applications,graph computations,hardware accelerators,Spark framework,heterogeneous MPSoC FPGAs,Pynq platform,hardware acceleration,computational intensive Spark kernels,all-programmable MPSoCs,logistic regression kernel,heterogeneous accelerator,data analytics,distributed systems,SPynq,programmable logic,high-performance Xeon cluster,cloud computing,energy-efficiency,low-power embedded processors,energy consumption
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要