首页 | 官方网站   微博 | 高级检索  
     

基于ARM+FPGA平台的二值神经网络加速方法研究
引用本文:孙孝辉,宋庆增,金光浩,姜文超.基于ARM+FPGA平台的二值神经网络加速方法研究[J].计算机应用研究,2020,37(3):779-783.
作者姓名:孙孝辉  宋庆增  金光浩  姜文超
作者单位:天津工业大学 计算机科学与技术学院,天津300387;广东工业大学 计算机学院,广州510006
基金项目:广东省科技计划;天津市自然科学基金;国家自然科学基金
摘    要:现有的卷积神经网络由于其结构复杂且依赖的数据集庞大,难以满足某些实际应用或者计算平台对运算性能的要求和能耗的限制。针对这些应用或计算平台,对基于ARM+FPGA平台的二值化算法进行了研究,并设计了二值神经网络,该网络减少了数据对存储单元的需求量,也降低了运算的复杂度。在ARM+FPGA平台内部实现时,通过将卷积的乘累加运算转换为XNOR逻辑运算和popcount等操作,提高了整体的运算效率,降低了对能源和资源的消耗。同时,根据二值神经网络中数据存储的特点提出了新的行处理改进算法,提高了网络的吞吐量。该实现方式在GOPS、能源和资源效率方面均优于现有的FPGA神经网络加速方法。

关 键 词:二值神经网络  现场可编程门阵列  异或运算  行处理算法
收稿时间:2018/8/4 0:00:00
修稿时间:2020/1/21 0:00:00

Research of binary neural network acceleration method based on ARM+FPGA platform
Sun Xioahui,Song Qingzeng,Jin Guanghao and Jiang Wenchao.Research of binary neural network acceleration method based on ARM+FPGA platform[J].Application Research of Computers,2020,37(3):779-783.
Authors:Sun Xioahui  Song Qingzeng  Jin Guanghao and Jiang Wenchao
Affiliation:DeptSchool or Collegeof Computer Science Software Engineering,Tianjin Polytechnic University,,,
Abstract:The existing convolutional neural network(CNN) has complicated structure and bases on huge dataset, so it is difficult to meet the requirement of computing performance and limitation of energy consumption in some practical applications or computing platforms. Aiming at these applications or platforms, this paper studied the binary algorithm based on ARM+FPGA platform and designed a binary neural network(BNN). It reduced the demand for data storage units and simplified the computational complexity. When implemented in the ARM+FPGA platform, it converted the convolution multiply-accumulate operation into XNOR logic and popcount operation, which improved the overall operation efficiency and declined the consumption of energy and resources. At the same time, based on the characteristics of data storage in BNN, this paper proposed a new row-processing algorithm to improve the throughput of the network. In a word, this implementation is superior to the existing FPGA neural network acceleration methods in terms of GOPS, energy and resource efficiency.
Keywords:BNN  FPGA  XNOR  row-processing algorithm
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号