首页 | 官方网站   微博 | 高级检索  
     

大规模三角线性方程的高效求解
引用本文:贾迅,邬贵明,钱磊,谢向辉,吴东.大规模三角线性方程的高效求解[J].计算机工程与科学,2019,41(2):240-245.
作者姓名:贾迅  邬贵明  钱磊  谢向辉  吴东
作者单位:数学工程与先进计算国家重点实验室,江苏 无锡,214125;数学工程与先进计算国家重点实验室,江苏 无锡,214125;数学工程与先进计算国家重点实验室,江苏 无锡,214125;数学工程与先进计算国家重点实验室,江苏 无锡,214125;数学工程与先进计算国家重点实验室,江苏 无锡,214125
基金项目:国家自然科学基金(91430214,61732018)
摘    要:大规模三角线性方程求解是科学与工程应用中重要的计算核心,受限于处理器的缓存容量和结构设计,其在CPU和GPU等平台上的计算效率不高。大规模三角线性方程的分块求解中,矩阵乘是主要运算,其计算效率对提升三角线性方程求解的计算效率至关重要。以矩阵乘计算效率较高的矩阵乘协处理器为计算平台,针对其结构特点提出了矩阵乘协处理器上大规模三角线性方程分块求解的实现方法和性能分析模型。实验结果表明,矩阵乘协处理器上大规模三角线性方程求解的计算效率最高可达85.9%,其实际性能和资源利用率分别为同等工艺下GPU的2.42倍和10.72倍。

关 键 词:大规模  三角线性方程  矩阵乘  协处理器
收稿时间:2018-08-05
修稿时间:2019-02-25

An efficient solver for large-scale triangular linear equations
JIA Xun,WU Gui ming,QIAN Lei,XIE Xiang hui,WU Dong.An efficient solver for large-scale triangular linear equations[J].Computer Engineering & Science,2019,41(2):240-245.
Authors:JIA Xun  WU Gui ming  QIAN Lei  XIE Xiang hui  WU Dong
Affiliation:(State Key Laboratory of Mathematical Engineering and Advanced Computing,Wuxi 214125,China)
Abstract:Large scale triangular solver is an important computational kernel in scientific and engineering applications. However, execution of this kernel is not efficient on existing CPU and GPU platforms, due to limited cache capacity and the underlying problems of the architecture design. In the block solving of large-scale triangular linear equations, matrix multiplication is the main operation and its computational efficiency is crucial for improving the computational efficiency of solving triangular linear equations. Taking advantage of the high computation efficiency of the matrix multiplication coprocessor as the computing platform, and according its architectural features, we propose a block solving method and a performance analysis model of large-scale triangular linear equations on the matrix multiplication coprocessor. Experimental results show that a highly-efficient large scale triangular solver can be implemented on the matrix multiplication coprocessor with a computational efficiency up to 85.9%. Compared with the GPUs under the same process technology mode, the proposed triangular solver on the coprocessor can achieve 2.42× actual performance and 10.72× resource utilization.
Keywords:large-scale  triangular linear equation  matrix multiplication  coprocessor  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号