大规模三角线性方程的高效求解 An efficient solver for large-scale triangular linear equations期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

大规模三角线性方程的高效求解

引用本文：	贾迅,邬贵明,钱磊,谢向辉,吴东.大规模三角线性方程的高效求解[J].计算机工程与科学,2019,41(2):240-245.

作者姓名：	贾迅邬贵明钱磊谢向辉吴东

作者单位：	数学工程与先进计算国家重点实验室,江苏无锡,214125;数学工程与先进计算国家重点实验室,江苏无锡,214125;数学工程与先进计算国家重点实验室,江苏无锡,214125;数学工程与先进计算国家重点实验室,江苏无锡,214125;数学工程与先进计算国家重点实验室,江苏无锡,214125

基金项目：	国家自然科学基金（91430214,61732018）

摘要：	大规模三角线性方程求解是科学与工程应用中重要的计算核心,受限于处理器的缓存容量和结构设计,其在CPU和GPU等平台上的计算效率不高。大规模三角线性方程的分块求解中,矩阵乘是主要运算,其计算效率对提升三角线性方程求解的计算效率至关重要。以矩阵乘计算效率较高的矩阵乘协处理器为计算平台,针对其结构特点提出了矩阵乘协处理器上大规模三角线性方程分块求解的实现方法和性能分析模型。实验结果表明,矩阵乘协处理器上大规模三角线性方程求解的计算效率最高可达85.9%,其实际性能和资源利用率分别为同等工艺下GPU的2.42倍和10.72倍。
关键词：	大规模三角线性方程矩阵乘协处理器
收稿时间：	2018-08-05
修稿时间：	2019-02-25
An efficient solver for large-scale triangular linear equations

JIA Xun,WU Gui ming,QIAN Lei,XIE Xiang hui,WU Dong.An efficient solver for large-scale triangular linear equations[J].Computer Engineering & Science,2019,41(2):240-245.

Authors:	JIA Xun WU Gui ming QIAN Lei XIE Xiang hui WU Dong

Affiliation:	（State Key Laboratory of Mathematical Engineering and Advanced Computing,Wuxi 214125,China）

Abstract:	Large scale triangular solver is an important computational kernel in scientific and engineering applications. However, execution of this kernel is not efficient on existing CPU and GPU platforms, due to limited cache capacity and the underlying problems of the architecture design. In the block solving of large-scale triangular linear equations, matrix multiplication is the main operation and its computational efficiency is crucial for improving the computational efficiency of solving triangular linear equations. Taking advantage of the high computation efficiency of the matrix multiplication coprocessor as the computing platform, and according its architectural features, we propose a block solving method and a performance analysis model of large-scale triangular linear equations on the matrix multiplication coprocessor. Experimental results show that a highly-efficient large scale triangular solver can be implemented on the matrix multiplication coprocessor with a computational efficiency up to 85.9%. Compared with the GPUs under the same process technology mode, the proposed triangular solver on the coprocessor can achieve 2.42× actual performance and 10.72× resource utilization.

Keywords:	large-scale triangular linear equation matrix multiplication coprocessor
本文献已被万方数据等数据库收录！
	点击此处可从《计算机工程与科学》浏览原始摘要信息
	点击此处可从《计算机工程与科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏