首页 | 官方网站   微博 | 高级检索  
     

一类Toeplitz三对角方程组的一种分布式并行算法
引用本文:骆志刚,李晓梅.一类Toeplitz三对角方程组的一种分布式并行算法[J].计算机学报,2001,24(2):173-178.
作者姓名:骆志刚  李晓梅
作者单位:国防科学技术大学计算机学院
基金项目:国家自然科学基金!重点项目 (6 99330 30 ),国家“八六三”高技术研究发展计划! (86 3-30 6 -ZD-0 1-0 3-4 ),并行与分布处理国家重
摘    要:文中提出一类Toeplitz三对角方程组的一种分布式并行算法。该算法以系数矩阵的分解为基础,充分利用了系数矩阵结构的特殊性,算法因并行化而引入的冗余计算量非常少,算法的通信机制简单,通信量仅与处理 机台数p有关,与方程组规模n无关,算法具有很高的并行效率,理论分析和数值试验表明,其加速比Sp(n)→p(n→ ∞),此为线性加速比的理想情况。文中给出了算法在分布存储多计算机系统上的数值试验结果。

关 键 词:Toeplitz三对角方程组  分布式并行算法  并行计算机  系数矩阵
修稿时间:1999年12月27

A Parallel Solver for Certain Toeplitz Tridiagonal Systems on Distributed-Memory Multicomputers
LUO Zhi-gang,LI Xiao-Mei.A Parallel Solver for Certain Toeplitz Tridiagonal Systems on Distributed-Memory Multicomputers[J].Chinese Journal of Computers,2001,24(2):173-178.
Authors:LUO Zhi-gang  LI Xiao-Mei
Abstract:The tridiagonal Toeplitz linear systems occur repeatedly in the solution of the implicit finite difference equations derived from linear first order hyperbolic equations, i.e. the Transport equation, under a variety of boundary conditions. Interest in fast direct methods for solving these kind of linear systems has long been a hot spot of research. A parallel algorithm for certain tridiagonal Toeplitz linear systems on distributed-memory multicomputers is presented. Derivation of the algorithm is introduced. The algorithm is based on the factorization of the coefficient matrix and the principle of ‘divide-and-conquer’ in designing parallel algorithms. Authors make full use of the special structure of the coefficient matrix. By using the customary nesting procedure, Horner's formula, authors avoid the necessary of quantities αi, (-α)i(i=2,3,…,m) and (αm)i, (-αm)i(i=2,3,…,p-1). This reduces the algorithm's redundancy computation caused by parallelization. The complexity of the algorithm is analyzed using LogP model. Its communication mechanism is very simple. The communication complexity is only related to p, the number of processors, and not related to n, the size of the matrix. The algorithm's parallel efficiency is high. The analysis of complexity and numerical experiments show that the algorithm's speedup satisfy: Sp(n)→p(n→+∞). This is the best a parallel algorithm can reach. The algorithm has been implemented on parallel computers. The results of numerical experiments about the algorithm on a distributed-memory multicomputer are presented in this paper.
Keywords:Toeplitz  tridiagonal system  parallel algorithm  distributed  memory  multicomputer
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号