首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 328 毫秒
1.
Transputer及多机并行处理系统结构分析   总被引:1,自引:0,他引:1  
分析了Transputer和基于Transputer网络加速板的多机并行处理系统的结构,着重说明了Transputer如何支持并发进程的调度和通信。最后指出为获得Transputer多机系统并行处理的高性能,并行程序设计应避免并发过程死锁和选取最佳任务粒度。  相似文献   

2.
本文基于Multi-Transputer并行处理系统的结构特点,分析了Transputer网的通信通信及任务通信开销对作业运行效率的制约关系。结果表明:并行程序设计的作业分解方式、选取合理的任务粒度和Transputer网络拓扑是其影响通信开销和作业运行效率的主要原因。  相似文献   

3.
本文对基于Transputer的多机并行处理系统,提出在两种情况下任务粒度临界值的计算表达式,并给出一个在Transputer多机系统上的并行C语言编程示例,说明如何平衡任务并行性和通信开销以减少作业运行时间。  相似文献   

4.
Transputer并行处理系统的任务粒度临界值   总被引:1,自引:0,他引:1  
本文对基于Transputer的多机并行处理系统,提出在两种情况下任务粒度临界值的计算表达式,并给出一个在Transputer多机系统上的并行C语言编程示例,说明了如何平衡任务并行性和通信开销以减少作业运行时间。  相似文献   

5.
TDS是借助于微机环境提供给用户的一个用于开发Transputer并发程序的程序开发集成环境,本文介绍了一个笔者在该环境下利用二个Transputer和Occam语言实现的多Transputer的模拟环境。  相似文献   

6.
本文提出了一种容错的多Transputer的体系结构。该体系结构采用并行处理芯片Transputer作为基本处理单元,利用多Transputer并行处理系统的并行性、可拓扑性,用软件实现了带一个后援备份的三倍任务仿作的动态混合冗余,达到了高可靠性的目标。在保证高可靠性的前提下,该体系结构还使资源的消耗最少。该体系结构能连续容忍系统处理单元的单故障,并自动地启动后援备份进行重构,具有很高的可靠性、实  相似文献   

7.
本文首先描述了该分布式并行计算机系统的结构,基于Transputer的PLC是其中的核心单元。由于Transputer处理器的强大功能,该系统具有较强的并行处理能力和通信网络能力,以及较高的稳定性。  相似文献   

8.
本文针对PVM不支持Transputer的不足,介绍了基于TCP/IP的Transputer异构型分布式并行计算系统(T-DPCS)的软硬件架构。从通信协议的选择、Transputer共享支持软件、分布式协同软件架构和通信函数库的实现等4个方面详细地阐述了实现TDPCS的方案,并进行了原型实现。  相似文献   

9.
本文将提出一种用于Transputer多机系统的Hamilton回路连接三元树互连结构,并给出其构造算法。这种结构的Transputer多机系统在其通信服务程序的支持下,可以很好地适应扇型,星型,环和超环等多种并行结构的计算,并且系统的并行处理规模可扩充性强。  相似文献   

10.
本文讨论了三维物体隐面消除的并行处理问题。给出了一类MIMD并行深度缓冲器算法,并在多Transputer系统上实现。文中还对这些算法的效率进行了比较。  相似文献   

11.
针对遥感数据非结构化、有格式、单景数据量大、记录总量小、大部分处理过程后数据量不显著减少的特点和大规模遥感数据处理的迫切需求,基于已有硬件资源,设计和实现了集中式集群计算的多源定量遥感产品生产系统。系统为解决遥感科学工作者设计开发的算法进行并行计算的问题以及大规模数据管理的需求,设计了基于任务并行的处理系统和面向应用的并行文件系统。系统集成了遥感科学工作者开发的30多种原始数据的预处理算法和50多种多源定量遥感产品生产算法,解决了多源定量遥感产品规模化流程化按需生产的难题,并进行了产品生产,证明了系统在管理和处理大规模遥感数据时的可靠性和有效性。  相似文献   

12.
An increasing awareness of the need for high speed parallel processing systems for image analysis has stimulated a great deal of interest in the design and development of such systems. Efficient processing schemes for several specific problems have been developed providing some insight into the general problems encountered in designing efficient image processing algorithms for parallel architectures. However it is still not clear what architecture or architectures are best suited for image processing in general, or how one may go about determining those which are. An approach that would allow application requirements to specify architectural features would be useful in this context. Working towards this goal, general principles are outlined for formulating parallel image processing tasks by exploiting parallelism in the algorithms and data structures employed. A synchronous parallel processing model is proposed which governs the communication and interaction between these tasks. This model presents a uniform framework for comparing and contrasting different formulation strategies. In addition, techniques are developed for analyzing instances of this model to determine a high level specification of a parallel architecture that best ‘matches’ the requirements of the corresponding application. It is also possible to derive initial estimates of the component capabilities that are required to achieve predefined performance levels. Such analysis tools are useful both in the design stage, in the selection of a specific parallel architecture, or in efficiently utilizing an existing one. In addition, the architecture independent specification of application requirements makes it a useful tool for benchmarking applications.  相似文献   

13.
This paper presents an analytical performance prediction model and methodology that can be used to predict the execution time, speedup, scalability and similar performance metrics of a large set of image processing operations running on a p-processor parallel system. The model which requires only a few parameters obtainable on a minimal system can help in the systematic design, evaluation and performance tuning of parallel image processing systems. Using the model one can reason about the performance of a parallel image processing system prior to implementation. The method can also support programmers in detecting critical parts of an implementation and system designers in predicting hardware performance and the effect of hardware parameter changes on performance. The execution of parallel image processing operations was studied and operations were arranged in three main problem classes based on data locality and the communication patterns of the algorithms. The core of the method is the derivation of the overhead function, as it is the overhead that determines the achievable speedup. The overheads were examined and modelled for each class. The use of the method is illustrated by four class-representative image processing algorithms: image-scalar addition, convolution, histogram calculation and the Fast Fourier Transform. The developed performance model has been validated on a 16-node parallel machine and it has been shown that the model is able to predict the parallel run-time and other performance metrics of parallel image processing operations accurately.  相似文献   

14.
A polynomial approach to the representation of gray images for machine vision is described. An algebraic system is developed where a polynomial in two variables with real coefficients represents a gray image and it is shown that most of the standard image processing tasks like smoothing, edge detection, rotation and magnification can be done by operating certain polynomials called template polynomials. This method is also applied to connected component labelling, shape decomposition, template matching, and the skeletonization of a gray image without a priori thresholding. A technique is developed to decompose a template and do parallel processing.  相似文献   

15.
Visualization of large data sets with the Active Data Repository   总被引:1,自引:0,他引:1  
We implement ray-casting-based volume rendering and isosurface rendering methods using the Active Data Repository (ADR) for visualizing out-of-core data sets. We have developed the ADR object-oriented framework to provide support for applications that employ range queries with user-defined mapping and aggregation operations on large-scale multidimensional data. ADR targets distributed-memory parallel machines with one or more disks attached to each node. It is designed as a set of modular services implemented in C++, which can be customized for application-specific processing. The ADR runtime system supports common operations such as memory management, data retrieval, and scheduling of processing across a parallel machine  相似文献   

16.
The newly developed object-oriented database management systems provide rich facilities for the modeling and processing of structural as well as behavioral properties of complex application objects. However, due to their inherent generality, new functionalities to be added to these systems as they continue to evolve, and high performance demand in many application domains, efficient parallel algorithms and architectures would be needed to meet the performance requirement for processing large OODBs. In our previous work, we have shown that processing OODBs can be viewed as the manipulation of patterns of object associations. In this paper, we present several parallel, multiwavefront algorithms based on two approaches, i.e., identification and elimination approaches, to verify association patterns specified in queries. Both approaches allow more processors to operate concurrently on a query than the traditional tree-structured query processing approach, thus introducing a higher degree of parallelism in query processing. We present a graph model to transform the query processing problem into a graph problem. Based on the graph model, proofs of correctness of both approaches for tree-structured queries are given, and a combined approach for solving cyclic queries is also provided. We present a new data structure to represent associations between objects, parallel algorithms based on these approaches, and some evaluation results obtained from an actual implementation of these algorithms on an nCUBE 2 parallel computer.  相似文献   

17.
针对多片TMS320C64 DSP构成的多计算机体系结构的实时图像识别系统,设计并实现了面向并行图像处理的实时分布式操作系统PIPORTDOS(Parallel image processing-oriented real-time distributed operating system).PIPORTDOS基于微内核体系结构,包括硬件抽象层、系统核心层、分布式消息通信机制和系统服务层四个层次.其多任务内核实现了基于优先级的抢先式调度、任务闻的同步和通信原语、实时的中断处理以及面向应用的缓存管理机制.为了实现对分布式并行图像处理的支持,PIPORTDOS采用了基于消息传递(Message Passing)的方式,并在实现中充分考虑了上层图像处理算法的应用需求以及DSP的硬件功能.相关性能指标表明,本文设计实现的PIPORTDOS完全可以满足系统的强实时性要求.在功能上也能适应算法对不同并行结构的需求.  相似文献   

18.
This paper describes a general methodology for developing parallel image processing algorithms based on message passing for high resolution images (on the order of several Gigabytes). These algorithms have been applied to histological images and must be executed on massively parallel processing architectures. Advances in new technologies for complete slide digitalization in pathology have been combined with developments in biomedical informatics. However, the efficient use of these digital slide systems is still a challenge. The image processing that these slides are subject to is still limited both in terms of data processed and processing methods. The work presented here focuses on the need to design and develop parallel image processing tools capable of obtaining and analyzing the entire gamut of information included in digital slides. Tools have been developed to assist pathologists in image analysis and diagnosis, and they cover low and high-level image processing methods applied to histological images. Code portability, reusability and scalability have been tested by using the following parallel computing architectures: distributed memory with massive parallel processors and two networks, INFINIBAND and Myrinet, composed of 17 and 1024 nodes respectively. The parallel framework proposed is flexible, high performance solution and it shows that the efficient processing of digital microscopic images is possible and may offer important benefits to pathology laboratories.  相似文献   

19.
Management of large quantities of complex data is essential in many advanced application areas. Object-oriented (OO) database management system have been developed to effectively model and process the complex domain knowledge. They have been shown to outperform some existing relational systems. The existing implementations of OO database management systems attempt to improve the efficiency of OO queries by explicitly capturing the relationships among objects. However, the execution of complex queries involving the retrieval of objects from many classes and relationships among them causes the existing system to operate inefficiently. In this paper, we present parallel algorithms for the processing of queries against a large OO database. The algorithms are based on a closed model of query processing pattern-based access instead of the conventional value-based access. During processing, the algorithms avoid the execution of time-consuming join operations by making use of the explicitly stored object associations. Generation of large quantities of temporary data is avoided by marking objects using their identifiers and by employing a two-phase query processing strategy. A query is processed by concurrent multiple waves, thereby improving parallelism avoiding the complexities introduced in their sequential implementation. The correctness and the performance of the parallel algorithms have been tested and analyzed by running parallel programs on a 32-node transputer based parallel machine designed and developed at the IBM Research Center at Yorktown Heights, New York. Benchmark queries of different semantic complexities are generated, and their performance is analyzed for various data and query parameters  相似文献   

20.
熊晶  郭磊  高峰 《微机发展》2012,(1):79-82
为解决传统串行检索方式在面对海量数据进行处理和查询时效率低下的问题,提出基于JPPF的分布式并行检索策略。JPPF是一种基于Java的功能强大的并行处理框架,其并行环境易于搭建、简单实用。通过分析JPPF的框架结构和分布式工作流程,利用其在执行队列管理及负载均衡方面的优势,设计和实现了一个基于JPPF的检索系统。采用对比实验的方法,以数据库查询为例,比较了串行检索和JPPF并行检索的效率。实验结果表明在数据规模较大的情况下,JPPF并行方式较之串行方式可以显著提高检索效率。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号