共查询到20条相似文献,搜索用时 15 毫秒
1.
Japanese supercomputer development activities in the industry is reviewed. Architectures and softwares of Japanese supercomputer systems in the last 20 years are described and discussed. Special emphasis is put on the similarities and differences of the systems. 相似文献
2.
《Computer》2004,37(2):10-13
Although grid computing - which links disparate machines so that they can function as a distributed supercomputer - has become an increasingly popular focus of high-performance-computing research, the traditional supercomputing industry has languished until recently. This occurred largely because the easing of the Cold War in the early 1990s reduced government use of and spending on supercomputer technology. However, the industry is now reviving because of the development of low-cost supercomputer clusters that use commodity chips. Clusters of commodity computers linked by highspeed interconnect technologies, such as InfiniBand, have put supercomputers within reach of new users. Meanwhile, longtime supercomputer vendors like Cray are making a comeback with systems based on vector units, powerful CPUs dedicated to floating-point and matrix calculations. Vector machines satisfy the demand by governments and large industries for supercomputers that can conduct complex tasks such as nuclear-weapons simulations, pharmaceutical drug modeling, mining of large data sets, and geological analysis to find oil deposits. 相似文献
3.
4.
基于我国超级计算机平台,开展了大规模并行时域有限差分法(Finite-Difference Time-DomainFDTD)的性能和应用研究。在我国首台百万亿次"魔方"超级计算机、具有国产CPU的"神威蓝光"超级计算机和当前排名世界第一的"天河二号"超级计算机上就并行FDTD方法的并行性能进行了测试,并分别突破了10000 CPU核,100000 CPU核和300000 CPU核的并行规模。在不同测试规模下,该算法的并行效率均达到了50%以上,表明了本文并行算法具有良好的可扩展性。通过仿真分析多个微带天线阵的辐射特性和某大型飞机的散射特性,表明本文方法可以在不同架构的超级计算机上对复杂电磁问题进行精确高效电磁仿真。 相似文献
5.
《Computer》1980,13(11):37-46
Incredibly complex ballistic missile defense computational problems have been a rich source of technological advance. The BMD challenge will stimulate supercomputer development through the 1980's. 相似文献
6.
In this paper, we describe the results of several tests that check the accuracy of numerical computation on the Cray supercomputer in vector and scalar modes. The known tests were modified to identify the critical point where roundings start causing problems. After describing the tests, we present an interval library called libavi.a. It was developed in Fortran 90 on the Cray Y-MP2E supercomputer of UFRGS-Brazil. This library makes interval mathematics accessible to the Cray supercomputers users. It works with real and complex intervals and intervals matrices and vectors. The library allows overloading of operators and functions. It is organized in four modules: real intervals, interval vectors and matrices, complex intervals, and linear algebra applications. 相似文献
7.
HPCG基准测试程序是一种新的超级计算机排名度量标准.该测试基准主要用于衡量超级计算机解决大规模稀疏线性系统的能力,更贴近实际应用,近年来广受关注.基于国产超级计算机研究异构众核并行HPCG软件具有非常重要的意义,其不仅可以提升国产超级计算机HPCG的排名,还对很多应用提供了并行算法、优化技术等方面的参考.本文面向某国产复杂异构超级计算机开展研究,首先采用了分块图着色算法对HPCG进行并行,并提出一种适用于结构化网格的图着色算法,该算法并行性能高于传统的JPL、CC等算法,且着色质量高,运用于HPCG后,迭代次数减少了3次,整体性能提升了6%.本文还分析了复杂异构系统各个部件传输的开销,提出一套更适用于HPCG的任务划分方法,并从稀疏矩阵存储格式、稀疏矩阵重排、访存等角度开展了细粒度的优化.另外在多进程计算时,还采用了内外区划分算法将核心函数SpMV、SymGS中的邻居通信操作进行了隐藏.最终整机测试时,性能达到国产超级计算机峰值性能的1.67%,相比单节点,整机弱可扩展性并行效率达到了92%. 相似文献
8.
HPCG基准测试程序是一种新的超级计算机排名度量标准.该测试基准主要用于衡量超级计算机解决大规模稀疏线性系统的能力,更贴近实际应用,近年来广受关注.基于国产超级计算机研究异构众核并行HPCG软件具有非常重要的意义,其不仅可以提升国产超级计算机HPCG的排名,还对很多应用提供了并行算法、优化技术等方面的参考.面向某国产复杂异构超级计算机开展研究,首先采用了分块图着色算法对HPCG进行并行,并提出一种适用于结构化网格的图着色算法.该算法并行性能高于传统的JPL、CC等算法,且着色质量高,运用于HPCG后,迭代次数减少了3次,整体性能提升了6%.分析了复杂异构系统各个部件传输的开销,提出一套更适用于HPCG的任务划分方法,并从稀疏矩阵存储格式、稀疏矩阵重排、访存等角度开展了细粒度的优化.在多进程计算时,还采用内外区划分算法将核心函数SpMV、SymGS中的邻居通信操作进行了隐藏.最终整机测试时,性能达到了国产超级计算机峰值性能的1.67%,与单节点相比,整机弱可扩展性并行效率达到了92%. 相似文献
9.
《Computer》1980,13(11):71-83
Integral scalar processing and increased parallelism are major advances in supercomputer evolution. Three second-generation machines—BSP, Cyber 205, and Cray-1— are here described and compared. 相似文献
10.
Weixia XU Yutong LU Qiong LI Enqiang ZHOU Zhenlong SONG Yong DONG Wei ZHANG Dengping WEI Xiaoming ZHANG Haitao CHEN Jianying XING Yuan YUAN 《Frontiers of Computer Science》2014,8(3):367-377
With the rapid improvement of computation capability in high performance supercomputer system, the imbalance of performance between computation subsystem and storage subsystem has become more and more serious, especially when various big data are produced ranging from tens of gigabytes up to terabytes. To reduce this gap, large-scale storage systems need to be designed and implemented with high performance and scalability.MilkyWay-2 (TH-2) supercomputer system with peak performance 54.9 Pflops, definitely has this kind of requirement for storage system. This paper mainly introduces the storage system in MilkyWay-2 supercomputer, including the hardware architecture and the parallel file system. The storage system in MilkyWay-2 supercomputer exploits a novel hybrid hierarchy storage architecture to enable high scalability of I/O clients, I/O bandwidth and storage capacity. To fit this architecture, a user level virtualized file system, named H2FS, is designed and implemented which can cooperate local storage and shared storage together into a dynamic single namespace to optimize I/O performance in IO-intensive applications. The evaluation results show that the storage system in MilkyWay-2 supercomputer can satisfy the critical requirements in large scale supercomputer, such as performance and scalability. 相似文献
11.
A multiback-end database supercomputer, or MDBS, that embodies the architecture and performance of an experimental database computer and a number of database processors and their corresponding database stores, is discussed. Two studies are described. One focused on the design and architecture of the microprocessor-based MDBS, and the other on the performance and benchmark results for various loads and configurations 相似文献
12.
13.
14.
The use of a graphic technique to visualize chaos for a dual-parameter one-dimensional map is described. It involves plotting the Lyapunov exponent with both height and color in a three-dimensional map, as a function of the two parameters. Color is determined using a geographic lookup table. A graphics supercomputer can rotate the map in real time. The technique demonstrates graphically interesting behavior in chaotic systems 相似文献
15.
《Journal of Parallel and Distributed Computing》2005,65(1):74-84
The parallelization on a supercomputer of a one list dynamic programming algorithm using dominance technique and processor cooperation for the 0–1 knapsack problem is presented. Such a technique generates irregular data structure, moreover the number of undominated states is unforeseeable. Original and efficient load balancing strategies are proposed. Finally, computational results obtained with an Origin 3800 supercomputer are displayed and analyzed. To the best of our knowledge, this is the first time for which computational experiments on a supercomputer are presented for a parallel dynamic programming algorithm using dominance technique. 相似文献
16.
17.
18.
Tools specifically oriented toward flow analysis problems are described. The approach is oriented toward performing graphics calculations on the supercomputer and using high-end workstations as rendering engines to display flow fields generated by aircraft flight. A suite of software provides both display and interactive analysis capability. Also described is animation hardware featuring Winchester disk technology, which allows the storage of more than one gigabyte of data and supports editing, special effects, and video output 相似文献
19.
Several approaches to visualizing simulated data are described. It is noted that a centralized video animation facility represents the state of the art in scientific visualization, but is too expensive to replicate widely. Methods used at LANL for connecting supercomputers with display units, ranging from dumb terminals to powerful workstations, are also described. Of particular interest is the Scientific Visualization Workbench, an inexpensive and effective way of using `televisualization' to move data from supercomputer to user and to modify the user's display. The implication of higher speed networks are discussed 相似文献
20.
The architecture and design of the μVP, a single-chip vector coprocessor developed to meet the needs of high-performance processors, are described. The μVP is a supercomputer component implemented on a single large-scale-integrated (LSI) CMOS chip. With 206 MFLOPS single-precision and 106-MFLOPS double-precision performance at 50 MHz, the μVP offers a rate almost equivalent to that typical minisupercomputers 相似文献