共查询到17条相似文献,搜索用时 109 毫秒
1.
自Alamouti提出空时分组码以来,空时分组码的构造得到广泛的研究。研究表明,复正交空时分组码当发射天线数大于3时,它的传输码率小于1。准正交空时分组码虽然能达到全速率,却牺牲了一定的分集增益。根据空时编码秩准则,通过对传统准正交空时分组码进行矩阵旋转处理,得到一种全速率满分集的空时分组码。数值仿真显示,这种全速率满分集空时分组码的性能较传统准正交空时分组码至少有4 dB的提高。 相似文献
2.
3.
依据空时分组码的复正交设计理论的推广,推导了基于坐标间交织正交设计(CIOD)的空时分组码设计方法,并根据Alamouti正交接收原理,对接收信号依次经过矢量正交变换、矢量虚部交换和矢量加权旋转处理,进行解调运算。最后在准静态平坦Rayleigh衰落信道中,对802.16d/e中使用的4发射天线基于CIOD的发射分集配置方案A进行了性能仿真,结果表明这些码具有较低的译码复杂性,与基于复数正交设计的空时分组码的译码复杂度相当,但性能要好。 相似文献
4.
5.
当发射天线数大于2时,复信号空时分组码不能实现满速率编码,可以通过星座旋转来设计发送矩阵,使系统同时获得满分集增益和最大的编码速率。通过对传统的旋转星座准正交空时分组码加以改进,使每根天线在不同时隙发射的信号属于不同星座,在保证满分集和最大的编码速率的同时,减小了码间干扰,更有利于译码器译码,提高了系统性能。 相似文献
6.
7.
8.
基于典型的多输入-多输出无线通信系统,研究了一种适用于任意发射天线数的高速率复正交空时分组码设计方法。证明了采用该算法所设计的复正交空时分组码能达到满发送分集度和未对信号进行线性处理时的最大码率(m+1)/2m(m为自然数)。在此基础上,对不同发送天线数(n=2,3,4,5,6,7,8)的复正交空时分组码进行了蒙特卡洛仿真,从而验证了算法的有效性,并得出发射天线数与系统误比特率性能呈正比,且随着发射天线数目的增加,误比特率性能改善的程度越来越小。 相似文献
9.
10.
基于Alamouti提出的BPSK调制下空时分组码在Rayleigh衰落信道中的简单分集方案。推导出多发射和多接收天线系统中正交空时分组码在Rician衰落信道的BPSK调制下的比特差错率的最小距离球界,并推广到在高阶调制下衰落信道中系统符号差错率的性能。仿真分析和比较了空时分组码的多天线系统中发射和接收天线分集增益,以及信道相关参数的变化对系统误比特性能的影响。 相似文献
11.
12.
13.
U. Ziegler 《Computer Physics Communications》2008,179(4):227-244
I report on a new version of the magnetohydrodynamics code NIRVANA1 which is targeted at the study of astrophysical problems. The new version allows for distributed-memory simulations supporting adaptive mesh refinement. Numerical algorithms include dissipative terms (viscosity, Ohmic diffusion, thermal heat conduction) in a conservative form. Domain decomposition is preferably block-wise in case of unigrid applications but adopts space-filling curve techniques for adaptive mesh applications with a hierarchical block-structured mesh. The code architecture facilitates workload balancing among processors for arbitrary mesh refinement depths maintaining intra-level data locality via space-filling curve mappings and, at the same time, ensuring inter-level data locality by applying a novel technique called block sharing. This way, it is demonstrated that comparable performance can be achieved for problems with locally highly refined grid. The data transfer between processors extensively utilizes the coarse-granularity concept of parallel computing and makes use of the MPI library. Conservation properties of the numerical method carry over to the parallel framework. In particular, the solenoidality condition for the magnetic field is preserved to roundoff precision applying the constrained transport machinery. This paper has its focus of discussion on implementation details related to the parallelization and on a code performance analysis. 相似文献
14.
15.
16.
Gabriell Araujo Dalvan Griebler Dinei A. Rockenbach Marco Danelutto Luiz G. Fernandes 《Software》2023,53(1):53-80
NAS Parallel Benchmarks (NPB) is a standard benchmark suite used in the evaluation of parallel hardware and software. Several research efforts from academia have made these benchmarks available with different parallel programming models beyond the original versions with OpenMP and MPI. This work joins these research efforts by providing a new CUDA implementation for NPB. Our contribution covers different aspects beyond the implementation. First, we define design principles based on the best programming practices for GPUs and apply them to each benchmark using CUDA. Second, we provide ease of use parametrization support for configuring the number of threads per block in our version. Third, we conduct a broad study on the impact of the number of threads per block in the benchmarks. Fourth, we propose and evaluate five strategies for helping to find a better number of threads per block configuration. The results have revealed relevant performance improvement solely by changing the number of threads per block, showing performance improvements from 8% up to 717% among the benchmarks. Fifth, we conduct a comparative analysis with the literature, evaluating performance, memory consumption, code refactoring required, and parallelism implementations. The performance results have shown up to 267% improvements over the best benchmarks versions available. We also observe the best and worst design choices, concerning code size and the performance trade-off. Lastly, we highlight the challenges of implementing parallel CFD applications for GPUs and how the computations impact the GPU's behavior. 相似文献
17.
Wang Xinran Pan Chi-Chun Liu Peng Zhu Sencun 《Dependable and Secure Computing, IEEE Transactions on》2010,7(1):65-79
We propose SigFree, an online signature-free out-of-the-box application-layer method for blocking code-injection buffer overflow attack messages targeting at various Internet services such as web service. Motivated by the observation that buffer overflow attacks typically contain executables whereas legitimate client requests never contain executables in most Internet services, SigFree blocks attacks by detecting the presence of code. Unlike the previous code detection algorithms, SigFree uses a new data-flow analysis technique called code abstraction that is generic, fast, and hard for exploit code to evade. SigFree is signature free, thus it can block new and unknown buffer overflow attacks; SigFree is also immunized from most attack-side code obfuscation methods. Since SigFree is a transparent deployment to the servers being protected, it is good for economical Internet-wide deployment with very low deployment and maintenance cost. We implemented and tested SigFree; our experimental study shows that the dependency-degree-based SigFree could block all types of code-injection attack packets (above 750) tested in our experiments with very few false positives. Moreover, SigFree causes very small extra latency to normal client requests when some requests contain exploit code. 相似文献