首页 | 官方网站   微博 | 高级检索  
     

一种多线程阵列众核处理器的二级Cache划分机制
引用本文:陈逸飞,朱蕾,李宏亮.一种多线程阵列众核处理器的二级Cache划分机制[J].计算机工程与科学,2019,41(3):400-408.
作者姓名:陈逸飞  朱蕾  李宏亮
作者单位:江南计算技术研究所,江苏无锡,214083;江南计算技术研究所,江苏无锡,214083;江南计算技术研究所,江苏无锡,214083
摘    要:阵列众核处理器由于其较高的计算性能和能效比已经广泛应用于高性能计算领域。而要构建未来高性能计算系统处理器必须解决严峻的"访存墙"挑战以及核心协同问题。通常的阵列处理器,其核心多采用单线程结构,以减少开销,但是对访存提出了较高的要求。引入硬件同时多线程技术,针对实验中单核心多线程二级Cache利用率较低的问题,提出了一种共享二级Cache划分机制。经实验模拟,通过上述优化的共享二级Cache划分机制,二级指令Cache失效率下降18.59%,数据Cache失效率下降6.60%,整体CPI性能提升达到10.1%。

关 键 词:阵列众核处理器  同时多线程  共享二级Cache划分机制
收稿时间:2018-08-30
修稿时间:2019-03-25

A L2 cache partitioning mechanism for multithreaded array-based many-core processors
CHEN Yi fei,ZHU Lei,LI Hong liang.A L2 cache partitioning mechanism for multithreaded array-based many-core processors[J].Computer Engineering & Science,2019,41(3):400-408.
Authors:CHEN Yi fei  ZHU Lei  LI Hong liang
Affiliation:(Jiangnan Institute of Computing Technology,Wuxi 214083,China)
Abstract:Because of its high computational performance and energy efficiency ratio, array-based many-core processors have been widely used in the high performance computing field. To build future high performance computing systems, processor must solve the severe challenge of ‘memory wall’ and core synergy problem. In a typical array-based many-core processor, the core adopts the single-threaded structure to reduce overhead. However, the demand for memory access is higher. We introduce the hardware simultaneous multithreading technology into the single core structure. Aiming at the problem that the utilization rate of the single core multi threaded L2 cache is significantly low, we present a L2 cache partitioning mechanism (thread-based cache partitioning) for the array-based many core processor. Experimental results demonstrate that, based on the L2 cache partition mechanism, the miss rate of the L2 instruction cache is decreased by 18.59%, the miss rate of the L2 data cache is decreased by 6.60% and the CPI performance is increased by 10.1%.
Keywords:array-based many-core processor  simultaneous multithreading  shared L2 cache partitioning mechanism  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号