首页 | 官方网站   微博 | 高级检索  
     

SURF算法在通用GPU和OpenCL的优化与实现
引用本文:王艳梅,史晓华,于湛麟.SURF算法在通用GPU和OpenCL的优化与实现[J].电子测试,2013(12):51-55,42.
作者姓名:王艳梅  史晓华  于湛麟
作者单位:[1]渤海大学高职学院,锦州121013 [2]北京航空航天大学计算机学院,北京100091
基金项目:国家自然科学基金资助项目(No.61073010).
摘    要:Speeded Up Robust Feature(SURF)算法是在计算机视觉领域得到广泛应用的一种图像兴趣点检测和匹配方法。开放计算语言(OpenCL)提供了一个在异构体系结构上,包括GPU,CPU及其他类型处理器,编写并行程序的框架。本文介绍了如何在通用GPU和OpenCL平台上,对SURF算法进行优化与实现。本文对其中一些优化方法,例如kernel线程的配置,局部内存的使用方法等,进行了详细的对比和讨论。最终实现的OpenCL版本的算法在NVidiaGTX260平台上获得了比原始的CPU版本在IntelDual—CoreE54002.7G处理器上至少21倍的加速。

关 键 词:SURF算法  OpenCL语言  GPGPU  kernel线程  匹配  图像

Optimization and Realization of SURF algorithm in GPU and OpenCL
Wang Yanmeil,Shi Xiaohua,Yu Zhanlin.Optimization and Realization of SURF algorithm in GPU and OpenCL[J].Electronic Test,2013(12):51-55,42.
Authors:Wang Yanmeil  Shi Xiaohua  Yu Zhanlin
Affiliation:1. Higher professional technical institute, Bohai university, Liaoning Jinzhou, 121013, China. 2. School of computer science and engineering, Beihang university, Beijing, 100091, China.)
Abstract:Speeded-Up Robust Feature(SURF)algorithm is widely used for image feature detecting and matching in computer vision area. Open Computing Language(OpenCL) is a framework for writing programs that execute across heterogeneous platforms consisting of CPUs, GPUs, and other processors. This paper introduces how to implement and optimize SURF algorithm on General Purpose GPU and OpenCL, and discusses some optimization methods such as configuring the kernel threads, using local memory in details. The final OpenCL version on Nvidia GTX 260 is more than 21 times faster than its original CPU version on Intel Dual-Core E5400 2.7G.
Keywords:SURF  OpenCL  GPGPU  kernel  match  image
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号