首页 | 官方网站   微博 | 高级检索  
     


A Resource Aware MapReduce Based Parallel SVM for Large Scale Image Classifications
Authors:Wenming Guo  Nasullah Khalid Alham  Yang Liu  Maozhen Li  Man Qi
Affiliation:1.School of Software Engineering,Beijing University of Post and Telecommunication,Beijing,China;2.Nuffield Department of Clinical Laboratory Sciences,University of Oxford,Oxford,UK;3.School of Electrical Engineering and Information,Sichuan University,Chengdu,China;4.Department of Electronic and Computer Engineering,Brunel University London,Uxbridge,UK;5.The Key Laboratory of Embedded Systems and Service Computing,Tongji University,Shanghai,China;6.Department of Computing,Canterbury Christ Church University,Canterbury,UK
Abstract:Machine learning techniques have facilitated image retrieval by automatically classifying and annotating images with keywords. Among them support vector machines (SVMs) are used extensively due to their generalization properties. However, SVM training is notably a computationally intensive process especially when the training dataset is large. This paper presents RASMO, a resource aware MapReduce based parallel SVM algorithm for large scale image classifications which partitions the training data set into smaller subsets and optimizes SVM training in parallel using a cluster of computers. A genetic algorithm based load balancing scheme is designed to optimize the performance of RASMO in heterogeneous computing environments. RASMO is evaluated in both experimental and simulation environments. The results show that the parallel SVM algorithm reduces the training time significantly compared with the sequential SMO algorithm while maintaining a high level of accuracy in classifications.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号