首页 | 官方网站   微博 | 高级检索  
     

基于Scikit Learn的SVM分类器算法优化
引用本文:左一鹏,陈辉.基于Scikit Learn的SVM分类器算法优化[J].上海电力学院学报,2020,36(3):259-264,306.
作者姓名:左一鹏  陈辉
作者单位:上海电力大学 自动化工程学院
基金项目:国家自然科学基金(51705304);上海市自然科学基金(16ZR1413400)。
摘    要:支持向量机(SVM)在高维度数据分类中表现出优异性能,可通过核函数对原始特征进行映射,解决原始空间线性不可分问题。但由于数据特征、维度不同,所以SVM在参数调整时,一般需要手动调整,效率较低且增加工作量。针对该问题,提出了一种基于Scikit Learn的SVM分类器参数调整优化方法。使用网格搜索对最优参数范围进行搜索,利用高斯径向基核函数进行参数调整,基于Python机器学习库Scikit Learn对不同参数、不同核函数的分类结果进行可视化观察,并在网格上显示其最优参数范围,寻找准确率高的参数分布。通过自动迭代的方式对参数进行更精确求解,设定相应值代入迭代计算。同时为防止陷入过拟合,设定最优参数邻域范围直接读取最优参数值。实验结果表明,所提出的方法可大量减少人工调参时间,且可以更精确地获得SVM的最优参数。

关 键 词:SVM分类器  机器学习  径向基核函数  网格搜索
收稿时间:2019/4/2 0:00:00

Improvement of SVM Classifier Algorithm Based on ScikitLearn
ZUO Yipeng,CHEN Hui.Improvement of SVM Classifier Algorithm Based on ScikitLearn[J].Journal of Shanghai University of Electric Power,2020,36(3):259-264,306.
Authors:ZUO Yipeng  CHEN Hui
Affiliation:School of Automation Engineering, Shanghai University of Electric Power, Shanghai 200090, China
Abstract:Support Vector Machine (SVM) classifier algorithm,which can map the original features by the kernel function to solve the linear inseparability problem in the original space,has excellent performance in high-dimensional data classification.However,due to the different data characteristics and dimensions,it is generally necessary to manually test the parameters of SVM with low efficiency and increased workload.To address this problem,this study proposes an optimization method for SVM classifier based on Scikit-learning.The optimal parameter range is searched by grid search method,and the classification results of different parameters and different kernel functions are visualized in Scikit-Learn of Python machine learning library.The optimal parameter range is displayed on the grid to find the parameter distribution with high accuracy.The parameters are solved more accurately by means of automatic iteration,and the corresponding values are set to be brought into the iteration calculation.Moreover,the neighborhood search of the optimal parameters is determined to choose the optimal parameters value directly,in order to avoid falling into over-fitting.Experimental results show that the time of manual parameter adjustment is reduced greatly and the accuracy of the optimal parameters obtained here is improved.
Keywords:SVM classifier  machine learning  radial basis function kernel  grid search
本文献已被 CNKI 等数据库收录!
点击此处可从《上海电力学院学报》浏览原始摘要信息
点击此处可从《上海电力学院学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号