基于K-means的朴素贝叶斯分类算法的研究 Research of Naive Bayesian Classification Based on K-means Method期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于K-means的朴素贝叶斯分类算法的研究

引用本文：	张亚萍,胡学钢.基于K-means的朴素贝叶斯分类算法的研究[J].微机发展,2007,17(11):33-35.

作者姓名：	张亚萍胡学钢

作者单位：	淮北煤炭师范学院物理系安徽淮北235000(张亚萍)，合肥工业大学计算机与信息学院安徽合肥230009(胡学钢)

基金项目：	安徽省自然科学基金资助项目(050420207)

摘要：	将K-means算法引入到朴素贝叶斯分类研究中,提出一种基于K-means的朴素贝叶斯分类算法。首先用K-means算法对原始数据集中的完整数据子集进行聚类,计算缺失数据子集中的每条记录与k个簇重心之间的相似度,把记录赋给距离最近的一个簇,并用该簇相应的属性均值来填充记录的缺失值,然后用朴素贝叶斯分类算法对处理后的数据集进行分类。实验结果表明,与朴素贝叶斯相比,基于K-means思想的朴素贝叶斯算法具有较高的分类准确率。
关键词：	朴素贝叶斯分类 k-means算法缺失数据
文章编号：	1673-629X(2007)11-0033-03
修稿时间：	2007年1月29日
Research of Naive Bayesian Classification Based on K-means Method

ZHANG Ya-ping,HU Xue-gang.Research of Naive Bayesian Classification Based on K-means Method[J].Microcomputer Development,2007,17(11):33-35.

Authors:	ZHANG Ya-ping HU Xue-gang

Affiliation:	ZHANG Ya-ping1,HU Xue-gang2

Abstract:	A naive Bayesian classification based on K-means method by introducing clustering algorithm into naive Bayesian classification.The similarity between every recorder in absent data subsets and the centers k cluster is calculated by clustering complete data subsets of initial data by K-means algorithm,then the recorder is set to the nearest cluster and the absent value of the record is filled by the corresponding attribute of the cluster,finally,the handled data set is clustered by naive Bayesian classification algorithm.The experiments show that naive Bayesian classification based on K-means method has the higher precise of clustering comparing with naive Bayesian classification.

Keywords:	naive Bayesian classification K-means algorithms missing data
本文献已被 CNKI 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏