首页 | 官方网站   微博 | 高级检索  
     

一种基于条件熵的垃圾邮件过滤算法
引用本文:翟军昌,车伟伟.一种基于条件熵的垃圾邮件过滤算法[J].计算机与现代化,2014,0(2):129-132.
作者姓名:翟军昌  车伟伟
作者单位:[1]渤海大学,辽宁锦州121000 [2]沈阳大学,辽宁沈阳110044
基金项目:国家自然科学基金资助项目(61104106)
摘    要:在垃圾邮件过滤中,针对过滤器对合法邮件的误判问题,提出一种改进的垃圾邮件过滤算法。该算法对信息增益的条件熵估计方法作了改进,结合最小风险贝叶斯决策方法,在英文语料库上进行实验,并采用召回率和正确率对算法进行评价分析。实验结果表明,改进后的方法可提高过滤器对合法邮件的识别能力,降低对合法邮件的误判,减少用户的损失。

关 键 词:垃圾邮件  信息增益  条件熵  最小风险

A Spam Filtering Algorithm Based on Conditional Entropy
ZHAI Jun-chang,CHE Wei-wei.A Spam Filtering Algorithm Based on Conditional Entropy[J].Computer and Modernization,2014,0(2):129-132.
Authors:ZHAI Jun-chang  CHE Wei-wei
Affiliation:1. Bohai University, Jinzhou 121000, China; 2. Shenyang University, Shenyang 110044, China)
Abstract:In spare filtering, according to the filter misjudgment for legitimate mails, we put forward an improved spare filtering algorithm, which improves the conditional entropy estimation method of information gain. Combined with the Bayes minimum risk decision method, we analyze the algorithm through the recall and accuracy by carrying out an experiment on the English Corpus. Experimental results show that the improved algorithm can enhance the classification precision and reduce the misjudgment of le- gitimate emails, which can reduce the loss of users.
Keywords:spare  information gain  conditional entropy  minimum risk
本文献已被 维普 等数据库收录!
点击此处可从《计算机与现代化》浏览原始摘要信息
点击此处可从《计算机与现代化》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号