首页 | 官方网站   微博 | 高级检索  
     

基于Bayes参数估计的垃圾邮件过滤算法研究
引用本文:刘震,佘堃,周明天.基于Bayes参数估计的垃圾邮件过滤算法研究[J].计算机科学,2005,32(9):55-57.
作者姓名:刘震  佘堃  周明天
作者单位:电子科技大学计算机学院卫士通安全联合实验室,成都,610054
基金项目:国家863项目863-104-03-01课题支持.
摘    要:朴素Bayes邮件过滤算法由于简单、易于理解,已被人们广泛接受,并应用到一些商用邮件系统当中.但面对目前垃圾邮件问题依然严重的现状,人们逐渐开始认识到采用简单的朴素Bayes邮件过滤算法已不能满足现有邮件过滤的性能要求.Bayes网络一直以来作为知识发现的一个重要分支,是人们研究的热点;邮件过滤问题也可以映射到一个Bayes决策网络模型中.通过构建针对邮件过滤的Bayes决策网络模型,并经过概率学习对关键节点作Bayes参数估计,可以实现邮件的概率分类发现.邮件样本试验结果表明新算法与朴素Bayes邮件过滤算法相比具有更快的收敛速度和更高的稳定性.

关 键 词:Bayes网络  参数估计  垃圾邮件  过滤算法

Research on Advanced Filtering Algorithm for Spam Email Based on Bayes Parameter Estimation
LIU Zhen,SHE Kun,ZHOU Ming-Tian.Research on Advanced Filtering Algorithm for Spam Email Based on Bayes Parameter Estimation[J].Computer Science,2005,32(9):55-57.
Authors:LIU Zhen  SHE Kun  ZHOU Ming-Tian
Affiliation:1.Westone United Lab. of College of Computer Science and Engineering, UESTC, Siehuan, Chengdu 610054
Abstract:Presently, naive Bayes algorithm for Email filtering has been accepted widely for its simplicity and plainness. At the same time, the algorithm has also been applied in many commercial Email system However, facing with the persisted severe situation of spam Email, people are realizing gradually that depending on such a simple algorithm can' t meet the practical needs for anti-spam. On the other hand, Bayes network, as an important branch in knowledge dis- covery area, has been researched and explored for a long time. The question of Email filter can be projected to a Bayes network model too. By doing so and utilizing Bayes parameter estimation for key node in the model, we can accomplish Email classification and discrimination on probabilistic condition. According to the testing results, new algorithm has much higher rate to converge and stabilization than the naive Bayes one.
Keywords:Spam email  Bayes network  Email filtering  Parameter estimation
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号