首页 | 官方网站   微博 | 高级检索  
     

邻近类别分类在电子邮件过滤系统中的运用
引用本文:顾辉,李翔,薛质,李建华.邻近类别分类在电子邮件过滤系统中的运用[J].微机发展,2008,18(4):202-205.
作者姓名:顾辉  李翔  薛质  李建华
作者单位:上海交通大学 上海200030
摘    要:电子邮件作为互联网技术发展的产物,在给全球网民带来通讯便利的同时,正不可避免地遭遇有悖初衷的运用。最为突出的是随之产生的垃圾邮件像瘟疫一样蔓延,污染网络环境,占用大量传输、存储和运算资源,影响了网络的正常运行。垃圾邮件问题日益严重,受到研究人员的广泛关注。基于内容的过滤是当前解决垃圾邮件问题的主流技术之一。由于常用的特征字串匹配技术对垃圾邮件件的查准率已经不能满足日益提高的过滤系统用户的产品需求,随后引入邻近类别分类的方法,利用基于贝叶斯算法的电子邮件过滤系统,对色情垃圾邮件样本进行分析,可明显提高对垃圾邮件的查准率。

关 键 词:垃圾邮件  文本分类  贝叶斯算法  特征字串匹配  邻近类别分类
文章编号:1673-629X(2008)04-0202-04
修稿时间:2007年7月11日

Vicinity CategOry Classification in Email Filtering System
GU Hui,LI Xiang,XUE Zhi,LI Jian-hua.Vicinity CategOry Classification in Email Filtering System[J].Microcomputer Development,2008,18(4):202-205.
Authors:GU Hui  LI Xiang  XUE Zhi  LI Jian-hua
Affiliation:GU Hui, LI Xiang,XUE Zhi, LI Jian-hua (Shanghai Jiaotong University, Shanghai 200030, China)
Abstract:As the product of Internet technology,Email can provide convenient communication.On the other hand,some applications related to Email cause big trouble to the Internet.For example,the spam spreads like plague,polluting the net environment,occupying resources for transmission,storation and calculation, and influencing the normal operation of network.The volume of junk Email in Internet has grown tremendously in the past few years.And this problem attracts many researchers' attention.Because the finding spam on feature word matching technique can not satisfy the developing requirements of filtering system user.Referred the vicinity category classification to the Email filtering system that based on the Bayesian can filter pornographic counteraction and spam related to advertisement.Till this time,has some test results which showed the high call ration to pornographic spam.
Keywords:junk email  text classification  Bayesian  feature word matching  vicinity category classification
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号