首页 | 官方网站   微博 | 高级检索  
     

基于模糊相关的Web文档分类方法
引用本文:雷景生. 基于模糊相关的Web文档分类方法[J]. 计算机工程, 2005, 31(24): 13-14,17
作者姓名:雷景生
作者单位:海南大学信息科学技术学院,海口,570228
基金项目:教育部重点资助项目;海南省自然科学基金资助项目
摘    要:面对Internet上不断增长的巨大信息量,如何使用户获得有趣的和有用的信息已成为信息检索急需解决的问题。由于Web文档往往具有不确定的特征,使得利用模糊集合理论对信息检索过程的不确定性建立模型成为可能。文章提出了一种基于模糊相关技术的Web文档分类方法,实验结果表明,该方法比基于向量空间模型的Web分类方法有较高的分类精度。

关 键 词:文本挖掘  文档分类  信息过滤
文章编号:1000-3428(2005)24-0013-02
收稿时间:2004-11-07
修稿时间:2004-11-07

Classification Approach Based on Fuzzy Related Technology for Web Document
LEI Jingsheng. Classification Approach Based on Fuzzy Related Technology for Web Document[J]. Computer Engineering, 2005, 31(24): 13-14,17
Authors:LEI Jingsheng
Affiliation:School of Information Science and Technology, Hainan University, Haikou 570228
Abstract:Due to the explosive growth of available information on the WWW, it is not uncommon that the users on WWW often find themselves overwhelmed with the large amount of information that might be of their interest and usefulness. To alleviate this problem, there is a need for an intelligent tool to help the users screening and filtering for interesting and useful information. Web documents tend to have unpredictable characteristics. Motivated by these fuzzy characteristics, the fuzzy related technology in classifying Web documents into a predefined set of categories is adopted. The experimental results show that the approach yields higher classification accuracy compared to the vector space model.
Keywords:Text mining   Document classification   Information filtering
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号