Web内容挖掘技术研究 Research on Web Content Mining期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Web内容挖掘技术研究

引用本文：	涂承胜,鲁明羽,陆玉昌.Web内容挖掘技术研究[J].计算机应用研究,2003,20(11):5-9,15.

作者姓名：	涂承胜鲁明羽陆玉昌

作者单位：	1. 重庆三峡学院计算机科学系,重庆404000;清华大学计算机科学与技术系智能技术与系统国家重点实验室,北京100084 2. 清华大学计算机科学与技术系智能技术与系统国家重点实验室,北京100084

基金项目：	国家自然科学基金重大项目(79990580),国家"973"重点基础研究发展项目(G1998030414)

摘要：	简要介绍了Web挖掘的概念、分类以及其功能,阐述了Web挖掘与传统数据挖掘以及Web信息检索之间的关系。给出了Web内容挖掘的不同分类方法、文本以及多媒体文本数据挖掘的定义、分类与应用。重点分析了Web文本挖掘的方法,包括文本的特征表示与抽取、文本的分类与聚类等,讨论了多媒体文本分类挖掘方法。
关键词：	Web挖掘 Web内容挖掘文本的分类文本聚类多媒体文本挖掘
文章编号：	1001-3695(2003)11-0005-05
Research on Web Content Mining

Abstract:	This paper briefly introduces the conception of web mining,including the taxonomy and function,and discusses the relationship between information mining and retrieval on the web,and the difference between web mining and data mining.Then definition and classifications and applications of web text data mining are given,including a taxonomy of content mining.The method of text mining on web are discussed in detail,including text categorization and text clustering,etc.It discusses multimedia text data categorization and its alteration

Keywords:	Web Mining Web Content Mining Text Categorization Text Clustering Multimedia Text Mining
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏