基于专业信息深度挖掘的搜索引擎Spider的设计与实现 Design and Implementation of a Full Text Search Engine Spider Based on Specific Information Mining期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于专业信息深度挖掘的搜索引擎Spider的设计与实现

引用本文：	赵恒永,沈坚,山岚. 基于专业信息深度挖掘的搜索引擎Spider的设计与实现[J]. 计算机工程与科学, 2009, 31(6)

作者姓名：	赵恒永沈坚山岚

作者单位：	北京化工大学信息科学与技术学院,北京,100029;北京化工大学信息科学与技术学院,北京,100029;北京化工大学信息科学与技术学院,北京,100029

摘要：	本文针对专业全文搜索引擎的特点,设计并实现了一种网络机器人。通过二维矢量工作队列实现站点式深度优先搜索,采用页面站点加权算法动态控制站点的处理时间。完成了网络上与专业相关信息的集中收集和处理,并探讨了网络机器人对专业的侧重性以及向通用全文搜索引擎网络机器人转换的可行性。
关键词：	搜索引擎网络机器人工作队列加权算法任务平衡
Design and Implementation of a Full Text Search Engine Spider Based on Specific Information Mining

ZHAO Heng-yong,SHEN Jian,SHAN Lan. Design and Implementation of a Full Text Search Engine Spider Based on Specific Information Mining[J]. Computer Engineering & Science, 2009, 31(6)

Authors:	ZHAO Heng-yong SHEN Jian SHAN Lan

Affiliation:	School of Information Science and Technology;Beijing University of Chemical Technology;Beijing 100029;China

Abstract:	The paper designs and implements a full text search engine Spider based on specific information mining,carries out site depth-first search by two-dimensional vector workload queue,uses a page-site weighted algorithm to dynamically control the tenure of site processing,accomplishes a concentrative collection and processing of specialization-related information from the Internet,and discusses the inclination of Spider to the specializations and the transformation for a universal full text search engine.

Keywords:	search engine spider workload queue weighted algorithm task balance
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏