首页 | 官方网站   微博 | 高级检索  
     

基于Lucene/Heritrix的垂直搜索引擎的研究与应用
引用本文:白坤,耿国华.基于Lucene/Heritrix的垂直搜索引擎的研究与应用[J].计算机应用与软件,2009,26(1).
作者姓名:白坤  耿国华
作者单位:西北大学信息科学与技术学院,陕西,西安,710127
摘    要:Lucene是一个用Java写的全文索引引擎工具包,访问索引时间快,支持多用户访问,可以跨平台使用.Heritrix是一个由Java开发的、开源的Web网络爬虫,用户可以使用它从网络上抓取想要的资源.探讨了Lucene和Heritrix在构建垂直搜索引擎中的应用.

关 键 词:垂直搜索引擎

STUDY AND APPLICATION OF VERTICAL SEARCH ENGINE BASED ON LUCENE AND HERITRIX
Bai Kun,Geng Guohua.STUDY AND APPLICATION OF VERTICAL SEARCH ENGINE BASED ON LUCENE AND HERITRIX[J].Computer Applications and Software,2009,26(1).
Authors:Bai Kun  Geng Guohua
Affiliation:College of Information Science and Technology;Northwest University;Xi'an 710127;Shaanxi;China
Abstract:Lucene is a full text indexing engine package written in Java language.It has high access speed,supports multi-user accesses and can be sued in a cross-platform way.Heritrix is an open source web spider explored by Java.Users can snatch information from Internet by using it.In this paper it studies Lucene and Heritrix technology,analyzes the application in designing a Vertical Search Engine based on them.
Keywords:Lucene  Heritrix
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号