基于XML的Web信息数据库的建立 Construction of Web Database Based on XML期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于XML的Web信息数据库的建立

引用本文：	黄昱阳,李慧伦.基于XML的Web信息数据库的建立[J].计算机与现代化,2012(9):222-224.

作者姓名：	黄昱阳李慧伦

作者单位：	[1]华南理工大学生物科学与工程学学院,广东广州510006 [2]山东理工大学生命科学院,山东淄博255012

摘要：	为了有效地从Web页面上提取数据信息,本文建立一种基于XML的Web信息收集数据库。利用开源工具JTidy将Web页面加以整理,利用XML良好的结构特性,使用Dom4j工具包解析XML文件;按照XML中的标签层次特点作为对数据进行储存的依据;最后使用Hibernate将数据持久化地储存于数据库中,方便数据的储存与查询。
关键词：	XML Web 信息挖掘数据库
Construction of Web Database Based on XML

HUANG Yu-yang,LI Hui-lun.Construction of Web Database Based on XML[J].Computer and Modernization,2012(9):222-224.

Authors:	HUANG Yu-yang LI Hui-lun

Affiliation:	1 School of Bioscience and Bioengineering, South China University of Technology, Guangzhou 510006, China; 2. School of Life Sciences, Shandong University of Technology, Zibo 255012, China)

Abstract:	In order to extract information and data from Web pages effectively, this paper constructs a database used for collecting data based on XML. The HTML documents are transformed to XHTML and analyzed by the open-source tools JTidy and Dom4j. Data are extracted and saved based on the tag characteristics of XML documents. Finally the data are persisted in the database by the 0RM tool-Hibernate.

Keywords:	XML Web data mining database
本文献已被 CNKI 维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏