共查询到20条相似文献,搜索用时 93 毫秒
1.
2.
CASE系统必须以某种方式将程序代码与各种文档资料有机地联系起来,并能为各个成分保存完整的版本历史。超文本为此提供了良好的解决手段。本文主要描述如何在CASE系统中应用超文本技术,以提高CASE系统的性能。 相似文献
3.
数据库与超文本系统的连接 总被引:2,自引:0,他引:2
超文本/超媒休技术从八十年代以采发展很快,在理论上建立了多种超文本抽象模型,如HAM、Dexter、Trellis等,也先后出现了一批应用的超文木 相似文献
5.
数据库与超文本系统的连接 总被引:8,自引:0,他引:8
本文提出了数据库与超文本/超媒体系统连接的三条途径,比较了它们的优点和不足之处,并且介绍了我们所实现的一个Internet与数据库连接的系统。 相似文献
6.
7.
基于XML的非结构化文本数据转换研究与实现 总被引:1,自引:0,他引:1
采用XML作为存储数据的中间过程,通过两次调用Java程序,使非结构化的数据结构化因为XML作为一种半结构化的语言,适合数据存储与数据转换 而Java程序可以让基于XML的非结构化数据转换成结构化的数据,使其完全的结构化.使用该Java程序,可以完成类似的非结构化数据的转换工作. 相似文献
8.
9.
异构分布式数据库是近年来发展起来的一个研究领域,如何解决它的异构性和自治性仍然是尚未完全解决的难题。提出一种采用超文本概念集成异构分布式数据库的方法,建立了分布式超文本(DHT)的体系结构,并对其优越性和应用范围作了讨论。 相似文献
10.
随着Internet技术的发展,万维网上的文档数目成指数级增长。在如此浩瀚的信息库中,用户很难找到自己所需要的信息,如何自动且高效地处理这些海量文档信息成为了目前重要的研究课题。文章通过对抽取到的数据集文档中的标题,超连接和标记等超文本信息,以及文档内容本身分别建立分类模型。然后根据神经网络集成各个分类模型得出判别结果,提出了一种基于元信息的超文本集成分类算法,该算法能更好的综合利用超文本的多元结构化信息。实验结果表明,相对于单独利用某种超文本结构信息进行分类的方法。基于元信息的超文本集成分类算法具有更好的分类性能。 相似文献
11.
12.
孙斌 《计算机科学技术学报》2003,18(3):0-0
This paper presents an introduction to the initial design of the Structured Hypertext Transfer Protocol(STTP),a compatible extension to the HTTP.It includes a new message set for the control of resuource transmission,and the Structured Hypertext Markup Language(STML) for describing the structural information of Web pages.Experimental tests show that STTP can be significantly faster than HTTP,with the improvement of transmission time being around 70% to 400% and the same magnitude of packet savings,which is among the best performance improvement ever reported.The paper discusses the basic idea and major design considerations of these components,as well as a few important issues in developing STTP servers and clients. 相似文献
13.
A Study of Approaches to Hypertext Categorization 总被引:34,自引:2,他引:34
Yiming Yang Seán Slattery Rayid Ghani 《Journal of Intelligent Information Systems》2002,18(2-3):219-241
Hypertext poses new research challenges for text classification. Hyperlinks, HTML tags, category labels distributed over linked documents, and meta data extracted from related Web sites all provide rich information for classifying hypertext documents. How to appropriately represent that information and automatically learn statistical patterns for solving hypertext classification problems is an open question. This paper seeks a principled approach to providing the answers. Specifically, we define five hypertext regularities which may (or may not) hold in a particular application domain, and whose presence (or absence) may significantly influence the optimal design of a classifier. Using three hypertext datasets and three well-known learning algorithms (Naive Bayes, Nearest Neighbor, and First Order Inductive Learner), we examine these regularities in different domains, and compare alternative ways to exploit them. Our results show that the identification of hypertext regularities in the data and the selection of appropriate representations for hypertext in particular domains are crucial, but seldom obvious, in real-world problems. We find that adding the words in the linked neighborhood to the page having those links (both inlinks and outlinks) were helpful for all our classifiers on one data set, but more harmful than helpful for two out of the three classifiers on the remaining datasets. We also observed that extracting meta data from related Web sites was extremely useful for improving classification accuracy in some of those domains. Finally, the relative performance of the classifiers being tested provided insights into their strengths and limitations for solving classification problems involving diverse and often noisy Web pages. 相似文献
14.
利用数据库技术实现的可扩展的分类算法 总被引:9,自引:0,他引:9
重点研究将数据挖掘中的分类技术与数据库技术紧密结合的高效的可扩展的分类算法.提出一种基于分组记数技术构造分类器的方法,利用数据库系统的结构化查询语言来实现主要计算任务.为了提高算法的执行效率,还提出了优化策略和冗余规则的剪裁策略,并将分类规则的发现过程与相关属性的选择方法有机地结合在一起.使用这些方法和策略,分类算法能够从大规模数据集中快速地发现一组简洁的规则.除了具有与现有分类算法相当的准确度和较高的执行效率以外,该分类算法还具有良好的基于训练集元组个数和属性个数两方面的可扩展性和易于实现的特点. 相似文献
15.
提出了利用遗传算法在关系数据库海量的元组中选择最优元组进行水印嵌入,实现了水印短时间内的优化嵌入,大大提高了水印嵌入的效率。同时,采用纠错编码技术和投票选取机制来增加水印的鲁棒性。实验结果表明,这种关系数据库水印优化方案大大提高了水印嵌入的效率,并可以较好地协调水印的鲁棒性和数据库可用性之间的矛盾,实现了关系数据库水印的优化。 相似文献
16.
17.
18.
19.
In this article, we propose a database-internal representation for SGML-/HyTime-documents based on object-oriented database technology with the following features: documents of arbitrary type can be administered. The semantics of architectural forms is reflected by means of methods that are part of the database schema and by the database-internal representation of HyTime-specific characteristics. The framework includes mechanisms to ensure conformance of documents to the HyTime standard. Measures for improved performance of HyTime operations are also described. The database-internal representation of documents is a hybrid between a completely structured and a flat representation. Namely, the structured representation is better to support the HyTime semantics, and modifications of document components. On the other hand, most operations are faster for the flat representation, as will be shown. 相似文献
20.
Hypertext functionalities represent a form of the distilled wisdom of the hypermedia community. Even if they were introduced and advocated already in the pre-Web era, most of these functionalities are absent in current Web browsers. However, such functionalities can be very useful in some specific applicative fields, like for instance browsing complex software engineering documents, using standard WWW components. We propose to exploit the advent of XML as a basic infrastructure for describing software engineering hypertexts. In fact, we describe XMLC, a prototype of an XML browser that, given its modular architecture and general scope, can be seen as the basis for implementing sophisticated hypertext functionalities for software engineering documentation to be maintained and browsed on the Web. 相似文献