首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 156 毫秒
1.
黄亮  赵泽茂  梁兴开 《计算机应用》2012,32(6):1662-1665
Div+CSS流行于Web页面的布局,在这种布局下,网页中很多数据记录以重复结构的形式聚集在一个层级。为了更好地从网页中挖掘数据,提出了一种新的Web数据挖掘算法,把树编辑距离转化为字符串编辑距离的计算,改进字符串编辑距离算法,利用字符串编辑距离评价树的相似度,进而找到网页中的重复模式,提取数据。通过针对不同重复模式特征的网页的实验说明,基于编辑距离的Web数据挖掘算法不仅能提取具有根节点及上面几层相同的网页的数据,对具有底层节点相同的网页也是有效的。  相似文献   

2.
运用XML和XSLT技术实现Web页面的重用   总被引:7,自引:0,他引:7  
李伟  郑宁 《计算机应用》2004,24(3):103-105
从分析页面结构特征入手,运用XML和XSLT技术实现Web页面的重用。该方法用动态XML文件定义页面的组成,用XSLT文件定义页面布局,通过XSLT转化把二者统一起来得到最终的页面。页面布局与页面组成的分离实现了页面布局和页面组成块的重用。  相似文献   

3.
4.
Reaction time, eye movements, and errors were measured during visual search of Web pages to determine age-related differences in performance as a function of link size, link number, link location, and clutter. Participants (15 young adults, M = 23 years; 14 older adults, M = 57 years) searched Web pages for target links that varied from trial to trial. During one half of the trials, links were enlarged from 10-point to 12-point font. Target location was distributed among the left, center, and bottom portions of the screen. Clutter was manipulated according to the percentage of used space, including graphics and text, and the number of potentially distracting nontarget links was varied. Increased link size improved performance, whereas increased clutter and links hampered search, especially for older adults. Results also showed that links located in the left region of the page were found most easily. Actual or potential applications of this research include Web site design to increase usability, particularly for older adults.  相似文献   

5.
在传统的Web网站中,网页的布局往往由网页制作人员安排并很少变化.为了更好的为网络用户提供服务,提出通过对Web日志的数据清洗,识别出每个用户在一个会话期内访问的页面,依据网页内客在逻辑上的关系和用户经常访问的页面,得到用户对网页内容的兴趣度矩阵及各子项目的兴趣度矩阵.对网络用户根据兴趣度短阵进行层次化的分类,得到每个...  相似文献   

6.
具有相似功能的Web应用,其页面样式和布局往往存在很大的相似性。针对当前Web页面开发复杂度高且效率低的情况,提出一种挖掘现有页面布局结构和样式属性的方法来实现Web页面自动化设计。该方法充分利用Web网页布局结构上的特点,采用分级处理的方式,首先利用页面分块算法思想通过相似度计算挖掘出具有相似性的代码块,其次通过结合RoSunday方法解析样式文件快速匹配出节点集合对应的样式表并建立文档模型树结构,各个子模块之间的相互组合可以实现页面的自动化设计。通过应用实例表明,该方法能动态地设计并生成页面,有效提升Web页面开发效率。  相似文献   

7.
Adherence to accessibility guidelines for Web pages does not necessarily guarantee a usable or satisfying Web experience for all persons with disabilities. The needs of many of these individuals fall outside the guidelines for accessible content that most Web authors take into consideration. Many of these users wish, for example, that they simply could ‘enlarge’ what is on a Web page. They also express the wish that pages would be ‘less confusing’. To meet these needs, Web browsers and various software applications provide for a variety of ways in which page presentations can be altered. The effects of these alterations often have unexpected consequences. Some designs accommodate these alterations better than others. This article discusses one such application that allows users to control features of Web page presentation and explores design features that facilitate such control.  相似文献   

8.
《Computer Networks》1999,31(11-16):1467-1479
When using traditional search engines, users have to formulate queries to describe their information need. This paper discusses a different approach to Web searching where the input to the search process is not a set of query terms, but instead is the URL of a page, and the output is a set of related Web pages. A related Web page is one that addresses the same topic as the original page. For example, www.washingtonpost.com is a page related to www.nytimes.com, since both are online newspapers.We describe two algorithms to identify related Web pages. These algorithms use only the connectivity information in the Web (i.e., the links between pages) and not the content of pages or usage information. We have implemented both algorithms and measured their runtime performance. To evaluate the effectiveness of our algorithms, we performed a user study comparing our algorithms with Netscape's `What's Related' service (http://home.netscape.com/escapes/related/). Our study showed that the precision at 10 for our two algorithms are 73% better and 51% better than that of Netscape, despite the fact that Netscape uses both content and usage pattern information in addition to connectivity information.  相似文献   

9.
Web crawlers are essential to many Web applications, such as Web search engines, Web archives, and Web directories, which maintain Web pages in their local repositories. In this paper, we study the problem of crawl scheduling that biases crawl ordering toward important pages. We propose a set of crawling algorithms for effective and efficient crawl ordering by prioritizing important pages with the well-known PageRank as the importance metric. In order to score URLs, the proposed algorithms utilize various features, including partial link structure, inter-host links, page titles, and topic relevance. We conduct a large-scale experiment using publicly available data sets to examine the effect of each feature on crawl ordering and evaluate the performance of many algorithms. The experimental results verify the efficacy of our schemes. In particular, compared with the representative RankMass crawler, the FPR-title-host algorithm reduces computational overhead by a factor as great as three in running time while improving effectiveness by 5?% in cumulative PageRank.  相似文献   

10.
This paper presents an approach based on information retrieval and clustering techniques for automatically enhancing the navigation structure of a Web site for improving navigability. The approach increments the set of navigation links provided in each page of the site with a semantic navigation map, i.e., a set of links enabling navigating from a given page to other pages of the site showing similar or related content. The approach uses Latent Semantic Indexing to compute a dissimilarity measure between the pages of the site and a graph-theoretic clustering algorithm to group pages showing similar or related content according to the calculated dissimilarity measure. AJAX code is finally used to extend each Web page with an associated semantic navigation map. The paper also presents a prototype of a tool developed to support the approach and the results from a case study conducted to assess the validity and feasibility of the proposal.  相似文献   

11.
Given a user keyword query, current Web search engines return a list of individual Web pages ranked by their "goodness" with respect to the query. Thus, the basic unit for search and retrieval is an individual page, even though information on a topic is often spread across multiple pages. This degrades the quality of search results, especially for long or uncorrelated (multitopic) queries (in which individual keywords rarely occur together in the same document), where a single page is unlikely to satisfy the user's information need. We propose a technique that, given a keyword query, on the fly generates new pages, called composed pages, which contain all query keywords. The composed pages are generated by extracting and stitching together relevant pieces from hyperlinked Web pages and retaining links to the original Web pages. To rank the composed pages, we consider both the hyperlink structure of the original pages and the associations between the keywords within each page. Furthermore, we present and experimentally evaluate heuristic algorithms to efficiently generate the top composed pages. The quality of our method is compared to current approaches by using user surveys. Finally, we also show how our techniques can be used to perform query-specific summarization of Web pages.  相似文献   

12.
网页内容过滤技术中的特征提取   总被引:2,自引:0,他引:2  
有害的网络内容日益猖獗,为封锁色情网页,论文通过统计和分析,主要从四个方面提取色情网页的特征:网页的布局,因特网内容选择平台(PICS)等级评定应用,暗示性条文和文档内容。从这四个方面的特征能几乎完全区分色情网页和非色情网页,该基本框架也适用于过滤网上除色情以外的其它不益内容。  相似文献   

13.
基于模型的Web页面自动生成系统PAGS   总被引:2,自引:0,他引:2  
如何提高页面开发效率进而提高Web应用开发的效率成为Web应用开发者们关心的一个问题。现有的页面辅助开发工具大多由页面编辑工具改进而来,采用手动布局和数据绑定的方式定制页面,虽然方便了单个页面的开发,但工程的整体效率并未得到实质性的提高。PAGS是一种基于模型的页面自动生成系统,它通过对Web页面及页面组件的分类建立模型,结合数据字典技术和代码自动生成技术,实现了页面的自动生成、自动布局、自动绑定数据,同时也支持以图形化的方式对页面进行局部调整和修改。工程实践表明,PAGS能够较好地支持各类大型Web应用系统的页面开发需求,有效提高Web页面的开发效率。  相似文献   

14.
Magel  K. 《Computer》1997,30(12):131-132
Originally, HTML was intended to be a simple, rather limited language for describing primitive information layouts in World Wide Web pages. The first version of the HTML specification emphasized simplicity. The browser, not the HTML coder, controlled the actual page appearance, determining the appearance of headers, paragraphs, and other primitive layout elements. In recent years, however, complex features have been added to HTML, including frames, dynamic HTML, and cascading styles. These extensions have served designers well, giving them more control over the appearance of their pages. And today designers have pixel level control over the layout of every element. Unfortunately, these enhancements have ignored the Web user. None of the recent additions to HTML help the user to traverse the Web or to find suitable information quickly. The article describes methods which make the Internet easier to use, including: adding value to links; line thickness control; color usage; icons and pop-ups; and preselection  相似文献   

15.
Abstract: Web page design guidelines produce a pressure towards uniformity; excessive uniformity lays a Web page designer open to accusations of plagiarism. In the past, assessment of similarity between visual products such as Web pages has involved an uncomfortably high degree of subjectivity. This paper describes a method for measuring perceived similarity of visual products which avoids previous problems with subjectivity, and which makes it possible to pool results from respondents without the need for intermediate coding. This method is based on co‐occurrence matrices derived from card sorts. It can also be applied to other areas of software development, such as systems analysis and market research.  相似文献   

16.
WAP页面转换代理系统原理及其实现   总被引:2,自引:0,他引:2  
为了解决WAP应用层与Web应用层之间的语言差别,文章提出了一个WAP页面转换代理系统的方案,设计并实现了从HTML到WML网页转换算法,并提出利用知识库构造基于Web内容的自适应页面转换代理的处理方法。实验证明,该方法能够很好地解决移动计算设备上网存在的语言差异问题,另一方面,它也能够较好地解决普通Web页面无线设备上显示时可能出现的布局失调问题。  相似文献   

17.
Websites do not become usable just because their content is accessible. For people who are blind, the application of the W3C's Web Content Accessibility Guidelines (WCAG) often might not even make a significant difference in terms of efficiency, errors or satisfaction in website usage. This paper documents the development of nine guidelines to construct an enhanced text user interface (ETI) as an alternative to the graphical user interface (GUI). An experimental design with 39 blind participants executing a search and a navigation task on a website showed that with the ETI, blind users executed the search task significantly faster, committing fewer mistakes, rating it significantly better on subjective scales as well as when compared to the GUIs from other websites they had visited. However, performance did not improve with the ETI on the navigation task, the main reason presumed to be labeling problems. We conclude that the ETI is an improvement over the GUI, but that it cannot help in overcoming one major weakness of most websites: If users do not understand navigation labels, even the best user interface cannot help them navigate.  相似文献   

18.
针对传统PageRank算法存在的平分链接权重和忽略用户兴趣等问题,提出一种基于学习自动机和用户兴趣的页面排序算法LUPR。在所提方法中,给每个网页分配学习自动机,其功能是确定网页之间超链接的权重。通过对用户行为进一步分析,以用户的浏览行为衡量用户对网页的兴趣度,从而获得兴趣度因子。该算法根据网页间的超链接和用户对网页的兴趣度衡量网页权重计算每个网页的排名。最后的仿真实验表明,较传统的PageRank算法和WPR算法,改进后的LUPR算法在一定程度上提高了信息检索的准确度和用户满意度。  相似文献   

19.
This paper presents a Page rank-based prefetching technique for accesses to Web page clusters. The approach uses the link structure of a requested page to determine the “most important” linked pages and to identify the page(s) to be prefetched. The underlying premise of our approach is that in the case of cluster accesses, the next pages requested by users of the Web server are typically based on the current and previous pages requested. Furthermore, if the requested pages have a lot of links to some “important” page, that page has a higher probability of being the next one requested. An experimental evaluation of the prefetching mechanism is presented using real server logs. The results show that the Page rank-based scheme does better than random prefetching for clustered accesses, with hit rates of 90% in some cases.  相似文献   

20.
一直以来,我国的书籍设计只注重封面设计而忽视目录页的编排设计,在目录页的编排设计中,仅局限于它的检索功能,而忽视审美功能和设计内涵。其实,目录页版式设计非常重要,读者会通过浏览目录页来了解这本书的内容概要,感受杂志的特色和风格。因此,目录页设计应该从单纯的提供信息转化为调动读者的视觉感受上来。文章尝试从目录页的发展历史中,归纳出杂志目录页的版式设计规律,重新审视目录页版式设计对于平面设计的研究价值。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号