首页 | 官方网站   微博 | 高级检索  
     

基于用户兴趣分析的网页生命周期建模
引用本文:王勇,刘奕群,张敏,马少平,茹立云.基于用户兴趣分析的网页生命周期建模[J].中文信息学报,2008,22(2):76-80.
作者姓名:王勇  刘奕群  张敏  马少平  茹立云
作者单位:1. 智能技术与系统国家重点实验室,清华信息科学与技术国家实验室筹,清华大学 计算机系,北京 100084;
2. 搜狐公司研发中心,北京 100084
基金项目:国家973重点基础研究资助项目(2004CB318108),国家自然科学基金资助项目(60621062,60503064,60736044),国家863高科技计划资助项目(2006AA01Z141)
摘    要:网页在其生命周期内的活跃程度会随时间发生变化。有的网页只在特定的阶段有价值,此后就会过时。从用户的角度对网页的生命周期进行分析可以提高网络爬虫和搜索引擎的性能,改善网络广告的效果。利用一台代理服务器收集的网页访问量信息,我们对网页的生命周期进行了研究,给出了用户兴趣演变的模型。这个模型有助于更好地理解网络的组织与运行机理。

关 键 词:计算机应用  中文信息处理  用户行为分析  网页生命周期  网络日志挖掘  
文章编号:1003-0077(2008)02-0076-05
收稿时间:2007-05-04
修稿时间:2007-12-08

Modeling Lifetime of Web Pages Based on User Interest Analysis
WANG Yong,LIU Yi-qun,ZHANG Min,MA Shao-ping,RU Li-yun.Modeling Lifetime of Web Pages Based on User Interest Analysis[J].Journal of Chinese Information Processing,2008,22(2):76-80.
Authors:WANG Yong  LIU Yi-qun  ZHANG Min  MA Shao-ping  RU Li-yun
Affiliation:1. State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory
for Information Science and Technology, Department of Computer Science and Technology,
Tsinghua University, Beijing 100084, China; 2. Sohu Inc. R&D Center, Beijing 100084, China
Abstract:The activeness of a web page varies during its lifetime. Some pages are valuable only in a specific period, and then become obsolescent. Web page lifetime analysis from users’ perspective is important to enhance the performance of web crawlers and search engines, and to improve the efficiency of web advertising. With page view data collected by a proxy server, we were able to perform large scale analysis in web page lifetime. A model is given to describe user interest evolution based on an experiment conducted with the page view data of more than 36 000 000 web pages for two months. The model is the foundation to better understand how the web is organized and operates.
Keywords:computer application  Chinese information processing  user behavior analysis  web page lifetime  web log mining
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号