首页 | 官方网站   微博 | 高级检索  
     

一种基于数字图书馆的文本信息标引技术的改进研究
引用本文:王兰成,王立双.一种基于数字图书馆的文本信息标引技术的改进研究[J].现代图书情报技术,2006(2):5-9.
作者姓名:王兰成  王立双
作者单位:1. 南京政治学院上海分院军事信息管理系,上海,200433
2. 万方数据股份有限公司,北京,100044
摘    要:研究构建了具有位置信息控制的特义禁用词语义环境,进而运用于中文文献元数据CXMARC文本的自动标引和主题信息的数据挖掘,其中研究设计的预处理特义中文禁用字词切分算法SWF,能有效地减少领域的分词歧义性和缩短标引时间,从而改进了传统最大匹配MM算法的自动标引质量和效率。

关 键 词:自动标引  数字图书馆  中文信息处理  MARC文本
收稿时间:2005-09-13
修稿时间:2005-11-20

Research on a New Text Automatic Indexing Technology Based on Digital Library
Wang Lancheng,Wang Lishuang.Research on a New Text Automatic Indexing Technology Based on Digital Library[J].New Technology of Library and Information Service,2006(2):5-9.
Authors:Wang Lancheng  Wang Lishuang
Affiliation:1.Department of Information Management, Nanjing Political College PIA, Shanghai 200433, China;2 . Wanfang Data Co. , Ltd, Bering 100044, China
Abstract:The semantic environmental with special stop -words location information control has been studied and founded. This technology has been applied to Chinese metadata CXMARC text automatic indexing and the data mining of theme information. The algorithm of SWF that is used in the pretreatment special Chinese text automatic indexing can reduce the participle different meanings of a field efficiently and shorten indexing time. So tradition maximum matching algorithm has been improved of its quality and efficiency.
Keywords:Automatic indexing Digital library Chinese information processing MARC
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号