首页 | 官方网站   微博 | 高级检索  
     

一种松耦合的生物医学命名实体识别算法
引用本文:胡俊锋,陈蓉,陈源,陈浩,于中华.一种松耦合的生物医学命名实体识别算法[J].计算机应用,2007,27(11):2866-2869.
作者姓名:胡俊锋  陈蓉  陈源  陈浩  于中华
作者单位:四川大学,计算机学院,成都,610064
基金项目:国家自然科学基金 , 教育部高等学校博士学科点专项科研基金 , 四川大学校科研和校改项目
摘    要:生物医学命名实体识别(Bio-NER)是生物医学文献挖掘利用的基础工作。针对目前Bio-NER存在的困难和问题,提出了松耦合的Bio NER算法LCA,该算法利用启发规则过滤器、词性模板匹配及改良的隐马尔科夫模型(HMM)识别生物医学命名实体。在GENIA corpus 3.02语料库上进行的实验表明,LCA可以达到80%的准确率和89%的召回率,优于相关工作中的结果。

关 键 词:生物医学命名实体  启发规则过滤器  词性模板匹配  词根匹配  HMM  松耦合算法
文章编号:1001-9081(2007)11-2866-04
收稿时间:2007-05-16
修稿时间:2007年5月15日

Loose coupling algorithm for biomedical named entity recognition
HU Jun-feng,CHEN Rong,CHEN Yuan,CHEN Hao,YU Zhong-hua.Loose coupling algorithm for biomedical named entity recognition[J].journal of Computer Applications,2007,27(11):2866-2869.
Authors:HU Jun-feng  CHEN Rong  CHEN Yuan  CHEN Hao  YU Zhong-hua
Abstract:The rapid development of biology and medicine in recent years leads to speedy accumulation of gigabyte biomedical information. How to use technical methods to mine and utilize the information becomes more and more important. Biomedical Named Entity Recognition (Bio-NER) is a basal work for mining and utilizing biomedical literatures. Concerning the difficulties and problems of the existing Bio-NER algorithms, a loose coupling algorithm named LCA for Bio-NER was proposed. The biomedical named entities were recognized based on heuristic rule filter, POS pattern matching pattern matching and modified Hidden Markov Model (HMM) approaches. The experimental results on GENIA corpus 3.02 show that the precision and recall of LCA are around 80% and 89% respectively, higher than the results of the related works.
Keywords:biomedical named entity  heuristic rule filter  Parts Of Speech (POS) Pattern Matching  etyma matching  Hidden Markov Model (HMM)  loose coupling algorithm
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号