首页 | 官方网站   微博 | 高级检索  
     

网络食品安全的歧义性消解算法
引用本文:刘金硕,邓莹莹,邓娟.网络食品安全的歧义性消解算法[J].计算机科学,2015,42(Z11):7-9, 26.
作者姓名:刘金硕  邓莹莹  邓娟
作者单位:武汉大学计算机学院 武汉430072,武汉大学国际软件学院 武汉430072,武汉大学国际软件学院 武汉430072
基金项目:本文受国家自然科学基金项目(61303214)资助
摘    要:以网络食品安全信息为研究对象,旨在提出一个能够解决食品安全领域专有名词指代不明的歧义消解算法。文中采用的歧义消解算法是在改进的TF-IDF特征选择算法的基础上,结合了隐含马尔可夫模型(HMM)和SVM分类器,从而实现专有名词的歧义消解。提出了一个在TF-IDF的基础上增加两个加权因子的特征提取算法LN-TF-IDF。实验表明,以202831条文本实验所得的准确率和召回率的调和平均值F1值为评价标准,设计的基于改进TF-IDF的食品安全领域歧义消解算法的效果比基于传统TF-IDF的歧义消解算法平均提升了7.31%,且在不同时间抓取的实验数据集下,本算法的效果也相对稳定。

关 键 词:食品安全  歧义消解  隐含马尔可夫模型  TF-IDF  支持向量机

Disambiguation Algorithm Design and Implementation of Food Safety Issues in Network
LIU Jin-shuo,DENG Ying-ying and DENG Juan.Disambiguation Algorithm Design and Implementation of Food Safety Issues in Network[J].Computer Science,2015,42(Z11):7-9, 26.
Authors:LIU Jin-shuo  DENG Ying-ying and DENG Juan
Affiliation:Computer School,Wuhan University,Wuhan 430072,China,International School of Software,Wuhan University,Wuhan 430072,China and International School of Software,Wuhan University,Wuhan 430072,China
Abstract:The article aimed to put forward a disambiguation algorithm which can correctly classify the unknown terms,based on the food safety information in network.The disambiguation algorithms used in this paper combines the hidden Markov model(HMM) and SVM classifier to achieve terminology disambiguation,based on the improved TF-IDF feature selection algorithm.This paper proposed a new feature extraction algorithm LN-TF-IDF with two additional weighting factors on traditional TF-IDF.Experiments show that,the improved TF-IDF disambiguation algorithm designed in the field of food safety enhances the effect of disambiguation by average 7.31% on the 202831 texts.It was compared with the traditional TF-IDF text feature selection algorithm,with the F-measure as evaluation criteria.At the same time,the effect of the algorithm is relatively stable on different experimental data sets obtained from different time.
Keywords:Food safety  Disambiguation  HMM  TF-IDF  SVM
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号