首页 | 官方网站   微博 | 高级检索  
     

基于Word2Vec的WordNet词语相似度计算研究
引用本文:陈丹华,王艳娜,周子力,赵晓函,李天宇,王凯莉. 基于Word2Vec的WordNet词语相似度计算研究[J]. 计算机工程与应用, 2022, 58(3): 222-229. DOI: 10.3778/j.issn.1002-8331.2009-0090
作者姓名:陈丹华  王艳娜  周子力  赵晓函  李天宇  王凯莉
作者单位:1.曲阜师范大学 网络空间安全学院,山东 曲阜 273100 2.曲阜师范大学 物理工程学院,山东 曲阜 273100
基金项目:山东省自然科学基金(ZR2017MD019);教育部高教司产学合作协同育人项目(201701020098);曲阜师范大学交叉学科研究项目(QFNUSKC291809120);赛尔网络下一代互联网技术创新项目(NGII20190516)。
摘    要:当前大部分WordNet词语相似度计算方法由于未充分考虑词语的语义信息和位置关系,导致相似度的准确率降低.为解决上述问题,提出了一种使用词向量模型Word2Vec计算WordNet词语相似度的新方法.在构建WordNet数据集时提出一种新形式,不再使用传统的文本语料库,同时提出信息位置排列方法对数据集加以处理.利用Wo...

关 键 词:词语相似度  WordNet  Word2Vec  同义词集标号

Research on WordNet Word Similarity Calculation Based on Word2Vec
CHEN Danhua,WANG Yanna,ZHOU Zili,ZHAO Xiaohan,LI Tianyu,WANG Kaili. Research on WordNet Word Similarity Calculation Based on Word2Vec[J]. Computer Engineering and Applications, 2022, 58(3): 222-229. DOI: 10.3778/j.issn.1002-8331.2009-0090
Authors:CHEN Danhua  WANG Yanna  ZHOU Zili  ZHAO Xiaohan  LI Tianyu  WANG Kaili
Affiliation:1.School of Cyber Science and Engineering, Qufu Normal University, Qufu, Shandong 273100, China2.School of Physical Engineering, Qufu Normal University, Qufu, Shandong 273100, China
Abstract:Currently, most WordNet word similarity calculation methods do not fully consider the semantic information and the location relationships of words, leading to the similarity accuracy reduction. To solve these problems, this paper proposes a new method to calculate the WordNet word similarity using the word vector model Word2Vec. A new form of the WordNet data set is proposed instead of using the traditional text corpus, and the information position arrangement method is used to process the data set. The vector representations are obtained by training the WordNet data set with the Word2Vec model. The word similarity calculation task is completed on the open word similarity evaluation sets like R&G-65, M&C-30 and MED38, and the Pearson correlation coefficient comparative experiment is conducted from multiple angels. Experimental results show that Pearson correlation coefficient computed by the similarity value calculated in this paper and the artificial judgement value is significantly improved.
Keywords:word similarity  WordNet  Word2Vec  synset label
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号