首页 | 官方网站   微博 | 高级检索  
     

一种基于词矢量的汉语语义量化模型
引用本文:陈清才,王晓龙.一种基于词矢量的汉语语义量化模型[J].计算机研究与发展,2001,38(2):207-212.
作者姓名:陈清才  王晓龙
作者单位:哈尔滨工业大学计算机科学与工程系
基金项目:国家自然科学基金! (6 99730 15 ),黑龙江省杰出青年基金资助
摘    要:通过建立基于词矢量的汉语语义量化模型来解决语义信息的自动获取及量化问题,描述了模型的建立方法及其在汉语词义排歧中的应用,最后通过构造伪词的方法对模型的语义辨识能力进行了评测。实验表明该语义量化模型具有很好的语义表示能力,并且由于模型的建立是通过对大规模生语料库的统计来完成的,避免了人工对词语语义进行量化时所需的庞大工作量,从而可以运用于许多与语义相关的自然语言处理任务中。

关 键 词:自然语言处理  词矢量  汉语语义量化模型  语料库  人工智能

A WORD VECTOR BASED QUANTIZATION MODEL OF CHINESE WORD SENSE
CHEN Qing-Cai,WANG Xiao-Long.A WORD VECTOR BASED QUANTIZATION MODEL OF CHINESE WORD SENSE[J].Journal of Computer Research and Development,2001,38(2):207-212.
Authors:CHEN Qing-Cai  WANG Xiao-Long
Abstract:A word vector based Chinese word sense quantization model is proposed, which can be used to solve problems such as auto acquisition and quantization of word sense information. The modeling method of the model and its applications in the Chinese word sense disambiguation are further described. And then, the model's ability to discriminate word sense is evaluated by constructing pseudoword. The experiment shows that this model has a good representation for word sense. As the construction of model is done via the statistic of large scale rough corpora, huge workload of manually quantization word senses is avoided. So this model can be applied to many word sense related NLP tasks.
Keywords:natural language processing  word sense quantization model  word sense disambiguation  word vector
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号