一种新的潜在语义分析语言模型 A new latent semantic analysis language model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种新的潜在语义分析语言模型

引用本文：	任纪生,王作英.一种新的潜在语义分析语言模型[J].高技术通讯,2005,15(8):1-5.

作者姓名：	任纪生王作英

作者单位：	清华大学电子工程系,北京,100084

基金项目：	863计划（2001AA114071）资助项目.

摘要：	提出了基于聚类的方法实现词的快速量化表示，并由此导出潜在语义分析语言模型预测置信度，同时运用新提出的几何加权静态插值方式同三元文法模型相结合，构建了一种新的潜在语义分析语言模型，并将其应用于汉语语音识别。实验表明其效率和性能均优于传统基于奇异值分解的潜在语义分析语言模型，相比于三元文法模型，识别错误率相对下降为3．6％～7．1％左右，并为有效量化表示词对进一步提高潜在语义分析语言模型性能提供了新的途径。
关键词：	语言模型语音识别 N元文法潜在语义分析奇异值分解汉语语音识别模型性能模型预测插值方式量化表
收稿时间：	2004-12-01
修稿时间：	2004年12月1日
A new latent semantic analysis language model

Ren Jisheng,Wang Zuoying.A new latent semantic analysis language model[J].High Technology Letters,2005,15(8):1-5.

Authors:	Ren Jisheng Wang Zuoying

Abstract:	In this paper, latent semantic analysis automatically uncovered the salient semantic relationships between words in a given training corpus by a novel faster method for quantizing word via clustering, it was used for mandarin speech recognition through combining with trigram model via a new proposed static geometric weighting interpolation manner. Experiments show that it outperformed the traditional singular value decomposition-based latent semantic analysis model for its better efficiency and performance. Compared with the trigram model, the reduction of relative recognition error rate is about 3.6% -7.1%. Furthermore, it provides a novel approach for improving latent semantic analysis model through quantizing word pair effectively.

Keywords:	language model speech recognition N-gram latent semantic analysis singular value decomposition
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏