基于SVD的唇动视觉语音特征提取技术 Visual-audio feature extraction of lip movements based on SVD期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于SVD的唇动视觉语音特征提取技术

引用本文：	张建明,陶宏,王良民,詹永照,宋顺林.基于SVD的唇动视觉语音特征提取技术[J].江苏大学学报(自然科学版),2004,25(5):426-429.

作者姓名：	张建明陶宏王良民詹永照宋顺林

作者单位：	江苏大学计算机科学与通讯工程学院,江苏,镇江,212013

基金项目：	国家自然科学基金资助项目(60273040)，江苏省高校自然科学基金资助项目(02KJB520003)

摘要：	唇动视觉语音特征提取是音视频驱动的人脸动画唇动表示和唇读研究的关键技术．首先针对彩色视频图像进行唇色增强，对增强后的灰度图像进行闽值分割，获取唇部包围框，并根据口型发音的视觉特征进行初分类；然后进行尺度与灰度归一化处理，对预处理后的图像提取奇异值特征；最后采用基于欧氏距离的模板匹配法对该奇异值特征所包含的视觉语音信息进行测试试验．结果表明该低维度特征包含了大量唇动视觉语音信息，可用于单个人在自然环境下的唇语口型识别．
关键词：	唇动特征提取 SVD 唇读
文章编号：	1671-7775(2004)05-0426-04
修稿时间：	2004年3月11日
Visual-audio feature extraction of lip movements based on SVD

ZHANG Jian-ming,TAO Hong,WANG Liang-min,ZHAN Yong-zhao,SONG Shun-lin.Visual-audio feature extraction of lip movements based on SVD[J].Journal of Jiangsu University:Natural Science Edition,2004,25(5):426-429.

Authors:	ZHANG Jian-ming TAO Hong WANG Liang-min ZHAN Yong-zhao SONG Shun-lin

Abstract:	Visual feature extraction of lip movement is a key issue in video and speech driven face (animation systems. )The approach is like this: firstly enhance chromatic video image, then segment enhanced gray images with thresholds, and finally obtain lip shapes. This method classifies the lip-shapes according to the visual features of pronunciations, regulates the dimensions and grayscales of lip images, and (extracts) features based on SVD from the preprocessed images. Finally, the template (matching algorithm based ) on Euclidean distance is applied. The results show that the character of lower dimensions includes a large number of visual speeches' information, so it can be applied in individual natural conditions.

Keywords:	lip movements feature extraction SVD lipreading
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏