基于肤色分割与改进VGG网络的手语识别 Sign Language Recognition Based on Skin Color Model and Improved VGG Network期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于肤色分割与改进VGG网络的手语识别

引用本文：	包嘉欣,田秋红,杨慧敏,陈影柔.基于肤色分割与改进VGG网络的手语识别[J].计算机系统应用,2020,29(6):47-55.

作者姓名：	包嘉欣田秋红杨慧敏陈影柔

作者单位：	浙江理工大学信息学院,杭州310018;浙江理工大学信息学院,杭州310018;浙江理工大学信息学院,杭州310018;浙江理工大学信息学院,杭州310018

基金项目：	国家自然科学基金(51405448); 浙江理工大学博士科研启动项目(11122932611817); 浙江省大学生科技成果推广项目(14530031661961); 浙江理工大学2019年国家级大学生创新创业训练计划(201910338012)

摘要：	传统的手语识别仅仅依靠人工选取的底层特征完成识别, 难以适应手语图像背景的多样性, 本文提出了一种综合多要素的手语肤色分割与改进VGG网络的手语识别方法. 对采集到的手语图像利用椭圆模型进行初步分割, 根据最大连通域排除背景中的类肤色区域并用质心定位的方法去除手部区域以外的肤色区域, 从而实现手语图像准确分割. 在原有VGG网络的基础上减少卷积及全连接的层数对VGG网络进行改进, 减少了所需的存储容量和参数数量. 将分割后的手语灰度图像作为网络的输入, 采用改进的VGG网络建立手语的识别模型. 通过比较不同结构的网络模型对手语图像的识别率, 表明改进的VGG网络能够有效进行特征学习, 对手语图像的平均识别率都达到97%以上.
关键词：	肤色分割手语识别 VGG 改进VGG网络识别模型
收稿时间：	2019/11/9 0:00:00
修稿时间：	2019/12/9 0:00:00
Sign Language Recognition Based on Skin Color Model and Improved VGG Network

BAO Jia-Xin,TIAN Qiu-Hong,YANG Hui-Min,CHEN Ying-Rou.Sign Language Recognition Based on Skin Color Model and Improved VGG Network[J].Computer Systems& Applications,2020,29(6):47-55.

Authors:	BAO Jia-Xin TIAN Qiu-Hong YANG Hui-Min CHEN Ying-Rou

Affiliation:	School of Information Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China

Abstract:	The traditional sign language recognition only relies on the underlying features selected manually, which is difficult to adapt to the diversity of sign language image background. A method of sign language recognition based on the multi-factor skin color segmentation and the improved VGG network is proposed in the study. The collected sign language images are initially segmented by an elliptic model. The skin color region is excluded according to the maximum connected domain, and the skin color regions outside the hand region is removed by centroid positioning method, so as to realize the accurate segmentation of sign language images. The VGG network is improved by reducing the number of convolution and full connection, which reduces the required storage capacity and the number of parameters. The gray-scale image of the segmented sign language is taken as the input of the network, and the improved VGG network is used to establish the recognition model of sign language. By comparing the different structure of the network model of sign language recognition rate of the image, show that the improved VGG networks can effectively study characteristics, the average image sign language recognition rate is above 97%.

Keywords:	skin color segmentation sign language recognition VGG improved VGG network recognition model
本文献已被万方数据等数据库收录！
	点击此处可从《计算机系统应用》浏览原始摘要信息
	点击此处可从《计算机系统应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏