首页 | 官方网站   微博 | 高级检索  
     

基于视频与语音的多通道游戏用户界面系统
引用本文:曾祥永,鲁鹏,张满囤,王阳生. 基于视频与语音的多通道游戏用户界面系统[J]. 计算机辅助设计与图形学学报, 2005, 17(10): 2353-2358
作者姓名:曾祥永  鲁鹏  张满囤  王阳生
作者单位:中国科学院自动化研究所高技术创新中心,北京,100080
基金项目:国家“八六三”高技术研究发展计划(2003AAll4020)
摘    要:设计和实现了一套基于视频和语音的多通道游戏用户界面系统,以增强计算机游戏的交互性和游戏用户的沉浸感.系统新创建并有效地整合了基于视频与语音两种交互通道,其中包含脸部模型重建、头部姿态估计、汉语语音识别三个模块,可快速实现个性化的游戏角色脸部模型,并允许游戏用户使用头部姿态和语音命令实时控制游戏角色和游戏进展.测试和应用结果表明:该系统适用于普通游戏用户和实际游戏环境.

关 键 词:用户界面 多通道交互 三维人脸建模 头部姿态估计 语音识别 计算机游戏
收稿时间:2004-07-16
修稿时间:2004-07-162005-03-30

A Multi-modal Game Player Interface System Based on Video and Speech
Zeng Xiangyong,Lu Peng,Zhang Mandun,Wang Yangsheng. A Multi-modal Game Player Interface System Based on Video and Speech[J]. Journal of Computer-Aided Design & Computer Graphics, 2005, 17(10): 2353-2358
Authors:Zeng Xiangyong  Lu Peng  Zhang Mandun  Wang Yangsheng
Affiliation:Hitic Innovation Center, Institute of Automation , Chinese Academy of Sciences, Beijing 100080
Abstract:In order to enhance game's interactivity and player's immersion in 3D computer games, we developed an easy-to-use and cost-effective interface system based on video and speech. This system integrates three modules, personalized 3D face modeling, head pose estimation and Chinese speech recognition. The first module allows a player to generate his or her personalized character with minimal time and manual interaction, the second and third modules are incorporated to control the character using player's head poses and speech commands in real-time. We also developed a multiplayer game demo, 3D Chinese chess, as a test bench to test our system. The test results demonstrate that the approaches and the system are feasible and practical.
Keywords:user interface   multi-modal interaction   3D face modeling   head pose estimation   speech recognition   computer games
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号