Institutional Repository
| phoneme-level articulatory animation in pronunciation training | |
| Wang Lan; Chen Hui; Li Sheng; Meng Helen M. | |
| 2012 | |
| 发表期刊 | Speech Communication
![]() |
| ISSN | 1676393 |
| 卷号 | 54期号:7页码:845-856 |
| 摘要 | Speech visualization is extended to use animated talking heads for computer assisted pronunciation training. In this paper, we design a data-driven 3D talking head system for articulatory animations with synthesized articulator dynamics at the phoneme level. A database of AG500 EMA-recordings of three-dimensional articulatory movements is proposed to explore the distinctions of producing the sounds. Visual synthesis methods are then investigated, including a phoneme-based articulatory model with a modified blending method. A commonly used HMM-based synthesis is also performed with a Maximum Likelihood Parameter Generation algorithm for smoothing. The 3D articulators are then controlled by synthesized articulatory movements, to illustrate both internal and external motions. Experimental results have shown the performances of visual synthesis methods by root mean square errors. A perception test is then presented to evaluate the 3D animations, where a word identification accuracy is 91.6% among 286 tests, and an average realism score is 3.5 (1 = bad to 5 = excellent). © 2012 Elsevier B.V. All rights reserved. |
| 收录类别 | ei |
| 部门归属 | (1) Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China; (2) Chinese University of HongKong, China; (3) Institute of Software, Chinese Academy of Sciences, China |
| 语种 | 英语 |
| WOS记录号 | WOS:000305496400001 |
| 引用统计 | |
| 内容类型 | 期刊论文 |
| URI标识 | http://ir.iscas.ac.cn/handle/311060/14731 |
| 专题 | 中国科学院软件研究所 |
| 推荐引用方式 GB/T 7714 | Wang Lan,Chen Hui,Li Sheng,et al. phoneme-level articulatory animation in pronunciation training[J]. Speech Communication,2012,54(7):845-856. |
| APA | Wang Lan,Chen Hui,Li Sheng,&Meng Helen M..(2012).phoneme-level articulatory animation in pronunciation training.Speech Communication,54(7),845-856. |
| MLA | Wang Lan,et al."phoneme-level articulatory animation in pronunciation training".Speech Communication 54.7(2012):845-856. |
| 条目包含的文件 | ||||||
| 文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
| 1-s2.0-S016763931200(1311KB) | 开放获取 | 使用许可 | 请求全文 | |||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论