ISCAS OpenIR
phoneme-level articulatory animation in pronunciation training
Wang Lan; Chen Hui; Li Sheng; Meng Helen M.
2012
SourceSpeech Communication
ISSN1676393
Volume54Issue:7Pages:845-856
English AbstractSpeech visualization is extended to use animated talking heads for computer assisted pronunciation training. In this paper, we design a data-driven 3D talking head system for articulatory animations with synthesized articulator dynamics at the phoneme level. A database of AG500 EMA-recordings of three-dimensional articulatory movements is proposed to explore the distinctions of producing the sounds. Visual synthesis methods are then investigated, including a phoneme-based articulatory model with a modified blending method. A commonly used HMM-based synthesis is also performed with a Maximum Likelihood Parameter Generation algorithm for smoothing. The 3D articulators are then controlled by synthesized articulatory movements, to illustrate both internal and external motions. Experimental results have shown the performances of visual synthesis methods by root mean square errors. A perception test is then presented to evaluate the 3D animations, where a word identification accuracy is 91.6% among 286 tests, and an average realism score is 3.5 (1 = bad to 5 = excellent). © 2012 Elsevier B.V. All rights reserved.
Indexed Typeei
Department(1) Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China; (2) Chinese University of HongKong, China; (3) Institute of Software, Chinese Academy of Sciences, China
Language英语
WOS IDWOS:000305496400001
Citation statistics
Content Type期刊论文
URIhttp://ir.iscas.ac.cn/handle/311060/14731
Collection中国科学院软件研究所
Recommended Citation
GB/T 7714
Wang Lan,Chen Hui,Li Sheng,et al. phoneme-level articulatory animation in pronunciation training[J]. Speech Communication,2012,54(7):845-856.
APA Wang Lan,Chen Hui,Li Sheng,&Meng Helen M..(2012).phoneme-level articulatory animation in pronunciation training.Speech Communication,54(7),845-856.
MLA Wang Lan,et al."phoneme-level articulatory animation in pronunciation training".Speech Communication 54.7(2012):845-856.
Files in This Item:
File Name/Size DocType Version Access License
1-s2.0-S016763931200(1311KB) 开放获取LicenseApplication Full Text
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Wang Lan]'s Articles
[Chen Hui]'s Articles
[Li Sheng]'s Articles
Baidu academic
Similar articles in Baidu academic
[Wang Lan]'s Articles
[Chen Hui]'s Articles
[Li Sheng]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Wang Lan]'s Articles
[Chen Hui]'s Articles
[Li Sheng]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.