Department of Special Education and Communication Disorders
Document Type
Article
Date of this Version
Fall 9-13-2012
Citation
Wang, J., Samal, A., Green, J. R., & Rudzicz, F. (2012). Whole-word recognition from articulatory movements for silent speech interfaces, InterSpeech, Portland, OR.
Abstract
Articulation-based silent speech interfaces convert silently produced speech movements into audible words. These systems are still in their experimental stages, but have significant potential for facilitating oral communication in persons with laryngectomy or speech impairments. In this paper, we report the result of a novel, real-time algorithm that recognizes whole-words based on articulatory movements. This approach differs from prior work that has focused primarily on phoneme-level recognition based on articulatory features. On average, our algorithm missed 1.93 words in a sequence of twenty-five words with an average latency of 0.79 seconds for each word prediction using a data set of 5,500 isolated word samples collected from ten speakers. The results demonstrate the effectiveness of our approach and its potential for building a real-time articulation-based silent speech interface for health applications.
Comments
Copyright (c) 2012 Jun Wang, Ashok Samal, Jordan R. Green, & Frank Rudzicz.