Abstract
An acoustic model for an embedded speech recognition system must exhibit two desirable features; ability to minimize performance degradation in recognition while solving the memory problem under limited system resources. To cope with the challenges, we introduce the state-clustered tied-mixture (SCTM) HMM as an acoustic model optimization. The proposed SCTM modeling shows a significant improvement in recognition performance as well as a solution to sparse training data problem. Moreover, the state weight quantizing method achieves a drastic reduction in model size. In this paper, we describe the acoustic model optimization procedure for embedded speech recognition system and corresponding performance evaluation results.
Original language | English |
---|---|
Title of host publication | 8th International Conference on Spoken Language Processing, ICSLP 2004 |
Publisher | International Speech Communication Association |
Pages | 693-696 |
Number of pages | 4 |
Publication status | Published - 2004 |
Event | 8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of Duration: 2004 Oct 4 → 2004 Oct 8 |
Other
Other | 8th International Conference on Spoken Language Processing, ICSLP 2004 |
---|---|
Country/Territory | Korea, Republic of |
City | Jeju, Jeju Island |
Period | 04/10/4 → 04/10/8 |
ASJC Scopus subject areas
- Language and Linguistics
- Linguistics and Language