Achieving a reliable compact acoustic model for embedded speech recognition system with high confusion frequency model handling

Junho Park, Hanseok Ko

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

An acoustic model for an embedded speech recognition system must exhibit two desirable features; the ability to minimize the performance degradation in recognition, while solving the memory problem under the constraint of limited system resources. Moreover, for general speech recognition tasks, context dependent models such as state-clustered tri-phones are used to guarantee the high recognition performance of the embedded system. To cope with these challenges, we introduce the state-clustered tied-mixture (SCTM) HMM as a method of optimizing an acoustic model. The proposed SCTM modeling system offers a significant improvement in recognition performance, as well as providing a solution to sparse training data problems. Moreover, the state weight quantizing method achieves a drastic reduction in the size of the model. However, using models constructed only in this way is insufficient to improve the recognition rate in some tasks where a large mutual similarity exists, such as in the case of the Korean-digit recognition task. Hence, we also construct new dedicated HMM's for all or part of the Korean-digits that have exclusive states using the same Gaussian pool of previous tri-phone models. In this paper, we describe the acoustic model optimization procedure for embedded speech recognition systems and the corresponding performance evaluation results.

Original languageEnglish
Pages (from-to)737-745
Number of pages9
JournalSpeech Communication
Volume48
Issue number6
DOIs
Publication statusPublished - 2006 Jun

Keywords

  • Compact acoustic modeling
  • Embedded speech recognition system
  • Tied-mixture HMM

ASJC Scopus subject areas

  • Software
  • Modelling and Simulation
  • Communication
  • Language and Linguistics
  • Linguistics and Language
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Achieving a reliable compact acoustic model for embedded speech recognition system with high confusion frequency model handling'. Together they form a unique fingerprint.

Cite this