ADAPTATION TO ENVIRONMENT AND SPEAKER USING MAXIMUM LIKELIHOOD NEURAL NETWORKS

Dong Suk Yuk, James Flanagan, Mahesh Krishnamoorthy, Krishna Dayanidhi

Research output: Contribution to conferencePaperpeer-review

2 Citations (Scopus)

Abstract

When there is a mismatch between training and testing conditions, statistical speech recognition algorithms suffer from severe degradation in recognition accuracy. The mismatch could be due to the interference from acoustical environments where systems are actually used or from speakers themselves. In this paper, a neural network based transformation approach is studied to handle the data distribution mismatches between training and testing conditions. The conditional probability that comes from hidden Markov model (HMM) based recognizers is used for the objective function of a neural network. It maximizes the likelihood of the data from a testing environment, and allows global optimization of the network when used with HMM-based recognizers. The new objective function can be used to transform speech feature vectors, or the mean vectors and covariance matrices of a recognizer. The proposed algorithm is evaluated on a noisy distant-talking version of the Resource Management database.

Original languageEnglish
Pages2531-2534
Number of pages4
Publication statusPublished - 1999
Externally publishedYes
Event6th European Conference on Speech Communication and Technology, EUROSPEECH 1999 - Budapest, Hungary
Duration: 1999 Sept 51999 Sept 9

Conference

Conference6th European Conference on Speech Communication and Technology, EUROSPEECH 1999
Country/TerritoryHungary
CityBudapest
Period99/9/599/9/9

Bibliographical note

Publisher Copyright:
© 1999 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999. All rights reserved.

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Linguistics and Language
  • Communication

Fingerprint

Dive into the research topics of 'ADAPTATION TO ENVIRONMENT AND SPEAKER USING MAXIMUM LIKELIHOOD NEURAL NETWORKS'. Together they form a unique fingerprint.

Cite this