TY - GEN
T1 - Speaker adaptive confidence scoring using Bayesian combining
AU - Kim, Tae Yoon
AU - Ko, Hanseok
PY - 2005
Y1 - 2005
N2 - Bayesian combining of confidence measures is proposed for speech recognition. Bayesian combining is achieved by the estimation of joint pdf of confidence feature vector in correct and incorrect hypothesis classes. If the joint pdf in the two classes are correctly estimated, this method guarantees an optimal combining in the minimum Bayes risk sense. Investigating the distribution of confidence features, we found out that the pdfs are well estimated by Gaussian mixture model with full covariance matrix in combining small number of features. In addition, the adaptation of a confidence score by adapting the joint pdf is presented. The proposed methods reduced the classification error rate by 17% from the conventional single feature based confidence scoring method in isolated word Out-of-Vocabulary rejection test.
AB - Bayesian combining of confidence measures is proposed for speech recognition. Bayesian combining is achieved by the estimation of joint pdf of confidence feature vector in correct and incorrect hypothesis classes. If the joint pdf in the two classes are correctly estimated, this method guarantees an optimal combining in the minimum Bayes risk sense. Investigating the distribution of confidence features, we found out that the pdfs are well estimated by Gaussian mixture model with full covariance matrix in combining small number of features. In addition, the adaptation of a confidence score by adapting the joint pdf is presented. The proposed methods reduced the classification error rate by 17% from the conventional single feature based confidence scoring method in isolated word Out-of-Vocabulary rejection test.
UR - http://www.scopus.com/inward/record.url?scp=29144489617&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2005.1415054
DO - 10.1109/ICASSP.2005.1415054
M3 - Conference contribution
AN - SCOPUS:29144489617
SN - 0780388747
SN - 9780780388741
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - I77-I80
BT - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
Y2 - 18 March 2005 through 23 March 2005
ER -