Fast speech adaptation in linear spectral domain for additive and convolutional noise

Donghyun Kim, Dongsuk Yook

Research output: Contribution to conferencePaperpeer-review

Abstract

In this paper, we propose a transform-based adaptation technique for robust speech recognition in unknown environments. It uses maximum likelihood spectral transform (MLST) algorithm with additive and convolutional noise parameters. Previously many adaptation algorithms have been proposed in the cepstral domain. Though the cepstral domain may be appropriate for the speech recognition, it is difficult to handle environmental noise directly in the cepstral domain. Therefore our approach deals with such noise in the linear spectral domain in which speech is directly affected by the noise. As a result, we can use a small number of noise parameters for fast adaptation. The experiments evaluated on the FFMTIMIT corpus shows promising result with only a small number of adaptation data.

Original languageEnglish
Pages2557-2560
Number of pages4
Publication statusPublished - 2004
Event8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
Duration: 2004 Oct 42004 Oct 8

Other

Other8th International Conference on Spoken Language Processing, ICSLP 2004
Country/TerritoryKorea, Republic of
CityJeju, Jeju Island
Period04/10/404/10/8

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Fast speech adaptation in linear spectral domain for additive and convolutional noise'. Together they form a unique fingerprint.

Cite this