Autoencoder based domain adaptation for speaker recognition under insufficient channel information

Suwon Shon, Seongkyu Mun, Wooil Kim, Hanseok Ko

    Research output: Contribution to journalConference articlepeer-review

    22 Citations (Scopus)

    Abstract

    In real-life conditions, mismatch between development and test domain degrades speaker recognition performance. To solve the issue, many researchers explored domain adaptation approaches using matched in-domain dataset. However, adaptation would be not effective if the dataset is insufficient to estimate channel variability of the domain. In this paper, we explore the problem of performance degradation under such a situation of insufficient channel information. In order to exploit limited in-domain dataset effectively, we propose an unsupervised domain adaptation approach using Autoencoder based Domain Adaptation (AEDA). The proposed approach combines an autoencoder with a denoising autoencoder to adapt resource-rich development dataset to test domain. The proposed technique is evaluated on the Domain Adaptation Challenge 13 experimental protocols that is widely used in speaker recognition for domain mismatched condition. The results show significant improvements over baselines and results from other prior studies.

    Original languageEnglish
    Pages (from-to)1014-1018
    Number of pages5
    JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
    Volume2017-August
    DOIs
    Publication statusPublished - 2017
    Event18th Annual Conference of the International Speech Communication Association, INTERSPEECH 2017 - Stockholm, Sweden
    Duration: 2017 Aug 202017 Aug 24

    Bibliographical note

    Funding Information:
    This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. 2017R1A2B4012720). This subject is supported by Korea Ministry of Environment (MOE) as “Public Technology Program based on Environmental Policy”.

    Publisher Copyright:
    Copyright © 2017 ISCA.

    Keywords

    • Autoencoder
    • Denoising autoencoder
    • Domain mismatch
    • Speaker recognition
    • Unsupervised domain adaptation

    ASJC Scopus subject areas

    • Language and Linguistics
    • Human-Computer Interaction
    • Signal Processing
    • Software
    • Modelling and Simulation

    Fingerprint

    Dive into the research topics of 'Autoencoder based domain adaptation for speaker recognition under insufficient channel information'. Together they form a unique fingerprint.

    Cite this