Reinforced blocking matrix with cross channel projection for speech enhancement

Inho Lee, Yoon Jongsung, Lee Yoonjae, Hanseok Ko

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Citation (Scopus)

    Abstract

    In this paper, we propose a reinforced Blocking Matrix of TF-GSC by incorporating a cross channel projection for speech enhancement. Transfer function GSC (TF-GSC) proposed by Gannot was aimed at improving speech quality but the desired speech signal becomes somewhat distorted since the reference signal resulting from blocking matrix significantly contains the desired signal. The proposed reinforcement on the Blocking Matrix is a scheme to remove the highly correlated components between the inter-channel reference signals using orthogonal projection, thereby completely eliminating the desired signal. Representative experiments show that the proposed scheme is effective and its strength is demonstrated in terms of improved averaged signal noise ratio (SNR) and Log Spectral Distance (LSD).

    Original languageEnglish
    Title of host publicationProceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
    PublisherInternational Speech Communication Association
    Pages957-960
    Number of pages4
    Publication statusPublished - 2010

    Publication series

    NameProceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010

    Bibliographical note

    Funding Information:
    This research was supported by Seoul R&BD (WR080951) Program.

    Keywords

    • GSC (generalized sidelobe canceller)
    • HCI (human-computer interface)
    • Speech enhancement

    ASJC Scopus subject areas

    • Language and Linguistics
    • Speech and Hearing
    • Human-Computer Interaction
    • Signal Processing
    • Software
    • Modelling and Simulation

    Fingerprint

    Dive into the research topics of 'Reinforced blocking matrix with cross channel projection for speech enhancement'. Together they form a unique fingerprint.

    Cite this