Abstract
This paper proposes a novel filter bank composed of dominant Spectral Basis Vectors (SBVs) in a spectrogram. Spectral envelopes represented by the SBVs have shown to be excellent characteristic features for discriminating different acoustic events in noisy environment. Non-negative Matrix Factorization (NMF) and non-negative K-SVD (NKSVD) for part-based and holistic representations extract dominant SBVs from a spectrogram. The effectiveness of the proposed method is demonstrated on a database of real life recordings via experiments, and its robust performance is compared to conventional methods.
Original language | English |
---|---|
Pages (from-to) | 2002-2006 |
Number of pages | 5 |
Journal | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
Volume | 2015-January |
Publication status | Published - 2015 |
Event | 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany Duration: 2015 Sept 6 → 2015 Sept 10 |
Bibliographical note
Publisher Copyright:Copyright © 2015 ISCA.
Keywords
- Acoustic event recognition
- Dictionary learning
- Dominant spectral basis vector
- K-SVD
- Non-negative matrix factorization
- Robust feature extraction
- Spectral envelope
ASJC Scopus subject areas
- Language and Linguistics
- Human-Computer Interaction
- Signal Processing
- Software
- Modelling and Simulation