Abstract
The unseen noise signal is difficult to anticipate, and various approaches have been developed to address this issue. In our earlier work, we proposed a lightweight dynamic filter by splitting the filter into kernel and spatial parts. This small footprint model showed robust results in an unseen noisy environment. However, a simple pooling process for dividing the feature would limit the performance. In this paper, we propose an efficient dynamic filter to enhance the performance of the existing dynamic filter. Instead of the simple feature mean, we separate the input features as non-overlapping chunks, and separable convolutions take place for each feature direction. We also propose a dynamic filter based attention pooling method. These methods are applied to the kernel part in our previous work, and experiments are carried out for keyword spotting and speaker verification. We confirm that our proposed method performs better in unseen environments than the recently developed models.
Original language | English |
---|---|
Title of host publication | 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 540-547 |
Number of pages | 8 |
ISBN (Electronic) | 9798350396904 |
DOIs | |
Publication status | Published - 2023 |
Event | 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Doha, Qatar Duration: 2023 Jan 9 → 2023 Jan 12 |
Publication series
Name | 2022 IEEE Spoken Language Technology Workshop, SLT 2022 - Proceedings |
---|
Conference
Conference | 2022 IEEE Spoken Language Technology Workshop, SLT 2022 |
---|---|
Country/Territory | Qatar |
City | Doha |
Period | 23/1/9 → 23/1/12 |
Bibliographical note
Funding Information:This work was supported by Korea Environment Industry & Technology Institute(KEITI) through Exotic Invasive Species Management Program, funded by Korea Ministry of Environment(MOE) (2021002280004). Corresponding Author:Hanseok Ko.
Publisher Copyright:
© 2023 IEEE.
Keywords
- dynamic filter
- keyword spotting
- speaker verification
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
- Hardware and Architecture
- Media Technology
- Instrumentation
- Linguistics and Language