Abstract
In recent times, there has been a growing interest in utilizing personalized large models on low-spec devices, such as mobile and CPU-only devices. However, utilizing a personalized large model in the on-device is inefficient, and sometimes limited due to computational cost. To tackle the problem, this paper presents the weights separation method to minimize on-device model weights using parameter-efficient fine-tuning methods. Moreover, some people speak multiple languages in an utterance, as known as code-switching, the personalized ASR model is necessary to address such cases. However, current multilingual speech recognition models are limited to recognizing a single language within each utterance. To tackle this problem, we propose code-switching speech recognition models that incorporate fine-tuned monolingual and multilingual speech recognition models. Additionally, we introduce a gated low-rank adaptation(GLoRA) for parameter-efficient fine-tuning with minimal performance degradation. Our experiments, conducted on Korean-English code-switching datasets, demonstrate that fine-tuning speech recognition models for code-switching surpasses the performance of traditional code-switching speech recognition models trained from scratch. Furthermore, GLoRA enhances parameter-efficient fine-tuning performance compared to conventional LoRA.
Original language | English |
---|---|
Title of host publication | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 760-764 |
Number of pages | 5 |
ISBN (Electronic) | 9798350374513 |
DOIs | |
Publication status | Published - 2024 |
Externally published | Yes |
Event | 49th IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Seoul, Korea, Republic of Duration: 2024 Apr 14 → 2024 Apr 19 |
Publication series
Name | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings |
---|
Conference
Conference | 49th IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 |
---|---|
Country/Territory | Korea, Republic of |
City | Seoul |
Period | 24/4/14 → 24/4/19 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- automatic speech recognition
- code-switching
- on-device
- parameter-efficient fine-tuning
- personalized
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Networks and Communications
- Signal Processing
- Media Technology
- Acoustics and Ultrasonics