Abstract
In recent times, there has been a growing interest in utilizing personalized large models on low-spec devices, such as mobile and CPU-only devices. However, utilizing a personalized large model in the on-device is inefficient, and sometimes limited due to computational cost. To tackle the problem, this paper presents the weights separation method to minimize on-device model weights using parameter-efficient fine-tuning methods. Moreover, some people speak multiple languages in an utterance, as known as code-switching, the personalized ASR model is necessary to address such cases. However, current multilingual speech recognition models are limited to recognizing a single language within each utterance. To tackle this problem, we propose code-switching speech recognition models that incorporate fine-tuned monolingual and multilingual speech recognition models. Additionally, we introduce a gated low-rank adaptation(GLoRA) for parameter-efficient fine-tuning with minimal performance degradation. Our experiments, conducted on Korean-English code-switching datasets, demonstrate that fine-tuning speech recognition models for code-switching surpasses the performance of traditional code-switching speech recognition models trained from scratch. Furthermore, GLoRA enhances parameter-efficient fine-tuning performance compared to conventional LoRA.
| Original language | English |
|---|---|
| Title of host publication | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 760-764 |
| Number of pages | 5 |
| ISBN (Electronic) | 9798350374513 |
| DOIs | |
| Publication status | Published - 2024 |
| Event | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Seoul, Korea, Republic of Duration: 2024 Apr 14 → 2024 Apr 19 |
Publication series
| Name | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings |
|---|
Conference
| Conference | 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 |
|---|---|
| Country/Territory | Korea, Republic of |
| City | Seoul |
| Period | 24/4/14 → 24/4/19 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- automatic speech recognition
- code-switching
- on-device
- parameter-efficient fine-tuning
- personalized
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Networks and Communications
- Signal Processing
- Media Technology
- Acoustics and Ultrasonics