Gated Low-Rank Adaptation for Personalized Code-Switching Automatic Speech Recognition on the Low-Spec Devices

Gwantae Kim, Bokyeung Lee, Donghyeon Kim, Hanseok Ko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In recent times, there has been a growing interest in utilizing personalized large models on low-spec devices, such as mobile and CPU-only devices. However, utilizing a personalized large model in the on-device is inefficient, and sometimes limited due to computational cost. To tackle the problem, this paper presents the weights separation method to minimize on-device model weights using parameter-efficient fine-tuning methods. Moreover, some people speak multiple languages in an utterance, as known as code-switching, the personalized ASR model is necessary to address such cases. However, current multilingual speech recognition models are limited to recognizing a single language within each utterance. To tackle this problem, we propose code-switching speech recognition models that incorporate fine-tuned monolingual and multilingual speech recognition models. Additionally, we introduce a gated low-rank adaptation(GLoRA) for parameter-efficient fine-tuning with minimal performance degradation. Our experiments, conducted on Korean-English code-switching datasets, demonstrate that fine-tuning speech recognition models for code-switching surpasses the performance of traditional code-switching speech recognition models trained from scratch. Furthermore, GLoRA enhances parameter-efficient fine-tuning performance compared to conventional LoRA.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages760-764
Number of pages5
ISBN (Electronic)9798350374513
DOIs
Publication statusPublished - 2024
Externally publishedYes
Event49th IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Seoul, Korea, Republic of
Duration: 2024 Apr 142024 Apr 19

Publication series

Name2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings

Conference

Conference49th IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024
Country/TerritoryKorea, Republic of
CitySeoul
Period24/4/1424/4/19

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • automatic speech recognition
  • code-switching
  • on-device
  • parameter-efficient fine-tuning
  • personalized

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Signal Processing
  • Media Technology
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Gated Low-Rank Adaptation for Personalized Code-Switching Automatic Speech Recognition on the Low-Spec Devices'. Together they form a unique fingerprint.

Cite this