Abstract
In this paper, we present the KU x Upstage team’s submission for the Special Task on Formality Control on Spoken Language Translation, which involves translating English into four languages with diverse grammatical formality markers. Our methodology comprises two primary components: 1) a language-specific data-driven approach, and 2) the generation of synthetic data through the employment of large-scale language models and empirically-grounded prompt engineering. By adapting methodologies and models to accommodate the unique linguistic properties of each language, we observe a notable enhancement in performance relative to the baseline, substantiating the heightened efficacy of data-driven approaches. Moreover, our devised prompt engineering strategy yields superior synthetic translation instances.
Original language | English |
---|---|
Title of host publication | 20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference |
Editors | Elizabeth Salesky, Marcello Federico, Marine Carpuat |
Publisher | Association for Computational Linguistics |
Pages | 420-432 |
Number of pages | 13 |
ISBN (Electronic) | 9781959429845 |
DOIs | |
Publication status | Published - 2023 |
Event | 20th International Conference on Spoken Language Translation, IWSLT 2023 - Hybrid, Toronto, Canada Duration: 2023 Jul 13 → 2023 Jul 14 |
Publication series
Name | 20th International Conference on Spoken Language Translation, IWSLT 2023 - Proceedings of the Conference |
---|
Conference
Conference | 20th International Conference on Spoken Language Translation, IWSLT 2023 |
---|---|
Country/Territory | Canada |
City | Hybrid, Toronto |
Period | 23/7/13 → 23/7/14 |
Bibliographical note
Publisher Copyright:© IWSLT 2023.All rights reserved.
ASJC Scopus subject areas
- Language and Linguistics
- Human-Computer Interaction
- Linguistics and Language