Abstract
Translating imagined speech from human brain activity into voice is a challenging and absorbing research issue that can provide new means of human communication via brain signals. Efforts to reconstruct speech from brain activity have shown their potential using invasive measures of spoken speech data, but have faced challenges in reconstructing imagined speech. In this paper, we propose NeuroTalk, which converts non-invasive brain signals of imagined speech into the user's own voice. Our model was trained with spoken speech EEG which was generalized to adapt to the domain of imagined speech, thus allowing natural correspondence between the imagined speech and the voice as a ground truth. In our framework, an automatic speech recognition decoder contributed to decomposing the phonemes of the generated speech, demonstrating the potential of voice reconstruction from unseen words. Our results imply the potential of speech synthesis from human EEG signals, not only from spoken speech but also from the brain signals of imagined speech.
Original language | English |
---|---|
Title of host publication | AAAI-23 Technical Tracks 5 |
Editors | Brian Williams, Yiling Chen, Jennifer Neville |
Publisher | AAAI press |
Pages | 6030-6038 |
Number of pages | 9 |
ISBN (Electronic) | 9781577358800 |
Publication status | Published - 2023 Jun 27 |
Event | 37th AAAI Conference on Artificial Intelligence, AAAI 2023 - Washington, United States Duration: 2023 Feb 7 → 2023 Feb 14 |
Publication series
Name | Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 |
---|---|
Volume | 37 |
Conference
Conference | 37th AAAI Conference on Artificial Intelligence, AAAI 2023 |
---|---|
Country/Territory | United States |
City | Washington |
Period | 23/2/7 → 23/2/14 |
Bibliographical note
Publisher Copyright:Copyright © 2023, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
ASJC Scopus subject areas
- Artificial Intelligence