Abstract
Despite the lack of invariance problem (the many-to-many mapping between acoustics and percepts), we experience phonetic constancy and typically perceive what a speaker intends. Models of human speech recognition have sidestepped this problem, working with abstract, idealized inputs and deferring the challenge of working with real speech. In contrast, automatic speech recognition powered by deep learning networks have allowed robust, real-world speech recognition. However, the complexities of deep learning architectures and training regimens make it difficult to use them to provide direct insights into mechanisms that may support human speech recognition. We developed a simple network that borrows one element from automatic speech recognition (long short-term memory nodes, which provide dynamic memory for short and long spans). This allows the network to learn to map real speech from multiple talkers to semantic targets with high accuracy. Internal representations emerge that resemble phonetically-organized responses in human superior temporal gyrus, suggesting that the model develops a distributed phonological code despite no explicit training on phonetic or phonemic targets. The ability to work with real speech is a major advance for cognitive models of human speech recognition.
Original language | English |
---|---|
Title of host publication | Proceedings of the 41st Annual Meeting of the Cognitive Science Society |
Subtitle of host publication | Creativity + Cognition + Computation, CogSci 2019 |
Publisher | The Cognitive Science Society |
Pages | 2248-2253 |
Number of pages | 6 |
ISBN (Electronic) | 0991196775, 9780991196777 |
Publication status | Published - 2019 |
Event | 41st Annual Meeting of the Cognitive Science Society: Creativity + Cognition + Computation, CogSci 2019 - Montreal, Canada Duration: 2019 Jul 24 → 2019 Jul 27 |
Publication series
Name | Proceedings of the 41st Annual Meeting of the Cognitive Science Society: Creativity + Cognition + Computation, CogSci 2019 |
---|
Conference
Conference | 41st Annual Meeting of the Cognitive Science Society: Creativity + Cognition + Computation, CogSci 2019 |
---|---|
Country/Territory | Canada |
City | Montreal |
Period | 19/7/24 → 19/7/27 |
Bibliographical note
Publisher Copyright:© Cognitive Science Society: Creativity + Cognition + Computation, CogSci 2019.All rights reserved.
Keywords
- computational models
- deep learning
- neural networks
- spoken word recognition
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Science Applications
- Human-Computer Interaction
- Cognitive Neuroscience