Robust Self Supervised Speech Embeddings for Child-Adult Classification in Interactions involving Children with Autism

  • Rimita Lahiri
  • , Tiantian Feng
  • , Rajat Hebbar
  • , Catherine Lord
  • , So Hyun Kim
  • , Shrikanth Narayanan

Research output: Contribution to journalConference articlepeer-review

Abstract

We address the problem of detecting who spoke when in child-inclusive spoken interactions i.e., automatic child-adult speaker classification. Interactions involving children are richly heterogeneous due to developmental differences. The presence of neurodiversity e.g., due to Autism, contributes additional variability. We investigate the impact of additional pre-training with more unlabelled child speech on the child-adult classification performance. We pre-train our model with child-inclusive interactions, following two recent self-supervision algorithms, Wav2vec 2.0 and WavLM, with a contrastive loss objective. We report 9 − 13% relative improvement over the state-of-the-art baseline with regards to classification F1 scores on two clinical interaction datasets involving children with Autism. We also analyze the impact of pre-training under different conditions by evaluating our model on interactions involving different subgroups of children based on various demographic factors.

Original languageEnglish
Pages (from-to)3557-3561
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2023-August
DOIs
Publication statusPublished - 2023
Event24rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2023 - Dublin, Ireland
Duration: 2023 Aug 202023 Aug 24

Bibliographical note

Publisher Copyright:
© 2023 International Speech Communication Association. All rights reserved.

Keywords

  • autism
  • child-adult classification
  • self-supervision
  • speech

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Language and Linguistics
  • Modelling and Simulation
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'Robust Self Supervised Speech Embeddings for Child-Adult Classification in Interactions involving Children with Autism'. Together they form a unique fingerprint.

Cite this