The Impact of Speaker Diarization on DNN-based Autism Severity Estimation

Marina Eni, Alex Gorodetski, Ilan Dinstein, Yaniv Zigel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents a speech-based system for autism severity estimation combined with automatic speaker diarization. Speaker diarization was performed by two different methods. The first used acoustic features, which included Mel-Frequency Cepstral Coefficients (MFCC) and pitch, and the second used x-vectors - embeddings extracted from Deep Neural Networks (DNN). The speaker diarization was trained using a Fully Connected Deep Neural Network (FCDNN) in both methods. We then trained a Convolutional Neural Network (CNN) to estimate the severity of autism based on 48 acoustic and prosodic features of speech. One hundred thirty-two young children were recorded in the Autism Diagnostic Observation Schedule (ADOS) examination room, using a distant microphone. Between the two diarization methods, the MFCC and Pitch achieved a better Diarization Error Rate (DER) of 26.91%. Using this diarization method, the severity estimation system achieved a correlation of 0.606 (Pearson) between the predicted and the actual autism severity scores (i.e., ADOS scores).

Original languageEnglish
Title of host publication44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022
PublisherInstitute of Electrical and Electronics Engineers
Pages3414-3417
Number of pages4
ISBN (Electronic)9781728127828
DOIs
StatePublished - 1 Jan 2022
Event44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022 - Glasgow, United Kingdom
Duration: 11 Jul 202215 Jul 2022

Publication series

NameProceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS
Volume2022-July
ISSN (Print)1557-170X

Conference

Conference44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2022
Country/TerritoryUnited Kingdom
CityGlasgow
Period11/07/2215/07/22

ASJC Scopus subject areas

  • Signal Processing
  • Health Informatics
  • Computer Vision and Pattern Recognition
  • Biomedical Engineering

Fingerprint

Dive into the research topics of 'The Impact of Speaker Diarization on DNN-based Autism Severity Estimation'. Together they form a unique fingerprint.

Cite this