TY - GEN
T1 - Toward Explainable Automatic Classification of Children’s Speech Disorders
AU - Shulga, Dima
AU - Silber-Varod, Vered
AU - Benson-Karai, Diamanta
AU - Levi, Ofer
AU - Vashdi, Elad
AU - Lerner, Anat
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020/1/1
Y1 - 2020/1/1
N2 - Early and adequate diagnosis of speech disorders can contribute to the quality of the treatment and thus to treatment success rates. Using acoustic analysis of the speech of children with speech disorders may aid therapists in the diagnostic process by identifying the acoustic characteristics that are unique to a specific disorder and that distinguish it from normal speech development. The purpose of this work is to investigate the feasibility of the automatic detection of speech disorders based on children’s voices. In this preliminary study, using a dataset of utterance recordings of 24 children whose mother tongue is Hebrew, we propose an automatic system that may facilitate accurate speech assessment by therapists by providing a preliminary diagnosis and explainable insights about the model’s predictions. We built a serial, two-step network that is both powerful and possibly interpretable. The first step can model the complex relations between acoustic features and the speech disorder while the second can shed light on the utterances that make the greatest contribution to the final classification. Our preliminary results focus on the broad spectrum of speech disorders. In future work, we plan to design a system that will be able to detect childhood apraxia of speech (CAS) specifically and shed light on the differences in the speech of individuals with CAS and those with other speech disorders.
AB - Early and adequate diagnosis of speech disorders can contribute to the quality of the treatment and thus to treatment success rates. Using acoustic analysis of the speech of children with speech disorders may aid therapists in the diagnostic process by identifying the acoustic characteristics that are unique to a specific disorder and that distinguish it from normal speech development. The purpose of this work is to investigate the feasibility of the automatic detection of speech disorders based on children’s voices. In this preliminary study, using a dataset of utterance recordings of 24 children whose mother tongue is Hebrew, we propose an automatic system that may facilitate accurate speech assessment by therapists by providing a preliminary diagnosis and explainable insights about the model’s predictions. We built a serial, two-step network that is both powerful and possibly interpretable. The first step can model the complex relations between acoustic features and the speech disorder while the second can shed light on the utterances that make the greatest contribution to the final classification. Our preliminary results focus on the broad spectrum of speech disorders. In future work, we plan to design a system that will be able to detect childhood apraxia of speech (CAS) specifically and shed light on the differences in the speech of individuals with CAS and those with other speech disorders.
KW - Childhood Apraxia of Speech (CAS)
KW - Deep spectrum
KW - GeMAPS
KW - Speech disorder
UR - http://www.scopus.com/inward/record.url?scp=85092932257&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-60276-5_49
DO - 10.1007/978-3-030-60276-5_49
M3 - Conference contribution
AN - SCOPUS:85092932257
SN - 9783030602758
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 509
EP - 519
BT - Speech and Computer - 22nd International Conference, SPECOM 2020, Proceedings
A2 - Karpov, Alexey
A2 - Potapova, Rodmonga
PB - Springer Science and Business Media Deutschland GmbH
T2 - 22nd International Conference on Speech and Computer, SPECOM 2020
Y2 - 7 October 2020 through 9 October 2020
ER -