Automatic prediction of intelligible speaking rate for individuals with ALS from speech acoustic and articulatory samples.

Journal: International journal of speech-language pathology
Published Date:

Abstract

: This research aimed to automatically predict intelligible speaking rate for individuals with Amyotrophic Lateral Sclerosis (ALS) based on speech acoustic and articulatory samples. Twelve participants with ALS and two normal subjects produced a total of 1831 phrases. NDI Wave system was used to collect tongue and lip movement and acoustic data synchronously. A machine learning algorithm (i.e. support vector machine) was used to predict intelligible speaking rate (speech intelligibility × speaking rate) from acoustic and articulatory features of the recorded samples. Acoustic, lip movement, and tongue movement information separately, yielded a of 0.652, 0.660, and 0.678 and a Root Mean Squared Error (RMSE) of 41.096, 41.166, and 39.855 words per minute (WPM) between the predicted and actual values, respectively. Combining acoustic, lip and tongue information we obtained the highest R (0.712) and the lowest RMSE (37.562 WPM). The results revealed that our proposed analyses predicted the intelligible speaking rate of the participant with reasonably high accuracy by extracting the acoustic and/or articulatory features from one short speech sample. With further development, the analyses may be well-suited for clinical applications that require automatic speech severity prediction.

Authors

  • Jun Wang
    Department of Speech, Language, and Hearing Sciences and the Department of Neurology, The University of Texas at Austin, Austin, TX 78712, USA.
  • Prasanna V Kothalkar
    a Department of Bioengineering , Speech Disorders & Technology Lab.
  • Myungjong Kim
  • Andrea Bandini
    Department of Information Engineering, Università degli Studi di Firenze, Via di S. Marta 3, 50139 Firenze, Italy; Department of Electrical, Electronic and Information Engineering (DEI) "Guglielmo Marconi", Università di Bologna, Viale del Risorgimento 2, 40136, Bologna, Italy. Electronic address: andrea.bandini@uhn.ca.
  • Beiming Cao
    a Department of Bioengineering , Speech Disorders & Technology Lab.
  • Yana Yunusova
    c Department of Speech-Language Pathology , University of Toronto , Toronto , Canada.
  • Thomas F Campbell
    b Callier Center for Communication Disorders, University of Texas at Dallas , Richardson , TX , USA.
  • Daragh Heitzman
    d MDA/ALS Center , Texas , TX , USA.
  • Jordan R Green
    e Department of Communication Sciences and Disorders , MGH Institute of Health Professions , Boston , MA , USA.