AIMC Topic: Speech Acoustics

Clear Filters Showing 1 to 10 of 46 articles

Automated segmentation of child-clinician speech in naturalistic clinical contexts.

Research in developmental disabilities
BACKGROUND: Computational approaches hold significant promise for enhancing diagnosis and therapy in child and adolescent clinical practice. Clinical procedures heavily depend n vocal exchanges and interpersonal dynamics conveyed through speech. Rese...

Computing nasalance with MFCCs and Convolutional Neural Networks.

PloS one
Nasalance is a valuable clinical biomarker for hypernasality. It is computed as the ratio of acoustic energy emitted through the nose to the total energy emitted through the mouth and nose (eNasalance). A new approach is proposed to compute nasalance...

The voice of depression: speech features as biomarkers for major depressive disorder.

BMC psychiatry
BACKGROUND: Psychiatry faces a challenge due to the lack of objective biomarkers, as current assessments are based on subjective evaluations. Automated speech analysis shows promise in detecting symptom severity in depressed patients. This project ai...

The Mason-Alberta Phonetic Segmenter: a forced alignment system based on deep neural networks and interpolation.

Phonetica
Given an orthographic transcription, forced alignment systems automatically determine boundaries between segments in speech, facilitating the use of large corpora. In the present paper, we introduce a neural network-based forced alignment system, the...

Consistency of the Signature of Phonotraumatic Vocal Hyperfunction Across Different Ambulatory Voice Measures.

Journal of speech, language, and hearing research : JSLHR
PURPOSE: Although different factors and voice measures have been associated with phonotraumatic vocal hyperfunction (PVH), it is unclear what percentage of individuals with PVH exhibit such differences during their daily lives. This study used a mach...

Use of a humanoid robot for auditory psychophysical testing.

PloS one
Tasks in psychophysical tests can at times be repetitive and cause individuals to lose engagement during the test. To facilitate engagement, we propose the use of a humanoid NAO robot, named Sam, as an alternative interface for conducting psychophysi...

Recognition of the Effect of Vocal Exercises by Fuzzy Triangular Naive Bayes, a Machine Learning Classifier: A Preliminary Analysis.

Journal of voice : official journal of the Voice Foundation
OBJECTIVES: Machine learning (ML) methods allow the development of expert systems for pattern recognition and predictive analysis of intervention outcomes. It has been used in Voice Sciences, mainly to discriminate between healthy and dysphonic voice...

Pathological Voice Detection Based on Phase Reconstitution and Convolutional Neural Network.

Journal of voice : official journal of the Voice Foundation
The nonlinear dynamic features can effectively describe the acoustic characteristics of normal and pathological voice. In this paper, the phase space reconstruction and convolution neural network are used to classify the normal and pathological voice...

Deep-Learning-Based Representation of Vocal Fold Dynamics in Adductor Spasmodic Dysphonia during Connected Speech in High-Speed Videoendoscopy.

Journal of voice : official journal of the Voice Foundation
OBJECTIVE: Adductor spasmodic dysphonia (AdSD) is a neurogenic dystonia, which causes spasms of the laryngeal muscles. This disorder mainly affects production of connected speech. To understand how AdSD affects vocal fold (VF) movements and hence, th...