AIMC Topic: Voice Quality

Clear Filters Showing 31 to 39 of 39 articles

Developing a smart system for binary classification of disordered voices using machine learning.

American journal of otolaryngology
OBJECTIVES: Voice disorder is characterized by disruptions in voice quality caused by issues in vocal fold vibration during phonation. The study explored the application of machine learning, based on the Random Forest (RF) and Decision Tree (DT) mode...

[Current methods of acoustic analysis of voice: a review].

Lin chuang er bi yan hou tou jing wai ke za zhi = Journal of clinical otorhinolaryngology head and neck surgery
Acoustic analysis of the voice, as an objective, quantitative, non-invasive and reproducible method for the evaluation of voice quality, can be used to detect and analyze the acoustic characteristics of normal, artistic or pathological voice. With th...

A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection.

The Journal of the Acoustical Society of America
Goodness of pronunciation (GOP) is the most widely used method for automatic mispronunciation detection. In this paper, a transfer learning approach to GOP based mispronunciation detection when applying maximum F1-score criterion (MFC) training to de...

Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images.

The Journal of the Acoustical Society of America
Tongue gestural target classification is of great interest to researchers in the speech production field. Recently, deep convolutional neural networks (CNN) have shown superiority to standard feature extraction techniques in a variety of domains. In ...

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

The Journal of the Acoustical Society of America
In this letter, authors propose an auditory feature representation technique with the filterbank learned using an annealing dropout convolutional restricted Boltzmann machine (ConvRBM) and noise-robust energy estimation using the Teager energy operat...

Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network.

The Journal of the Acoustical Society of America
Estimation of the spectral tilt of the glottal source has several applications in speech analysis and modification. However, direct estimation of the tilt from telephone speech is challenging due to vocal tract resonances and distortion caused by spe...

Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics.

The Journal of the Acoustical Society of America
Total removal of the larynx may be required to treat laryngeal cancer: speech is lost. This article shows that it may be possible to restore speech by sensing movement of the remaining speech articulators and use machine learning algorithms to derive...

Long-Term Voice Outcomes After Robotic Thyroidectomy.

World journal of surgery
BACKGROUND: The purpose of this study was to evaluate the long-term voice function after robotic thyroidectomy in comparison with conventional transcervical thyroidectomy.