Voice Quality - AI Medical Compendium

The machine learning-based prediction of the sound pressure level from pathological and healthy speech signals.

The Journal of the Acoustical Society of America Mar 1, 2025

Vocal intensity is quantified by sound pressure level (SPL). The SPL can be measured by either using a sound level meter or by comparing the energy of the recorded speech signal with the energy of the recorded calibration tone of a known SPL. Neither...

Speech Production Measurement Voice Quality Adult Male Middle Aged Case-Control Studies Signal Processing, Computer-Assisted Sound Spectrography Pressure Speech Acoustics Humans Machine Learning Female Aged

View on PubMed DOI

Developing a smart system for binary classification of disordered voices using machine learning.

American journal of otolaryngology Jan 1, 2025

OBJECTIVES: Voice disorder is characterized by disruptions in voice quality caused by issues in vocal fold vibration during phonation. The study explored the application of machine learning, based on the Random Forest (RF) and Decision Tree (DT) mode...

Databases, Factual Female Aged Young Adult Humans Machine Learning Decision Trees Speech Acoustics Voice Disorders Voice Quality Adult Male Middle Aged ROC Curve

View on PubMed DOI

Digital Avatars and Personalized Voices-How AI Is Helping to Restore Speech to Patients.

JAMA Apr 16, 2024

Humans Artificial Intelligence Speech Equipment and Supplies Voice Aphasia Voice Quality Avatar

View on PubMed DOI

[Current methods of acoustic analysis of voice: a review].

Lin chuang er bi yan hou tou jing wai ke za zhi = Journal of clinical otorhinolaryngology head and neck surgery Dec 1, 2022

Acoustic analysis of the voice, as an objective, quantitative, non-invasive and reproducible method for the evaluation of voice quality, can be used to detect and analyze the acoustic characteristics of normal, artistic or pathological voice. With th...

Humans Artificial Intelligence Acoustics Speech Acoustics Voice Voice Quality

View on PubMed DOI

A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection.

The Journal of the Acoustical Society of America Nov 1, 2017

Goodness of pronunciation (GOP) is the most widely used method for automatic mispronunciation detection. In this paper, a transfer learning approach to GOP based mispronunciation detection when applying maximum F1-score criterion (MFC) training to de...

Speech Production Measurement Voice Quality Judgment Speech Perception Phonetics Speech Acoustics Humans Software Deep Learning Signal Processing, Computer-Assisted Acoustics Markov Chains Pattern Recognition, Automated

View on PubMed DOI

Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images.

The Journal of the Acoustical Society of America Jun 1, 2017

Tongue gestural target classification is of great interest to researchers in the speech production field. Recently, deep convolutional neural networks (CNN) have shown superiority to standard feature extraction techniques in a variety of domains. In ...

Ultrasonography Signal Processing, Computer-Assisted Tongue Pattern Recognition, Automated Speech Acoustics Voice Quality Biomechanical Phenomena Male Deep Learning Gestures Humans Neural Networks, Computer Female

View on PubMed DOI

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

The Journal of the Acoustical Society of America Jun 1, 2017

In this letter, authors propose an auditory feature representation technique with the filterbank learned using an annealing dropout convolutional restricted Boltzmann machine (ConvRBM) and noise-robust energy estimation using the Teager energy operat...

Voice Quality Sound Spectrography Pattern Recognition, Automated Speech Acoustics Speech Production Measurement Machine Learning Time Factors Signal Processing, Computer-Assisted Acoustics Humans Neural Networks, Computer

View on PubMed DOI

Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network.

The Journal of the Acoustical Society of America Apr 1, 2017

Estimation of the spectral tilt of the glottal source has several applications in speech analysis and modification. However, direct estimation of the tilt from telephone speech is challenging due to vocal tract resonances and distortion caused by spe...

Signal Processing, Computer-Assisted Acoustics Sound Spectrography Speech Acoustics Speech Production Measurement Phonation Telephone Voice Quality Humans Female Male Deep Learning Glottis

View on PubMed DOI

Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics.

The Journal of the Acoustical Society of America Mar 1, 2017

Total removal of the larynx may be required to treat laryngeal cancer: speech is lost. This article shows that it may be possible to restore speech by sensing movement of the remaining speech articulators and use machine learning algorithms to derive...

Transducers Lip Voice Quality Larynx, Artificial Recovery of Function Signal Processing, Computer-Assisted Prosthesis Design Acoustics Humans Machine Learning Biomechanical Phenomena Time Factors Magnetics Speech Intelligibility Speech Acoustics Laryngectomy Sound Spectrography Tongue Magnets Magnetic Fields

View on PubMed DOI

Long-Term Voice Outcomes After Robotic Thyroidectomy.

World journal of surgery Jan 1, 2016

BACKGROUND: The purpose of this study was to evaluate the long-term voice function after robotic thyroidectomy in comparison with conventional transcervical thyroidectomy.

Thyroid Neoplasms Thyroid Nodule Treatment Outcome Follow-Up Studies Robotics Patient Satisfaction Voice Disorders Voice Quality Male Middle Aged Time Factors Thyroidectomy Humans Female Adult

View on PubMed DOI

AIMC Topic: Voice Quality

The machine learning-based prediction of the sound pressure level from pathological and healthy speech signals.

Developing a smart system for binary classification of disordered voices using machine learning.

Digital Avatars and Personalized Voices-How AI Is Helping to Restore Speech to Patients.

[Current methods of acoustic analysis of voice: a review].

A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection.

Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images.

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network.

Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics.

Long-Term Voice Outcomes After Robotic Thyroidectomy.

Popular Topics

Recent Journals

AIMC Topic: Voice Quality

Stay Ahead of Medical AI

Popular Topics

Recent Journals