Speech Acoustics - AI Medical Compendium

A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection.

The Journal of the Acoustical Society of America Nov 1, 2017

Goodness of pronunciation (GOP) is the most widely used method for automatic mispronunciation detection. In this paper, a transfer learning approach to GOP based mispronunciation detection when applying maximum F1-score criterion (MFC) training to de...

Acoustics Markov Chains Pattern Recognition, Automated Deep Learning Humans Signal Processing, Computer-Assisted Software Speech Acoustics Speech Production Measurement Judgment Voice Quality Speech Perception Phonetics

View on PubMed DOI

Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images.

The Journal of the Acoustical Society of America Jun 1, 2017

Tongue gestural target classification is of great interest to researchers in the speech production field. Recently, deep convolutional neural networks (CNN) have shown superiority to standard feature extraction techniques in a variety of domains. In ...

Ultrasonography Signal Processing, Computer-Assisted Tongue Deep Learning Gestures Pattern Recognition, Automated Speech Acoustics Voice Quality Humans Neural Networks, Computer Female Biomechanical Phenomena Male

View on PubMed DOI

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

The Journal of the Acoustical Society of America Jun 1, 2017

In this letter, authors propose an auditory feature representation technique with the filterbank learned using an annealing dropout convolutional restricted Boltzmann machine (ConvRBM) and noise-robust energy estimation using the Teager energy operat...

Humans Neural Networks, Computer Machine Learning Speech Acoustics Speech Production Measurement Acoustics Voice Quality Sound Spectrography Time Factors Signal Processing, Computer-Assisted Pattern Recognition, Automated

View on PubMed DOI

Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network.

The Journal of the Acoustical Society of America Apr 1, 2017

Estimation of the spectral tilt of the glottal source has several applications in speech analysis and modification. However, direct estimation of the tilt from telephone speech is challenging due to vocal tract resonances and distortion caused by spe...

Telephone Voice Quality Glottis Speech Production Measurement Phonation Male Acoustics Deep Learning Humans Sound Spectrography Signal Processing, Computer-Assisted Female Speech Acoustics

View on PubMed DOI

Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics.

The Journal of the Acoustical Society of America Mar 1, 2017

Total removal of the larynx may be required to treat laryngeal cancer: speech is lost. This article shows that it may be possible to restore speech by sensing movement of the remaining speech articulators and use machine learning algorithms to derive...

Larynx, Artificial Humans Magnetics Tongue Recovery of Function Speech Intelligibility Lip Magnets Signal Processing, Computer-Assisted Voice Quality Speech Acoustics Magnetic Fields Machine Learning Prosthesis Design Acoustics Laryngectomy Biomechanical Phenomena Sound Spectrography Transducers Time Factors

View on PubMed DOI

Improved speech inversion using general regression neural network.

The Journal of the Acoustical Society of America Sep 1, 2015

The problem of nonlinear acoustic to articulatory inversion mapping is investigated in the feature space using two models, the deep belief network (DBN) which is the state-of-the-art, and the general regression neural network (GRNN). The task is to e...

Speech Production Measurement Numerical Analysis, Computer-Assisted Phonetics Speech Acoustics Databases, Factual Reproducibility of Results Humans Regression Analysis Computer Simulation Male Nonlinear Dynamics Neural Networks, Computer Female Pattern Recognition, Automated

View on PubMed DOI

AIMC Topic: Speech Acoustics

A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection.

Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images.

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network.

Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics.

Improved speech inversion using general regression neural network.

Popular Topics

Recent Journals

AIMC Topic: Speech Acoustics

A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection.

Convolutional neural network-based automatic classification of midsagittal tongue gestural targets using B-mode ultrasound images.

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network.

Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics.

Improved speech inversion using general regression neural network.

Stay Ahead of Medical AI

Popular Topics

Recent Journals