AI Medical Compendium Journal:
The Journal of the Acoustical Society of America

Showing 91 to 100 of 100 articles

Auditory feature representation using convolutional restricted Boltzmann machine and Teager energy operator for speech recognition.

The Journal of the Acoustical Society of America
In this letter, authors propose an auditory feature representation technique with the filterbank learned using an annealing dropout convolutional restricted Boltzmann machine (ConvRBM) and noise-robust energy estimation using the Teager energy operat...

Estimating the spectral tilt of the glottal source from telephone speech using a deep neural network.

The Journal of the Acoustical Society of America
Estimation of the spectral tilt of the glottal source has several applications in speech analysis and modification. However, direct estimation of the tilt from telephone speech is challenging due to vocal tract resonances and distortion caused by spe...

Predicting the perception of performed dynamics in music audio with ensemble learning.

The Journal of the Acoustical Society of America
By varying the dynamics in a musical performance, the musician can convey structure and different expressions. Spectral properties of most musical instruments change in a complex way with the performed dynamics, but dedicated audio features for model...

Restoring speech following total removal of the larynx by a learned transformation from sensor data to acoustics.

The Journal of the Acoustical Society of America
Total removal of the larynx may be required to treat laryngeal cancer: speech is lost. This article shows that it may be possible to restore speech by sensing movement of the remaining speech articulators and use machine learning algorithms to derive...

Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners.

The Journal of the Acoustical Society of America
Machine-learning based approaches to speech enhancement have recently shown great promise for improving speech intelligibility for hearing-impaired listeners. Here, the performance of three machine-learning algorithms and one classical algorithm, Wie...

Speaker-dependent multipitch tracking using deep neural networks.

The Journal of the Acoustical Society of America
Multipitch tracking is important for speech and signal processing. However, it is challenging to design an algorithm that achieves accurate pitch estimation and correct speaker assignment at the same time. In this paper, deep neural networks (DNNs) a...

Reliable bearing fault diagnosis using Bayesian inference-based multi-class support vector machines.

The Journal of the Acoustical Society of America
This letter presents a multi-fault diagnosis scheme for bearings using hybrid features extracted from their acoustic emissions and a Bayesian inference-based one-against-all support vector machine (Bayesian OAASVM) for multi-class classification. The...

Improved speech inversion using general regression neural network.

The Journal of the Acoustical Society of America
The problem of nonlinear acoustic to articulatory inversion mapping is investigated in the feature space using two models, the deep belief network (DBN) which is the state-of-the-art, and the general regression neural network (GRNN). The task is to e...

Emotion recognition from sound stimuli based on back-propagation neural networks and electroencephalograms.

The Journal of the Acoustical Society of America
This research aims to explore the feasibility of using back-propagation (BP) neural networks and electroencephalograms (EEGs) to recognize the emotional reactions induced by sound stimuli in the dimensions of pleasure and arousal, as well as compare ...

Compensating for the effects of site and equipment variation on delphinid species identification from their echolocation clicks.

The Journal of the Acoustical Society of America
A concern for applications of machine learning techniques to bioacoustics is whether or not classifiers learn the categories for which they were trained. Unfortunately, information such as characteristics of specific recording equipment or noise envi...