AIMC Topic: Phonetics

Clear Filters Showing 31 to 37 of 37 articles

Phonetic variability constrained bottleneck features for joint speaker recognition and physical task stress detection.

The Journal of the Acoustical Society of America
Normalizing intrinsic variabilities (e.g., variability in speech production brought on by aging, physical or cognitive task stress, Lombard effect, etc.) in speech and speaker recognition models is essential for system robustness. This study focuses ...

EARSHOT: A Minimal Neural Network Model of Incremental Human Speech Recognition.

Cognitive science
Despite the lack of invariance problem (the many-to-many mapping between acoustics and percepts), human listeners experience phonetic constancy and typically perceive what a speaker intends. Most models of human speech recognition (HSR) have side-ste...

Density and Distinctiveness in Early Word Learning: Evidence From Neural Network Simulations.

Cognitive science
High phonological neighborhood density has been associated with both advantages and disadvantages in early word learning. High density may support the formation and fine-tuning of new word sound memories-a process termed lexical configuration (e.g., ...

Semi-supervised learning of a nonnative phonetic contrast: How much feedback is enough?

Attention, perception & psychophysics
Semi-supervised learning refers to learning that occurs when feedback about performance is provided on only a subset of training trials. Algorithms for semi-supervised learning are popular in machine learning because of their minimal reliance on labe...

Evaluating automatic speech recognition systems as quantitative models of cross-lingual phonetic category perception.

The Journal of the Acoustical Society of America
Theories of cross-linguistic phonetic category perception posit that listeners perceive foreign sounds by mapping them onto their native phonetic categories, but, until now, no way to effectively implement this mapping has been proposed. In this pape...

A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection.

The Journal of the Acoustical Society of America
Goodness of pronunciation (GOP) is the most widely used method for automatic mispronunciation detection. In this paper, a transfer learning approach to GOP based mispronunciation detection when applying maximum F1-score criterion (MFC) training to de...

Improved speech inversion using general regression neural network.

The Journal of the Acoustical Society of America
The problem of nonlinear acoustic to articulatory inversion mapping is investigated in the feature space using two models, the deep belief network (DBN) which is the state-of-the-art, and the general regression neural network (GRNN). The task is to e...