AIMC Topic: Phonetics

Clear Filters Showing 1 to 10 of 40 articles

A dataset for recognition of Arabic accents from spoken L2 English speech (ArL2Eng).

Scientific data
This paper introduces the ArL2Eng dataset, a speech corpus of L2 English produced by native speakers of Arabic, and highlights its potential in supporting research into automated language assessment. ArL2Eng comprises audio sequences from speakers of...

Evaluating Mandarin tone pronunciation accuracy for second language learners using a ResNet-based Siamese network.

Scientific reports
Evaluating tone pronunciation is essential for helping second-language (L2) learners master the intricate nuances of Mandarin tones. This article introduces an innovative automatic evaluation method for Mandarin tone pronunciation that employs a Siam...

A study on phonemes recognition method for Mandarin pronunciation based on improved Zipformer-RNN-T(Pruned) modeling.

PloS one
In recent years, empowered by artificial intelligence technologies, computer-assisted language learning systems have gradually become a hot topic of research. Currently, the mainstream pronunciation assessment models rely on advanced speech recogniti...

Does Musical Experience Facilitate Phonetic Accommodation During Human-Robot Interaction?

Journal of speech, language, and hearing research : JSLHR
PURPOSE: This study investigated the effect of musical training on phonetic accommodation in a second language (L2) after interacting with a social robot, exploring the motivations and reasons behind their accommodation strategies.

A Tunable Forced Alignment System Based on Deep Learning: Applications to Child Speech.

Journal of speech, language, and hearing research : JSLHR
PURPOSE: Phonetic forced alignment has a multitude of applications in automated analysis of speech, particularly in studying nonstandard speech such as children's speech. Manual alignment is tedious but serves as the gold standard for clinical-grade ...

Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society
OBJECTIVE: Speech brain-computer interfaces (speech BCIs), which convert brain signals into spoken words or sentences, have demonstrated great potential for high-performance BCI communication. Phonemes are the basic pronunciation units. For monosylla...

The Mason-Alberta Phonetic Segmenter: a forced alignment system based on deep neural networks and interpolation.

Phonetica
Given an orthographic transcription, forced alignment systems automatically determine boundaries between segments in speech, facilitating the use of large corpora. In the present paper, we introduce a neural network-based forced alignment system, the...

Artificial Intelligence-Assisted Speech Therapy for /ɹ/: A Single-Case Experimental Study.

American journal of speech-language pathology
PURPOSE: This feasibility trial describes changes in rhotic production in residual speech sound disorder following ten 40-min sessions including artificial intelligence (AI)-assisted motor-based intervention with ChainingAI, a version of Speech Motor...

Accuracy of Speech Sound Analysis: Comparison of an Automatic Artificial Intelligence Algorithm With Clinician Assessment.

Journal of speech, language, and hearing research : JSLHR
PURPOSE: Automatic speech analysis (ASA) and automatic speech recognition systems are increasingly being used in the treatment of speech sound disorders (SSDs). When utilized as a home practice tool or in the absence of the clinician, the ASA system ...

Exploring the effectiveness of reward-based learning strategies for second-language speech sounds.

Psychonomic bulletin & review
Adults struggle to learn non-native speech categories in many experimental settings (Goto, Neuropsychologia, 9(3), 317-323 1971), but learn efficiently in a video game paradigm where non-native speech sounds have functional significance (Lim & Holt, ...