Phonetics - AI Medical Compendium

Brain-to-text decoding with context-aware neural representations and large language models.

Journal of neural engineering Oct 14, 2025

. Decoding attempted speech from neural activity offers a promising avenue for restoring communication abilities in individuals with speech impairments. Previous studies have focused on mapping neural activity to text using phonemes as the intermedia...

Humans Brain-Computer Interfaces Phonetics Large Language Models Brain Language

View on PubMed DOI

Wordsworth: A generative word dataset for comparison of speech representations in humans and neural networks.

Scientific data Sep 26, 2025

Speech perception is fundamental for human communication, but its neural basis is not well understood. Furthermore, while modern neural networks (NNs) can accurately recognize speech, whether they effectively model human speech processing remains unc...

Speech Perception Phonetics Neural Networks, Computer Speech Humans

View on PubMed DOI

Research on optimal deep learning modeling in HaiNan dialect recognition.

Scientific reports Aug 28, 2025

The speech recognition task of the HaiNan dialect faces significant differences in phonology, intonation, and grammatical structure among dialects, which in turn show significant regionalization characteristics, which makes the task of dialect-to-Man...

Deep Learning China Phonetics Speech Perception Humans Neural Networks, Computer Language

View on PubMed DOI

A dataset for recognition of Arabic accents from spoken L2 English speech (ArL2Eng).

Scientific data Jul 31, 2025

This paper introduces the ArL2Eng dataset, a speech corpus of L2 English produced by native speakers of Arabic, and highlights its potential in supporting research into automated language assessment. ArL2Eng comprises audio sequences from speakers of...

Deep Learning Speech Arabs Humans Language Phonetics Multilingualism

View on PubMed DOI

Evaluating Mandarin tone pronunciation accuracy for second language learners using a ResNet-based Siamese network.

Scientific reports Jul 8, 2025

Evaluating tone pronunciation is essential for helping second-language (L2) learners master the intricate nuances of Mandarin tones. This article introduces an innovative automatic evaluation method for Mandarin tone pronunciation that employs a Siam...

Phonetics Multilingualism Language Speech Humans Neural Networks, Computer

View on PubMed DOI

A study on phonemes recognition method for Mandarin pronunciation based on improved Zipformer-RNN-T(Pruned) modeling.

PloS one May 23, 2025

In recent years, empowered by artificial intelligence technologies, computer-assisted language learning systems have gradually become a hot topic of research. Currently, the mainstream pronunciation assessment models rely on advanced speech recogniti...

China Speech Recognition Software Phonetics Artificial Intelligence Algorithms Language Humans Neural Networks, Computer

View on PubMed DOI

Does Musical Experience Facilitate Phonetic Accommodation During Human-Robot Interaction?

Journal of speech, language, and hearing research : JSLHR Apr 21, 2025

PURPOSE: This study investigated the effect of musical training on phonetic accommodation in a second language (L2) after interacting with a social robot, exploring the motivations and reasons behind their accommodation strategies.

Robotics Humans Adult Male Speech Female Young Adult Multilingualism Music Phonetics Cues

View on PubMed DOI

A Tunable Forced Alignment System Based on Deep Learning: Applications to Child Speech.

Journal of speech, language, and hearing research : JSLHR Mar 31, 2025

PURPOSE: Phonetic forced alignment has a multitude of applications in automated analysis of speech, particularly in studying nonstandard speech such as children's speech. Manual alignment is tedious but serves as the gold standard for clinical-grade ...

Phonetics Speech Production Measurement Child Language Child, Preschool Child Humans Female Deep Learning Speech Male

View on PubMed DOI

Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society Sep 18, 2024

OBJECTIVE: Speech brain-computer interfaces (speech BCIs), which convert brain signals into spoken words or sentences, have demonstrated great potential for high-performance BCI communication. Phonemes are the basic pronunciation units. For monosylla...

Brain-Computer Interfaces Female Phonetics Young Adult Cluster Analysis Electroencephalography Male Speech Adult Humans Neural Networks, Computer Algorithms Language

View on PubMed DOI

The Mason-Alberta Phonetic Segmenter: a forced alignment system based on deep neural networks and interpolation.

Phonetica Sep 5, 2024

Given an orthographic transcription, forced alignment systems automatically determine boundaries between segments in speech, facilitating the use of large corpora. In the present paper, we introduce a neural network-based forced alignment system, the...

Humans Phonetics Speech Acoustics Neural Networks, Computer Speech

View on PubMed DOI

AIMC Topic: Phonetics

Brain-to-text decoding with context-aware neural representations and large language models.

Wordsworth: A generative word dataset for comparison of speech representations in humans and neural networks.

Research on optimal deep learning modeling in HaiNan dialect recognition.

A dataset for recognition of Arabic accents from spoken L2 English speech (ArL2Eng).

Evaluating Mandarin tone pronunciation accuracy for second language learners using a ResNet-based Siamese network.

A study on phonemes recognition method for Mandarin pronunciation based on improved Zipformer-RNN-T(Pruned) modeling.

Does Musical Experience Facilitate Phonetic Accommodation During Human-Robot Interaction?

A Tunable Forced Alignment System Based on Deep Learning: Applications to Child Speech.

Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs.

The Mason-Alberta Phonetic Segmenter: a forced alignment system based on deep neural networks and interpolation.

Popular Topics

Recent Journals

AIMC Topic: Phonetics

Don't Miss the Future of Medicine

Popular Topics

Recent Journals