Speech - AI Medical Compendium

Efficient neural encoding as revealed by bilingualism.

Proceedings of the National Academy of Sciences of the United States of America Aug 19, 2025

The remarkable human capacity for bilingual and multilingual acquisition raises fundamental questions about how the brain develops efficient systems for processing multiple languages. In this study, we used neural network models trained on natural sp...

Humans Neural Networks, Computer Brain Models, Neurological Multilingualism Language Development Language Learning Speech

View on PubMed DOI

A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.

Scientific reports Aug 5, 2025

Speech is one of the most efficient methods of communication among humans, inspiring advancements in machine speech processing under Natural Language Processing (NLP). This field aims to enable computers to analyze, comprehend, and generate human lan...

Emotions Natural Language Processing Neural Networks, Computer Algorithms Female Male Deep Learning Speech Humans

View on PubMed DOI

A dataset for recognition of Arabic accents from spoken L2 English speech (ArL2Eng).

Scientific data Jul 31, 2025

This paper introduces the ArL2Eng dataset, a speech corpus of L2 English produced by native speakers of Arabic, and highlights its potential in supporting research into automated language assessment. ArL2Eng comprises audio sequences from speakers of...

Humans Language Deep Learning Phonetics Speech Multilingualism Arabs

View on PubMed DOI

EEG-based speech imagery decoding by dynamic hypergraph learning within projected and selected feature subspaces.

Journal of neural engineering Jul 28, 2025

Speech imagery is a nascent paradigm that is receiving widespread attention in current brain-computer interface (BCI) research. By collecting the electroencephalogram (EEG) data generated when imagining the pronunciation of a sentence or word in huma...

Adult Male Speech Female Young Adult Electroencephalography Brain-Computer Interfaces Imagination Humans Machine Learning

View on PubMed DOI

Multilingual identification of nuanced dimensions of hope speech in social media texts.

Scientific reports Jul 23, 2025

Hope plays a crucial role in human psychology and well-being, yet its expression and detection across languages remain underexplored in natural language processing (NLP). This study presents MIND-HOPE, the first-ever multiclass hope speech detection ...

Social Media Natural Language Processing Multilingualism Hope Language Deep Learning Speech Humans Machine Learning

View on PubMed DOI

Speech emotion recognition based on a stacked autoencoders optimized by PSO based grass fibrous root optimization.

Scientific reports Jul 18, 2025

Effective speech emotion recognition (SER) poses a significant challenge due to the intricate and subjective nature of human emotions. Recognizing emotional states accurately from speech signals has a broad spectrum of practical applications, such as...

Algorithms Deep Learning Speech Support Vector Machine Humans Emotions Neural Networks, Computer Autoencoder

View on PubMed DOI

Voice fatigue subtyping through individual modeling of vocal demand reponses.

Scientific reports Jul 16, 2025

Recognizing individual variability is essential for developing targeted, personalized medical interventions. Vocal fatigue is a prevalent symptom and complaint among occupational voice users, but its identification has yielded mixed results. Vocal fa...

Female Young Adult Adult Male Middle Aged Speech Humans Voice Quality Voice Voice Disorders

View on PubMed DOI

Detecting schizophrenia, bipolar disorder, psychosis vulnerability and major depressive disorder from 5 minutes of online-collected speech.

Translational psychiatry Jul 12, 2025

Psychosis poses substantial social and healthcare burdens. The analysis of speech is a promising approach for the diagnosis and monitoring of psychosis, capturing symptoms like thought disorder and flattened affect. Recent advancements in Natural Lan...

Schizophrenia Psychotic Disorders Bipolar Disorder Natural Language Processing Machine Learning Depressive Disorder, Major Case-Control Studies Female Speech Adult Humans Young Adult Male Middle Aged

View on PubMed DOI

An enhanced deep learning approach for speaker diarization using TitaNet, MarbelNet and time delay network.

Scientific reports Jul 8, 2025

Speaker diarization, identifying "who spoke when," plays a vital role in speech transcription, supervised fine-tuning of large language models, conversational AI, and audio content analysis by providing labeled speaker segments. Traditional speaker d...

Algorithms Humans Deep Learning Neural Networks, Computer Speech

View on PubMed DOI

Evaluating Mandarin tone pronunciation accuracy for second language learners using a ResNet-based Siamese network.

Scientific reports Jul 8, 2025

Evaluating tone pronunciation is essential for helping second-language (L2) learners master the intricate nuances of Mandarin tones. This article introduces an innovative automatic evaluation method for Mandarin tone pronunciation that employs a Siam...

Multilingualism Humans Language Neural Networks, Computer Speech Phonetics

View on PubMed DOI

AIMC Topic: Speech

Efficient neural encoding as revealed by bilingualism.

A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.

A dataset for recognition of Arabic accents from spoken L2 English speech (ArL2Eng).

EEG-based speech imagery decoding by dynamic hypergraph learning within projected and selected feature subspaces.

Multilingual identification of nuanced dimensions of hope speech in social media texts.

Speech emotion recognition based on a stacked autoencoders optimized by PSO based grass fibrous root optimization.

Voice fatigue subtyping through individual modeling of vocal demand reponses.

Detecting schizophrenia, bipolar disorder, psychosis vulnerability and major depressive disorder from 5 minutes of online-collected speech.

An enhanced deep learning approach for speaker diarization using TitaNet, MarbelNet and time delay network.

Evaluating Mandarin tone pronunciation accuracy for second language learners using a ResNet-based Siamese network.

Popular Topics

Recent Journals

AIMC Topic: Speech

Don't Miss the Future of Medicine

Popular Topics

Recent Journals