Speech - AI Medical Compendium

A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.

Scientific reports Aug 5, 2025

Speech is one of the most efficient methods of communication among humans, inspiring advancements in machine speech processing under Natural Language Processing (NLP). This field aims to enable computers to analyze, comprehend, and generate human lan...

Humans Neural Networks, Computer Algorithms Speech Emotions Female Natural Language Processing Male Deep Learning

View on PubMed DOI

A dataset for recognition of Arabic accents from spoken L2 English speech (ArL2Eng).

Scientific data Jul 31, 2025

This paper introduces the ArL2Eng dataset, a speech corpus of L2 English produced by native speakers of Arabic, and highlights its potential in supporting research into automated language assessment. ArL2Eng comprises audio sequences from speakers of...

Arabs Phonetics Multilingualism Humans Language Deep Learning Speech

View on PubMed DOI

EEG-based speech imagery decoding by dynamic hypergraph learning within projected and selected feature subspaces.

Journal of neural engineering Jul 28, 2025

Speech imagery is a nascent paradigm that is receiving widespread attention in current brain-computer interface (BCI) research. By collecting the electroencephalogram (EEG) data generated when imagining the pronunciation of a sentence or word in huma...

Humans Machine Learning Brain-Computer Interfaces Imagination Female Male Young Adult Speech Adult Electroencephalography

View on PubMed DOI

Multilingual identification of nuanced dimensions of hope speech in social media texts.

Scientific reports Jul 23, 2025

Hope plays a crucial role in human psychology and well-being, yet its expression and detection across languages remain underexplored in natural language processing (NLP). This study presents MIND-HOPE, the first-ever multiclass hope speech detection ...

Language Deep Learning Speech Humans Machine Learning Hope Social Media Natural Language Processing Multilingualism

View on PubMed DOI

Speech emotion recognition based on a stacked autoencoders optimized by PSO based grass fibrous root optimization.

Scientific reports Jul 18, 2025

Effective speech emotion recognition (SER) poses a significant challenge due to the intricate and subjective nature of human emotions. Recognizing emotional states accurately from speech signals has a broad spectrum of practical applications, such as...

Autoencoder Neural Networks, Computer Algorithms Deep Learning Speech Humans Support Vector Machine Emotions

View on PubMed DOI

Voice fatigue subtyping through individual modeling of vocal demand reponses.

Scientific reports Jul 16, 2025

Recognizing individual variability is essential for developing targeted, personalized medical interventions. Vocal fatigue is a prevalent symptom and complaint among occupational voice users, but its identification has yielded mixed results. Vocal fa...

Voice Disorders Voice Quality Speech Voice Humans Female Young Adult Adult Male Middle Aged

View on PubMed DOI

Detecting schizophrenia, bipolar disorder, psychosis vulnerability and major depressive disorder from 5 minutes of online-collected speech.

Translational psychiatry Jul 12, 2025

Psychosis poses substantial social and healthcare burdens. The analysis of speech is a promising approach for the diagnosis and monitoring of psychosis, capturing symptoms like thought disorder and flattened affect. Recent advancements in Natural Lan...

Female Young Adult Adult Male Middle Aged Depressive Disorder, Major Humans Machine Learning Natural Language Processing Case-Control Studies Schizophrenia Speech Psychotic Disorders Bipolar Disorder

View on PubMed DOI

An enhanced deep learning approach for speaker diarization using TitaNet, MarbelNet and time delay network.

Scientific reports Jul 8, 2025

Speaker diarization, identifying "who spoke when," plays a vital role in speech transcription, supervised fine-tuning of large language models, conversational AI, and audio content analysis by providing labeled speaker segments. Traditional speaker d...

Humans Neural Networks, Computer Algorithms Deep Learning Speech

View on PubMed DOI

Evaluating Mandarin tone pronunciation accuracy for second language learners using a ResNet-based Siamese network.

Scientific reports Jul 8, 2025

Evaluating tone pronunciation is essential for helping second-language (L2) learners master the intricate nuances of Mandarin tones. This article introduces an innovative automatic evaluation method for Mandarin tone pronunciation that employs a Siam...

Multilingualism Speech Phonetics Humans Neural Networks, Computer Language

View on PubMed DOI

Prediction of suicide using web based voice recordings analyzed by artificial intelligence.

Scientific reports Jul 4, 2025

The integration of machine learning (ML) and deep learning models in suicide risk assessment has advanced significantly in recent years. In this study, we utilized ML in a case-control design, we predicted completed suicides using publicly available,...

Voice Speech Suicide Internet Machine Learning Humans Risk Assessment Female Artificial Intelligence Adult Case-Control Studies Young Adult Male Deep Learning Middle Aged

View on PubMed DOI

AIMC Topic: Speech

A deep learning framework for gender sensitive speech emotion recognition based on MFCC feature selection and SHAP analysis.

A dataset for recognition of Arabic accents from spoken L2 English speech (ArL2Eng).

EEG-based speech imagery decoding by dynamic hypergraph learning within projected and selected feature subspaces.

Multilingual identification of nuanced dimensions of hope speech in social media texts.

Speech emotion recognition based on a stacked autoencoders optimized by PSO based grass fibrous root optimization.

Voice fatigue subtyping through individual modeling of vocal demand reponses.

Detecting schizophrenia, bipolar disorder, psychosis vulnerability and major depressive disorder from 5 minutes of online-collected speech.

An enhanced deep learning approach for speaker diarization using TitaNet, MarbelNet and time delay network.

Evaluating Mandarin tone pronunciation accuracy for second language learners using a ResNet-based Siamese network.

Prediction of suicide using web based voice recordings analyzed by artificial intelligence.

Popular Topics

Recent Journals

AIMC Topic: Speech

Stay Ahead of Medical AI

Popular Topics

Recent Journals