Speech - AI Medical Compendium

Accurate semi-supervised automatic speech recognition for ordinary and characterized speeches via multi-hypotheses-based curriculum learning.

PloS one Oct 21, 2025

How can we build accurate transcription models for both ordinary speech and characterized speech in a semi-supervised setting? ASR (Automatic Speech Recognition) systems are widely used in various real-world applications, including translation system...

Supervised Machine Learning Speech Recognition Software Algorithms Speech Humans

View on PubMed DOI

Hierarchical dynamic coding coordinates speech comprehension in the human brain.

Proceedings of the National Academy of Sciences of the United States of America Oct 17, 2025

Speech comprehension involves transforming an acoustic waveform into meaning. To do so, the human brain generates a hierarchy of features that converts the sensory input into increasingly abstract language properties. However, little is known about h...

Brain Language Female Humans Speech Perception Magnetoencephalography Male Speech Comprehension Young Adult Adult

View on PubMed DOI

Automated Speech Markers of Alzheimer Dementia: Test of Cross-Linguistic Generalizability.

Journal of medical Internet research Oct 15, 2025

BACKGROUND: Automated speech and language analysis (ASLA) is gaining momentum as a noninvasive, affordable, and scalable approach for the early detection of Alzheimer disease (AD). Nevertheless, the literature presents 2 notable limitations. First, m...

Male Middle Aged Speech Aged Aged, 80 and over Linguistics Alzheimer Disease Language Female Humans

View on PubMed DOI

Speech Emotion Recognition in Mental Health: Systematic Review of Voice-Based Applications.

JMIR mental health Sep 30, 2025

BACKGROUND: The field of speech emotion recognition (SER) encompasses a wide variety of approaches, with artificial intelligence technologies providing improvements in recent years. In the domain of mental health, the links between individuals' emoti...

Speech Emotions Artificial Intelligence Mental Disorders Mental Health Humans

View on PubMed DOI

A Multimodal Depression Consultation Dataset of Speech and Text with HAMD-17 Assessments.

Scientific data Sep 29, 2025

The global surge in depression rates, notably severe in China with over 95 million affected, underscores a dire public health issue. This is exacerbated by a critical shortfall in mental health professionals, highlighting an urgent call for innovativ...

Referral and Consultation Humans Artificial Intelligence Speech China Depression

View on PubMed DOI

Wordsworth: A generative word dataset for comparison of speech representations in humans and neural networks.

Scientific data Sep 26, 2025

Speech perception is fundamental for human communication, but its neural basis is not well understood. Furthermore, while modern neural networks (NNs) can accurately recognize speech, whether they effectively model human speech processing remains unc...

Phonetics Speech Neural Networks, Computer Speech Perception Humans

View on PubMed DOI

Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

Biomedical physics & engineering express Sep 26, 2025

Imagined speech classification involves decoding brain signals to recognize verbalized thoughts or intentions without actual speech production. This technology has significant implications for individuals with speech impairments, offering a means to ...

Electroencephalography Signal Processing, Computer-Assisted Imagination Humans Female Male Algorithms Machine Learning Young Adult Speech Brain Adult

View on PubMed DOI

Searching for effective preprocessing method and CNN based architecture with efficient channel attention on speech emotion recognition.

Scientific reports Sep 24, 2025

Recently, Speech emotion recognition (SER) performance has steadily increased as multiple deep learning architectures have adapted. Especially, convolutional neural network (CNN) models with spectrogram data preprocessing are the most popular approac...

Humans Neural Networks, Computer Deep Learning Speech Algorithms Emotions

View on PubMed DOI

Auto-Masked Audio Spectrogram Transformer for depression detection from speech.

Journal of affective disorders Sep 16, 2025

BACKGROUND: Depression is a psychological disorder characterized by altered self-referential cognition and impaired emotional expression. Traditional diagnostic methods can be costly or intrusive, while Speech-based analysis offers an accessible alte...

Humans Female Adult Depression Deep Learning Sound Spectrography Male Speech Depressive Disorder

View on PubMed DOI

Syllable-based speech characteristics as potential biomarker for differential diagnosis of Parkinson's disease, multiple system atrophy, and cerebellar ataxia.

Journal of neurology Sep 5, 2025

Speech disorders differ between Parkinson's disease (PD) and multiple system atrophy (MSA), but studies focusing on group differences based on syllables or including cerebellar ataxia (CA) are lacking until now. This cross-sectional study aimed to an...

Cerebellar Ataxia Speech Disorders Multiple System Atrophy Female Male Parkinson Disease Speech Aged Middle Aged Diagnosis, Differential Humans Cross-Sectional Studies Biomarkers

View on PubMed DOI

AIMC Topic: Speech

Accurate semi-supervised automatic speech recognition for ordinary and characterized speeches via multi-hypotheses-based curriculum learning.

Hierarchical dynamic coding coordinates speech comprehension in the human brain.

Automated Speech Markers of Alzheimer Dementia: Test of Cross-Linguistic Generalizability.

Speech Emotion Recognition in Mental Health: Systematic Review of Voice-Based Applications.

A Multimodal Depression Consultation Dataset of Speech and Text with HAMD-17 Assessments.

Wordsworth: A generative word dataset for comparison of speech representations in humans and neural networks.

Machine learning based classification of imagined speech electroencephalogram data from the amplitude and phase spectrum of frequency domain EEG signal.

Searching for effective preprocessing method and CNN based architecture with efficient channel attention on speech emotion recognition.

Auto-Masked Audio Spectrogram Transformer for depression detection from speech.

Syllable-based speech characteristics as potential biomarker for differential diagnosis of Parkinson's disease, multiple system atrophy, and cerebellar ataxia.

Popular Topics

Recent Journals

AIMC Topic: Speech

Stay Ahead of Medical AI

Popular Topics

Recent Journals