Speech - AI Medical Compendium

SuperM2M: Supervised and mixture-to-mixture co-learning for speech enhancement and noise-robust ASR.

Neural networks : the official journal of the International Neural Network Society Mar 26, 2025

The current dominant approach for neural speech enhancement is based on supervised learning by using simulated training data. The trained models, however, often exhibit limited generalizability to real-recorded data. To address this, this paper inves...

Humans Speech Noise Supervised Machine Learning Algorithms Deep Learning Neural Networks, Computer

View on PubMed DOI

Harnessing emotion and intonation in speech to improve robot acceptance.

Science robotics Mar 26, 2025

The use of emotional words and expressive voices in robots alters the attribution of agency and experience by humans.

Emotions Robotics Voice Humans Speech

View on PubMed DOI

Building a Gender-Bias-Resistant Super Corpus as a Deep Learning Baseline for Speech Emotion Recognition.

Sensors (Basel, Switzerland) Mar 22, 2025

The focus on Speech Emotion Recognition has dramatically increased in recent years, driven by the need for automatic speech-recognition-based systems and intelligent assistants to enhance user experience by incorporating emotional content. While deep...

Emotions Humans Deep Learning Female Speech Male

View on PubMed DOI

Exploring emotional climate recognition in peer conversations through bispectral features and affect dynamics.

Computer methods and programs in biomedicine Mar 18, 2025

BACKGROUND AND OBJECTIVE: Emotion recognition in conversations using artificial intelligence (AI) has gained significant attention due to its potential to provide insights into human social behavior. This study extends AI-based emotion recognition to...

Humans Speech Emotions Artificial Intelligence Peer Group Machine Learning

View on PubMed DOI

A multi-dilated convolution network for speech emotion recognition.

Scientific reports Mar 10, 2025

Speech emotion recognition (SER) is an important application in Affective Computing and Artificial Intelligence. Recently, there has been a significant interest in Deep Neural Networks using speech spectrograms. As the two-dimensional representation ...

Humans Neural Networks, Computer Algorithms Deep Learning Speech Emotions

View on PubMed DOI

Machine learning-assisted wearable sensing systems for speech recognition and interaction.

Nature communications Mar 10, 2025

The human voice stands out for its rich information transmission capabilities. However, voice communication is susceptible to interference from noisy environments and obstacles. Here, we propose a wearable wireless flexible skin-attached acoustic sen...

Speech Recognition Software Voice Wireless Technology Wearable Electronic Devices Acoustics Machine Learning Female Male Speech Adult Humans

View on PubMed DOI

A unified acoustic-to-speech-to-language embedding space captures the neural basis of natural language processing in everyday conversations.

Nature human behaviour Mar 7, 2025

This study introduces a unified computational framework connecting acoustic, speech and word-level linguistic structures to study the neural basis of everyday conversations in the human brain. We used electrocorticography to record neural signals acr...

Speech Models, Neurological Brain Mapping Adult Male Speech Perception Electrocorticography Natural Language Processing Comprehension Humans Brain Female Young Adult Language

View on PubMed DOI

Linguistic cues for automatic assessment of Alzheimer's disease across languages.

Journal of Alzheimer's disease : JAD Feb 25, 2025

BackgroundMost common forms of dementia, including Alzheimer's disease, are associated with alterations in spoken language.ObjectiveThis study explores the potential of a speech-based machine learning (ML) approach in estimating cognitive impairment,...

Cues Linguistics Aged, 80 and over Language Cognitive Dysfunction Female Male Humans Speech Alzheimer Disease Machine Learning Aged

View on PubMed DOI

MemoCMT: multimodal emotion recognition using cross-modal transformer-based feature fusion.

Scientific reports Feb 14, 2025

Speech emotion recognition has seen a surge in transformer models, which excel at understanding the overall message by analyzing long-term patterns in speech. However, these models come at a computational cost. In contrast, convolutional neural netwo...

Humans Neural Networks, Computer Algorithms Speech Emotions

View on PubMed DOI

Multi-source sparse broad transfer learning for parkinson's disease diagnosis via speech.

Medical & biological engineering & computing Feb 4, 2025

Diagnosing Parkinson's disease (PD) via speech is crucial for its non-invasive and convenient data collection. However, the small sample size of PD speech data impedes accurate recognition of PD speech. Therefore, we propose a novel multi-source spar...

Speech Parkinson Disease Male Middle Aged Humans Machine Learning Female Aged Algorithms

View on PubMed DOI

AIMC Topic: Speech

SuperM2M: Supervised and mixture-to-mixture co-learning for speech enhancement and noise-robust ASR.

Harnessing emotion and intonation in speech to improve robot acceptance.

Building a Gender-Bias-Resistant Super Corpus as a Deep Learning Baseline for Speech Emotion Recognition.

Exploring emotional climate recognition in peer conversations through bispectral features and affect dynamics.

A multi-dilated convolution network for speech emotion recognition.

Machine learning-assisted wearable sensing systems for speech recognition and interaction.

A unified acoustic-to-speech-to-language embedding space captures the neural basis of natural language processing in everyday conversations.

Linguistic cues for automatic assessment of Alzheimer's disease across languages.

MemoCMT: multimodal emotion recognition using cross-modal transformer-based feature fusion.

Multi-source sparse broad transfer learning for parkinson's disease diagnosis via speech.

Popular Topics

Recent Journals

AIMC Topic: Speech

Stay Ahead of Medical AI

Popular Topics

Recent Journals