Speech - AI Medical Compendium

Mandarin Speech Reconstruction from Tongue Motion Ultrasound Images based on Generative Adversarial Networks.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Jul 1, 2024

Speech impairment resulting from laryngectomy causes severe physiological and psychological distress to laryngectomee. In clinical practice, the upper vocal tract articulatory organs function normally in most laryngectomee. The potential to reconstru...

Ultrasonography Image Processing, Computer-Assisted Tongue Generative Adversarial Networks Motion Neural Networks, Computer Humans Language Speech

View on PubMed DOI

Emotion Recognition from Speech Signals by Mel-Spectrogram and a CNN-RNN.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Jul 1, 2024

Speech emotion recognition (SER) in health applications can offer several benefits by providing insights into the emotional well-being of individuals. In this work, we propose a method for SER using time-frequency representation of the speech signals...

Algorithms Signal Processing, Computer-Assisted Speech Humans Emotions Neural Networks, Computer

View on PubMed DOI

Detecting Post-Stroke Aphasia Via Brain Responses to Speech in a Deep Learning Framework.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Jul 1, 2024

Aphasia, a language disorder primarily caused by a stroke, is traditionally diagnosed using behavioral language tests. However, these tests are time-consuming, require manual interpretation by trained clinicians, suffer from low ecological validity, ...

Support Vector Machine Electroencephalography Aphasia Stroke Middle Aged Humans Deep Learning Speech Female Adult Aged Male Brain

View on PubMed DOI

Exploring Self-Supervised Models for Depressive Disorder Detection: A Study on Speech Corpora.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Jul 1, 2024

Automatic detection of depressive disorder from speech signals can help improve medical diagnosis reliability. However, a significant challenge in this field is that most of the available depression datasets are relatively small, which limits the eff...

Support Vector Machine Bayes Theorem Signal Processing, Computer-Assisted Algorithms Speech Supervised Machine Learning Depressive Disorder Humans

View on PubMed DOI

Research on Tone Enhancement of Mandarin Pitch Controllable Electrolaryngeal Speech Based on Deep Learning.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Jul 1, 2024

The deep learning-based electrolaryngeal (EL) voice conversion methods have achieved good results in non-tonal languages. However, the effectiveness in tonal languages, such as Mandarin Chinese (Mandarin), remains suboptimal. The reason may be that t...

Humans Language Deep Learning Signal Processing, Computer-Assisted Speech Speech Acoustics China Speech, Alaryngeal

View on PubMed DOI

EmoNet: Deep Learning-based Emotion Climate Recognition Using Peers' Conversational Speech, Affect Dynamics, and Physiological Data.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Jul 1, 2024

Understanding the emotional dynamics within social interactions is crucial for meaningful interpretation. Despite progress in emotion recognition systems, recognizing the collective emotional climate among peers has been understudied. Addressing this...

Peer Group Heart Rate Galvanic Skin Response Humans Deep Learning Speech Emotions Signal Processing, Computer-Assisted

View on PubMed DOI

Enhancing Word-Level Imagined Speech BCI Through Heterogeneous Transfer Learning.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference Jul 1, 2024

In this study, we proposed a novel heterogeneous transfer learning approach named Focused Speech Feature Transfer Learning (FSFTL), aimed at enhancing the performance of electroencephalogram (EEG)-based word-level Imagined Speech (IS) Brain-Computer ...

Humans Machine Learning Signal Processing, Computer-Assisted Brain-Computer Interfaces Imagination Speech Algorithms Electroencephalography

View on PubMed DOI

First 'bilingual' brain-reading device decodes Spanish and English words.

Nature May 1, 2024

Multilingualism Humans Artificial Intelligence Brain England Prostheses and Implants Male Spain Speech

View on PubMed DOI

Digital Avatars and Personalized Voices-How AI Is Helping to Restore Speech to Patients.

JAMA Apr 16, 2024

Artificial Intelligence Voice Quality Voice Humans Speech Equipment and Supplies Avatar Aphasia

View on PubMed DOI

A unified beamforming and source separation model for static and dynamic human-robot interaction.

JASA express letters Mar 1, 2024

This paper presents a unified model for combining beamforming and blind source separation (BSS). The validity of the model's assumptions is confirmed by recovering target speech information in noise accurately using Oracle information. Using real sta...

Robotics Humans Speech Signal-To-Noise Ratio

View on PubMed DOI

AIMC Topic: Speech

Mandarin Speech Reconstruction from Tongue Motion Ultrasound Images based on Generative Adversarial Networks.

Emotion Recognition from Speech Signals by Mel-Spectrogram and a CNN-RNN.

Detecting Post-Stroke Aphasia Via Brain Responses to Speech in a Deep Learning Framework.

Exploring Self-Supervised Models for Depressive Disorder Detection: A Study on Speech Corpora.

Research on Tone Enhancement of Mandarin Pitch Controllable Electrolaryngeal Speech Based on Deep Learning.

EmoNet: Deep Learning-based Emotion Climate Recognition Using Peers' Conversational Speech, Affect Dynamics, and Physiological Data.

Enhancing Word-Level Imagined Speech BCI Through Heterogeneous Transfer Learning.

First 'bilingual' brain-reading device decodes Spanish and English words.

Digital Avatars and Personalized Voices-How AI Is Helping to Restore Speech to Patients.

A unified beamforming and source separation model for static and dynamic human-robot interaction.

Popular Topics

Recent Journals

AIMC Topic: Speech

Don't Miss the Future of Medicine

Popular Topics

Recent Journals