Speech - AI Medical Compendium

Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection.

Neural networks : the official journal of the International Neural Network Society Apr 16, 2024

The rhythm of bonafide speech is often difficult to replicate, which causes that the fundamental frequency (F0) of synthetic speech is significantly different from that of real speech. It is expected that the F0 feature contains the discriminative in...

Algorithms Speech Attention Humans Neural Networks, Computer

View on PubMed DOI

Validation of Machine Learning-Based Assessment of Major Depressive Disorder from Paralinguistic Speech Characteristics in Routine Care.

Depression and anxiety Apr 9, 2024

New developments in machine learning-based analysis of speech can be hypothesized to facilitate the long-term monitoring of major depressive disorder (MDD) during and after treatment. To test this hypothesis, we collected 550 speech samples from tele...

Depressive Disorder, Major Speech Psychiatric Status Rating Scales Adult Female Humans Male Young Adult Sensitivity and Specificity Middle Aged Machine Learning

View on PubMed DOI

Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model.

Journal of affective disorders Mar 27, 2024

BACKGROUND: Prior research has associated spoken language use with depression, yet studies often involve small or non-clinical samples and face challenges in the manual transcription of speech. This paper aimed to automatically identify depression-re...

Smartphone Speech Recognition Software Speech Humans Depression Deep Learning

View on PubMed DOI

Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting.

Journal of speech, language, and hearing research : JSLHR Feb 22, 2024

PURPOSE: Many studies using machine learning (ML) in speech, language, and hearing sciences rely upon cross-validations with single data splitting. This study's first purpose is to provide quantitative evidence that would incentivize researchers to i...

Sample Size Hearing Language Speech Humans Machine Learning

View on PubMed DOI

Decoding Single and Paired Phonemes Using 7T Functional MRI.

Brain topography Jan 23, 2024

Several studies have shown that mouth movements related to the pronunciation of individual phonemes are represented in the sensorimotor cortex. This would theoretically allow for brain computer interfaces that are capable of decoding continuous speec...

Support Vector Machine Magnetic Resonance Imaging Male Speech Phonetics Sensorimotor Cortex Brain Mapping Brain-Computer Interfaces Brain Female Young Adult Adult Humans

View on PubMed DOI

Speech emotion analysis using convolutional neural network (CNN) and gamma classifier-based error correcting output codes (ECOC).

Scientific reports Nov 21, 2023

Speech emotion analysis is one of the most basic requirements for the evolution of Artificial Intelligence (AI) in the field of human-machine interaction. Accurate emotion recognition in speech can be effective in applications such as online support,...

Humans Neural Networks, Computer Artificial Intelligence Machine Learning Emotions Speech

View on PubMed DOI

Reliability and validity of a widely-available AI tool for assessment of stress based on speech.

Scientific reports Nov 18, 2023

Cigna's online stress management toolkit includes an AI-based tool that purports to evaluate a person's psychological stress level based on analysis of their speech, the Cigna StressWaves Test (CSWT). In this study, we evaluate the claim that the CSW...

Artificial Intelligence Reproducibility of Results Speech Humans

View on PubMed DOI

Towards audio-based identification of Ethio-Semitic languages using recurrent neural network.

Scientific reports Nov 7, 2023

In recent times, there is an increasing interest in employing technology to process natural language with the aim of providing information that can benefit society. Language identification refers to the process of detecting which speech a speaker app...

Speech Algorithms Language Neural Networks, Computer

View on PubMed DOI

A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations.

Scientific reports Oct 29, 2023

Humans express their emotions in various ways, such as through facial expressions and voices. In particular, emotions are directly expressed or indirectly implied in the text of utterance. Research on the technology to identify emotions included in h...

Emotions Communication Speech Artificial Intelligence Humans Republic of Korea

View on PubMed DOI

Speech extraction from vibration signals based on deep learning.

PloS one Oct 25, 2023

Extracting speech information from vibration response signals is a typical system identification problem, and the traditional method is too sensitive to deviations such as model parameters, noise, boundary conditions, and position. A method was propo...

Deep Learning Vibration Speech Noise

View on PubMed DOI

AIMC Topic: Speech

Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection.

Validation of Machine Learning-Based Assessment of Major Depressive Disorder from Paralinguistic Speech Characteristics in Routine Care.

Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model.

Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting.

Decoding Single and Paired Phonemes Using 7T Functional MRI.

Speech emotion analysis using convolutional neural network (CNN) and gamma classifier-based error correcting output codes (ECOC).

Reliability and validity of a widely-available AI tool for assessment of stress based on speech.

Towards audio-based identification of Ethio-Semitic languages using recurrent neural network.

A Korean emotion-factor dataset for extracting emotion and factors in Korean conversations.

Speech extraction from vibration signals based on deep learning.

Popular Topics

Recent Journals

AIMC Topic: Speech

Stay Ahead of Medical AI

Popular Topics

Recent Journals