AIMC Topic: Speech

Clear Filters Showing 111 to 120 of 395 articles

Improving span-based Aspect Sentiment Triplet Extraction with part-of-speech filtering and contrastive learning.

Neural networks : the official journal of the International Neural Network Society
Aspect Sentiment Triple Extraction (ASTE), a subtask of fine-grained sentiment analysis, aims to extract aspect terms, opinion terms, and their corresponding sentiment polarities from sentences. Previous methods often enumerated all possible spans of...

Machine learning decoding of single neurons in the thalamus for speech brain-machine interfaces.

Journal of neural engineering
. Our goal is to decode firing patterns of single neurons in the left ventralis intermediate nucleus (Vim) of the thalamus, related to speech production, perception, and imagery. For realistic speech brain-machine interfaces (BMIs), we aim to charact...

Crossmixed convolutional neural network for digital speech recognition.

PloS one
Digital speech recognition is a challenging problem that requires the ability to learn complex signal characteristics such as frequency, pitch, intensity, timbre, and melody, which traditional methods often face issues in recognizing. This article in...

Improving speech depression detection using transfer learning with wav2vec 2.0 in low-resource environments.

Scientific reports
Depression, a pervasive global mental disorder, profoundly impacts daily lives. Despite numerous deep learning studies focused on depression detection through speech analysis, the shortage of annotated bulk samples hampers the development of effectiv...

Imagined speech classification exploiting EEG power spectrum features.

Medical & biological engineering & computing
Imagined speech recognition has developed as a significant topic of research in the field of brain-computer interfaces. This innovative technique has great promise as a communication tool, providing essential help to those with impairments. An imagin...

Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection.

Neural networks : the official journal of the International Neural Network Society
The rhythm of bonafide speech is often difficult to replicate, which causes that the fundamental frequency (F0) of synthetic speech is significantly different from that of real speech. It is expected that the F0 feature contains the discriminative in...

Validation of Machine Learning-Based Assessment of Major Depressive Disorder from Paralinguistic Speech Characteristics in Routine Care.

Depression and anxiety
New developments in machine learning-based analysis of speech can be hypothesized to facilitate the long-term monitoring of major depressive disorder (MDD) during and after treatment. To test this hypothesis, we collected 550 speech samples from tele...

Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model.

Journal of affective disorders
BACKGROUND: Prior research has associated spoken language use with depression, yet studies often involve small or non-clinical samples and face challenges in the manual transcription of speech. This paper aimed to automatically identify depression-re...

Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting.

Journal of speech, language, and hearing research : JSLHR
PURPOSE: Many studies using machine learning (ML) in speech, language, and hearing sciences rely upon cross-validations with single data splitting. This study's first purpose is to provide quantitative evidence that would incentivize researchers to i...

Decoding Single and Paired Phonemes Using 7T Functional MRI.

Brain topography
Several studies have shown that mouth movements related to the pronunciation of individual phonemes are represented in the sensorimotor cortex. This would theoretically allow for brain computer interfaces that are capable of decoding continuous speec...