Speech - AI Medical Compendium

Improving Speech Emotion Recognition With Adversarial Data Augmentation Network.

IEEE transactions on neural networks and learning systems Jan 5, 2022

When training data are scarce, it is challenging to train a deep neural network without causing the overfitting problem. For overcoming this challenge, this article proposes a new data augmentation network-namely adversarial data augmentation network...

Speech Emotions Entropy Neural Networks, Computer Machine Learning

View on PubMed DOI

Natural Language Processing markers in first episode psychosis and people at clinical high-risk.

Translational psychiatry Dec 13, 2021

Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. ...

Natural Language Processing Humans Psychotic Disorders Biomarkers Speech Cognition

View on PubMed DOI

Generalisation Gap of Keyword Spotters in a Cross-Speaker Low-Resource Scenario.

Sensors (Basel, Switzerland) Dec 12, 2021

Models for keyword spotting in continuous recordings can significantly improve the experience of navigating vast libraries of audio recordings. In this paper, we describe the development of such a keyword spotting system detecting regions of interest...

Acoustics Data Curation Language Speech Humans Neural Networks, Computer

View on PubMed DOI

Presentation Attack Detection on Limited-Resource Devices Using Deep Neural Classifiers Trained on Consistent Spectrogram Fragments.

Sensors (Basel, Switzerland) Nov 20, 2021

The presented paper is concerned with detection of presentation attacks against unsupervised remote biometric speaker verification, using a well-known challenge-response scheme. We propose a novel approach to convolutional phoneme classifier training...

Language Speech Neural Networks, Computer Databases, Factual

View on PubMed DOI

Multimodal Emotion Recognition on RAVDESS Dataset Using Transfer Learning.

Sensors (Basel, Switzerland) Nov 18, 2021

Emotion Recognition is attracting the attention of the research community due to the multiple areas where it can be applied, such as in healthcare or in road safety systems. In this paper, we propose a multimodal emotion recognition system that relie...

Emotions Learning Speech Neural Networks, Computer Machine Learning

View on PubMed DOI

The Impact of Attention Mechanisms on Speech Emotion Recognition.

Sensors (Basel, Switzerland) Nov 12, 2021

Speech emotion recognition (SER) plays an important role in real-time applications of human-machine interaction. The Attention Mechanism is widely used to improve the performance of SER. However, the applicable rules of attention mechanism are not de...

Humans Neural Networks, Computer Speech Emotions Perception

View on PubMed DOI

Application of Neural Network Algorithm Based on Principal Component Image Analysis in Band Expansion of College English Listening.

Computational intelligence and neuroscience Nov 12, 2021

With the development of information technology, band expansion technology is gradually applied to college English listening teaching. This technology aims to recover broadband speech signals from narrowband speech signals with a limited frequency ban...

Speech Principal Component Analysis Algorithms Humans Language Neural Networks, Computer

View on PubMed DOI

Audio-Driven Robot Upper-Body Motion Synthesis.

IEEE transactions on cybernetics Nov 9, 2021

Body language is an important aspect of human communication, which an effective human-robot interaction interface should mimic well. Human beings exchange information and convey their thoughts and feelings through gaze, facial expressions, body langu...

Humans Speech Hand Facial Expression Gestures Robotics

View on PubMed DOI

Environmental sound classification using temporal-frequency attention based convolutional neural network.

Scientific reports Nov 3, 2021

Environmental sound classification is one of the important issues in the audio recognition field. Compared with structured sounds such as speech and music, the time-frequency structure of environmental sounds is more complicated. In order to learn ti...

Music Recognition, Psychology Sound Attention Motivation Semantics Algorithms Noise Research Design Acoustics Models, Theoretical Learning Models, Statistical Speech Humans Neural Networks, Computer

View on PubMed DOI

Application of Deep Learning Models for Automated Identification of Parkinson's Disease: A Review (2011-2021).

Sensors (Basel, Switzerland) Oct 23, 2021

Parkinson's disease (PD) is the second most common neurodegenerative disorder affecting over 6 million people globally. Although there are symptomatic treatments that can increase the survivability of the disease, there are no curative treatments. Th...

Parkinson Disease Gait Deep Learning Speech Humans Artificial Intelligence

View on PubMed DOI

AIMC Topic: Speech

Improving Speech Emotion Recognition With Adversarial Data Augmentation Network.

Natural Language Processing markers in first episode psychosis and people at clinical high-risk.

Generalisation Gap of Keyword Spotters in a Cross-Speaker Low-Resource Scenario.

Presentation Attack Detection on Limited-Resource Devices Using Deep Neural Classifiers Trained on Consistent Spectrogram Fragments.

Multimodal Emotion Recognition on RAVDESS Dataset Using Transfer Learning.

The Impact of Attention Mechanisms on Speech Emotion Recognition.

Application of Neural Network Algorithm Based on Principal Component Image Analysis in Band Expansion of College English Listening.

Audio-Driven Robot Upper-Body Motion Synthesis.

Environmental sound classification using temporal-frequency attention based convolutional neural network.

Application of Deep Learning Models for Automated Identification of Parkinson's Disease: A Review (2011-2021).

Popular Topics

Recent Journals

AIMC Topic: Speech

Don't Miss the Future of Medicine

Popular Topics

Recent Journals