AIMC Topic: Speech

Clear Filters Showing 241 to 250 of 395 articles

Deep ANC: A deep learning approach to active noise control.

Neural networks : the official journal of the International Neural Network Society
Traditional active noise control (ANC) methods are based on adaptive signal processing with the least mean square algorithm as the foundation. They are linear systems and do not perform satisfactorily in the presence of nonlinear distortions. In this...

Combining a parallel 2D CNN with a self-attention Dilated Residual Network for CTC-based discrete speech emotion recognition.

Neural networks : the official journal of the International Neural Network Society
A challenging issue in the field of the automatic recognition of emotion from speech is the efficient modelling of long temporal contexts. Moreover, when incorporating long-term temporal dependencies between features, recurrent neural network (RNN) a...

Residual Neural Network precisely quantifies dysarthria severity-level based on short-duration speech segments.

Neural networks : the official journal of the International Neural Network Society
Recently, we have witnessed Deep Learning methodologies gaining significant attention for severity-based classification of dysarthric speech. Detecting dysarthria, quantifying its severity, are of paramount importance in various real-life application...

Multi-Path and Group-Loss-Based Network for Speech Emotion Recognition in Multi-Domain Datasets.

Sensors (Basel, Switzerland)
Speech emotion recognition (SER) is a natural method of recognizing individual emotions in everyday life. To distribute SER models to real-world applications, some key challenges must be overcome, such as the lack of datasets tagged with emotion labe...

Human cortical encoding of pitch in tonal and non-tonal languages.

Nature communications
Languages can use a common repertoire of vocal sounds to signify distinct meanings. In tonal languages, such as Mandarin Chinese, pitch contours of syllables distinguish one word from another, whereas in non-tonal languages, such as English, pitch is...

Biosignal Sensors and Deep Learning-Based Speech Recognition: A Review.

Sensors (Basel, Switzerland)
Voice is one of the essential mechanisms for communicating and expressing one's intentions as a human being. There are several causes of voice inability, including disease, accident, vocal abuse, medical surgery, ageing, and environmental pollution, ...

Emotion Detection for Social Robots Based on NLP Transformers and an Emotion Ontology.

Sensors (Basel, Switzerland)
For social robots, knowledge regarding human emotional states is an essential part of adapting their behavior or associating emotions to other entities. Robots gather the information from which emotion detection is processed via different media, such...

Toward Using Twitter for Tracking COVID-19: A Natural Language Processing Pipeline and Exploratory Data Set.

Journal of medical Internet research
BACKGROUND: In the United States, the rapidly evolving COVID-19 outbreak, the shortage of available testing, and the delay of test results present challenges for actively monitoring its spread based on testing alone.

Preictal state detection using prodromal symptoms: A machine learning approach.

Epilepsia
A reliable identification of a high-risk state for upcoming seizures may allow for preemptive treatment and improve the quality of patients' lives. We evaluated the ability of prodromal symptoms to predict preictal states using a machine learning (ML...

Stacked DeBERT: All attention in incomplete data for text classification.

Neural networks : the official journal of the International Neural Network Society
In this paper, we propose Stacked DeBERT, short for StackedDenoising Bidirectional Encoder Representations from Transformers. This novel model improves robustness in incomplete data, when compared to existing systems, by designing a novel encoding sc...