AI Medical Compendium Topic

Explore the latest research on artificial intelligence and machine learning in medicine.

Speech

Showing 161 to 170 of 336 articles

Clear Filters

Natural Language Processing markers in first episode psychosis and people at clinical high-risk.

Translational psychiatry
Recent work has suggested that disorganised speech might be a powerful predictor of later psychotic illness in clinical high risk subjects. To that end, several automated measures to quantify disorganisation of transcribed speech have been proposed. ...

Generalisation Gap of Keyword Spotters in a Cross-Speaker Low-Resource Scenario.

Sensors (Basel, Switzerland)
Models for keyword spotting in continuous recordings can significantly improve the experience of navigating vast libraries of audio recordings. In this paper, we describe the development of such a keyword spotting system detecting regions of interest...

Presentation Attack Detection on Limited-Resource Devices Using Deep Neural Classifiers Trained on Consistent Spectrogram Fragments.

Sensors (Basel, Switzerland)
The presented paper is concerned with detection of presentation attacks against unsupervised remote biometric speaker verification, using a well-known challenge-response scheme. We propose a novel approach to convolutional phoneme classifier training...

Multimodal Emotion Recognition on RAVDESS Dataset Using Transfer Learning.

Sensors (Basel, Switzerland)
Emotion Recognition is attracting the attention of the research community due to the multiple areas where it can be applied, such as in healthcare or in road safety systems. In this paper, we propose a multimodal emotion recognition system that relie...

The Impact of Attention Mechanisms on Speech Emotion Recognition.

Sensors (Basel, Switzerland)
Speech emotion recognition (SER) plays an important role in real-time applications of human-machine interaction. The Attention Mechanism is widely used to improve the performance of SER. However, the applicable rules of attention mechanism are not de...

Application of Neural Network Algorithm Based on Principal Component Image Analysis in Band Expansion of College English Listening.

Computational intelligence and neuroscience
With the development of information technology, band expansion technology is gradually applied to college English listening teaching. This technology aims to recover broadband speech signals from narrowband speech signals with a limited frequency ban...

Audio-Driven Robot Upper-Body Motion Synthesis.

IEEE transactions on cybernetics
Body language is an important aspect of human communication, which an effective human-robot interaction interface should mimic well. Human beings exchange information and convey their thoughts and feelings through gaze, facial expressions, body langu...

Environmental sound classification using temporal-frequency attention based convolutional neural network.

Scientific reports
Environmental sound classification is one of the important issues in the audio recognition field. Compared with structured sounds such as speech and music, the time-frequency structure of environmental sounds is more complicated. In order to learn ti...

Application of Deep Learning Models for Automated Identification of Parkinson's Disease: A Review (2011-2021).

Sensors (Basel, Switzerland)
Parkinson's disease (PD) is the second most common neurodegenerative disorder affecting over 6 million people globally. Although there are symptomatic treatments that can increase the survivability of the disease, there are no curative treatments. Th...

One-dimensional convolutional neural network and hybrid deep-learning paradigm for classification of specific language impaired children using their speech.

Computer methods and programs in biomedicine
BACKGROUND AND OBJECTIVE: Screening children for communicational disorders such as specific language impairment (SLI) is always challenging as it requires clinicians to follow a series of steps to evaluate the subjects. Artificial intelligence and co...