AIMC Topic: Speech

Clear Filters Showing 341 to 350 of 368 articles

A two-stage deep learning algorithm for talker-independent speaker separation in reverberant conditions.

The Journal of the Acoustical Society of America
Speaker separation is a special case of speech separation, in which the mixture signal comprises two or more speakers. Many talker-independent speaker separation methods have been introduced in recent years to address this problem in anechoic conditi...

Voice Command Recognition Using Biologically Inspired Time-Frequency Representation and Convolutional Neural Networks.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Voice command is an important interface between human and technology in healthcare, such as for hands-free control of surgical robots and in patient care technology. Voice command recognition can be cast as a speech classification task, where convolu...

Parkinson's Disease Classification using Pitch Synchronous Speech Segments and Fine Gaussian Kernels based SVM.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Researchers have been using signal processing based methods to assess speech from Parkinson's disease (PD) patients and identify the contrasting features in comparison to speech from healthy controls (HC). The methodologies follow conventional approa...

Spoken words as biomarkers: using machine learning to gain insight into communication as a predictor of anxiety.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: The goal of this study was to explore whether features of recorded and transcribed audio communication data extracted by machine learning algorithms can be used to train a classifier for anxiety.

EARSHOT: A Minimal Neural Network Model of Incremental Human Speech Recognition.

Cognitive science
Despite the lack of invariance problem (the many-to-many mapping between acoustics and percepts), human listeners experience phonetic constancy and typically perceive what a speaker intends. Most models of human speech recognition (HSR) have side-ste...

Artificial Intelligence, Speech, and Language Processing Approaches to Monitoring Alzheimer's Disease: A Systematic Review.

Journal of Alzheimer's disease : JAD
BACKGROUND: Language is a valuable source of clinical information in Alzheimer's disease, as it declines concurrently with neurodegeneration. Consequently, speech and language data have been extensively studied in connection with its diagnosis.

Decoding Speech from Single Trial MEG Signals Using Convolutional Neural Networks and Transfer Learning.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Decoding speech directly from the brain has the potential for the development of the next generation, more efficient brain computer interfaces (BCIs) to assist in the communication of patients with locked-in syndrome (fully paralyzed but aware). In t...

A Comparative Study of Features for Acoustic Cough Detection Using Deep Architectures.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Automatic cough detection is key to tracking the condition of patients suffering from tuberculosis. We evaluate various acoustic features for performing cough detection using deep architectures. As most previous studies have adopted features designed...

Differentiating post-cancer from healthy tongue muscle coordination patterns during speech using deep learning.

The Journal of the Acoustical Society of America
The ability to differentiate post-cancer from healthy tongue muscle coordination patterns is necessary for the advancement of speech motor control theories and for the development of therapeutic and rehabilitative strategies. A deep learning approach...

Live human-robot interactive public demonstrations with automatic emotion and personality prediction.

Philosophical transactions of the Royal Society of London. Series B, Biological sciences
Communication with humans is a multi-faceted phenomenon where the emotions, personality and non-verbal behaviours, as well as the verbal behaviours, play a significant role, and human-robot interaction (HRI) technologies should respect this complexit...