AIMC Topic: Speech

Clear Filters Showing 301 to 310 of 368 articles

Context-aware data augmentation for enhanced speech command recognition in industrial environments.

Scientific reports
In Human-Robot Interaction, speech is one of the most intuitive and effective communication channel. In Industry 4.0, speech-based communication can significantly enhance productivity and efficiency on production lines. However, deploying a Speech Co...

A Review of Challenges in Speech-Based Conversational AI for Elderly Care.

Studies in health technology and informatics
Artificially intelligent systems optimized for speech conversation are appearing at a fast pace. Such models are interesting from a healthcare perspective, as these voice-controlled assistants may support the elderly and enable remote health monitori...

Using Gesture and Speech to Control Surgical Lighting Systems: Mixed Methods Study.

JMIR human factors
BACKGROUND: Surgical lighting systems (SLSs) provide optimal lighting conditions for operating room personnel. Current systems are mainly adjusted by hand; surgeons either accommodate the light themselves or communicate their requirements to an assis...

Hearing vocals to recognize schizophrenia: speech discriminant analysis with fusion of emotions and features based on deep learning.

BMC psychiatry
BACKGROUND AND OBJECTIVE: Accurate detection of schizophrenia poses a grand challenge as a complex and heterogeneous mental disorder. Current diagnostic criteria rely primarily on clinical symptoms, which may not fully capture individual differences ...

[Neural network for auditory speech enhancement featuring feedback-driven attention and lateral inhibition].

Sheng wu yi xue gong cheng xue za zhi = Journal of biomedical engineering = Shengwu yixue gongchengxue zazhi
The processing mechanism of the human brain for speech information is a significant source of inspiration for the study of speech enhancement technology. Attention and lateral inhibition are key mechanisms in auditory information processing that can ...

Speech Detection via Respiratory Inductance Plethysmography, Thoracic Impedance, Accelerometers, and Gyroscopes: A Machine Learning-Informed Comparative Study.

Psychophysiology
Speech production interferes with the measurement of changes in cardiac vagal activity during acute stress by attenuating the expected drop in heart rate variability. Speech also induces cardiac sympathetic changes similar to those induced by psychol...

Audio-visual source separation with localization and individual control.

PloS one
The growing reliance on video conferencing software brings significant benefits but also introduces challenges, particularly in managing audio quality. In multi-participant settings, ambient noise and interruptions can hinder speaker recognition and ...

The JIBO Kids Corpus: A speech dataset of child-robot interactions in a classroom environment.

JASA express letters
This paper describes an original dataset of children's speech, collected through the use of JIBO, a social robot. The dataset encompasses recordings from 110 children, aged 4-7 years old, who participated in a letter and digit identification task and...

Artificial intelligence classifies primary progressive aphasia from connected speech.

Brain : a journal of neurology
Neurodegenerative dementia syndromes, such as primary progressive aphasias (PPA), have traditionally been diagnosed based, in part, on verbal and non-verbal cognitive profiles. Debate continues about whether PPA is best divided into three variants an...

Personalised Speech-Based PTSD Prediction Using Weighted-Instance Learning.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Post-traumatic stress disorder (PTSD) is a prevalent disorder that can develop in people who have experienced very stressful, shocking, or distressing events. It has great influence on peoples' daily life and can affect their mental, physical, or soc...