AIMC Topic: Speech

Clear Filters Showing 31 to 40 of 395 articles

Prediction of suicide using web based voice recordings analyzed by artificial intelligence.

Scientific reports
The integration of machine learning (ML) and deep learning models in suicide risk assessment has advanced significantly in recent years. In this study, we utilized ML in a case-control design, we predicted completed suicides using publicly available,...

LSTM autoencoder based parallel architecture for deepfake audio detection with dynamic residual encoding and feature fusion.

Scientific reports
With the rapid advancement of synthetic speech technologies, detecting deepfake audio has become essential for preventing impersonation and misinformation. This study aims to enhance detection performance by addressing limitations in existing models,...

End-to-end feature fusion for jointly optimized speech enhancement and automatic speech recognition.

Scientific reports
Speech enhancement (SE) and automatic speech recognition (ASR) in real-time processing involve improving the quality and intelligibility of speech signals on the fly, ensuring accurate transcription as the speech unfolds. SE eliminates unwanted backg...

MS-EmoBoost: a novel strategy for enhancing self-supervised speech emotion representations.

Scientific reports
Extracting richer emotional representations from raw speech is one of the key approaches to improving the accuracy of Speech Emotion Recognition (SER). In recent years, there has been a trend in utilizing self-supervised learning (SSL) for extracting...

A novel speech signal feature extraction technique to detect speech impairment in children accurately.

Computers in biology and medicine
Speech signal processing and extracting useful information from speech signal is necessary for speech language impairment (SLI) detection in children. Although different features has been suggested for SLI detection, there is still a scope exist for ...

Speech imagery brain-computer interfaces: a systematic literature review.

Journal of neural engineering
Speech Imagery (SI) refers to the mental experience of hearing speech and may be the core of verbal thinking for people who undergo internal monologues. It belongs to the set of possible mental imagery states that produce kinesthetic experiences whos...

A novel Swin transformer based framework for speech recognition for dysarthria.

Scientific reports
Dysarthria frequently occurs in individuals with disorders such as stroke, Parkinson's disease, cerebral palsy, and other neurological disorders. Well-timed detection and management of dysarthria in these patients is imperative for efficiently handli...

AI-powered remote monitoring of brain responses to clear and incomprehensible speech via speckle pattern analysis.

Journal of biomedical optics
SIGNIFICANCE: Functional magnetic resonance imaging provides high spatial resolution but is limited by cost, infrastructure, and the constraints of an enclosed scanner. Portable methods such as functional near-infrared spectroscopy and electroencepha...

An EEG-based imagined speech recognition using CSP-TP feature fusion for enhanced BCI communication.

Behavioural brain research
BACKGROUND: Imagined speech has emerged as a promising paradigm for intuitive control of brain-computer interface (BCI)-based communication systems, providing a means of communication for individuals with severe brain disabilities. In this work, a no...

Feature and classifier-level domain adaptation in DistilHuBERT for cross-corpus speech emotion recognition.

Computers in biology and medicine
Cross-corpus speech emotion recognition (CCSER) aims to develop robust models capable of accurately identifying a speaker's emotional state across diverse datasets. This task is challenged by variations in dataset characteristics, such as differences...