Speech is one of the most efficient methods of communication among humans, inspiring advancements in machine speech processing under Natural Language Processing (NLP). This field aims to enable computers to analyze, comprehend, and generate human lan...
This paper introduces the ArL2Eng dataset, a speech corpus of L2 English produced by native speakers of Arabic, and highlights its potential in supporting research into automated language assessment. ArL2Eng comprises audio sequences from speakers of...
Speech imagery is a nascent paradigm that is receiving widespread attention in current brain-computer interface (BCI) research. By collecting the electroencephalogram (EEG) data generated when imagining the pronunciation of a sentence or word in huma...
Hope plays a crucial role in human psychology and well-being, yet its expression and detection across languages remain underexplored in natural language processing (NLP). This study presents MIND-HOPE, the first-ever multiclass hope speech detection ...
Effective speech emotion recognition (SER) poses a significant challenge due to the intricate and subjective nature of human emotions. Recognizing emotional states accurately from speech signals has a broad spectrum of practical applications, such as...
Recognizing individual variability is essential for developing targeted, personalized medical interventions. Vocal fatigue is a prevalent symptom and complaint among occupational voice users, but its identification has yielded mixed results. Vocal fa...
Psychosis poses substantial social and healthcare burdens. The analysis of speech is a promising approach for the diagnosis and monitoring of psychosis, capturing symptoms like thought disorder and flattened affect. Recent advancements in Natural Lan...
Speaker diarization, identifying "who spoke when," plays a vital role in speech transcription, supervised fine-tuning of large language models, conversational AI, and audio content analysis by providing labeled speaker segments. Traditional speaker d...
Evaluating tone pronunciation is essential for helping second-language (L2) learners master the intricate nuances of Mandarin tones. This article introduces an innovative automatic evaluation method for Mandarin tone pronunciation that employs a Siam...
The integration of machine learning (ML) and deep learning models in suicide risk assessment has advanced significantly in recent years. In this study, we utilized ML in a case-control design, we predicted completed suicides using publicly available,...
Join thousands of healthcare professionals staying informed about the latest AI breakthroughs in medicine. Get curated insights delivered to your inbox.