AIMC Topic: Speech

Clear Filters Showing 51 to 60 of 368 articles

Computing nasalance with MFCCs and Convolutional Neural Networks.

PloS one
Nasalance is a valuable clinical biomarker for hypernasality. It is computed as the ratio of acoustic energy emitted through the nose to the total energy emitted through the mouth and nose (eNasalance). A new approach is proposed to compute nasalance...

Momentary Depression Severity Prediction in Patients With Acute Depression Who Undergo Sleep Deprivation Therapy: Speech-Based Machine Learning Approach.

JMIR mental health
BACKGROUND: Mobile devices for remote monitoring are inevitable tools to support treatment and patient care, especially in recurrent diseases such as major depressive disorder. The aim of this study was to learn if machine learning (ML) models based ...

Early detection of high blood pressure from natural speech sounds with graph diffusion network.

Computers in biology and medicine
This study presents an innovative approach to cuffless blood pressure prediction by integrating speech and demographic features. With a focus on non-invasive monitoring, especially in remote regions, our model harnesses speech signals and demographic...

A pilot study for speech assessment to detect the severity of Parkinson's disease: An ensemble approach.

Computers in biology and medicine
BACKGROUND: Changes in voice are a symptom of Parkinson's disease and used to assess the progression of the condition. However, natural differences in the voices of people can make this challenging. Computerized binary speech classification can ident...

Speech-based personality prediction using deep learning with acoustic and linguistic embeddings.

Scientific reports
This study introduces a novel method for predicting the Big Five personality traits through the analysis of speech samples, advancing the field of computational personality assessment. We collected data from 2045 participants who completed a self-rep...

Multimodal machine learning for language and speech markers identification in mental health.

BMC medical informatics and decision making
BACKGROUND: There are numerous papers focusing on diagnosing mental health disorders using unimodal and multimodal approaches. However, our literature review shows that the majority of these studies either use unimodal approaches to diagnose a variet...

Detection of hate: speech tweets based convolutional neural network and machine learning algorithms.

Scientific reports
There is no doubt that social media sites have provided many benefits to humanity, such as sharing information continuously and communicating with others easily. It also seems that social media sites have many advantages, but in addition to these adv...

Deep temporal representation learning for language identification.

Neural networks : the official journal of the International Neural Network Society
Language identification (LID) is a key component in downstream tasks. Recently, the self-supervised speech representation learned by Wav2Vec 2.0 (W2V2) has been demonstrated to be very effective for various speech-related tasks. In LID, it is commonl...

Roman urdu hate speech detection using hybrid machine learning models and hyperparameter optimization.

Scientific reports
With the rapid increase of users over social media, cyberbullying, and hate speech problems have arisen over the past years. Automatic hate speech detection (HSD) from text is an emerging research problem in natural language processing (NLP). Researc...

The voice of depression: speech features as biomarkers for major depressive disorder.

BMC psychiatry
BACKGROUND: Psychiatry faces a challenge due to the lack of objective biomarkers, as current assessments are based on subjective evaluations. Automated speech analysis shows promise in detecting symptom severity in depressed patients. This project ai...