AIMC Topic: Speech

Clear Filters Showing 141 to 150 of 395 articles

Evidence of a predictive coding hierarchy in the human brain listening to speech.

Nature human behaviour
Considerable progress has recently been made in natural language processing: deep learning algorithms are increasingly able to generate, summarize, translate and classify texts. Yet, these language models still fail to match the language abilities of...

Robot leadership-Investigating human perceptions and reactions towards social robots showing leadership behaviors.

PloS one
Human-robot interaction research has shown that social robots can interact with humans in complex social situations and display leadership-related behaviors. Therefore, social robots could be able to take on leadership roles. The aim of our study was...

Multimodal Sensor-Input Architecture with Deep Learning for Audio-Visual Speech Recognition in Wild.

Sensors (Basel, Switzerland)
This paper investigates multimodal sensor architectures with deep learning for audio-visual speech recognition, focusing on in-the-wild scenarios. The term "in the wild" is used to describe AVSR for unconstrained natural-language audio streams and vi...

Human-Computer Interaction with a Real-Time Speech Emotion Recognition with Ensembling Techniques 1D Convolution Neural Network and Attention.

Sensors (Basel, Switzerland)
Emotions have a crucial function in the mental existence of humans. They are vital for identifying a person's behaviour and mental condition. Speech Emotion Recognition (SER) is extracting a speaker's emotional state from their speech signal. SER is ...

A Survey on Low-Latency DNN-Based Speech Enhancement.

Sensors (Basel, Switzerland)
This paper presents recent advances in low-latency, single-channel, deep neural network-based speech enhancement systems. The sources of latency and their acceptable values in different applications are described. This is followed by an analysis of t...

A Deep Learning Method Using Gender-Specific Features for Emotion Recognition.

Sensors (Basel, Switzerland)
Speech reflects people's mental state and using a microphone sensor is a potential method for human-computer interaction. Speech recognition using this sensor is conducive to the diagnosis of mental illnesses. The gender difference of speakers affect...

Qualitative and Artificial Intelligence-based Sentiment Analyses of Anti-LGBTI+ Hate Speech on Twitter in Turkey.

Issues in mental health nursing
The aim of this study was to evaluate hate speech in Turkish LGBTI+-related tweets during a one-month period of artificial intelligence-based sentiment analyses. Turkish tweets related to LGBTI+, were retrieved using Python library Tweepy and were ev...

Classification of Depression and Its Severity Based on Multiple Audio Features Using a Graphical Convolutional Neural Network.

International journal of environmental research and public health
Audio features are physical features that reflect single or complex coordinated movements in the vocal organs. Hence, in speech-based automatic depression classification, it is critical to consider the relationship among audio features. Here, we prop...

Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review.

Computers in biology and medicine
The growing and aging of the world population have driven the shortage of medical resources in recent years, especially during the COVID-19 pandemic. Fortunately, the rapid development of robotics and artificial intelligence technologies help to adap...

Detecting Lombard Speech Using Deep Learning Approach.

Sensors (Basel, Switzerland)
Robust Lombard speech-in-noise detecting is challenging. This study proposes a strategy to detect Lombard speech using a machine learning approach for applications such as public address systems that work in near real time. The paper starts with the ...