AIMC Topic: Speech

Clear Filters Showing 171 to 180 of 395 articles

A novel multimodal fusion framework for early diagnosis and accurate classification of COVID-19 patients using X-ray images and speech signal processing techniques.

Computer methods and programs in biomedicine
BACKGROUND AND OBJECTIVE: COVID-19 outbreak has become one of the most challenging problems for human being. It is a communicable disease caused by a new coronavirus strain, which infected over 375 million people already and caused almost 6 million d...

Auditory Speech Based Alerting System for Detecting Dummy Number Plate via Video Processing Data sets.

Computational intelligence and neuroscience
Spectrum of applications in computer vision use object detection algorithms driven by the power of AI and ML algorithms. State of art detection models like faster Region based convolutional Neural Network (RCNN), Single Shot Multibox Detector (SSD), ...

Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations.

Sensors (Basel, Switzerland)
The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment an...

Frequency, Time, Representation and Modeling Aspects for Major Speech and Audio Processing Applications.

Sensors (Basel, Switzerland)
There are many speech and audio processing applications and their number is growing. They may cover a wide range of tasks, each having different requirements on the processed speech or audio signals and, therefore, indirectly, on the audio sensors as...

A novel speech emotion recognition method based on feature construction and ensemble learning.

PloS one
In the field of Human-Computer Interaction (HCI), speech emotion recognition technology plays an important role. Facing a small number of speech emotion data, a novel speech emotion recognition method based on feature construction and ensemble learni...

Deep Learning Scoring Model in the Evaluation of Oral English Teaching.

Computational intelligence and neuroscience
This study is aimed at improving the accuracy of oral English recognition and proposing evaluation measures with better performance. This work is based on related theories such as deep learning, speech recognition, and oral English practice. As the l...

Improvement of Speech Recognition Technology in Piano Music Scene Based on Deep Learning of Internet of Things.

Computational intelligence and neuroscience
The main goal of speech recognition technology is to use computers to convert human analog speech signals into computer-generated signals, such as behavior patterns or binary codes. Different from speaker identification and speaker confirmation, the ...

Deep Learning Models for Fast Retrieval and Extraction of French Speech Vocabulary Applications.

Computational intelligence and neuroscience
Due to the large French vocabulary, how quickly retrieve and accurately identify the required vocabulary is still a big challenge in French learning. In view of the above problems, we introduce a deep learning algorithm in this study to upgrade and o...

Research on Chinese Speech Emotion Recognition Based on Deep Neural Network and Acoustic Features.

Sensors (Basel, Switzerland)
In recent years, the use of Artificial Intelligence for emotion recognition has attracted much attention. The industrial applicability of emotion recognition is quite comprehensive and has good development potential. This research uses voice emotion ...

Arabic Speech Analysis for Classification and Prediction of Mental Illness due to Depression Using Deep Learning.

Computational intelligence and neuroscience
Depression is a global prevalent ailment for possible mental illness or mental disorder globally. Recognizing depressed early signs is critical for evaluating and preventing mental illness. With the progress of machine learning, it is possible to mak...