AIMC Topic: Speech Recognition Software

Clear Filters Showing 81 to 90 of 94 articles

A deep learning algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker and reverberation.

The Journal of the Acoustical Society of America
For deep learning based speech segregation to have translational significance as a noise-reduction tool, it must perform in a wide variety of acoustic environments. In the current study, performance was examined when target speech was subjected to in...

Talker change detection: A comparison of human and machine performance.

The Journal of the Acoustical Society of America
The automatic analysis of conversational audio remains difficult, in part, due to the presence of multiple talkers speaking in turns, often with significant intonation variations and overlapping speech. The majority of prior work on psychoacoustic sp...

Vision-referential speech enhancement of an audio signal using mask information captured as visual data.

The Journal of the Acoustical Society of America
This paper describes a vision-referential speech enhancement of an audio signal using mask information captured as visual data. Smartphones and tablet devices have become popular in recent years. Most of them not only have a microphone but also a cam...

[Artificial intelligence for future MD].

Giornale italiano di nefrologia : organo ufficiale della Societa italiana di nefrologia
Health care workers need artificial intelligence. Artificial intelligence is a set of studies and techniques that tend to the realization of machines, which solve complex problems automatically, simulating or emulating human intelligence activities. ...

Automatic speech recognition: A primer for speech-language pathology researchers.

International journal of speech-language pathology
Automatic speech recognition (ASR) is increasingly becoming an integral component of our daily lives. This trend is in large part due to recent advances in machine learning, and specifically in deep learning, that have led to accurate ASR across nume...

Conversational agents in healthcare: a systematic review.

Journal of the American Medical Informatics Association : JAMIA
OBJECTIVE: Our objective was to review the characteristics, current applications, and evaluation measures of conversational agents with unconstrained natural language input capabilities used for health-related purposes.

Evaluating automatic speech recognition systems as quantitative models of cross-lingual phonetic category perception.

The Journal of the Acoustical Society of America
Theories of cross-linguistic phonetic category perception posit that listeners perceive foreign sounds by mapping them onto their native phonetic categories, but, until now, no way to effectively implement this mapping has been proposed. In this pape...

Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech Intelligibility After Oral Surgery.

IEEE transactions on bio-medical engineering
This paper focuses on machine learning based voice conversion (VC) techniques for improving the speech intelligibility of surgical patients who have had parts of their articulators removed. Because of the removal of parts of the articulator, a patie...

Unsupervised modulation filter learning for noise-robust speech recognition.

The Journal of the Acoustical Society of America
The modulation filtering approach to robust automatic speech recognition (ASR) is based on enhancing perceptually relevant regions of the modulation spectrum while suppressing the regions susceptible to noise. In this paper, a data-driven unsupervise...

Natural Language Understanding Performance & Use Considerations in Virtual Medical Encounters.

Studies in health technology and informatics
A virtual standardized patient (VSP) prototype was tested for natural language understanding (NLU) performance. The conversational VSP was evaluated in a controlled 61 subject study over four repetitions of a patient case. The prototype achieved more...