AI Medical Compendium Topic

Explore the latest research on artificial intelligence and machine learning in medicine.

Speech Perception

Showing 11 to 20 of 111 articles

Clear Filters

ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals.

Trends in hearing
Decoding speech envelopes from electroencephalogram (EEG) signals holds potential as a research tool for objectively assessing auditory processing, which could contribute to future developments in hearing loss diagnosis. However, current methods stru...

Deep learning-based auditory attention decoding in listeners with hearing impairment.

Journal of neural engineering
This study develops a deep learning (DL) method for fast auditory attention decoding (AAD) using electroencephalography (EEG) from listeners with hearing impairment (HI). It addresses three classification tasks: differentiating noise from speech-in-n...

Recovering speech intelligibility with deep learning and multiple microphones in noisy-reverberant situations for people using cochlear implants.

The Journal of the Acoustical Society of America
For cochlear implant (CI) listeners, holding a conversation in noisy and reverberant environments is often challenging. Deep-learning algorithms can potentially mitigate these difficulties by enhancing speech in everyday listening environments. This ...

Deep learning restores speech intelligibility in multi-talker interference for cochlear implant users.

Scientific reports
Cochlear implants (CIs) do not offer the same level of effectiveness in noisy environments as in quiet settings. Current single-microphone noise reduction algorithms in hearing aids and CIs only remove predictable, stationary noise, and are ineffecti...

Using deep learning to improve the intelligibility of a target speaker in noisy multi-talker environments for people with normal hearing and hearing loss.

The Journal of the Acoustical Society of America
Understanding speech in noisy environments is a challenging task, especially in communication situations with several competing speakers. Despite their ongoing improvement, assistive listening devices and speech processing approaches still do not per...

Machine Learning Recognizes Frequency-Following Responses in American Adults: Effects of Reference Spectrogram and Stimulus Token.

Perceptual and motor skills
Electrophysiological research has been widely utilized to study brain responses to acoustic stimuli. The frequency-following response (FFR), a non-invasive reflection of how the brain encodes acoustic stimuli, is a particularly propitious electrophys...

Exploring the effectiveness of reward-based learning strategies for second-language speech sounds.

Psychonomic bulletin & review
Adults struggle to learn non-native speech categories in many experimental settings (Goto, Neuropsychologia, 9(3), 317-323 1971), but learn efficiently in a video game paradigm where non-native speech sounds have functional significance (Lim & Holt, ...

Speech intelligibility prediction based on a physiological model of the human ear and a hierarchical spiking neural network.

The Journal of the Acoustical Society of America
A speech intelligibility (SI) prediction model is proposed that includes an auditory preprocessing component based on the physiological anatomy and activity of the human ear, a hierarchical spiking neural network, and a decision back-end processing b...

Deep-learning models reveal how context and listener attention shape electrophysiological correlates of speech-to-language transformation.

PLoS computational biology
To transform continuous speech into words, the human brain must resolve variability across utterances in intonation, speech rate, volume, accents and so on. A promising approach to explaining this process has been to model electroencephalogram (EEG) ...

Speech recognition using an english multimodal corpus with integrated image and depth information.

Scientific reports
Traditional English corpora mainly collect information from a single modality, but lack information from multimodal information, resulting in low quality of corpus information and certain problems with recognition accuracy. To solve the above problem...