Speech Perception - AI Medical Compendium

Single-microphone deep envelope separation based auditory attention decoding for competing speech and music.

Journal of neural engineering May 7, 2025

In this study, we introduce an end-to-end single microphone deep learning system for source separation and auditory attention decoding (AAD) in a competing speech and music setup. Deep source separation is applied directly on the envelope of the obse...

Speech Electroencephalography Attention Young Adult Music Humans Adult Auditory Perception Female Male Deep Learning Acoustic Stimulation Speech Perception

View on PubMed DOI

Automatic development of speech-in-noise hearing tests using machine learning.

Scientific reports Apr 15, 2025

Understanding speech in noisy environments is a primary challenge for individuals with hearing loss, affecting daily communication and quality of life. Traditional speech-in-noise tests are essential for screening and diagnosing hearing loss but are ...

Hearing Loss Speech Perception Hearing Tests Machine Learning Female Adult Noise Speech Recognition Software Male Middle Aged Humans

View on PubMed DOI

Natural language processing models reveal neural dynamics of human conversation.

Nature communications Apr 9, 2025

Through conversation, humans engage in a complex process of alternating speech production and comprehension to communicate. The neural mechanisms that underlie these complementary processes through which information is precisely conveyed by language,...

Humans Brain Language Female Deep Learning Speech Natural Language Processing Comprehension Young Adult Adult Male Speech Perception

View on PubMed DOI

Enhancing visual speech perception through deep automatic lipreading: A systematic review.

Computers in biology and medicine Mar 28, 2025

Communication involves exchanging information between individuals or groups through various media sources. However, limitations such as hearing loss can make it difficult for some individuals to understand the information delivered during speech comm...

Deep Learning Speech Perception Lipreading Humans

View on PubMed DOI

A unified acoustic-to-speech-to-language embedding space captures the neural basis of natural language processing in everyday conversations.

Nature human behaviour Mar 7, 2025

This study introduces a unified computational framework connecting acoustic, speech and word-level linguistic structures to study the neural basis of everyday conversations in the human brain. We used electrocorticography to record neural signals acr...

Female Young Adult Adult Male Humans Brain Language Natural Language Processing Comprehension Speech Perception Electrocorticography Speech Models, Neurological Brain Mapping

View on PubMed DOI

Incremental accumulation of linguistic context in artificial and biological neural networks.

Nature communications Jan 18, 2025

Large Language Models (LLMs) have shown success in predicting neural signals associated with narrative processing, but their approach to integrating context over large timescales differs fundamentally from that of the human brain. In this study, we s...

Humans Neural Networks, Computer Brain Brain Mapping Male Speech Perception Language Young Adult Magnetic Resonance Imaging Adult Linguistics Female Models, Neurological

View on PubMed DOI

Endpoint-aware audio-visual speech enhancement utilizing dynamic weight modulation based on SNR estimation.

Neural networks : the official journal of the International Neural Network Society Jan 11, 2025

Integrating visual features has been proven effective for deep learning-based speech quality enhancement, particularly in highly noisy environments. However, these models may suffer from redundant information, resulting in performance deterioration w...

Speech Intelligibility Noise Speech Perception Sound Spectrography Humans Voice Signal-To-Noise Ratio Neural Networks, Computer Sound Recordings Deep Learning Speech

View on PubMed DOI

Unraveling the Differential Efficiency of Dorsal and Ventral Pathways in Visual Semantic Decoding.

International journal of neural systems Jan 10, 2025

Visual semantic decoding aims to extract perceived semantic information from the visual responses of the human brain and convert it into interpretable semantic labels. Although significant progress has been made in semantic decoding across individual...

Visual Cortex Visual Pathways Neural Networks, Computer Female Humans Semantics Young Adult Speech Perception Adult Male

View on PubMed DOI

Deep-learning models reveal how context and listener attention shape electrophysiological correlates of speech-to-language transformation.

PLoS computational biology Nov 11, 2024

To transform continuous speech into words, the human brain must resolve variability across utterances in intonation, speech rate, volume, accents and so on. A promising approach to explaining this process has been to model electroencephalogram (EEG) ...

Male Speech Perception Deep Learning Female Computational Biology Electroencephalography Speech Humans Young Adult Models, Neurological Attention Brain Adult Language

View on PubMed DOI

Speech recognition using an english multimodal corpus with integrated image and depth information.

Scientific reports Nov 6, 2024

Traditional English corpora mainly collect information from a single modality, but lack information from multimodal information, resulting in low quality of corpus information and certain problems with recognition accuracy. To solve the above problem...

Deep Learning Speech Facial Expression Speech Recognition Software Humans Signal-To-Noise Ratio Language Speech Perception Lip

View on PubMed DOI

AIMC Topic: Speech Perception

Single-microphone deep envelope separation based auditory attention decoding for competing speech and music.

Automatic development of speech-in-noise hearing tests using machine learning.

Natural language processing models reveal neural dynamics of human conversation.

Enhancing visual speech perception through deep automatic lipreading: A systematic review.

A unified acoustic-to-speech-to-language embedding space captures the neural basis of natural language processing in everyday conversations.

Incremental accumulation of linguistic context in artificial and biological neural networks.

Endpoint-aware audio-visual speech enhancement utilizing dynamic weight modulation based on SNR estimation.

Unraveling the Differential Efficiency of Dorsal and Ventral Pathways in Visual Semantic Decoding.

Deep-learning models reveal how context and listener attention shape electrophysiological correlates of speech-to-language transformation.

Speech recognition using an english multimodal corpus with integrated image and depth information.

Popular Topics

Recent Journals

AIMC Topic: Speech Perception

Stay Ahead of Medical AI

Popular Topics

Recent Journals