Speech - AI Medical Compendium

Convolutional fusion network for monaural speech enhancement.

Neural networks : the official journal of the International Neural Network Society May 25, 2021

Convolutional neural network (CNN) based methods, such as the convolutional encoder-decoder network, offer state-of-the-art results in monaural speech enhancement. In the conventional encoder-decoder network, large kernel size is often used to enhanc...

Neural Networks, Computer Speech

View on PubMed DOI

Streaming cascade-based speech translation leveraged by a direct segmentation model.

Neural networks : the official journal of the International Neural Network Society May 17, 2021

The cascade approach to Speech Translation (ST) is based on a pipeline that concatenates an Automatic Speech Recognition (ASR) system followed by a Machine Translation (MT) system. Nowadays, state-of-the-art ST systems are populated with deep neural ...

Speech Speech Recognition Software Language Neural Networks, Computer

View on PubMed DOI

Learning to recognize while learning to speak: Self-supervision and developing a speaking motor.

Neural networks : the official journal of the International Neural Network Society May 14, 2021

Traditionally, learning speech synthesis and speech recognition were investigated as two separate tasks. This separation hinders incremental development for concurrent synthesis and recognition, where partially-learned synthesis and partially-learned...

Recognition, Psychology Speech Time Perception Neural Networks, Computer Machine Learning Humans

View on PubMed DOI

Anti-transfer learning for task invariance in convolutional neural networks for speech processing.

Neural networks : the official journal of the International Neural Network Society May 14, 2021

We introduce the novel concept of anti-transfer learning for speech processing with convolutional neural networks. While transfer learning assumes that the learning process for a target task will benefit from re-using representations learned for anot...

Learning Speech Machine Learning Neural Networks, Computer

View on PubMed DOI

Combination of deep speaker embeddings for diarisation.

Neural networks : the official journal of the International Neural Network Society Apr 21, 2021

Significant progress has recently been made in speaker diarisation after the introduction of d-vectors as speaker embeddings extracted from neural network (NN) speaker classifiers for clustering speech segments. To extract better-performing and more ...

Speech Humans Cluster Analysis Neural Networks, Computer

View on PubMed DOI

A dual-stream deep attractor network with multi-domain learning for speech dereverberation and separation.

Neural networks : the official journal of the International Neural Network Society Apr 21, 2021

Deep attractor networks (DANs) perform speech separation with discriminative embeddings and speaker attractors. Compared with methods based on the permutation invariant training (PIT), DANs define a deep embedding space and deliver a more elaborate r...

Speech Fourier Analysis Cluster Analysis Deep Learning Humans

View on PubMed DOI

Deep neural network-based generalized sidelobe canceller for dual-channel far-field speech recognition.

Neural networks : the official journal of the International Neural Network Society Apr 19, 2021

The traditional generalized sidelobe canceller (GSC) is a common speech enhancement front end to improve the noise robustness of automatic speech recognition (ASR) systems in the far-field cases. However, the traditional GSC is optimized based on the...

Noise Speech Speech Recognition Software Humans Deep Learning

View on PubMed DOI

What Can Network Science Tell Us About Phonology and Language Processing?

Topics in cognitive science Apr 9, 2021

Contemporary psycholinguistic models place significant emphasis on the cognitive processes involved in the acquisition, recognition, and production of language but neglect many issues related to the representation of language-related information in t...

Speech Perception Phonetics Models, Psychological Psycholinguistics Recognition, Psychology Adult Language Cognition Speech Humans Neural Networks, Computer

View on PubMed DOI

Improving Loanword Identification in Low-Resource Language with Data Augmentation and Multiple Feature Fusion.

Computational intelligence and neuroscience Apr 8, 2021

Loanword identification is studied in recent years to alleviate data sparseness in several natural language processing (NLP) tasks, such as machine translation, cross-lingual information retrieval, and so on. However, recent studies on this topic usu...

Information Storage and Retrieval Russia Speech Natural Language Processing Language

View on PubMed DOI

Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings.

Neural networks : the official journal of the International Neural Network Society Apr 5, 2021

Respiration is an essential and primary mechanism for speech production. We first inhale and then produce speech while exhaling. When we run out of breath, we stop speaking and inhale. Though this process is involuntary, speech production involves a ...

Linguistics Humans Deep Learning Male Respiration Speech Young Adult Female Adult

View on PubMed DOI

AIMC Topic: Speech

Convolutional fusion network for monaural speech enhancement.

Streaming cascade-based speech translation leveraged by a direct segmentation model.

Learning to recognize while learning to speak: Self-supervision and developing a speaking motor.

Anti-transfer learning for task invariance in convolutional neural networks for speech processing.

Combination of deep speaker embeddings for diarisation.

A dual-stream deep attractor network with multi-domain learning for speech dereverberation and separation.

Deep neural network-based generalized sidelobe canceller for dual-channel far-field speech recognition.

What Can Network Science Tell Us About Phonology and Language Processing?

Improving Loanword Identification in Low-Resource Language with Data Augmentation and Multiple Feature Fusion.

Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings.

Popular Topics

Recent Journals

AIMC Topic: Speech

Stay Ahead of Medical AI

Popular Topics

Recent Journals