Amazon cover image
Image from Amazon.com

Automatic Speech Recognition [electronic resource] : A Deep Learning Approach / by Dong Yu, Li Deng.

By: Contributor(s): Material type: TextTextSeries: Signals and Communication TechnologyPublisher: London : Springer London : Imprint: Springer, 2015Description: XXVI, 321 p. 62 illus. online resourceContent type:
  • text
Media type:
  • computer
Carrier type:
  • online resource
ISBN:
  • 9781447157793
Subject(s): Additional physical formats: Printed edition:: No titleDDC classification:
  • 621.382 23
Online resources:
Contents:
Section 1: Automatic speech recognition: Background -- Feature extraction: basic frontend -- Acoustic model: Gaussian mixture hidden Markov model -- Language model: stochastic N-gram -- Historical reviews of speech recognition research: 1st, 2nd, 3rd, 3.5th, and 4th generations -- Section 2: Advanced feature extraction and transformation -- Unsupervised feature extraction -- Discriminative feature transformation -- Section 3: Advanced acoustic modeling -- Conditional random field (CRF) and hidden conditional random field (HCRF) -- Deep-Structured CRF -- Semi-Markov conditional random field -- Deep stacking models -- Deep neural network - hidden Markov hybrid model -- Section 4: Advanced language modeling -- Discriminative Language model -- Log-linear language model -- Neural network language model.
In: Springer eBooksSummary: This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models including deep neural networks and many of their variants. This is the first automatic speech recognition book dedicated to the deep learning approach. In addition to the rigorous mathematical treatment of the subject, the book also presents insights and theoretical foundation of a series of highly successful deep learning models.
No physical items for this record

Section 1: Automatic speech recognition: Background -- Feature extraction: basic frontend -- Acoustic model: Gaussian mixture hidden Markov model -- Language model: stochastic N-gram -- Historical reviews of speech recognition research: 1st, 2nd, 3rd, 3.5th, and 4th generations -- Section 2: Advanced feature extraction and transformation -- Unsupervised feature extraction -- Discriminative feature transformation -- Section 3: Advanced acoustic modeling -- Conditional random field (CRF) and hidden conditional random field (HCRF) -- Deep-Structured CRF -- Semi-Markov conditional random field -- Deep stacking models -- Deep neural network - hidden Markov hybrid model -- Section 4: Advanced language modeling -- Discriminative Language model -- Log-linear language model -- Neural network language model.

This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models including deep neural networks and many of their variants. This is the first automatic speech recognition book dedicated to the deep learning approach. In addition to the rigorous mathematical treatment of the subject, the book also presents insights and theoretical foundation of a series of highly successful deep learning models.