Robust Automatic Speech Recognition: A Bridge To Practical Applications by Jinyu LiRobust Automatic Speech Recognition: A Bridge To Practical Applications by Jinyu Li

Robust Automatic Speech Recognition: A Bridge To Practical Applications

byJinyu Li, Li Deng, Reinhold Haeb-Umbach

Hardcover | October 12, 2015

Pricing and Purchase Info


Earn 933 plum® points

Prices and offers may vary in store


In stock online

Ships free on orders over $25

Not available in stores


Robust Automatic Speech Recognition: A Bridge to Practical Applicationsestablishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that have been developed over the past thirty years, with an emphasis on practical methods that have been proven to be successful and which are likely to be further developed for future applications. The strengths and weaknesses of robustness-enhancing speech recognition techniques are carefully analyzed. The book covers noise-robust techniques designed for acoustic models which are based on both Gaussian mixture models and deep neural networks. In addition, a guide to selecting the best methods for practical applications is provided. The reader will:

  • Gain a unified, deep and systematic understanding of the state-of-the-art technologies for robust speech recognition
  • Learn the links and relationship between alternative technologies for robust speech recognition
  • Be able to use the technology analysis and categorization detailed in the book to guide future technology development
  • Be able to develop new noise-robust methods in the current era of deep learning for acoustic modeling in speech recognition

  • The first book that provides a comprehensive review on noise and reverberation robust speech recognition methods in the era of deep neural networks
  • Connects robust speech recognition techniques to machine learning paradigms with rigorous mathematical treatment
  • Provides elegant and structural ways to categorize and analyze noise-robust speech recognition techniques
  • Written by leading researchers who have been actively working on the subject matter in both industrial and academic organizations for many years
Jinyu Li received a Ph.D. degree from Georgia Institute of Technology, U.S. From 2000 to 2003, he was a Researcher at Intel China Research Center and a Research Manager at iFlytek, China. Currently, he is a Principal Applied Scientist at Microsoft, working as a technical lead to design and improve speech modeling algorithms and technol...
Title:Robust Automatic Speech Recognition: A Bridge To Practical ApplicationsFormat:HardcoverDimensions:306 pages, 9.41 × 7.24 × 0.98 inPublished:October 12, 2015Publisher:Academic PressLanguage:English

The following ISBNs are associated with this title:

ISBN - 10:0128023988

ISBN - 13:9780128023983


Table of Contents

  1. Introduction
  2. Fundamental of speech recognition
  3. Background of robust speech recognition

    1. Processing in the Feature and Model Domains
    2. Compensation with prior knowledge
    3. Explicit distortion modeling
    4. Uncertainty processing
    5. Joint model training
    6. Reverberant speech recognition
    7. Multi-channel processing
    8. Summary and Future Directions