Robust Speech Recognition of Uncertain or Missing Data (eBook)

Theory and Applications
eBook Download: PDF
2011 | 2011
XVIII, 380 Seiten
Springer Berlin (Verlag)
978-3-642-21317-5 (ISBN)

Lese- und Medienproben

Robust Speech Recognition of Uncertain or Missing Data -
Systemvoraussetzungen
96,29 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

Automatic speech recognition suffers from a lack of robustness with respect to noise, reverberation and interfering speech. The growing field of speech recognition in the presence of missing or uncertain input data seeks to ameliorate those problems by using not only a preprocessed speech signal but also an estimate of its reliability to selectively focus on those segments and features that are most reliable for recognition. This book presents the state of the art in recognition in the presence of uncertainty, offering examples that utilize uncertainty information for noise robustness, reverberation robustness, simultaneous recognition of multiple speech signals, and audiovisual speech recognition.

The book is appropriate for scientists and researchers in the field of speech recognition who will find an overview of the state of the art in robust speech recognition, professionals working in speech recognition who will find strategies for improving recognition results in various conditions of mismatch, and lecturers of advanced courses on speech processing or speech recognition who will find a reference and a comprehensive introduction to the field. The book assumes an understanding of the fundamentals of speech recognition using Hidden Markov Models.



Prof. Dr.-Ing. Dorothea Kolossa is a professor at the Institut für Kommunikationsakustik of the Ruhr-Universität Bochum, Germany; her research interests are automatic speech recognition, digital speech signal processing, and blind source separation.

Prof. Dr.-Ing. Reinhold Haeb-Umbach heads the Dept. of Communications Engineering of the University of Paderborn, Germany; his research interest are speech signal processing and automatic speech recognition, statistical learning and pattern recognition, and signal processing for digital communications.

 

Prof. Dr.-Ing. Dorothea Kolossa is a professor at the Institut für Kommunikationsakustik of the Ruhr-Universität Bochum, Germany; her research interests are automatic speech recognition, digital speech signal processing, and blind source separation.Prof. Dr.-Ing. Reinhold Haeb-Umbach heads the Dept. of Communications Engineering of the University of Paderborn, Germany; his research interest are speech signal processing and automatic speech recognition, statistical learning and pattern recognition, and signal processing for digital communications. 

Chap. 1 – Introduction.- Part I – Theoretical Foundations.- Chap. 2 – Uncertainty Decoding and Conditional Bayesian Estimation.- Chap. 3 – Uncertainty Propagation.- Part II – Applications.- Chap. 4 – Front-End, Back-End, and Hybrid Techniques for Noise-Robust Speech Recognition.- Chap. 5 – Model-Based Approaches to Handling Uncertainty.- Chap. 6 – Reconstructing Noise-Corrupted Spectrographic Components for Robust Speech Recognition.- Chap. 7 – Automatic Speech Recognition Using Missing Data Techniques: Handling of Real-World Data.- Chap. 8 – Conditional Bayesian Estimation Employing a Phase-Sensitive Estimation Model for Noise-Robust Speech Recognition.-  Part III – Reverberation Robustness.- Chap. 9 – Variance Compensation for Recognition of Reverberant Speech with Dereverberation Processing.- Chap. 10 – A Model-Based Approach to Joint Compensation of Noise and Reverberation for Speech Recognition.- Part IV – Applications: Multiple Speakers and Modalities.- Chap. 11 – Evidence Modelling for Missing Data Speech Recognition Using Small Microphone Arrays.- Chap. 12 – Recognition of Multiple Speech Sources Using ICA.- Chap. 13 – Use of Missing and Unreliable Data for Audiovisual Speech Recognition.-  Index.

Erscheint lt. Verlag 14.7.2011
Zusatzinfo XVIII, 380 p.
Verlagsort Berlin
Sprache englisch
Themenwelt Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Technik Elektrotechnik / Energietechnik
Schlagworte Audiovisual speech recognition • Deconvolution • Missing feature theory • Noise robustness • packet loss • Source Separation • Speech processing • Speech Recognition • Uncertainty decoding
ISBN-10 3-642-21317-0 / 3642213170
ISBN-13 978-3-642-21317-5 / 9783642213175
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
PDFPDF (Wasserzeichen)
Größe: 5,9 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
der Praxis-Guide für Künstliche Intelligenz in Unternehmen - Chancen …

von Thomas R. Köhler; Julia Finkeissen

eBook Download (2024)
Campus Verlag
38,99