Automatic Speech and Speaker Recognition -

Automatic Speech and Speaker Recognition

Advanced Topics
Buch | Hardcover
518 Seiten
1996
Springer (Verlag)
978-0-7923-9706-9 (ISBN)
213,99 inkl. MwSt
Research in the field of automatic speech and speaker recognition has made a number of significant advances in the last two decades, influenced by advances in signal processing, algorithms, architectures, and hardware. These advances include: the adoption of a statistical pattern recognition paradigm; the use of the hidden Markov modeling framework to characterize both the spectral and the temporal variations in the speech signal; the use of a large set of speech utterance examples from a large population of speakers to train the hidden Markov models of some fundamental speech units; the organization of speech and language knowledge sources into a structural finite state network; and the use of dynamic, programming based heuristic search methods to find the best word sequence in the lexical network corresponding to the spoken utterance.
Automatic Speech and Speaker Recognition: Advanced Topics groups together in a single volume a number of important topics on speech and speaker recognition, topics which are of fundamental importance, but not yet covered in detail in existing textbooks. Although no explicit partition is given, the book is divided into five parts: Chapters 1-2 are devoted to technology overviews; Chapters 3-12 discuss acoustic modeling of fundamental speech units and lexical modeling of words and pronunciations; Chapters 13-15 address the issues related to flexibility and robustness; Chapter 16-18 concern the theoretical and practical issues of search; Chapters 19-20 give two examples of algorithm and implementational aspects for recognition system realization.
Audience: A reference book for speech researchers and graduate students interested in pursuing potential research on the topic. May also be used as a text for advanced courses on the subject.

1 An Overview of Automatic Speech Recognition.- 2 An Overview of Speaker Recognition Technology.- 3 Maximum Mutual Information Estimation of Hidden Markov Models.- 4 Bayesian Adaptive Learning and Map Estimation of HMM.- 5 Statistical and Discriminative Methods for Speech Recognition.- 6 Context Dependent Vector Quantization for Speech Recognition.- 7 Hidden Markov Network for Precise Acoustic Modeling.- 8 From HMMS to Segment Models: Stochastic Modeling for CSR.- 9 Voice Identification Using Nonparametric Density Matching.- 10 The Use of Recurrent Networks in Continuous Speech Recognition.- 11 Hybrid Connnectionist Models for Continuous Speech Recognition.- 12 Automatic Generation of Detailed Pronunciation Lexicons.- 13 Word Spotting — Extracting Partial Information from Continuous Utterances.- 14 Spectral Dynamics for Speech Recognition under Adverse Conditions.- 15 Signal Processing for Robust Speech Recognition.- 16 Dynamic Programming Search: from Digit Strings to Large Vocabulary Word Graphs.- 17 Fast Matching Techniques.- 18 Multiple-Pass Search Strategies.- 19 Issues in Practical Large Vocabulary Isolated Word Recognition: The IBM Tangora System.- 20 From Sphinx-II to Whisper: Making Speech Recognition Usable.

Erscheint lt. Verlag 31.3.1996
Reihe/Serie The Springer International Series in Engineering and Computer Science ; 355
Zusatzinfo XVI, 518 p.
Verlagsort Dordrecht
Sprache englisch
Maße 155 x 235 mm
Themenwelt Mathematik / Informatik Informatik Theorie / Studium
Naturwissenschaften Physik / Astronomie Mechanik
Technik Elektrotechnik / Energietechnik
ISBN-10 0-7923-9706-1 / 0792397061
ISBN-13 978-0-7923-9706-9 / 9780792397069
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Grundlagen – Anwendungen – Perspektiven

von Matthias Homeister

Buch | Softcover (2022)
Springer Vieweg (Verlag)
34,99
was jeder über Informatik wissen sollte

von Timm Eichstädt; Stefan Spieker

Buch | Softcover (2024)
Springer Vieweg (Verlag)
37,99
Eine Einführung in die Systemtheorie

von Margot Berghaus

Buch | Softcover (2022)
UTB (Verlag)
25,00