Novel Techniques for Dialectal Arabic Speech Recognition - Mohamed Elmahdy, Rainer Gruhn, Wolfgang Minker

Novel Techniques for Dialectal Arabic Speech Recognition

Buch | Softcover
110 Seiten
2014
Springer-Verlag New York Inc.
978-1-4899-9945-0 (ISBN)
106,99 inkl. MwSt
Novel Techniques for Dialectal Arabic Speech describes approaches to improve automatic speech recognition for dialectal Arabic. Since speech resources for dialectal Arabic speech recognition are very sparse, the authors describe how existing Modern Standard Arabic (MSA) speech data can be applied to dialectal Arabic speech recognition, while assuming that MSA is always a second language for all Arabic speakers.

In this book, Egyptian Colloquial Arabic (ECA) has been chosen as a typical Arabic dialect. ECA is the first ranked Arabic dialect in terms of number of speakers, and a high quality ECA speech corpus with accurate phonetic transcription has been collected. MSA acoustic models were trained using news broadcast speech. In order to cross-lingually use MSA in dialectal Arabic speech recognition, the authors have normalized the phoneme sets for MSA and ECA. After this normalization, they have applied state-of-the-art acoustic model adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A-Posteriori (MAP) to adapt existing phonemic MSA acoustic models with a small amount of dialectal ECA speech data. Speech recognition results indicate a significant increase in recognition accuracy compared to a baseline model trained with only ECA data.

Fundamentals.- Speech Corpora.- Phonemic Acoustic Modeling.- Graphemic Acoustic Modeling.- Phonetic Transcription Using the Arabic Chat Alphabet.

Erscheint lt. Verlag 13.4.2014
Zusatzinfo XXII, 110 p.
Verlagsort New York
Sprache englisch
Maße 155 x 235 mm
Themenwelt Geisteswissenschaften Sprach- / Literaturwissenschaft Sprachwissenschaft
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Technik Elektrotechnik / Energietechnik
Technik Nachrichtentechnik
Schlagworte Arabic dialect • Arabic speech recognition • ECA • ECA speech data • map • MLLR • MSA
ISBN-10 1-4899-9945-0 / 1489999450
ISBN-13 978-1-4899-9945-0 / 9781489999450
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
Mehr entdecken
aus dem Bereich
Künstliche Intelligenz, Macht und das größte Dilemma des 21. …

von Mustafa Suleyman; Michael Bhaskar

Buch | Hardcover (2024)
C.H.Beck (Verlag)
28,00