Time Domain Representation of Speech Sounds -  Asoke Kumar Datta

Time Domain Representation of Speech Sounds (eBook)

A Case Study in Bangla
eBook Download: PDF
2018 | 1st ed. 2018
XVI, 154 Seiten
Springer Singapore (Verlag)
978-981-13-2303-4 (ISBN)
Systemvoraussetzungen
96,29 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

The book presents the history of time-domain representation and the extent of its development along with that of spectral domain representation in the cognitive and technology domains. It discusses all the cognitive experiments related to this development, along with details of technological developments related to both automatic speech recognition (ASR) and text to speech synthesis (TTS), and introduces a viable time-domain representation for both objective and subjective analysis, as an alternative to the well-known spectral representation.

The book also includes a new cohort study on the use of lexical knowledge in ASR.

India has numerous official dialects, and spoken-language technology development is a burgeoning area. In fact TTS and ASR taken together constitute the most important technology for empowering people. As such, the book describes time domain representation in such a way that it can be easily and seamlessly incorporated into ASR and TTS research and development. In short, it is a valuable guidebook for the development of ASR and TTS in all the Indian Standard Dialects using signal domain parameters.




Prof. Asoke Kumar Datta, an MSc. (Pure Math), worked at the Indian Statistical Institute from 1955-1994. He retired from the HOD Electronics and Communication Sciences Department, and is an ISI Visiting Professor. He is President, BOM-BOM, Kolkata; Senior Guest Researcher, Sir C V Raman Centre for Physics and Music, JU; Executive Member, Society for Natural Language Technology Research, Kolkata; Life Member, Acoustical Society of India. He received the J C Bose Memorial Award, 1969; Sir C V Raman Award, 1982-83 & 1998-99; S K Mitra Memorial Award, 1984; and the Sri C AchyutMenon Prize, 2001. His areas of academic interest include pattern recognition, AI, speech, music and consciousness.
The book presents the history of time-domain representation and the extent of its development along with that of spectral domain representation in the cognitive and technology domains. It discusses all the cognitive experiments related to this development, along with details of technological developments related to both automatic speech recognition (ASR) and text to speech synthesis (TTS), and introduces a viable time-domain representation for both objective and subjective analysis, as an alternative to the well-known spectral representation.The book also includes a new cohort study on the use of lexical knowledge in ASR. India has numerous official dialects, and spoken-language technology development is a burgeoning area. In fact TTS and ASR taken together constitute the most important technology for empowering people. As such, the book describes time domain representation in such a way that it can be easily and seamlessly incorporated into ASR and TTS research and development. In short, it is a valuable guidebook for the development of ASR and TTS in all the Indian Standard Dialects using signal domain parameters.

Prof. Asoke Kumar Datta, an MSc. (Pure Math), worked at the Indian Statistical Institute from 1955-1994. He retired from the HOD Electronics and Communication Sciences Department, and is an ISI Visiting Professor. He is President, BOM-BOM, Kolkata; Senior Guest Researcher, Sir C V Raman Centre for Physics and Music, JU; Executive Member, Society for Natural Language Technology Research, Kolkata; Life Member, Acoustical Society of India. He received the J C Bose Memorial Award, 1969; Sir C V Raman Award, 1982-83 & 1998-99; S K Mitra Memorial Award, 1984; and the Sri C AchyutMenon Prize, 2001. His areas of academic interest include pattern recognition, AI, speech, music and consciousness.

Chapter 1. Introduction.- Chapter 2. Spectral Domain.- Chapter 3. Cognition of Phones.- Chapter 4. Signal Processing.- Chapter 5. Time Domain Representation of Phones.- Chapter 6. Role of Lexical Knowledge in ASR.- Chapter 7. Random Perturbations.- Chapter 8. Non linearity in Speech signal.

Erscheint lt. Verlag 3.11.2018
Zusatzinfo XVI, 154 p. 117 illus., 27 illus. in color.
Verlagsort Singapore
Sprache englisch
Themenwelt Mathematik / Informatik Informatik Betriebssysteme / Server
Informatik Software Entwicklung User Interfaces (HCI)
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Technik Elektrotechnik / Energietechnik
Schlagworte Automatic speech recognition (ASR) • Cognitive Development • lexical knowledge • perturbations • Signal Processing • Speech processing • Text to Speech Synthesis (TTS)
ISBN-10 981-13-2303-8 / 9811323038
ISBN-13 978-981-13-2303-4 / 9789811323034
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 7,9 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
A roadmap to data value realization and measurable business outcomes

von Pui Shing Lee

eBook Download (2024)
Packt Publishing (Verlag)
35,99
Unlock the power of deep learning for swift and enhanced results

von Giuseppe Ciaburro

eBook Download (2024)
Packt Publishing Limited (Verlag)
35,99