Speech Enhancement in the STFT Domain

Buch | Softcover
VII, 109 Seiten
2011 | 2012
Springer Berlin (Verlag)
978-3-642-23249-7 (ISBN)
58,84 inkl. MwSt
This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-channel problem where STFT coefficients at different frames and frequency bands are assumed to be independent. In this case, the noise reduction filter in each frequency band is basically a real gain. Since a gain does not improve the signal-to-noise ratio (SNR) for any given subband and frame, the noise reduction is basically achieved by liftering the subbands and frames that are less noisy while weighing down on those that are more noisy. The second category also concerns the single-channel problem. The difference is that now the interframe correlation is taken into account and a filter is applied in each subband instead of just a gain.
The advantage of using the interframe correlation is that we can improve not only the long-time fullband SNR, but the frame-wise subband SNR as well. The third and fourth classes discuss the problem of multichannel noise reduction in the STFT domain with and without interframe correlation, respectively. In the last category, we consider the interband correlation in the design of the noise reduction filters. We illustrate the basic principle for the single-channel case as an example, while this concept can be generalized to other scenarios. In all categories, we propose different optimization cost functions from which we derive the optimal filters and we also define the performance measures that help analyzing them.

Introduction.- Single-Channel Speech Enhancement with a Gain.- Single-Channel Speech Enhancement with a Filter.- Multichannel Speech Enhancement with Gains.- Multichannel Speech Enhancement with Filters.- The Bifrequency Spectrum in Speech Enhancement.- Summary and Perspectives.

From the reviews:

"This work addresses the problem in the short-time Fourier transform (STFT) domain. The general problem is divided into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. ... This book is mainly a research book for people doing research in electrical and computer engineering." (Yuehua Wu, Zentralblatt MATH, Vol. 1242, 2012)

Erscheint lt. Verlag 16.9.2011
Reihe/Serie SpringerBriefs in Electrical and Computer Engineering
Zusatzinfo VII, 109 p. 5 illus.
Verlagsort Berlin
Sprache englisch
Maße 155 x 235 mm
Gewicht 193 g
Themenwelt Technik Elektrotechnik / Energietechnik
Schlagworte linearly constrained minimum variance (LCMV) filte • linearly constrained minimum variance (LCMV) filter • maximum signal-to-noise ratio (SNR) filter • Microphone Arrays • minimum variance distortionless response (MVDR) fi • minimum variance distortionless response (MVDR) filter • prediction filter • short-time Fourier transform (STFT) domain • single-channel and multichannel • Speech Enhancement • tradeoff filter • Wiener Filter
ISBN-10 3-642-23249-3 / 3642232493
ISBN-13 978-3-642-23249-7 / 9783642232497
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
Mehr entdecken
aus dem Bereich
DIN-Normen und Technische Regeln für die Elektroinstallation

von DIN; ZVEH; Burkhard Schulze

Buch | Softcover (2023)
Beuth (Verlag)
86,00