Personalized Dereverberation of Speech
Classic non-blind speech dereverberation methods produce high- quality results only when the precise impulse response is known. Alternatively, learning-based blind methods cannot ensure ad- equate dereverberation in all environments. We propose an environment- and speaker-specific approach combining the ad- vantages of both approaches. With a simple, one-time personal- ization step, our model generalizes a single measured impulse response to its spatial neighborhood. Specifically, the two-stage model performs feature-based Wiener deconvolution followed by a network-based refinement. Extensive experimental results indicate that our approach quantitatively and qualitatively out- performs the state-of-the-art methods. Additional user studies confirm that our method is overwhelmingly favored by listeners.
Project information
- Category Speech Enhancement, Deep Learning
- Project Page Personalized Dereverberation of Speech
- Publication Page ISCA Archive
- Paper PDF