A Speech Enhancement Method Based on Kalman Filtering

Full Text (PDF, 178KB), PP.55-61

Views: 0 Downloads: 0

Author(s)

Chaogang Wu 1,* Bo Li 1 Jin Zheng 1

1. Beihang University, Beijing 100191, China

* Corresponding author.

DOI: https://doi.org/10.5815/ijwmt.2011.02.08

Received: 4 Jan. 2011 / Revised: 11 Feb. 2011 / Accepted: 14 Mar. 2011 / Published: 15 Apr. 2011

Index Terms

Speech enhancement, LF glottal flow, source separation

Abstract

The enhancement of speech degraded by non-stationary interferers is a highly relevant and difficult task for many signal processing applications. In this study, we present a monaural speech enhancement method based on spectral subtraction and Kalman filtering (KF) by extracting the Liljencrants–Fant (LF) excitation during voiced speech, in which the nature of glottal flow can be maintained. Therefore, the approach could preserve the glottal pulse's nature characteristic in Kalman filtering and thus achieve significant improvements on objective quality. The quality of the enhanced speech has been evaluated by perceptual evaluation of speech quality (PESQ) score. The results indicate that the proposed algorithm could improve the output speech quality compared with the conventional KF algorithm and sub-band spectral subtraction.

Cite This Paper

Chaogang Wu,Bo Li,Jin Zheng,"A Speech Enhancement Method Based on Kalman Filtering", IJWMT, vol.1, no.2, pp.55-61, 2011. DOI: 10.5815/ijwmt.2011.02.08

Reference

[1] D. Vincent, O. Rosec, and T. Chonavel, "A New Method for Speech Synthesis and Transformation Based On an ARX-LF Source-Filter Decomposition and HNM Modeling," ICASSP. pp. 525–528, 2007.

[2] H. Zhao, and X. Zou, "A Speech Enhancement Preprocessor for Low Bit Rate Speech Coding," Pacific-Asia Conference on Circuits, Communications and System. pp. 443–445, 2009.

[3] Christian D. Sigg, Tomas Dikk, and Joachim M. Buhmann, "Speech Enhancement with Sparse Coding in Learning," Dictionaries in Proc. ICASSP. pp. 4758–4761, 2010.

[4] B. Yegnanarayana, S. R. Mahadeva Prasanna and K. Sreenivasa Rao, "Speech Enhancement Using Excitation Source Information," ICASSP. vol. 1, pp. 541–544, 2002.

[5] D. Vincent, O. Rosec, and T. Chonavel, "Estimation of LF glottal source parameters based on ARX model," Interspeech. pp. 333–336, 2005.