IJWMT Vol. 1, No. 2, 15 Apr. 2011
Cover page and Table of Contents: PDF (size: 178KB)
Full Text (PDF, 178KB), PP.55-61
Views: 0 Downloads: 0
Speech enhancement, LF glottal flow, source separation
The enhancement of speech degraded by non-stationary interferers is a highly relevant and difficult task for many signal processing applications. In this study, we present a monaural speech enhancement method based on spectral subtraction and Kalman filtering (KF) by extracting the Liljencrants–Fant (LF) excitation during voiced speech, in which the nature of glottal flow can be maintained. Therefore, the approach could preserve the glottal pulse's nature characteristic in Kalman filtering and thus achieve significant improvements on objective quality. The quality of the enhanced speech has been evaluated by perceptual evaluation of speech quality (PESQ) score. The results indicate that the proposed algorithm could improve the output speech quality compared with the conventional KF algorithm and sub-band spectral subtraction.
Chaogang Wu,Bo Li,Jin Zheng,"A Speech Enhancement Method Based on Kalman Filtering", IJWMT, vol.1, no.2, pp.55-61, 2011. DOI: 10.5815/ijwmt.2011.02.08
[1] D. Vincent, O. Rosec, and T. Chonavel, "A New Method for Speech Synthesis and Transformation Based On an ARX-LF Source-Filter Decomposition and HNM Modeling," ICASSP. pp. 525–528, 2007.
[2] H. Zhao, and X. Zou, "A Speech Enhancement Preprocessor for Low Bit Rate Speech Coding," Pacific-Asia Conference on Circuits, Communications and System. pp. 443–445, 2009.
[3] Christian D. Sigg, Tomas Dikk, and Joachim M. Buhmann, "Speech Enhancement with Sparse Coding in Learning," Dictionaries in Proc. ICASSP. pp. 4758–4761, 2010.
[4] B. Yegnanarayana, S. R. Mahadeva Prasanna and K. Sreenivasa Rao, "Speech Enhancement Using Excitation Source Information," ICASSP. vol. 1, pp. 541–544, 2002.
[5] D. Vincent, O. Rosec, and T. Chonavel, "Estimation of LF glottal source parameters based on ARX model," Interspeech. pp. 333–336, 2005.