Optimized Speech Compression Algorithm Based on Wavelets Techniques and its Real Time Implementation on DSP

Full Text (PDF, 444KB), PP.33-41

Views: 0 Downloads: 0

Author(s)

Noureddine Aloui 1,* Souha Bousselmi 1 Adnane Cherif 1

1. Sciences Faculty of Tunis, Innov’COM Laboratory, University of Tunis El-Manar, 2092, Tunisia

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2015.03.05

Received: 12 Jun. 2014 / Revised: 7 Oct. 2014 / Accepted: 4 Dec. 2014 / Published: 8 Feb. 2015

Index Terms

Speech Compression, Discrete Wavelet Transform, Voice Activity Detection, Hardware Implementation, Digital Signal Processor, RTW

Abstract

This paper presents an optimized speech compression algorithm using discrete wavelet transform, and its real time implementation on fixed-point digital signal processor (DSP). The optimized speech compression algorithm presents the advantages to ensure low complexity, low bit rate and achieve high speech coding efficiency, and this by adding a voice activity detector (VAD) module before the application of the discrete wavelet transform. The VAD module avoids the computation of the discrete wavelet coefficients during the inactive voice signal. In addition, a real-time implementation of the optimized speech compression algorithm is performed using fixed-point processor. The optimized and the original algorithms are evaluated and compared in terms of CPU time (sec), Cycle count (MCPS), Memory consumption (Ko), Compression Ratio (CR), Signal to Noise Ratio (SNR), Peak Signal to Noise Ratio (PSNR) and Normalized Root Mean Square Error (NRMSE).

Cite This Paper

Noureddine Aloui, Souha Bousselmi, Adnane Cherif, "Optimized Speech Compression Algorithm Based on Wavelets Techniques and its Real Time Implementation on DSP", International Journal of Information Technology and Computer Science(IJITCS), vol.7, no.3, pp.33-41, 2015. DOI:10.5815/ijitcs.2015.03.05

Reference

[1]O. Khalifa, S. Habib Harding, A. Hashim, "Compression using Wavelet Transform", Signal Processing: An International Journal, vol. 2, pp. 17-26, 2008.

[2]R.A. DeVore, B. Jawerth, B.J. Lucier, "Image compression through wavelet transform coding", IEEE Transactions on Information Theory, vol. 38, pp. 719-746, March 1992.

[3]H. Khalil, A. F. Atiya, S. Shaheen "Three-Dimensional Video Compression Using Subband/Wavelet Transform with Lower Buffering Requirements", IEEE transactions on image processing, vol. 8, June 1999.

[4]S. Ayat, M.T. Manzuri-Shalmani, R. Dianat, "An improved wavelet-based speech enhancement by using speech signal features", Computers & Electrical Engineering, vol. 32, pp. 411–425, November 2006.

[5]B. Walczak, B. van den Bogaert, D. Luc Massart, "Application of Wavelet Packet Transform in Pattern Recognition of Near-IR Data", Analytical Chemistry, vol. 68, pp. 1742-1747, May 15, 1996.

[6]S. Ardhapurkar, R. Manthalkar, S. Gajre “Interpretation of Normal and Pathological ECG Beats using Multiresolution Wavelet Analysis”, I.J. Information Technology and Computer Science, vol. 5, pp. 1-14, December 2012.

[7]G. Rajesh, A. Kumar and K. Ranjeet, “Speech Compression using Different Transform Techniques”, IEEE International Conference on Computer and Communication Technology(ICCCT), pp. 146-151, 2011.

[8]Hatem Elaydi, “Speech compression using wavelets”, 2010, http://site.iugaza.edu.ps/helaydi/files/2010/02/Elaydi.pdf

[9]S. Dusan , J. L. Flanagan, A. Karve, M. Balaraman, “Speech compression using polynomial approximation”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, pp. 387-395 , 2007.

[10]S. M. Joseph, “Spoken digit compression using wavelet packet”, IEEE international conference on signal and image processing (ICSIP-2010), pp. 255-259, 2010.

[11]Z. Dan, S. Ma, “Speech Compression with Best Wavelet Packet Transform and SPIHT Algorithm”, International Conference on Computer Modeling and Simulation (ICCMS '10-2010), pp. 360-363, 2010.

[12]E. B. Fgee, W. J. Philips et W. Robertson, “Comparing audio compression using wavelet with other audio compression schemes”, IEEE Proceedings, Electrical and Computer Engineering, vol. 2, pp. 698-701, 1999.

[13]M. Abdul Mawla, A. Najih, Abdul Rahman Ramli, Azizah Ibrahim, Syed A.R, “Comparing Speech Compression Using Wavelets With Other Speech Compression Schemes”, Student Conference on Research and Development, Proceedings, pp. 55-58, 2003.

[14]N. Sriram, “Comparison of CELP speech coder with a wavelet method”, Thesis, 2006.

[15]A. Grossman and J. Morlet, “Decomposition of Hardy functions into square integrable wavelets of constant shape”, SIAM J. Math. Anal., vol. 15, pp. 723-736, 1984.

[16]P. Gilles Lemariés-Rieusset, “Sur l’existence des analyses multi-résolutions en théorie des ondelettes”, Revista Mathematica Ibero Americana, vol. 8, pp. 457-474, 1992.

[17]S. Mallat, “Multiresolution approximations and wavelet orthonormal bases of L2(R)”, Trans. Amer. Math. Soc;. 315, pp. 69-87, 1989.

[18]W. Kinsner, A. Langi, “Speech and Image Signal Compression with Wavelets”, IEEE Wescanex Conference Proceedings, New York, NY, pp. 368-375, 1993.

[19]S. Kumar, V. K. Chaudhari, R. K. Singh,D. Varshney, “A New Algorithm for Voice Signal Compression (VSC) and Analysis Suitable for Limited Storage Devices Using MatLab”, International Journal of Computer and Electrical Engineering, vol. 1, pp. 1793-8163, 2009.

[20]J. I. Agbinya, “Discrete wavelet transform techniques in speech processing”, TENCON 96. Proceedings, IEEE TENCON. Digital Signal Processing Applications, vol. 2, pp. 514–519, 1996.

[21]D.L. Donoho, “De-Noising by softthresholding”, IEEE Transaction on Inf. Theory, Vol. 41, pp.613-627, 1995.

[22]M. Misiti, Y. Misiti, G. Oppenheim et J. Poggi, “Matlab Wavelet Tool Box”, The Math Works Inc., 2000.

[23]Y. Ephram and D. Malah, “Speech Enhancement Using a- Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator”, IEEE Transaction on acoustics, speech and signal processing, vol. 32, pp. 1109-1121, 1984.

[24]G. Madre, E.H. Baghious, S. Azou et G. Burel, ” Design of a variable rate algorithm for CS-ACELP Coder”, IEEE International Conference on Communications (ICC’03), Anchorage, AK,USA, 2003.

[25]ITU-T Recommandation G.729 - Annex B, ”Shéma de compression des silences pour la recommandation G.729 optimisé pour les terminaux conformes à la recommendation V.70”, 1996.

[26]J. Garofolo, “Darpa Timit: Acoustic-phonetic Continuous Speech Corpus CD-ROM”, US Department of Commerce, National Institute of Standards and Technology, 1993.

[27]MathWorks, “Real-Time Workshop For Use with SIMULINK”, User’s Guide, Version 6, June 2004.

[28]Texas instruments, “TMS320 DSP/BIOS”, v5.42, User’s Guide, spru423I, Août 2012.