Text Localization and Character Extraction in Natural Scene Images using Contourlet Transform and SVM Classifier

Full Text (PDF, 647KB), PP.36-42

Views: 0 Downloads: 0

Author(s)

Shivanand Seeri 1,* Jagadeesh D. Pujari 2 P. S. Hiremath 1

1. Department of Computer Applications (MCA), KLE Technological University, BVBCET Campus, Hubballi, Karnataka

2. Department of Information Science & Engineering, S.D.M. College of Engg. & Tech., Dharwad, Karnataka

* Corresponding author.

DOI: https://doi.org/10.5815/ijigsp.2016.05.02

Received: 15 Jan. 2016 / Revised: 26 Feb. 2016 / Accepted: 1 Apr. 2016 / Published: 8 May 2016

Index Terms

Natural scene images, Text localization, Contourlet transform, SVM classifier, GLCM, horizontal profile, vertical profile

Abstract

The objective of this study is to propose a new method for text region localization and character extraction in natural scene images with complex background. In this paper, a hybrid methodology is suggested which extracts multilingual text from natural scene image with cluttered backgrounds. The proposed approach involves four steps. First, potential text regions in an image are extracted based on edge features using Contourlet transform. In the second step, potential text regions are tested for text content or non-text using GLCM features and SVM classifier. In the third step, detection of multiple lines in localized text regions is done and line segmentation is performed using horizontal profiles. In the last step, each character of the segmented line is extracted using vertical profiles. The experimentation has been done using images drawn from own dataset and ICDAR dataset. The performance is measured in terms of the precision and recall. The results demonstrate the effectiveness of the proposed method, which can be used as an efficient method for text recognition in natural scene images.

Cite This Paper

Shivananda V. Seeri, J. D. Pujari, P. S. Hiremath,"Text Localization and Character Extraction in Natural Scene Images using Contourlet Transform and SVM Classifier", International Journal of Image, Graphics and Signal Processing(IJIGSP), Vol.8, No.5, pp.36-42, 2016. DOI: 10.5815/ijigsp.2016.05.02

Reference

[1]Vijayakumar, R. Nedunchezhianm, A Novel Method For Super Imposed Text Extraction In A Sports Video, International Journal Of Computer Applications, Volume 15– No. 1, 0975 – 8887, February 2011, pp. 1-6.

[2]Chitrakala Gopalan and Manjula, Text Region Segmentation from Heterogeneous Images, IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.10, October 2008, pp. 108-113.

[3]Xiaopei Liu, Zhaoyang Lu, Jing Li, Wei Jiang, Detection and Segmentation Text from Natural Scene Images Based on Graph Model, WSEAS TRANSACTIONS on SIGNAL PROCESSING, E-ISSN: 2224-3488, Volume 10, 2014, pp. 124-135.

[4]Xiaoqing Liu and Jagath Samarabandu, Multiscale Edge-Based Text Extraction From Complex Images, Multimedia and Expo, 2006 IEEE International Conference, ISBN 1-4244-0366-7, 2006, pp. 1721-1724. 

[5]Xin Zhang, Fuchun Sun, Lei Gu, "A Combined Algorithm for Video Text Extraction", 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery, ISBN 978-1-4244-5931-5,2010, pp. 2294 – 2298.

[6]Jui-Chen Wu Jun-Wei Hsieh Yung-Sheng Chen, Morphology-based text line extraction ,International Journal of Machine Vision and Applications, P-ISSN: 0932-8092, Volume 19, Issue 3,2008, pp. 195-207.

[7]Rama Mohan Babu, G., Srimaiyee, P. Srikrishna, A., Text Extraction From Heterogeneous Images Using Mathematical Morphology, Journal Of Theoretical And Applied Information Technology, Vol. 16 Issue 1/2, 2010, pp. 39-47.

[8]Chu Duc Nguyen, Mohsen Ardabilian and Liming Chen, Robust Car License Plate Localization using a Novel Texture Descriptor, Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, E-ISBN : 978-0-7695-3718-4, 2009, pp. 523 – 528.

[9]Kwang In Kim, Keechul Jung, And Jin Hyung Kim, Texture-Based Approach For Text Detection In Images Using Support Vector Machines And Continuously Adaptive Mean Shift Algorithm , IEEE Transactions On Pattern Analysis And Machine Intelligence, Vol. 25, No. 12, ISSN: 0162-8828, 2003, pp. 1631 – 1639.

[10]Neha Gupta, V.K Banga, Localization of Text in Complex Images Using Haar Wavelet Transform, International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-1, Issue-6, November 2012, pp. 111-115.

[11]Akash Goel, Yogesh Kumar Sharma, Text Extraction of Vehicle Number Plate and Document Images Using Discrete Wavelet Transform in MATLAB, IOSR Journal of Computer Engineering (IOSR-JCE), e-ISSN: 2278-0661, p- ISSN: 2278-8727, Volume 16, Issue 2, Ver. V (Mar-Apr. 2014), pp. 117-126.

[12]Sumathi, C.P. and G. Gayathri Devi, Automatic Text Extraction From Complex Colored Images Using Gamma Correction Method, Journal of Computer Science, published Online 10 (4) 2014, ISSN: 1549-3636, 2014, pp. 705-715.

[13]Adesh Kumar, PankilAhuja, Rohit Seth, Text Extraction and Recognition from an Image Using Image Processing In Matlab, Conference on Advances in Communication and Control Systems 2013 (CAC2S 2013), pp. 429-435.

[14]Parthasarathi Giri, Text Information Extraction And Analysis From Images Using Digital Image Processing Techniques, Special Issue of International Journal on Advanced Computer Theory and Engineering (IJACTE), ISSN (Print) : 2319 – 2526, Volume-2, Issue-1, 2013, pp. 66-71.

[15]S. T. Deepa, S. P. Victor, Tamil Text Extraction, International Journal of Engineering Science and Technology (IJEST), ISSN: 0975-5462, Vol. 4 No.05 May 2012, pp. 2176-2179.

[16]Ankita Sikdar, Payal Roy, Somdeep Mukherjee, Moumita Das and Sreeparna Banerjee, A Two Stage Method For Bengali Text Extraction From Still Images Containing Text, Natarajan Meghanathan, et al. (Eds): SIPM, FCST, ITCA, WSE, ACSIT, CS & IT 062012. © CS & IT-CSCP 2012, DOI: 10.5121/csit.2012.2306, 2012, pp. 47–55.

[17]Satish Kumar, Sunil Kumar, S. Gopinath, Text Extraction From Images, International Journal of Advanced Research in Computer Engineering & Technology, ISSN: 2278 – 1323, ISSN: 2278 – 1323, 2012, pp. 34-36.

[18]Keshavaprasanna, Ramakhanth Kumar P, Thungamani.M, ManoharKoli, Kannada Text Extraction From Images And Videos For Vision Impaired Persons, International Journal of Advances in Engineering & Technology, ISSN: 2231-1963, Nov 2011, pp. 189-196.

[19]Narasimha Murthy K N, Y S Kumaraswamy, A Novel Method for Efficient Text Extraction from Real Time Images with Diversified Background using Haar Discrete Wavelet Transform and K-Means Clustering, IJCSI International Journal of Computer Science Issues, Vol. 8, Issue , No 3, ISSN (Online): 1694-0814,September 2011, pp. 235-245.

[20]S. A. Angadi, M. M. Kodabagi, A Texture Based Methodology for Text Region Extraction from Low Resolution Natural Scene Images, International Journal of Image Processing (IJIP) Volume(3), Issue(5), November 2009, pp. 229-245.

[21]U. Bhattacharya, S. K. Parui and S. Mondal, Devanagari and Bangla Text Extraction from Natural Scene Images, 2009 10th International Conference on Document Analysis and Recognition, 978-0-7695-3725-2/09 $25.00 © 2009 IEEE, DOI 10.1109/ICDAR.2009, pp. 171-178.

[22]S. V.Seeri, J. D. Pujari, P. S. Hiremath, Multilingual Text Localization in Natural Scene Images using Wavelet based Edge Features and Fuzzy Classification, International Journal of Emerging Trends & Technology in Computer Science (IJETTCS), Volume 4, Issue 1, , ISSN 2278-6856, February 2015, pp. 210-218.

[23]Miriam Leon, Veronica Vilaplana, Antoni Gasull and Ferran Marques, Lecture Notes in Electrical Engineering, Springer, vol. 158, ISSN1876-1100, 2013, pp. 21-36.

[24]NitiSyal, Naresh Kumar Garg, Text Extraction in Images Using DWT, Gradient Method And SVM Classifier, International Journal of Emerging Technology and Advanced Engineering, ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 4, Issue 6, June 2014, pp. 477-481.

[25]Nobuo Ezaki, Marius Bulacu, Lambert Schomaker, Text Detection from natural Scene Images: Towards a System for Visually Impaired Persons, Proc. of 17th Int. Conf. on Pattern Recognition (ICPR 2004), IEEE Computer Society, vol. II, 23-26 August, Cambridge, UK.,2004, pp. 683-686.

[26]Shraddha Naik, Sankhya Nayak, Text Detection and Character Extraction in Natural Scene Images, International Journal of emerging Technology and Advanced Engineering, ISSN 2250-2459, ISO 9001:2008 Certified Journal, Volume 5, Issue 2, February 2015, pp. 178-182.

[27]S. V. Seeri, J. D. Pujari, P. S. Hiremath, Multilingual Text Detection in Natural Scene Images using Wavelet based Edge Features and SVM Classifier, International Journal of Advanced Research in Computer Science and Software Engineering (IJARCSSE), ISSN 2277-128x Volume 5, Issue 11, November 2015, February 2015, pp. 81-89.

[28]P. S. Hiremath and Rohini A. Bhusnurmath, Non-subsampled contourlet transform and local directional binary pattern for texture image classification using support vector machine, Int. Jl. Of Engineering Research and Technology, Vol. 2, Issue 10, Oct. 2013, pp. 3881-3890.

[29]Samabia Tehsin, Asif Masood, Sumaira Kausar, Survey of Region-Based Text Extraction Techniques for Efficient Indexing of Image/Video Retrieval, IJIGSP Vol. 6, No. 12, November 2014, pp.53-64.

[30]Dhirendra Pal Singh, Ashish Khare, Text Region Extraction: A Morphological Based Image Analysis Using Genetic Algorithm, IJIGSP Vol. 7, No. 2, January 2015, pp. 39-47.