A Systematic Literature Review on Spell Checkers for Bangla Language

Full Text (PDF, 620KB), PP.40-47

Views: 0 Downloads: 0

Author(s)

Prianka Mandal 1,* B M Mainul Hossain 1

1. Institute of Information Technology, University of Dhaka, Dhaka, 1000, Bangladesh

* Corresponding author.

DOI: https://doi.org/10.5815/ijmecs.2017.06.06

Received: 12 Mar. 2017 / Revised: 9 Apr. 2017 / Accepted: 13 May 2017 / Published: 8 Jun. 2017

Index Terms

Systematic Literature Review, Spelling Errors, Spell Detecting, Spell Checking, Spell Checker, Bangla Language, Misspelled Word

Abstract

Spell checkers check whether a word is misspelled and provide suggestions to correct it. Detection and correction of spelling errors in Bangla language which is the seventh most spoken native language in the world, is very onerous because of the complex rules of Bangla spelling. There is no systematic literature review on this research topic. In this paper, we present a systematic literature review on checking and correcting spelling errors in Bangla language. We investigate the current methods used for spell checking and find out what challenges are addressed by those methods. We also report the limitations of those methods. Recent relevant studies are selected based on a set of significant criteria. Our results indicate that there are research gaps in this research topic and has a potential for further investigation.

Cite This Paper

Prianka Mandal, B M Mainul Hossain, "A Systematic Literature Review on Spell Checkers for Bangla Language", International Journal of Modern Education and Computer Science(IJMECS), Vol.9, No.6, pp.40-47, 2017. DOI:10.5815/ijmecs.2017.06.06

Reference

[1]Keele, Staffs. "Guidelines for Performing Systematic Literature Reviews in Software Engineering." Technical report, Ver. 2.3 EBSE Technical Report. EBSE. 2007.
[2]Kukich, Karen. "Techniques for Automatically Correcting Words in Text." ACM Computing Surveys (CSUR) 24.4 (1992): 377-439.
[3]P. Kundu and B.B. Chaudhuri (1999) "Error Pattern in Bangla Text". International Journal of Dravidian Linguistics. 28(2): 49-88.
[4]D. E. Knuth, The Art of Computer Programming, Vol. 3, Addison-Wesley Publishing Company, Reading, Massachusetts, 2nd edition, 1982.
[5]Lawrence Phillips, “Hanging on the Metaphone”, Computer Language, 7(12), 1990.
[6]Lawrence Phillips, “The Double Metaphone Search Algorithm”, C/C++ Users Journal, 18(6), June, 2000.
[7]T. N. Gadd, “PHONIX: The Algorithm”, Program, 24(4), pp. 363-366, 1990.
[8]Levenshtein, V. I. (1966). Binary Codes Capable of Correcting Deletions, Insertions, and Reversals. 10(8), 707–710.
[9]W. Kraaij and R. Pohlman, “Viewing Stemming as Recall Enhancement”, In the Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1996, pp. 40–48.
[10]Lovins, Julie Beth (1968). "Development of a Stemming Algorithm". Mechanical Translation and Computational Linguistics 11: 22–31.
[11]Porter, Martin F.1980. An Algorithm for Suffix Stripping. Program 14 (3): 130-137.
[12]C. E. Shannon, “Prediction and Entropy of Printed English,” Bell Sys. Tec. J. (30):50–64, 1951.
[13]Farag Ahmed, Ernesto William De Luca, and Andreas Nürnberger, “Revised N-Gram based Automatic Spelling Correction Tool to Improve Retrieval Effectiveness”, August 22, 2009.
[14]Chaudhuri, Bidyut Baran. "Reversed Word Dictionary and Phonetically Similar Word Grouping based Spell-checker to Bangla Text." Proc. LESAL Workshop, Mumbai. 2001.
[15]Naushad UzZaman and Mumit Khan, “A Bangla Phonetic Encoding for Better Spelling Suggestions”, Proc. 7th International Conference on Computer and Information Technology, Dhaka, December, 2004.
[16]UzZaman, Naushad, and Mumit Khan. "A Double Metaphone Encoding for Bangla and its Application in Spelling Checker." 2005 International Conference on Natural Language Processing and Knowledge Engineering. IEEE, 2005.
[17]Islam, Md, Md Uddin, and Mumit Khan. “A Light Weight Stemmer for Bengali and its Use in Spelling Checker,” Proc. 1st Intl. Conf. on Digital Comm. and Computer Applications (DCCA07), Irbid, Jordan, March 19-23, 2007.
[18]N. UzZaman and M. Khan, “A Comprehensive Bangla Spelling Checker”, In the Proceeding of the International Conference on Computer Processing on Bengali (ICCPB), Dhaka, Bangladesh, 2006.
[19]Hoque, Md Tamjidul, and Md Kaykobad. "Coding System for Bangla Spell Checker." 5th International Conference on Computer and Information Technology. 2002.
[20]Abdullah, Md Munshi, Md Zahurul Islam, and Mumit Khan. "Error-tolerant Finite-state Recognizer and String Pattern Similarity Based Spelling-Checker for Bangla." Proceeding of 5th International Conference on Natural Language Processing (ICON). 2007.
[21]Chaudhuri, Bidyut Baran. "Towards Indian Language Spell-checker Design." Language Engineering Conference, 2002. Proceedings. IEEE, 2002.
[22]Abdullah, A. B. A., and Ashfaq Rahman. "A Generic Spell Checker Engine for South Asian Languages." Conference on Software Engineering and Applications (SEA 2003). 2003.
[23]Murshed, M. Manzur, Mahbubur Rahman Syed, and M. Kaykobad. "A Linguistically Sortable Bengali Coding System and its Application in Spell Checking: A Case Study of Multilingual Applications." Interactive multimedia systems (2002): 251.
[24]Khan, Nur Hossain, et al. "Checking the Correctness of Bangla Words using N-Gram." International Journal of Computer Application 89.11 (2014).
[25]Haque, Md Tamjidul, and M. Kaykobad. "Use of Phonetic Similarity for Bangla Spell Checker." Proc. 5th International Conference on Computer and Information Technology. 2002.
[26]Abdullah, A. B. A., and Ashfaq Rahman. "A Different Approach in Spell Checking for South Asian Languages." Proc. 2nd International Conference on Information Technology for Applications (ICITA), China. 2004.
[27]Abdullah, Arif Billah Al-Mahmud, and Ashfaq Rahman. "Spell Checker for Bangla Language: An Implementation Perspective." Proc. 6th International Conference on Computer and Information Technology, Dhaka, Bangladesh. 2003.
[28]UzZaman, Naushad. "Phonetic Encoding for Bangla and its Application to Spelling Checker, Name Searching, Transliteration and Cross Language Information Retrieval." Undergraduate thesis (Computer Science), BRAC University (2005).
[29]Bhowmik, Kowshik, Afsana Zarin Chowdhury, and Sushmita Mondal. Development of A Word Based Spell Checker for Bangla Language. Diss. Department of Computer Science and Engineering, Military Institute of Science and Technology, 2014.
[30]Asadullah, Munshi. Finite State Recognizer and String Similarity based Spelling Checker for Bangla. Diss. BRAC University, 2007.
[31]Govilkar, Sharvari S., J. W. Bakal, and Sagar R. Kulkarni. "Extraction of Root Words using Morphological Analyzer for Devanagari Script." International Journal of Information Technology and Computer Science (IJITCS) 8.1 (2016): 33.
[32]Aktaruzzaman, Md, and Md Farukuzzaman Khan. "A New Technique for Segmentation of Handwritten Numerical Strings of Bangla Language." International Journal of Information Technology and Computer Science (IJITCS) 5.5 (2013): 38.
[33]Doumi, Noureddine, et al. "A Semi-Automatic and Low Cost Approach to Build Scalable Lemma-based Lexical Resources for Arabic Verbs." International Journal of Information Technology and Computer Science (IJITCS) 8.2 (2016): 1.
[34]Divya, K. S., R. Subha, and S. Palaniswami. "Similar Words Identification Using Naive and TF-IDF Method." International Journal of Information Technology and Computer Science (IJITCS) 6.11 (2014): 42.