J. W. Bakal

Work place: SJCOE, Mumbai, India

E-mail: bakaljw@gmail.com

Website:

Research Interests: Information Security, Information Retrieval, Information-Theoretic Security

Biography

Dr. J. W. Bakal received M.Tech in Electronics Engineering, from Marathwada University. Later, He has completed his Ph.D. in the field of Computer Engineering from Bharati University, Pune. He is a PhD supervisor in CSE at University of Mumbai. He is presently working as principal at the S.S. Jondhale College of Engineering, Thane, India. He was a chairman of board of studies in Information Technology in University of Mumbai. His research interests are Telecomm Networking, Mobile Computing and Information Security. He has publications in journals, conference proceedings, and books in his credits. During his academics tenure, he has attended, organized and conducted training programs in Computer and Electronics branches. He is life member of professional societies such as IETE, ISTE INDIA. He is also a member of IEEE. He has prominently worked for IETE as a chairman, Mumbai section.

Author Articles
Extraction of Root Words using Morphological Analyzer for Devanagari Script

By Sharvari S. Govilkar J. W. Bakal Sagar R. Kulkarni

DOI: https://doi.org/10.5815/ijitcs.2016.01.04, Pub. Date: 8 Jan. 2016

In India, more than 300 million people use Devanagari script for documentation. In Devanagari script, Marathi and Hindi are mainly used as primary language of Maharashtra state and national language of India respectively. As compared with English script, Devanagari script is reach of morphemes. Thus the lemmatization of Devanagari script is quite complex than that of English script. There is lack of resources for Devanagari script such as WordNet, ontology representation, parsing the keywords and their part of speech. Thus the overall task of information retrieval becomes complex and time consuming. Devanagari script document always carries suffixes which may cause problem in accurate information retrieval. We propose a method of extracting root words from Devanagari script document which can be used for information retrieval, text summarization, text categorization, ontology building etc. An attempt is made to design the Morphological Analyzer for Devanagari script. We have designed CORPUS containing more than 3000 possible stop words and suffixes for Marathi language. Morphological Analyzer can acts as a preliminary stage for developing any information retrieval application in Devanagari script. We have conducted the experiments on randomly selected Marathi documents and we found the accuracy of designed morphological analyzer is up to 96%.

[...] Read more.
Other Articles