Sagar R. Kulkarni

Work place: Department of Computer Engineering, PIIT, New Panvel, India

E-mail: skulkarni@mes.ac.in

Website:

Research Interests: Natural Language Processing, Computer Architecture and Organization, Data Structures and Algorithms, Programming Language Theory

Biography

Sagar Kulkarni is Assistant Professor in Computer Engineering department at PIIT, New Panvel, University of Mumbai, India. He has completed M.E. in Computer Engineering from University of Mumbai, India. Sagar has received BE in CSE from Shivaji University, Kolhapur. He is having Eight years of experience in teaching. His area of interest are Text Mining and Summarization, System Programming and Compiler construction, Natural Language Processing, Information Retrieval etc.

Author Articles
Extraction of Root Words using Morphological Analyzer for Devanagari Script

By Sharvari S. Govilkar J. W. Bakal Sagar R. Kulkarni

DOI: https://doi.org/10.5815/ijitcs.2016.01.04, Pub. Date: 8 Jan. 2016

In India, more than 300 million people use Devanagari script for documentation. In Devanagari script, Marathi and Hindi are mainly used as primary language of Maharashtra state and national language of India respectively. As compared with English script, Devanagari script is reach of morphemes. Thus the lemmatization of Devanagari script is quite complex than that of English script. There is lack of resources for Devanagari script such as WordNet, ontology representation, parsing the keywords and their part of speech. Thus the overall task of information retrieval becomes complex and time consuming. Devanagari script document always carries suffixes which may cause problem in accurate information retrieval. We propose a method of extracting root words from Devanagari script document which can be used for information retrieval, text summarization, text categorization, ontology building etc. An attempt is made to design the Morphological Analyzer for Devanagari script. We have designed CORPUS containing more than 3000 possible stop words and suffixes for Marathi language. Morphological Analyzer can acts as a preliminary stage for developing any information retrieval application in Devanagari script. We have conducted the experiments on randomly selected Marathi documents and we found the accuracy of designed morphological analyzer is up to 96%.

[...] Read more.
Other Articles