Work place: Department of Computer Science, B.B. Ambedkar University, Lucknow-226025,India
E-mail: arya.chandrakala@gmail.com
Website:
Research Interests: Computer systems and computational processes, Information Systems, Data Structures and Algorithms, Information Theory, Algorithmic Information Theory
Biography
Chandrakala Arya is Research Scholar at Department of Computer Science in B.B. Ambedkar University, Lucknow, India. She has received her M.C.A. Degree in the year 2011 from Uttarakhand Technical University. Her research interest includes Information Extraction and Text Summarization. She has published some of the research papers in international conferences.
By Chandrakala Arya Sanjay k. Dwivedi
DOI: https://doi.org/10.5815/ijeme.2018.01.06, Pub. Date: 8 Jan. 2018
Keyphrase extraction from news web pages is an important task for news documents retrieval and summarization. Keyphrases are like index terms that enclose the important information about document content. Keyphrases actually offer concise and precise description of document content. Key phrases are considered as a single word or a combination of more than one word that represent the important concepts in a text documents. The aim of this paper is to develop and evaluate an automatic keyphrases extraction approach for news web pages. Our approach identifies the candidate keyphrases from documents and chooses those candidate keyphrase having highest weight score. Weight formula combines the feature set that includes TF*IDF, phrase disatnce in documents and lexical chain that is based on WordNet to represent semantic relations between words. The experimental results show that the performance of our approach is better than the contemporary approaches today.
[...] Read more.Subscribe to receive issue release notifications and newsletters from MECS Press journals