Syntactic and Sentence Feature Based Hybrid Approach for Text Summarization

Full Text (PDF, 277KB), PP.38-46

Views: 0 Downloads: 0

Author(s)

D.Y. Sakhare 1,* Raj Kumar 2

1. Bharati Veedyapeeth Deemed University, Pune, Maharashtra, India

2. DRDO, Scientist ā€—Dā€˜, DIAT, Khadakwasla, Pune, Maharashtra, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2014.03.05

Received: 24 May 2013 / Revised: 20 Oct. 2013 / Accepted: 11 Dec. 2013 / Published: 8 Feb. 2014

Index Terms

Text Summarization, Dependency Grammar, Syntactic Structure, Feature Score, POS Tagger, DUC 2002

Abstract

Recently, there has been a significant research in automatic text summarization using feature-based techniques in which most of them utilized any one of the soft computing techniques. But, making use of syntactic structure of the sentences for text summarization has not widely applied due to its difficulty of handling it in summarization process. On the other hand, feature-based technique available in the literature showed efficient results in most of the techniques. So, combining syntactic structure into the feature-based techniques is surely smooth the summarization process in a way that the efficiency can be achieved. With the intention of combining two different techniques, we have presented an approach of text summarization that combines feature and syntactic structure of the sentences. Here, two neural networks are trained based on the feature score and the syntactic structure of sentences. Finally, the two neural networks are combined with weighted average to find the sentence score of the sentences. The experimentation is carried out using DUC 2002 dataset for various compression ratios. The results showed that the proposed approach achieved F-measure of 80% for the compression ratio 50 % that proved the better results compared with the existing techniques.

Cite This Paper

D.Y. Sakhare, Raj Kumar, "Syntactic and Sentence Feature Based Hybrid Approach for Text Summarization", International Journal of Information Technology and Computer Science(IJITCS), vol.6, no.3, pp.38-46, 2014. DOI:10.5815/ijitcs.2014.03.05

Reference

[1]Automatic text summarization using sentence Features: a review, International J. of Engg. Research & Indu. Appls. (IJERIA). ISSN 0974-1518, Vol.4, No. IV, November 2011, pp. 31- 42.

[2]Oi Mean Foong, Alan Oxley and Suziah Sulaiman, ‘Challenges and trends in automatic text summarization, IJITT, Vol. 1, Issue 1, 2010 pp 34-39’.

[3]Joakim Nivre, “Dependency Grammar and Dependency Parsing”, In MSI report 05133, 2005.

[4]Luhn , ‘The Automatic Creation of Literature Abstracts’, IBM Journal April 1958 pp. 159–165. 

[5]Baxendale, ‘Machine-made Index for Technical Literature’An Experiment‘, IBM Journal of Research Development, Vol. 2, No.4, pp. 354-361, 1958. 

[6]Edmundson, ‘New Methods in Automatic Extracting’, Journal of the Association for Computing Machinery, Vol 16, No 2, April 1969, PP. 264-285. 

[7]Pollock and Zamora, ‘Automatic Abstracting Research at Chemical Abstracts Service’,Journal of Chemical Information and Computer Sciences, 15(4), 226-232,1975. 

[8]Kathleen McKeown, ‘Discourse Strategies for Generating Natural Language Text’, Department ofComputer Science, Columbia University, New York, 1982 

[9]Turney, Learning to extract keyphrases from text’, technical report ERB-1057. (NRC#41622), National Research Council, Institute for Information Technology, 1999. 

[10]Marcu, ‘The automatic construction of large-scale corpora for summarization research’. In Proceedings of the 22nd International Conference on Research and Development in Information Retrieval, University of California, Berkeley, August 1999.

[11]Boguraev, Kennedy and Bellamy, ‘Dynamic presentation of phrasal-based document abstractions’, 32nd International Conference on System Sciences, 1999. 

[12]Brandow, R., Mitze, K., Rau,’ Automatic condensation of electronic publications by sentence selection’. Information Processing anagement,31(5):675-685, 1995. 

[13]Radev, R., Blair-goldensohn, S, Zhang, Z., ‘Experiments in Single and Multi-Docuemtn Summarization using MEAD’. In First Document Understanding Conference, New Orleans, LA, 2001. 

[14]Jing, Hongyan and Kathleen McKeown., ‘Cut and paste based text summarization’. In 1st Conference of the North American Chapter of the Association for Computational Linguistics , 2000

[15]Nadira Begum, Mohamed Abdel Fattah, Fuji Ren, ‘Automatic text summarization using support vector machine’, International Journal of Innovative Computing, Volume 5, pp 1987-1996, 2009. 

[16]Rafeeq Al-Hashemi, ‘Text Summarization Extraction System (TSES)Using Extracted Keywords’, International Arab Journal of e-Technology, Vol. 1, No. 4, pp 164-168, 2010

[17]Rajesh Prasad, Uday Kulkarni, Implementation and Evaluation of Evolutionary Connectionist Approaches to Automated Text Summarization’ Journal of Computer Science, 2010 ISSN 1549-3636, pp1366-1376.

[18]Dipti..Sakhare, Rakjumar ‘Syntactical Knowledge based Stemmer for Automatic Document Summarization’, CIIT international journal of data mining knowledge engineering print: ISSN 0974 – 9683 & online: ISSN 0974 – 9578 Issue: march 2012 doi: dmke032012002.

[19]DUC.nist.gov/data.html 

[20]Dipti..Sakhare, Rakjumar “Neural network based approach to study the effect of feature selection on document summarization” (IJET) ISSN: 0975-4024, Vol 5, Issue No 3, Jun-Jul 2013 pp 2585-2593.