Opinion Mining of Online Product Reviews from Traditional LDA Topic Clusters using Feature Ontology Tree and Sentiwordnet

Full Text (PDF, 455KB), PP.34-44

Views: 0 Downloads: 0

Author(s)

D. Teja Santosh 1 K. Sudheer Babu 1 S.D.V. Prasad 1 A. Vivekananda 1

1. GITAM University, Rudraram, Telangana, India

* Corresponding author.

DOI: https://doi.org/10.5815/ijeme.2016.06.04

Received: 7 Jul. 2016 / Revised: 6 Sep. 2016 / Accepted: 10 Oct. 2016 / Published: 8 Nov. 2016

Index Terms

Document indicator, Feature Ontology Tree, Latent Dirichlet Allocation, Opinion word, SentiWordNet

Abstract

Online product reviews provide data about the user's perspective on the features that were experienced by them. Product features and corresponding opinions form a major part in analyzing the online product reviews. Extracting features from a huge number of reviews is classified into three major categories such as utilizing language rules, sequence labeling as well as the topic modeling. Latent Dirichlet Allocation (LDA) is one such topic model which clusters the document words into unsupervised learned topics using Dirichlet priors. The words so clustered are the features and opinion words in the product reviews domain. To identify appropriate product features from these clusters a hierarchical, domain independent Feature Ontology Tree (FOT) is applied to LDA clusters. The opinion bearing words of obtained product features are identified by utilizing the document indicators available from topic matrix of LDA. These indicators are useful to backtrack to the corresponding online review in which the product feature is present. The polarity of the opinion bearing word is calculated with the help of SentiWordNet. This improves the accuracy of the features using extracted LDA topic clusters and machine interpretation of polarity of opinion word is satisfactory.

Cite This Paper

D. Teja Santosh, K. Sudheer Babu, S.D.V. Prasad, A. Vivekananda,"Opinion Mining of Online Product Reviews from Traditional LDA Topic Clusters using Feature Ontology Tree and Sentiwordnet", International Journal of Education and Management Engineering(IJEME), Vol.6, No.6, pp.34-44, 2016. DOI: 10.5815/ijeme.2016.06.04

Reference

[1]Dandibhotla, T., & Bulusu, D., Obtaining Feature- and Opinion-Based Linked Instance RDF Data from Unstructured Reviews using Ontology-Based Machine Learning, International Journal Of Technology, 6(2), 198-206, 2015. doi:10.14716/ijtech.v6i2.555.

[2]David M. Blei, Andrew Y. Ng and Michael I. Jordan, Latent Dirichlet Allocation, Journal of Machine Learning Research, Issue 3, pages 993-1022, January 2003.

[3]Ian Horrocks, Ontologies and the Semantic Web, ACM 2009.

[4]Hofmann, T. (1999). Probabilistic latent semantic indexing. Research and Development in Information Retrieval, pages 50–57.

[5]Hu, M., Liu, B., 2004. Mining and Summarizing Customer Reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 22−25, 2004, Seattle, WA, USA.

[6]S. Brody, N. Elhadad, An unsupervised aspect-opinion model for online reviews, in: HLT-NAACL, The Association for Computational Linguistics, 2010, pp. 804–812.

[7]C. Lin, Y. He, Joint opinion/topic model for opinion analysis, in: Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM'11), 2009.

[8]Jo, Yohan, and Alice H. Oh., Aspect and opinion unification model for online review analysis. Proceedings of the fourth ACM international conference on Web search and data mining. ACM, 2011.

[9]Zofia Stankiewicz and Satoshi Sekine, SurfShop: Combining a Product Ontology With Topic Model Results for Online Window Shopping, NAACL-HLT 2012. Montreal, Canada, pp 13-16.

[10]BA Frigyik, A. Kapila, and M.R. Gupta. Introduction to the dirichlet distribution and related processes. Department of Electrical Engineering, University of Washignton, UWEETR-2010-0006, 2010.

[11]Popescu A. and Etzioni O., Extracting Product Features and Opinions from Reviews, in Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, USA, pp. 339-346, 2005. 

[12]Carenini G., Ng R., and Zwart E., Extracting Knowledge from Evaluative Text, in Proceedings of the 3rd International Conference on Knowledge Capture, USA, pp. 11-18, 2005.

[13]Ferreira L., Jakob N., and Gurevych I., A Comparative Study of Feature Extraction Algorithms in Customer Reviews, in Proceedings of IEEE International Conference onSemantic Computing, Santa Clara, California, USA, pp. 144-151, 2008. 

[14]Yi J., Nasukawa T., Bunescu R., and Niblack W., Sentiment Analyzer: Extracting Sentiments about a Given Topic Using Natural Language Processing Techniques, in Proceedings of the 3rd IEEE International Conference on Data Mining, Washington, USA, pp. 427-434, 2003.

[15]Ghobadi A. and Rahgozar M., An Ontology based Semantic Extraction Approach for B2C eCommerce, The International Arab Journal of Information Technology, vol. 8, no. 2, pp. 163-170, 2011. 

[16]Alani, Harith, et al., Using protege for automatic ontology instantiation, 2004.

[17]Turney, P. D., & Littman, M. L., Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions on Information Systems, 21(4), 315–346, 2003.

[18]Santosh, D. Teja, B. Vishnu Vardhan, and D. Ramesh. "Extracting Product Features from Reviews Using Feature Ontology Tree Applied on LDA Topic Clusters." Advanced Computing (IACC), 2016 IEEE 6th International Conference on. IEEE, 2016.

[19]Griffiths TL, Steyvers M (2004). "Finding Scientific Topics." Proceedings of the National Academy of Sciences of the United States of America, 101, 5228–5235.

[20]B. Pang, L. Lee, and S. Vaithyanathan, "Thumbs up? Sentiment classification using machine learning techniques," in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 79–86, 2002.

[21]D. Teja Santosh, B. Vishnu Vardhan "Extracting product features from reviews using Feature Ontology Tree applied on LDA topic clusters." In Proceedings of the Advance Computing Conference (IACC), IEEE, pp 163-168, 2016.

[22]Kristina Toutanova, Dan Klein, Christopher Manning, and Yoram Singer. 2003. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network. In Proceedings of HLT-NAACL 2003, pp. 252-259.