Identifying Sentiment in Web Multi-topic Documents

Full Text (PDF, 173KB), PP.10-16

Views: 0 Downloads: 0

Author(s)

Na Fan 1,*

1. Chang ’an University Xi’ an, China

* Corresponding author.

DOI: https://doi.org/10.5815/ijwmt.2012.01.02

Received: 27 Oct. 2011 / Revised: 25 Nov. 2011 / Accepted: 3 Jan. 2012 / Published: 15 Feb. 2012

Index Terms

Analyzing Sentiment, Multi-topic Text, Parametric Mixture Model

Abstract

Most of web documents coverage multiple topic. Identifying sentiment of multi-topic documents is a challenge task. In this paper, we proposed a new method to solve this problem. The method firstly reveals the latent topical facets in documents by Parametric Mixture Model. By focusing on modeling the generation process of a document with multiple topics, we can extract specific properties of documents with multiple topics. PMM models documents with multiple topics by mixing model parameters of each single topic. In order to analyze sentiment of each topic, conditional random fields techniques is used to identify sentiment. Empirical experiments on test datasets show that this approach is effective for extracting subtopics and revealing sentiments of each topic. Moreover, this method is quite general and can be applied to any kinds of text collections.

Cite This Paper

Na Fan,"Identifying Sentiment in Web Multi-topic Documents", IJWMT, vol.2, no.1, pp.10-16, 2012. DOI: 10.5815/ijwmt.2012.01.02

Reference

[1] Liu Jing, Zhong Wei-Cai, Liu Fang, Jiao Li-Cheng. Classification based on organization coevolutionary algorithm. Chinese Journal of Computers, 2003, 26(4): 446-453 (in Chinese).

[2] D. T. Larose, Discovering Knowledge in Data: An Introduction to Data Mining, Wiley Publishers, 2004.

[3] U. Fayyad, “Data mining and knowledge discovery in databases: implications for scientific database,” Proc. of the 9th International Conference on Scientific and Statistical Database Management, pp. 2-11, 1997.

[4] C. Apte, B. Liu, E. P. D. Pednault, and P. Smyth,“Business applications of data mining,” Communications of the ACM, vol. 45, no. 8, pp. 49-53, 2002.

[5] M. Garofalakis, R. Rastogi, and K. Shim, “Mining sequential patterns with regular expression constraints,”IEEE Trans. on Knowledge and Data Engineering, vol.14, no. 3, pp. 530-552, 2002.

[6] Zadeh, L.A. Fuzzy Sets. Information and Control 8:338:353 1965.