A HYBRID APPROACH FOR SUPERVISED TWITTER SENTIMENT CLASSIFICATION.

revathy kumaresan

Abstract


Micro blogging Websites like Twitter, Facebook have become rich source of opinions. This information can be leveraged by different communities to perform sentiment analysis. There is a need for automatically detecting the polarity of Twitter messages.  A semantic sentiment mining system is proposed to determine the contextual polarity of a sentence. This hybrid approach uses three different machine learning models for classifying the sentiment as positive and negative. The system presents more significant approach towards the contextual information in the document which is one of the drawbacks of the systems which are available for determining contextual information. The first model uses rule-based classification based on compositional semantic rules that identifies expression level polarity. The second one performs sense-based classification based on WordNet senses as features to Support Vector Machine classifier. Further to provide a meaningful classification, semantics are incorporated as additional feature into the training data by the interpolation method. Thus, the third model performs entity-level analysis based on concepts obtained. The outputs of three models are handled by knowledge inference system to predict the polarity of sentence. This system is expected to produce better results when compared to the baseline system performance. The system aims to predict consumer moods and the attitude in real-time which can be efficiently utilized by the firms to increase productivity and revenue.


Keywords


TWITTER SENTIMENT CLASSIFICATION; HYBRID APPROACH; SEMANTIC ANALYSIS;SENTIMENT ANALYSIS

Full Text:

PDF

References


B. Pang, L. Lee, S. Vaithyanathan, (2002), “Thumbs up? Sentiment classification using Machine learning Techniquesâ€, in: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, Volume 10, pg. 79–86.

K. Dave, S. Lawrence, D.M. Pennock, (2003), “Mining the peanut gallery: opinion extraction and semantic Classification of product reviewsâ€, in: Proceedings of the 12th International Conference on World Wide Web, pg. 519 – 528.

T. Kudo, Y. Matsumoto, (2004), “A boosting algorithm for classification of semi-structured Textâ€, in: Proceedings of EMNLP.

C. Chen, F. Ibekwe-SanJuan, E. SanJuan, C. Weaver, (2006), “Visual analysis of conflicting opinionsâ€, in: Visual Analytics Science and Technology, IEEE Symposium On, pg. 59–66.

M.Annett, G. Kondrak, (2008), “A comparison of sentiment analysis techniques: polarizing movie Blogsâ€, in: Advances in Artificial Intelligence, pg: 25-35.

A.Go, R. Bhayani, L. Huang, (2009), “Twitter sentiment classification using distant supervisionâ€, in: CS224N Project Report, Stanford, pg. 1–12.D.

Davidov, O. Tsur, A. Rappoport, (2003), “Enhanced sentiment learning using Twitter hashtags and smileysâ€, in: Proceedings of the 23rd International Conference on Computational Linguistics, Posters, pg. 241–249.

Magdalini Eirinaki , Shamita Pisal , Japinder Singh (2012), “feature-based opinion mining and ranking†, Journal of Computer and System Sciences 78 (2012) 1175–1184.

Karan Chawla, Ankit Ramteke, Pushpak Bhattacharya, “IITB-Sentiment-Analysts: Participation in Sentiment Analysis in Twitter SemEval 2013 Taskâ€

L. Barbosa, J. Feng. “Robust Sentiment Detection on Twitter from Biased and Noisy Dataâ€. COLING 2010: Poster Volume, pp. 36-44.

Balamurali A , Aditya Joshi, Pushpak Bhattacharyya, “Robust Sense-Based Sentiment Classificationâ€, Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis, ACL-HLT 2011, pages 132–138,24 June, 2011, Portland, Oregon, USAc 2011 Association for Computational Linguistics.

Y. Choi, and C. Cardie, Learning with compositional semantics as structural inference for subsentential sentiment analysis. Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 793–801, 2008.

Kunpeng Zhang, Yu Cheng, Yusheng Xie, Daniel Honbo Ankit Agrawal, Diana Palsetia, Kathy Lee, Wei-keng Liao, and Alok Choudhary, “SES: Sentiment Elicitation System for Social Media Dataâ€, 2011 11th IEEE International Conference on Data Mining Workshops.

Hassan Saif, Yulan He and Harith Alani, 2012 , “Semantic Sentiment Analysis of Twitterâ€.

George A. Miller. 1995. Wordnet: A lexical database for english. Comsmunications of the ACM, 38:39–41.

N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press, March 2000.

Pak, A. and Paroubek, P. 2010. Twitter as a Corpus for Sentiment Analysis and Opinion Mining, in 'Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)' , European Language Resources Association(ELRA), Valletta, Malta.

Dekang Lin. 1998. An information-theoretic definition of similarity. In Proc. of the 15th International Conference on Machine Learning,pages 296–304

Satanjeev Banerjee and Ted Pedersen. 2002. An adapted lesk algorithm for word sense isambiguation using wordnet. In Proc. of CICLing’02, pages 136–145, London, UK

Claudia Leacock and Martin Chodorow. 1998. Combining local context with wordnet similarity for word sense identification. In WordNet: A Lexical Reference System and its Application.

Helmut Schmid. 1994. Probabilistic part-of-speech tagging using decision trees.

http://en.wikipedia.org/wiki/Listofemoticons

http://en.wikipedia.org/wiki/Twitter

www.alchemyapi.com

www.zemanta.com

http://twitter.com

http://tumblr.com

http://facebook.com


Refbacks

  • There are currently no refbacks.


ISSN: 1694-2507 (Print)

ISSN: 1694-2108 (Online)