Wikipedia based semantic smoothing for twitter sentiment classification

Dilara Torunoglu, Gurkan Telseren, Ozgun Sagturk, Murat C. Ganiz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Citations (Scopus)

Abstract

Sentiment classification is one of the important and popular application areas for text classification in which texts are labeled as positive and negative. Moreover, Naïve Bayes (NB) is one of the mostly used algorithms in this area. NB having several advantages on lower complexity and simpler training procedure, it suffers from sparsity. Smoothing can be a solution for this problem, mostly Laplace Smoothing is used; however in this paper we propose Wikipedia based semantic smoothing approach. In our study we extend semantic approach by using Wikipedia article titles that exist in training documents, categories and redirects of these articles as topic signatures. Results of the extensive experiments show that our approach improves the performance of NB and even can exceed the accuracy of SVM on Twitter Sentiment 140 dataset.

Original languageEnglish
Title of host publication2013 IEEE International Symposium on Innovations in Intelligent Systems and Applications, IEEE INISTA 2013
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event2013 IEEE International Symposium on Innovations in Intelligent Systems and Applications, IEEE INISTA 2013 - Albena, Bulgaria
Duration: 19 Jun 201321 Jun 2013

Publication series

Name2013 IEEE International Symposium on Innovations in Intelligent Systems and Applications, IEEE INISTA 2013

Conference

Conference2013 IEEE International Symposium on Innovations in Intelligent Systems and Applications, IEEE INISTA 2013
Country/TerritoryBulgaria
CityAlbena
Period19/06/1321/06/13

Keywords

  • semantic smoothing
  • text classification
  • twitter corpus
  • wiki concept
  • wikipedi
  • wikipedia

Fingerprint

Dive into the research topics of 'Wikipedia based semantic smoothing for twitter sentiment classification'. Together they form a unique fingerprint.

Cite this