A comparison study on ensemble strategies and feature sets for sentiment analysis

Deniz Aldogan*, Yusuf Yaslan

*Bu çalışma için yazışmadan sorumlu yazar

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

3 Atıf (Scopus)

Özet

This paper is devoted to the comparison of different common base and ensemble classifiers for sentiment classification of reviews. It is also aimed to generate different feature sets and to observe their contribution to the classification accuracy. In detail, these feature sets are formed in an hierarchical manner, which is accomplished by first forming part-of-speech (POS) based word groups and then utilizing feature frequencies, SentiWordNet scores and their combination to obtain feature sets. In addition, several common base classifiers, namely Multinominal Naive Bayes (MNB), Support Vector Machine (SVM), Voted Perceptron (VP), K-Nearest Neighbor (k-NN), as well as common ensemble strategies, Random Forests (RFs), Stacking and Random Subspace (RSS) are each tested on the generated feature sets. Also, the Behavior-Knowledge Space (BKS) method has been derived to be applied on the set of outcomes for different algorithm and feature set combinations. Furthermore, a probability based meta-classifier technique has been tested on this set of outcomes. Finally, Information Gain (IG) feature selection technique has been applied to reduce the feature spaces. The experiments are conducted on a widely used movie review dataset and an equally common multi-domain review dataset. The results indicate that the probabilistic ensemble method generally gives comparatively better results than the other algorithms tested on the chosen datasets and that IG method can be utilized to save computational time while maintaining allowable accuracy.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıInformation Sciences and Systems 2015 - 30th International Symposium on Computer and Information Sciences, ISCIS 2015
EditörlerOmer H. Abdelrahman, Gokce Gorbil, Ricardo Lent, Erol Gelenbe
YayınlayanSpringer Verlag
Sayfalar359-370
Sayfa sayısı12
ISBN (Basılı)9783319226347
DOI'lar
Yayın durumuYayınlandı - 2016
Etkinlik30th International Symposium on Computer and Information Sciences, ISCIS 2015 - London, United Kingdom
Süre: 21 Eyl 201524 Eyl 2015

Yayın serisi

AdıLecture Notes in Electrical Engineering
Hacim363
ISSN (Basılı)1876-1100
ISSN (Elektronik)1876-1119

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???30th International Symposium on Computer and Information Sciences, ISCIS 2015
Ülke/BölgeUnited Kingdom
ŞehirLondon
Periyot21/09/1524/09/15

Bibliyografik not

Publisher Copyright:
© Springer International Publishing Switzerland 2016.

Parmak izi

A comparison study on ensemble strategies and feature sets for sentiment analysis' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap