Abstract
One of the most challenging problems of sentiment analysis on social media is that labelling huge amounts of instances can be very expensive. Active learning has been proposed to overcome this problem and to provide means for choosing the most useful training instances. In this study, we introduce active learning to a framework which is comprised of most popular base and ensemble approaches for sentiment analysis. In addition, the implemented framework contains two ensemble approaches, i.e. a probabilistic algorithm and a derived version of Behavior Knowledge Space (BKS) algorithm. The Shannon Entropy approach was utilized for choosing among training data during active learning process and it was compared with maximum disagreement method and random selection of instances. It was observed that the former method causes better accuracies in less number of iterations. The above methods were tested on Cornell movie review dataset and a popular multi-domain product review dataset.
Original language | English |
---|---|
Pages (from-to) | 311-323 |
Number of pages | 13 |
Journal | Computers and Electrical Engineering |
Volume | 57 |
DOIs | |
Publication status | Published - 1 Jan 2017 |
Bibliographical note
Publisher Copyright:© 2016
Keywords
- Active learning
- Artificial intelligence
- Ensemble learning
- Machine learning
- Sentiment analysis