Abstract
In recent years, analyzing data streams has attracted considerable attention in different fields of computer science. In this paper, two different frameworks, namely MOA and Spark MLlib, are examined for linear regression on streaming data. The focus is placed on determining how well the linear regression techniques implemented in the frameworks that could be used to model the data streams. We also examine the challenges of massive data streams and how MOA and Spark Streaming solve these kinds of challenges. As a result of the experiments, we see that although the usage of MOA is more easier than Spark MLlib, Spark MLlib linear regression performance on streaming data is better.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015 |
Editors | Jian Pei, Jie Tang, Fabrizio Silvestri |
Publisher | Association for Computing Machinery, Inc |
Pages | 1244-1247 |
Number of pages | 4 |
ISBN (Electronic) | 9781450338547 |
DOIs | |
Publication status | Published - 25 Aug 2015 |
Event | IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015 - Paris, France Duration: 25 Aug 2015 → 28 Aug 2015 |
Publication series
Name | Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015 |
---|
Conference
Conference | IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2015 |
---|---|
Country/Territory | France |
City | Paris |
Period | 25/08/15 → 28/08/15 |
Bibliographical note
Publisher Copyright:© 2015 ACM.
Keywords
- Data streams
- MOA
- Spark MLlib
- Spark streaming
- Stream mining
- Streaming linear regression