Abstract
GitHub is the largest open source software development platform with millions of repositories on variety of topics. The number of stars received by a repository is often considered as a measure of its popularity. Predicting the number of stars of a repository has been associated with the number of forks, commits, followers, documentation size, and programming language in the literature. We extend prior studies in terms of input features and algorithm: We define six features from GitHub events corresponding to the development activities, and additional six features incorporating the influence of users (followers and contributors) on the popularity of projects into their development activities. We propose a time-series based forecast model using Recurrent Neural Networks to predict the number of stars received in consecutive k days. We assess the performance of our proposed model with varying k (1, 7, 14, 30 days) and with varying input features. Our analysis on five topmost starred repositories in data visualization area shows that the error rate ranges between 19.76 and 70.57 among the projects. The best performing models use either features from development activities only, or all metrics including all the features.
Original language | English |
---|---|
Title of host publication | Open Source Systems - 15th IFIP WG 2.13 International Conference, OSS 2019, Proceedings |
Editors | Francis Bordeleau, Alberto Sillitti, Paulo Meirelles, Valentina Lenarduzzi |
Publisher | Springer New York LLC |
Pages | 80-90 |
Number of pages | 11 |
ISBN (Print) | 9783030208820 |
DOIs | |
Publication status | Published - 2019 |
Event | 15th International Conference on Open Source Systems, OSS 2019 - Montreal, Canada Duration: 26 May 2019 → 27 May 2019 |
Publication series
Name | IFIP Advances in Information and Communication Technology |
---|---|
Volume | 556 |
ISSN (Print) | 1868-4238 |
ISSN (Electronic) | 1868-422X |
Conference
Conference | 15th International Conference on Open Source Systems, OSS 2019 |
---|---|
Country/Territory | Canada |
City | Montreal |
Period | 26/05/19 → 27/05/19 |
Bibliographical note
Publisher Copyright:© IFIP International Federation for Information Processing 2019.
Funding
Acknowledgments. This research is supported in part by Scientific Research Projects Division of Istanbul Technical University with project number MGA-2017-40712 and Scientific and Technological Research Council of Turkey with project number 5170048.
Funders | Funder number |
---|---|
Türkiye Bilimsel ve Teknolojik Araştirma Kurumu | 5170048 |
Istanbul Teknik Üniversitesi | MGA-2017-40712 |
Keywords
- Open source projects
- Predicting stars
- Recurrent Neural Networks