Mining Github for Factors That Affect Open Source Software Sustainability
DOI:
https://doi.org/10.15379/ijmst.v10i4.1859Keywords:
Software, Repositories, GitHub, Regression, Data mining , Open Source Software, Sustainability.Abstract
Open-Source Software (OSS) is everywhere. The availability of such software enables researchers in software engineering to have a deep insight into the factors that affect the success of the software. Some OSS repos get more popular and evolve over time, while others may only survive for a couple of months. This study aims to help developers identify internal factors that affect the sustainability of their software. Firstly, identify the most demanding application domain; secondly, observe the most popular repositories (for the demanded application domain) for about three years to identify the factors that help such repos to survive (sustainability factors). Dara mining algorithms (classification and regression) are used as a tool to find the GitHub factors that affect the sustainability of OSS.