Permalink home for my Conference presentation at the 2021 Graduate Research School Symposium at AUT, held in Auckland, New Zealand. Hosted by Auckland University of Technology (online) November 26, 2021.
Presently the conference website is hosted on Whova. Full schedule with abstracts can be found here.
Author: Jeff Nijsse (jeff.nijsse@aut.ac.nz)
Department of Computer & Mathematical Sciences, Auckland University of Technology, Auckland, New Zealand
open-source software, GitHub, blockchain, structural equation modelling
GitHub represents the largest open-source software (OSS) community hosting over 100 million repositories by 56 million developers. Since the rise of Bitcoin, blockchain projects have grown dramatically and 84% of the top 200 are hosted on GitHub. Growth inspires many new initiatives and clones, but few projects survive. Identifying healthy projects can assist developers looking to contribute to OSS and researchers seeking innovative teams in addition to reducing fraudulent activity.
This presentation is part of a larger ongoing study investigating OSS blockchain performance and health. The hypothesis to be studied is that the health of an OSS project can be determined through publicly available data. This will be investigated by answering the research question: How can factors that influence the health of OSS be identified? The methodology is adapted from a framework for extracting, processing, and analysing OSS that combines eleven variables into a measurement model. Latent variables are introduced to represent the constructs of community engagement, project robustness, and public interest. Statistical analysis methods are employed both for the development and specification of the measurement model and the construction and evaluation of its structural efficacy. Validation is by two mechanisms: confirmatory factor analysis applied to the measurement model and structural equation modelling to estimate the validity of the relationships.
In the broader study a tool is to be developed to monitor health and highlight areas for innovation. Preliminary findings indicate that community engagement and project interest are positively correlated to robustness, which, in turn, is expected to be a predictor of software project health. A future hypothesis is that healthy projects can be a leading indicator of innovative technology and practice. If OSS communities and projects can be monitored for health, stake holders can identify areas of strength and weakness, extending to industries beyond blockchain.
GitHub_GRS_symposium_slides_Nijsse.pdf
- GitHub, “Empowering Healthy Communities,” 2020. [Online]. Available: https://octoverse.github.com
- M. Goeminne and T. Mens, “Analyzing ecosystems for open source software developer communities,” in Software Ecosystems: Analyzing and Managing Business Networks in the Software Industry, no. 2013, S. Jansen, Ed. 2013, pp. 247–275.
- J. F. Hair Jr., W. C. Black, B. J. Babin, and R. E. Anderson, Multivariate Data Analysis, Seventh. Essex: Pearson Education Limited, 2014.
- Y. Rosseel, "lavaan: An R Package for Structural Equation Modeling", In Journal of Statistical Software (Vol. 48, Issue 2, pp. 1–36). 2012. http://www.jstatsoft.org/v48/i02/
- Y. Rosseel, “Structural equation modeling with lavaan,” Geneve, 2020. https://users.ugent.be/~yrosseel/lavaan/geneve2020/lavaan_oneday_geneve2020.pdf
- J.-F. Schrape, “Open Source Projects as Incubators of Innovation: From Niche Phenomenon to Integral Part of the Software Industry,” SSRN Electron. J., 2017, doi: 10.2139/ssrn.2977352.
- S. Nakamoto, “Bitcoin: A Peer-to-Peer Electronic Cash System,” 2008. https://bitcoin.org/bitcoin.pdf