Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

2012

Brigham Young University

Software engineering

Articles 1 - 2 of 2

Full-Text Articles in Physical Sciences and Mathematics

Analysis And Characterization Of Author Contribution Patterns In Open Source Software Development, Quinn Carlson Taylor Mar 2012

Analysis And Characterization Of Author Contribution Patterns In Open Source Software Development, Quinn Carlson Taylor

Theses and Dissertations

Software development is a process fraught with unpredictability, in part because software is created by people. Human interactions add complexity to development processes, and collaborative development can become a liability if not properly understood and managed. Recent years have seen an increase in the use of data mining techniques on publicly-available repository data with the goal of improving software development processes, and by extension, software quality. In this thesis, we introduce the concept of author entropy as a metric for quantifying interaction and collaboration (both within individual files and across projects), present results from two empirical observational studies of open-source …


Commit Patterns And Threats To Validity In Analysis Of Open Source Software Repositories, Alexander Curtis Maclean Feb 2012

Commit Patterns And Threats To Validity In Analysis Of Open Source Software Repositories, Alexander Curtis Maclean

Theses and Dissertations

In the course of studying the effects of programming in multiple languages, we unearthed troubling trends in SourceForge artifacts. Our initial studies suggest that programming in multiple languages concurrently negatively affects developer productivity. While addressing our initial question of interest, we discovered a pattern of monolithic commits in the SourceForge community. Consequently, we also report on the effects that this pattern of commits can have when using SourceForge as a data-source for temporal analysis of open source projects or for studies of individual developers.