Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Data Science

PDF

Dissertations, Master's Theses and Master's Reports

2021

Machine learning

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Using Text Mining And Machine Learning Classifiers To Analyze Stack Overflow, Taylor Morris Jan 2021

Using Text Mining And Machine Learning Classifiers To Analyze Stack Overflow, Taylor Morris

Dissertations, Master's Theses and Master's Reports

StackOverflow is an extensively used platform for programming questions. In this report, text mining and machine learning classifiers such as decision trees and Naive Bayes are used to evaluate whether a given question posted on StackOverflow will be closed or answered. While multiple models were used in the analysis, the performance for the models was no better than the majority classifier. Future work to develop better performing classifiers to understand why a question is closed or answered will require additional natural language processing or methods to address the imbalanced data.