Open Access. Powered by Scholars. Published by Universities.®

Business Commons

Open Access. Powered by Scholars. Published by Universities.®

Series

Physical Sciences and Mathematics

Kennesaw State University

2019

Articles 1 - 1 of 1

Full-Text Articles in Business

A Descriptive Study Of Variable Discretization And Cost-Sensitive Logistic Regression On Imbalanced Credit Data, Lili Zhang, Jennifer Priestley, Herman Ray, Soon Tan Jul 2019

A Descriptive Study Of Variable Discretization And Cost-Sensitive Logistic Regression On Imbalanced Credit Data, Lili Zhang, Jennifer Priestley, Herman Ray, Soon Tan

Published and Grey Literature from PhD Candidates

Training classification models on imbalanced data tends to result in bias towards the majority class. In this paper, we demonstrate how variable discretization and cost-sensitive logistic regression help mitigate this bias on an imbalanced credit scoring dataset, and further show the application of the variable discretization technique on the data from other domains, demonstrating its potential as a generic technique for classifying imbalanced data beyond credit scoring. The performance measurements include ROC curves, Area under ROC Curve (AUC), Type I Error, Type II Error, accuracy, and F1 score. The results show that proper variable discretization and cost-sensitive logistic regression with …