Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
Articles 1 - 1 of 1
Full-Text Articles in Physical Sciences and Mathematics
Multicollinearity Applied Stepwise Stochastic Imputation: A Large Dataset Imputation Through Correlation‑Based Regression, Benjamin D. Leiby, Darryl K. Ahner
Multicollinearity Applied Stepwise Stochastic Imputation: A Large Dataset Imputation Through Correlation‑Based Regression, Benjamin D. Leiby, Darryl K. Ahner
Faculty Publications
This paper presents a stochastic imputation approach for large datasets using a correlation selection methodology when preferred commercial packages struggle to iterate due to numerical problems. A variable range-based guard rail modification is proposed that benefits the convergence rate of data elements while simultaneously providing increased confidence in the plausibility of the imputations. A large country conflict dataset motivates the search to impute missing values well over a common threshold of 20% missingness. The Multicollinearity Applied Stepwise Stochastic imputation methodology (MASS-impute) capitalizes on correlation between variables within the dataset and uses model residuals to estimate unknown values. Examination of the …