Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Wayne State University

Journal

Data mining

Publication Year

Articles 1 - 4 of 4

Full-Text Articles in Physical Sciences and Mathematics

The Probit Link Function In Generalized Linear Models For Data Mining Applications, Mehdi Razzaghi May 2013

The Probit Link Function In Generalized Linear Models For Data Mining Applications, Mehdi Razzaghi

Journal of Modern Applied Statistical Methods

The use of logistic regression for outcome classification of dichotomous variables is well known in data mining applications. The estimated probability of the logit transformation belongs to the class of canonical link functions that follow from particular probability distribution functions. A closely related model is the probit link which can be used for binary responses. Although the probit link is not canonical, in some cases the overall fit of the model can be improved by using non-canonical link functions. This article reviews the properties of the probit link function and discusses its applications in data mining problems. Contrasts and comparisons …


Data Mining Ceo Compensation, Susan M. Adams, Atul Gupta, Dominique M. Haughton, John D. Leeth Nov 2008

Data Mining Ceo Compensation, Susan M. Adams, Atul Gupta, Dominique M. Haughton, John D. Leeth

Journal of Modern Applied Statistical Methods

The need to pre-specify expected interactions between variables is an issue in multiple regression. Theoretical and practical considerations make it impossible to pre-specify all possible interactions. The functional form of the dependent variable on the predictors is unknown in many cases. Two ways are described in which the data mining technique Multivariate Adaptive Regression Splines (MARS) can be utilized: first, to obtain possible improvements in model specification, and second, to test for the robustness of findings from a regression analysis. An empirical illustration is provided to show how MARS can be used for both purposes.


An Exploration Of Using Data Mining In Educational Research, Yonghong Jade Xu May 2005

An Exploration Of Using Data Mining In Educational Research, Yonghong Jade Xu

Journal of Modern Applied Statistical Methods

Technology advances popularized large databases in education. Traditional statistics have limitations for analyzing large quantities of data. This article discusses data mining by analyzing a data set with three models: multiple regression, data mining, and a combination of the two. It is concluded that data mining is applicable in educational research.


Shifting Goals And Mounting Challenges For Statistical Methodology, Pranab K. Sen May 2002

Shifting Goals And Mounting Challenges For Statistical Methodology, Pranab K. Sen

Journal of Modern Applied Statistical Methods

Modern interdisciplinary research in statistical science encompasses a wide field: agriculture, biology, biomedical sciences along with bioinformatics, clinical sciences, education, environmental and public health disciplines, genomic science, industry, molecular genetics, socio-behavior, socio-economics, toxicology, and a variety of other disciplines. Statistical science has historically had mathematical perspectives dominating theoretical and methodological developments. Yet, the advent of modern information technology has opened the doors for highly computation intensive statistical tools (i.e., software), wherein mathematical aspects are often de-emphasized. Knowledge discovery and data mining (KDDM) is now becoming a dominating force, with bioinformatics as a notable example. In view of this apparent discordance …