Categorical Data Analysis Commons

Open Access. Powered by Scholars. Published by Universities.

12 Institutions 101 Full-Text Articles 149 Authors 10,431 Downloads

Recent Articles in Categorical Data Analysis

The Net Reclassification Index (Nri): A Misleading Measure Of Prediction Improvement With Miscalibrated Or Overfit Models, Margaret Pepe, Jin Fang, Ziding Feng, Thomas Gerds, Jorgen Hilden COBRA

The Net Reclassification Index (Nri): A Misleading Measure Of Prediction Improvement With Miscalibrated Or Overfit Models, Margaret Pepe, Jin Fang, Ziding Feng, Thomas Gerds, Jorgen Hilden

UW Biostatistics Working Paper Series

The Net Reclassification Index (NRI) is a very popular measure for evaluating the improvement in prediction performance gained by adding a marker to a set of baseline predictors. However, the statistical properties of this novel measure have not been explored in depth. We demonstrate the alarming result that the NRI statistic calculated on a large test dataset using risk models derived from a training set is likely to be positive even when the new marker has no predictive information. A related theoretical example is provided in which a miscalibrated risk model that includes an uninformative marker is proven to erroneously ...


Analysis Of Alcohol Use Among Pregnant Women In San Luis Obispo County, Samantha Law California Polytechnic State University

Analysis Of Alcohol Use Among Pregnant Women In San Luis Obispo County, Samantha Law

Statistics

Drinking alcohol during pregnancy is harmful to the fetus, and can lead to serious alcohol related developmental birth defects. Utilizing prenatal screening, such as the 4P’s Plus© screening tool, during a woman’s first prenatal doctors visit can help educate women and reduce continued alcohol use during pregnancy. Currently the CDC reports that 1 in 13 women in the US drink alcohol while pregnant compared to local reports that 1 in 3 women in San Luis Obispo County continue to drink alcohol during pregnancy. A primary concern for many local county health care experts and organizations is to raise ...


Analysis Of Median Household Income Differences Between Election Day-Vbm And Eip Voters, Mark Salling, Norman Robbins Cleveland State University

Analysis Of Median Household Income Differences Between Election Day-Vbm And Eip Voters, Mark Salling, Norman Robbins

Urban Publications

Analysis of early in-person (EIP) voting in 2008 in Cuyahoga County shows that African-American, white, and Hispanic voters who used EIP voting had significantly lower incomes than members of those same groups who voted on election day or by mail. This result applies to those voting EIP on weekdays, extended weekday hours, weekends, and the three days before election day.


Group Testing Regression Models, Boan Zhang University of Nebraska - Lincoln

Group Testing Regression Models, Boan Zhang

Dissertations and Theses in Statistics

Group testing, where groups of individual specimens are composited to test for the presence or absence of a disease (or some other binary characteristic), is a procedure commonly used to reduce the costs of screening a large number of individuals. Statistical research in group testing has traditionally focused on a homogeneous population, where individuals are assumed to have the same probability of having a disease. However, individuals often have different risks of positivity, so recent research has examined regression models that allow for heterogeneity among individuals within the population. This dissertation focuses on two problems involving group testing regression models ...


Comparative Analysis Of Dispersion Parameter Estimates In Loglinear Modeling: Applied To E-Commerce Sales And Customer Data, Scott Davis California Polytechnic State University

Comparative Analysis Of Dispersion Parameter Estimates In Loglinear Modeling: Applied To E-Commerce Sales And Customer Data, Scott Davis

Statistics

When loglinear models are applied to count data the issue of over-dispersion often arises. Moment and maximum likelihood estimation methods in accounting for over-dispersion are widely used because they allow for model checking tools such as Chi-square, F, and likelihood ratio tests. Here is a comparison between R functions that each uses one method; glm.nb uses MLE, and glm.poisson.disp uses MME. The Index of Dissimilarity and visual model selection (ECDF plots) are also incorporated. These are applied to sales data using product and customer information compiled over the last five years that was generously provided by an ...


Library Technical Services Process Improvement Based On Lean, Richard J. Zwiercan, Cyrus Zarganj Ford, Greg W. Voelker University of Nevada, Las Vegas

Library Technical Services Process Improvement Based On Lean, Richard J. Zwiercan, Cyrus Zarganj Ford, Greg W. Voelker

Presentations (Libraries)

Lean Thinking … is to see and eliminate Muda ‘waste’ – which is essentially any activity in which absorbs resources but creates no value.