Open Access. Powered by Scholars. Published by Universities.®

Categorical Data Analysis Commons

Open Access. Powered by Scholars. Published by Universities.®

448 Full-Text Articles 668 Authors 259,886 Downloads 101 Institutions

All Articles in Categorical Data Analysis

Faceted Search

448 full-text articles. Page 16 of 18.

Analysis Of Alcohol Use Among Pregnant Women In San Luis Obispo County, Samantha Law 2012 California Polytechnic State University, San Luis Obispo

Analysis Of Alcohol Use Among Pregnant Women In San Luis Obispo County, Samantha Law

Statistics

Drinking alcohol during pregnancy is harmful to the fetus, and can lead to serious alcohol related developmental birth defects. Utilizing prenatal screening, such as the 4P’s Plus© screening tool, during a woman’s first prenatal doctors visit can help educate women and reduce continued alcohol use during pregnancy. Currently the CDC reports that 1 in 13 women in the US drink alcohol while pregnant compared to local reports that 1 in 3 women in San Luis Obispo County continue to drink alcohol during pregnancy. A primary concern for many local county health care experts and organizations is to raise awareness that …


An Analysis Of Risk Reduction Choices In Dcis Breast Cancer Patients, Lauren Soltesz 2012 California Polytechnic State University, San Luis Obispo

An Analysis Of Risk Reduction Choices In Dcis Breast Cancer Patients, Lauren Soltesz

Statistics

The main focus of this paper was to evaluate possible demographic and clinical characteristics associated with a woman’s choice of breast conserving surgery (BCS), unilateral mastectomy (ULM), or bilateral risk reduction mastectomy (BRRM). The cohort consisted of patients presenting to the City of Hope National Medical Center with ductal carcinoma in situ breast cancer who elected to have cancer directed surgery (N=305). Analyses to examine associations of patient characteristics with type of surgery were conducted using a multinomial logistic regression. Results showed that older women were more likely to choose breast conserving surgery over bilateral risk reduction mastectomy than younger …


Group Testing Regression Models, Boan Zhang 2012 University of Nebraska-Lincoln

Group Testing Regression Models, Boan Zhang

Department of Statistics: Dissertations, Theses, and Student Work

Group testing, where groups of individual specimens are composited to test for the presence or absence of a disease (or some other binary characteristic), is a procedure commonly used to reduce the costs of screening a large number of individuals. Statistical research in group testing has traditionally focused on a homogeneous population, where individuals are assumed to have the same probability of having a disease. However, individuals often have different risks of positivity, so recent research has examined regression models that allow for heterogeneity among individuals within the population. This dissertation focuses on two problems involving group testing regression models. …


Comparative Analysis Of Dispersion Parameter Estimates In Loglinear Modeling: Applied To E-Commerce Sales And Customer Data, Scott Davis 2012 California Polytechnic State University, San Luis Obispo

Comparative Analysis Of Dispersion Parameter Estimates In Loglinear Modeling: Applied To E-Commerce Sales And Customer Data, Scott Davis

Statistics

When loglinear models are applied to count data the issue of over-dispersion often arises. Moment and maximum likelihood estimation methods in accounting for over-dispersion are widely used because they allow for model checking tools such as Chi-square, F, and likelihood ratio tests. Here is a comparison between R functions that each uses one method; glm.nb uses MLE, and glm.poisson.disp uses MME. The Index of Dissimilarity and visual model selection (ECDF plots) are also incorporated. These are applied to sales data using product and customer information compiled over the last five years that was generously provided by an e-commerce company.


Analysis Of Median Household Income Differences Between Election Day-Vbm And Eip Voters, Mark Salling, Norman Robbins 2012 Cleveland State University

Analysis Of Median Household Income Differences Between Election Day-Vbm And Eip Voters, Mark Salling, Norman Robbins

All Maxine Goodman Levin School of Urban Affairs Publications

Analysis of early in-person (EIP) voting in 2008 in Cuyahoga County shows that African-American, white, and Hispanic voters who used EIP voting had significantly lower incomes than members of those same groups who voted on election day or by mail. This result applies to those voting EIP on weekdays, extended weekday hours, weekends, and the three days before election day.


Adventures In Library Salary Surveys, Scott L. Schaffer 2012 University of Vermont

Adventures In Library Salary Surveys, Scott L. Schaffer

UVM Libraries Conference Day

Salary surveys are an important tool for the library community and the administrators and boards responsible for the oversight of libraries. However, such assessments must be constructed and analyzed with great care. The Vermont Library Association Personnel Committee has conducted three salary surveys over the past several years, one focusing on academic libraries and two on public libraries. Significant issues have included confidentiality, participation rate, definitions, length and difficulty of questions, collection of data, and representativeness. Suggestions and lessons learned will be shared.


Bailey/Howe Reference Analytics: What Two Years Of Data Tell Us, Elizabeth Berman 2012 University of Vermont, Bailey/Howe Library

Bailey/Howe Reference Analytics: What Two Years Of Data Tell Us, Elizabeth Berman

UVM Libraries Conference Day

Analyzing the last two academic years (2010-2011 and 2011-2012) of reference-desk statistics, this presentation will highlight trends at the Bailey/Howe Reference Desk, and offer scenarios for the future of reference services.


Investigation Of Trends And Predictive Effectiveness Of Crash Severity Models, James E. Mooradian 2012 University of Connecticut

Investigation Of Trends And Predictive Effectiveness Of Crash Severity Models, James E. Mooradian

Master's Theses

This thesis describes analysis using ordinal logistic regression to uncover temporal patterns in the severity level (fatal, serious injury, minor injury, slight injury or no injury) for persons involved in highway crashes in Connecticut, focusing on the demographic split between senior travelers (65 years and over) and non-senior travelers. Existing state sources provide data describing the time and weather conditions for each crash and the vehicles and persons involved over the time period from 1995 to 2009 as well as the traffic volumes and the characteristics of the roads on which these crashes occurred. Findings indicate an overall increase in …


Computing Highly Accurate Or Exact P-Values Using Importance Sampling, Chris Lloyd 2012 Melbourne Business School

Computing Highly Accurate Or Exact P-Values Using Importance Sampling, Chris Lloyd

Chris J. Lloyd

Especially for discrete data, standard first order P-values can suffer from poor accuracy, even for quite large sample sizes. Moreover, different test statistics can give practically different results. There are several approaches to computing P-values which do not suffer these defects, such as parametric bootstrap P-values or the partially maximised P-values of Berger and Boos (1994).

Both these methods require computing the exact tail probability of the approximate P-value as a function of the nuisance parameter/s, known as the significance profile. For most practical problems, this is not computationally feasible. I develop an importance sampling approach to this problem. A …


Using The R Library Rpanel For Gui-Based Simulations In Introductory Statistics Courses, Ryan M. Allison 2012 California Polytechnic State University, San Luis Obispo

Using The R Library Rpanel For Gui-Based Simulations In Introductory Statistics Courses, Ryan M. Allison

Statistics

As a student, I noticed that the statistical package R (http://www.r-project.org) would have several benefits of its usage in the classroom. One benefit to the package is its free and open-source nature. This would be a great benefit for instructors and students alike since it would be of no cost to use, unlike other statistical packages. Due to this, students could continue using the program after their statistical courses and into their professional careers. It would be good to expose students while they are in school to a tool that professionals use in industry. R also has powerful …


Analyzing Financial Data Through Interactive Visualization, Fizi Yadav 2012 Purdue University

Analyzing Financial Data Through Interactive Visualization, Fizi Yadav

Purdue Polytechnic Masters Theses

This investigation explored the role data visualization in the scientific community and its effect on the cognitive ability of an individual. The research attempted to answer the question, “ Whether appropriate data visualization helps in better understanding of corporate financial data?” and used quantitative methods to gain an insight. Participants in the study were divided into two groups of 30 each, with one group receiving the treatment devised for the research and the other using the common prevalent method. Both the groups were subjected to a test, based upon the analysis of which a conclusion was derived. The subjects were …


An Annotated Bibliography Of Methods For Analyzing Correlated Categorical Data, Mark Ashby, John Neuhaus, Walter Hauck, Peter Bacchetti, David Heilbron, Nicholas Jewell, Mark Segal, Robert Fusaro 2012 Genentech, South San Francisco, CA

An Annotated Bibliography Of Methods For Analyzing Correlated Categorical Data, Mark Ashby, John Neuhaus, Walter Hauck, Peter Bacchetti, David Heilbron, Nicholas Jewell, Mark Segal, Robert Fusaro

Mark R Segal

This paper provides an annotated bibliography of over 100 articles concerning methods for analyzing correlated categorical response data. Most of the papers listed here concern categorical regression models and estimation, with particular emphasis on binary responses. The papers are classified by several characteristics which group them according to common themes. The bibliography serves as a reference of methods for analysts of correlated categorical data, as well as for persons interested in methodologic work in this active area of statistical research.


R Code: A Non-Iterative Implementation Of Tango's Score Confidence Interval For A Paired Difference Of Proportions, Zhao Yang 2012 Quintiles Inc

R Code: A Non-Iterative Implementation Of Tango's Score Confidence Interval For A Paired Difference Of Proportions, Zhao Yang

Zhao (Tony) Yang, Ph.D.

For matched-pair binary data, a variety of approaches have been proposed for the construction of a confidence interval (CI) for the difference of marginal probabilities between two procedures. The score-based approximate CI has been shown to outperform other asymptotic CIs. Tango’s method provides a score CI by inverting a score test statistic using an iterative procedure. In the developed R code, we propose an efficient non-iterative method with closed-form expression to calculate Tango’s CIs. Examples illustrate the practical application of the new approach.


The Bivariate Rank-Based Concordance Index For Ordinal And Tied Data, Emanuela Raffinetti, Pier Alda Ferrari 2012 department of economics, business and statistics

The Bivariate Rank-Based Concordance Index For Ordinal And Tied Data, Emanuela Raffinetti, Pier Alda Ferrari

Emanuela Raffinetti

No abstract provided.


Proportional Mean Residual Life Model For Right-Censored Length-Biased Data, Gary KWUN CHUEN Chan, Ying Qing Chen, Chongzhi Di 2012 University of Washington

Proportional Mean Residual Life Model For Right-Censored Length-Biased Data, Gary Kwun Chuen Chan, Ying Qing Chen, Chongzhi Di

Chongzhi Di

To study disease association with risk factors in epidemiologic studies, cross-sectional sampling is often more focused and less costly for recruiting study subjects who have already experienced initiating events. For time-to-event outcome, however, such a sampling strategy may be length-biased. Coupled with censoring, analysis of length-biased data can be quite challenging, due to the so-called “induced informative censoring” in which the survival time and censoring time are correlated through a common backward recurrence time. We propose to use the proportional mean residual life model of Oakes and Dasu (1990) for analysis of censored length-biased survival data. Several nonstandard data structures, …


Racial And Ethnic Proportions Of Early In-Person Voters In Cuyahoga County, General Election 2008, And Implications For 2012, Norman Robbins, Mark Salling 2012 Cleveland State University

Racial And Ethnic Proportions Of Early In-Person Voters In Cuyahoga County, General Election 2008, And Implications For 2012, Norman Robbins, Mark Salling

All Maxine Goodman Levin School of Urban Affairs Publications

No abstract provided.


Using Etf Data To Monitor Systemic Risk, Melissa A. Siemer, Brett Amidan 2012 Sonoma State University

Using Etf Data To Monitor Systemic Risk, Melissa A. Siemer, Brett Amidan

STAR Program Research Presentations

In 2010, President Obama signed The Dodd-Frank Wall Street Reform and Consumer Protection Act, which requires financially based government agencies and 35 major US banks to monitor systemic risk. This was in response to the recent near financial collapse due to major mistakes made by organizations generally considered "too big to fail". Systemic risk is the risk of the collapse of an entire financial system or market. It refers to the risks imposed by interdependencies in a system or market, where the failure of a single entity or cluster of entities can cause a cascading failure. In April 2011, the …


Demographic And Socioeconomic Conditions And A Patron Borrowing Analysis Of Cleveland Public Library Branch And Main Libraries, Mark Salling 2012 Cleveland State University

Demographic And Socioeconomic Conditions And A Patron Borrowing Analysis Of Cleveland Public Library Branch And Main Libraries, Mark Salling

All Maxine Goodman Levin School of Urban Affairs Publications

We provide here an analysis of the demographic and socioeconomic characteristics of the Cleveland Public Library’s (CPL) service area and that of the neighborhoods in which the library’s patrons live. We also describe the borrowing patterns for the branch and downtown, Main Library, locations. The census-based demographic and socioeconomic data used for the analysis include income, number of children, race, Hispanic ethnicity, language spoken at home, ability to speak English, public-versus-private school attendance by grade level, housing tenure (owner/renter), educational attainment, employment status, and place of employment (Cleveland versus other). Data from the 2010 census and the Census Bureau’s 2005-2009 …


Alternatives To Mixture Model Analysis Of Correlated Binomial Data, N. Rao Chaganty, Roy Sabo, Yihao Deng 2012 Old Dominion University

Alternatives To Mixture Model Analysis Of Correlated Binomial Data, N. Rao Chaganty, Roy Sabo, Yihao Deng

Mathematics & Statistics Faculty Publications

While univariate instances of binomial data are readily handled with generalized linear models, cases of multivariate or repeated measure binomial data are complicated by the possibility of correlated responses. Likelihood-based estimation can be applied by using mixture distribution models, though this approach can present computational challenges. The logistic transformation can be used to bypass these concerns and allow for alternative estimating procedures. One popular alternative is the generalized estimating equation (GEE) method, though systematic errors can lead to infeasible correlation estimates or nonconvergence problems. Our approach is the coupling of quasileast squares (QLSs) method with a rarely used matrix factorization, …


A Quality Improvement Initiative Using A Novel Travel Survey To Define High-Risk International Travel And Promote Patient-Centered Counseling, Craig A. Mackaness DO, Mark Knouse MD, Suzanne J. Templer DO, Deepti Verma MD, Allison Osborne BA, Michael J. Weiss MPH 2012 Lehigh Valley Health Network

A Quality Improvement Initiative Using A Novel Travel Survey To Define High-Risk International Travel And Promote Patient-Centered Counseling, Craig A. Mackaness Do, Mark Knouse Md, Suzanne J. Templer Do, Deepti Verma Md, Allison Osborne Ba, Michael J. Weiss Mph

Department of Medicine

No abstract provided.


Digital Commons powered by bepress