Physical Sciences and Mathematics | Open Access Articles

Models And Software Development For Interval-Censored Data, Chun Pan Jan 2013

Models And Software Development For Interval-Censored Data, Chun Pan

Theses and Dissertations

Interval-censored time-to-event data occur naturally in studies of diseases where the symptoms are not directly observable, and periodic clinical examinations are required for detection. Due to the lack of well-established procedures, interval-censored data have been conventionally treated as right-censored data, however, this introduces bias at the first place. This dissertation focuses on methodological research and software development for interval-censored data. Specifically, it consists of three projects. The first project is to create an R package for regression analysis and survival curve estimation of interval-censored data based on several published papers by our research team. In the second project, a Bayesian …

Go to article

Protein Identification Using Bayesian Stochastic Search, Christina Nicole Lewis Jan 2013

Protein Identification Using Bayesian Stochastic Search, Christina Nicole Lewis

Theses and Dissertations

Current methods for protein identification in tandem mass spectrometry (MS/MS) involve database searches or de novo peptide sequencing, with database searches being the standard method. With database searches, issues arise when the species is not in the database. Shortcomings of de novo peptide sequencing and database searches include chemical noise, overly complex fragments, and incomplete b and y ion sequences. Here we present a Bayesian approach to identifying peptides. Our model uses prior information about the average relative abundances of bond cleavages and the prior probability of any particular amino acid sequence. The proposed likelihood function is composed of two …

Go to article

A New Method For The Comparison Of Survival Distributions, Jaymie Shanahan Jan 2013

A New Method For The Comparison Of Survival Distributions, Jaymie Shanahan

Theses and Dissertations

The assessment of overall homogeneity of time-to-event curves is a key element in survival analysis in biomedical research. The currently commonly used testing methods, e.g. log-rank test, Wilcoxon test, and Kolmogorov-Smirnov test, may have a significant loss of statistical testing power under certain circumstances. In this thesis we replicate a testing method (Lin & Xu, 2009) that is robust for the comparison of the overall homogeneity of survival curves based on the absolute difference of the area under the survival curves using normal approximation by Greenwood's formula, and propose a new weight component to their test statistic. The weight component …

Go to article

Permutation Testing For Covariance Matrices, With Applications In Shape Analysis, Blake Cassidy Hill Jan 2013

Permutation Testing For Covariance Matrices, With Applications In Shape Analysis, Blake Cassidy Hill

Theses and Dissertations

In many applications, it is of interest to compare covariance structures. In this work, we propose hypothesis tests for comparing covariance matrices for data in different groups, especially in shape analysis. The main motivation for the work is comparing covariance matrices of the size and shapes of damaged versus undamaged DNA molecules. A practical motivation behind analyzing the differences between these DNA covariance matrices is to compare the variation between the two groups during situations where the molecules are repairing. The testing methods proposed in this dissertation consist of three types of permutation testing methods for differences in covariance structures. …

Go to article

Heaped Data In Count Models, Tammy Harris Jan 2013

Heaped Data In Count Models, Tammy Harris

Theses and Dissertations

Heaped data result when subjects who recall the frequency of events prefer for reporting from a limited set of rounded responses or preferred digits over reporting exact counts. These rounded responses and digit preferences (also referred to as data coarsening) could be characterized by reported frequencies (or counts) favoring multiples of 20, reporting counts ending with 0 or 5, or a preference for reporting an even number over an odd number or vice versa. This mixture of values is a type of measurement error (pattern of misreporting) that can lead to biased estimation and imprecision in discrete quantitative data. Sometimes …

Go to article

Estimation And Q-Matrix Validation For Diagnostic Classification Models, Yuling Feng Jan 2013

Estimation And Q-Matrix Validation For Diagnostic Classification Models, Yuling Feng

Theses and Dissertations

Diagnostic classification models (DCMs) are structured latent class models widely discussed in the field of psychometrics. They model subjects' underlying attribute patterns and classify subjects into unobservable groups based on their mastery of attributes required to answer the items correctly. The effective implementation of DCMs depends on correct specification of a Q-matrix which is a binary matrix linking attribute patterns to items. Current literature on assessing the appropriateness of Q-matrix specifications has focused on validation methods for the deterministic-input, noisy-and-gate (DINA) model. The goal of the study is to develop general Q-matrix validation methods that can be applied to a …

Go to article

Advanced Methodology Developments In Mixture Cure Models, Chao Cai Jan 2013

Advanced Methodology Developments In Mixture Cure Models, Chao Cai

Theses and Dissertations

Modern medical treatments have substantially improved cure rates for many chronic diseases and have generated increasing interest in appropriate statistical models to handle survival data with non-negligible cure fractions. The mixture cure models are designed to model such data set, which assume that studied population is a mixture of being cured and uncured. In this dissertation, I will develop two programs named smcure and NPHMC in R. The first program aims to facilitate estimating two popular mixture cure models: the proportional hazards (PH) mixture cure model and accelerated failure time (AFT) mixture cure model. The second program focuses on designing …

Go to article

Modeling Mixed Unfolding/Monotone Dichotomous Item Exams, Na Yang Jan 2013

Modeling Mixed Unfolding/Monotone Dichotomous Item Exams, Na Yang

Theses and Dissertations

Item response theory (IRT) is widely applied to analyze educational and psychological assessments. Readily available IRT implementations allow for two common types of models: monotone models used for dominance scales (Guttman 1950; Rasch 1960/1980; Birnbaum 1968; Mokken 1971) and unfolding models used for proximity scales (Coombs, 1964; Andrich, 1996; Roberts, Donoghue and Laughlin, 2000).

When an exam contains items following both types of models, there is currently no method to distinguish the item types, estimate their characteristics, or estimate the examinee characteristics. Thus, there is no existing methodology to simultaneously analyze items like ``At a minimum, I am in favor …

Go to article

The Complete Plus-Minus: A Case Study Of The Columbus Blue Jackets, Nathan Spagnola Jan 2013

The Complete Plus-Minus: A Case Study Of The Columbus Blue Jackets, Nathan Spagnola

Theses and Dissertations

A new hockey statistic termed the Complete Plus-Minus (CPM) was created to calculate the abilities of hockey players in the National Hockey League (NHL). This new statistic was used to analyze the Columbus Blue Jackets for the 2011-2012 season. The CPM for the Blue Jackets was created using two logistic regressions that modeled a goal being scored for and against the Blue Jackets. Whether a goal was scored for or against the team were the responses, while events on the ice were the predictors in the model. It was found that the team's poor performance was due to a weak …

Go to article

Physical Sciences and Mathematics Commons^™

Full-Text Articles in Physical Sciences and Mathematics

Models And Software Development For Interval-Censored Data, Chun Pan

Theses and Dissertations

Protein Identification Using Bayesian Stochastic Search, Christina Nicole Lewis

Theses and Dissertations

A New Method For The Comparison Of Survival Distributions, Jaymie Shanahan

Theses and Dissertations

Permutation Testing For Covariance Matrices, With Applications In Shape Analysis, Blake Cassidy Hill

Theses and Dissertations

Heaped Data In Count Models, Tammy Harris

Theses and Dissertations

Estimation And Q-Matrix Validation For Diagnostic Classification Models, Yuling Feng

Theses and Dissertations

Advanced Methodology Developments In Mixture Cure Models, Chao Cai

Theses and Dissertations

Modeling Mixed Unfolding/Monotone Dichotomous Item Exams, Na Yang

Theses and Dissertations

The Complete Plus-Minus: A Case Study Of The Columbus Blue Jackets, Nathan Spagnola

Theses and Dissertations