Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Institution
- Publication Year
Articles 1 - 14 of 14
Full-Text Articles in Physical Sciences and Mathematics
Finite Mixtures Of Mean-Parameterized Conway-Maxwell-Poisson Models, Dongying Zhan
Finite Mixtures Of Mean-Parameterized Conway-Maxwell-Poisson Models, Dongying Zhan
Theses and Dissertations--Statistics
For modeling count data, the Conway-Maxwell-Poisson (CMP) distribution is a popular generalization of the Poisson distribution due to its ability to characterize data over- or under-dispersion. While the classic parameterization of the CMP has been well-studied, its main drawback is that it is does not directly model the mean of the counts. This is mitigated by using a mean-parameterized version of the CMP distribution. In this work, we are concerned with the setting where count data may be comprised of subpopulations, each possibly having varying degrees of data dispersion. Thus, we propose a finite mixture of mean-parameterized CMP distributions. An …
Non-Inferiority Testing: Kernel Estimation And Overlap Measure, Larie C. Ward
Non-Inferiority Testing: Kernel Estimation And Overlap Measure, Larie C. Ward
Electronic Theses and Dissertations
In non-inferiority testing, the decision of whether a proposed treatment is non-inferior to a reference treatment depends on model assumptions and choices of acceptable tolerance limits. Here, we consider a method that employs kernels to estimate the probability density functions of both the experimental and reference populations from two independent samples. Based on these densities, we introduce a quantity called the overlap coefficient or overlap measure. A bootstrap technique is helpful in exploring the distribution and variance empirically. We derive the distribution of this measure and define a hypothesis test that can be applied to the non-inferiority setting under some …
A Flexible Zero-Inflated Poisson Regression Model, Eric S. Roemmele
A Flexible Zero-Inflated Poisson Regression Model, Eric S. Roemmele
Theses and Dissertations--Statistics
A practical problem often encountered with observed count data is the presence of excess zeros. Zero-inflation in count data can easily be handled by zero-inflated models, which is a two-component mixture of a point mass at zero and a discrete distribution for the count data. In the presence of predictors, zero-inflated Poisson (ZIP) regression models are, perhaps, the most commonly used. However, the fully parametric ZIP regression model could sometimes be restrictive, especially with respect to the mixing proportions. Taking inspiration from some of the recent literature on semiparametric mixtures of regressions models for flexible mixture modeling, we propose a …
Evaluation Of Using The Bootstrap Procedure To Estimate The Population Variance, Nghia Trong Nguyen
Evaluation Of Using The Bootstrap Procedure To Estimate The Population Variance, Nghia Trong Nguyen
Electronic Theses and Dissertations
The bootstrap procedure is widely used in nonparametric statistics to generate an empirical sampling distribution from a given sample data set for a statistic of interest. Generally, the results are good for location parameters such as population mean, median, and even for estimating a population correlation. However, the results for a population variance, which is a spread parameter, are not as good due to the resampling nature of the bootstrap method. Bootstrap samples are constructed using sampling with replacement; consequently, groups of observations with zero variance manifest in these samples. As a result, a bootstrap variance estimator will carry a …
Statistical Methodology For Data With Multiple Limits Of Detection, Robert M. Flikkema
Statistical Methodology For Data With Multiple Limits Of Detection, Robert M. Flikkema
Dissertations
Limitations of instruments used to collect continuous data sometimes lead to obtaining observations lower than a limit of detection. These observations are known as nondetects. They could be zeroes, or positive numbers, but they are too small to be recorded by a measuring device. Nondetects frequently occur in environmental data. Trace amounts of chemicals can exist in soil or groundwater and are undetectable by a machine reading. These observations pose a problem to researchers since the true values are unknown.
Simulations in the literature have led to inconsistent conclusions regarding what estimation technique to use with nondetect data when estimating …
Causal Inference In Observational Studies With Clustered Data, Meng Wu
Causal Inference In Observational Studies With Clustered Data, Meng Wu
Legacy Theses & Dissertations (2009 - 2024)
In this thesis, we study causal inference in observational studies with clustered data.
Per-Contact Infectivity Of Hcv Associated With Injection Exposures In A Prospective Cohort Of Young Injection Drug Users In San Francisco, Ca (Ufo Study), Yuridia Leyva
Mathematics & Statistics ETDs
Sharing needles and ancillary injection drug equipment places injection drug users (IDU) at risk for Hepatitis C Virus (HCV), a highly infectious blood-borne virus. A limited number of studies have analyzed the per-contact infectivity of HCV associated with the use of previously-used needles, but per-contact infectivity of ancillary injecting equipment has not been previously investigated. Our goal is to estimate the per-contact infectivity of HCV associated with (1) injecting with another person's previously-used needle, classified as receptive needle sharing (RNS), and (2) using another person's previously-used ancillary injecting equipment, such as cookers to melt drugs and cottons to strain impurities …
Comparing Bootstrap And Jackknife Variance Estimation Methods For Area Under The Roc Curve Using One-Stage Cluster Survey Data, Allison Dunning
Comparing Bootstrap And Jackknife Variance Estimation Methods For Area Under The Roc Curve Using One-Stage Cluster Survey Data, Allison Dunning
Theses and Dissertations
The purpose of this research is to examine the bootstrap and jackknife as methods for estimating the variance of the AUC from a study using a complex sampling design and to determine which characteristics of the sampling design effects this estimation. Data from a one-stage cluster sampling design of 10 clusters was examined. Factors included three true AUCs (.60, .75, and .90), three prevalence levels (50/50, 70/30, 90/10) (non-disease/disease), and finally three number of clusters sampled (2, 5, or 7). A simulated sample was constructed for each of the 27 combinations of AUC, prevalence and number of clusters. Estimates of …
Characterizing The Statistical Properties And Global Distribution Of Dansgaard-Oeschger Events, Andrea Michelle Thomas
Characterizing The Statistical Properties And Global Distribution Of Dansgaard-Oeschger Events, Andrea Michelle Thomas
Theses and Dissertations
Ice core records from Greenland have shown times of rapid warming during the most recent glacial period, called Dansgaard-Oeschger (D-O) events. D-O events are important to our understanding of both past climate systems and modern climate volatility. In this paper, we present new approaches for statistically evaluating the existence of cyclicity in D-O events and the possible lagged correlation between the Greenland and Antarctica temperature records. Specifically, we consider permutation testing and bootstrapping methodologies for assessing the cyclicity of D-O events and the correlation between the Greenland and Antarctica records. We find that there is not enough evidence to conclude …
Assessing Multivariate Heritability Through Nonparametric Methods, Benjamin Alan Carper
Assessing Multivariate Heritability Through Nonparametric Methods, Benjamin Alan Carper
Theses and Dissertations
The similarities between generations of living subjects are often quantified by heritability. By distinguishing genotypic variation, or variation due to parental pairings, from phenotypic variation, or normal intraspecies variation, the heritability of traits can be estimated. Due to the multivariate nature of many traits, such as size and shape, computation of heritability can be difficult. Also, assessment of the variation of the heritability estimate is extremely difficult. This study uses nonparametric methods, namely the randomization test and the bootstrap, to obtain both a measure of the extremity of the observed heritability and an assessment of the uncertainty.
A Method For Finding Standard Error Estimates For Rma Expression Levels Using Bootstrap, Gabriel Nicholas
A Method For Finding Standard Error Estimates For Rma Expression Levels Using Bootstrap, Gabriel Nicholas
All Graduate Plan B and other Reports, Spring 1920 to Spring 2023
Oligonucleotide arrays are used in many applications. Affymetrix GeneChip arrays are widely used. Before researchers can use the information from these arrays, the raw data must be transformed and summarized into a more meaningful and usable form. One of the more popular methods for doing so is RMA (Robust Multi-array Analysis).
A problem with RMA is that the end result (estimated gene expression levels) is based on a fairly complicated process that is unusual. Specifically, there is no closed-form estimate of standard errors for the estimated gene expression levels. The current recommendation is to use a naive estimate for the …
Comparing The Statistical Tests For Homogeneity Of Variances., Zhiqiang Mu
Comparing The Statistical Tests For Homogeneity Of Variances., Zhiqiang Mu
Electronic Theses and Dissertations
Testing the homogeneity of variances is an important problem in many applications since statistical methods of frequent use, such as ANOVA, assume equal variances for two or more groups of data. However, testing the equality of variances is a difficult problem due to the fact that many of the tests are not robust against non-normality. It is known that the kurtosis of the distribution of the source data can affect the performance of the tests for variance. We review the classical tests and their latest, more robust modifications, some other tests that have recently appeared in the literature, and use …
Comparison Of Bootstrap With Other Tests For Several Distributions, Yu-Yu Wong
Comparison Of Bootstrap With Other Tests For Several Distributions, Yu-Yu Wong
All Graduate Theses and Dissertations, Spring 1920 to Summer 2023
This paper discusses results of a computer simulation to investigate several different tests when sampling several distributions. The hypothesis H0: μ=0 was tested against H0: μ≠0, using the usual t-test, trimmed t-test, the Jackkinfe, the Bootstrap and signed-rank test. The p-values and empirical power show that the Bootstrap is as good as the t-test. The Jackknife procedure is too liberal, always obtaining small p-values. The signed-rank is a fairly good test if the data follows the Cauchy Distribution.
Comparison Of Bootstrap And Jacknife Statistical Procedures, Amanuel Gobena
Comparison Of Bootstrap And Jacknife Statistical Procedures, Amanuel Gobena
All Graduate Plan B and other Reports, Spring 1920 to Spring 2023
This report compares the bootstrapping to jacknifing statistical procedures in terms in bias, confidence interval and estimation of median. Related literature have been reviewed. A bootstrap allows a researcher to get an approximation to the distribution of possibly complicated statistical summaries. It is based on random sampling with replacement from experimental units. Jacknife has also been in operation prior to bootstrapping statistical procedure. The jacknife divides the data into subgroups and obtains partial estimates of these subgroups by omitting one subgroup at a time. When both of these statistical resampling procedures are compared the bootstrap has less bias, more accurate …