Statistical Methodology | Open Access Articles

The Distribution Of The Significance Level, Paul O. Monnu Jan 2024

The Distribution Of The Significance Level, Paul O. Monnu

Electronic Theses and Dissertations

Reporting the p-value is customary when conducting a test of hypothesis or significance. The likelihood of getting a fictitious second sample and presuming the null hypothesis is correct is the p-value. The significance level is a statistic that interests us to investigate. Being a statistic, it has a distribution. For the F-test in a one-way ANOVA and the t-tests for population means, we define the significance level, its observed value, and the observed significance level. It is possible to derive the significance level distribution. The t-test and the F-test are not without controversy. Specifically, we demonstrate that as sample size …

Go to article

Multiple Imputation Using Influential Exponential Tilting In Case Of Non-Ignorable Missing Data, Kavita Gohil Jan 2020

Multiple Imputation Using Influential Exponential Tilting In Case Of Non-Ignorable Missing Data, Kavita Gohil

Electronic Theses and Dissertations

Modern research strategies rely predominantly on three steps, data collection, data analysis, and inference. In research, if the data is not collected as designed, researchers may face challenges of having incomplete data, especially when it is non-ignorable. These situations affect the subsequent steps of evaluation and make them difficult to perform. Inference with incomplete data is a challenging task in data analysis and clinical trials when missing data related to the condition under the study. Moreover, results obtained from incomplete data are prone to biases. Parameter estimation with non-ignorable missing data is even more challenging to handle and extract useful …

Go to article

Generalization Of Kullback-Leibler Divergence For Multi-Stage Diseases: Application To Diagnostic Test Accuracy And Optimal Cut-Points Selection Criterion, Chen Mo Jan 2020

Generalization Of Kullback-Leibler Divergence For Multi-Stage Diseases: Application To Diagnostic Test Accuracy And Optimal Cut-Points Selection Criterion, Chen Mo

Electronic Theses and Dissertations

The Kullback-Leibler divergence (KL), which captures the disparity between two distributions, has been considered as a measure for determining the diagnostic performance of an ordinal diagnostic test. This study applies KL and further generalizes it to comprehensively measure the diagnostic accuracy test for multi-stage (K > 2) diseases, named generalized total Kullback-Leibler divergence (GTKL). Also, GTKL is proposed as an optimal cut-points selection criterion for discriminating subjects among different disease stages. Moreover, the study investigates a variety of applications of GTKL on measuring the rule-in/out potentials in the single-stage and multi-stage levels. Intensive simulation studies are conducted to compare the performance …

Go to article

Nonparametric Misclassification Simulation And Extrapolation Method And Its Application, Congjian Liu Jan 2020

Nonparametric Misclassification Simulation And Extrapolation Method And Its Application, Congjian Liu

Electronic Theses and Dissertations

The misclassification simulation extrapolation (MC-SIMEX) method proposed by Küchenho et al. is a general method of handling categorical data with measurement error. It consists of two steps, the simulation and extrapolation steps. In the simulation step, it simulates observations with varying degrees of measurement error. Then parameter estimators for varying degrees of measurement error are obtained based on these observations. In the extrapolation step, it uses a parametric extrapolation function to obtain the parameter estimators for data with no measurement error. However, as shown in many studies, the parameter estimators are still biased as a result of the parametric extrapolation …

Go to article

Variable Selection In Accelerated Failure Time (Aft) Frailty Models: An Application Of Penalized Quasi-Likelihood, Sarbesh R. Pandeya Jan 2019

Variable Selection In Accelerated Failure Time (Aft) Frailty Models: An Application Of Penalized Quasi-Likelihood, Sarbesh R. Pandeya

Electronic Theses and Dissertations

Variable selection is one of the standard ways of selecting models in large scale datasets. It has applications in many fields of research study, especially in large multi-center clinical trials. One of the prominent methods in variable selection is the penalized likelihood, which is both consistent and efficient. However, the penalized selection is significantly challenging under the influence of random (frailty) covariates. It is even more complicated when there is involvement of censoring as it may not have a closed-form solution for the marginal log-likelihood. Therefore, we applied the penalized quasi-likelihood (PQL) approach that approximates the solution for such a …

Go to article

Some New And Generalized Distributions Via Exponentiation, Gamma And Marshall-Olkin Generators With Applications, Hameed Abiodun Jimoh Jan 2018

Some New And Generalized Distributions Via Exponentiation, Gamma And Marshall-Olkin Generators With Applications, Hameed Abiodun Jimoh

Electronic Theses and Dissertations

Three new generalized distributions developed via completing risk, gamma generator, Marshall-Olkin generator and exponentiation techniques are proposed and studied. Structural properties including quantile functions, hazard rate functions, moment, conditional moments, mean deviations, R\'enyi entropy, distribution of order statistics and maximum likelihood estimates are presented. Monte Carlo simulation is employed to examine the performance of the proposed distributions. Applications of the generalized distributions to real lifetime data are presented to illustrate the usefulness of the models.

Go to article

Missing Data In Clinical Trial: A Critical Look At The Proportionality Of Mnar And Mar Assumptions For Multiple Imputation, Theophile B. Dipita Jan 2016

Missing Data In Clinical Trial: A Critical Look At The Proportionality Of Mnar And Mar Assumptions For Multiple Imputation, Theophile B. Dipita

Electronic Theses and Dissertations

Randomized control trial is a gold standard of research studies. Randomization helps reduce bias and infer causality. One constraint of these studies is that it depends on participants to obtain the desired data. Whatever the researcher can do, there is a possibility to end up with incomplete data. The problem is more relevant in clinical trials when missing data can be related to the condition under study. The benefits of randomization is compromised by missing data. Multiple imputation is a valid method of treating missing data under the assumption of MAR. Unfortunately this is an unverified assumptions. Current practice advise …

Go to article

Exponentially Weighted Moving Average Charts For Monitoring The Process Generalized Variance, Anna Khamitova Jan 2014

Exponentially Weighted Moving Average Charts For Monitoring The Process Generalized Variance, Anna Khamitova

Electronic Theses and Dissertations

The exponentially weighted moving average chart based on the sample generalized variance is studied under the independent multivariate normal model for the vector of quality measurements. The performance of the chart is based on an analysis of the chart's initial and steady-state run length distributions. The three methods that are commonly used to determinate run length distribution, simulation, the integral equation method, and the Markov chain approximation are discussed. The integral equation and Markov chain approaches are analytical methods that require a nu- merical method for determining the probability density and cumulative distribution functions describing the distribution of the sample …

Go to article

Income Inequality Measures And Statistical Properties Of Weighted Burr-Type And Related Distributions, Meznah R. Al Buqami Jan 2013

Income Inequality Measures And Statistical Properties Of Weighted Burr-Type And Related Distributions, Meznah R. Al Buqami

Electronic Theses and Dissertations

In this thesis, tail conditional expectation (TCE) in risk analysis, an important measure for right-tail risk, is presented. This value is generally based on the quantile of the loss distribution. Explicit formulas of several tail conditional expectations and inequality measures for Dagum-type models are derived. In addition, a new class of weighted Burr-III (WBIII) distribution is presented. The statistical properties of this distribution including hazard and reverse hazard functions, moments, coefficient of variation, skewness, and kurtosis, inequality measures, entropy are derived. Also, Fisher information and maximum likelihood estimates of the model parameters are obtained.

Go to article

Finding A Better Confidence Interval For A Single Regression Changepoint Using Different Bootstrap Confidence Interval Procedures, Bodhipaksha Thilakarathne Oct 2012

Finding A Better Confidence Interval For A Single Regression Changepoint Using Different Bootstrap Confidence Interval Procedures, Bodhipaksha Thilakarathne

Electronic Theses and Dissertations

Recently a number of papers have been published in the area of regression changepoints but there is not much literature concerning confidence intervals for regression changepoints. The purpose of this paper is to find a better bootstrap confidence interval for a single regression changepoint. ("Better" confidence interval means having a minimum length and coverage probability which is close to a chosen significance level). Several methods will be used to find bootstrap confidence intervals. Among those methods a better confidence interval will be presented.

Go to article

Statistical Methodology Commons^™

Full-Text Articles in Statistical Methodology

The Distribution Of The Significance Level, Paul O. Monnu

Electronic Theses and Dissertations

Multiple Imputation Using Influential Exponential Tilting In Case Of Non-Ignorable Missing Data, Kavita Gohil

Electronic Theses and Dissertations

Generalization Of Kullback-Leibler Divergence For Multi-Stage Diseases: Application To Diagnostic Test Accuracy And Optimal Cut-Points Selection Criterion, Chen Mo

Electronic Theses and Dissertations

Nonparametric Misclassification Simulation And Extrapolation Method And Its Application, Congjian Liu

Electronic Theses and Dissertations

Variable Selection In Accelerated Failure Time (Aft) Frailty Models: An Application Of Penalized Quasi-Likelihood, Sarbesh R. Pandeya

Electronic Theses and Dissertations

Some New And Generalized Distributions Via Exponentiation, Gamma And Marshall-Olkin Generators With Applications, Hameed Abiodun Jimoh

Electronic Theses and Dissertations

Missing Data In Clinical Trial: A Critical Look At The Proportionality Of Mnar And Mar Assumptions For Multiple Imputation, Theophile B. Dipita

Electronic Theses and Dissertations

Exponentially Weighted Moving Average Charts For Monitoring The Process Generalized Variance, Anna Khamitova

Electronic Theses and Dissertations

Income Inequality Measures And Statistical Properties Of Weighted Burr-Type And Related Distributions, Meznah R. Al Buqami

Electronic Theses and Dissertations

Finding A Better Confidence Interval For A Single Regression Changepoint Using Different Bootstrap Confidence Interval Procedures, Bodhipaksha Thilakarathne

Electronic Theses and Dissertations