Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 20 of 20

Full-Text Articles in Statistics and Probability

Compound Sums, Their Distributions, And Actuarial Pricing, Ang Li Oct 2021

Compound Sums, Their Distributions, And Actuarial Pricing, Ang Li

Electronic Thesis and Dissertation Repository

Compound risk models are widely used in insurance companies to mathematically describe their aggregate amount of losses during certain time period. However, evaluation of the distribution of compound random variables and the computation of the relevant risk measures are non-trivial. Therefore, the main purpose of this thesis is to study the bounds and simulation methods for both univariate and multivariate compound distributions. The premium setting principles related to dependent multivariate compound distributions are studied. .

In the first part of this thesis, we consider the upper and lower bounds of the tail of bivariate compound distributions. Our results extend those …


Can Auxiliary Information Improve Rasch Estimation At Small Sample Sizes?, Derek Sauder May 2020

Can Auxiliary Information Improve Rasch Estimation At Small Sample Sizes?, Derek Sauder

Dissertations, 2020-current

The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters. The purpose of this study was to determine if incorporating item property information (i.e., characteristics of the items related to item difficulty) in a random effects linear logistic test model (RE-LLTM) would improve estimation of item difficulty. A simulation study was conducted that varied sample size, …


Propensity Score Matching And Generalized Boosted Modeling In The Context Of Model Misspecification: A Simulation Study, Briana G. Craig May 2020

Propensity Score Matching And Generalized Boosted Modeling In The Context Of Model Misspecification: A Simulation Study, Briana G. Craig

Masters Theses, 2020-current

In the absence of random assignment, researchers must consider the impact of selection bias – pre-existing covariate differences between groups due to differences among those entering into treatment and those otherwise unable to participate. Propensity score matching (PSM) and generalized boosted modeling (GBM) are two quasi-experimental pre-processing methods that strive to reduce the impact of selection bias before analyzing a treatment effect. PSM and GBM both examine a treatment and comparison group and either match or weight members of those groups to create new, balanced groups. The new, balanced groups theoretically can then be used as a proxy for the …


Exploring The Variance Of The Sample Variance Through Estimation And Simulation, Christina Stradwick Jan 2019

Exploring The Variance Of The Sample Variance Through Estimation And Simulation, Christina Stradwick

Theses, Dissertations and Capstones

In this thesis, we examine properties of the variance of the sample variance, which we will denote V (S 2 ). We derive a formula for this variance and show that it only depends on the sample size, variance, and kurtosis of the underlying distribution. We also derive the maximum likelihood estimators for this parameter, Vˆ (S 2 ), under the normal, exponential, Bernoulli, and Poisson distributions and end the thesis with simulations demonstrating the distributions of these estimators.


Predictions Generated From A Simulation Engine For Gene Expression Micro-Arrays For Use In Research Laboratories, Gopinath R. Mavankal, John Blevins, Dominique Edwards, Monnie Mcgee, Andrew Hardin Jul 2018

Predictions Generated From A Simulation Engine For Gene Expression Micro-Arrays For Use In Research Laboratories, Gopinath R. Mavankal, John Blevins, Dominique Edwards, Monnie Mcgee, Andrew Hardin

SMU Data Science Review

In this paper we introduce the technical components, the biology and data science involved in the use of microarray technology in biological and clinical research. We discuss how laborious experimental protocols involved in obtaining this data used in laboratories could benefit from using simulations of the data. We discuss the approach used in the simulation engine from [7]. We use this simulation engine to generate a prediction tool in Power BI, a Microsoft, business intelligence tool for analytics and data visualization [22]. This tool could be used in any laboratory using micro-arrays to improve experimental design by comparing how predicted …


A Comparison Of Some Confidence Intervals For Estimating The Kurtosis Parameter, Guensley Jerome Jun 2017

A Comparison Of Some Confidence Intervals For Estimating The Kurtosis Parameter, Guensley Jerome

FIU Electronic Theses and Dissertations

Several methods have been proposed to estimate the kurtosis of a distribution. The three common estimators are: g2, G2 and b2. This thesis addressed the performance of these estimators by comparing them under the same simulation environments and conditions. The performance of these estimators are compared through confidence intervals by determining the average width and probabilities of capturing the kurtosis parameter of a distribution. We considered and compared classical and non-parametric methods in constructing these intervals. Classical method assumes normality to construct the confidence intervals while the non-parametric methods rely on bootstrap techniques. The bootstrap …


On Some Test Statistics For Testing The Population Skewness And Kurtosis: An Empirical Study, Yawen Guo Aug 2016

On Some Test Statistics For Testing The Population Skewness And Kurtosis: An Empirical Study, Yawen Guo

FIU Electronic Theses and Dissertations

The purpose of this thesis is to propose some test statistics for testing the skewness and kurtosis parameters of a distribution, not limited to a normal distribution. Since a theoretical comparison is not possible, a simulation study has been conducted to compare the performance of the test statistics. We have compared both parametric methods (classical method with normality assumption) and non-parametric methods (bootstrap in Bias Corrected Standard Method, Efron’s Percentile Method, Hall’s Percentile Method and Bias Corrected Percentile Method). Our simulation results for testing the skewness parameter indicate that the power of the tests differs significantly across sample sizes, the …


Design & Analysis Of A Computer Experiment For An Aerospace Conformance Simulation Study, Ryan W. Gryder Jan 2016

Design & Analysis Of A Computer Experiment For An Aerospace Conformance Simulation Study, Ryan W. Gryder

Theses and Dissertations

Within NASA's Air Traffic Management Technology Demonstration # 1 (ATD-1), Interval Management (IM) is a flight deck tool that enables pilots to achieve or maintain a precise in-trail spacing behind a target aircraft. Previous research has shown that violations of aircraft spacing requirements can occur between an IM aircraft and its surrounding non-IM aircraft when it is following a target on a separate route. This research focused on the experimental design and analysis of a deterministic computer simulation which models our airspace configuration of interest. Using an original space-filling design and Gaussian process modeling, we found that aircraft delay assignments …


Global Network Inference From Ego Network Samples: Testing A Simulation Approach, Jeffrey A. Smith Apr 2015

Global Network Inference From Ego Network Samples: Testing A Simulation Approach, Jeffrey A. Smith

Department of Sociology: Faculty Publications

Network sampling poses a radical idea: that it is possible to measure global network structure without the full population coverage assumed in most network studies. Network sampling is only useful, however, if a researcher can produce accurate global network estimates. This article explores the practicality of making network inference, focusing on the approach introduced in Smith (2012). The method uses sampled ego network data and simulation techniques to make inference about the global features of the true, unknown network. The validity check here includes more difficult scenarios than previous tests, including those that go beyond the initial scope conditions of …


Exponentially Weighted Moving Average Charts For Monitoring The Process Generalized Variance, Anna Khamitova Jan 2014

Exponentially Weighted Moving Average Charts For Monitoring The Process Generalized Variance, Anna Khamitova

Electronic Theses and Dissertations

The exponentially weighted moving average chart based on the sample generalized variance is studied under the independent multivariate normal model for the vector of quality measurements. The performance of the chart is based on an analysis of the chart's initial and steady-state run length distributions. The three methods that are commonly used to determinate run length distribution, simulation, the integral equation method, and the Markov chain approximation are discussed. The integral equation and Markov chain approaches are analytical methods that require a nu- merical method for determining the probability density and cumulative distribution functions describing the distribution of the sample …


Simulating Non-Normal Distributions With Specified L-Moments And L-Correlations, Todd C. Headrick, Mohan D. Pant May 2012

Simulating Non-Normal Distributions With Specified L-Moments And L-Correlations, Todd C. Headrick, Mohan D. Pant

Mohan Dev Pant

This paper derives a procedure for simulating continuous non-normal distributions with specified L-moments and L-correlations in the context of power method polynomials of order three. It is demonstrated that the proposed procedure has computational advantages over the traditional product-moment procedure in terms of solving for intermediate correlations. Simulation results also demonstrate that the proposed L-moment-based procedure is an attractive alternative to the traditional procedure when distributions with more severe departures from normality are considered. Specifically, estimates of L-skew and L-kurtosis are superior to the conventional estimates of skew and kurtosis in terms of both relative bias and relative standard error. …


Using The R Library Rpanel For Gui-Based Simulations In Introductory Statistics Courses, Ryan M. Allison May 2012

Using The R Library Rpanel For Gui-Based Simulations In Introductory Statistics Courses, Ryan M. Allison

Statistics

As a student, I noticed that the statistical package R (http://www.r-project.org) would have several benefits of its usage in the classroom. One benefit to the package is its free and open-source nature. This would be a great benefit for instructors and students alike since it would be of no cost to use, unlike other statistical packages. Due to this, students could continue using the program after their statistical courses and into their professional careers. It would be good to expose students while they are in school to a tool that professionals use in industry. R also has powerful …


The Quotient Of The Beta-Weibull Distribution, Nonhle Channon Mdziniso Jan 2012

The Quotient Of The Beta-Weibull Distribution, Nonhle Channon Mdziniso

Theses, Dissertations and Capstones

A new class of distributions recently developed involves the logit of the beta distribution. Among this class of distributions are, the beta-Normal (Eugene et al. [15]); beta-Gumbel (Nadarajah and Kotz [18]); beta-Exponential (Nadarajah and Kotz [19]); beta-Weibull (Famoye et al. [6]); beta-Rayleigh (Akinsete and Lowe [3]); beta-Laplace (Kozubowshi and Nadarajah [20]); and beta-Pareto (Akinsete et al. [4]), among a few others. Many useful statistical properties arising from these distributions and their applications to real life data have been discussed in literature. One approach by which a new statistical distribution is generated is by the transformation of random variables having known …


On Simulating Univariate And Multivariate Burr Type Iii And Type Xii Distributions, Todd C. Headrick, Mohan D. Pant, Yanyan Sheng Mar 2010

On Simulating Univariate And Multivariate Burr Type Iii And Type Xii Distributions, Todd C. Headrick, Mohan D. Pant, Yanyan Sheng

Mohan Dev Pant

This paper describes a method for simulating univariate and multivariate Burr Type III and Type XII distributions with specified correlation matrices. The methodology is based on the derivation of the parametric forms of a pdf and cdf for this family of distributions. The paper shows how shape parameters can be computed for specified values of skew and kurtosis. It is also demonstrated how to compute percentage points and other measures of central tendency such as the mode, median, and trimmed mean. Examples are provided to demonstrate how this Burr family can be used in the context of distribution fitting using …


Simulating Multivariate G-And-H Distributions, Rhonda K. Kowalchuk, Todd C. Headrick Jan 2010

Simulating Multivariate G-And-H Distributions, Rhonda K. Kowalchuk, Todd C. Headrick

Todd Christopher Headrick

The Tukey family of g-and-h distributions is often used to model univariate real-world data. There is a paucity of research demonstrating appropriate multivariate data generation using the g-and-h family of distributions with specified correlations. Therefore, the methodology and algorithms are presented to extend the g-and-h family from univariate to multivariate data generation. An example is provided along with a Monte Carlo simulation demonstrating the methodology. In addition, algorithms written in Mathematica 7.0 are available from the authors for implementing the procedure.


Statistical Simulation: Power Method Polynomials And Other Transformations, Todd C. Headrick Jan 2010

Statistical Simulation: Power Method Polynomials And Other Transformations, Todd C. Headrick

Todd Christopher Headrick

Although power method polynomials based on the standard normal distributions have been used in many different contexts for the past 30 years, it was not until recently that the probability density function (pdf) and cumulative distribution function (cdf) were derived and made available. Focusing on both univariate and multivariate nonnormal data generation, Statistical Simulation: Power Method Polynomials and Other Transformations presents techniques for conducting a Monte Carlo simulation study. It shows how to use power method polynomials for simulating univariate and multivariate nonnormal distributions with specified cumulants and correlation matrices. The book first explores the methodology underlying the power method, …


Simulating Controlled Variate And Rank Correlations Based On The Power Method Transformation, Todd C. Headrick, Simon Y. Aman, T. Mark Beasley Dec 2007

Simulating Controlled Variate And Rank Correlations Based On The Power Method Transformation, Todd C. Headrick, Simon Y. Aman, T. Mark Beasley

Todd Christopher Headrick

The power method transformation is a popular algorithm used for simulating correlated non normal continuous variates because of its simplicity and ease of execution. Statistical models may consist of continuous and (or) ranked variates. In view of this, the methodology is derived for simulating controlled correlation structures between non normal (a) variates, (b) ranks, and (c) variates with ranks in the context of the power method. The correlation structure between variate-values and their associated rank-order is also derived for the power method. As such, a measure of the potential loss of information is provided when ranks are used in place …


The Power Method Transformation: Its Probability Density Function, Distribution Function, And Its Further Use For Fitting Data, Todd C. Headrick, Rhonda K. Kowalchuk Mar 2007

The Power Method Transformation: Its Probability Density Function, Distribution Function, And Its Further Use For Fitting Data, Todd C. Headrick, Rhonda K. Kowalchuk

Todd Christopher Headrick

The power method polynomial transformation is a popular algorithm used for simulating non-normal distributions because of its simplicity and ease of execution. The primary limitations of the power method transformation are that its probability density function (pdf) and cumulative distribution function (cdf) are unknown. In view of this, the power method’s pdf and cdf are derived in general form. More specific properties are also derived for determining if a given transformation will also have an associated pdf in the context of polynomials of order three and five. Numerical examples and parametric plots of power method densities are provided to confirm …


Simulating Correlated Multivariate Nonnormal Distributions: Extending The Fleishman Power Method, Todd C. Headrick, Shlomo S. Sawilowsky Mar 1999

Simulating Correlated Multivariate Nonnormal Distributions: Extending The Fleishman Power Method, Todd C. Headrick, Shlomo S. Sawilowsky

Todd Christopher Headrick

A procedure for generating multivariate nonnormal distributions is proposed. Our procedure generates average values of intercorrelations much closer to population parameters than competing procedures for skewed and/or heavy tailed distributions and for small sample sizes. Also, it eliminates the necessity of conducting a factorization procedure on the population correlation matrix that underlies the random deviates, and it is simpler to code in a programming language (e.g,, FORTRAN). Numerical examples demonstrating the procedures are given. Monte Carlo results indicate our procedure yields excellent agreement between population parameters and average values of intercorrelation, skew, and kurtosis.


Simulating Regional Interindustry Models For Western States, William A. Schaffer, Kong Chu Jan 1969

Simulating Regional Interindustry Models For Western States, William A. Schaffer, Kong Chu

Applications

Although regional input-output models are now most frequently constructed on the basis of reasonably adequate surveys, simulation (estimating) techniques not based on original survey data are still in use by many regional scientists for quick and less costly results. We will modify our original aggregation procedures, examine our results through several statistical tests of tables constructed for three Western states, and discuss a possible correction procedure for improving raw estimates of interindustry transactions.