Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

Old Dominion University

Discipline
Keyword
Publication Year
Publication
Publication Type

Articles 1 - 30 of 42

Full-Text Articles in Applied Statistics

A New Method To Determine The Posterior Distribution Of Coefficient Alpha, John Mart V. Delosreyes Oct 2023

A New Method To Determine The Posterior Distribution Of Coefficient Alpha, John Mart V. Delosreyes

Psychology Theses & Dissertations

There is a focus within the behavioral/social sciences on non-physical, psychological constructs (i.e., constructs). These constructs are indirectly measured using measurement instruments that consist of questions that capture the manifestations of these constructs. The indirect nature of measuring constructs results in a need of ensuring that measurement instruments are reliable. The most popular statistic used to estimate reliability is coefficient alpha as it is easy to compute and has properties that make it desirable to use. Coefficient alpha’s popularity has resulted in a wide breadth of research into its qualities. Notably, research about coefficient alpha’s distribution has led to developments …


Statistical Methods For Meta-Analysis In Large-Scale Genomic Experiments, Wimarsha Thathsarani Jayanetti Dec 2022

Statistical Methods For Meta-Analysis In Large-Scale Genomic Experiments, Wimarsha Thathsarani Jayanetti

Mathematics & Statistics Theses & Dissertations

Recent developments in high throughput genomic assays have opened up the possibility of testing hundreds and thousands of genes simultaneously. With the availability of vast amounts of public databases, researchers tend to combine genomic analysis results from multiple studies in the form of a meta-analysis. Meta-analysis methods can be broadly classified into two main categories. The first approach is to combine the statistical significance (pvalues) of the genes from each individual study, and the second approach is to combine the statistical estimates (effect sizes) from the individual studies. In this dissertation, we will discuss how adherence to the standard null …


The Online Ordering Behaviors Among Participants In The Oklahoma Women, Infants, And Children Program: A Cross-Sectional Analysis, Qi Zhang, Kayoung Park, Junzhou Zhang, Chuanyi Tang Jan 2022

The Online Ordering Behaviors Among Participants In The Oklahoma Women, Infants, And Children Program: A Cross-Sectional Analysis, Qi Zhang, Kayoung Park, Junzhou Zhang, Chuanyi Tang

Community & Environmental Health Faculty Publications

The Special Supplemental Nutrition Program for Women, Infants, and Children (WIC) is a nutrition assistance program in the United States (U.S.). Participants in the program redeem their prescribed food benefits in WIC-authorized grocery stores. Online ordering is an innovative method being pilot-tested in some stores to facilitate WIC participants' food benefit redemption, which has become especially important in the COVID-19 pandemic. The present research aimed to examine the online ordering (OO) behaviors among 726 WIC households who adopted WIC OO in a grocery chain, XYZ (anonymous) store, in Oklahoma (OK). These households represented approximately 5% of WIC households who redeemed …


A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi Dec 2021

A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi

Mathematics & Statistics Theses & Dissertations

Deoxyribonucleic acid, more commonly known as DNA, is a complex double helix-shaped molecule present in all living organisms and hosts thousands of genes. However, only a few genes exhibit differential expression and play a vital role in a particular disease such as breast cancer. Microarray technology is one of the modern technologies developed to study these gene expressions. There are two major microarray technologies available for expression analysis: Spotted cDNA array and oligonucleotide array. The focus of our research is the statistical analysis of data that arises from the spotted cDNA microarray. Numerous models have been proposed in the literature …


Empirical Modeling Of Tilt-Rotor Aerodynamic Performance, Michael C. Stratton Oct 2021

Empirical Modeling Of Tilt-Rotor Aerodynamic Performance, Michael C. Stratton

Mechanical & Aerospace Engineering Theses & Dissertations

There has been increasing interest into the performance of electric vertical takeoff and landing (eVTOL) aircraft. The propellers used for the eVTOL propulsion systems experience a broad range of aerodynamic conditions, not typically experienced by propellers in forward flight, that includes large incidence angles relative to the oncoming airflow. Formal experiment design and analysis techniques featuring response surface methods were applied to a subscale, tilt-rotor wind tunnel test for three, four, five, and six blade, 16-inch diameter, propeller configurations in support of development of the NASA LA-8 aircraft. Investigation of low-speed performance included a maximum speed of 12 m/s and …


D-Vine Copula Model For Dependent Binary Data, Huihui Lin, N. Rao Chaganty Apr 2020

D-Vine Copula Model For Dependent Binary Data, Huihui Lin, N. Rao Chaganty

College of Sciences Posters

High-dimensional dependent binary data are prevalent in a wide range of scientific disciplines. A popular method for analyzing such data is the Multivariate Probit (MP) model. But the MP model sometimes fails even within a feasible range of binary correlations, because the underlying correlation matrix of the latent variables may not be positive definite. In this research, we proposed pair copula models, assuming the dependence between the binary variables is first order autoregressive (AR(1))or equicorrelated structure. Also, when Archimediean copula is used, most paper converted Kendall Tau to corresponding copula parameter, there is no explicit function of Pearson’s correlation coefficient …


Rotorcraft Blade Angle Calibration Methods, Brian David Calvert Jr. Apr 2020

Rotorcraft Blade Angle Calibration Methods, Brian David Calvert Jr.

Mechanical & Aerospace Engineering Theses & Dissertations

The most vital system of a rotorcraft is the rotor system due to its effects on the overall flight quality of the vehicle. Therefore, it is of importance to be able to accurately determine blade position during flight so that fine adjustments can be made to ensure a safe and efficient flight. In this study, a current calibration method focusing on the pitch, flap, and lead-lag blade angles is analyzed and found to have larger than acceptable error associated with the sensor calibrations. A literature review is conducted which reveals four novel methods that can potentially increase the accuracy of …


Nonparametric False Discovery Rate Control For Identifying Simultaneous Signals, Sihai Dave Zhao, Yet Tian Nguyen Jan 2020

Nonparametric False Discovery Rate Control For Identifying Simultaneous Signals, Sihai Dave Zhao, Yet Tian Nguyen

Mathematics & Statistics Faculty Publications

It is frequently of interest to identify simultaneous signals, defined as features that exhibit statistical significance across each of several independent experiments. For example, genes that are consistently differentially expressed across experiments in different animal species can reveal evolutionarily conserved biological mechanisms. However, in some problems the test statistics corresponding to these features can have complicated or unknown null distributions. This paper proposes a novel nonparametric false discovery rate control procedure that can identify simultaneous signals even without knowing these null distributions. The method is shown, theoretically and in simulations, to asymptotically control the false discovery rate. It was also …


Copula-Based Zero-Inflated Count Time Series Models, Mohammed Sulaiman Alqawba Jul 2019

Copula-Based Zero-Inflated Count Time Series Models, Mohammed Sulaiman Alqawba

Mathematics & Statistics Theses & Dissertations

Count time series data are observed in several applied disciplines such as in environmental science, biostatistics, economics, public health, and finance. In some cases, a specific count, say zero, may occur more often than usual. Additionally, serial dependence might be found among these counts if they are recorded over time. Overlooking the frequent occurrence of zeros and the serial dependence could lead to false inference. In this dissertation, we propose two classes of copula-based time series models for zero-inflated counts with the presence of covariates. Zero-inflated Poisson (ZIP), zero-inflated negative binomial (ZINB), and zero-inflated Conway-Maxwell-Poisson (ZICMP) distributed marginals of the …


Spatio-Temporal Cluster Detection And Local Moran Statistics Of Point Processes, Jennifer L. Matthews Apr 2019

Spatio-Temporal Cluster Detection And Local Moran Statistics Of Point Processes, Jennifer L. Matthews

Mathematics & Statistics Theses & Dissertations

Moran's index is a statistic that measures spatial dependence, quantifying the degree of dispersion or clustering of point processes and events in some location/area. Recognizing that a single Moran's index may not give a sufficient summary of the spatial autocorrelation measure, a local indicator of spatial association (LISA) has gained popularity. Accordingly, we propose extending LISAs to time after partitioning the area and computing a Moran-type statistic for each subarea. Patterns between the local neighbors are unveiled that would not otherwise be apparent. We consider the measures of Moran statistics while incorporating a time factor under simulated multilevel Palm distribution, …


Latent Choice Models To Account For Misclassification Errors In Discrete Transportation Data, Lacramioara Elena Balan Apr 2019

Latent Choice Models To Account For Misclassification Errors In Discrete Transportation Data, Lacramioara Elena Balan

Civil & Environmental Engineering Theses & Dissertations

One of the most fundamental tasks when it comes to analyzing data using statistical methods is to understand the relationship between the explanatory variables and the outcome. Misclassification of explanatory variables is a common risk when using statistical modeling techniques. In this dissertation, we define ‘misclassification,’ as a response that is reported or recorded in the wrong category; for example, a variable is registered as a one when it should have the value zero. Misclassification can easily happen in any data; for example, in an interview setting where the respondent misunderstands the question or the interviewer checks the wrong box. …


Controlling For Confounding Via Propensity Score Methods Can Result In Biased Estimation Of The Conditional Auc: A Simulation Study, Hadiza I. Galadima, Donna K. Mcclish Jan 2019

Controlling For Confounding Via Propensity Score Methods Can Result In Biased Estimation Of The Conditional Auc: A Simulation Study, Hadiza I. Galadima, Donna K. Mcclish

Community & Environmental Health Faculty Publications

In the medical literature, there has been an increased interest in evaluating association between exposure and outcomes using nonrandomized observational studies. However, because assignments to exposure are not random in observational studies, comparisons of outcomes between exposed and nonexposed subjects must account for the effect of confounders. Propensity score methods have been widely used to control for confounding, when estimating exposure effect. Previous studies have shown that conditioning on the propensity score results in biased estimation of conditional odds ratio and hazard ratio. However, research is lacking on the performance of propensity score methods for covariate adjustment when estimating the …


Density Estimation Of Spatio-Temporal Point Patterns Using Moran’S Statistics, Jennifer L. Lorio, Norou Diawara, Lance A. Waller Mar 2018

Density Estimation Of Spatio-Temporal Point Patterns Using Moran’S Statistics, Jennifer L. Lorio, Norou Diawara, Lance A. Waller

Mathematics & Statistics Faculty Publications

Moran’s Index is a statistic that measures spatial autocorrelation, quantifying the degree of dispersion (or spread) of objects in space. When investigating data in an area, a single Moran statistic may not give a sufficient summary of the autocorrelation spread. However, by partitioning the area and taking the Moran statistic of each subarea, we discover patterns of the local neighbors not otherwise apparent. In this paper, we consider the model of the spread of an infectious disease, incorporate time factor, and simulate a multilevel Poisson process where the dependence among the levels is captured by the rate of increase of …


The Use Of Item Response Theory In Survey Methodology: Application In Seat Belt Data, Mark K. Ledbetter, Norou Diawara, Bryan E. Porter Jan 2018

The Use Of Item Response Theory In Survey Methodology: Application In Seat Belt Data, Mark K. Ledbetter, Norou Diawara, Bryan E. Porter

Mathematics & Statistics Faculty Publications

Problem: Several approaches to analyze survey data have been proposed in the literature. One method that is not popular in survey research methodology is the use of item response theory (IRT). Since accurate methods to make prediction behaviors are based upon observed data, the design model must overcome computation challenges, but also consideration towards calibration and proficiency estimation. The IRT model deems to be offered those latter options. We review that model and apply it to an observational survey data. We then compare the findings with the more popular weighted logistic regression. Method: Apply IRT model to the observed data …


A Proposed Taxonomy For The Systems Statistical Engineering Body Of Knowledge, Teddy Steven Cotter Jan 2018

A Proposed Taxonomy For The Systems Statistical Engineering Body Of Knowledge, Teddy Steven Cotter

Engineering Management & Systems Engineering Faculty Publications

In the ASEM-IAC 2012, Cotter (2012) identified the gaps in knowledge that statistical engineering needs to address, explored additional gaps in knowledge not addressed in the prior works, and set forth a working definition of and body of knowledge for statistical engineering. In the ASEM-IAC 2015, Cotter (2015) proposed a systemic causal Bayesian hierarchical model that addressed the knowledge gap needed to integrate deterministic mathematical engineering causal models within a stochastic framework. Missing, however, is the framework for specifying the hierarchical qualitative systems structures necessary and sufficient for specifying systemic causal Bayesian hierarchical models. In the ASEM-IAC 2016, Cotter (2016) …


Barriers To Counseling Among Human Service Professionals: The Development And Validation Of The Fit, Stigma, & Value Scale, Edward S. Neukrug, Michael T. Kalkbrenner, Sandy-Ann M. Griffith Jan 2017

Barriers To Counseling Among Human Service Professionals: The Development And Validation Of The Fit, Stigma, & Value Scale, Edward S. Neukrug, Michael T. Kalkbrenner, Sandy-Ann M. Griffith

Counseling & Human Services Faculty Publications

This study sought to confirm rates of attendance in counseling of human service professionals and validate a 32-item questionnaire designed to identify barriers to counseling seeking behavior among this population. Results indicated that a large percentage of human service professionals attend counseling, with males and females attending at similar rates and non-Caucasians attending at lower rates. A multivariate analysis of variance and descriptive statistics identified the most common barriers to attendance in counseling and examined demographic differences in participants’ sensitivity towards barriers to attendance in counseling. A Principal Factor Analysis (PFA) revealed three subscales (fit, value, and stigma), which we …


Exploring New Models For Seatbelt Use In Survey Data, Mark K. Ledbetter, Norou Diawara, Bryan E. Porter Oct 2016

Exploring New Models For Seatbelt Use In Survey Data, Mark K. Ledbetter, Norou Diawara, Bryan E. Porter

Virginia Journal of Science

Problem: Several approaches to analyze seatbelt use have been proposed in the literature. Two methods that has not been explored are the use of unweighted and weighted logistic regression model and the use of item response theory (IRT) or the Rasch model. Since accurate methods to predict seatbelt use behavior based upon observed data must include a built-in design method and model, and overcome computation challenges, weighted and IRT method deem to be other options for an observational survey of seat belt use in the state of Virginia.

Method: The observed data from 136 sites within the Commonwealth …


Zero-Inflated Models To Identify Transcription Factor Binding Sites In Chip-Seq Experiments, Sameera Dhananjaya Viswakula Apr 2015

Zero-Inflated Models To Identify Transcription Factor Binding Sites In Chip-Seq Experiments, Sameera Dhananjaya Viswakula

Mathematics & Statistics Theses & Dissertations

It is essential to determine the protein-DNA binding sites to understand many biological processes. A transcription factor is a particular type of protein that binds to DNA and controls gene regulation in living organisms. Chromatin immunoprecipitation followed by highthroughput sequencing (ChIP-seq) is considered the gold standard in locating these binding sites and programs use to identify DNA-transcription factor binding sites are known as peak-callers. ChIP-seq data are known to exhibit considerable background noise and other biases. In this study, we propose a negative binomial model (NB), a zero-inflated Poisson model (ZIP) and a zero-inflated negative binomial model (ZINB) for peak-calling. …


Analysis Of Continuous Longitudinal Data With Arma(1, 1) And Antedependence Correlation Structures, Sirisha Mushti Apr 2013

Analysis Of Continuous Longitudinal Data With Arma(1, 1) And Antedependence Correlation Structures, Sirisha Mushti

Mathematics & Statistics Theses & Dissertations

Longitudinal or repeated measure data are common in biomedical and clinical trials. These data are often collected on individuals at scheduled times resulting in dependent responses. Inference methods for studying the behavior of responses over time as well as methods to study the association with certain risk factors or covariates taking into account the dependencies are of great importance. In this research we focus our study on the analysis of continuous longitudinal data. To model the dependencies of the responses over time, we consider appropriate correlation structures generated by the stationary and non-stationary time-series models. We develop new estimation procedures …


Modeling Martian Planetary Entry Descent And Landing Using Monte Carlo Driven Response Surface Methodology, Narcrisha S. Norman Oct 2012

Modeling Martian Planetary Entry Descent And Landing Using Monte Carlo Driven Response Surface Methodology, Narcrisha S. Norman

Mechanical & Aerospace Engineering Theses & Dissertations

Response surface methodology (RSM) is a statistical method that explores the relationships between several descriptive variables and one or more response variables. For over sixty years, among other areas, it has been utilized in quality engineering, process engineering, aircraft engineering, economics, chemical engineering, automotive engineering and design/technique optimization. In this dissertation, RSM is utilized to produce regression models that represent the planetary entry, descent and landing (EDL) process. A complete understanding of EDL process is an essential component of any planetary exploration. Research in this area is ongoing and confidence in the ability to explore known celestial bodies is growing. …


Meta-Heuristics Analysis For Technologically Complex Programs: Understanding The Impact Of Total Constraints For Schedule, Quality And Cost, Henry Darrel Webb Jul 2012

Meta-Heuristics Analysis For Technologically Complex Programs: Understanding The Impact Of Total Constraints For Schedule, Quality And Cost, Henry Darrel Webb

Engineering Management & Systems Engineering Projects for D. Eng. Degree

Program management data associated with a technically complex radio frequency electronics base communication system has been collected and analyzed to identify heuristics which may be utilized in addition to existing processes and procedures to provide indicators that a program is trending to failure. Analysis of the collected data includes detailed schedule analysis, detailed earned value management analysis and defect analysis within the framework of a Firm Fixed Price (FFP) incentive fee contract.

This project develops heuristics and provides recommendations for analysis of complex project management efforts such as those discussed herein. The analysis of the effects of the constraints on …


A Statistical Model To Determine Multiple Binding Sites Of A Transcription Factor On Dna Using Chip-Seq Data, Rasika Jayatillake Jul 2012

A Statistical Model To Determine Multiple Binding Sites Of A Transcription Factor On Dna Using Chip-Seq Data, Rasika Jayatillake

Mathematics & Statistics Theses & Dissertations

Protein-DNA interaction is vital to many biological processes in cells such as cell division, embryo development and regulating gene expression. Chromatin Immunoprecipitation followed by massively parallel sequencing (ChIP-seq) is a new technology that can reveal protein binding sites in genome with superior accuracy. Although many methods have been proposed to find binding sites for ChIP-seq data, they can find only one binding site within a short region of the genome. In this study we introduce a statistical model to identify multiple binding sites of a transcription factor within a short region of the genome using the ChIP-seq data. Mapped sequence …


Response Surface Optimization Of Electron Beam Freeform Fabrication Depositions Using Design Of Experiments, Patricia A. Quigley Jul 2012

Response Surface Optimization Of Electron Beam Freeform Fabrication Depositions Using Design Of Experiments, Patricia A. Quigley

Engineering Management & Systems Engineering Theses & Dissertations

The Electron Beam Freeform Fabrication (EBF3 ) System is a material depositing, layer additive technique that produces three dimensional (3D) parts out of a wide range of metals in high vacuum, using an electron beam and wire feedstock. Screening deposition trials on a titanium alloy, Ti-6Al-4V, at the National Aeronautics Space Administration (NASA) revealed selective vaporization of the aluminum content of linear prototypes when subjected to chemical analysis. In this study, the aluminum content, bead height and bead width output responses were analyzed from a systematic study of the effects that the interactions of the EBF3 processing parameters …


Analysis Of Discrete Choice Probit Models With Structured Correlation Matrices, Bhaskara Ravi Jan 2012

Analysis Of Discrete Choice Probit Models With Structured Correlation Matrices, Bhaskara Ravi

Mathematics & Statistics Theses & Dissertations

Discrete choice models are very popular in Economics and the conditional logit model is the most widely used model to analyze consumer choice behavior, which was introduced in a seminal paper by McFadden (1974). This model is based on the assumption that the unobserved factors, which determine the consumer choices, are independent and follow a Gumbel distribution, widely known as the Independence of irrelevant Alternatives (IIA) assumption. Alternate models that relax IIA assumption are the Generalized Extreme Value (GEV) models, which allow dependency between unobserved factors. However, GEV models do not incorporate all dependency patterns, other choice behaviors such as …


The Doubly Inflated Poisson And Related Regression Models, Manasi Sheth-Chandra Jan 2011

The Doubly Inflated Poisson And Related Regression Models, Manasi Sheth-Chandra

Mathematics & Statistics Theses & Dissertations

Most real life count data consists of some values that are more frequent than allowed by the common parametric families of distributions. For data consisting of only excess zeros, in a seminal paper Lambert (1992) introduced Zero-Inflated Poisson (ZIP) model, which is a mixture model that accounts for the inflated zeros. In this thesis, two Doubly Inflated Poisson (DIP) probability models, DIP (p, λ) and DIP ( p1, p2, λ), are discussed for situations where there is another inflated value k > 0 besides the inflated zeros. The distributional properties such as identifiability, moments, and conditional probabilities …


Rao's Quadratic Entropy And Some New Applications, Yueqin Zhao Apr 2010

Rao's Quadratic Entropy And Some New Applications, Yueqin Zhao

Mathematics & Statistics Theses & Dissertations

Many problems in statistical inference are formulated as testing the diversity of populations. The entropy functions measure the similarity of a distribution function to the uniform distribution and hence can be used as a measure of diversity. Rao (1982a) proposed the concept of quadratic entropy. Its concavity property makes the decomposition similar to ANOVA for categorical data feasible. In this thesis, after reviewing the properties and providing a modification to quadratic entropy, various applications of quadratic entropy are explored. First, analysis of quadratic entropy with the suggested modification to analyze the contingency table data is explored. Then its application to …


Canonical Correlation Analysis For Longitudinal Data, Raymond Mccollum Jan 2010

Canonical Correlation Analysis For Longitudinal Data, Raymond Mccollum

Mathematics & Statistics Theses & Dissertations

Data (multivariate data) on two sets of vectors commonly occur in applications. Statistical analysis of these data is usually done using a canonical correlation analysis (CCA). Occurrence of these data at multiple occasions or conditions leads to longitudinal multivariate data for a CCA. We address the problem of canonical correlation analysis on longitudinal data when the data have a Kronecker product covariance structure. Using structured correlation matrices we model the dependency of repeatedly observed data. Recent work of Srivastava, Nahtman, and von Rosen (2008) developed an iterative algorithm to determine the maximum likelihood estimate of the Kronecker product covariance structure …


Analysis Of Models For Longitudinal And Clustered Binary Data, Weiming Yang Jan 2010

Analysis Of Models For Longitudinal And Clustered Binary Data, Weiming Yang

Mathematics & Statistics Theses & Dissertations

This dissertation deals with modeling and statistical analysis of longitudinal and clustered binary data. Such data consists of observations on a dichotomous response variable generated from multiple time or cluster points, that exhibit either decaying correlation or equi-correlated dependence. The current literature addresses modeling the dependence using an appropriate correlation structure, but ignores the feasible bounds on the correlation parameter imposed by the marginal means.

The first part of this dissertation deals with two multivariate probability models, the first order Markov chain model and the multivariate probit model, that adhere to the feasible bounds on the correlation. For both the …


Canonical Correlation And Correspondence Analysis Of Longitudinal Data, Jayesh Srivastava Apr 2007

Canonical Correlation And Correspondence Analysis Of Longitudinal Data, Jayesh Srivastava

Mathematics & Statistics Theses & Dissertations

Assessing the relationship between two sets of multivariate vectors is an important problem in statistics. Canonical correlation coefficients are used to study these relationships. Canonical correlation analysis (CCA) is a general multivariate method that is mainly used to study relationships when both sets of variables are quantitative. When the variables are qualitative (categorical), a technique called correspondence analysis (CA) is used. Canonical correspondence analysis (CCPA) is used to deal with the case when one set of variables is categorical and the other set is quantitative. By exploiting the interrelationships between these three techniques we first provide a theoretical basis for …


Modeling And Efficient Estimation Of Intra-Family Correlations, Roy Sabo Jan 2007

Modeling And Efficient Estimation Of Intra-Family Correlations, Roy Sabo

Mathematics & Statistics Theses & Dissertations

Familial data occur when observations are taken on multiple members of the same family. Due to relationships between these members, both genetic and by cohabitation, their response variables will likely exhibit some form of dependence. Most of the existing literature models this dependence with an equicorrelated structure. This structure is appropriate when the dependencies between family members are similar, such as in genetic studies, but not in cases where we expect the dependencies to differ, such as behavioral comparisons across different age groups. In this dissertation we first discuss an alternative structure based upon first-order autoregressive correlation. Specifically we create …