Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Applied Statistics (6)
- Engineering (5)
- Probability (5)
- Artificial Intelligence and Robotics (4)
- Computer Sciences (4)
-
- Biostatistics (3)
- Longitudinal Data Analysis and Time Series (3)
- Social and Behavioral Sciences (3)
- Computational Engineering (2)
- Economics (2)
- Life Sciences (2)
- Operations Research, Systems Engineering and Industrial Engineering (2)
- Statistical Methodology (2)
- Survival Analysis (2)
- Aerospace Engineering (1)
- Animal Sciences (1)
- Applied Mathematics (1)
- Aquaculture and Fisheries (1)
- Business (1)
- Categorical Data Analysis (1)
- Data Science (1)
- Design of Experiments and Sample Surveys (1)
- Education (1)
- Genetics (1)
- Genetics and Genomics (1)
- Industrial Engineering (1)
- Marketing (1)
- Keyword
-
- Agent-based simulation (2)
- Bivariate exponential (2)
- Gaussian copula (2)
- Logistic regression (2)
- Maximum likelihood (2)
-
- Quasi-least squares (2)
- AR(1) (1)
- AUC (1)
- Admixture (1)
- Ancestry (1)
- Artificial intelligence (1)
- Autoregressive correlation (1)
- Avoid-ability (1)
- Bias (1)
- Big data (1)
- Binomial data (1)
- Blackjack (1)
- Clayton copula (1)
- Copula (1)
- Correlated discrete choice models (1)
- Covariate adjustment (1)
- Creel survey (1)
- D-vine pair-copula (1)
- Data-driven modeling (1)
- Decision governance (1)
- Dependent discrete choice models (1)
- Derivative of multivariate normal CDF (1)
- Dirac delta (1)
- Doubly inflated Poisson (1)
- Equicorrelated (1)
- Publication Year
- Publication
-
- Mathematics & Statistics Faculty Publications (7)
- Mathematics & Statistics Theses & Dissertations (7)
- Community & Environmental Health Faculty Publications (1)
- Computational Modeling & Simulation Engineering Theses & Dissertations (1)
- Engineering Management & Systems Engineering Faculty Publications (1)
- Publication Type
Articles 1 - 22 of 22
Full-Text Articles in Statistical Models
Statistical Approach To Quantifying Interceptability Of Interaction Scenarios For Testing Autonomous Surface Vessels, Benjamin E. Hargis, Yiannis E. Papelis
Statistical Approach To Quantifying Interceptability Of Interaction Scenarios For Testing Autonomous Surface Vessels, Benjamin E. Hargis, Yiannis E. Papelis
Modeling, Simulation and Visualization Student Capstone Conference
This paper presents a probabilistic approach to quantifying interceptability of an interaction scenario designed to test collision avoidance of autonomous navigation algorithms. Interceptability is one of many measures to determine the complexity or difficulty of an interaction scenario. This approach uses a combined probability model of capability and intent to create a predicted position probability map for the system under test. Then, intercept-ability is quantified by determining the overlap between the system under test probability map and the intruder’s capability model. The approach is general; however, a demonstration is provided using kinematic capability models and an odometry-based intent model.
Novel Statistical Analysis In The Context Of A Comprehensive Needs Assessment For Secondary Stem Recruitment, Norou Diawara, Sarah Ferguson, Melva Grant, Kumer Das
Novel Statistical Analysis In The Context Of A Comprehensive Needs Assessment For Secondary Stem Recruitment, Norou Diawara, Sarah Ferguson, Melva Grant, Kumer Das
Mathematics & Statistics Faculty Publications
There is a myriad of career opportunities stemming from science, technology, engineering, and mathematics (STEM) disciplines. In addition to careers in corporate settings, teaching is a viable career option for individuals pursuing degrees in STEM disciplines. With national shortages of secondary STEM teachers, efforts to recruit, train, and retain quality STEM teachers is greatly important. Prior to exploring ways to attract potential STEM teacher candidates to pursue teacher training programs, it is important to understand the perceived value that potential recruits place on STEM careers, disciplines, and the teaching profession. The purpose of this study was to explore students’ perceptions …
D-Vine Pair-Copula Models For Longitudinal Binary Data, Huihui Lin
D-Vine Pair-Copula Models For Longitudinal Binary Data, Huihui Lin
Mathematics & Statistics Theses & Dissertations
Dependent longitudinal binary data are prevalent in a wide range of scientific disciplines, including healthcare and medicine. A popular method for analyzing such data is the multivariate probit (MP) model. The motivation for this dissertation stems from the fact that the MP model fails even the binary correlations are within the feasible range. The reason being the underlying correlation matrix of the latent variables in the MP model may not be positive definite. In this dissertation, we study alternatives that are based on D-vine pair-copula models. We consider both the serial dependence modeled by the first order autoregressive (AR(1)) and …
A Data-Driven Approach For Modeling Agents, Hamdi Kavak
A Data-Driven Approach For Modeling Agents, Hamdi Kavak
Computational Modeling & Simulation Engineering Theses & Dissertations
Agents are commonly created on a set of simple rules driven by theories, hypotheses, and assumptions. Such modeling premise has limited use of real-world data and is challenged when modeling real-world systems due to the lack of empirical grounding. Simultaneously, the last decade has witnessed the production and availability of large-scale data from various sensors that carry behavioral signals. These data sources have the potential to change the way we create agent-based models; from simple rules to driven by data. Despite this opportunity, the literature has neglected to offer a modeling approach to generate granular agent behaviors from data, creating …
Controlling For Confounding Via Propensity Score Methods Can Result In Biased Estimation Of The Conditional Auc: A Simulation Study, Hadiza I. Galadima, Donna K. Mcclish
Controlling For Confounding Via Propensity Score Methods Can Result In Biased Estimation Of The Conditional Auc: A Simulation Study, Hadiza I. Galadima, Donna K. Mcclish
Community & Environmental Health Faculty Publications
In the medical literature, there has been an increased interest in evaluating association between exposure and outcomes using nonrandomized observational studies. However, because assignments to exposure are not random in observational studies, comparisons of outcomes between exposed and nonexposed subjects must account for the effect of confounders. Propensity score methods have been widely used to control for confounding, when estimating exposure effect. Previous studies have shown that conditioning on the propensity score results in biased estimation of conditional odds ratio and hazard ratio. However, research is lacking on the performance of propensity score methods for covariate adjustment when estimating the …
New Approaches To Model Simulated Spatio-Temporal Moran's Index, Nhan Bu, Jennifer Lorio, Norou Diawara, Kumar Das, Lance Waller
New Approaches To Model Simulated Spatio-Temporal Moran's Index, Nhan Bu, Jennifer Lorio, Norou Diawara, Kumar Das, Lance Waller
Mathematics & Statistics Faculty Publications
The Moran's index is a statistic that measures spatial autocorrelation; it quantifies the degree of dispersion (or clustering) of objects in space. However, when investigating data over a general area, a single global Moran statistic may not give a sufficient summary of the spread, behavior, features or latent surfaces shared by neighboring areas; rather, by partitioning the area and taking the Moran statistic of each divided subareas, we can discover patterns of the local neighbors not otherwise apparent. In this paper, we present a simulation experiment where the local Moran values are computed and a time variable is added to …
Supervised Classification Using Finite Mixture Copula, Sumen Sen, Norou Diawara
Supervised Classification Using Finite Mixture Copula, Sumen Sen, Norou Diawara
Mathematics & Statistics Faculty Publications
Use of copula for statistical classification is recent and gaining popularity. For example, statistical classification using copula has been proposed for automatic character recognition, medical diagnostic and most recently in data mining. Classical discrimination rules assume normality. But in this data age time, this assumption is often questionable. In fact features of data could be a mixture of discrete and continues random variables. In this paper, mixture copula densities are used to model class conditional distributions. Such types of densities are useful when the marginal densities of the vector of features are not normally distributed and are of a mixed …
Augmenting Bottom-Up Metamodels With Predicates, Ross J. Gore, Saikou Diallo, Christopher Lynch, Jose Padilla
Augmenting Bottom-Up Metamodels With Predicates, Ross J. Gore, Saikou Diallo, Christopher Lynch, Jose Padilla
VMASC Publications
Metamodeling refers to modeling a model. There are two metamodeling approaches for ABMs: (1) top-down and (2) bottom-up. The top down approach enables users to decompose high-level mental models into behaviors and interactions of agents. In contrast, the bottom-up approach constructs a relatively small, simple model that approximates the structure and outcomes of a dataset gathered fromthe runs of an ABM. The bottom-up metamodel makes behavior of the ABM comprehensible and exploratory analyses feasible. Formost users the construction of a bottom-up metamodel entails: (1) creating an experimental design, (2) running the simulation for all cases specified by the design, (3) …
Human-Intelligence/Machine-Intelligence Decision Governance: An Analysis From Ontological Point Of View, Faisal Mahmud, Teddy Steven Cotter
Human-Intelligence/Machine-Intelligence Decision Governance: An Analysis From Ontological Point Of View, Faisal Mahmud, Teddy Steven Cotter
Engineering Management & Systems Engineering Faculty Publications
The increasing CPU power and memory capacity of computers, and now computing appliances, in the 21st century has allowed accelerated integration of artificial intelligence (AI) into organizational processes and everyday life. Artificial intelligence can now be found in a wide range of organizational processes including medical diagnosis, automated stock trading, integrated robotic production systems, telecommunications routing systems, and automobile fuzzy logic controllers. Self-driving automobiles are just the latest extension of AI. This thrust of AI into organizations and everyday life rests on the AI community’s unstated assumption that “…every aspect of human learning and intelligence could be so precisely described …
Exploring New Models For Seatbelt Use In Survey Data, Mark K. Ledbetter, Norou Diawara, Bryan E. Porter
Exploring New Models For Seatbelt Use In Survey Data, Mark K. Ledbetter, Norou Diawara, Bryan E. Porter
Virginia Journal of Science
Problem: Several approaches to analyze seatbelt use have been proposed in the literature. Two methods that has not been explored are the use of unweighted and weighted logistic regression model and the use of item response theory (IRT) or the Rasch model. Since accurate methods to predict seatbelt use behavior based upon observed data must include a built-in design method and model, and overcome computation challenges, weighted and IRT method deem to be other options for an observational survey of seat belt use in the state of Virginia.
Method: The observed data from 136 sites within the Commonwealth …
Analysis Off Dependent Discrete Choices Using Gaussian Copula, Arjun Poddar
Analysis Off Dependent Discrete Choices Using Gaussian Copula, Arjun Poddar
Mathematics & Statistics Theses & Dissertations
A popular tool for analyzing product choices of consumers is the well-known conditional logit discrete choice model. Originally publicized by McFadden (1974), this model assumes that the random components of the underlying latent utility functions of the consumers follow independent Gumbel distributions. However, in practice the independence assumption may be violated and a more reasonable model should account for the dependence of the utilities. In this dissertation we use the Gaussian copula with compound symmetric and autoregressive of order one correlation matrices to construct a general multivariate model for the joint distribution of the utilities. The induced correlations on the …
Alternatives To Mixture Model Analysis Of Correlated Binomial Data, N. Rao Chaganty, Roy Sabo, Yihao Deng
Alternatives To Mixture Model Analysis Of Correlated Binomial Data, N. Rao Chaganty, Roy Sabo, Yihao Deng
Mathematics & Statistics Faculty Publications
While univariate instances of binomial data are readily handled with generalized linear models, cases of multivariate or repeated measure binomial data are complicated by the possibility of correlated responses. Likelihood-based estimation can be applied by using mixture distribution models, though this approach can present computational challenges. The logistic transformation can be used to bypass these concerns and allow for alternative estimating procedures. One popular alternative is the generalized estimating equation (GEE) method, though systematic errors can lead to infeasible correlation estimates or nonconvergence problems. Our approach is the coupling of quasileast squares (QLSs) method with a rarely used matrix factorization, …
Analysis Of Discrete Choice Probit Models With Structured Correlation Matrices, Bhaskara Ravi
Analysis Of Discrete Choice Probit Models With Structured Correlation Matrices, Bhaskara Ravi
Mathematics & Statistics Theses & Dissertations
Discrete choice models are very popular in Economics and the conditional logit model is the most widely used model to analyze consumer choice behavior, which was introduced in a seminal paper by McFadden (1974). This model is based on the assumption that the unobserved factors, which determine the consumer choices, are independent and follow a Gumbel distribution, widely known as the Independence of irrelevant Alternatives (IIA) assumption. Alternate models that relax IIA assumption are the Generalized Extreme Value (GEV) models, which allow dependency between unobserved factors. However, GEV models do not incorporate all dependency patterns, other choice behaviors such as …
The Doubly Inflated Poisson And Related Regression Models, Manasi Sheth-Chandra
The Doubly Inflated Poisson And Related Regression Models, Manasi Sheth-Chandra
Mathematics & Statistics Theses & Dissertations
Most real life count data consists of some values that are more frequent than allowed by the common parametric families of distributions. For data consisting of only excess zeros, in a seminal paper Lambert (1992) introduced Zero-Inflated Poisson (ZIP) model, which is a mixture model that accounts for the inflated zeros. In this thesis, two Doubly Inflated Poisson (DIP) probability models, DIP (p, λ) and DIP ( p1, p2, λ), are discussed for situations where there is another inflated value k > 0 besides the inflated zeros. The distributional properties such as identifiability, moments, and conditional probabilities …
Probability Models For Blackjack Poker, Charlie H. Cooke
Probability Models For Blackjack Poker, Charlie H. Cooke
Mathematics & Statistics Faculty Publications
For simplicity in calculation, previous analyses of blackjack poker have employed models which employ sampling with replacement. in order to assess what degree of error this may induce, the purpose here is to calculate results for a typical hand where sampling without replacement is employed. It is seen that significant error can result when long runs are required to complete the hand. The hand examined is itself of particular interest, as regards both its outstanding expectations of high yield and certain implications for pair splitting of two nines against the dealer's seven. Theoretical and experimental methods are used in order …
The Joint Distribution Of Bivariate Exponential Under Linearly Related Model, Norou Diawara, Kumer Pial Das
The Joint Distribution Of Bivariate Exponential Under Linearly Related Model, Norou Diawara, Kumer Pial Das
Mathematics & Statistics Faculty Publications
In this paper, fundamental results of the joint distribution of the bivariate exponential distributions are established. The positive support multivariate distribution theory is important in reliability and survival analysis, and we applied it to the case where more than one failure or survival is observed in a given study. Usually, the multivariate distribution is restricted to those with marginal distributions of a specified and familiar lifetime family. The family of exponential distribution contains the absolutely continuous and discrete case models with a nonzero probability on a set of measure zero. Examples are given, and estimators are developed and applied to …
Linear Dependency For The Difference In Exponential Regression, Indika Sathish, Norou Diawara
Linear Dependency For The Difference In Exponential Regression, Indika Sathish, Norou Diawara
Mathematics & Statistics Faculty Publications
In the field of reliability, a lot has been written on the analysis of phenomena that are related. Estimation of the difference of two population means have been mostly formulated under the no-correlation assumption. However, in many situations, there is a correlation involved. This paper addresses this issue. A sequential estimation method for linearly related lifetime distributions is presented. Estimations for the scale parameters of the exponential distribution are given under square error loss using a sequential prediction method. Optimal stopping rules are discussed using concepts of mean criteria, and numerical results are presented.
Multiple Imputation To Correct For Measurement Error In Admixture Estimates In Genetic Structured Association Testing, Miguel A. Padilla, Jamin Divers, Laura K. Vaughan, David B. Allison, Hemant K. Tiwari
Multiple Imputation To Correct For Measurement Error In Admixture Estimates In Genetic Structured Association Testing, Miguel A. Padilla, Jamin Divers, Laura K. Vaughan, David B. Allison, Hemant K. Tiwari
Psychology Faculty Publications
Objectives: Structured association tests ( SAT), like any statistical model, assumes that all variables are measured without error. Measurement error can bias parameter estimates and confound residual variance in linear models. It has been shown that admixture estimates can be contaminated with measurement error causing SAT models to suffer from the same afflictions. Multiple imputation (MI) is presented as a viable tool for correcting measurement error problems in SAT linear models with emphasis on correcting measurement error contaminated admixture estimates. Methods: Several MI methods are presented and compared, via simulation, in terms of controlling Type I error rates for both …
Modeling And Efficient Estimation Of Intra-Family Correlations, Roy Sabo
Modeling And Efficient Estimation Of Intra-Family Correlations, Roy Sabo
Mathematics & Statistics Theses & Dissertations
Familial data occur when observations are taken on multiple members of the same family. Due to relationships between these members, both genetic and by cohabitation, their response variables will likely exhibit some form of dependence. Most of the existing literature models this dependence with an equicorrelated structure. This structure is appropriate when the dependencies between family members are similar, such as in genetic studies, but not in cases where we expect the dependencies to differ, such as behavioral comparisons across different age groups. In this dissertation we first discuss an alternative structure based upon first-order autoregressive correlation. Specifically we create …
Estimation Of Parameters In Replicated Time Series Regression Models, Genming Shi
Estimation Of Parameters In Replicated Time Series Regression Models, Genming Shi
Mathematics & Statistics Theses & Dissertations
The time series regression model was widely studied in the literature by several authors. However, statistical analysis of replicated time series regression models has received little attention. In this thesis, we study the application of quasi-least squares, a relatively new method, to estimate the parameters in replicated time series models with general ARMA( p, q) correlation structure. We also study several established methods for estimating the parameters in those models, including the maximum likelihood, method of moments, and the GEE method. Asymptotic comparisons of the methods are made bV fixing the number of repeated measurements in each series, and …
Mark-Recapture Creel Survey And Survival Models, Shampa Saha
Mark-Recapture Creel Survey And Survival Models, Shampa Saha
Mathematics & Statistics Theses & Dissertations
In this dissertation, we consider a model based approach to the estimation of exploitation rate of a fish population by combining mark-recapture procedures with a creel survey. We also consider the analysis of a proportional hazards survival model for randomly censored observations, known as the Koziol-Green model. The model assumes that the lifetime survivor function is a power of the censored time survivor function.
In Chapter 2, we introduce the model based approach to the estimation of the exploitation rate of a fish population by combining mark-recapture procedures with a creel survey. We assume that in the beginning of a …
Comparing Traditional Statistical Models With Neural Network Models: The Case Of The Relation Of Human Performance Factors To The Outcomes Of Military Combat, William Oliver Hedgepeth
Comparing Traditional Statistical Models With Neural Network Models: The Case Of The Relation Of Human Performance Factors To The Outcomes Of Military Combat, William Oliver Hedgepeth
Engineering Management & Systems Engineering Theses & Dissertations
Statistics and neural networks are analytical methods used to learn about observed experience. Both the statistician and neural network researcher develop and analyze data sets, draw relevant conclusions, and validate the conclusions. They also share in the challenge of creating accurate predictions of future events with noisy data.
Both analytical methods are investigated. This is accomplished by examining the veridicality of both with real system data. The real system used in this project is a database of 400 years of historical military combat. The relationships among the variables represented in this database are recognized as being hypercomplex and nonlinear.
The …