Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

Conference

Discipline
Institution
Keyword
Publication Year
Publication
File Type

Articles 1 - 30 of 609

Full-Text Articles in Applied Statistics

Multi-Type Branching Processes In Time-Varying Environments, Arash Jamshidpey May 2024

Multi-Type Branching Processes In Time-Varying Environments, Arash Jamshidpey

Biology and Medicine Through Mathematics Conference

No abstract provided.


Session 6: Model-Based Clustering Analysis On The Spatial-Temporal And Intensity Patterns Of Tornadoes, Yana Melnykov, Yingying Zhang, Rong Zheng Feb 2024

Session 6: Model-Based Clustering Analysis On The Spatial-Temporal And Intensity Patterns Of Tornadoes, Yana Melnykov, Yingying Zhang, Rong Zheng

SDSU Data Science Symposium

Tornadoes are one of the nature’s most violent windstorms that can occur all over the world except Antarctica. Previous scientific efforts were spent on studying this nature hazard from facets such as: genesis, dynamics, detection, forecasting, warning, measuring, and assessing. While we want to model the tornado datasets by using modern sophisticated statistical and computational techniques. The goal of the paper is developing novel finite mixture models and performing clustering analysis on the spatial-temporal and intensity patterns of the tornadoes. To analyze the tornado dataset, we firstly try a Gaussian distribution with the mean vector and variance-covariance matrix represented as …


Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae Feb 2024

Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae

SDSU Data Science Symposium

A size-biased left-truncated Lognormal (SB-ltLN) mixture is proposed as a robust alternative to the Erlang mixture for modeling left-truncated insurance losses with a heavy tail. The weak denseness property of the weighted Lognormal mixture is studied along with the tail behavior. Explicit analytical solutions are derived for moments and Tail Value at Risk based on the proposed model. An extension of the regularized expectation–maximization (REM) algorithm with Shannon's entropy weights (ewREM) is introduced for parameter estimation and variability assessment. The left-truncated internal fraud data set from the Operational Riskdata eXchange is used to illustrate applications of the proposed model. Finally, …


Data Quality Checks: Implementation With Popular Data Collection Crowdsourcing Platforms, James Down, Gregory Balkcom, Kristine Duncan, Ngan (An) Truong, Andrew Lewis Nov 2023

Data Quality Checks: Implementation With Popular Data Collection Crowdsourcing Platforms, James Down, Gregory Balkcom, Kristine Duncan, Ngan (An) Truong, Andrew Lewis

Symposium of Student Scholars

The utilization of online crowdsourcing platforms for data collection has increased over the past two decades in the field of public health due to the ease of use, the cost-saving benefits, the speed of the data collection process, and the accessibility of a potentially true representative population. Although these platforms offer many advantages to researchers, significant drawbacks exist, such as poor data quality, that threaten the reliability and validity of the study. Previous studies have examined data quality concerns, but differences in results arise due to variations in study designs, disciplinary contexts, and the platforms being investigated. Therefore, this study …


Baa-Ttling Sore Mouth In Sheep With Mathematical Modeling, David C. Elzinga, W. Christopher Strickland Nov 2023

Baa-Ttling Sore Mouth In Sheep With Mathematical Modeling, David C. Elzinga, W. Christopher Strickland

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Msis-Glenn: Natural Selection In Wolves Leads To Domesticated Dogs Predicted By Agent-Based Model Simulations, Alex Capaldi, David C. Elzinga Nov 2023

Msis-Glenn: Natural Selection In Wolves Leads To Domesticated Dogs Predicted By Agent-Based Model Simulations, Alex Capaldi, David C. Elzinga

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Incorporating Interventions To An Extended Seird Model With Vaccination: Application To Covid-19 In Qatar, Elizabeth B. Amona Nov 2023

Incorporating Interventions To An Extended Seird Model With Vaccination: Application To Covid-19 In Qatar, Elizabeth B. Amona

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Mathematical Modeling Of The Impact Of Lobbying On Climate Policy, Andrew Jacoby, Claire Hannah, James Hutchinson, Jasmine Narehood, Aditi Ghosh, Padmanabhan Seshaiyer Nov 2023

Mathematical Modeling Of The Impact Of Lobbying On Climate Policy, Andrew Jacoby, Claire Hannah, James Hutchinson, Jasmine Narehood, Aditi Ghosh, Padmanabhan Seshaiyer

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


The Relationship Between Pets Owned And Gender Amongst College Students, Eman Asghar, Bryce Camuso, Josh Feirick Oct 2023

The Relationship Between Pets Owned And Gender Amongst College Students, Eman Asghar, Bryce Camuso, Josh Feirick

Harrisburg University Research Symposium: Highlighting Research, Innovation, & Creativity

There is a gap between the number of men and women that own dogs. The Mintel Press Team states that "71% of men aged 18-44 own a dog compared to 60% of their female counterparts" (Mintel Press Team, 2016, para: 1). While this statistic includes people who are college-aged, it does not mention college students specifically. Thus, research into the relationship between pets owned and gender amongst college students is an important topic that warrants looking into.


Reducing Uncertainty In Sea-Level Rise Prediction: A Spatial-Variability-Aware Approach, Subhankar Ghosh, Shuai An, Arun Sharma, Jayant Gupta, Shashi Shekhar, Aneesh Subramanian Oct 2023

Reducing Uncertainty In Sea-Level Rise Prediction: A Spatial-Variability-Aware Approach, Subhankar Ghosh, Shuai An, Arun Sharma, Jayant Gupta, Shashi Shekhar, Aneesh Subramanian

I-GUIDE Forum

Given multi-model ensemble climate projections, the goal is to accurately and reliably predict future sea-level rise while lowering the uncertainty. This problem is important because sea-level rise affects millions of people in coastal communities and beyond due to climate change's impacts on polar ice sheets and the ocean. This problem is challenging due to spatial variability and unknowns such as possible tipping points (e.g., collapse of Greenland or West Antarctic ice-shelf), climate feedback loops (e.g., clouds, permafrost thawing), future policy decisions, and human actions. Most existing climate modeling approaches use the same set of weights globally, during either regression or …


Statistical Methods To Generate Artificial Slot Floor Data For The Advancement Of Casino Related Research, Courtney Bonner, Anastasia (Stasi) D. Baran, Jason D. Fiege, Saman Muthukumarana May 2023

Statistical Methods To Generate Artificial Slot Floor Data For The Advancement Of Casino Related Research, Courtney Bonner, Anastasia (Stasi) D. Baran, Jason D. Fiege, Saman Muthukumarana

International Conference on Gambling & Risk Taking

Abstract:

A common difficulty when researching gambling topics is the availability of high-quality data sets for development and testing. Due to the high level of secrecy within the gambling industry, if data is obtained for research purposes it is often prohibitively obfuscated, incomplete, or aggregated. Although these data have allowed for advancement in academic work, it leaves both the researchers and readers left wondering about what would be possible if more detailed data sets were available. To mitigate the paucity of data available to researchers, we present a Markov chain-based statistical process for producing artificial event data for a simulated …


Stake Size And Wagering In A Professional Betting Environment – When Data Affects Decision Making, Anthony Bedford, Tristan Barnett May 2023

Stake Size And Wagering In A Professional Betting Environment – When Data Affects Decision Making, Anthony Bedford, Tristan Barnett

International Conference on Gambling & Risk Taking

In this work, we discuss the structure of a number of professional wagering organisations, and how they attempt to deal with the “Ender’s Game” effect – when knowledge of the true nature of the ‘war being wagered’ may have affected the process and choice of betting. We analyse the responses from professional wagering and betting organisations, whom operate predominately in Horseracing and sportsbetting, and they identify the importance of separation of decisions around choices to make and the stakes and size of wagers that are linked to the decisions. The proposed model, practically carried out by one company, is an …


Analytical Approach For Monitoring The Behavior Of Patients With Pancreatic Adenocarcinoma At Different Stages As A Function Of Time, Aditya Chakaborty Dr, Chris P. Tsokos Dr May 2023

Analytical Approach For Monitoring The Behavior Of Patients With Pancreatic Adenocarcinoma At Different Stages As A Function Of Time, Aditya Chakaborty Dr, Chris P. Tsokos Dr

Biology and Medicine Through Mathematics Conference

No abstract provided.


Employee Attrition: Analyzing Factors Influencing Job Satisfaction Of Ibm Data Scientists, Graham Nash Apr 2023

Employee Attrition: Analyzing Factors Influencing Job Satisfaction Of Ibm Data Scientists, Graham Nash

Symposium of Student Scholars

Employee attrition is a relevant issue that every business employer must consider when gauging the effectiveness of their employees. Whether or not an employee chooses to leave their job can come from a multitude of factors. As a result, employers need to develop methods in which they can measure attrition by calculating the several qualities of their employees. Factors like their age, years with the company, which department they work in, their level of education, their job role, and even their marital status are all considered by employers to assist in predicting employee attrition. This project will be analyzing a …


A Chairpersons Guide To Managing Time And Stress, Christian K. Hansen Mar 2023

A Chairpersons Guide To Managing Time And Stress, Christian K. Hansen

Academic Chairpersons Conference Proceedings

In this interactive workshop we discuss time and stress management specifically from the perspective of a department chairperson responsible for leading an academic department through numerous internal and external challenges. The focus will be on practical strategies for effective use of time, not only at a personal level, but also at a department wide level.


Two-Stage Approach For Forensic Handwriting Analysis, Ashlan J. Simpson, Danica M. Ommen Feb 2023

Two-Stage Approach For Forensic Handwriting Analysis, Ashlan J. Simpson, Danica M. Ommen

SDSU Data Science Symposium

Trained experts currently perform the handwriting analysis required in the criminal justice field, but this can create biases, delays, and expenses, leaving room for improvement. Prior research has sought to address this by analyzing handwriting through feature-based and score-based likelihood ratios for assessing evidence within a probabilistic framework. However, error rates are not well defined within this framework, making it difficult to evaluate the method and can lead to making a greater-than-expected number of errors when applying the approach. This research explores a method for assessing handwriting within the Two-Stage framework, which allows for quantifying error rates as recommended by …


A Characterization Of Bias Introduced Into Forensic Source Identification When There Is A Subpopulation Structure In The Relevant Source Population., Dylan Borchert, Semhar Michael, Christopher Saunders Feb 2023

A Characterization Of Bias Introduced Into Forensic Source Identification When There Is A Subpopulation Structure In The Relevant Source Population., Dylan Borchert, Semhar Michael, Christopher Saunders

SDSU Data Science Symposium

In forensic source identification the forensic expert is responsible for providing a summary of the evidence that allows for a decision maker to make a logical and coherent decision concerning the source of some trace evidence of interest. The academic consensus is usually that this summary should take the form of a likelihood ratio (LR) that summarizes the likelihood of the trace evidence arising under two competing propositions. These competing propositions are usually referred to as the prosecution’s proposition, that the specified source is the actual source of the trace evidence, and the defense’s proposition, that another source in a …


Application Of Gaussian Mixture Models To Simulated Additive Manufacturing, Jason Hasse, Semhar Michael, Anamika Prasad Feb 2023

Application Of Gaussian Mixture Models To Simulated Additive Manufacturing, Jason Hasse, Semhar Michael, Anamika Prasad

SDSU Data Science Symposium

Additive manufacturing (AM) is the process of building components through an iterative process of adding material in specific designs. AM has a wide range of process parameters that influence the quality of the component. This work applies Gaussian mixture models to detect clusters of similar stress values within and across components manufactured with varying process parameters. Further, a mixture of regression models is considered to simultaneously find groups and also fit regression within each group. The results are compared with a previous naive approach.


Finite Mixture Modeling For Hierarchically Structured Data With Application To Keystroke Dynamics, Andrew Simpson, Semhar Michael Feb 2023

Finite Mixture Modeling For Hierarchically Structured Data With Application To Keystroke Dynamics, Andrew Simpson, Semhar Michael

SDSU Data Science Symposium

Keystroke dynamics has been used to both authenticate users of computer systems and detect unauthorized users who attempt to access the system. Monitoring keystroke dynamics adds another level to computer security as passwords are often compromised. Keystrokes can also be continuously monitored long after a password has been entered and the user is accessing the system for added security. Many of the current methods that have been proposed are supervised methods in that they assume that the true user of each keystroke is known apriori. This is not always true for example with businesses and government agencies which have internal …


Session 8: Ensemble Of Score Likelihood Ratios For The Common Source Problem, Federico Veneri, Danica M. Ommen Feb 2023

Session 8: Ensemble Of Score Likelihood Ratios For The Common Source Problem, Federico Veneri, Danica M. Ommen

SDSU Data Science Symposium

Machine learning-based Score Likelihood Ratios have been proposed as an alternative to traditional Likelihood Ratios and Bayes Factor to quantify the value of evidence when contrasting two opposing propositions.

Under the common source problem, the opposing proposition relates to the inferential problem of assessing whether two items come from the same source. Machine learning techniques can be used to construct a (dis)similarity score for complex data when developing a traditional model is infeasible, and density estimation is used to estimate the likelihood of the scores under both propositions.

In practice, the metric and its distribution are developed using pairwise comparisons …


Learning From Public Spaces In Historic Cities, Cody Josh Kucharski Nov 2022

Learning From Public Spaces In Historic Cities, Cody Josh Kucharski

Symposium of Student Scholars

Successful public spaces in cities are key for enhancing social cohesion and improving health and safety. Learning from historic cities involves the development of representational and analytical tools aimed at capturing their essence as places of human interaction. The research reports findings of the spatial analysis of twenty Adriatic and Ionian coastal cities, which addresses the question of how the network of public spaces calibrates different degrees of spatial enclosure necessary for creating successful social interactions. Cities in the littoral region include well-preserved historic centers that are renowned for the successful integration of urban squares into the urban fabric. For …


Evaluation Of Circular Logistic Regression Models With Asymmetrical Link Functions, Feridun Tasdan Nov 2022

Evaluation Of Circular Logistic Regression Models With Asymmetrical Link Functions, Feridun Tasdan

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Incorporating Interventions To An Extended Seird Model With Vaccination: Application To Covid-19 In Qatar, Elizabeth Amona Nov 2022

Incorporating Interventions To An Extended Seird Model With Vaccination: Application To Covid-19 In Qatar, Elizabeth Amona

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Bias-Corrected Bagging In Active Learning With An Actuarial Application, Yangxuan Xu Aug 2022

Bias-Corrected Bagging In Active Learning With An Actuarial Application, Yangxuan Xu

Undergraduate Student Research Internships Conference

The variable annuity (VA) is a modern insurance product that offers certain guaranteed protection and tax-deferred treatment. Because of the inherent complexity of guarantees’ payoff, the closed-form solution of fair market values (FMVs) is often not available. Most insurance companies depend on Monte Carlo (MC) simulation to price the FMVs of these products, which is an extremely computational intensive and time-consuming approach. The metamodeling approach can be used to circumvent the heavy computation.

In the modeling stage, the bagged tree method has proved to outperform other parametric approaches. Also, a bias-corrected (BC) bagging model was tried and showed significant improvement …


The Q-Analogue Of The Extended Generalized Gamma Distribution, Wenhao Chen Aug 2022

The Q-Analogue Of The Extended Generalized Gamma Distribution, Wenhao Chen

Undergraduate Student Research Internships Conference

This project introduces a flexible univariate probability model referred to as the q-analogue of the Extended Generalized Gamma (or q-EGG) distribution, which encompasses the majority of the most frequently used continuous distributions, including the gamma, Weibull, logistic, type-1 and type-2 beta, Gaussian, Cauchy, Student-t and F. Closed form representations of its moments and cumulative distribution function are provided. Additionally, computational techniques are proposed for determining estimates of its parameters. Both the method of moments and the maximum likelihood approach are utilized. The effect of each parameter is also graphically illustrated. Certain data sets are modeled with q-EGG distributions; goodness of …


Investigation Of Key Factors To Earthquake Insurance Take-Up Rates In Quebec And British Columbia Households And Prediction Model Building, Yongcheng Jiang Aug 2022

Investigation Of Key Factors To Earthquake Insurance Take-Up Rates In Quebec And British Columbia Households And Prediction Model Building, Yongcheng Jiang

Undergraduate Student Research Internships Conference

Maintaining an adequate level of earthquake take-up rate could protect the insurance industry from systemic failure. Past research has shown that British Columbia and Quebec have significant differences in earthquake insurance take-up rate. This report investigates key factors from the structure (default options and various types) of the insurance plan and personal characteristics along with socioeconomic/demographic profiles that affect the demand for earthquake protection in the form of insurance. The report also provides a prediction model for earthquake insurance take-up rate. The results show an importance ranking of key factors of earthquake insurance take up, the most important three are …


Financial Literacy: Self-Evaluation And Reality, Yangsijia Wang Aug 2022

Financial Literacy: Self-Evaluation And Reality, Yangsijia Wang

Undergraduate Student Research Internships Conference

This study is on the topic of financial literacy, with the data source containing information on clients' demographic information and self-evaluation, change in account value, and trade record, three major problems were investigated: first, whether a client's demographic traits are related to his/her self-evaluation of financial knowledge level; second, does the trading behaviour differ for clients who self-identified as in different financial knowledge groups; and third, do people who self-identified as financially knowledgeable have better investment result. Data manipulation was done using SQL and R. Exploratory analysis including multiple types of plots and proportion tables was used to derive the …


Model Averaging In Agriculture And Natural Resources: What Is It? When Is It Useful? When Is It A Distraction?, Philip M. Dixon May 2022

Model Averaging In Agriculture And Natural Resources: What Is It? When Is It Useful? When Is It A Distraction?, Philip M. Dixon

Conference on Applied Statistics in Agriculture and Natural Resources

I use two examples to illustrate three methods for model averaging: using AIC weights, using BIC weights, and fully Bayesian analyses. The first example is a capture-recapture study that estimates the population size by averaging over 4 models for capture probabilities. The second is an analysis of a study of logging impacts on Curculionid weevils using a before-after-control-impact (BACI) study design. The estimated impact is averaged over 4 ecologically relevant models.

Both examples demonstrate the sensitivity of model weights, or posterior model probabilities, to the choice of prior model probabilities and prior distributions for parameters. The model averaged estimates and …


A Robust Clustering Method Using Compositional Data Restrictions: Studying Wood Properties In The Reforestation Of Portugal, Pamela M. Chiroque-Solano, Guido A. Moreira May 2022

A Robust Clustering Method Using Compositional Data Restrictions: Studying Wood Properties In The Reforestation Of Portugal, Pamela M. Chiroque-Solano, Guido A. Moreira

Conference on Applied Statistics in Agriculture and Natural Resources

Classification of multivariate observations while preserving the data’s natural restriction is a challenge. Special properties such as identifiability, interpretability, and others need to be cared for to build a new approach. To avoid these complications, many transformation algorithms have been developed to use traditional models.In this context, the aim of this work is to propose a robust probabilistic distance algorithm to classify compositional data. Based on the probabilistic distance (PD) clustering approach, the proposal identifies clusters minimizing a joint distance function, JDF, which is part of a dissimilarity measure. This measure combines the PD clustering approach with the density of …


Random Regression For Modeling Semen Fertility In Hf Purebred And Crossbred Bulls Using A Bayesian Framework, Vrinda Ambike, R. Venkataramanan, S. M. K. Karthickeyan, K. G. Tirumurugaan, Kaustubh Bhave, M. Swaminathan May 2022

Random Regression For Modeling Semen Fertility In Hf Purebred And Crossbred Bulls Using A Bayesian Framework, Vrinda Ambike, R. Venkataramanan, S. M. K. Karthickeyan, K. G. Tirumurugaan, Kaustubh Bhave, M. Swaminathan

Conference on Applied Statistics in Agriculture and Natural Resources

Data on insemination records of Holstein Friesian (HF) purebred (n=45,497) and crossbred (n=58,497) collected from the BAIF Research Foundation were utilized. The conception rate was modeled as a binary trait, using linear repeatability models. Random regression models (RRM) were used to obtain the trajectory of variance components across age of the bulls. Legendre Polynomials up to order of fit of 4 were used for the random effects of additive genetic and permanent environmental effects. 200,000 Gibbs samples were generated with a burn-in of 20,000 and thinning interval of 50 using the THRGIBBS1F90 program. Heritability estimates were very low (0.1) in …