Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics Commons

Open Access. Powered by Scholars. Published by Universities.®

2020

Discipline
Institution
Keyword
Publication
Publication Type
File Type

Articles 1 - 30 of 175

Full-Text Articles in Applied Statistics

The Family Of Bicircular Matroids Closed Under Duality, Vaidy Sivaraman, Daniel Slilaty Dec 2020

The Family Of Bicircular Matroids Closed Under Duality, Vaidy Sivaraman, Daniel Slilaty

Mathematics and Statistics Faculty Publications

We characterize the 3-connected members of the intersection of the class of bicircular and cobi- circular matroids. Aside from some exceptional matroids with rank and corank at most 5, this class consists of just the free swirls and their minors.


Analysis And Implementation Of The Maximum Likelihood Expectation Maximization Algorithm For Find, Angus Boyd Jameson Dec 2020

Analysis And Implementation Of The Maximum Likelihood Expectation Maximization Algorithm For Find, Angus Boyd Jameson

Student Research Projects

This thesis presents an organized explanation and breakdown of the Maximum Likelihood Expectation Maximization image reconstruction algorithm. This background research was used to develop a means of implementing the algorithm into the imaging code for UNH's Field Deployable Imaging Neutron Detector to improve its ability to resolve complex neutron sources. This thesis provides an overview for this implementation scheme, and include the results of a couple of reconstruction tests for the algorithm. A discussion is given on the current state of the algorithm and its integration with the neutron detector system, and suggestions are given for how the work and …


Variation In Personality Among Semi-Wild Myanmar Timber Elephants, Sateesh Venkatesh Dec 2020

Variation In Personality Among Semi-Wild Myanmar Timber Elephants, Sateesh Venkatesh

Theses and Dissertations

This study examines two personality traits: exploration and neophobia, which could influence human-elephant conflicts. Thirty-one semi-wild elephants were tested over two trials using a custom novel puzzle tube containing three tasks and three rewards. Our studies show that elephants do vary significantly between individuals in both exploration and neophobia.


Bayesian Semi-Supervised Keyphrase Extraction And Jackknife Empirical Likelihood For Assessing Heterogeneity In Meta-Analysis, Guanshen Wang Dec 2020

Bayesian Semi-Supervised Keyphrase Extraction And Jackknife Empirical Likelihood For Assessing Heterogeneity In Meta-Analysis, Guanshen Wang

Statistical Science Theses and Dissertations

This dissertation investigates: (1) A Bayesian Semi-supervised Approach to Keyphrase Extraction with Only Positive and Unlabeled Data, (2) Jackknife Empirical Likelihood Confidence Intervals for Assessing Heterogeneity in Meta-analysis of Rare Binary Events.

In the big data era, people are blessed with a huge amount of information. However, the availability of information may also pose great challenges. One big challenge is how to extract useful yet succinct information in an automated fashion. As one of the first few efforts, keyphrase extraction methods summarize an article by identifying a list of keyphrases. Many existing keyphrase extraction methods focus on the unsupervised setting, …


Improved Statistical Methods For Time-Series And Lifetime Data, Xiaojie Zhu Dec 2020

Improved Statistical Methods For Time-Series And Lifetime Data, Xiaojie Zhu

Statistical Science Theses and Dissertations

In this dissertation, improved statistical methods for time-series and lifetime data are developed. First, an improved trend test for time series data is presented. Then, robust parametric estimation methods based on system lifetime data with known system signatures are developed.

In the first part of this dissertation, we consider a test for the monotonic trend in time series data proposed by Brillinger (1989). It has been shown that when there are highly correlated residuals or short record lengths, Brillinger’s test procedure tends to have significance level much higher than the nominal level. This could be related to the discrepancy between …


Confirmative Evaluation: New Cipp Evaluation Model, Tia L. Finney Dec 2020

Confirmative Evaluation: New Cipp Evaluation Model, Tia L. Finney

Journal of Modern Applied Statistical Methods

Struggling trainees often require a substantial investment of time, effort, and resources from medical educators. An emergent challenge involves developing effective ways to accurately identify struggling students and better understand the primary causal factors underlying their poor performance. Identifying the potential reasons for poor performance in medical school is a key first step in developing suitable remediation plans. The SOM Modified Program is a remediation program that aims to ensure academic success for medical students. The purpose of this study is to determine the impact of modifying the CIPP evaluation model by adding a confirmative evaluation step to the model. …


Root Stage Distributions And Their Importance In Plant-Soil Feedback Models, Tyler Poppenwimer Dec 2020

Root Stage Distributions And Their Importance In Plant-Soil Feedback Models, Tyler Poppenwimer

Doctoral Dissertations

Roots are fundamental to PSFs, being a key mediator of these feedbacks by interacting with and affecting the soil environment and soil microbial communities. However, most PSF models aggregate roots into a homogeneous component or only implicitly simulate roots via functions. Roots are not homogeneous and root traits (nutrient and water uptake, turnover rate, respiration rate, mycorrhizal colonization, etc.) vary with age, branch order, and diameter. Trait differences among a plant’s roots lead to variation in root function and roots can be disaggregated according to their function. The impact on plant growth and resource cycling of changes in the distribution …


Comparative Evaluation Of Statistical Dependence Measures, Eman Abdel Rahman Ibrahim Dec 2020

Comparative Evaluation Of Statistical Dependence Measures, Eman Abdel Rahman Ibrahim

Graduate Theses and Dissertations

Measuring and testing dependence between random variables is of great importance in many scientific fields. In the case of linearly correlated variables, Pearson’s correlation coefficient is a commonly used measure of the correlation strength. In the case of nonlinear correlation, several innovative measures have been proposed, such as distance-based correlation, rank-based correlations, and information theory-based correlation. This thesis focuses on the statistical comparison of several important correlations, including Spearman’s correlation, mutual information, maximal information coefficient, biweight midcorrelation, distance correlation, and copula correlation, under various simulation settings such as correlative patterns and the level of random noise. Furthermore, we apply those …


Statistical Approaches Of Gene Set Analysis With Quantitative Trait Loci For High-Throughput Genomic Studies., Samarendra Das Dec 2020

Statistical Approaches Of Gene Set Analysis With Quantitative Trait Loci For High-Throughput Genomic Studies., Samarendra Das

Electronic Theses and Dissertations

Recently, gene set analysis has become the first choice for gaining insights into the underlying complex biology of diseases through high-throughput genomic studies, such as Microarrays, bulk RNA-Sequencing, single cell RNA-Sequencing, etc. It also reduces the complexity of statistical analysis and enhances the explanatory power of the obtained results. Further, the statistical structure and steps common to these approaches have not yet been comprehensively discussed, which limits their utility. Hence, a comprehensive overview of the available gene set analysis approaches used for different high-throughput genomic studies is provided. The analysis of gene sets is usually carried out based on …


Incorporating Shear Resistance Into Debris Flow Triggering Model Statistics, Noah J. Lyman Dec 2020

Incorporating Shear Resistance Into Debris Flow Triggering Model Statistics, Noah J. Lyman

Master's Theses

Several regions of the Western United States utilize statistical binary classification models to predict and manage debris flow initiation probability after wildfires. As the occurrence of wildfires and large intensity rainfall events increase, so has the frequency in which development occurs in the steep and mountainous terrain where these events arise. This resulting intersection brings with it an increasing need to derive improved results from existing models, or develop new models, to reduce the economic and human impacts that debris flows may bring. Any development or change to these models could also theoretically increase the ease of collection, processing, and …


Quantifying The Simultaneous Effect Of Socio-Economic Predictors And Build Environment On Spatial Crime Trends, Alfieri Daniel Ek Dec 2020

Quantifying The Simultaneous Effect Of Socio-Economic Predictors And Build Environment On Spatial Crime Trends, Alfieri Daniel Ek

Graduate Theses and Dissertations

Proper allocation of law enforcement agencies falls under the umbrella of risk terrainmodeling (Caplan et al., 2011, 2015; Drawve, 2016) that primarily focuses on crime prediction and prevention by spatially aggregating response and predictor variables of interest. Although mental health incidents demand resource allocation from law enforcement agencies and the city, relatively less emphasis has been placed on building spatial models for mental health incidents events. Analyzing spatial mental health events in Little Rock, AR over 2015 to 2018, we found evidence of spatial heterogeneity via Moran’s I statistic. A spatial modeling framework is then built using generalized linear models, …


Aspects Of Causal Inference., John A. Craycroft Dec 2020

Aspects Of Causal Inference., John A. Craycroft

Electronic Theses and Dissertations

Observational studies differ from experimental studies in that assignment of subjects to treatments is not randomized but rather occurs due to natural mechanisms, which are usually hidden from researchers. Yet objectives of the two studies are frequently the same: identify the causal – rather than merely associational – relationship between some treatment or exposure and an outcome. The statistical issues that arise in properly analyzing observational data for this goal are numerous and fascinating, and these issues are encompassed in the domain of causal inference. The research presented in this dissertation explores several distinct aspects of causal inference. This dissertation …


Developing A Tourism Opportunity Index Regarding The Prospective Of Overtourism In Nepal, Susan Phuyal Dec 2020

Developing A Tourism Opportunity Index Regarding The Prospective Of Overtourism In Nepal, Susan Phuyal

MSU Graduate Theses

This research explores Nepal's overtourism scenario based on the capacity of a locality to manage sustainable tourism practices. Environmental degradation, local infrastructure degradation, negative tourist experience and local resident responses regarding visitors are the four main variables used in this study to analyze overtourism. In order to analyze the case study of overtourism, we select the three top touristic cities of Nepal, Kathmandu, Pokhara, and Chitwan based on the number of annual visitors. Nepal's case analysis of overtourism conditions reviews the overall threat of over-tourism and establishes a metric by which tourism can be viewed as potentially detrimental to sustainability. …


Statistical Methods With A Focus On Joint Outcome Modeling And On Methods For Fire Science, Da Zhong Xi Nov 2020

Statistical Methods With A Focus On Joint Outcome Modeling And On Methods For Fire Science, Da Zhong Xi

Electronic Thesis and Dissertation Repository

Understanding the dynamics of wildfires contributes significantly to the development of fire science. Challenges in the analysis of historical fire data include defining fire dynamics within existing statistical frameworks, modeling the duration and size of fires as joint outcomes, identifying the how fires are grouped into clusters of subpopulations, and assessing the effect of environmental variables in different modeling frameworks. We develop novel statistical methods to consider outcomes related to fire science jointly. These methods address these challenges by linking univariate models for separate outcomes through shared random effects, an approach referred to as joint modeling. Comparisons with existing …


Stochastic Analysis And Statistical Inference For Seir Models Of Infectious Diseases, Andrés Ríos-Gutiérrez, Viswanathan Arunachalam, Anuj Mubayi Nov 2020

Stochastic Analysis And Statistical Inference For Seir Models Of Infectious Diseases, Andrés Ríos-Gutiérrez, Viswanathan Arunachalam, Anuj Mubayi

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Pharmacokinetics And Pharmacodynamics Models Of Tumor Growth And Anticancer Effects In Discrete Time, Ngoc Nguyen, Ferhan M. Atici Nov 2020

Pharmacokinetics And Pharmacodynamics Models Of Tumor Growth And Anticancer Effects In Discrete Time, Ngoc Nguyen, Ferhan M. Atici

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Direct Questioning Of Sensitive Topics In Public Health Studies: A Simulation Study, Jessica K. Fox, Evrim Oral Nov 2020

Direct Questioning Of Sensitive Topics In Public Health Studies: A Simulation Study, Jessica K. Fox, Evrim Oral

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


A Study Of Sentiment Of Covid-19 Related Tweets In The Usa, Jack Luu, Rosangela Follmann Nov 2020

A Study Of Sentiment Of Covid-19 Related Tweets In The Usa, Jack Luu, Rosangela Follmann

Annual Symposium on Biomathematics and Ecology Education and Research

No abstract provided.


Applying The Data: Predictive Analytics In Sport, Anthony Teeter, Margo Bergman Nov 2020

Applying The Data: Predictive Analytics In Sport, Anthony Teeter, Margo Bergman

Access*: Interdisciplinary Journal of Student Research and Scholarship

The history of wagering predictions and their impact on wide reaching disciplines such as statistics and economics dates to at least the 1700’s, if not before. Predicting the outcomes of sports is a multibillion-dollar business that capitalizes on these tools but is in constant development with the addition of big data analytics methods. Sportsline.com, a popular website for fantasy sports leagues, provides odds predictions in multiple sports, produces proprietary computer models of both winning and losing teams, and provides specific point estimates. To test likely candidates for inclusion in these prediction algorithms, the authors developed a computer model, and test …


Cash Flow Forecasting Using Probabilistic Neural Networks, Marwan Ashour Nov 2020

Cash Flow Forecasting Using Probabilistic Neural Networks, Marwan Ashour

Journal of the Arab American University مجلة الجامعة العربية الامريكية للبحوث

This paper aimed to compare the modern methods of cash flow forecasting with the traditional ones. In other words, the researcher compared between the Probabilistic Neural Networks and Transfer Function. It is worth mentioning that cash flow forecasting , nowadays, is very important and helps the upper management plan, control, assess the performance and make decisions. More specifically, in this paper, the Artificial Neural networks were used to diagnose the nature of the cash flow for the next period of time and then forecast the cash flow. The experiment was conducted in The General company for Electricity Distribution in Baghdad. …


Interval Estimation Of Proportion Of Second-Level Variance In Multi-Level Modeling, Steven Svoboda Oct 2020

Interval Estimation Of Proportion Of Second-Level Variance In Multi-Level Modeling, Steven Svoboda

The Nebraska Educator: A Student-Led Journal

Physical, behavioral and psychological research questions often relate to hierarchical data systems. Examples of hierarchical data systems include repeated measures of students nested within classrooms, nested within schools and employees nested within supervisors, nested within organizations. Applied researchers studying hierarchical data structures should have an estimate of the intraclass correlation coefficient (ICC) for every nested level in their analyses because ignoring even relatively small amounts of interdependence is known to inflate Type I error rate in single-level models. Traditionally, researchers rely upon the ICC as a point estimate of the amount of interdependency in their data. Recent methods utilizing an …


Robustness Of The Ewma Sampling Plan To Non-Normality, Uttama Mishra, S. Siddiqui, J. R. Singh Oct 2020

Robustness Of The Ewma Sampling Plan To Non-Normality, Uttama Mishra, S. Siddiqui, J. R. Singh

Journal of Modern Applied Statistical Methods

The effect of non-normality on the OC function of the sampling plan under EWMA is studied by deriving the OC function for a non-normal population represented by the first four terms of an Edgeworth series.


Misguided Opposition To Multiplicity Adjustment Remains A Problem, Andrew V. Frane Oct 2020

Misguided Opposition To Multiplicity Adjustment Remains A Problem, Andrew V. Frane

Journal of Modern Applied Statistical Methods

Fallacious arguments against multiplicity adjustment have been cited with increasing frequency to defend unadjusted tests. These arguments and their enduring impact are discussed in this paper.


Task Interrupted By A Poisson Process, Jarrett Christopher Nantais Oct 2020

Task Interrupted By A Poisson Process, Jarrett Christopher Nantais

Major Papers

We consider a task which has a completion time T (if not interrupted), which is a random variable with probability density function (pdf) f(t), t>0. Before it is complete, the task may be interrupted by a Poisson process with rate lambda. If that happens, then the task must begin again, with the same completion time random variable T, but with a potentially different realization. These interruptions can reoccur, until eventually the task is finished, with a total time of W. In this paper, we will find the Laplace Transform of W in several special cases.


Economic Design Of X̅ Control Chart Under Double Ewma, Manzoor A. Khanday, J. R. Singh Oct 2020

Economic Design Of X̅ Control Chart Under Double Ewma, Manzoor A. Khanday, J. R. Singh

Journal of Modern Applied Statistical Methods

Designing of parameters plays an important role in economic design of control charts for lowering the cost and time. Manipulating sample size (n) and sampling interval (h), the effect of double exponentially weighted moving average (DEWMA) model was studied for the Economic Design (ED) of control chart. Optimum sizes and level were obtained when the characteristics of an item possesses DEWMA model. When shifts are uncertain the optimal design for DEWMA chart should be more conservative and should be implemented for benefiting the consumers as well as producers.


An Investigation Of Chi-Square And Entropy Based Methods Of Item-Fit Using Item Level Contamination In Item Response Theory, William R. Dardick, Brandi A. Weiss Oct 2020

An Investigation Of Chi-Square And Entropy Based Methods Of Item-Fit Using Item Level Contamination In Item Response Theory, William R. Dardick, Brandi A. Weiss

Journal of Modern Applied Statistical Methods

New variants of entropy as measures of item-fit in item response theory are investigated. Monte Carlo simulation(s) examine aberrant conditions of item-level misfit to evaluate relative (compare EMRj, X2, G2, S-X2, and PV-Q1) and absolute (Type I error and empirical power) performance. EMRj has utility in discovering misfit.


A Monte Carlo Analysis Of Ordinary Least Squares Versus Equal Weights, James Brewer Ayres Oct 2020

A Monte Carlo Analysis Of Ordinary Least Squares Versus Equal Weights, James Brewer Ayres

Masters Theses & Specialist Projects

Equal weights are an alternative weighting procedure to the optimal weights offered by ordinary least squares regression analysis. Also called units weights, equal weights are formed by standardizing scores on the predictor variables and averaging these standardized scores to create a composite score. Research is limited regarding the conditions under which equal weights result in cross-validated 𝑅𝑅2 values that meet or exceed optimal weights. In this study, I explored the effect of various predictor-criterion correlations, predictor intercorrelations, and sample sizes to determine the relative performance of equal and optimal weighting schemes upon cross-validation. Results indicated that optimally weighted predictors explained …


Logistic Regression Under Sparse Data Conditions, David A. Walker, Thomas J. Smith Sep 2020

Logistic Regression Under Sparse Data Conditions, David A. Walker, Thomas J. Smith

Journal of Modern Applied Statistical Methods

The impact of sparse data conditions was examined among one or more predictor variables in logistic regression and assessed the effectiveness of the Firth (1993) procedure in reducing potential parameter estimation bias. Results indicated sparseness in binary predictors introduces bias that is substantial with small sample sizes, and the Firth procedure can effectively correct this bias.


Estimating A Multilevel Model With Complex Survey Data: Demonstration Using Timss, Julie Lorah Sep 2020

Estimating A Multilevel Model With Complex Survey Data: Demonstration Using Timss, Julie Lorah

Journal of Modern Applied Statistical Methods

Analysis of complex survey data is demonstrated for the multilevel model. Description of specific aspects of analysis, including plausible values, sampling weights, and replicate weights is provided. Following this, example TIMSS data and models are described and results are presented.


Concomitant Of Order Statistics From New Bivariate Gompertz Distribution, Sumit Kumar, M. J. S. Khan, Surinder Kumar Sep 2020

Concomitant Of Order Statistics From New Bivariate Gompertz Distribution, Sumit Kumar, M. J. S. Khan, Surinder Kumar

Journal of Modern Applied Statistical Methods

For the new bivariate Gompertz distribution, the expression for probability density function (pdf) of rth order statistics and pdf of concomitant arising from rth order statistics are derived. The properties of concomitant arising from the corresponding order statistics are used to derive these results. The exact expression for moment generating function (mgf) of concomitant of order rth statistics is derived. Also, the mean of concomitant arising from rth order statistics is computed using the mgf of concomitant of rth order statistics, and the exact expression for joint density of concomitant of two non-adjacent order statistics …