Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Other Statistics and Probability

2021

Institution
Keyword
Publication
Publication Type
File Type

Articles 1 - 30 of 30

Full-Text Articles in Physical Sciences and Mathematics

Approximate Likelihood Based Estimations For Joint Models With Intractable Likelihoods, Karl Stessy M. Bisselou Dec 2021

Approximate Likelihood Based Estimations For Joint Models With Intractable Likelihoods, Karl Stessy M. Bisselou

Theses & Dissertations

This dissertation focuses on the development of approximation approaches for the joint modeling (JM) of repeated measures data and time-to-event data in the presence of analytically or numerically intractable likelihoods. Current likelihood-based inferences for JMs show several limitations including (i) intractability of integrals during marginal likelihood derivations due to the complexity in computations, and (ii) the large number of nuisance parameters (unobserved) posing a problem with convergence. The h-likelihood (HL) and synthetic likelihood (SL) are two computationally efficient estimation approaches that overcome these challenges.

In the presence of extremely high censoring rates, the HL can produce bias parameter estimates. We …


Confidence Interval For The Mean Of A Beta Distribution, Sean Rangel Dec 2021

Confidence Interval For The Mean Of A Beta Distribution, Sean Rangel

Electronic Theses and Dissertations

Statistical inference for the mean of a beta distribution has become increasingly popular in various fields of academic research. In this study, we developed a novel statistical model from likelihood-based techniques to evaluate various confidence interval techniques for the mean of a beta distribution. Simulation studies will be implemented to compare the performance of the confidence intervals. In addition to the development and study involving confidence intervals, we will also apply the confidence intervals to real biological data that was gathered by the Department of Biology at Stephen F. Austin State University and provide recommendations on the best practice.


Faculty Versus Student Repeatability On Evaluating Translucency Of The Anterior Dentition, James L. Sheets, David B. Marx, Nina Ariani, Valentim A. R. Barão, Alvin G. Wee Oct 2021

Faculty Versus Student Repeatability On Evaluating Translucency Of The Anterior Dentition, James L. Sheets, David B. Marx, Nina Ariani, Valentim A. R. Barão, Alvin G. Wee

Department of Statistics: Faculty Publications

The objective was to compare the repeatability between dental faculty, whose clinical practice was primarily restorative dentistry, and final year dental students in categorizing the inherent translucency of images selected at random using either a 3- or 7-point scale (translucent to opaque). Digital images of anterior dentition were randomly selected based on inherent translucency. Thirty images (five were repeated) were randomized and categorized by 20 dental students and 20 faculty on their inherent translucency. Statistical analysis was performed using an F test for analysis of variance at 95% confidence interval. A covariance parameter estimate (CPE) was accomplished to compare the …


Incorporating Molecular Markers And Causal Structure Among Traits Using A Smith-Hazel Index And Structural Equation Models, Juan Valente Hidalgo-Contreras, Josafhat Salinas-Ruiz, Kent M. Eskridge, Stephen P. Baenziger Sep 2021

Incorporating Molecular Markers And Causal Structure Among Traits Using A Smith-Hazel Index And Structural Equation Models, Juan Valente Hidalgo-Contreras, Josafhat Salinas-Ruiz, Kent M. Eskridge, Stephen P. Baenziger

Department of Statistics: Faculty Publications

The goal in breeding programs is to choose candidates that produce offspring with the best phenotypes. In conventional selection, the best candidate is selected with high genotypic values (unobserved), in the assumption that this is related to the observed phenotypic values for several traits. Multi-trait selection indices are used to identify superior genotypes when a number of traits are to be considered simultaneously. Often, the causal relationship among the traits is well known. Structural equation models (SEM) have been used to describe the causal relationships among variables in many biological systems. We present a method for multi-trait genomic selection that …


Novel Statistical Modeling Methods For Traffic Video Analysis, Hang Shi Aug 2021

Novel Statistical Modeling Methods For Traffic Video Analysis, Hang Shi

Dissertations

Video analysis is an active and rapidly expanding research area in computer vision and artificial intelligence due to its broad applications in modern society. Many methods have been proposed to analyze the videos, but many challenging factors remain untackled. In this dissertation, four statistical modeling methods are proposed to address some challenging traffic video analysis problems under adverse illumination and weather conditions.

First, a new foreground detection method is presented to detect the foreground objects in videos. A novel Global Foreground Modeling (GFM) method, which estimates a global probability density function for the foreground and applies the Bayes decision rule …


Posterior Propriety Of An Objective Prior For Generalized Hierarchical Normal Linear Models, Cong Lin, Dongchu Sun, Chengyuan Song Aug 2021

Posterior Propriety Of An Objective Prior For Generalized Hierarchical Normal Linear Models, Cong Lin, Dongchu Sun, Chengyuan Song

Department of Statistics: Faculty Publications

Bayesian Hierarchical models has been widely used in modern statistical application. To deal with the data having complex structures, we propose a generalized hierarchical normal linear (GHNL) model which accommodates arbitrarily many levels, usual design matrices and ‘vanilla’ covariance matrices. Objective hyperpriors can be employed for the GHNL model to express ignorance or match frequentist properties, yet the common objective Bayesian approaches are infeasible or fraught with danger in hierarchical modelling. To tackle this issue, [Berger, J., Sun, D., & Song, C. (2020b). An objective prior for hyperparameters in normal hierarchical models. Journal of Multivariate Analysis, 178, 104606. https://doi.org/10.1016/j.jmva.2020.104606] …


Factors Influencing Student Outcomes In A Large, Online Simulation-Based Introductory Statistics Course, Ella M. Burnham Aug 2021

Factors Influencing Student Outcomes In A Large, Online Simulation-Based Introductory Statistics Course, Ella M. Burnham

Department of Statistics: Dissertations, Theses, and Student Work

The demand for statistical knowledge and skills is growing in many disciplines, so more students are enrolling in introductory statistics courses (Blair, Kirkman, & Maxwell, 2018). At the same time, institutions are seeking course delivery methods that allow for greater flexibility for students, especially following the onset of the COVID-19 pandemic; therefore, there is more interest in the development and delivery of online introductory statistics courses.

To address this, I collaboratively designed an online introductory statistics course which focuses on simulation-based inference for the University of Nebraska-Lincoln. The course design was informed by the Community of Inquiry framework (Garrison, Anderson, …


Prediction Intervals: The Effects And Identification Of Sparse Regions For Nonparametric Regression Methods, Jackson Faires Aug 2021

Prediction Intervals: The Effects And Identification Of Sparse Regions For Nonparametric Regression Methods, Jackson Faires

Electronic Theses and Dissertations

In this work, we provide an overview of different nonparametric methods for prediction interval estimation and investigate how well they perform when making predictions in sparse regions of the predictor space. This sparsity is an extension to the more common concept of extrapolation in linear regression settings. Using simulation studies, we show that coverage probabilities using prediction intervals from quantile k-nearest neighbors and quantile random forest can be biased to low or too high from the nominal level under various situations of sparsity. We also introduce a test that can be used to see if a new data point lies …


Fully Bayesian Analysis Of Relevance Vector Machine Classification With Probit Link Function For Imbalanced Data Problem, Wenyang Wang, Dongchu Sun, Peng Shao, Haibo Kuang, Cong Sui Jun 2021

Fully Bayesian Analysis Of Relevance Vector Machine Classification With Probit Link Function For Imbalanced Data Problem, Wenyang Wang, Dongchu Sun, Peng Shao, Haibo Kuang, Cong Sui

Department of Statistics: Faculty Publications

The original RVM classification model uses the logistic link function to build the likelihood function making the model hard to be conducted since the posterior of the weight parameter has no closed-form solution. This article proposes the probit link function approach instead of the logistic one for the likelihood function in the RVM classification model, namely PRVM (RVM with the probit link function). We show that the posterior of the weight parameter in PRVM follows the Multivariate Normal distribution and achieves a closed-form solution. A latent variable is needed in our algorithms to simplify the Bayesian computation greatly, and its …


Application Of Randomness In Finance, Jose Sanchez, Daanial Ahmad, Satyanand Singh May 2021

Application Of Randomness In Finance, Jose Sanchez, Daanial Ahmad, Satyanand Singh

Publications and Research

Brownian Motion which is also considered to be a Wiener process and can be thought of as a random walk. In our project we had briefly discussed the fluctuations of financial indices and related it to Brownian Motion and the modeling of Stock prices.


Association Between Stream Impairment By Mercury And Superfund Sites In The Conterminous Usa, Karessa L. Manning May 2021

Association Between Stream Impairment By Mercury And Superfund Sites In The Conterminous Usa, Karessa L. Manning

Masters Theses

Mercury is a natural element that can cause harm to the brain, heart, kidneys, lungs, and immune system, especially to fetuses developing in the womb. Many natural and anthropogenic factors contribute to mercury in the environment, such as geologic deposits, landfills, gold and silver mining operations, cement production, and atmospheric deposition. Mercury has been identified as a contaminant of concern at many National Priority List (NPL) sites, however, studies on contamination at NPL sites are often only conducted on a local level. This study was to analyze the potential connection between mercury-contaminated NPL sites and the presence of mercury impaired …


Does Defense Actually Win Championships? Using Statistics To Examine One Of The Greatest Stereotypes In Sports, Thomas Burkett Apr 2021

Does Defense Actually Win Championships? Using Statistics To Examine One Of The Greatest Stereotypes In Sports, Thomas Burkett

Senior Theses

A common saying in sports is that “defense wins championships.” However, the past decade of play in the modern NBA has seen a rise and focus in offensive efficiency and 3-pointers. This thesis tests whether defense can truly predict a championship winning team in today’s NBA through two-sample hypothesis testing and multiple logistic regression models. The results found that both defensive and offensive statistics were significant predictors of championship teams, meaning that a balanced team, rather than one specialized in defense alone, is a more accurate predictor of championship success.


The Fundamental Limit Theorem Of Countable Markov Chains, Nathanael Gentry Apr 2021

The Fundamental Limit Theorem Of Countable Markov Chains, Nathanael Gentry

Senior Honors Theses

In 1906, the Russian probabilist A.A. Markov proved that the independence of a sequence of random variables is not a necessary condition for a law of large numbers to exist on that sequence. Markov's sequences -- today known as Markov chains -- touch several deep results in dynamical systems theory and have found wide application in bibliometrics, linguistics, artificial intelligence, and statistical mechanics. After developing the appropriate background, we prove a modern formulation of the law of large numbers (fundamental theorem) for simple countable Markov chains and develop an elementary notion of ergodicity. Then, we apply these chain convergence results …


Adventuring Into Complexity By Exploring Data: From Complicity To Sustainability, Tim Lutz Mar 2021

Adventuring Into Complexity By Exploring Data: From Complicity To Sustainability, Tim Lutz

Northeast Journal of Complex Systems (NEJCS)

Problems of sustainability are typically represented by major present-day challenges such as climate change, biodiversity loss, and environmental and social injustice. Framed this way, sustainable lives and societies depend on finding solutions to each problem. From another perspective, there is only one problem behind them all, stated by Gregory Bateson as: “…the difference between how nature works and the way people think,” and complexity provides a way to define and approach this problem. I extend Edgar Morin’s conceptions of restricted and general complexity into pedagogy to address problems of simplicity and reductionist teaching. The proposed pedagogy is based on long …


Entropic Dynamics Of Networks, Felipe Xavier Costa, Pedro Pessoa Mar 2021

Entropic Dynamics Of Networks, Felipe Xavier Costa, Pedro Pessoa

Northeast Journal of Complex Systems (NEJCS)

Here we present the entropic dynamics formalism for networks. That is, a framework for the dynamics of graphs meant to represent a network derived from the principle of maximum entropy and the rate of transition is obtained taking into account the natural information geometry of probability distributions. We apply this framework to the Gibbs distribution of random graphs obtained with constraints on the node connectivity. The information geometry for this graph ensemble is calculated and the dynamical process is obtained as a diffusion equation. We compare the steady state of this dynamics to degree distributions found on real-world networks.


Sars-Cov-2 Pandemic Analytical Overview With Machine Learning Predictability, Anthony Tanaydin, Jingchen Liang, Daniel W. Engels Jan 2021

Sars-Cov-2 Pandemic Analytical Overview With Machine Learning Predictability, Anthony Tanaydin, Jingchen Liang, Daniel W. Engels

SMU Data Science Review

Understanding diagnostic tests and examining important features of novel coronavirus (COVID-19) infection are essential steps for controlling the current pandemic of 2020. In this paper, we study the relationship between clinical diagnosis and analytical features of patient blood panels from the US, Mexico, and Brazil. Our analysis confirms that among adults, the risk of severe illness from COVID-19 increases with pre-existing conditions such as diabetes and immunosuppression. Although more than eight months into pandemic, more data have become available to indicate that more young adults were getting infected. In addition, we expand on the definition of COVID-19 test and discuss …


Review Of Social Workers Count: Numbers And Social Issues By Michael Anthony Lewis, Michael T. Catalano Jan 2021

Review Of Social Workers Count: Numbers And Social Issues By Michael Anthony Lewis, Michael T. Catalano

Numeracy

Lewis, Michael Anthony. 2017. Social Workers Count: Numbers and Social Issues. 2019. New York: Oxford University Press. 223 pp. ISBN 978-019046713-5

The numeracy movement, although largely birthed within the mathematics community, is an outside-the-box endeavor which has always sought to break down or at least transgress traditional disciplinary boundaries. Michael Anthony Lewis’s book is a testament that this effort is succeeding. Lewis is a social worker and sociologist with an impressive resume, author of Economics for Social Workers, co-editor of The Ethics and Economics of the Basic Income Guarantee, and member of the faculty at the Silberman School …


Analyzing And Creating Playing Card Cryptosystems, Isaac A. Reiter Jan 2021

Analyzing And Creating Playing Card Cryptosystems, Isaac A. Reiter

Honors Student Research

Before computers, military tacticians and government agents had to rely on pencil-and-paper methods to encrypt information. For agents that want to use low-tech options in order to minimize their digital footprint, non-computerized ciphers are an essential component of their toolbox. Still, the presence of computers limits the pool of effective hand ciphers. If a cipher is not unpredictable enough, then a computer will easily be able to break it. There are 52! ≈ 2^225.58 ways to mix a deck of cards. If each deck order is a key, this means that there are 52! ≈ 2^225.58 different ways to encrypt …


Moving Past ‘One Size Fits All’: Developing A Trajectory Deviance Index For Dynamic Measurement Modeling, Yixiao Dong Jan 2021

Moving Past ‘One Size Fits All’: Developing A Trajectory Deviance Index For Dynamic Measurement Modeling, Yixiao Dong

Electronic Theses and Dissertations

Dynamic Measurement Modeling (DMM) is a recently developed measurement framework for gauging developing constructs (e.g., learning capacity) that conventional single-timepoint tests cannot assess. Like most measurement models, overall model fit indices of DMM do not indicate the measurement appropriateness for each included student. For this reason, other measurement modeling paradigms (e.g., Item-Response Theory; IRT) utilize person-fit or model appropriateness statistics to indicate whether a measurement model appropriately describes the data from each individual student. However, within the extant DMM framework, no statistical index has yet been developed for this purpose. Thus, the current project advanced a person-specific DMM Trajectory Deviance …


Statistical Modeling Of Positive Peer Support On Longitudinal Adolescent Substance Use, Kady Rost Jan 2021

Statistical Modeling Of Positive Peer Support On Longitudinal Adolescent Substance Use, Kady Rost

Electronic Theses and Dissertations

To evaluate this study’s research question of ”Does the latent construct of Positive Peer Support (PPS) relate to the construct of Adolescent Substance Use (ASU) over time, controlling for neighborhood safety, race, and sex?”, Structural Equation (SEM) and Latent Growth Curve Modeling (LGCM) were used to investigate trajectories. Secondary longitudinal data from Zimmerman (2014) of 604 students enrolled for four consecutive years in public schools located in Flint, Michigan. In the secondary data resource, students who participated were declared “at risk” by GPA. Significant relationships were found in SEM: Positive Peer Support to Adolescent Substance Use, All Control Variables to …


Assessing The Variations Of Educational Attainment At National And Subnational Levels Using Hierarchical Linear Models, Bingxin Qi Jan 2021

Assessing The Variations Of Educational Attainment At National And Subnational Levels Using Hierarchical Linear Models, Bingxin Qi

Electronic Theses and Dissertations

Education is a human right, and equal access to education is not only crucial for an individual’s well-being, but also essential for eradicating poverty, ensuring long-term prosperity for all, transforming the society, and achieving sustainable development. Measuring education development, especially the variations of educational attainment, in a timely and accurate manner can help educators, practitioners, scientists, and policymakers compare and evaluate various education indicators at both subnational and national levels. This research presents an approach that combines multi-source and multidimensional data including population distribution, human settlement, and education data to assess and explore educational attainment trajectories at both national and …


A Grounded Theory Inquiry Into The Pedagogical Socialization Of Graduate Students Within Graduate Quantitative Methods Courses, Amanda Kay Thomas Jan 2021

A Grounded Theory Inquiry Into The Pedagogical Socialization Of Graduate Students Within Graduate Quantitative Methods Courses, Amanda Kay Thomas

Electronic Theses and Dissertations

Quantitative methods are one of the most highly technical fields of study within social sciences graduate programs. Although classroom pedagogy is an important factor connected to student success within graduate quantitative methods courses little is known on the pedagogical socialization experiences of masters and doctoral students. The purpose of this grounded theory inquiry was to discover graduate students perspectives on their pedagogical socialization experiences and the norms, values and role expectations transmitted during the teaching and learning of quantitative methods. Narrative data was collected from in-depth interviews among a theoretical sample of 31 masters and doctoral students enrolled in introductory, …


Development Of A Multiplex Real-Time Pcr Assay For Predicting Macrolide And Tetracycline Resistance Associated With Bacterial Pathogens Of Bovine Respiratory Disease, Enakshy Dutta, John Loy, Caitlyn A. Deal, Emily L. Wynn, Michael L. Clawson, Jennifer Clarke, Bing Wang Jan 2021

Development Of A Multiplex Real-Time Pcr Assay For Predicting Macrolide And Tetracycline Resistance Associated With Bacterial Pathogens Of Bovine Respiratory Disease, Enakshy Dutta, John Loy, Caitlyn A. Deal, Emily L. Wynn, Michael L. Clawson, Jennifer Clarke, Bing Wang

Department of Statistics: Faculty Publications

Antimicrobial resistance (AMR) in bovine respiratory disease (BRD) is an emerging concern that may threaten both animal and public health. Rapid and accurate detection of AMR is essential for prudent drug therapy selection during BRD outbreaks. This study aimed to develop a multiplex quantitative real-time polymerase chain reaction assay (qPCR) to provide culture-independent information regarding the phenotypic AMR status of BRD cases and an alternative to the gold-standard, culture-dependent test. Bovine clinical samples (297 lung and 111 nasal) collected in Nebraska were subjected to qPCR quantification of macrolide (MAC) and tetracycline (TET) resistance genes and gold-standard determinations of AMR of …


A Review Of Spatial Causal Inference Methods For Environmental And Epidemiological Applications, Brian J. Reich, Shu Yang, Yawen Guan, Andrew B. Giffin, Matthew J. Miller, Ana Rappold Jan 2021

A Review Of Spatial Causal Inference Methods For Environmental And Epidemiological Applications, Brian J. Reich, Shu Yang, Yawen Guan, Andrew B. Giffin, Matthew J. Miller, Ana Rappold

Department of Statistics: Faculty Publications

The scientific rigor and computational methods of causal inference have had great impacts on many disciplines but have only recently begun to take hold in spatial applications. Spatial causal inference poses analytic challenges due to complex correlation structures and interference between the treatment at one location and the outcomes at others. In this paper, we review the current literature on spatial causal inference and identify areas of future work. We first discuss methods that exploit spatial structure to account for unmeasured confounding variables. We then discuss causal analysis in the presence of spatial interference including several common assumptions used to …


Treatment Of Inconclusive Results In Firearms Error Rate Studies, Heike Hofmann, Susan Vanderplas, Alicia L. Carriquiry Jan 2021

Treatment Of Inconclusive Results In Firearms Error Rate Studies, Heike Hofmann, Susan Vanderplas, Alicia L. Carriquiry

Department of Statistics: Faculty Publications

★ Defining error rates for firearms evidence ★ Impact of inconclusive decisions on error rates ★ Predictive probabilities and errors


Computational Modeling For Decision-Making Under Climate Change Uncertainty: Reservoir Simulation Game, Julianne Quinn Jan 2021

Computational Modeling For Decision-Making Under Climate Change Uncertainty: Reservoir Simulation Game, Julianne Quinn

All ECSTATIC Materials

Almost every decision you make is under uncertainty. Will I need a rain jacket in the afternoon? Will they say yes if I ask them out? Is 1 hour enough time to finish this assignment? Oftentimes, we can use computational modeling to simulate different scenarios of what might happen in the future to inform what decisions are best on average, or what decisions minimize the worst case outcome. For example, you could decide what player to draft for your Fantasy Football team by simulating player performance. In this activity, we will simulate how much water to release from a dam …


Use Of Research Tradition And Design In Program Evaluation: An Explanatory Mixed Methods Study Of Practitioners’ Methodological Choices, Margaret Schultz Patel Jan 2021

Use Of Research Tradition And Design In Program Evaluation: An Explanatory Mixed Methods Study Of Practitioners’ Methodological Choices, Margaret Schultz Patel

Electronic Theses and Dissertations

The goal of this explanatory sequential mixed method study was to assess whether there were observable trends, associations, or group differences in evaluation methodology by settings and content area in published evaluations from the past ten years (quantitative), to illuminate how evaluation practitioners selected these methodologies (qualitative), and assess how emergent findings from each phase fit together or helped contextualize each other. In this study, methodology was operationalized as research tradition and method was operationalized as research design. For phase one (quantitative), a systematized ten-year review of five peer-reviewed evaluation journals was conducted and coded by journal, research tradition, research …


Writing At The Horizon: How Producing Imagined Narratives Affects Mood, David Yu-Zhong Liang Jan 2021

Writing At The Horizon: How Producing Imagined Narratives Affects Mood, David Yu-Zhong Liang

Senior Projects Fall 2021

The present study explores the effect of three different writing activities and their subsequent effects on participant mood. Writing has been of particular interest for psychologists due to its use in interventions aimed at working through traumatic or stressful periods, and recent research has begun to explore the use of narrative in placing traumatic events and experiences in greater context. However, purely therapeutic, intervention-based writing exercises exclude a large amount of more expressive, imagined creations and narratives, which may have the capacity to reorient, contextualize, and otherwise positively affect a person’s mood. This study investigates whether employing the imagination may …


Neither “Post-War” Nor Post-Pregnancy Paranoia: How America’S War On Drugs Continues To Perpetuate Disparate Incarceration Outcomes For Pregnant, Substance-Involved Offenders, Becca S. Zimmerman Jan 2021

Neither “Post-War” Nor Post-Pregnancy Paranoia: How America’S War On Drugs Continues To Perpetuate Disparate Incarceration Outcomes For Pregnant, Substance-Involved Offenders, Becca S. Zimmerman

Pitzer Senior Theses

This thesis investigates the unique interactions between pregnancy, substance involvement, and race as they relate to the War on Drugs and the hyper-incarceration of women. Using ordinary least square regression analyses and data from the Bureau of Justice Statistics’ 2016 Survey of Prison Inmates, I examine if (and how) pregnancy status, drug use, race, and their interactions influence two length of incarceration outcomes: sentence length and amount of time spent in jail between arrest and imprisonment. The results collectively indicate that pregnancy decreases length of incarceration outcomes for those offenders who are not substance-involved but not evenhandedly -- benefitting white …


Parametric, Nonparametric, And Semiparametric Linear Regression In Classical And Bayesian Statistical Quality Control, Chelsea L. Jones Jan 2021

Parametric, Nonparametric, And Semiparametric Linear Regression In Classical And Bayesian Statistical Quality Control, Chelsea L. Jones

Theses and Dissertations

Statistical process control (SPC) is used in many fields to understand and monitor desired processes, such as manufacturing, public health, and network traffic. SPC is categorized into two phases; in Phase I historical data is used to inform parameter estimates for a statistical model and Phase II implements this statistical model to monitor a live ongoing process. Within both phases, profile monitoring is a method to understand the functional relationship between response and explanatory variables by estimating and tracking its parameters. In profile monitoring, control charts are often used as graphical tools to visually observe process behaviors. We construct a …