Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

Modeling

Discipline
Institution
Publication Year
Publication
Publication Type
File Type

Articles 1 - 30 of 55

Full-Text Articles in Statistics and Probability

Defensive Impact Wins: Developing A New Method To Rate Individual Defense In Nba Games, Dylan J. Stiles Jan 2024

Defensive Impact Wins: Developing A New Method To Rate Individual Defense In Nba Games, Dylan J. Stiles

Honors Theses and Capstones

With the analytics revolution in sports in the past 20 years, it seems that everything that can be quantified is. In basketball though, trying to break the game down into a set of numbers comes with a unique problem. While we've come up with a good set of advanced numbers to measure offensive efficiency, defense is fundamentally harder to quantify. The game is played five on five, but it has often been popular or convenient to model defense as a set of five one on one games. As defenses became more complex into the 2010s, this methodology became more insignificant. …


Traditional Vs Machine Learning Approaches: A Comparison Of Time Series Modeling Methods, Miguel E. Bonilla Jr., Jason Mcdonald, Tamas Toth, Bivin Sadler Aug 2023

Traditional Vs Machine Learning Approaches: A Comparison Of Time Series Modeling Methods, Miguel E. Bonilla Jr., Jason Mcdonald, Tamas Toth, Bivin Sadler

SMU Data Science Review

In recent years, various new Machine Learning and Deep Learning algorithms have been introduced, claiming to offer better performance than traditional statistical approaches when forecasting time series. Studies seeking evidence to support the usage of ML/DL over statistical approaches have been limited to comparing the forecasting performance of univariate, linear time series data. This research compares the performance of traditional statistical-based and ML/DL methods for forecasting multivariate and nonlinear time series.


Identifying Advantages To Teaching Linear Regression In A Modeling And Simulation Introductory Statistics Curriculum, Kit Harris Clement Jun 2023

Identifying Advantages To Teaching Linear Regression In A Modeling And Simulation Introductory Statistics Curriculum, Kit Harris Clement

Dissertations and Theses

Statistical association is a key facet of statistical literacy: claims based on relationships between variables or ideas rooted in data are found everywhere in media and discourse. A key development in introductory statistics curricula is the use of simulation-based inference, which has shown positive outcomes for students, especially in regards to statistical literacy and conceptual understanding. In this dissertation project, I investigate students from the Change Agents for the Teaching and Learning of STatistics (CATALST) curriculum in activities I designed for learning statistical association and linear regression. First, I analyzed the informal line fitting strategies of CATALST students. Findings suggest …


Modeling And Fitting Two-Way Tables Containing Outliers, David L. Farnsworth Feb 2023

Modeling And Fitting Two-Way Tables Containing Outliers, David L. Farnsworth

Articles

A model is proposed for two-way tables of measurement data containing outliers. The two independent variables are categorical and error free. Neither missing values nor replication are present. The model consists of the sum of a customary additive part that can be fit using least squares and a part that is composed of outliers. Recommendations are made for methods for identifying cells containing outliers and for fitting the model. A graph of the observations is used to determine the outliers’ locations. For all cells containing an outlier, replacement values are determined simultaneously using a classical missing-data tool. The result is …


Mathematical Modeling Suggests Cooperation Of Plant-Infecting Viruses, Joshua Miller, Vitaly V. Ganusov, Tessa Burch-Smith May 2022

Mathematical Modeling Suggests Cooperation Of Plant-Infecting Viruses, Joshua Miller, Vitaly V. Ganusov, Tessa Burch-Smith

Chancellor’s Honors Program Projects

No abstract provided.


Statistical Theory For Specialized Linear Regression Adjustment Methods Compared To Multiple Linear Regression In The Presence And Absence Of Interaction Effects, Leon Su Jan 2022

Statistical Theory For Specialized Linear Regression Adjustment Methods Compared To Multiple Linear Regression In The Presence And Absence Of Interaction Effects, Leon Su

Theses and Dissertations--Statistics

When building models to investigate outcomes and variables of interest, researchers often want to adjust for other variables. There is a variety of ways that these adjustments are performed. In this work, we will consider four approaches to adjustment utilized by researchers in various fields. We will compare the efficacy of these methods to what we call the ”true model method”, fitting a multiple linear regression model in which adjustment variables are model covariates. Our goal is to show that these adjustment methods have inferior performance to the true model method by comparing model parameter estimates, power, type I error, …


A Phenological Model For A Southern Population Of Mountain Pine Beetle, Catherine E. Wangen Aug 2021

A Phenological Model For A Southern Population Of Mountain Pine Beetle, Catherine E. Wangen

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

The mountain pine beetle (MPB, Dendroctonus ponderosae Hopkins) attacks living Pinus trees across a widespread area of western North America, causing significant ecological and economic damage. The ability to make accurate predictions of how MPB populations across this range will respond to temperatures, which affect MPB progress through life stages, is essential. Northern and southern populations of MPB are genetically different in response to temperature, requiring geographic-specific model parameters. There is not currently a predictive model for the southern MPB life cycle, despite concerns that those populations may be more susceptible to increased numbers of generations per year, which would …


Developing Prediction Models For Kidney Stone Disease, Joseph Palko Jun 2021

Developing Prediction Models For Kidney Stone Disease, Joseph Palko

Honors Theses

Kidney stone disease has become more prevalent through the years, leading to high treatment cost and associated health risks. In this study, we explore a large medical database and machine learning methods to extract features and construct models for diagnosing kidney stone disease.

Data of 46,250 patients and 58,976 hospital admissions were extracted and analyzed, including patients’ demographic information, diagnoses, vital signs, and laboratory measurements of the blood and urine. We compared the kidney stone (KDS) patients to patients with abdominal and back pain (ABP), patients diagnosed with nephritis, nephrosis, renal sclerosis, chronic kidney disease, or acute and unspecified renal …


Markov Model Composition Of Balinese Reyong Norot Improvisations, Taylor Flanagan, Robert Rovetti May 2021

Markov Model Composition Of Balinese Reyong Norot Improvisations, Taylor Flanagan, Robert Rovetti

Honors Thesis

Markov models are mathematical structures that model the transition between possible states based on the probability of moving from one state to any other. Thus, given a distribution of starting points, the model produces a chain of states that are visited in sequence. Such models have been used extensively to generate music based on probabilities, as sequences of states can represent sequences of notes and rhythms. While music generation is a common application of Markov models, most existing work attempts to reconstruct the musical style of classical Western composers. In this thesis, we produce a series of Markov chains that …


Mathematical Modeling: Instructor And Student Resources, Marnie Phipps, Patty Wagner Jul 2020

Mathematical Modeling: Instructor And Student Resources, Marnie Phipps, Patty Wagner

Mathematics Ancillary Materials

This collection of student and instructor materials for Mathematical Modeling contains lesson plans, lecture slides, homework, learning goals, and student notes for the following major topics:

  • Linear Functions
  • Quadratic Functions
  • Exponential Functions
  • Logarithmic Functions

This is a materials update for a collection of materials created for a Round Nine ALG Textbook Transformation Grant.


Structural Analysis Of The Multifunctional Spoiie Regulatory Protein Of Clostridioides Difficile., Blythe Emily Bunkers Jul 2020

Structural Analysis Of The Multifunctional Spoiie Regulatory Protein Of Clostridioides Difficile., Blythe Emily Bunkers

Graduate Theses and Dissertations

Clostridioides (formally Clostridium) difficile is a medically relevant pathogen pertinent to infectious disease research. C. difficile is distinctly known for its ability to produce two toxins, enterotoxin A and cytotoxin B, and the propensity to colonize the mammalian gastrointestinal tract. It is known that metabolism is tightly correlated with sporulation in endospore producers such as C. difficile, but an interesting and novel regulatory relationship found by the Ivey lab has yet to be understood. The relationship explored in this study is observed between the sporulation factor, SpoIIE, which represses expression of an ABC peptide transporter, app. In this study, two …


Dot: Gene-Set Analysis By Combining Decorrelated Association Statistics, Olga A. Vsevolozhskaya, Min Shi, Fengjiao Hu, Dmitri V. Zaykin Apr 2020

Dot: Gene-Set Analysis By Combining Decorrelated Association Statistics, Olga A. Vsevolozhskaya, Min Shi, Fengjiao Hu, Dmitri V. Zaykin

Biostatistics Faculty Publications

Historically, the majority of statistical association methods have been designed assuming availability of SNP-level information. However, modern genetic and sequencing data present new challenges to access and sharing of genotype-phenotype datasets, including cost of management, difficulties in consolidation of records across research groups, etc. These issues make methods based on SNP-level summary statistics particularly appealing. The most common form of combining statistics is a sum of SNP-level squared scores, possibly weighted, as in burden tests for rare variants. The overall significance of the resulting statistic is evaluated using its distribution under the null hypothesis. Here, we demonstrate that this basic …


Regression Modeling And Prediction By Individual Observations Versus Frequency, Stan Lipovetsky Feb 2020

Regression Modeling And Prediction By Individual Observations Versus Frequency, Stan Lipovetsky

Journal of Modern Applied Statistical Methods

A regression model built by a dataset could sometimes demonstrate a low quality of fit and poor predictions of individual observations. However, using the frequencies of possible combinations of the predictors and the outcome, the same models with the same parameters may yield a high quality of fit and precise predictions for the frequencies of the outcome occurrence. Linear and logistical regressions are used to make an explicit exposition of the results of regression modeling and prediction.


An Examination Of Covid-19 Statistical Modeling, Shane Vaughan Jan 2020

An Examination Of Covid-19 Statistical Modeling, Shane Vaughan

Williams Honors College, Honors Research Projects

The 2019 novel coronavirus, also known as COVID-19, is an infectious disease which was first reported in late 2019 and soon spread to become a global pandemic, prompting major action from world governments. Soon after, many institutions began attempts to analyze and predict the spread and severity of the disease via statistical modeling. Some information is not available for public consumption; however, a number of institutions have published the results of their analyses and some have made public repositories of the code used to build the models. This research paper attempts use these and other resources to examine the modeling …


The Correlation Between Sleep And Lifespan In Drosophila Melanogaster, Joshua Randall Lisse Jan 2019

The Correlation Between Sleep And Lifespan In Drosophila Melanogaster, Joshua Randall Lisse

Masters Theses

”Adequate sleep is associated with an individual’s health. Too little sleep is associated with many health problems, including cardiovascular disease, obesity, and a general increase in all-cause mortality. Yet the molecular changes that link poor sleep and changes in health are still not well understood. Individuals have a unique daily need for sleep, and deviations from the animal’s regular sleeping patterns can be indicative of, or result in, underlying changes in its health. Therefore, we hypothesize that changes in the sleep architecture in Drosophila melanogaster reflect changes in the fly’s health.

We determined sleep architecture in wild-type male flies over …


Monitoring And Evaluating The Influences Of Class V Injection Wells On Urban Karst Hydrology, James Adam Shelley Oct 2018

Monitoring And Evaluating The Influences Of Class V Injection Wells On Urban Karst Hydrology, James Adam Shelley

Masters Theses & Specialist Projects

The response of a karst aquifer to storm events is often faster and more severe than that of a non-karst aquifer. This distinction is often problematic for planners and municipalities, because karst flooding does not typically occur along perennial water courses; thus, traditional flood management strategies are usually ineffective. The City of Bowling Green (CoBG), Kentucky is a representative example of an area plagued by karst flooding. The CoBG, is an urban karst area (UKA), that uses Class V Injection Wells to lessen the severity of flooding. The overall effectiveness, siting, and flooding impact of Injection Wells in UKA’s is …


Essentials Of Structural Equation Modeling, Mustafa Emre Civelek Mar 2018

Essentials Of Structural Equation Modeling, Mustafa Emre Civelek

Zea E-Books Collection

Structural Equation Modeling is a statistical method increasingly used in scientific studies in the fields of Social Sciences. It is currently a preferred analysis method, especially in doctoral dissertations and academic researches. However, since many universities do not include this method in the curriculum of undergraduate and graduate courses, students and scholars try to solve the problems they encounter by using various books and internet resources.

This book aims to guide the researcher who wants to use this method in a way that is free from math expressions. It teaches the steps of a research program using structured equality modeling …


Learning About Modeling In Teacher Preparation Programs, Hyunyi Jung, Eryn Stehr, Jia He, Sharon L. Senk Dec 2017

Learning About Modeling In Teacher Preparation Programs, Hyunyi Jung, Eryn Stehr, Jia He, Sharon L. Senk

Hyunyi Jung

This study explores opportunities that secondary mathematics teacher preparation programs provide to learn about modeling in algebra. Forty-eight course instructors and ten focus groups at five universities were interviewed to answer questions related to modeling. With the analysis of the interview transcripts and related course materials, we found few opportunities for PSTs to engage with the full modeling cycle. Examples of opportunities to learn about algebraic modeling and the participants’ perspectives on the opportunities can contribute to the study of modeling and algebra in teacher education.


Detecting And Evaluating Therapy Induced Changes In Radiomics Features Measured From Non-Small Cell Lung Cancer To Predict Patient Outcomes, Xenia J. Fave May 2017

Detecting And Evaluating Therapy Induced Changes In Radiomics Features Measured From Non-Small Cell Lung Cancer To Predict Patient Outcomes, Xenia J. Fave

Dissertations & Theses (Open Access)

The purpose of this study was to investigate whether radiomics features measured from weekly 4-dimensional computed tomography (4DCT) images of non-small cell lung cancers (NSCLC) change during treatment and if those changes are prognostic for patient outcomes or dependent on treatment modality. Radiomics features are quantitative metrics designed to evaluate tumor heterogeneity from routine medical imaging. Features that are prognostic for patient outcome could be used to monitor tumor response and identify high-risk patients for adaptive treatment. This would be especially valuable for NSCLC due to the high prevalence and mortality of this disease.

A novel process was designed to …


A Realistic Meteorological Assessment Of Perennial Biofuel Crop Deployment: A Southern Great Plains Perspective, Melissa Wagner, Meng Wang, Gonzalo Miguez-Macho, Jesse Miller, Andy Vanloocke, Justin E. Bagley, Carl J. Bernacchi, Matei Georgescu Jan 2017

A Realistic Meteorological Assessment Of Perennial Biofuel Crop Deployment: A Southern Great Plains Perspective, Melissa Wagner, Meng Wang, Gonzalo Miguez-Macho, Jesse Miller, Andy Vanloocke, Justin E. Bagley, Carl J. Bernacchi, Matei Georgescu

Andy VanLoocke

Utility of perennial bioenergy crops (e.g., switchgrass and miscanthus) offers unique opportunities to transition toward a more sustainable energy pathway due to their reduced carbon footprint, averted competition with food crops, and ability to grow on abandoned and degraded farmlands. Studies that have examined biogeophysical impacts of these crops noted a positive feedback between near-surface cooling and enhanced evapotranspiration (ET), but also potential unintended consequences of soil moisture and groundwater depletion. To better understand hydrometeorological effects of perennial bioenergy crop expansion, this study conducted high-resolution (2-km grid spacing) simulations with a state-of-the-art atmospheric model (Weather Research and Forecasting system) dynamically …


Nature Of Mathematical Modeling Tasks For Secondary Mathematics Preservice Teachers, Jia He, Eryn Stehr, Hyunyi Jung Jan 2017

Nature Of Mathematical Modeling Tasks For Secondary Mathematics Preservice Teachers, Jia He, Eryn Stehr, Hyunyi Jung

Mathematics, Statistics and Computer Science Faculty Research and Publications

This study investigated the nature of written modeling tasks reported by instructors of required courses in five secondary mathematics teacher education programs. These tasks were analyzed based on a framework addressing potential cognitive orientation (simple procedures, complex procedures, and rich tasks) and purpose (epistemological, educational, contextual, and socio-critical modeling) of the tasks. Our analysis suggests that most tasks included questions of more than one cognitive orientation and more than half of the tasks were coded as contextual modeling. We also found that tasks that were coded as contextual modeling offered opportunities for future teachers to engage with questions at all …


On The Quantification Of Complexity And Diversity From Phenotypes To Ecosystems, Zachary Harrison Marion Dec 2016

On The Quantification Of Complexity And Diversity From Phenotypes To Ecosystems, Zachary Harrison Marion

Doctoral Dissertations

A cornerstone of ecology and evolution is comparing and explaining the complexity of natural systems, be they genomes, phenotypes, communities, or entire ecosystems. These comparisons and explanations then beget questions about how complexity should be quantified in theory and estimated in practice. Here I embrace diversity partitioning using Hill or effective numbers to move the empirical side of the field regarding the quantification of biological complexity.

First, at the level of phenotypes, I show that traditional multivariate analyses ignore individual complexity and provide relatively abstract representations of variation among individuals. I then suggest using well-known diversity indices from community ecology …


Automated Sea State Classification From Parameterization Of Survey Observations And Wave-Generated Displacement Data, Jason A. Teichman May 2016

Automated Sea State Classification From Parameterization Of Survey Observations And Wave-Generated Displacement Data, Jason A. Teichman

University of New Orleans Theses and Dissertations

Sea state is a subjective quantity whose accuracy depends on an observer’s ability to translate local wind waves into numerical scales. It provides an analytical tool for estimating the impact of the sea on data quality and operational safety. Tasks dependent on the characteristics of local sea surface conditions often require accurate and immediate assessment. An attempt to automate sea state classification using eleven years of ship motion and sea state observation data is made using parametric modeling of distribution-based confidence and tolerance intervals and a probabilistic model using sea state frequencies. Models utilizing distribution intervals are not able to …


Learning About Modeling In Teacher Preparation Programs, Hyunyi Jung, Eryn Stehr, Jia He, Sharon L. Senk Jan 2016

Learning About Modeling In Teacher Preparation Programs, Hyunyi Jung, Eryn Stehr, Jia He, Sharon L. Senk

Mathematics, Statistics and Computer Science Faculty Research and Publications

This study explores opportunities that secondary mathematics teacher preparation programs provide to learn about modeling in algebra. Forty-eight course instructors and ten focus groups at five universities were interviewed to answer questions related to modeling. With the analysis of the interview transcripts and related course materials, we found few opportunities for PSTs to engage with the full modeling cycle. Examples of opportunities to learn about algebraic modeling and the participants’ perspectives on the opportunities can contribute to the study of modeling and algebra in teacher education.


Niche-Based Modeling Of Japanese Stiltgrass (Microstegium Vimineum) Using Presence-Only Information, Nathan Bush Nov 2015

Niche-Based Modeling Of Japanese Stiltgrass (Microstegium Vimineum) Using Presence-Only Information, Nathan Bush

Masters Theses

The Connecticut River watershed is experiencing a rapid invasion of aggressive non-native plant species, which threaten watershed function and structure. Volunteer-based monitoring programs such as the University of Massachusetts’ OutSmart Invasives Species Project, Early Detection Distribution Mapping System (EDDMapS) and the Invasive Plant Atlas of New England (IPANE) have gathered valuable invasive plant data. These programs provide a unique opportunity for researchers to model invasive plant species utilizing citizen-sourced data. This study took advantage of these large data sources to model invasive plant distribution and to determine environmental and biophysical predictors that are most influential in dispersion, and to identify …


Interdisciplinary Modeling For Water-Related Issues Graduate Course, Laurel Saito, Alexander Fernald, Timothy Link Jul 2015

Interdisciplinary Modeling For Water-Related Issues Graduate Course, Laurel Saito, Alexander Fernald, Timothy Link

All ECSTATIC Materials

The science and management of aquatic ecosystems is inherently interdisciplinary, with issues associated with hydrology, atmospheric science, water quality, geochemistry, sociology, economics, environmental science, and ecology. Addressing water resources issues in any one discipline invariably involves effects that concern other disciplines, and attempts to address one issue often have consequences that exacerbate existing issues or concerns, or create new ones (Jørgensen et al. 1992; Lackey et al. 1975; Straskraba 1994) due to the strongly interactive nature of key processes (Christensen et al. 1996). Thus, research and management of aquatic ecosystems must be interdisciplinary to be most effective, but such truly …


Assessing The Probability That A Finding Is Genuine For Large-Scale Genetic Association Studies, Chia-Ling Kuo, Olga A. Vsevolozhskaya, Dmitri V. Zaykin May 2015

Assessing The Probability That A Finding Is Genuine For Large-Scale Genetic Association Studies, Chia-Ling Kuo, Olga A. Vsevolozhskaya, Dmitri V. Zaykin

Olga A. Vsevolozhskaya

Genetic association studies routinely involve massive numbers of statistical tests accompanied by P-values. Whole genome sequencing technologies increased the potential number of tested variants to tens of millions. The more tests are performed, the smaller P-value is required to be deemed significant. However, a small P-value is not equivalent to small chances of a spurious finding and significance thresholds may fail to serve as efficient filters against false results. While the Bayesian approach can provide a direct assessment of the probability that a finding is spurious, its adoption in association studies has been slow, due in part to the ubiquity …


Using Capture-Mark-Recapture Techniques To Estimate Detection Probabilities & Fidelity Of Expression For The Critically Endangered James Spinymussel (Pleurobema Collina)., Alaina C. Esposito May 2015

Using Capture-Mark-Recapture Techniques To Estimate Detection Probabilities & Fidelity Of Expression For The Critically Endangered James Spinymussel (Pleurobema Collina)., Alaina C. Esposito

Masters Theses, 2010-2019

The critically endangered James Spinymussel (Pleurobema collina) is a species of freshwater mussel endemic to Virginia’s James and Dan River basins. In the last 20 years, P. collina has experienced a substantial decline in numbers and currently occupies approximately 10% of its original habitat; however, little information is known about this species to assist in conservation. A 230-meter reach of transitional habitat in Swift Run was selected for repeat observations to estimate detection probabilities using a Capture-Mark-Recapture framework. In June 2014, visual scouting began to locate and tag P. collina (including other mussels in the community) with PIT …


Dependency-Topic-Affects-Sentiment-Lda Model For Sentiment Analysis, Shunshun Yin, Jun Han, Yu Huang, Kuldeep Kumar Mar 2015

Dependency-Topic-Affects-Sentiment-Lda Model For Sentiment Analysis, Shunshun Yin, Jun Han, Yu Huang, Kuldeep Kumar

Kuldeep Kumar

Sentiment analysis tends to use automated approaches to mine the sentiment information expressed in text, such as reviews, blogs and forum discussions. As most traditional approaches for sentiment analysis are based on supervised learning models and need many labeled corpora as their training data which are not always easily obtained, various unsupervised models based on Latent Dirichlet Allocation (LDA) have been proposed for sentiment classification. In this paper, we propose a novel probabilistic modeling framework based on LDA, called Dependency-Topic-Affects-Sentiment-LDA (DTAS) model, which drops the ”bag of words” assumption and assumes that the topics of sentences in a document form …


A Predictive Modeling System: Early Identification Of Students At-Risk Enrolled In Online Learning Programs, Mary L. Fonti Jan 2015

A Predictive Modeling System: Early Identification Of Students At-Risk Enrolled In Online Learning Programs, Mary L. Fonti

CCE Theses and Dissertations

Predictive statistical modeling shows promise in accurately predicting academic performance for students enrolled in online programs. This approach has proven effective in accurately identifying students who are at-risk enabling instructors to provide instructional intervention. While the potential benefits of statistical modeling is significant, implementations have proven to be complex, costly, and difficult to maintain. To address these issues, the purpose of this study is to develop a fully integrated, automated predictive modeling system (PMS) that is flexible, easy to use, and portable to identify students who are potentially at-risk for not succeeding in a course they are currently enrolled in. …