Open Access. Powered by Scholars. Published by Universities.®

Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.®

1,356 Full-Text Articles 2,006 Authors 853,222 Downloads 156 Institutions

All Articles in Statistical Models

Faceted Search

1,356 full-text articles. Page 1 of 53.

Using The History Of Statistics To Teach Introductory Statistics, Melissa Hansen 2024 Utah State University

Using The History Of Statistics To Teach Introductory Statistics, Melissa Hansen

All Graduate Reports and Creative Projects, Fall 2023 to Present

While often taught in high school and required as part of a college degree, statistics classes are sometimes viewed by students as an obstacle rather than a support for their overall goals. One way to increase student engagement in a statistics course is to use the history of statistics. Within the literature review, the advantages to using the history of statistics are discussed as well as the more extensive research on using the history of mathematics in mathematics courses. Included are instructional strategies for using the context around the development of mathematical ideas in math classrooms which can be extended …


Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth 2024 California State University, San Bernardino

Code For Care: Hypertension Prediction In Women Aged 18-39 Years, Kruti Sheth

Electronic Theses, Projects, and Dissertations

The longstanding prevalence of hypertension, often undiagnosed, poses significant risks of severe chronic and cardiovascular complications if left untreated. This study investigated the causes and underlying risks of hypertension in females aged between 18-39 years. The research questions were: (Q1.) What factors affect the occurrence of hypertension in females aged 18-39 years? (Q2.) What machine learning algorithms are suited for effectively predicting hypertension? (Q3.) How can SHAP values be leveraged to analyze the factors from model outputs? The findings are: (Q1.) Performing Feature selection using binary classification Logistic regression algorithm reveals an array of 30 most influential factors at an …


Comparing North American Professional Sports League Season Formats Using Monte Carlo Simulation, Lathan Gregg 2024 University of Arkansas, Fayetteville

Comparing North American Professional Sports League Season Formats Using Monte Carlo Simulation, Lathan Gregg

Industrial Engineering Undergraduate Honors Theses

Each NFL, NBA, and MLB season consists of a regular season, in which teams play a set number of scheduled games and a playoff, in which qualifying teams compete for a championship. At the conclusion of each season, teams are ranked based on their performance throughout the season. This study aims to investigate the ability of each league's season format to accurately rank teams using Monte Carlo simulation. Matches between two teams are simulated by using the team’s assigned strength ranks to calculate a winning probability for each team. The winning probabilities are simulated with different skill values, dictating how …


Evaluating The Effect Of Skipping Ticagrelor Doses And Need For Bolus Doses Upon Treatment Resumption Through Population Pk/Pd Simulation, Hiroyoshi Matsui, Le Thien Truc Pham, Eyob D. Adane 2024 Ohio Northern University

Evaluating The Effect Of Skipping Ticagrelor Doses And Need For Bolus Doses Upon Treatment Resumption Through Population Pk/Pd Simulation, Hiroyoshi Matsui, Le Thien Truc Pham, Eyob D. Adane

ONU Student Research Colloquium

Ticagrelor (Brilinta (R)) is the first reversibly binding oral P2Y12 receptor antagonist. It is used, mostly in combination with aspirin, in patients with acute coronary syndromes to reduce thrombosis. The manufacturer of ticagrelor recommends discontinuing it at least 5 days before any surgery when possible. While the effect of dose interruptions on the risk of thrombosis is not directly studied, it is important to understand the impact of skipping doses on ticagrelor's PK/PD profile for clinical-decision making. The objectives of the current study were to simulate the impact of therapy interruption on the PK/PD of ticagrelor and examine the need …


Research On Chinese Data Sovereignty Policy Based On Lda Model And Policy Instruments, Han QIAO, Junru XU 2024 School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China MOE Social Science Laboratory of Digital Economic Forecasts and Policy Simulation, University of Chinese Academy of Sciences, Beijing 100190, China

Research On Chinese Data Sovereignty Policy Based On Lda Model And Policy Instruments, Han Qiao, Junru Xu

Bulletin of Chinese Academy of Sciences (Chinese Version)

Data sovereignty has become an important component of national sovereignty in the dual context of the digital economy development and the overall national security concept. Major countries and regions are actively carrying out data sovereignty strategic deployment and engaging in fierce competition in data resources, data technology, and data rules. This work adopts the policy text analysis method to study China’s data sovereignty policy, and employs the LDA model and policy instruments to quantitatively analyze the process evolution and thematic characteristics of China’s data sovereignty policy. Drawing on these findings, this study comprehensively considers the global data sovereignty policy and …


Assessment Of Method Effects Of Keying And Wording In Instruments: A Mixed-Methods Explanatory Sequential Study, Lin Ma 2024 University of Denver

Assessment Of Method Effects Of Keying And Wording In Instruments: A Mixed-Methods Explanatory Sequential Study, Lin Ma

Electronic Theses and Dissertations

This dissertation presents an innovative approach to examining the keying method, wording method, and construct validity on psychometric instruments. By employing a mixed methods explanatory sequential design, the effects of keying and wording in two psychometric assessments were examined and validated. Those two self-report psychometric assessments were the Effortful Control assessment (Ellis & Rothbart, 2001) and the Grit assessment (Duckworth & Quinn, 2009). Moreover, the quantitative phase utilized structural equation modeling to analyze 2,104 students’ responses and assess the construct of keying and wording. Various hypothetical models were investigated and evaluated. The reliability of each construct in each method was …


Predicting Crop Yield Using Remote Sensing Data, Mary Row, Jung-Han Kimn, Hossein Moradi 2024 Saint Mary's University of Minnesota

Predicting Crop Yield Using Remote Sensing Data, Mary Row, Jung-Han Kimn, Hossein Moradi

SDSU Data Science Symposium

Accurate crop yield predictions can help farmers make adjustments or changes in their farming practices to optimize their harvest. Remote sensing data is an inexpensive approach to collecting massive amounts of data that could be utilized for predicting crop yield. This study employed linear regression and spatial linear models were used to predict soybean yield with data from Landsat 8 OLI. Each model was built using only spectral bands of the satellite, only vegetation indices, and both spectral bands and vegetation indices. All analysis was based on data collected from two fields in South Dakota from the 2019 and 2021 …


Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae 2024 Miami University - Oxford

Session 6: The Size-Biased Lognormal Mixture With The Entropy Regularized Algorithm, Tatjana Miljkovic, Taehan Bae

SDSU Data Science Symposium

A size-biased left-truncated Lognormal (SB-ltLN) mixture is proposed as a robust alternative to the Erlang mixture for modeling left-truncated insurance losses with a heavy tail. The weak denseness property of the weighted Lognormal mixture is studied along with the tail behavior. Explicit analytical solutions are derived for moments and Tail Value at Risk based on the proposed model. An extension of the regularized expectation–maximization (REM) algorithm with Shannon's entropy weights (ewREM) is introduced for parameter estimation and variability assessment. The left-truncated internal fraud data set from the Operational Riskdata eXchange is used to illustrate applications of the proposed model. Finally, …


Modeling Of Covid-19 Clinical Outcomes In Mexico: An Analysis Of Demographic, Clinical, And Chronic Disease Factors, Livia Clarete 2024 The Graduate Center, City University of New York

Modeling Of Covid-19 Clinical Outcomes In Mexico: An Analysis Of Demographic, Clinical, And Chronic Disease Factors, Livia Clarete

Dissertations, Theses, and Capstone Projects

This study explores COVID-19 clinical outcomes in Mexico, focusing on demographic, clinical, and chronic disease variables to develop predictive models. In the binary classification task, the Ada Boost Classifier distinguishes survivors from non-survivors, with age, sex, ethnicity, and chronic medical conditions influencing outcomes. In multiclass classification, the Gradient Boosting Classifier categorizes patients into outcome groups.

Demographic variables, especially age, are crucial for predicting COVID-19 outcomes for both the binary and multiclass classification tasks. Clinical information about previous conditions, including chronic diseases, also holds relevance, especially diabetes, immunocompromise, and cardiovascular diseases. These insights inform public health measures and healthcare strategies, emphasizing …


Making Sense Of Making Parole In New York, Alexandra McGlinchy 2024 The Graduate Center, City University of New York

Making Sense Of Making Parole In New York, Alexandra Mcglinchy

Dissertations, Theses, and Capstone Projects

For many individuals incarcerated in New York, the initial step toward freedom begins with an interview with the Board of Parole. This process, however, is frequently a complex and challenging one, characterized by repeated denials and extended incarcerations. The disparity in outcomes – where one individual may receive over 20 denials and another is granted parole on their first attempt – highlights the ambiguity and inconsistency in the parole decision-making process. This project aims to clarify the factors that influence parole decisions by concentrating on measurable variables. These include age, race, duration of sentence served, proportion of sentence served, type …


Model Selection Through Cross-Validation For Supervised Learning Tasks With Manifold Data, Derek Brown 2024 Purdue University Fort Wayne

Model Selection Through Cross-Validation For Supervised Learning Tasks With Manifold Data, Derek Brown

The Journal of Purdue Undergraduate Research

No abstract provided.


Sensitivity Analysis Of Prior Distributions In Regression Model Estimation, AYOADE I ADEWOLE, OLUWATOYIN K. BODUNWA 2024 Department of Mathematics, Tai Solarin University of Education Ijagun Ogun State Nigeria.

Sensitivity Analysis Of Prior Distributions In Regression Model Estimation, Ayoade I Adewole, Oluwatoyin K. Bodunwa

Al-Bahir Journal for Engineering and Pure Sciences

Bayesian inferences depend solely on specification and accuracy of likelihoods and prior distributions of the observed data. The research delved into Bayesian estimation method of regression models to reduce the impact of some of the problems, posed by convectional method of estimating regression models, such as handling complex models, availability of small sample sizes and inclusion of background information in the estimation procedure. Posterior distributions are based on prior distributions and the data accuracy, which is the fundamental principles of Bayesian statistics to produce accurate final model estimates. Sensitivity analysis is an essential part of mathematical model validation in obtaining …


Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe 2024 University of Central Florida

Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe

Data Science and Data Mining

Cyberbullying refers to the act of bullying using electronic means and the internet. In recent years, this act has been identifed to be a major problem among young people and even adults. It can negatively impact one’s emotions and lead to adverse outcomes like depression, anxiety, harassment, and suicide, among others. This has led to the need to employ machine learning techniques to automatically detect cyberbullying and prevent them on various social media platforms. In this study, we want to analyze the combination of some Natural Language Processing (NLP) algorithms (such as Bag-of-Words and TFIDF) with some popular machine learning …


Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe 2024 University of Central Florida

Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe

Data Science and Data Mining

This project estimates a regression model to predict the superconducting critical temperature based on variables extracted from the superconductor’s chemical formula. The regression model along with the stepwise variable selection gives a reasonable and good predictive model with a lower prediction error (MSE). Variables extracted based on atomic radius, valence, atomic mass and thermal conductivity appeared to have the most contribution to the predictive model.


A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox 2024 Dartmouth College

A Bayesian Inversion For Emissions And Export Productivity Across The End-Cretaceous Boundary, Alexander A. Cox

Dartmouth College Master’s Theses

The end-Cretaceous mass extinction was marked by both the Chicxulub impact and the ongoing emplacement of the Deccan Traps flood basalt province. Both of these events perturbed the environment by the emission of climate-active volatiles, primarily CO2 and SO2. To understand the mechanism of extinction, we must disentangle the timing, duration, and intensity of volcanic and meteoritic environmental forcings. In this thesis, we used a parallel Markov chain Monte Carlo approach to invert for the aforementioned volatile emissions, export productivity, and remineralization from 67 to 65 million years ago using the LOSCAR (Long-term Ocean-atmosphere-Sediment CArbon cycle Reservoir) model. The parallel …


Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen 2024 Wilfrid Laurier University

Multiscale Modelling Of Brain Networks And The Analysis Of Dynamic Processes In Neurodegenerative Disorders, Hina Shaheen

Theses and Dissertations (Comprehensive)

The complex nature of the human brain, with its intricate organic structure and multiscale spatio-temporal characteristics ranging from synapses to the entire brain, presents a major obstacle in brain modelling. Capturing this complexity poses a significant challenge for researchers. The complex interplay of coupled multiphysics and biochemical activities within this intricate system shapes the brain's capacity, functioning within a structure-function relationship that necessitates a specific mathematical framework. Advanced mathematical modelling approaches that incorporate the coupling of brain networks and the analysis of dynamic processes are essential for advancing therapeutic strategies aimed at treating neurodegenerative diseases (NDDs), which afflict millions of …


Utility In Time Description In Priority Best-Worst Discrete Choice Models: An Empirical Evaluation Using Flynn's Data, Sasanka Adikari, Norou Diawara 2024 Old Dominion University

Utility In Time Description In Priority Best-Worst Discrete Choice Models: An Empirical Evaluation Using Flynn's Data, Sasanka Adikari, Norou Diawara

Mathematics & Statistics Faculty Publications

Discrete choice models (DCMs) are applied in many fields and in the statistical modelling of consumer behavior. This paper focuses on a form of choice experiment, best-worst scaling in discrete choice experiments (DCEs), and the transition probability of a choice of a consumer over time. The analysis was conducted by using simulated data (choice pairs) based on data from Flynn's (2007) 'Quality of Life Experiment'. Most of the traditional approaches assume the choice alternatives are mutually exclusive over time, which is a questionable assumption. We introduced a new copula-based model (CO-CUB) for the transition probability, which can handle the dependent …


Simulation Of Wave Propagation In Granular Particles Using A Discrete Element Model, SYED TAHMID HUSSAN 2024 Georgia Southern University

Simulation Of Wave Propagation In Granular Particles Using A Discrete Element Model, Syed Tahmid Hussan

Electronic Theses and Dissertations

The understanding of Bender Element mechanism and utilization of Particle Flow Code (PFC) to simulate the seismic wave behavior is important to test the dynamic behavior of soil particles. Both discrete and finite element methods can be used to simulate wave behavior. However, Discrete Element Method (DEM) is mostly suitable, as the micro scaled soil particle cannot be fully considered as continuous specimen like a piece of rod or aluminum. Recently DEM has been widely used to study mechanical properties of soils at particle level considering the particles as balls. This study represents a comparative analysis of Voigt and Best …


Imputation Strategies For Different Categories Of Missing Data, Karthik Chalumuri 2024 University of New Hampshire, Durham

Imputation Strategies For Different Categories Of Missing Data, Karthik Chalumuri

Honors Theses and Capstones

Addressing missing data in research is crucial for ensuring the reliability and validity of study findings, yet it remains a significant challenge. This study investigates the impact of missing data on research outcomes and explores the underutilization of existing tools for managing missingness, potentially leading to gaps in critical information with tangible implications for decision-making processes (Dziura et al.).

Focusing on the different categories of missing data—Missing Completely At Random (MCAR), Missing At Random (MAR), and Missing Not At Random (MNAR)—this research examines various imputation strategies tailored to each category. Specifically, we compare the efficacy of several model-based imputation methods, …


Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia 2023 Brigham Young University

Reducing Food Scarcity: The Benefits Of Urban Farming, S.A. Claudell, Emilio Mejia

Journal of Nonprofit Innovation

Urban farming can enhance the lives of communities and help reduce food scarcity. This paper presents a conceptual prototype of an efficient urban farming community that can be scaled for a single apartment building or an entire community across all global geoeconomics regions, including densely populated cities and rural, developing towns and communities. When deployed in coordination with smart crop choices, local farm support, and efficient transportation then the result isn’t just sustainability, but also increasing fresh produce accessibility, optimizing nutritional value, eliminating the use of ‘forever chemicals’, reducing transportation costs, and fostering global environmental benefits.

Imagine Doris, who is …


Digital Commons powered by bepress