Open Access. Powered by Scholars. Published by Universities.®

Statistical Methodology Commons

Open Access. Powered by Scholars. Published by Universities.®

Series

Discipline
Institution
Keyword
Publication Year
Publication

Articles 1 - 30 of 493

Full-Text Articles in Statistical Methodology

Identifying Rural Health Clinics Within The Transformed Medicaid Statistical Information System (T-Msis) Analytic Files, Katherine Ahrens Mph, Phd, Zachariah Croll, Yvonne Jonk Phd, John Gale Ms, Heidi O'Connor Ms Mar 2024

Identifying Rural Health Clinics Within The Transformed Medicaid Statistical Information System (T-Msis) Analytic Files, Katherine Ahrens Mph, Phd, Zachariah Croll, Yvonne Jonk Phd, John Gale Ms, Heidi O'Connor Ms

Rural Health Clinics

Researchers at the Maine Rural Health Research Center describe a methodology for identifying Rural Health Clinic encounters within the Medicaid claims data using Transformed Medicaid Statistical Information System (T-MSIS) Analytic Files.

Background: There is limited information on the extent to which Rural Health Clinics (RHC) provide pediatric and pregnancy-related services to individuals enrolled in state Medicaid/CHIP programs. In part this is because methods to identify RHC encounters within Medicaid claims data are outdated.

Methods: We used a 100% sample of the 2018 Medicaid Demographic and Eligibility and Other Services Transformed Medicaid Statistical Information System (T-MSIS) Analytic Files for 20 states …


Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe Jan 2024

Machine Learning Approaches For Cyberbullying Detection, Roland Fiagbe

Data Science and Data Mining

Cyberbullying refers to the act of bullying using electronic means and the internet. In recent years, this act has been identifed to be a major problem among young people and even adults. It can negatively impact one’s emotions and lead to adverse outcomes like depression, anxiety, harassment, and suicide, among others. This has led to the need to employ machine learning techniques to automatically detect cyberbullying and prevent them on various social media platforms. In this study, we want to analyze the combination of some Natural Language Processing (NLP) algorithms (such as Bag-of-Words and TFIDF) with some popular machine learning …


Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe Jan 2024

Predicting Superconducting Critical Temperature Using Regression Analysis, Roland Fiagbe

Data Science and Data Mining

This project estimates a regression model to predict the superconducting critical temperature based on variables extracted from the superconductor’s chemical formula. The regression model along with the stepwise variable selection gives a reasonable and good predictive model with a lower prediction error (MSE). Variables extracted based on atomic radius, valence, atomic mass and thermal conductivity appeared to have the most contribution to the predictive model.


Is The Declining Birthrate Really An Issue For The Economy?, Harsh Ramesh Pednekar, Theodore Lee, Darrion Chin Dec 2023

Is The Declining Birthrate Really An Issue For The Economy?, Harsh Ramesh Pednekar, Theodore Lee, Darrion Chin

Introduction to Research Methods RSCH 202

This study aims to explore the complex implications of declining birth rates on the economy, focusing on GDP per capita as a crucial metric, and aims to uncover both potential opportunities and challenges stemming from this demographic transformation using regression analysis. Using a quantitative methodology and secondary data from OECD.stat, World Population Review, and World Bank, the study explores the relationship between declining birth rates and economic impacts. GDP per capita serves as an essential dependent variable, and it accounts for control variables such as labour force participation, literacy, and education levels, child dependence ratio, and physical capital. Past studies …


Statistical And Machine Learning Approaches To Describe Factors Affecting Preweaning Mortality Of Piglets, Md Towfiqur Rahman, Tami M. Brown-Brandl, Gary A. Rohrer, Sudhendu R. Sharma, Vamsi Manthena, Yeyin Shi Oct 2023

Statistical And Machine Learning Approaches To Describe Factors Affecting Preweaning Mortality Of Piglets, Md Towfiqur Rahman, Tami M. Brown-Brandl, Gary A. Rohrer, Sudhendu R. Sharma, Vamsi Manthena, Yeyin Shi

Biological Systems Engineering: Papers and Publications

High preweaning mortality (PWM) rates for piglets are a significant concern for the worldwide pork industries, causing economic loss and well-being issues. This study focused on identifying the factors affecting PWM, overlays, and predicting PWM using historical production data with statistical and machine learning models. Data were collected from 1,982 litters from the United States Meat Animal Research Center, Nebraska, over the years 2016 to 2021. Sows were housed in a farrowing building with three rooms, each with 20 farrowing crates, and taken care of by well-trained animal caretakers. A generalized linear model was used to analyze the various sow, …


A Classical Fall Statistics Problem, Timothy L. Meyer Oct 2023

A Classical Fall Statistics Problem, Timothy L. Meyer

Cornhusker Economics

An evaluation of traditional baseball measures and suggestions for alternatives, centering on statistics related to the offensive quality of a player.


Movie Recommender System Using Matrix Factorization, Roland Fiagbe May 2023

Movie Recommender System Using Matrix Factorization, Roland Fiagbe

Data Science and Data Mining

Recommendation systems are a popular and beneficial field that can help people make informed decisions automatically. This technique assists users in selecting relevant information from an overwhelming amount of available data. When it comes to movie recommendations, two common methods are collaborative filtering, which compares similarities between users, and content-based filtering, which takes a user’s specific preferences into account. However, our study focuses on the collaborative filtering approach, specifically matrix factorization. Various similarity metrics are used to identify user similarities for recommendation purposes. Our project aims to predict movie ratings for unwatched movies using the MovieLens rating dataset. We developed …


A Monte Carlo Analysis Of Nonprobability Sampling & Post Hoc Corrections, Julia Hong May 2023

A Monte Carlo Analysis Of Nonprobability Sampling & Post Hoc Corrections, Julia Hong

Masters Theses & Specialist Projects

Nonprobability samples are often used in place of probability samples because the former are less trouble and less expensive. Unfortunately, it is difficult to determine how well a sample represents population parameters when using nonprobability samples. Researchers attempt to mitigate the disadvantages of nonprobability sampling by performing post hoc corrections, but this adjustment may not successfully undo the effects of nonprobability sampling. To examine these effects, a Monte Carlo simulation was conducted to create a pseudo-population from which samples were drawn. Forty-one conditions were replicated 10,000 times each, with each sample consisting of 100 observations. A post-stratification adjustment was made …


Prevalence Of Sars-Cov-2 Antibodies In Liberty University Student Population, Emily Bonus Apr 2023

Prevalence Of Sars-Cov-2 Antibodies In Liberty University Student Population, Emily Bonus

Senior Honors Theses

In 2020, the virus SARS-CoV-2 gained attention as it spread around the world. Its antibodies are poorly understood, and little research focuses on those with few COVID-19 complications yet large numbers of close contacts: university students. This longitudinal study recorded SARS-CoV-2 antibody presence in 107 undergraduate Liberty University students twice during early 2021. After extensive data cleaning and the application of various statistical tests and ANOVAs, the data seems to show that in the case of COVID-19 infections, SARS-CoV-2 IgM antibodies are immediately produced, and then IgG antibodies follow later. However, the COVID-19 vaccine causes the production of both IgM …


Forecasting Remission Time Of A Treatment Method For Leukemia As An Application To Statistical Inference Approach, Mahmoud Mansour, Rashad El-Sagheer, Ahmed Galal Attia, Beha S. El-Desouky Prof. Feb 2023

Forecasting Remission Time Of A Treatment Method For Leukemia As An Application To Statistical Inference Approach, Mahmoud Mansour, Rashad El-Sagheer, Ahmed Galal Attia, Beha S. El-Desouky Prof.

Basic Science Engineering

In this paper, Weibull-Linear Exponential distribution (WLED) has been investigated whether being it is a well-fit distribution to a clinical real data. These data represent the duration of remission achieved by a certain drug used in the treatment of leukemia for a group of patients. The statistical inference approach is used to estimate the parameters of the WLED through the set of the fitted data. The estimated parameters are utilized to evaluate the survival and hazard functions and hence assessing the treatment method through forecasting the duration of remission times of patients. A two-sample prediction approach has been applied to …


Modeling And Fitting Two-Way Tables Containing Outliers, David L. Farnsworth Feb 2023

Modeling And Fitting Two-Way Tables Containing Outliers, David L. Farnsworth

Articles

A model is proposed for two-way tables of measurement data containing outliers. The two independent variables are categorical and error free. Neither missing values nor replication are present. The model consists of the sum of a customary additive part that can be fit using least squares and a part that is composed of outliers. Recommendations are made for methods for identifying cells containing outliers and for fitting the model. A graph of the observations is used to determine the outliers’ locations. For all cells containing an outlier, replacement values are determined simultaneously using a classical missing-data tool. The result is …


Classification Of Adult Income Using Decision Tree, Roland Fiagbe Jan 2023

Classification Of Adult Income Using Decision Tree, Roland Fiagbe

Data Science and Data Mining

Decision tree is a commonly used data mining methodology for performing classification tasks. It is a tree-based supervised machine learning algorithm that is used to classify or make predictions in a path of how previous questions are answered. Generally, the decision tree algorithm categorizes data into branch-like segments that develop into a tree that contains a root, nodes, and leaves. This project seeks to explore the decision tree methodology and apply it to the Adult Income dataset from the UCI Machine Learning Repository, to determine whether a person makes over 50K per year and determine the necessary factors that improve …


A Bayesian Programming Approach To Car-Following Model Calibration And Validation Using Limited Data, Franklin Abodo Jun 2022

A Bayesian Programming Approach To Car-Following Model Calibration And Validation Using Limited Data, Franklin Abodo

FIU Electronic Theses and Dissertations

Traffic simulation software is used by transportation researchers and engineers to design and evaluate changes to roadway networks. Underlying these simulators are mathematical models of microscopic driver behavior from which macroscopic measures of flow and congestion can be recovered. Many models are intended to apply to only a subset of possible traffic scenarios and roadway configurations, while others do not have any explicit constraint on their applicability. Work zones on highways are one scenario for which no model invented to date has been shown to accurately reproduce realistic driving behavior. This makes it difficult to optimize for safety and other …


The Short-Term Effects Of Fine Airborne Particulate Matter And Climate On Covid-19 Disease Dynamics, El Hussain Shamsa, Kezhong Zhang Jun 2022

The Short-Term Effects Of Fine Airborne Particulate Matter And Climate On Covid-19 Disease Dynamics, El Hussain Shamsa, Kezhong Zhang

Medical Student Research Symposium

Background: Despite more than 60% of the United States population being fully vaccinated, COVID-19 cases continue to spike in a temporal pattern. These patterns in COVID-19 incidence and mortality may be linked to short-term changes in environmental factors.

Methods: Nationwide, county-wise measurements for COVID-19 cases and deaths, fine-airborne particulate matter (PM2.5), and maximum temperature were obtained from March 20, 2020 to March 20, 2021. Multivariate Linear Regression was used to analyze the association between environmental factors and COVID-19 incidence and mortality rates in each season. Negative Binomial Regression was used to analyze daily fluctuations of COVID-19 cases …


Relationship Between Higher Education And Health Insurance Coverage And Healthcare Utilization For Black Or African American Individuals, Emma Revoir Apr 2022

Relationship Between Higher Education And Health Insurance Coverage And Healthcare Utilization For Black Or African American Individuals, Emma Revoir

Psychology Student Works

Higher education may decrease mortality due to higher income and health insurance availability (Buckles et al., 2016). Education can increase understanding and utilizing health insurance (Gallo et al., 2020). There is a gap in how this relationship affects racial minorities. This study aims to understand how health insurance is affected by education level for Black or African American individuals.


The Association Between Sibling Relationships And Personality, Rebecca Ramsey Apr 2022

The Association Between Sibling Relationships And Personality, Rebecca Ramsey

Psychology Student Works

Differences in self perception (specifically aggression) was discovered in participants with siblings, and no difference of intelligence, outgoing personalities, creativity, competitiveness and family orientation (Van Volkom, Guerguis, & Kramer, 2017). Sisters were found to have more empathy than the brothers along with higher intimacy, knowledge, and emotional support (Walęcka-Matyja, 2017). There is a gap in my research between sibling jealousy and depression, sibling negativity, and intimacy in young adulthood (Hamwey & Whiteman, 2020). My study examines the similarity of personality to siblings. Some personality traits may include jealousy, outgoingness, competitiveness, and creativity.


How Quality Of Paternal Relationship Impacts Depression Development In Adulthood, Rachel Lane Apr 2022

How Quality Of Paternal Relationship Impacts Depression Development In Adulthood, Rachel Lane

Psychology Student Works

Previous studies show that paternal involvement directly correlates to a decrease in the child’s probability of developing a mental illness (O’Gara, et.al. 2019). Research shows, the more supportive and the more affection a parent gives, the less likely a child is to develop depressive tendencies (Del Barrio, et. al. 2016) There is a gap in investigating whether or not having a loving father postpones the age at which someone is diagnosed with depression. This study provides insight into paternal closeness and the age at which someone is diagnosed with depression.


The Association Between Promiscuity And Marital Satisfaction, Shelby Kate Christopher Apr 2022

The Association Between Promiscuity And Marital Satisfaction, Shelby Kate Christopher

Psychology Student Works

Research has suggested that premarital cohabitation is linked to a heightened risk of divorce and dissatisfaction if it occurs with someone other than the marriage partner (Teachman, 2004). The number of partners is linked to marital dissatisfaction for both men and women (Legkauskas & Stankevičienė, 2008). Involvement in church groups is linked to a 5x increase in likelihood of abstinence (Paul et al, 2000). Previous studies have used small sample sizes with little diversity and few men. However, my research will use a large, diverse sample size with both men and women.


The Association Between Religiosity And Alcohol Use In Adolescents, Daphny Vang Apr 2022

The Association Between Religiosity And Alcohol Use In Adolescents, Daphny Vang

Psychology Student Works

Research has found that the age and strength of religion directly correlate to the level of alcohol consumption(Mason & Windle, 2002). Previous studies show that adult women with stronger religious beliefs consume less alcohol compared to those with weaker religious beliefs (Mattila & et al., 2001). There is a gap in investigating the relationship between religiosity and alcohol use in adolescents. This study provides insight into the relationship between religiosity and alcohol consumption with adolescents.


The Relationship Between Adults Education Level And Their Mental Health Status, Kelsey Bergan Apr 2022

The Relationship Between Adults Education Level And Their Mental Health Status, Kelsey Bergan

Psychology Student Works

Research has shown that those with a higher education are more likely to seek out sources to help them with mental disorders (Ibrahim, et al., 2019; Neimeyer, et al., 2011). Previous studies have found that students with a lower GPA show signs of depression, anxiety and eating disorders (Eisenberg, Globerstein & Hunt, 2009). There is a gap in investigating the relationship between education level and an adult’s mental health experiences. This study will provide insight to the relationship between an adults education level and their mental health.


The Relationship Between Adolescents’ Participation In Religious/Spiritual Activities And Mental Health As An Adult, Tiara Lamb Apr 2022

The Relationship Between Adolescents’ Participation In Religious/Spiritual Activities And Mental Health As An Adult, Tiara Lamb

Psychology Student Works

Many receiving mental health treatment across the U.S. see spirituality and religion as very helpful in supporting their mental state (Oxhandler et al., 2021). How often black Americans attended church in adolescence is related to how as a young adult they could spiritually handle mentally stressing situations (Alexander, 2017). There’s a gap in previous studies because most of them study specific groups rather than using a representative sample of the U.S. In addition, there is a lack of studies investigating the relationship between adolescent religiosity/spirituality and adult mental health. This study uses a representative sample and investigates the association between …


The Association Of Maternal Relationships And Acts Of Violence In Adults, Cameron Shores Apr 2022

The Association Of Maternal Relationships And Acts Of Violence In Adults, Cameron Shores

Psychology Student Works

Through research it has been found that people who do not feel close to their parents are more likely to react with violence (Cano-Lozano et al., 2020). Studies have also shown that people who reported adverse relationships as children, tend to have more violence-related outcomes (Duke, 2010). There is currently a gap in explaining the association between maternal closeness and violent tendencies. This study is meant to help fill that gap and bring cognizance to the relationship between maternal closeness and violence.


Examining The Effects Of Individual And Neighborhood Factors On Hiv Transmission Risk Potential Among People With Hiv, Semiu Olatunde Gbadamosi Mar 2022

Examining The Effects Of Individual And Neighborhood Factors On Hiv Transmission Risk Potential Among People With Hiv, Semiu Olatunde Gbadamosi

FIU Electronic Theses and Dissertations

HIV transmission risk significantly increases in late-diagnosed HIV and at HIV viral load (VL) >1500 copies/mL. The objective of this dissertation was to examine factors associated with HIV transmission risk potential for persons with HIV (PWH) using measures of time from HIV infection to diagnosis and trajectories of VL suppression. Additionally, we sought to determine whether a single yearly VL measure—the current standard to track the HIV epidemic in the United States—is reliable in assessing viral suppression for PWH. The first study estimated the distribution of time from HIV infection to diagnosis in Florida using a CD4 depletion model and …


Identifying Patterns In The Structural Drivers Of Intrastate Conflict, Jonathan D. Moyer, Austin S. Matthews, Mickey Rafa, Yutang Xiong Jan 2022

Identifying Patterns In The Structural Drivers Of Intrastate Conflict, Jonathan D. Moyer, Austin S. Matthews, Mickey Rafa, Yutang Xiong

International Studies: Faculty Scholarship

Quantitative methods have been used to: (1) better predict civil conflict onset; and (2) understand causal mechanisms to inform policy intervention and theory. However, an exploration of individual conflict onset cases illustrates great variation in the characteristics describing the outbreak of civil war, suggesting that there is not one single set of factors that lead to intrastate war. In this article, we use descriptive statistics to explore persistent clusters in the drivers of civil war onset, finding evidence that some arrangements of structural drivers cluster robustly across multiple model specifications (such as young, poorly developed states with anocratic regimes). Additionally, …


Exploring The Relationship Between Mandatory Helmet Use Regulations And Adult Cyclists’ Behavior In California Using Hybrid Machine Learning Models, Fatemeh Davoudi Kakhki, Maria Chierichetti Oct 2021

Exploring The Relationship Between Mandatory Helmet Use Regulations And Adult Cyclists’ Behavior In California Using Hybrid Machine Learning Models, Fatemeh Davoudi Kakhki, Maria Chierichetti

Mineta Transportation Institute Publications

In California, bike fatalities increased by 8.1% from 2015 to 2016. Even though the benefits of wearing helmets in protecting cyclists against trauma in cycling crash has been determined, the use of helmets is still limited, and there is opposition against mandatory helmet use, particularly for adults. Therefore, exploring perceptions of adult cyclists regarding mandatory helmet use is a key element in understanding cyclists’ behavior, and determining the impact of mandatory helmet use on their cycling rate. The goal of this research is to identify sociodemographic characteristics and cycling behaviors that are associated with the use and non-use of bicycle …


From Mathematics To Medicine: A Practical Primer On Topological Data Analysis (Tda) And The Development Of Related Analytic Tools For The Functional Discovery Of Latent Structure In Fmri Data, Andrew Salch, Adam Regalski, Hassan Abdallah, Raviteja Suryadevara, Michael J. Catanzaro, Vaibhav A. Diwadkar Aug 2021

From Mathematics To Medicine: A Practical Primer On Topological Data Analysis (Tda) And The Development Of Related Analytic Tools For The Functional Discovery Of Latent Structure In Fmri Data, Andrew Salch, Adam Regalski, Hassan Abdallah, Raviteja Suryadevara, Michael J. Catanzaro, Vaibhav A. Diwadkar

Mathematics Faculty Research Publications

fMRI is the preeminent method for collecting signals from the human brain in vivo, for using these signals in the service of functional discovery, and relating these discoveries to anatomical structure. Numerous computational and mathematical techniques have been deployed to extract information from the fMRI signal. Yet, the application of Topological Data Analyses (TDA) remain limited to certain sub-areas such as connectomics (that is, with summarized versions of fMRI data). While connectomics is a natural and important area of application of TDA, applications of TDA in the service of extracting structure from the (non-summarized) fMRI data itself are heretofore nonexistent. …


On The Use Of Minimum Penalties In Statistical Learning, Ben Sherwood, Bradley S. Price Jul 2021

On The Use Of Minimum Penalties In Statistical Learning, Ben Sherwood, Bradley S. Price

Faculty & Staff Scholarship

Modern multivariate machine learning and statistical methodologies estimate parameters of interest while leveraging prior knowledge of the association between outcome variables. The methods that do allow for estimation of relationships do so typically through an error covariance matrix in multivariate regression which does not scale to other types of models. In this article we proposed the MinPEN framework to simultaneously estimate regression coefficients associated with the multivariate regression model and the relationships between outcome variables using mild assumptions. The MinPen framework utilizes a novel penalty based on the minimum function to exploit detected relationships between responses. An iterative algorithm that …


Pivot Points In Bivariate Linear Regression, David L. Farnsworth, Carl V. Lutzer Jun 2021

Pivot Points In Bivariate Linear Regression, David L. Farnsworth, Carl V. Lutzer

Articles

There are little-noticed points in the plane, which are artifacts of linear regression. The points, which are called pivot points, are the intersections of sets of regression lines. We derive the coordinates of the pivot point and explain its sources. We show how a pivot point arises in a certain notable data set, which has been analyzed often for points of high leverage. We obtain the application of pivot points that shortens calculations when updating a set of bivariate observations by adding a new point.


A Geometric Approach To Conditioning And The Search For Minimum Variance Unbiased Estimators, David L. Farnsworth, James E. Marengo Jun 2021

A Geometric Approach To Conditioning And The Search For Minimum Variance Unbiased Estimators, David L. Farnsworth, James E. Marengo

Articles

Our purpose is twofold: to present a prototypical example of the conditioning technique to obtain the best estimator of a parameter and to show that this technique resides in the structure of an inner product space. The technique uses conditioning of an unbiased estimator on a sufficient statistic. This procedure is founded upon the conditional variance formula, which leads to an inner product space and a geometric interpretation. The example clearly illustrates the dependence on the sampling methodology. These advantages show the power and centrality of this process.


Compare And Contrast Maximum Likelihood Method And Inverse Probability Weighting Method In Missing Data Analysis, Scott Sun May 2021

Compare And Contrast Maximum Likelihood Method And Inverse Probability Weighting Method In Missing Data Analysis, Scott Sun

Mathematical Sciences Technical Reports (MSTR)

Data can be lost for different reasons, but sometimes the missingness is a part of the data collection process. Unbiased and efficient estimation of the parameters governing the response mean model requires the missing data to be appropriately addressed. This paper compares and contrasts the Maximum Likelihood and Inverse Probability Weighting estimators in an Outcome-Dependendent Sampling design that deliberately generates incomplete observations. WE demonstrate the comparison through numerical simulations under varied conditions: different coefficient of determination, and whether or not the mean model is misspecified.