Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistical Models

Series

2020

Institution
Keyword
Publication
File Type

Articles 1 - 25 of 25

Full-Text Articles in Physical Sciences and Mathematics

Development Of A Statistical Model To Predict Materials’ Unit Prices For Future Maintenance And Rehabilitation In Highway Life Cycle Cost Analysis, Changmo Kim, Ghazan Khan, Brent Nguyen, Emily L. Hoang Dec 2020

Development Of A Statistical Model To Predict Materials’ Unit Prices For Future Maintenance And Rehabilitation In Highway Life Cycle Cost Analysis, Changmo Kim, Ghazan Khan, Brent Nguyen, Emily L. Hoang

Mineta Transportation Institute

The main objectives of this study are to investigate the trends in primary pavement materials’ unit price over time and to develop statistical models and guidelines for using predictive unit prices of pavement materials instead of uniform unit prices in life cycle cost analysis (LCCA) for future maintenance and rehabilitation (M&R) projects. Various socio-economic data were collected for the past 20 years (1997–2018) in California, including oil price, population, government expenditure in transportation, vehicle registration, and other key variables, in order to identify factors affecting pavement materials’ unit price. Additionally, the unit price records of the popular pavement materials were …


Statistical Modeling Of Private Sector Participation In Disaster Risk Reduction Data, Wupeng Yin Nov 2020

Statistical Modeling Of Private Sector Participation In Disaster Risk Reduction Data, Wupeng Yin

FIU Electronic Theses and Dissertations

The impacts of disaster on the private sector are inevitable, but their risks can be managed and reduced by preventively evaluative measures. Disaster risk reduction index (DRRI) and Disaster Experience (DE) variables were investigated in a survey study in six Western Hemisphere cities within the private sector of various business sizes. Our thesis built and evaluated 16 predictive models of DRRI with 36 categorical predictors and N = 1162 observations. Four statistical methods for linear regression and five for classification as well as seven machine learning methods were utilized. We also used stepwise selection and regulation methods for variable selection. …


Interval Estimation Of Proportion Of Second-Level Variance In Multi-Level Modeling, Steven Svoboda Oct 2020

Interval Estimation Of Proportion Of Second-Level Variance In Multi-Level Modeling, Steven Svoboda

The Nebraska Educator: A Student-Led Journal

Physical, behavioral and psychological research questions often relate to hierarchical data systems. Examples of hierarchical data systems include repeated measures of students nested within classrooms, nested within schools and employees nested within supervisors, nested within organizations. Applied researchers studying hierarchical data structures should have an estimate of the intraclass correlation coefficient (ICC) for every nested level in their analyses because ignoring even relatively small amounts of interdependence is known to inflate Type I error rate in single-level models. Traditionally, researchers rely upon the ICC as a point estimate of the amount of interdependency in their data. Recent methods utilizing an …


Cost Estimating Using A New Learning Curve Theory For Non-Constant Production Rates, Dakotah Hogan, John J. Elshaw, Clay M. Koschnick, Jonathan D. Ritschel, Adedeji B. Badiru, Shawn M. Valentine Oct 2020

Cost Estimating Using A New Learning Curve Theory For Non-Constant Production Rates, Dakotah Hogan, John J. Elshaw, Clay M. Koschnick, Jonathan D. Ritschel, Adedeji B. Badiru, Shawn M. Valentine

Faculty Publications

Traditional learning curve theory assumes a constant learning rate regardless of the number of units produced. However, a collection of theoretical and empirical evidence indicates that learning rates decrease as more units are produced in some cases. These diminishing learning rates cause traditional learning curves to underestimate required resources, potentially resulting in cost overruns. A diminishing learning rate model, namely Boone’s learning curve, was recently developed to model this phenomenon. This research confirms that Boone’s learning curve systematically reduced error in modeling observed learning curves using production data from 169 Department of Defense end-items. However, high amounts of variability in …


Statistical Methodology To Establish A Benchmark For Evaluating Antimicrobial Resistance Genes Through Real Time Pcr Assay, Enakshy Dutta Jul 2020

Statistical Methodology To Establish A Benchmark For Evaluating Antimicrobial Resistance Genes Through Real Time Pcr Assay, Enakshy Dutta

Department of Statistics: Dissertations, Theses, and Student Work

Novel diagnostic tests are usually compared with gold standard tests for evaluating diagnostic accuracy. For assessing antimicrobial resistance (AMR) to bovine respiratory disease (BRD) pathogens, phenotypic broth microdilution method is used as gold standard (GS). The objective of the thesis is to evaluate the optimal cycle threshold (Ct) generated by real-time polymerase chain reaction (rtPCR) to genes that confer resistance that will translate to the phenotypic classification of AMR. Data from two different methodologies are assessed to identify Ct that will discriminate between resistance (R) and susceptibility (S). First, the receiver operating characteristic (ROC) curve was used to determine the …


Working Children On Java Island 2017, Yuniarti Jul 2020

Working Children On Java Island 2017, Yuniarti

English Language Institute

Children's wellbeing has currently become a global concern as many of them are engaged in the labor force. A small area estimation (SAE) technique, EBLUP under Fey Herriot model, is employed to reveal their number in regencies of Java Island. Statistics have been disaggregated by geographical location (urban/rural) and gender. These statistics are required by the government as the basis for policy making.


Mathematical Modeling: Instructor And Student Resources, Marnie Phipps, Patty Wagner Jul 2020

Mathematical Modeling: Instructor And Student Resources, Marnie Phipps, Patty Wagner

Mathematics Ancillary Materials

This collection of student and instructor materials for Mathematical Modeling contains lesson plans, lecture slides, homework, learning goals, and student notes for the following major topics:

  • Linear Functions
  • Quadratic Functions
  • Exponential Functions
  • Logarithmic Functions

This is a materials update for a collection of materials created for a Round Nine ALG Textbook Transformation Grant.


Italian Sociologists: A Community Of Disconnected Groups, Aliakbar Akbaritabar, Vincent Traag, Alberto Caimo, Flaminio Squazzoni Jul 2020

Italian Sociologists: A Community Of Disconnected Groups, Aliakbar Akbaritabar, Vincent Traag, Alberto Caimo, Flaminio Squazzoni

Articles

Examining coauthorship networks is key to study scientific collaboration patterns and structural characteristics of scientific communities. Here, we studied coauthorship networks of sociologists in Italy, using temporal and multi-level quantitative analysis. By looking at publications indexed in Scopus, we detected research communities among Italian sociologists. We found that Italian sociologists are fractured in many disconnected groups. The giant connected component of the Italian sociology could be split into five main groups with a mixture of three main disciplinary topics: sociology of culture and communication (present in two groups), economic sociology (present in three groups) and general sociology (present in three …


Constitutive Model Of Lateral Unloading Creep Of Soft Soil Under Excess Pore Water Pressure, Wei Huang, Kejun Wen, Xiaojia Deng, Junjie Li, Zhijian Jiang, Yang Li, Lin Li, Farshad Amini Jun 2020

Constitutive Model Of Lateral Unloading Creep Of Soft Soil Under Excess Pore Water Pressure, Wei Huang, Kejun Wen, Xiaojia Deng, Junjie Li, Zhijian Jiang, Yang Li, Lin Li, Farshad Amini

Civil and Architectural Engineering Faculty Research

Presented in this paper is a study on the lateral unloading creep tests under different excess pore water pressures. The marine sedimentary soft soil in Shenzhen, China, was selected in this study. The results show that the excess pore water pressure plays a significant role in enhancing the unloading creep of soft soil. Higher excess pore water pressure brings more obvious creep deformation of soft soil and lower ultimate failure load. Meanwhile, the viscoelastic and the viscoplastic modulus of soft soil were found to exponentially decline with creep time. A modified merchant model and a combined model of the modified …


Decision Tree For Predicting The Party Of Legislators, Afsana Mimi May 2020

Decision Tree For Predicting The Party Of Legislators, Afsana Mimi

Publications and Research

The motivation of the project is to identify the legislators who voted frequently against their party in terms of their roll call votes using Office of Clerk U.S. House of Representatives Data Sets collected in 2018 and 2019. We construct a model to predict the parties of legislators based on their votes. The method we used is Decision Tree from Data Mining. Python was used to collect raw data from internet, SAS was used to clean data, and all other calculations and graphical presentations are performed using the R software.


Analyzing Competitive Balance In Professional Sport, Kevin Alwell May 2020

Analyzing Competitive Balance In Professional Sport, Kevin Alwell

Honors Scholar Theses

In this paper we review several measures to statistically analyze competitive balance and report which leagues have a wider variance of performance amongst its competitors. Each league seeks to maintain high levels of parity, making matches and overall season more unpredictable and appealing to the general audience. Here we quantify competitive advantage across major sports leagues in numbers using several statistical methods in order for leagues to optimize their revenue.


Interdependence Across Foreign Exchange Rate Markets- A Mixed Copula Approach, Richard Adjei-Boateng Apr 2020

Interdependence Across Foreign Exchange Rate Markets- A Mixed Copula Approach, Richard Adjei-Boateng

Masters Theses & Specialist Projects

The purpose of this thesis is to study the dependence structure of exchange rate pairs using a mixture of copula as opposed to a single copula approach. Mixed copula models have the ability to generate dependence structures that do not belong to existing copula families. The flexibility in choosing component copulas in this mixture model aids the construction of a system that is simultaneously parsimonious and flexible enough to generate most dependence patterns in exchange rate data. Furthermore, the method of mixture copulas facilitates the separation of both the structure and degree of dependence, concepts that are respectively embodied in …


A Monte Carlo Analysis Of Standard Error-Based Methods For Computing Confidence Intervals, Elayna Wichert Apr 2020

A Monte Carlo Analysis Of Standard Error-Based Methods For Computing Confidence Intervals, Elayna Wichert

Masters Theses & Specialist Projects

The objective of this study is to empirically test existing techniques to calculate the likely range of values for a Classical Test Theory true score given an observed score. The traditional method for forming these confidence intervals has used the standard error of measurement (SEM) as the basis for this confidence interval. An alternate equation, the standard error of estimate (SEE), has been recommended in place of the SEM for this purpose, yet it remains overlooked in the field of psychometrics. It is important that the correct equation be used in various applications in personnel psychology. Monte Carlo analyses were …


Development Of Gaussian Learning Algorithms For Early Detection Of Alzheimer's Disease, Chen Fang Mar 2020

Development Of Gaussian Learning Algorithms For Early Detection Of Alzheimer's Disease, Chen Fang

FIU Electronic Theses and Dissertations

Alzheimer’s disease (AD) is the most common form of dementia affecting 10% of the population over the age of 65 and the growing costs in managing AD are estimated to be $259 billion, according to data reported in the 2017 by the Alzheimer's Association. Moreover, with cognitive decline, daily life of the affected persons and their families are severely impacted. Taking advantage of the diagnosis of AD and its prodromal stage of mild cognitive impairment (MCI), an early treatment may help patients preserve the quality of life and slow the progression of the disease, even though the underlying disease cannot …


Inferences For Weibull-Gamma Distribution In Presence Of Partially Accelerated Life Test, Mahmoud Mansour, M A W Mahmoud Prof., Rashad El-Sagheer Mar 2020

Inferences For Weibull-Gamma Distribution In Presence Of Partially Accelerated Life Test, Mahmoud Mansour, M A W Mahmoud Prof., Rashad El-Sagheer

Basic Science Engineering

In this paper, the point at issue is to deliberate point and interval estimations for the parameters of Weibull-Gamma distribution (WGD) using progressively Type-II censored (PROG-II-C) sample under step stress partially accelerated life test (SSPALT) model. The maximum likelihood (ML), Bayes, and four parametric bootstrap methods are used to obtain the point estimations for the distribution parameters and the acceleration factor. Furthermore, the approximate confidence intervals (ACIs), four bootstrap confidence intervals and credible intervals of the estimators have been gotten. The results of Bayes estimators are computed under the squared error loss (SEL) function using Markov Chain Monte Carlo (MCMC) …


Measuring Localization Confidence For Quantifying Accuracy And Heterogeneity In Single-Molecule Super-Resolution Microscopy, Hesam Mazidi, Tianben Ding, Arye Nehorai, Matthew D. Lew Feb 2020

Measuring Localization Confidence For Quantifying Accuracy And Heterogeneity In Single-Molecule Super-Resolution Microscopy, Hesam Mazidi, Tianben Ding, Arye Nehorai, Matthew D. Lew

Electrical & Systems Engineering Publications and Presentations

We present a computational method, termed Wasserstein-induced flux (WIF), to robustly quantify the accuracy of individual localizations within a single-molecule localization microscopy (SMLM) dataset without ground- truth knowledge of the sample. WIF relies on the observation that accurate localizations are stable with respect to an arbitrary computational perturbation. Inspired by optimal transport theory, we measure the stability of individual localizations and develop an efficient optimization algorithm to compute WIF. We demonstrate the advantage of WIF in accurately quantifying imaging artifacts in high-density reconstruction of a tubulin network. WIF represents an advance in quantifying systematic errors with unknown and complex distributions, …


An Automatic Interaction Detection Hybrid Model For Bankcard Response Classification, Yan Wang, Sherry Ni, Brian Stone Jan 2020

An Automatic Interaction Detection Hybrid Model For Bankcard Response Classification, Yan Wang, Sherry Ni, Brian Stone

Published and Grey Literature from PhD Candidates

Data mining techniques have numerous applications in bankcard response modeling. Logistic regression has been used as the standard modeling tool in the financial industry because of its almost always desirable performance and its interpretability. In this paper, we propose a hybrid bankcard response model, which integrates decision tree-based chi-square automatic interaction detection (CHAID) into logistic regression. In the first stage of the hybrid model, CHAID analysis is used to detect the possible potential variable interactions. Then in the second stage, these potential interactions are served as the additional input variables in logistic regression. The motivation of the proposed hybrid model …


A Two-Stage Hybrid Model By Using Artificial Neural Networks As Feature Construction Algorithms, Yan Wang, Sherry Ni, Brian Stone Jan 2020

A Two-Stage Hybrid Model By Using Artificial Neural Networks As Feature Construction Algorithms, Yan Wang, Sherry Ni, Brian Stone

Published and Grey Literature from PhD Candidates

We propose a two-stage hybrid approach with neural networks as the new feature construction algorithms for bankcard response classifications. The hybrid model uses a very simple neural network structure as the new feature construction tool in the first stage, then the newly created features are used as the additional input variables in logistic regression in the second stage. The model is compared with the traditional one-stage model in credit customer response classification. It is observed that the proposed two-stage model outperforms the one-stage model in terms of accuracy, the area under the ROC curve, and KS statistic. By creating new …


Predicting Class-Imbalanced Business Risk Using Resampling, Regularization, And Model Ensembling Algorithms, Yan Wang, Sherry Ni Jan 2020

Predicting Class-Imbalanced Business Risk Using Resampling, Regularization, And Model Ensembling Algorithms, Yan Wang, Sherry Ni

Published and Grey Literature from PhD Candidates

We aim at developing and improving the imbalanced business risk modeling via jointly using proper evaluation criteria, resampling, cross-validation, classifier regularization, and ensembling techniques. Area Under the Receiver Operating Characteristic Curve (AUC of ROC) is used for model comparison based on 10-fold cross-validation. Two undersampling strategies including random undersampling (RUS) and cluster centroid undersampling (CCUS), as well as two oversampling methods including random oversampling (ROS) and Synthetic Minority Oversampling Technique (SMOTE), are applied. Three highly interpretable classifiers, including logistic regression without regularization (LR), L1-regularized LR (L1LR), and decision tree (DT) are implemented. Two ensembling techniques, including Bagging and Boosting, are …


A Xgboost Risk Model Via Feature Selection And Bayesian Hyper-Parameter Optimization, Yan Wang, Sherry Ni Jan 2020

A Xgboost Risk Model Via Feature Selection And Bayesian Hyper-Parameter Optimization, Yan Wang, Sherry Ni

Published and Grey Literature from PhD Candidates

This paper aims to explore models based on the extreme gradient boosting (XGBoost) approach for business risk classification. Feature selection (FS) algorithms and hyper-parameter optimizations are simultaneously considered during model training. The five most commonly used FS methods including weight by Gini, weight by Chi-square, hierarchical variable clustering, weight by correlation, and weight by information are applied to alleviate the effect of redundant features. Two hyper-parameter optimization approaches, random search (RS) and Bayesian tree-structuredParzen Estimator (TPE), are applied in XGBoost. The effect of different FS and hyper-parameter optimization methods on the model performance are investigated by the Wilcoxon Signed Rank …


Shrinkage Priors For Isotonic Probability Vectors And Binary Data Modeling, Philip S. Boonstra, Daniel R. Owen, Jian Kang Jan 2020

Shrinkage Priors For Isotonic Probability Vectors And Binary Data Modeling, Philip S. Boonstra, Daniel R. Owen, Jian Kang

The University of Michigan Department of Biostatistics Working Paper Series

This paper outlines a new class of shrinkage priors for Bayesian isotonic regression modeling a binary outcome against a predictor, where the probability of the outcome is assumed to be monotonically non-decreasing with the predictor. The predictor is categorized into a large number of groups, and the set of differences between outcome probabilities in consecutive categories is equipped with a multivariate prior having support over the set of simplexes. The Dirichlet distribution, which can be derived from a normalized cumulative sum of gamma-distributed random variables, is a natural choice of prior, but using mathematical and simulation-based arguments, we show that …


Multi-Variable Theme Park Analysis, Timothy Johnson Jan 2020

Multi-Variable Theme Park Analysis, Timothy Johnson

Research and Scholarship Symposium Posters

The purpose of this data analysis is to better understand how analytics can be used to look at different problems such as the one indicated in this multi-variable question. When is the best time of year to visit the theme parks at Walt Disney World? The analysis aims to look at multiple factors that will influence the answer to the question. The goal is to look at these multiple factors and run some probability and statistics tests in order to come to a conclusion on the data. This data along with other key factors can lead to the best possible …


Modelling Interactions Among Offenders: A Latent Space Approach For Interdependent Ego-Networks, Isabella Gollini, Alberto Caimo, Paolo Campana Jan 2020

Modelling Interactions Among Offenders: A Latent Space Approach For Interdependent Ego-Networks, Isabella Gollini, Alberto Caimo, Paolo Campana

Articles

Illegal markets are notoriously difficult to study. Police data offer an increasingly exploited source of evidence. However, their secondary nature poses challenges for researchers. A key issue is that researchers often have to deal with two sets of actors: targeted and non-targeted. This work develops a latent space model for interdependent ego-networks purposely created to deal with the targeted nature of police evidence. By treating targeted offenders as egos and their contacts as alters, the model (a) leverages on the full information available and (b) mirrors the specificity of the data collection strategy. The paper then applies this approach to …


Enhancing Models And Measurements Of Traffic-Related Air Pollutants For Health Studies Using Dispersion Modeling And Bayesian Data Fusion, Stuart A. Batterman, Veronica J. Berrocal, Chad Milando, Owais Gilani, Saravanan Arunachalam, K. Max Zhang Jan 2020

Enhancing Models And Measurements Of Traffic-Related Air Pollutants For Health Studies Using Dispersion Modeling And Bayesian Data Fusion, Stuart A. Batterman, Veronica J. Berrocal, Chad Milando, Owais Gilani, Saravanan Arunachalam, K. Max Zhang

Faculty Journal Articles

Research Report 202 describes a study led by Dr. Stuart Batterman at the University of Michigan, Ann Arbor and colleagues. The investigators evaluated the ability to predict traffic-related air pollution using a variety of methods and models, including a line source air pollution dispersion model and sophisticated spatiotemporal Bayesian data fusion methods. Exposure assessment for traffic-related air pollution is challenging because the pollutants are a complex mixture and vary greatly over space and time. Because extensive direct monitoring is difficult and expensive, a number of modeling approaches have been developed, but each model has its own limitations and errors.

Dr. …


Sex And Age Differences In Prevalence And Risk Factors For Prediabetes In Mexican-Americans, Kristina Vatcheva, Belinda M. Reininger, Susan P. Fisher-Hoch, Joseph B. Mccormick Jan 2020

Sex And Age Differences In Prevalence And Risk Factors For Prediabetes In Mexican-Americans, Kristina Vatcheva, Belinda M. Reininger, Susan P. Fisher-Hoch, Joseph B. Mccormick

School of Mathematical and Statistical Sciences Faculty Publications and Presentations

AIMS:

Over 1/3 of Americans have prediabetes, while 9.4% have type 2 diabetes. The aim of our study was to estimate the prevalence of prediabetes in Mexican Americans, with known 28.2% prevalence of type 2 diabetes, by age and sex and to identify critical socio-demographic and clinical factors associated with prediabetes.

METHODS:

Data were collected between 2004 and 2017 from the Cameron County Hispanic Cohort in Texas. Weighted crude and sex- and age- stratified prevalences were calculated. Survey weighted logistic regression analyses were conducted to identify risk factors for prediabetes.

RESULTS:

The prevalence of prediabetes (32%) was slightly higher than …