Open Access. Powered by Scholars. Published by Universities.®

Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.®

Masters Theses

Discipline
Institution
Keyword
Publication Year

Articles 1 - 14 of 14

Full-Text Articles in Statistical Models

Forecasting Covid-19 With Temporal Hierarchies And Ensemble Methods, Li Shandross Aug 2023

Forecasting Covid-19 With Temporal Hierarchies And Ensemble Methods, Li Shandross

Masters Theses

Infectious disease forecasting efforts underwent rapid growth during the COVID-19 pandemic, providing guidance for pandemic response and about potential future trends. Yet despite their importance, short-term forecasting models often struggled to produce accurate real-time predictions of this complex and rapidly changing system. This gap in accuracy persisted into the pandemic and warrants the exploration and testing of new methods to glean fresh insights.

In this work, we examined the application of the temporal hierarchical forecasting (THieF) methodology to probabilistic forecasts of COVID-19 incident hospital admissions in the United States. THieF is an innovative forecasting technique that aggregates time-series data into …


Statistical Improvements For Ecological Learning About Spatial Processes, Gaetan L. Dupont Oct 2021

Statistical Improvements For Ecological Learning About Spatial Processes, Gaetan L. Dupont

Masters Theses

Ecological inquiry is rooted fundamentally in understanding population abundance, both to develop theory and improve conservation outcomes. Despite this importance, estimating abundance is difficult due to the imperfect detection of individuals in a sample population. Further, accounting for space can provide more biologically realistic inference, shifting the focus from abundance to density and encouraging the exploration of spatial processes. To address these challenges, Spatial Capture-Recapture (“SCR”) has emerged as the most prominent method for estimating density reliably. The SCR model is conceptually straightforward: it combines a spatial model of detection with a point process model of the spatial distribution of …


Real-Time Dengue Forecasting In Thailand: A Comparison Of Penalized Regression Approaches Using Internet Search Data, Caroline Kusiak Oct 2018

Real-Time Dengue Forecasting In Thailand: A Comparison Of Penalized Regression Approaches Using Internet Search Data, Caroline Kusiak

Masters Theses

Dengue fever affects over 390 million people annually worldwide and is of particu- lar concern in Southeast Asia where it is one of the leading causes of hospitalization. Modeling trends in dengue occurrence can provide valuable information to Public Health officials, however many challenges arise depending on the data available. In Thailand, reporting of dengue cases is often delayed by more than 6 weeks, and a small fraction of cases may not be reported until over 11 months after they occurred. This study shows that incorporating data on Google Search trends can improve dis- ease predictions in settings with severely …


Developing Leading And Lagging Indicators To Enhance Equipment Reliability In A Lean System, Dhanush Agara Mallesh Dec 2017

Developing Leading And Lagging Indicators To Enhance Equipment Reliability In A Lean System, Dhanush Agara Mallesh

Masters Theses

With increasing complexity in equipment, the failure rates are becoming a critical metric due to the unplanned maintenance in a production environment. Unplanned maintenance in manufacturing process is created issues with downtimes and decreasing the reliability of equipment. Failures in equipment have resulted in the loss of revenue to organizations encouraging maintenance practitioners to analyze ways to change unplanned to planned maintenance. Efficient failure prediction models are being developed to learn about the failures in advance. With this information, failures predicted can reduce the downtimes in the system and improve the throughput.

The goal of this thesis is to predict …


Juvenile River Herring In Freshwater Lakes: Sampling Approaches For Evaluating Growth And Survival, Matthew T. Devine Oct 2017

Juvenile River Herring In Freshwater Lakes: Sampling Approaches For Evaluating Growth And Survival, Matthew T. Devine

Masters Theses

River herring, collectively alewives (Alosa pseudoharengus) and blueback herring (A. aestivalis), have experienced substantial population declines over the past five decades due in large part to overfishing, combined with other sources of mortality, and disrupted access to critical freshwater spawning habitats. Anadromous river herring populations are currently assessed by counting adults in rivers during upstream spawning migrations, but no field-based assessment methods exist for estimating juvenile densities in freshwater nursery habitats. Counts of 4-year-old migrating adults are variable and prevent understanding about how mortality acts on different life stages prior to returning to spawn (e.g., juveniles …


Modelling Bird Migration With Motus Data And Bayesian State-Space Models, Justin Baldwin Oct 2017

Modelling Bird Migration With Motus Data And Bayesian State-Space Models, Justin Baldwin

Masters Theses

Bird migration is a poorly-known yet important phenomenon, as understanding movement patterns of birds can inform conservation strategies and public health policy for animal-borne diseases. Recent advances in wildlife tracking technology, in particular the Motus system, have allowed researchers to track even small flying birds and insects with radio transmitters that weigh fractions of a gram. This system relies on a community-based distributed sensor network that detects tagged animals as they move through the detection nodes on journeys that range from small local movements to intercontinental migrations. The quantity of data generated by the Motus system is unprecedented, is on …


Niche-Based Modeling Of Japanese Stiltgrass (Microstegium Vimineum) Using Presence-Only Information, Nathan Bush Nov 2015

Niche-Based Modeling Of Japanese Stiltgrass (Microstegium Vimineum) Using Presence-Only Information, Nathan Bush

Masters Theses

The Connecticut River watershed is experiencing a rapid invasion of aggressive non-native plant species, which threaten watershed function and structure. Volunteer-based monitoring programs such as the University of Massachusetts’ OutSmart Invasives Species Project, Early Detection Distribution Mapping System (EDDMapS) and the Invasive Plant Atlas of New England (IPANE) have gathered valuable invasive plant data. These programs provide a unique opportunity for researchers to model invasive plant species utilizing citizen-sourced data. This study took advantage of these large data sources to model invasive plant distribution and to determine environmental and biophysical predictors that are most influential in dispersion, and to identify …


Modelling Supercomputer Maintenance Interrupts: Maintenance Policy Recommendations, Jagadish Cherukuri Aug 2015

Modelling Supercomputer Maintenance Interrupts: Maintenance Policy Recommendations, Jagadish Cherukuri

Masters Theses

A supercomputer is a repairable system with large number of compute nodes interconnected to work in harmony to achieve superior computational performance. Reliability of such a complex system depends on an effective maintenance strategy that involves both emergency and preventive maintenance. This thesis analyzes the maintenance records of four supercomputers operational at The National Institute of Computational Science located at Oak Ridge National Laboratory. We propose to use the generalized proportional intensities model (GPIM) to model the maintenance interrupts as it can capture both the reliability parameters and maintenance parameters and allows the inclusion of both emergency and preventive maintenance. …


Identifying The Spatial Distribution Of Three Plethodontid Salamanders In Great Smoky Mountains National Park Using Two Habitat Modeling Methods, Matthew Stephen Kookogey May 2012

Identifying The Spatial Distribution Of Three Plethodontid Salamanders In Great Smoky Mountains National Park Using Two Habitat Modeling Methods, Matthew Stephen Kookogey

Masters Theses

The main objective was to create habitat models of three plethodontid salamander species (Desmognathus conanti, D. ocoee, and Plethodon jordani) in GSMNP. To investigate the relationships between salamanders and their habitats, I used three models—logistic regression with use-availability sampling, logistic regression with case-control sampling, and Mahalanobis distance (D2)—for each species to gain a robust view of the relationships. The secondary objective was to compare the different modeling methods within and across the three species. Elevation was the dominant variable for all three species.

D2 for D. conanti predicted low elevations, close proximity …


Mixture Model Cluster Analysis Under Different Covariance Structures Using Information Complexity, Bahar Erar Aug 2011

Mixture Model Cluster Analysis Under Different Covariance Structures Using Information Complexity, Bahar Erar

Masters Theses

In this thesis, a mixture-model cluster analysis technique under different covariance structures of the component densities is developed and presented, to capture the compactness, orientation, shape, and the volume of component clusters in one expert system to handle Gaussian high dimensional heterogeneous data sets to achieve flexibility in currently practiced cluster analysis techniques. Two approaches to parameter estimation are considered and compared; one using the Expectation-Maximization (EM) algorithm and another following a Bayesian framework using the Gibbs sampler. We develop and score several forms of the ICOMP criterion of Bozdogan (1994, 2004) as our fitness function; to choose the number …


A Study Of Missing Data Imputation And Predictive Modeling Of Strength Properties Of Wood Composites, Yan Zeng Aug 2011

A Study Of Missing Data Imputation And Predictive Modeling Of Strength Properties Of Wood Composites, Yan Zeng

Masters Theses

Problem: Real-time process and destructive test data were collected from a wood composite manufacturer in the U.S. to develop real-time predictive models of two key strength properties (Modulus of Rupture (MOR) and Internal Bound (IB)) of a wood composite manufacturing process. Sensor malfunction and data “send/retrieval” problems lead to null fields in the company’s data warehouse which resulted in information loss. Many manufacturers attempt to build accurate predictive models excluding entire records with null fields or using summary statistics such as mean or median in place of the null field. However, predictive model errors in validation may be higher …


Bayesian Logistic Regression Model For Siting Biomass-Using Facilities, Xia Huang Dec 2010

Bayesian Logistic Regression Model For Siting Biomass-Using Facilities, Xia Huang

Masters Theses

Key sources of oil for western markets are located in complex geopolitical environments that increase economic and social risk. The amalgamation of economic, environmental, social and national security concerns for petroleum-based economies have created a renewed emphasis on alternative sources of energy which include biomass. The stability of sustainable biomass markets hinges on improved methods to predict and visualize business risk and cost to the supply chain.

This thesis develops Bayesian logistic regression models, with comparisons of classical maximum likelihood models, to quantify significant factors that influence the siting of biomass-using facilities and predict potential locations in the 13-state Southeastern …


A Monte Carlo Model Of Uncertainty In A Deterministic Hazardous Waste Transportation Risk Assessment, Michael A. Cowen Jan 1997

A Monte Carlo Model Of Uncertainty In A Deterministic Hazardous Waste Transportation Risk Assessment, Michael A. Cowen

Masters Theses

This thesis is aimed at developing and applying advanced modeling tools in the prediction of risk to the general public from transportation of chemical waste on public highways. The modeling tools developed can then be used to compare alternative waste management scenarios. The application considered is related to the transport of hazardous waste generated by the United States Department of Energy (DOE) to current treatment, storage, and disposal facilities. DOE is currently considering four different scenarios.

The application considered can be more specifically defined as an analysis of the risk to the general public from transporting the 63 shipments of …


A Statistical Approach To The Study Of The Ecosystems Of Seven Ponds In East-Central Illinois, Gregory Lee Orr Jan 1975

A Statistical Approach To The Study Of The Ecosystems Of Seven Ponds In East-Central Illinois, Gregory Lee Orr

Masters Theses

Gross primary productivity, heterotrophic bacterial numbers, and net phytoplankton densities of seven ponds in Coles County, Illinois, were studied in relation to physical, chemical, and biological habitat variables (light intensity and duration, turbidity, water temperature, pH, dissolved oxygen, sulfur, nitrogen, phosphorus, production, bacteria, and phytoplankton). Ten observations were made for each pond (except where otherwise noted) from 17 June through 25 August 1974. Stepwise multiple linear regression analyses of the data were used in order to determine those environmental factors which were important in predicting (i.e., significantly correlated with) bacterial and phytoplankton densities, and production. A multiple linear regression equation …