Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Theses/Dissertations

Statistics and Probability

2017

Institution
Keyword
Publication

Articles 1 - 30 of 243

Full-Text Articles in Physical Sciences and Mathematics

Of Rats And Men, Thomas S. Walsh Dec 2017

Of Rats And Men, Thomas S. Walsh

Capstones

This capstone is a data-driven investigation into New York City's rat problem. By using publicly available government data to map rat activity in NYC, I identified several socio-economic variables that correlate with rat populations at the community district, borough, and city-scale. I used these findings (mainly that rat problems are linked to lower incomes) as the basis of an investigation, which includes interviews with residents, experts, and city officials. Prof. Bobby Corrigan, urban rodentologist and formerly with the NYC Department of Health criticizes the city's efforts for the first time on the record.

https://thomasseiyawalsh.wixsite.com/ratstone


Seasonal Resource Selection And Habitat Treatment Use By A Fringe Population Of Greater Sage-Grouse, Rhett Boswell Dec 2017

Seasonal Resource Selection And Habitat Treatment Use By A Fringe Population Of Greater Sage-Grouse, Rhett Boswell

All Graduate Plan B and other Reports, Spring 1920 to Spring 2023

Movement and habitat selection by Greater Sage-grouse (Centrocercus uropasianus) is of great interest to wildlife managers tasked with applying conservation measures for this iconic western species. Current technology has created small and lightweight GPS (Global Positioning Systems) transmitters that can be attached to sage-grouse. Using GIS software and statistical programs such as Program R, land managers can analyze GPS location data to assess how sage-grouse are geospatially interacting with their habitats. Within the Panguitch Sage-Grouse Management Area (SGMA) thousands of acres of land have been restored or manipulated to enhance sage-grouse habitat; this usually involves removal of pinyon pine …


Cellulose Nanofiber-Reinforced Impact Modified Polypropylene: Assessing Material Properties From Fused Layer Modeling And Injection Molding Processing, Jordan Elliott Sanders Dec 2017

Cellulose Nanofiber-Reinforced Impact Modified Polypropylene: Assessing Material Properties From Fused Layer Modeling And Injection Molding Processing, Jordan Elliott Sanders

Electronic Theses and Dissertations

The purpose of this research was to investigate the use of cellulose nanofibers (CNF) compounded into an impact modified polypropylene (IMPP) matrix. A IMPP was used because it shrinks less than a PP homopolymer during FLM processing. An assessment of material properties from fused layer modeling (FLM), an additive manufacturing (AM) method, and injection molding (IM) was conducted. Results showed that material property measurements in neat PP were statistically similar between IM and FLM for density, strain at yield and flexural stiffness. Additionally, PP plus the coupling agent maleic anhydride (MA) showed statistically similar results in comparison of IM and …


Statistical Analysis Of Momentum In Basketball, Mackenzi Stump Dec 2017

Statistical Analysis Of Momentum In Basketball, Mackenzi Stump

Honors Projects

The “hot hand” in sports has been debated for as long as sports have been around. The debate involves whether streaks and slumps in sports are true phenomena or just simply perceptions in the mind of the human viewer. This statistical analysis of momentum in basketball analyzes the distribution of time between scoring events for the BGSU Women’s Basketball team from 2011-2017. We discuss how the distribution of time between scoring events changes with normal game factors such as location of the game, game outcome, and several other factors. If scoring events during a game were always randomly distributed, or …


Sample Size Calculations And Normalization Methods For Rna-Seq Data., Xiaohong Li Dec 2017

Sample Size Calculations And Normalization Methods For Rna-Seq Data., Xiaohong Li

Electronic Theses and Dissertations

High-throughput RNA sequencing (RNA-seq) has become the preferred choice for transcriptomics and gene expression studies. With the rapid growth of RNA-seq applications, sample size calculation methods for RNA-seq experiment design and data normalization methods for DEG analysis are important issues to be explored and discussed. The underlying theme of this dissertation is to develop novel sample size calculation methods in RNA-seq experiment design using test statistics. I have also proposed two novel normalization methods for analysis of RNA-seq data. In chapter one, I present the test statistical methods including Wald’s test, log-transformed Wald’s test and likelihood ratio test statistics for …


Developing Leading And Lagging Indicators To Enhance Equipment Reliability In A Lean System, Dhanush Agara Mallesh Dec 2017

Developing Leading And Lagging Indicators To Enhance Equipment Reliability In A Lean System, Dhanush Agara Mallesh

Masters Theses

With increasing complexity in equipment, the failure rates are becoming a critical metric due to the unplanned maintenance in a production environment. Unplanned maintenance in manufacturing process is created issues with downtimes and decreasing the reliability of equipment. Failures in equipment have resulted in the loss of revenue to organizations encouraging maintenance practitioners to analyze ways to change unplanned to planned maintenance. Efficient failure prediction models are being developed to learn about the failures in advance. With this information, failures predicted can reduce the downtimes in the system and improve the throughput.

The goal of this thesis is to predict …


Statistical And Clinical Equivalence Of Measurements, Puntipa Wanitjirattikal Dec 2017

Statistical And Clinical Equivalence Of Measurements, Puntipa Wanitjirattikal

Dissertations

This study proposes a test for statistical equivalence of two measurements. Typically, a new measurement process Υ is compared to an existing or standard measurement process Χ. We are assuming that Χ and Υ are measurements on the same scale. The paired t-test may be used to check for significant difference between (Χ, Υ) pairs. However, the paired t-test is intended to detect shift-type relationships of the form Υ=Χ+δ1 and may have low power for scale-type relations of the form ΥΧ.

We propose a test that has reasonable power to …


Diagnostics For Choosing Between Stratified Logrank And Stratified Wilcoxon, Jhoanne Marsh C. Gatpatan Dec 2017

Diagnostics For Choosing Between Stratified Logrank And Stratified Wilcoxon, Jhoanne Marsh C. Gatpatan

Dissertations

Martinez and Naranjo (2010) proposed a pretest for choosing between Logrank or Wilcoxon test in a two - sample case. However, in the presence of covariates, comparing two populations without adjusting for covariates would yield misleading results. In this study, we propose several pretests that will help the analyst decide to use stratified Logrank or stratified Wilcoxon tests in comparing two survival curves after covariates have been taken into account. Power performance of each adaptive test was done through simulations under PH and non-PH cases.


Bayesian Model For Detection Of Outliers In Linear Regression With Application To Longitudinal Data, Zahraa Al-Sharea Dec 2017

Bayesian Model For Detection Of Outliers In Linear Regression With Application To Longitudinal Data, Zahraa Al-Sharea

Graduate Theses and Dissertations

Outlier detection is one of the most important challenges with many present-day applications. Outliers can occur due to uncertainty in data generating mechanisms or due to an error in data recording/processing. Outliers can drastically change the study's results and make predictions less reliable. Detecting outliers in longitudinal studies is quite challenging because this kind of study is working with observations that change over time. Therefore, the same subject can produce an outlier at one point in time produce regular observations at all other time points. A Bayesian hierarchical modeling assigns parameters that can quantify whether each observation is an outlier …


Reassessment Of The Red Drum Stock In Mississippi Coastal Waters: The Role Of Ages 3-5 Year-Class Fish, Emily Satterfield Dec 2017

Reassessment Of The Red Drum Stock In Mississippi Coastal Waters: The Role Of Ages 3-5 Year-Class Fish, Emily Satterfield

Master's Theses

Red Drum, Sciaenops ocellatus, are highly sought after by sport fishermen in Mississippi coastal waters. In 2016, Mississippi anglers made over 180,000 fishing trips targeting Red Drum, making it the second most targeted marine species. The current Fishery Management Plan of the Gulf of Mexico Fishery Management Council, prohibits harvest of Red Drum in federal waters. Monitoring of the stock in Mississippi state waters occurs at sites that are almost exclusively estuarine, using gear types selective for juvenile fish. Additional samples come from the for-hire-industry that typically targets larger Red Drum. This project’s goal was to target age three …


Novel Statistical Models For Quantitative Shape-Gene Association Selection, Xiaotian Dai Dec 2017

Novel Statistical Models For Quantitative Shape-Gene Association Selection, Xiaotian Dai

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Other research reported that genetic mechanism plays a major role in the development process of biological shapes. The primary goal of this dissertation is to develop novel statistical models to investigate the quantitative relationships between biological shapes and genetic variants. However, these problems can be extremely challenging to traditional statistical models for a number of reasons: 1) the biological phenotypes cannot be effectively represented by single-valued traits, while traditional regression only handles one dependent variable; 2) in real-life genetic data, the number of candidate genes to be investigated is extremely large, and the signal-to-noise ratio of candidate genes is expected …


Application Of Machine Learning And Statistical Learning Methods For Prediction In A Large-Scale Vegetation Map, Carla M. Brookey Dec 2017

Application Of Machine Learning And Statistical Learning Methods For Prediction In A Large-Scale Vegetation Map, Carla M. Brookey

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Original analyses of a large vegetation cover dataset from Roosevelt National Forest in northern Colorado were carried out by Blackard (1998) and Blackard and Dean (1998; 2000). They compared the classification accuracies of linear and quadratic discriminant analysis (LDA and QDA) with artificial neural networks (ANN) and obtained an overall classification accuracy of 70.58% for a tuned ANN compared to 58.38% for LDA and 52.76% for QDA.

Because there has been tremendous development of machine learning classification methods over the last 35 years in both computer science and statistics, as well as substantial improvements in the speed of computer hardware, …


Extracting And Visualizing Data From Mobile And Static Eye Trackers In R And Matlab, Chunyang Li Dec 2017

Extracting And Visualizing Data From Mobile And Static Eye Trackers In R And Matlab, Chunyang Li

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Eye tracking is the process of measuring where people are looking at with an eye tracker device. Eye tracking has been used in many scientific fields, such as education, usability research, sports, psychology, and marketing. Eye tracking data are often obtained from a static eye tracker or are manually extracted from a mobile eye tracker. Visualization usually plays an important role in the analysis of eye tracking data. So far, there existed no software package that contains a whole collection of eye tracking data processing and visualization tools. In this dissertation, we review the eye tracking technology, the eye tracking …


Exact Approaches For Bias Detection And Avoidance With Small, Sparse, Or Correlated Categorical Data, Sarah E. Schwartz Dec 2017

Exact Approaches For Bias Detection And Avoidance With Small, Sparse, Or Correlated Categorical Data, Sarah E. Schwartz

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Every day, traditional statistical methodology are used world wide to study a variety of topics and provides insight regarding countless subjects. Each technique is based on a distinct set of assumptions to ensure valid results. Additionally, many statistical approaches rely on large sample behavior and may collapse or degenerate in the presence of small, spare, or correlated data. This dissertation details several advancements to detect these conditions, avoid their consequences, and analyze data in a different way to yield trustworthy results.

One of the most commonly used modeling techniques for outcomes with only two possible categorical values (eg. live/die, pass/fail, …


The Role Of Pre-Existing Type 2 Diabetes Mellitus In Colorectal Cancer Stage And Survival In Elderly Americans: A Seer-Medicare Population-Based Study 2002-~2011, Sanae El Ibrahimi Dec 2017

The Role Of Pre-Existing Type 2 Diabetes Mellitus In Colorectal Cancer Stage And Survival In Elderly Americans: A Seer-Medicare Population-Based Study 2002-~2011, Sanae El Ibrahimi

UNLV Theses, Dissertations, Professional Papers, and Capstones

Diabetes is a common comorbid condition among colorectal cancer (CRC) patients, yet its effects in CRC outcomes, particularly stage at diagnosis, risk of death and variations by diabetes severity (complications vs no complications) and Hispanic ethnicity have not been adequately studied. The purpose of this study was to investigate the association between pre-existing T2DM and advanced stage at diagnosis in elderly patients with CRC; to examine whether diabetes is an independent predictor of poor survival from all-cause and CRC-specific mortality; to assess whether variations exist by diabetes severity and to analyze the outcomes for the Hispanic group.

The Surveillance Epidemiology …


The Impact Of Economic Recession On The Health Of Adult Nevadans, Ariana Goertz Dec 2017

The Impact Of Economic Recession On The Health Of Adult Nevadans, Ariana Goertz

UNLV Theses, Dissertations, Professional Papers, and Capstones

Recessions are generally considered to cause negative consequences, but recent studies have provided evidence that some health outcomes improve as the economy deteriorates. The relationship between economic downfalls and health is not straightforward; it is important to look at how health has been impacted in one of the areas hit hardest by the recession. Las Vegas, Nevada was previously considered recession-proof, seemingly unaffected by previous economic downturns exhibited by the rest of the country. However, during the Great Recession of 2007-2009, Las Vegas led the country in highest rates of unemployment and foreclosures. This was quite a collapse for a …


Functional Data Analysis Methods For Predicting Disease Status., Sarah Kendrick Dec 2017

Functional Data Analysis Methods For Predicting Disease Status., Sarah Kendrick

Electronic Theses and Dissertations

Introduction: Differential scanning calorimetry (DSC) is used to determine thermally-induced conformational changes of biomolecules within a blood plasma sample. Recent research has indicated that DSC curves (or thermograms) may have different characteristics based on disease status and, thus, may be useful as a monitoring and diagnostic tool for some diseases. Since thermograms are curves measured over a range of temperature values, they are often considered as functional data. In this dissertation we propose and apply functional data analysis (FDA) techniques to analyze DSC data from the Lupus Family Registry and Repository (LFRR). The aim is to develop FDA methods to …


Survival Analysis: A Modified Kaplan-Meir Estimator, Justin A. Bancroft Dec 2017

Survival Analysis: A Modified Kaplan-Meir Estimator, Justin A. Bancroft

MSU Graduate Theses

The popular Kaplan-Meir estimator has traditionally been used to great effect as a survival function estimator. However, the Kaplan-Meir estimator is dependent upon a maximum likelihood parameter estimator which may not be the best estimator in all cases. We modify the Kaplan-Meir estimator, based on a Bayes parameter estimation, in hopes of providing a more accurate survival estimator for small sample sizes. Core elements of survival analysis are presented, acting as a foundation from which to construct and compare our modified Kaplan-Meir estimator. It is hypothesized that our modified Kaplan-Meir estimator is generally more accurate than the standard Kaplan-Meir estimator …


Making Models With Bayes, Pilar Olid Dec 2017

Making Models With Bayes, Pilar Olid

Electronic Theses, Projects, and Dissertations

Bayesian statistics is an important approach to modern statistical analyses. It allows us to use our prior knowledge of the unknown parameters to construct a model for our data set. The foundation of Bayesian analysis is Bayes' Rule, which in its proportional form indicates that the posterior is proportional to the prior times the likelihood. We will demonstrate how we can apply Bayesian statistical techniques to fit a linear regression model and a hierarchical linear regression model to a data set. We will show how to apply different distributions to Bayesian analyses and how the use of a prior affects …


Bayesian Inference On Quantile Regression-Based Mixed-Effects Joint Models For Longitudinal-Survival Data From Aids Studies, Hanze Zhang Nov 2017

Bayesian Inference On Quantile Regression-Based Mixed-Effects Joint Models For Longitudinal-Survival Data From Aids Studies, Hanze Zhang

USF Tampa Graduate Theses and Dissertations

In HIV/AIDS studies, viral load (the number of copies of HIV-1 RNA) and CD4 cell counts are important biomarkers of the severity of viral infection, disease progression, and treatment evaluation. Recently, joint models, which have the capability on the bias reduction and estimates' efficiency improvement, have been developed to assess the longitudinal process, survival process, and the relationship between them simultaneously. However, the majority of the joint models are based on mean regression, which concentrates only on the mean effect of outcome variable conditional on certain covariates. In fact, in HIV/AIDS research, the mean effect may not always be of …


The Performance Of Multilevel Structural Equation Modeling (Msem) In Comparison To Multilevel Modeling (Mlm) In Multilevel Mediation Analysis With Non-Normal Data, Thanh Vinh Pham Nov 2017

The Performance Of Multilevel Structural Equation Modeling (Msem) In Comparison To Multilevel Modeling (Mlm) In Multilevel Mediation Analysis With Non-Normal Data, Thanh Vinh Pham

USF Tampa Graduate Theses and Dissertations

The mediation analysis has been used to test if the effect of one variable on another variable is mediated by the third variable. The mediation analysis answers a question of how a predictor influences an outcome variable. Such information helps to gain understanding of mechanism underlying the variation of the outcome. When the mediation analysis is conducted on hierarchical data, the structure of data needs to be taken into account. Krull and MacKinnon (1999) recommended using Multilevel Modeling (MLM) with nested data and showed that the MLM approach has more power and flexibility over the standard Ordinary Least Squares (OLS) …


Which Factors Influence Student Success In Intermediate Algebra, Math 101-102-103?, Linh T. Ward Nov 2017

Which Factors Influence Student Success In Intermediate Algebra, Math 101-102-103?, Linh T. Ward

Mathematics & Statistics ETDs

At The University of New Mexico (UNM), Intermediate Algebra (MATH 120 and MATH 101-102-103) has historically been a so-called “killer course”, with very low pass rates: approximately 40% in Fall 2009 to Spring 2011 and about 50% from Fall 2011 to Spring 2013. Furthermore, many students failed the class multiple times. Since 2013, a computer system called ALEKS has been used to teach the course and, along with some additional interventions, on Albuquerque/Main campus success rates for MATH 101 have increased to roughly 80% and MATH 102 to about 70%. This thesis provides a strategy to identify those 20-30% as-risk …


Statistical Analysis And Modeling Of Stomach Cancer Data, Chao Gao Nov 2017

Statistical Analysis And Modeling Of Stomach Cancer Data, Chao Gao

USF Tampa Graduate Theses and Dissertations

The objective of this study is to address some important questions associated with stomach cancer patients using the data from the Surveillance Epidemiology and End Results (SEER) program of the United States. To better understand the behavior of stomach cancer, we first perform parametric analysis for each patient group (white male, white female, African American male, African American female, other male and female) to identify the probability distribution function which can best characterize the behavior of the malignant stomach tumor sizes. We evaluate the effects of patients’ age, gender and race on the malignant stomach tumor sizes by developing quantile …


Improving Service Level Of Free-Floating Bike Sharing Systems, Aritra Pal Nov 2017

Improving Service Level Of Free-Floating Bike Sharing Systems, Aritra Pal

USF Tampa Graduate Theses and Dissertations

Bike Sharing is a sustainable mode of urban mobility, not only for regular commuters but also for casual users and tourists. Free-floating bike sharing (FFBS) is an innovative bike sharing model, which saves on start-up cost, prevents bike theft, and offers significant opportunities for smart management by tracking bikes in real-time with built-in GPS. Efficient management of a FFBS requires: 1) analyzing its mobility patterns and spatio-temporal imbalance of supply and demand of bikes, 2) developing strategies to mitigate such imbalances, and 3) understanding the causes of a bike getting damaged and developing strategies to minimize them. All of these …


Statistical Modelling, Optimal Strategies And Decisions In Two-Period Economies, Jiang Wu Nov 2017

Statistical Modelling, Optimal Strategies And Decisions In Two-Period Economies, Jiang Wu

Electronic Thesis and Dissertation Repository

Motivated by some real problems, our thesis puts forward two general two-period pricing models and explore optimal buying and selling strategies in two states of the two-period decision, when buyer/seller's decisions in the two periods are uncertain: commodity valuations may or may not be independent, may or may not follow the same distribution, be heavily or just lightly influenced by exogenous economic conditions, and so on. For both the example of buying laptops and the example of selling houses, the connections between each example and the two-envelope paradox encourage us to explore optimal strategies based on the works of McDonnell …


An Enhanced Bridge Weigh-In-Motion Methodology And A Bayesian Framework For Predicting Extreme Traffic Load Effects Of Bridges, Yang Yu Nov 2017

An Enhanced Bridge Weigh-In-Motion Methodology And A Bayesian Framework For Predicting Extreme Traffic Load Effects Of Bridges, Yang Yu

LSU Doctoral Dissertations

In the past few decades, the rapid growth of traffic volume and weight, and the aging of transportation infrastructures have raised serious concerns over transportation safety. Under these circumstances, vehicle overweight enforcement and bridge condition assessment through structural health monitoring (SHM) have become critical to the protection of the safety of the public and transportation infrastructures. The main objectives of this dissertation are to: (1) develop an enhanced bridge weigh-in-motion (BWIM) methodology that can be integrated into the SHM system for overweight enforcement and monitoring traffic loading; (2) present a Bayesian framework to predict the extreme traffic load effects (LEs) …


Data-Adaptive Kernel Support Vector Machine, Xin Liu Nov 2017

Data-Adaptive Kernel Support Vector Machine, Xin Liu

Electronic Thesis and Dissertation Repository

In this thesis, we propose the data-adaptive kernel Support Vector Machine (SVM), a new method with a data-driven scaling kernel function based on real data sets. This two-stage approach of kernel function scaling can enhance the accuracy of a support vector machine, especially when the data are imbalanced. Followed by the standard SVM procedure in the first stage, the proposed method locally adapts the kernel function to data locations based on the skewness of the class outcomes. In the second stage, the decision rule is constructed with the data-adaptive kernel function and is used as the classifier. This process enlarges …


Deep Energy-Based Models For Structured Prediction, David Belanger Nov 2017

Deep Energy-Based Models For Structured Prediction, David Belanger

Doctoral Dissertations

We introduce structured prediction energy networks (SPENs), a flexible frame- work for structured prediction. A deep architecture is used to define an energy func- tion over candidate outputs and predictions are produced by gradient-based energy minimization. This deep energy captures dependencies between labels that would lead to intractable graphical models, and allows us to automatically discover discrim- inative features of the structured output. Furthermore, practitioners can explore a wide variety of energy function architectures without having to hand-design predic- tion and learning methods for each model. This is because all of our prediction and learning methods interact with the energy …


Investigation Of Neutron Induced Ternary Fission With The Niffte Time Projection Chamber, Alex C. Kemnitz Nov 2017

Investigation Of Neutron Induced Ternary Fission With The Niffte Time Projection Chamber, Alex C. Kemnitz

Physics

Ternary fission is a rare occurrence in which three particles are produced from a single fission event. This analysis uses tracked fission event data recorded by NIFFTE’s time projection chamber with a series of refined cuts to isolate all possible ternary events. The experiment used two targets, each consisting of two isotopes; one target was Pu-239 and U-235, and the other was U-238 and U-235. The data was used to measure the ternary/binary fission ratios for each isotope. The ratios for the Pu-239 and U-235 target that were found are shown to be too high due to alpha contamination. The …


Juvenile River Herring In Freshwater Lakes: Sampling Approaches For Evaluating Growth And Survival, Matthew T. Devine Oct 2017

Juvenile River Herring In Freshwater Lakes: Sampling Approaches For Evaluating Growth And Survival, Matthew T. Devine

Masters Theses

River herring, collectively alewives (Alosa pseudoharengus) and blueback herring (A. aestivalis), have experienced substantial population declines over the past five decades due in large part to overfishing, combined with other sources of mortality, and disrupted access to critical freshwater spawning habitats. Anadromous river herring populations are currently assessed by counting adults in rivers during upstream spawning migrations, but no field-based assessment methods exist for estimating juvenile densities in freshwater nursery habitats. Counts of 4-year-old migrating adults are variable and prevent understanding about how mortality acts on different life stages prior to returning to spawn (e.g., juveniles …