Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

Theses/Dissertations

2020

Institution
Keyword
Publication

Articles 241 - 258 of 258

Full-Text Articles in Physical Sciences and Mathematics

Public Perception Of Different Planting Techniques Using Augmented Reality, Sultana Quader Tania Jan 2020

Public Perception Of Different Planting Techniques Using Augmented Reality, Sultana Quader Tania

Electronic Theses and Dissertations

The objective of this study was to measure public perception of the different planting techniques (block and matrix), which are used at visitor information centers (VICs) and other rights of way (ROW) areas. The main factors that affect public perception of planting techniques were identified through an extensive literature review and qualitative survey from four welcome centers in the state of Georgia. The ranking of those indicators, based on public preferences, was discovered through a quantitative survey. During the first phase of the quantitative survey, images of block and matrix were used. An iOS-based user-friendly and cost-effective augmented reality (AR) …


Generalization Of Kullback-Leibler Divergence For Multi-Stage Diseases: Application To Diagnostic Test Accuracy And Optimal Cut-Points Selection Criterion, Chen Mo Jan 2020

Generalization Of Kullback-Leibler Divergence For Multi-Stage Diseases: Application To Diagnostic Test Accuracy And Optimal Cut-Points Selection Criterion, Chen Mo

Electronic Theses and Dissertations

The Kullback-Leibler divergence (KL), which captures the disparity between two distributions, has been considered as a measure for determining the diagnostic performance of an ordinal diagnostic test. This study applies KL and further generalizes it to comprehensively measure the diagnostic accuracy test for multi-stage (K > 2) diseases, named generalized total Kullback-Leibler divergence (GTKL). Also, GTKL is proposed as an optimal cut-points selection criterion for discriminating subjects among different disease stages. Moreover, the study investigates a variety of applications of GTKL on measuring the rule-in/out potentials in the single-stage and multi-stage levels. Intensive simulation studies are conducted to compare the performance …


Multiple Imputation Using Influential Exponential Tilting In Case Of Non-Ignorable Missing Data, Kavita Gohil Jan 2020

Multiple Imputation Using Influential Exponential Tilting In Case Of Non-Ignorable Missing Data, Kavita Gohil

Electronic Theses and Dissertations

Modern research strategies rely predominantly on three steps, data collection, data analysis, and inference. In research, if the data is not collected as designed, researchers may face challenges of having incomplete data, especially when it is non-ignorable. These situations affect the subsequent steps of evaluation and make them difficult to perform. Inference with incomplete data is a challenging task in data analysis and clinical trials when missing data related to the condition under the study. Moreover, results obtained from incomplete data are prone to biases. Parameter estimation with non-ignorable missing data is even more challenging to handle and extract useful …


Applications Of Dynamic Linear Models To Random Allocation Models, Albert H. Lee Iii Jan 2020

Applications Of Dynamic Linear Models To Random Allocation Models, Albert H. Lee Iii

Theses and Dissertations

Although advances in modern computational algorithms have provided researchers the ability to work problems which were once too computationally complex to solve, problems with high computation or large parameter spaces still remain. Problems such as those involving Time Series can be such problems. Chapter 1 looks at the the use of Exponentially Weighted Moving Averages developed by \citep{holt2004forecasting, winters1960forecasting} which were thought to provide sufficient solutions to these Time Series. A discussion is provided which illustrates the shortcomings of the EWMA and how its infinite number of possible starting values provides the modeler with an endless number of possible solutions …


The Application Of Machine Learning Models In The Concussion Diagnosis Process, Sujit Subhash Jan 2020

The Application Of Machine Learning Models In The Concussion Diagnosis Process, Sujit Subhash

Masters Theses

“Concussions represent a growing health concern and are challenging to diagnose and manage. Roughly four million concussions are diagnosed every year in the United States. Although research into the application of advanced metrics such as neuroimages and blood biomarkers has shown promise, they are yet to be implemented at a clinical level due to cost and reliability concerns. Therefore, concussion diagnosis is still reliant on clinical evaluations of symptoms, balance, and neurocognitive status and function. The lack of a universal threshold on these assessments makes the diagnosis process entirely reliant on a physician’s interpretation of these assessment scores. This study …


Nonparametric Analysis Of Clustered And Multivariate Data, Yue Cui Jan 2020

Nonparametric Analysis Of Clustered And Multivariate Data, Yue Cui

Theses and Dissertations--Statistics

In this dissertation, we investigate three distinct but interrelated problems for nonparametric analysis of clustered data and multivariate data in pre-post factorial design.

In the first project, we propose a nonparametric approach for one-sample clustered data in pre-post intervention design. In particular, we consider the situation where for some clusters all members are only observed at either pre or post intervention but not both. This type of clustered data is referred to us as partially complete clustered data. Unlike most of its parametric counterparts, we do not assume specific models for data distributions, intra-cluster dependence structure or variability, in effect …


Cancer Phylogenetic Analysis Based On Rna-Seq Data, Tingting Zhai Jan 2020

Cancer Phylogenetic Analysis Based On Rna-Seq Data, Tingting Zhai

Theses and Dissertations--Statistics

Studying tumor evolution is a major task to understand the biological mechanism of carcinogenesis, develop new cancer therapies, and prevent drug resistance. We focus on two important questions in tumor evolution. The first question is to quantify intra-tumor heterogeneity, where multiple subclones of tumor cells with distinct transcriptomic profiles. Another question is to estimate the temporal order of alteration of key cancer pathways during tumor evolution. We present a new statistical method to 1) reconstruct the evolutionary history and population frequency of the subclonal lineages of tumor cells and 2) infer temporal order of pathway alterations in tumor evolution for …


Semiparametric And Nonparametric Methods For Comparing Biomarker Levels Between Groups, Yuntong Li Jan 2020

Semiparametric And Nonparametric Methods For Comparing Biomarker Levels Between Groups, Yuntong Li

Theses and Dissertations--Statistics

Comparing the distribution of biomarker measurements between two groups under either an unpaired or paired design is a common goal in many biomarker studies. However, analyzing biomarker data is sometimes challenging because the data may not be normally distributed and contain a large fraction of zero values or missing values. Although several statistical methods have been proposed, they either require data normality assumption, or are inefficient. We proposed a novel two-part semiparametric method for data under an unpaired setting and a nonparametric method for data under a paired setting. The semiparametric method considers a two-part model, a logistic regression for …


Estimation Of The Treatment Effect With Bayesian Adjustment For Covariates, Li Xu Jan 2020

Estimation Of The Treatment Effect With Bayesian Adjustment For Covariates, Li Xu

Theses and Dissertations--Statistics

The Bayesian adjustment for confounding (BAC) is a Bayesian model averaging method to select and adjust for confounding factors when evaluating the average causal effect of an exposure on a certain outcome. We extend the BAC method to time-to-event outcomes. Specifically, the posterior distribution of the exposure effect on a time-to-event outcome is calculated as a weighted average of posterior distributions from a number of candidate proportional hazards models, weighing each model by its ability to adjust for confounding factors. The Bayesian Information Criterion based on the partial likelihood is used to compare different models and approximate the Bayes factor. …


Measuring Change: Prediction Of Early Onset Sepsis, Aric Schadler Jan 2020

Measuring Change: Prediction Of Early Onset Sepsis, Aric Schadler

Theses and Dissertations--Statistics

Sepsis occurs in a patient when an infection enters into the blood stream and spreads throughout the body causing a cascading response from the immune system. Sepsis is one of the leading causes of morbidity and mortality in today’s hospitals. This is despite published and accepted guidelines for timely and appropriate interventions for septic patients. The largest barrier to applying these interventions is the early identification of septic patients. Early identification and treatment leads to better outcomes, shorter lengths of stay, and financial savings for healthcare institutions. In order to increase the lead time in recognizing patients trending towards septicemia …


Utilizing Design Structure For Improving Design Selection And Analysis, Ahlam Ali Alzharani Jan 2020

Utilizing Design Structure For Improving Design Selection And Analysis, Ahlam Ali Alzharani

Theses and Dissertations

Recent work has shown that the structure for design plays a role in the simplicity or complexity of data analysis. To increase the knowledge of research in these areas, this dissertation aims to utilize design structure for improving design selection and analysis. In this regard, minimal dependent sets and block diagonal structure are both important concepts that are relevant to the orthogonality of the columns of a design. We are interested in finding ways to improve the data analysis especially for active effect detection by utilizing minimal dependent sets and block diagonal structure for design.

We introduce a new classification …


Fuzzy Logistic Regression For Detecting Differential Dna Methylation Regions, Tarek M. Bubaker Bennaser Jan 2020

Fuzzy Logistic Regression For Detecting Differential Dna Methylation Regions, Tarek M. Bubaker Bennaser

Doctoral Dissertations

“Epigenetics is the study of changes in gene activity or function that are not related to a change in the DNA sequence. DNA methylation is one of the main types of epigenetic modifications, that occur when a methyl chemical group attaches to a cytosine on the DNA sequence. Although the sequence does not change, the addition of a methyl group can change the way genes are expressed and produce different phenotypes. DNA methylation is involved in many biological processes and has important implications in the fields of biomedicine and agriculture.

Statistical methods have been developed to compare DNA methylation at …


A Multinational Study Of The Etiology And Clinical Teleology Of Moral Evaluations Of Patient Behaviors, Anna Yu Lee Jan 2020

A Multinational Study Of The Etiology And Clinical Teleology Of Moral Evaluations Of Patient Behaviors, Anna Yu Lee

CGU Theses & Dissertations

This dissertation is a collection of four studies which collectively explore a hypothesized construct of ‘moral evaluation of patient behaviors’ (MEPB) as a driver of health professionals’ readiness to interact humanistically with their patients. In these studies, ‘humanistic interactions’ refer to the non-technical, intangible skills and factors of clinical competence; the factors specifically explored in these studies were compassion toward patients, self-efficacy for treating patients, and optimism toward patient treatment. For the purpose of specificity, all factors were examined as they pertained to patients with substance use disorders. Survey data from a convenience sample of 524 health professionals (i.e. physicians, …


How Machine Learning And Probability Concepts Can Improve Nba Player Evaluation, Harrison Miller Jan 2020

How Machine Learning And Probability Concepts Can Improve Nba Player Evaluation, Harrison Miller

CMC Senior Theses

In this paper I will be breaking down a scholarly article, written by Sameer K. Deshpande and Shane T. Jensen, that proposed a new method to evaluate NBA players. The NBA is the highest level professional basketball league in America and stands for the National Basketball Association. They proposed to build a model that would result in how NBA players impact their teams chances of winning a game, using machine learning and probability concepts. I preface that by diving into these concepts and their mathematical backgrounds. These concepts include building a linear model using ordinary least squares method, the bias …


Phenotype Extraction: Estimation And Biometrical Genetic Analysis Of Individual Dynamics, Kevin L. Mckee Jan 2020

Phenotype Extraction: Estimation And Biometrical Genetic Analysis Of Individual Dynamics, Kevin L. Mckee

Theses and Dissertations

Within-person data can exhibit a virtually limitless variety of statistical patterns, but it can be difficult to distinguish meaningful features from statistical artifacts. Studies of complex traits have previously used genetic signals like twin-based heritability to distinguish between the two. This dissertation is a collection of studies applying state-space modeling to conceptualize and estimate novel phenotypic constructs for use in psychiatric research and further biometrical genetic analysis. The aims are to: (1) relate control theoretic concepts to health-related phenotypes; (2) design statistical models that formally define those phenotypes; (3) estimate individual phenotypic values from time series data; (4) consider hierarchical …


Nonparametric Misclassification Simulation And Extrapolation Method And Its Application, Congjian Liu Jan 2020

Nonparametric Misclassification Simulation And Extrapolation Method And Its Application, Congjian Liu

Electronic Theses and Dissertations

The misclassification simulation extrapolation (MC-SIMEX) method proposed by Küchenho et al. is a general method of handling categorical data with measurement error. It consists of two steps, the simulation and extrapolation steps. In the simulation step, it simulates observations with varying degrees of measurement error. Then parameter estimators for varying degrees of measurement error are obtained based on these observations. In the extrapolation step, it uses a parametric extrapolation function to obtain the parameter estimators for data with no measurement error. However, as shown in many studies, the parameter estimators are still biased as a result of the parametric extrapolation …


Aggregate Loss Model With Poisson-Tweedie Loss Frequency, Si Chen Jan 2020

Aggregate Loss Model With Poisson-Tweedie Loss Frequency, Si Chen

Theses and Dissertations (Comprehensive)

The aggregate loss model has applications in various areas such as financial risk management and actuarial science. The aggregate loss is the summation of all random losses occurred in a period, and it is governed by both the loss severity and the loss frequency. While the impact of the loss severity on aggregate loss is well studied, less focus is paid on the influence of loss frequency on aggregate loss, which motivates our study. In this thesis, we enrich the aggregate loss framework by introducing the Poisson-Tweedie distribution as a candidate for modelling loss frequency, prove the closedness of Poisson-Tweedie …


Elucidating The Properties And Mechanism For Cellulose Dissolution In Tetrabutylphosphonium-Based Ionic Liquids Using High Concentrations Of Water, Brad Crawford Jan 2020

Elucidating The Properties And Mechanism For Cellulose Dissolution In Tetrabutylphosphonium-Based Ionic Liquids Using High Concentrations Of Water, Brad Crawford

Graduate Theses, Dissertations, and Problem Reports

The structural, transport, and thermodynamic properties related to cellulose dissolution by tetrabutylphosphonium chloride (TBPCl) and tetrabutylphosphonium hydroxide (TBPH)-water mixtures have been calculated via molecular dynamics simulations. For both ionic liquid (IL)-water solutions, water veins begin to form between the TBPs interlocking arms at 80 mol % water, opening a pathway for the diffusion of the anions, cations, and water. The water veins allow for a diffusion regime shift in the concentration region from 80 to 92.5 mol % water, providing a higher probability of solvent interaction with the dissolving cellulose strand. The hydrogen bonding was compared between small and large …