Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 14 of 14

Full-Text Articles in Statistics and Probability

Using Deep Neural Networks To Analyze Precision Agriculture Data, Stephanie Liebl Jan 2022

Using Deep Neural Networks To Analyze Precision Agriculture Data, Stephanie Liebl

Electronic Theses and Dissertations

As the population of the Earth increases, there is a growing need for food to feed the inhabitants. Precision agriculture offers techniques and tools that can be used to help accommodate the growing population. One specific precision agriculture tool is remote sensing data, which can be used to image fields as an effort to better predict or understand the crops. In this thesis, deep neural networks are used to evaluate various spatial, spectral, and temporal resolutions of three different satellite images to determine which best predicts corn yield. The main metrics we used to evaluate the models were R-squared (R2), …


Evaluation Of The Effect Of The Clinical-Decision-Support Systems On Diabetes Management: A Multivariate Meta-Analysis Comparison With Univariate Meta-Analysis, Abdelfattah Elbarsha Jan 2021

Evaluation Of The Effect Of The Clinical-Decision-Support Systems On Diabetes Management: A Multivariate Meta-Analysis Comparison With Univariate Meta-Analysis, Abdelfattah Elbarsha

Electronic Theses and Dissertations

The advantage of using meta-analysis lies in its ability in providing a quantitative summary of the findings from multiple studies. The aim of this dissertation was first to conduct a simulation study in order to understand what factors (sample size, between-study correlation, and percent of missing data) have a significant effect on meta-analysis estimates and whether using univariate or multivariate meta-analysis would produce different estimates.

The second goal of this study was to evaluate the effect of clinical decision support systems CDSS on diabetes care management by conducting three separate univariate meta-analyses and one multivariate meta-analysis. CDSS are health information …


The Combined Impact Of Continuous And Ordinal Auxiliary Variables On Missing Data Imputation In Sem, Salina Wu Whitaker Jan 2021

The Combined Impact Of Continuous And Ordinal Auxiliary Variables On Missing Data Imputation In Sem, Salina Wu Whitaker

Electronic Theses and Dissertations

“Modern” methods of addressing missing data using full-information maximum-likelihood (FIML) have become mainstays in SEM analyses. FIML allows the inclusion of auxiliary variables which carry information that is related to missing values and can reduce bias in parameter estimates. Past research has illustrated the benefits of auxiliary variable inclusion under different missingness conditions (MCAR and MNAR; e.g., Enders, 2008), missingness proportions (e.g., Collins et al., 2001), and although limited, missingness patterns (e.g., Yoo, 2009) in FIML analyses. While past studies have focused on the effects of either continuous or ordinal auxiliary variables, no study has included both types in their …


Is Corequisite Developmental Math Effective At East Tennessee State University?, Christine Padden Aug 2019

Is Corequisite Developmental Math Effective At East Tennessee State University?, Christine Padden

Electronic Theses and Dissertations

This thesis looks at the corequisite developmental math program at East Tennessee State University (ETSU) and compares the effectiveness to the previous developmental math program by comparing the student outcomes in MATH 1530. MATH 1530 is a non-calculus based statistic and probability course that satisfies most majors’ general education math requirements. ETSU sees approximately 1,000 students a year pass through MATH 1530 which is around 6.7% of the total enrollment at ETSU[9]. We are interested in the last five years of the developmental math program before it was changed to corequisite developmental math and the first five years of corequisite …


Cramer Type Moderate Deviations For Random Fields And Mutual Information Estimation For Mixed-Pair Random Variables, Aleksandr Beknazaryan Jan 2019

Cramer Type Moderate Deviations For Random Fields And Mutual Information Estimation For Mixed-Pair Random Variables, Aleksandr Beknazaryan

Electronic Theses and Dissertations

In this dissertation we first study Cramer type moderate deviation for partial sums of random fields by applying the conjugate method. In 1938 Cramer published his results on large deviations of sums of i.i.d. random variables after which a lot of research has been done on establishing Cramer type moderate and large deviation theorems for different types of random variables and for various statistics. In particular results have been obtained for independent non-identically distributed random variables for the sum of independent random to estimate the mutual information between two random variables. The estimates enjoy a central limit theorem under some …


Bayesian Analytical Approaches For Metabolomics : A Novel Method For Molecular Structure-Informed Metabolite Interaction Modeling, A Novel Diagnostic Model For Differentiating Myocardial Infarction Type, And Approaches For Compound Identification Given Mass Spectrometry Data., Patrick J. Trainor Aug 2018

Bayesian Analytical Approaches For Metabolomics : A Novel Method For Molecular Structure-Informed Metabolite Interaction Modeling, A Novel Diagnostic Model For Differentiating Myocardial Infarction Type, And Approaches For Compound Identification Given Mass Spectrometry Data., Patrick J. Trainor

Electronic Theses and Dissertations

Metabolomics, the study of small molecules in biological systems, has enjoyed great success in enabling researchers to examine disease-associated metabolic dysregulation and has been utilized for the discovery biomarkers of disease and phenotypic states. In spite of recent technological advances in the analytical platforms utilized in metabolomics and the proliferation of tools for the analysis of metabolomics data, significant challenges in metabolomics data analyses remain. In this dissertation, we present three of these challenges and Bayesian methodological solutions for each. In the first part we develop a new methodology to serve a basis for making higher order inferences in metabolomics, …


Quasi-Random Action Selection In Markov Decision Processes, Samuel D. Walker Jan 2017

Quasi-Random Action Selection In Markov Decision Processes, Samuel D. Walker

Electronic Theses and Dissertations

In Markov decision processes an operator exploits known data regarding the environment it inhabits. The information exploited is learned from random exploration of the state-action space. This paper proposes to optimize exploration through the implementation of quasi-random sequences in both discrete and continuous state-action spaces. For the discrete case a permutation is applied to the indices of the action space to avoid repetitive behavior. In the continuous case sequences of low discrepancy, such as Halton sequences, are utilized to disperse the actions more uniformly.


A Multi-Indexed Logistic Model For Time Series, Xiang Liu Dec 2016

A Multi-Indexed Logistic Model For Time Series, Xiang Liu

Electronic Theses and Dissertations

In this thesis, we explore a multi-indexed logistic regression (MILR) model, with particular emphasis given to its application to time series. MILR includes simple logistic regression (SLR) as a special case, and the hope is that it will in some instances also produce significantly better results. To motivate the development of MILR, we consider its application to the analysis of both simulated sine wave data and stock data. We looked at well-studied SLR and its application in the analysis of time series data. Using a more sophisticated representation of sequential data, we then detail the implementation of MILR. We compare …


Mahalanobis Kernel-Based Support Vector Data Description For Detection Of Large Shifts In Mean Vector, Vu Nguyen Jan 2015

Mahalanobis Kernel-Based Support Vector Data Description For Detection Of Large Shifts In Mean Vector, Vu Nguyen

Electronic Theses and Dissertations

Statistical process control (SPC) applies the science of statistics to various process control in order to provide higher-quality products and better services. The K chart is one among the many important tools that SPC offers. Creation of the K chart is based on Support Vector Data Description (SVDD), a popular data classifier method inspired by Support Vector Machine (SVM). As any methods associated with SVM, SVDD benefits from a wide variety of choices of kernel, which determines the effectiveness of the whole model. Among the most popular choices is the Euclidean distance-based Gaussian kernel, which enables SVDD to obtain a …


Revising Common Core Georgia Performance Standards Statistics Lesson Plans To Better Align With Statistical Practice, Rachel Bonilla Jan 2013

Revising Common Core Georgia Performance Standards Statistics Lesson Plans To Better Align With Statistical Practice, Rachel Bonilla

Electronic Theses and Dissertations

In this thesis, lesson plans provided by the Georgia Department of Education are revised to give students better exposure and practice working with real-life data. Three learning tasks and a performance task are presented covering a unit lesson on statistical regression. The development of Georgia statistics curriculum standards are reviewed and presented.


A Comparative Study Between The Standards Of Learning And In-Class Grades., Randetta Lynn Fuller Aug 2010

A Comparative Study Between The Standards Of Learning And In-Class Grades., Randetta Lynn Fuller

Electronic Theses and Dissertations

We examined the Standards of Learning mathematics scores and in-class grades for a rural Virginia county public school system. We looked at third, fourth, fifth, sixth, and seventh grades as well as Algebra I, Algebra II, and Geometry classes. The purpose of this was to determine whether or not there is a strong correlation between the Standards of Learning and the students' in-class grades. Had a strong enough correlation between the Standards of Learning and in-class grades been found we would have used only the in-class grades to predict the Standard of Learning test scores. However, we found that the …


Comparison Of Career Statistics And Season Statistics In Major League Baseball, Mark Joseph Ammons Jan 2008

Comparison Of Career Statistics And Season Statistics In Major League Baseball, Mark Joseph Ammons

Electronic Theses and Dissertations

This is a comparison of statistics for some of the best seasons and careers of players from Major League Baseball; using data collected on batting average, at bat to homerun ratio, and earned run average. Two teams were created, composed of season leaders and career leaders, chosen for their outstanding offensive and pitching abilities, and were pitted against one another to determine superiority. These two teams also compared against a team from each era of major league baseball. The season and career leaders challenged, the 1918 Boston Red Sox, 1927 New York Yankees, 1955 Brooklyn Dodgers, 1961 New York Yankees, …


Probability And Statistics For Third Through Fifth Grade Classrooms., Melissa Taylor Mckinnon Dec 2007

Probability And Statistics For Third Through Fifth Grade Classrooms., Melissa Taylor Mckinnon

Electronic Theses and Dissertations

This document contains a variety of lesson plans that can be readily used by a teacher of intermediate students. This thesis contains two units in Probability and one unit in Statistics. Any educator can supplement this document with any curriculum to teach lessons from vocabulary to concept.


Modeling And Characterizations Of New Notions In Life Testing With Statistical Applications, Mohammad Sepehrifar Jan 2006

Modeling And Characterizations Of New Notions In Life Testing With Statistical Applications, Mohammad Sepehrifar

Electronic Theses and Dissertations

Knowing the class to which a life distribution belongs gives us an idea about the aging of the device or system the life distribution represents, and enables us to compare the aging properties of different systems. This research intends to establish several new nonparametric classes of life distributions defined by the concept of inactivity time of a unit with a guaranteed minimum life length. These classes play an important role in the study of reliability theory, survival analysis, maintenance policies, economics, actuarial sciences and many other applied areas.