Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 67

Full-Text Articles in Physical Sciences and Mathematics

Analyses Of Effect Indices Across Single-Case Research Designs In Counseling, Cian L. Brown Dec 2023

Analyses Of Effect Indices Across Single-Case Research Designs In Counseling, Cian L. Brown

Graduate Theses and Dissertations

Single case research design (SCRD) is a common methodology used across clinical disciplines to determine treatments effectiveness by comparing treatment conditions to baseline conditions in individual cases, usually among researchers working with smaller samples. Although popular within behavioral disciplines such as special education and behavioral analysis, studies have begun to emerge in counseling. However, guidance and current understanding of the use of SCRD in counseling is limited. A content analysis of counseling journals from 2003 to 2014 yielded only 7 studies using SCRD. In 2015, the flagship counseling journal, Journal of Counseling and Development, published a special issue on the …


Bayesian Learning Of Spatiotemporal Source Distribution For Beached Microplastic In The Gulf Of Mexico, David Pojunas Dec 2023

Bayesian Learning Of Spatiotemporal Source Distribution For Beached Microplastic In The Gulf Of Mexico, David Pojunas

Graduate Theses and Dissertations

Over the last several decades, plastic waste has gradually accumulated while slowly degrading in terrestrial and oceanic environments. Recently, there has been an increased effort to identify the possible sources of plastic to understand how they affect vulnerable beaches. This issue is of particular concern in the Gulf of Mexico due to the presence of oil, natural gas, and plastic production. In this thesis, we expand upon existing Bayesian plastic attribution models and develop a rigorous statistical framework to map observed beached microplastics to their sources. Within this framework, we combine Lagrangian backtracking simulations of floating particles using nurdle beaching …


Comparative Analysis Of Teacher Effects Parameters In Models Used For Assessing School Effectiveness: Value-Added Models & Persistence, Merlin J. Kamgue Dec 2023

Comparative Analysis Of Teacher Effects Parameters In Models Used For Assessing School Effectiveness: Value-Added Models & Persistence, Merlin J. Kamgue

Graduate Theses and Dissertations

Longitudinal measures for students have become increasingly popular to estimate the effects of individual teachers and schools. Value-added models are one of the approaches using longitudinal data to evaluate teachers and schools. In the value-added model (VAM) literature, many statistical approaches have been developed and used to estimate teacher or school effects on student learning. This study opted to use a Bayesian multivariate model for evaluating teacher effects. The generalized persistence models can handle longitudinal data, not vertically scaled, allowing for a below-par teacher’s effects correlation across test administrations. This study first generated longitudinal students’ test score data and used …


A Comparative Study Of Techniques For Non-Monotonic Dependence With Emphasis On Sensitivity To Sample Size, Noise Level And Computational Attributes, Fariha Tasnim Aug 2023

A Comparative Study Of Techniques For Non-Monotonic Dependence With Emphasis On Sensitivity To Sample Size, Noise Level And Computational Attributes, Fariha Tasnim

Graduate Theses and Dissertations

Evaluating association between variables is often of interest by many researchers. To serve this purpose, different association measures have been developed. However, type of relation between variables affects the degree of relationship. Hence, detection of the rela- tionship between variables is germane to measuring the correlation coefficient. With that mindset, here we explored six non-monotonic measure of association techniques and com- pared them with three classical approaches. Due to inconsistency in definition and range of different techniques, it is not feasible to compare the correlation estimates as their nature of variability differ. Therefore, we used permutation test based on Monte …


Comparing Predictive Performance Of Garch And Stochastic Volatility Models, Swapnaneel Nath Aug 2023

Comparing Predictive Performance Of Garch And Stochastic Volatility Models, Swapnaneel Nath

Graduate Theses and Dissertations

This paper compares the predictive performance of two commonly used financial models, the Generalized Auto-Regressive Conditional Heteroskedasticity (GARCH) model, and the Stochastic Volatility model. Both techniques are used in the finance literature to model returns on an asset; the main difference between the two is that the former holds volatility as deterministic, whereas the latter treats it as a stochastic component. Three 10-year periods (2006-15, 2008-17, and 2010-19) of returns of the S&P-500 Index are used to train the two models. The parameter estimation is done using Hamiltonian Monte Carlo. Then, using Sequential Monte Carlo updates, returns for 2016, 2018, …


Effects Of Land Use On Soil Microbial Communities In Tropical Montane Forests Of Malaysian Borneo, Yang Kai Tang May 2023

Effects Of Land Use On Soil Microbial Communities In Tropical Montane Forests Of Malaysian Borneo, Yang Kai Tang

Graduate Theses and Dissertations

Land use, such as logging and forest conversion to agriculture, can modify soil physicochemical and biological properties, and affect soil health. To understand how land use change can impact soil properties and canopy structure, we used a land use gradient in Malaysian Borneo consisting of six sites, including old growth forests, mixed forests, and agriculture fields. Specifically, we aimed to answer the following questions: (1) How do soil physicochemical properties vary across land use types? (2) Does bacterial diversity and composition vary across different land use types? (3) Does fungal diversity and composition vary across different land use types? We …


Mle And Eap Methods For Estimating Ability Scores For Data Of Varying Sample Size And Item Length, Sahar Taji Dec 2022

Mle And Eap Methods For Estimating Ability Scores For Data Of Varying Sample Size And Item Length, Sahar Taji

Graduate Theses and Dissertations

In this research, the performance of two popular estimators, Maximum Likelihood Estimator(MLE) and Bayesian Expected a Posteriori (EAP) is studied and compared in estimating the latent ability score in an Item Response Theory (IRT) model. The 2-Parameter Logistic (2PL) IRT model which is characterized by difficulty and discrimination item parameters is used to estimate the latent ability scores. Several datasets are generated for variety of sample size and item length values. The Monte-Carlo simulation is used to analyze the performance of the estimators. Results show that MLE produces reliable results with low root mean square error (RMSE) across all datasets. …


Ensemble Tree-Based Machine Learning For Imaging Data, Reza Iranzad Aug 2022

Ensemble Tree-Based Machine Learning For Imaging Data, Reza Iranzad

Graduate Theses and Dissertations

In particular medical imaging data, such as positron emission tomography (PET), computed tomography (CT), and fluorescence intravital microscopy (IVM), have become prevalent for use in a wide variety of applications, from diagnostic purposes, tracking diseases' progress, and monitoring the effectiveness of treatments to decision-making processes. The detailed information generated by medical imaging has enabled physicians to provide more comprehensive care. Although numerous machine learning algorithms, especially those used for imaging data, have been developed, dealing with unique structures in imaging data remained a big challenge. In this dissertation, we are proposing novel statistical tree-based methods with more efficient and more …


Improving Computation For Hierarchical Bayesian Spatial Gaussian Mixture Models With Application To The Analysis Of Thz Image Of Breast Tumor, Jean Remy Habimana Aug 2022

Improving Computation For Hierarchical Bayesian Spatial Gaussian Mixture Models With Application To The Analysis Of Thz Image Of Breast Tumor, Jean Remy Habimana

Graduate Theses and Dissertations

In the first chapter of this dissertation we give a brief introduction to Markov chain Monte Carlo methods (MCMC) and their application in Bayesian inference. In particular, we discuss the Metropolis-Hastings and conjugate Gibbs algorithms and explore the computational underpinnings of these methods. The second chapter discusses how to incorporate spatial autocorrelation in linear a regression model with an emphasis on the computational framework for estimating the spatial correlation patterns.

The third chapter starts with an overview of Gaussian mixture models (GMMs). However, because in the GMM framework the observations are assumed to be independent, GMMs are less effective when …


Hiding In Plain Sight: Accounting For Rate Heterogeneity In Trait Evolution Models, James Boyko Aug 2022

Hiding In Plain Sight: Accounting For Rate Heterogeneity In Trait Evolution Models, James Boyko

Graduate Theses and Dissertations

Within the last four decades, phylogenetic comparative methods have become the defacto method of analysis for comparative biologists. The availability of high-quality comparative datasets has been matched by an explosion of possible phylogenetic models. In large part, the efforts to increase the realism of phylogenetic comparative methods has been successful as evidenced by their widespread use. To this extensive literature, my contributions are modest. I have focused my dissertation work on two main themes. First, most phenotypic evolution is not independent of other phenotypes. Changes in a particular character may influence changes in another and modeling these characters in isolation …


Aberrant Responding With Underlying Dominance And Unfolding Response Processes: Examining Model Fit And Performance Of Person-Fit Statistics, Jennifer A. Reimers May 2022

Aberrant Responding With Underlying Dominance And Unfolding Response Processes: Examining Model Fit And Performance Of Person-Fit Statistics, Jennifer A. Reimers

Graduate Theses and Dissertations

Researchers have recognized that respondents may not answer items in a way that accurately reflects their attitude or trait level being measured. The resulting response data that deviates from what would be expected has been shown to have significant effects on the psychometric properties of a scale and analytical results. However, many studies that have investigated the detection of aberrant data and its effects have done so using dominance item response theory (IRT) models. It is unknown whether the impacts of aberrant data and the methodology used to identify aberrant responding when using dominance IRT models apply similarly when scales …


Posterior Predictive Model Checking Of The Hierarchical Rater Model, Nnamdi Chika Ezike May 2022

Posterior Predictive Model Checking Of The Hierarchical Rater Model, Nnamdi Chika Ezike

Graduate Theses and Dissertations

Fitting wrongly specified models to observed data may lead to invalid inferences about the model parameters of interest. The current study investigated the performance of the posterior predictive model checking (PPMC) approach in detecting model-data misfit of the hierarchical rater model (HRM). The HRM is a rater-mediated model that incorporates components of the polytomous item response theory (IRT) model, such as the partial credit model (PCM) and generalized partial credit model (GPCM), at the second level of the hierarchy, to model examinees’ responses to performance assessments. To date, the HRM has not been rigorously evaluated using PPMC techniques. Monte Carlo …


Multi-Trophic Biodiversity Increases With Increasing Structural Complexity Of Forest Canopy, Ayanna St. Rose May 2022

Multi-Trophic Biodiversity Increases With Increasing Structural Complexity Of Forest Canopy, Ayanna St. Rose

Graduate Theses and Dissertations

Understanding the effects of forest canopy structural complexity on multi-trophic diversity is critical for conserving biodiversity and managing land sustainably. But multi-trophic diversity is often ignored when making decisions about land management due to lack of cost- and time-effective methods to evaluate it. Here, we explored a new method based on widely available remote sensing data to quantify canopy structural complexity and its relationships with multi-trophic biodiversity at landscape scale using 32 forested sites of the National Ecological Observatory Network. We investigated the influence of vertical and horizontal structural complexity of forest canopy on multi-trophic (primary producers, herbivores (beetles), omnivores …


The Effects Of Metronomic And Maximum-Tolerated Dose Chemotherapy In Colorectal Cancer Angiogenesis: A Combined Approach Using Endoscopic Diffuse Reflectance Spectroscopy And Mrna Expression, Ariel Isaac Mundo Ortiz May 2022

The Effects Of Metronomic And Maximum-Tolerated Dose Chemotherapy In Colorectal Cancer Angiogenesis: A Combined Approach Using Endoscopic Diffuse Reflectance Spectroscopy And Mrna Expression, Ariel Isaac Mundo Ortiz

Graduate Theses and Dissertations

Colorectal cancer (CRC) continues to be one of the most incident and deadliest types of cancer worldwide. Chemotherapy has proven effective to reduce tumor burden for CRC patients, but there are several disadvantages associated with the use of mainstay maximtolerated dose (MTD) chemotherapeutic strategies. Metronomic chemotherapy (MET) has been developed as an alternative that addresses the shortcomings of maximum-tolerated dose chemotherapy but so far its effectiveness as a neoadjuvant strategy for CRC has not been explored.

This dissertation uses a combined optics and molecular biology approach (using diffuse reflectance spectroscopy and mRNA expression) to study the changes in angiogenesis and …


Statistical Modeling, Learning And Computing For Stochastic Dynamics Of Complex Systems, Mohammadmahdi Hajiha Dec 2021

Statistical Modeling, Learning And Computing For Stochastic Dynamics Of Complex Systems, Mohammadmahdi Hajiha

Graduate Theses and Dissertations

With the recent advances in sensor technology, it is much easier to collect and store streams of system operational and environmental (SOE) data. These data can be used as input to model the underlying behavior of complex engineered systems and phenomenons if appropriate algorithms with well-defined assumptions are developed. This dissertation is comprised of the research work to show the applicability of SOE data when fed into proposed tailored algorithms. The first purposes of these algorithms are to estimate and analyze the reliability of a system as elaborated in Chapter 2. This chapter provides the derivation of closed-form expressions that …


Data-Driven Statin Initiation Evaluation And Optimization For Prediabetes Population, Muhenned A. Abdulsahib Dec 2021

Data-Driven Statin Initiation Evaluation And Optimization For Prediabetes Population, Muhenned A. Abdulsahib

Graduate Theses and Dissertations

This dissertation develops quantitative models to support medical decision making of statininitiation considering the uncertainty in disease progression for prediabetes patients. A mathematical model is built to help medical decision-makers take action of statin initiation under uncertainty in future prediabetes progressions. The association between cholesterol drug use, such as statin, and elevating glucose level attracted considerable amounts of attention in the literature. Statin effects on glucose vary with respect to different levels of glucose. The first chapter of this dissertation introduces the problem and an overview of the tools that will be used to solve it. In the second chapter …


Knowledge Discovery From Complex Event Time Data With Covariates, Samira Karimi Jul 2021

Knowledge Discovery From Complex Event Time Data With Covariates, Samira Karimi

Graduate Theses and Dissertations

In particular engineering applications, such as reliability engineering, complex types of data are encountered which require novel methods of statistical analysis. Handling covariates properly while managing the missing values is a challenging task. These type of issues happen frequently in reliability data analysis. Specifically, accelerated life testing (ALT) data are usually conducted by exposing test units of a product to severer-than-normal conditions to expedite the failure process. The resulting lifetime and/or censoring data are often modeled by a probability distribution along with a life-stress relationship. However, if the probability distribution and life-stress relationship selected cannot adequately describe the underlying failure …


Statistical Modeling For High-Dimensional Compositional Data With Applications To The Human Microbiome, Thy Dao Jul 2021

Statistical Modeling For High-Dimensional Compositional Data With Applications To The Human Microbiome, Thy Dao

Graduate Theses and Dissertations

Compositional data refer to the data that lie on a simplex, which are common in many scientific domains such as genomics, geology, and economics. As the components in a composition must sum to one, traditional tests based on unconstrained data become inappropriate, and new statistical methods are needed to analyze this special type of data. This dissertation is motivated by some statistical problems arising in the analysis of compositional data. In particular, we focus on the high-dimensional and over-dispersed setting, where the dimensionality of compositions is greater than the sample size and the dispersion parameter is moderate or large. In …


Evaluating The Efficiency Of Markov Chain Monte Carlo Algorithms, Thuy Scanlon Jul 2021

Evaluating The Efficiency Of Markov Chain Monte Carlo Algorithms, Thuy Scanlon

Graduate Theses and Dissertations

Markov chain Monte Carlo (MCMC) is a simulation technique that produces a Markov chain designed to converge to a stationary distribution. In Bayesian statistics, MCMC is used to obtain samples from a posterior distribution for inference. To ensure the accuracy of estimates using MCMC samples, the convergence to the stationary distribution of an MCMC algorithm has to be checked. As computation time is a resource, optimizing the efficiency of an MCMC algorithm in terms of effective sample size (ESS) per time unit is an important goal for statisticians. In this paper, we use simulation studies to demonstrate how the Gibbs …


Privacy-Preserving Cloud-Assisted Data Analytics, Wei Bao Jul 2021

Privacy-Preserving Cloud-Assisted Data Analytics, Wei Bao

Graduate Theses and Dissertations

Nowadays industries are collecting a massive and exponentially growing amount of data that can be utilized to extract useful insights for improving various aspects of our life. Data analytics (e.g., via the use of machine learning) has been extensively applied to make important decisions in various real world applications. However, it is challenging for resource-limited clients to analyze their data in an efficient way when its scale is large. Additionally, the data resources are increasingly distributed among different owners. Nonetheless, users' data may contain private information that needs to be protected.

Cloud computing has become more and more popular in …


Comparative Evaluation Of Statistical Dependence Measures, Eman Abdel Rahman Ibrahim Dec 2020

Comparative Evaluation Of Statistical Dependence Measures, Eman Abdel Rahman Ibrahim

Graduate Theses and Dissertations

Measuring and testing dependence between random variables is of great importance in many scientific fields. In the case of linearly correlated variables, Pearson’s correlation coefficient is a commonly used measure of the correlation strength. In the case of nonlinear correlation, several innovative measures have been proposed, such as distance-based correlation, rank-based correlations, and information theory-based correlation. This thesis focuses on the statistical comparison of several important correlations, including Spearman’s correlation, mutual information, maximal information coefficient, biweight midcorrelation, distance correlation, and copula correlation, under various simulation settings such as correlative patterns and the level of random noise. Furthermore, we apply those …


Gene Set Testing By Distance Correlation, Sho-Hsien Su Dec 2020

Gene Set Testing By Distance Correlation, Sho-Hsien Su

Graduate Theses and Dissertations

Pathways are the functional building blocks of complex diseases such as cancers. Pathway-level studies may provide insights on some important biological processes. Gene set test is an important tool to study the differential expression of a gene set between two groups, e.g., cancer vs normal. The differential expression of a gene set could be due to the difference in mean, variability, or both. However, most existing gene set tests only target the mean difference but overlook other types of differential expression. In this thesis, we propose to use the recently developed distance correlation for gene set testing. To assess the …


Development Of An Effect Size To Classify The Magnitude Of Dif In Dichotomous And Polytomous Items, James D. Weese Dec 2020

Development Of An Effect Size To Classify The Magnitude Of Dif In Dichotomous And Polytomous Items, James D. Weese

Graduate Theses and Dissertations

A standardized effect size for the SIBTEST/POLYSIBTEST procedure is proposed, allowing for Differential Item Functioning (DIF) to be classified with a single set of DIF heuristics regardless of whether data are dichotomous or polytomous. This proposed standardized effect size accounts for both variability in responses and whether participants are included in the SIBTEST/POLYSIBTEST calculations. First, a new set of unstandardized effect size heuristics are established for dichotomous data that are more aligned with Educational Testing Service (ETS) standards using two and three parameter logistic (2PL and 3PL) models. Second, a standardized effect size is proposed and compared to other DIF …


Conditional Distance Correlation Test For Gene Expression Level, Dna Methylation Level And Copy Number, Shanshan Zhang Dec 2020

Conditional Distance Correlation Test For Gene Expression Level, Dna Methylation Level And Copy Number, Shanshan Zhang

Graduate Theses and Dissertations

Over the past years, efforts have been devoted to the genome-wide analysis of genetic and epigenetic profiles to better understand the underlying biological mechanisms of complex diseases such as cancer. It is of great importance to unravel the complex dependence structure between biological factors, and many conditional dependence tests have been developed to meet this need. The traditional partial correlation method can only capture the linear partial correlation, but not the nonlinear correlation. To overcome this limitation, we propose to use the innovative conditional distance correlation (CDC), which measures the conditional dependence between random vectors and detect nonlinear relations. In …


Quantifying The Simultaneous Effect Of Socio-Economic Predictors And Build Environment On Spatial Crime Trends, Alfieri Daniel Ek Dec 2020

Quantifying The Simultaneous Effect Of Socio-Economic Predictors And Build Environment On Spatial Crime Trends, Alfieri Daniel Ek

Graduate Theses and Dissertations

Proper allocation of law enforcement agencies falls under the umbrella of risk terrainmodeling (Caplan et al., 2011, 2015; Drawve, 2016) that primarily focuses on crime prediction and prevention by spatially aggregating response and predictor variables of interest. Although mental health incidents demand resource allocation from law enforcement agencies and the city, relatively less emphasis has been placed on building spatial models for mental health incidents events. Analyzing spatial mental health events in Little Rock, AR over 2015 to 2018, we found evidence of spatial heterogeneity via Moran’s I statistic. A spatial modeling framework is then built using generalized linear models, …


Models For Data Analysis In Accelerated Reliability Growth, Cesar Alexander Ruiz Torres Jul 2020

Models For Data Analysis In Accelerated Reliability Growth, Cesar Alexander Ruiz Torres

Graduate Theses and Dissertations

This work develops new methodologies for analyzing accelerated testing data in the context of a reliability growth program for a complex multi-component system. Each component has multiple failure modes and the growth program consists of multiple test-fix stages with corrective actions applied at the end of each stage. The first group of methods considers time-to-failure data and test covariates for predicting the final reliability of the system. The time-to-failure of each failure mode is assumed to follow a Weibull distribution with rate parameter proportional to an acceleration factor. Acceleration factors are specific to each failure mode and test covariates. We …


Measuring Sexual Excitation And Sexual Inhibition In A Dutch-Speaking Sample, Malachi Willis Jul 2020

Measuring Sexual Excitation And Sexual Inhibition In A Dutch-Speaking Sample, Malachi Willis

Graduate Theses and Dissertations

Background: Individual differences in sexual excitation and sexual inhibition are important predictors of sexual functioning. Psychometric instruments for these aspects of sexual response were originally developed separately for men (Sexual Inhibition /Sexual Excitation Scales [SIS/SES]) and women (Sexual Excitation/Sexual Inhibition Inventory for Women [SESII-W]). These measures were then adapted to function similarly in samples comprising both men and women (Sexual Inhibition/Sexual Excitation Scales-Short Form [SIS/SES-SF] and Sexual Excitation/Sexual Inhibition Inventory for Women and Men [SESII-W/M], respectively). No published study to our knowledge has administered the SIS/SES and SESII-W/M questionnaires to a sample of both women and men. In the present …


Learning Networks With Categorical Data Using Distance Correlation, And A Novel Graph-Based Multivariate Test, Jian Tinker Jul 2020

Learning Networks With Categorical Data Using Distance Correlation, And A Novel Graph-Based Multivariate Test, Jian Tinker

Graduate Theses and Dissertations

We study the use of distance correlation for statistical inference on categorical data, especially the induction of probability networks. Szekely et al. first defined distance correlation for continuous variables in [42], and Zhang translated the concept into the categorical setting in [57] by defining dCor(X,Y) for categorical variables X = (x1,...,xI) and Y = (y1,...,yJ) where P(X=xi)=[pi]i and P(Y=yi)=[pi]j with the formula [Please open the document]

Part I of the dissertation covers the background we need to understand this formula, and prepares us to analyze the properties and performance of its applications.

Part II then presents the main results of …


Structural Analysis Of The Multifunctional Spoiie Regulatory Protein Of Clostridioides Difficile., Blythe Emily Bunkers Jul 2020

Structural Analysis Of The Multifunctional Spoiie Regulatory Protein Of Clostridioides Difficile., Blythe Emily Bunkers

Graduate Theses and Dissertations

Clostridioides (formally Clostridium) difficile is a medically relevant pathogen pertinent to infectious disease research. C. difficile is distinctly known for its ability to produce two toxins, enterotoxin A and cytotoxin B, and the propensity to colonize the mammalian gastrointestinal tract. It is known that metabolism is tightly correlated with sporulation in endospore producers such as C. difficile, but an interesting and novel regulatory relationship found by the Ivey lab has yet to be understood. The relationship explored in this study is observed between the sporulation factor, SpoIIE, which represses expression of an ABC peptide transporter, app. In this study, two …


Effect Of Predictor Dependence On Variable Selection For Linear And Log-Linear Regression, Apu Chandra Das Jul 2020

Effect Of Predictor Dependence On Variable Selection For Linear And Log-Linear Regression, Apu Chandra Das

Graduate Theses and Dissertations

We propose a Bayesian approach to the Dirichlet-Multinomial (DM) regression model, which uses horseshoe, Laplace, and horseshoe plus priors for shrinkage and selection. The Dirichlet-Multinomial model can be used to find the significant association between a set of available covariates and taxa for a microbiome sample. We incorporate the covariates in a log-linear regression framework. We design a simulation study to make a comparison among the performance of the three shrinkage priors in terms of estimation accuracy and the ability to detect true signals. Our results have clearly separated the performance of the three priors and indicated that the horseshoe …