Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 21 of 21

Full-Text Articles in Physical Sciences and Mathematics

Statistical Approaches Of Gene Set Analysis With Quantitative Trait Loci For High-Throughput Genomic Studies., Samarendra Das Dec 2020

Statistical Approaches Of Gene Set Analysis With Quantitative Trait Loci For High-Throughput Genomic Studies., Samarendra Das

Electronic Theses and Dissertations

Recently, gene set analysis has become the first choice for gaining insights into the underlying complex biology of diseases through high-throughput genomic studies, such as Microarrays, bulk RNA-Sequencing, single cell RNA-Sequencing, etc. It also reduces the complexity of statistical analysis and enhances the explanatory power of the obtained results. Further, the statistical structure and steps common to these approaches have not yet been comprehensively discussed, which limits their utility. Hence, a comprehensive overview of the available gene set analysis approaches used for different high-throughput genomic studies is provided. The analysis of gene sets is usually carried out based on …


Modified-Half-Normal Distribution And Different Methods To Estimate Average Treatment Effect., Jingchao Sun Dec 2020

Modified-Half-Normal Distribution And Different Methods To Estimate Average Treatment Effect., Jingchao Sun

Electronic Theses and Dissertations

This dissertation consists of three projects related to Modified-Half-Normal distribution and causal inference. In my first project, a new distribution called Modified-Half-Normal distribution was introduced. I explored a few of its distributional properties, the procedures for generating random samples based on Bayesian approaches, and the parameter estimation based on the method of moments. The second project deals with the problem of selection bias of average treatment effect (ATE) if we use the observational data. I combined the propensity score based inverse probability of treatment weighting (IPTW) method and the directed acyclic graph (DAG) to solve this problem. The third project …


Aspects Of Causal Inference., John A. Craycroft Dec 2020

Aspects Of Causal Inference., John A. Craycroft

Electronic Theses and Dissertations

Observational studies differ from experimental studies in that assignment of subjects to treatments is not randomized but rather occurs due to natural mechanisms, which are usually hidden from researchers. Yet objectives of the two studies are frequently the same: identify the causal – rather than merely associational – relationship between some treatment or exposure and an outcome. The statistical issues that arise in properly analyzing observational data for this goal are numerous and fascinating, and these issues are encompassed in the domain of causal inference. The research presented in this dissertation explores several distinct aspects of causal inference. This dissertation …


The Influence Of Environmental Variables On The Height Growth Of Loblolly Pine (Pinus Taeda) In The Western Gulf, Osakpamwan Edo-Iyasere Aug 2020

The Influence Of Environmental Variables On The Height Growth Of Loblolly Pine (Pinus Taeda) In The Western Gulf, Osakpamwan Edo-Iyasere

Electronic Theses and Dissertations

Understanding the effects of environmental factors on stand growth is important in optimizing forest management plans. This study investigated the effects of soil and climate factors on the height growth (site index) of loblolly pine (Pinus Taeda L.) using data collected from permanent plots established in intensively-managed plantations across East Texas and Western Louisiana. The Chapman-Richards model was selected as the base model to describe the height-age relationships and important soil and climate variables were incorporated into the models as model parameter coefficient adjustors. Our results showed that the most important factors for predicting site index were nitrogen …


Marginal Methods And Software For Clustered Data With Cluster- And Group-Size Informativeness., Mary Elizabeth Gregg Aug 2020

Marginal Methods And Software For Clustered Data With Cluster- And Group-Size Informativeness., Mary Elizabeth Gregg

Electronic Theses and Dissertations

Clustered data result when observations have some natural organizational association. In such data, cluster size is defined as the number of observations belonging to a cluster. A phenomenon termed informative cluster size (ICS) occurs when observation outcomes vary in a systematic way related to the cluster size. An additional form of informativeness, termed informative within-cluster group size (IWCGS), arises when the distribution of group-defining categorical covariates within clusters similarly carries information related to outcomes. Standard methods for the marginal analysis of clustered data can produce biased estimates and inference when data have informativeness. A reweighting methodology has been developed that …


Linear Methods For Regression With Small Sample Sizes Relative To The Number Of Variables., Rajesh Sikder Aug 2020

Linear Methods For Regression With Small Sample Sizes Relative To The Number Of Variables., Rajesh Sikder

Electronic Theses and Dissertations

In data sets where there are a small number of observations but a large number of variables observed for each observation, ordinary least squares estimation cannot be used for regression models. There are many alternative including stepwise regression, penalized methods such as ridge regression and the LASSO, and methods based on derived inputs such as principal components regression and partial least squares regression. In this thesis, these five methods are described. K-fold cross validation is also discussed as a way for determining regularization parameters for each method. The performance of these methods in estimation and prediction is also examined through …


Chemostratigraphy Of Carbonate Gravity Flows Of The Wolfcamp Formation In Crockett County, Midland Basin, Texas, Alex Blizzard, Julie Bloxson Jun 2020

Chemostratigraphy Of Carbonate Gravity Flows Of The Wolfcamp Formation In Crockett County, Midland Basin, Texas, Alex Blizzard, Julie Bloxson

Electronic Theses and Dissertations

Sediment gravity flows into deep-water environments are important stratigraphic traps in lithologically diverse reservoirs generating multiple plays for hydrocarbon exploration. These highly heterogeneous deposits can be studied by utilizing chemostratigraphy and higher-order sequence stratigraphy; being an accurate method for reservoir characterization. Studying these gravity flows along a carbonate platform’s slope can further expand an understanding of the stratigraphy that is filling adjacent basins. The application of elemental analyses can support in identifying mineralogy that impact reservoir quality, especially when conventional testing cannot be applied.

This study utilizes five cores containing the Wolfcamp Formation from the southeastern slope of the Central …


First-Year Computer Science Students: Pathways And Perceptions In Introductory Computer Science Courses, Christina A. Leblanc May 2020

First-Year Computer Science Students: Pathways And Perceptions In Introductory Computer Science Courses, Christina A. Leblanc

Electronic Theses and Dissertations

This study examined student perceptions and experiences of an introductory Computer Science course at the University of Maine; COS 125: Introduction to Problem Solving Using Computer Programs. It also explored the pathways that students pursue after taking COS 125, depending on their success in the course, and their motivation to persist. Through characterizing student populations and their performance in their first semester in the Computer Science program, they can be placed into one of three categories that explain their path; a “continuer” (passed COS 125 and decided to stay in the major), a “persister” (did not pass COS 125 and …


Using Saddlepoint Approximations And Likelihood-Based Methods To Conduct Statistical Inference For The Mean Of The Beta Distribution, Bryn Brakefield May 2020

Using Saddlepoint Approximations And Likelihood-Based Methods To Conduct Statistical Inference For The Mean Of The Beta Distribution, Bryn Brakefield

Electronic Theses and Dissertations

The prevalence of conducting statistical inference for the mean of the beta distribution has been rising in various fields of academic research, such as in immunology that analyzes proportions of rare cell population subsets. For our purposes, we will address this statistical inference problem by using likelihood-based applications to hypothesis testing, along with a relatively new statistical method called saddlepoint approximations. Through simulation work, we will compare the performance of these statistical procedures and provide both the statistical and scientific communities with recommendations on best practices.


Novel Bayesian Methodology For The Analysis Of Single-Cell Rna Sequencing Data., Michael Sekula May 2020

Novel Bayesian Methodology For The Analysis Of Single-Cell Rna Sequencing Data., Michael Sekula

Electronic Theses and Dissertations

With single-cell RNA sequencing (scRNA-seq) technology, researchers are able to gain a better understanding of health and disease through the analysis of gene expression data at the cellular-level; however, scRNA-seq data tend to have high proportions of zero values, increased cell-to-cell variability, and overdispersion due to abnormally large expression counts, which create new statistical problems that need to be addressed. This dissertation includes three research projects that propose Bayesian methodology suitable for scRNA-seq analysis. In the first project, a hurdle model for identifying differentially expressed genes across cell types in scRNA-seq data is presented. This model incorporates a correlated random …


Measuring The Connective Action Of Black Lives Matter Activists: A Psychometric Investigation Into Twitter Data, Paige Alfonzo Jan 2020

Measuring The Connective Action Of Black Lives Matter Activists: A Psychometric Investigation Into Twitter Data, Paige Alfonzo

Electronic Theses and Dissertations

Many protest movements from the last twenty-first century have become increasingly networked and personalized. Several scholars have tapped into this change coining terms such as participatory action, digitally mediated action, computer-mediated communication, issue-based organization, and what I focus on in this project, connective action. Building on the ideas percolating across the literary landscape at the time, Bennett and Segerberg (2012) introduced the logic of connective action based on emergent characteristics they observed in post-2010 large-scale social movements. Both the logic of connective action and related work have become deeply ingrained in today's social movement scholarship. As such, I felt it …


Assessing Robustness Of The Rasch Mixture Model To Detect Differential Item Functioning - A Monte Carlo Simulation Study, Jinjin Huang Jan 2020

Assessing Robustness Of The Rasch Mixture Model To Detect Differential Item Functioning - A Monte Carlo Simulation Study, Jinjin Huang

Electronic Theses and Dissertations

Measurement invariance is crucial for an effective and valid measure of a construct. Invariance holds when the latent trait varies consistently across subgroups; in other words, the mean differences among subgroups are only due to true latent ability differences. Differential item functioning (DIF) occurs when measurement invariance is violated. There are two kinds of traditional tools for DIF detection: non-parametric methods and parametric methods. Mantel Haenszel (MH), SIBTEST, and standardization are examples of non-parametric DIF detection methods. The majority of parametric DIF detection methods are item response theory (IRT) based. Both non-parametric methods and parametric methods compare differences among subgroups …


Is The Reliability Of Objective Originality Scores Confounded By Elaboration?, Shannon Marie Maio Jan 2020

Is The Reliability Of Objective Originality Scores Confounded By Elaboration?, Shannon Marie Maio

Electronic Theses and Dissertations

The increased use of text-mining models as a scoring mechanism for divergent thinking (DT) tasks has sparked concerns about the ways in which automated Originality scores may be influenced by other dimensions of DT, especially Elaboration. The debate centers around the question of whether too much variance in automated Originality scores is accounted for by the number of words a participant uses in a response (i.e., Elaboration), and, thus, how the influence of Elaboration can affect the reliability of Originality scores. Here, a partial correlation analysis, in conjunction with text-mining and psychometric modeling, is conducted to test the degree to …


The Effects Of Adverse Childhood Experiences On Behavioral Outcomes, Jennifer Thomas Jan 2020

The Effects Of Adverse Childhood Experiences On Behavioral Outcomes, Jennifer Thomas

Electronic Theses and Dissertations

This study intends to explore the intersection of two vulnerable populations, early childhood development and risks associated with exposure to adverse childhood experiences (ACEs). This study examines how age plays a role in the long-term relationship between ACEs and internal and external behaviors. This study seeks to answer the question of: How does age influence the relationship between number of ACEs and internal and external behaviors? The participants in this study include those aged 0 – 16 from the National Survey of Child and adolescent Well-Being (NSCAW) dataset. The NSCAW study consists of five waves of data where Wave I …


The Experiences Of Ncaa Student-Athletes With An Eating Disorder Or Disordered Eating, Rachel E. Taylor Jan 2020

The Experiences Of Ncaa Student-Athletes With An Eating Disorder Or Disordered Eating, Rachel E. Taylor

Electronic Theses and Dissertations

The purpose of this study was to explore the experiences of student-athletes who had an eating disorder or disordered eating (ED/DE) while competing for the National Collegiate Athletic Association (NCAA). Integrating criticism and connoisseurship and critical evocative portraiture, four post-collegiate women who participated in cross country and track, who were either clinically diagnosed with an ED/DE or who self-diagnosed, participated in two interviews to describe their experiences with and the impact of ED/DE on their athletic pursuits, academic pursuits, as well as their relationships with coaches, teammates, and family. The analysis of these interviews showed the complexity of this topic. …


How 6-12th Grade Staff Support Students With Depression: A Pilot Study To Develop Measures Of Implicit Associations, Explicit Attitudes And Helping Behavior, Paul M. Thompson Jan 2020

How 6-12th Grade Staff Support Students With Depression: A Pilot Study To Develop Measures Of Implicit Associations, Explicit Attitudes And Helping Behavior, Paul M. Thompson

Electronic Theses and Dissertations

Students with emotional disabilities are disproportionately suspended and expelled in K-12 schools. Attribution theory suggests individuals are less likely to provide assistance to others if they believe the individuals are responsible for their own difficulties. To test attribution theory, this study created new measures of explicit attitudes and implicit associations of licensed 6-12th grade staff regarding students with depression as well as a helping behavior measure of staff toward students with depression. The survey was distributed within a single school district in the western United States. A majority of the sample (N = 52) held a mental health license (60%), …


Multiple Imputation Using Influential Exponential Tilting In Case Of Non-Ignorable Missing Data, Kavita Gohil Jan 2020

Multiple Imputation Using Influential Exponential Tilting In Case Of Non-Ignorable Missing Data, Kavita Gohil

Electronic Theses and Dissertations

Modern research strategies rely predominantly on three steps, data collection, data analysis, and inference. In research, if the data is not collected as designed, researchers may face challenges of having incomplete data, especially when it is non-ignorable. These situations affect the subsequent steps of evaluation and make them difficult to perform. Inference with incomplete data is a challenging task in data analysis and clinical trials when missing data related to the condition under the study. Moreover, results obtained from incomplete data are prone to biases. Parameter estimation with non-ignorable missing data is even more challenging to handle and extract useful …


Artificial Neural Network Models For Pattern Discovery From Ecg Time Series, Mehakpreet Kaur Jan 2020

Artificial Neural Network Models For Pattern Discovery From Ecg Time Series, Mehakpreet Kaur

Electronic Theses and Dissertations

Artificial Neural Network (ANN) models have recently become de facto models for deep learning with a wide range of applications spanning from scientific fields such as computer vision, physics, biology, medicine to social life (suggesting preferred movies, shopping lists, etc.). Due to advancements in computer technology and the increased practice of Artificial Intelligence (AI) in medicine and biological research, ANNs have been extensively applied not only to provide quick information about diseases, but also to make diagnostics accurate and cost-effective. We propose an ANN-based model to analyze a patient's electrocardiogram (ECG) data and produce accurate diagnostics regarding possible heart diseases …


Nonparametric Misclassification Simulation And Extrapolation Method And Its Application, Congjian Liu Jan 2020

Nonparametric Misclassification Simulation And Extrapolation Method And Its Application, Congjian Liu

Electronic Theses and Dissertations

The misclassification simulation extrapolation (MC-SIMEX) method proposed by Küchenho et al. is a general method of handling categorical data with measurement error. It consists of two steps, the simulation and extrapolation steps. In the simulation step, it simulates observations with varying degrees of measurement error. Then parameter estimators for varying degrees of measurement error are obtained based on these observations. In the extrapolation step, it uses a parametric extrapolation function to obtain the parameter estimators for data with no measurement error. However, as shown in many studies, the parameter estimators are still biased as a result of the parametric extrapolation …


Generalization Of Kullback-Leibler Divergence For Multi-Stage Diseases: Application To Diagnostic Test Accuracy And Optimal Cut-Points Selection Criterion, Chen Mo Jan 2020

Generalization Of Kullback-Leibler Divergence For Multi-Stage Diseases: Application To Diagnostic Test Accuracy And Optimal Cut-Points Selection Criterion, Chen Mo

Electronic Theses and Dissertations

The Kullback-Leibler divergence (KL), which captures the disparity between two distributions, has been considered as a measure for determining the diagnostic performance of an ordinal diagnostic test. This study applies KL and further generalizes it to comprehensively measure the diagnostic accuracy test for multi-stage (K > 2) diseases, named generalized total Kullback-Leibler divergence (GTKL). Also, GTKL is proposed as an optimal cut-points selection criterion for discriminating subjects among different disease stages. Moreover, the study investigates a variety of applications of GTKL on measuring the rule-in/out potentials in the single-stage and multi-stage levels. Intensive simulation studies are conducted to compare the performance …


Public Perception Of Different Planting Techniques Using Augmented Reality, Sultana Quader Tania Jan 2020

Public Perception Of Different Planting Techniques Using Augmented Reality, Sultana Quader Tania

Electronic Theses and Dissertations

The objective of this study was to measure public perception of the different planting techniques (block and matrix), which are used at visitor information centers (VICs) and other rights of way (ROW) areas. The main factors that affect public perception of planting techniques were identified through an extensive literature review and qualitative survey from four welcome centers in the state of Georgia. The ranking of those indicators, based on public preferences, was discovered through a quantitative survey. During the first phase of the quantitative survey, images of block and matrix were used. An iOS-based user-friendly and cost-effective augmented reality (AR) …