Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 8 of 8

Full-Text Articles in Entire DC Network

Statistical Methods For Meta-Analysis In Large-Scale Genomic Experiments, Wimarsha Thathsarani Jayanetti Dec 2022

Statistical Methods For Meta-Analysis In Large-Scale Genomic Experiments, Wimarsha Thathsarani Jayanetti

Mathematics & Statistics Theses & Dissertations

Recent developments in high throughput genomic assays have opened up the possibility of testing hundreds and thousands of genes simultaneously. With the availability of vast amounts of public databases, researchers tend to combine genomic analysis results from multiple studies in the form of a meta-analysis. Meta-analysis methods can be broadly classified into two main categories. The first approach is to combine the statistical significance (pvalues) of the genes from each individual study, and the second approach is to combine the statistical estimates (effect sizes) from the individual studies. In this dissertation, we will discuss how adherence to the standard null …


A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi Dec 2021

A Copula Model Approach To Identify The Differential Gene Expression, Prasansha Liyanaarachchi

Mathematics & Statistics Theses & Dissertations

Deoxyribonucleic acid, more commonly known as DNA, is a complex double helix-shaped molecule present in all living organisms and hosts thousands of genes. However, only a few genes exhibit differential expression and play a vital role in a particular disease such as breast cancer. Microarray technology is one of the modern technologies developed to study these gene expressions. There are two major microarray technologies available for expression analysis: Spotted cDNA array and oligonucleotide array. The focus of our research is the statistical analysis of data that arises from the spotted cDNA microarray. Numerous models have been proposed in the literature …


Copula-Based Zero-Inflated Count Time Series Models, Mohammed Sulaiman Alqawba Jul 2019

Copula-Based Zero-Inflated Count Time Series Models, Mohammed Sulaiman Alqawba

Mathematics & Statistics Theses & Dissertations

Count time series data are observed in several applied disciplines such as in environmental science, biostatistics, economics, public health, and finance. In some cases, a specific count, say zero, may occur more often than usual. Additionally, serial dependence might be found among these counts if they are recorded over time. Overlooking the frequent occurrence of zeros and the serial dependence could lead to false inference. In this dissertation, we propose two classes of copula-based time series models for zero-inflated counts with the presence of covariates. Zero-inflated Poisson (ZIP), zero-inflated negative binomial (ZINB), and zero-inflated Conway-Maxwell-Poisson (ZICMP) distributed marginals of the …


Spatio-Temporal Cluster Detection And Local Moran Statistics Of Point Processes, Jennifer L. Matthews Apr 2019

Spatio-Temporal Cluster Detection And Local Moran Statistics Of Point Processes, Jennifer L. Matthews

Mathematics & Statistics Theses & Dissertations

Moran's index is a statistic that measures spatial dependence, quantifying the degree of dispersion or clustering of point processes and events in some location/area. Recognizing that a single Moran's index may not give a sufficient summary of the spatial autocorrelation measure, a local indicator of spatial association (LISA) has gained popularity. Accordingly, we propose extending LISAs to time after partitioning the area and computing a Moran-type statistic for each subarea. Patterns between the local neighbors are unveiled that would not otherwise be apparent. We consider the measures of Moran statistics while incorporating a time factor under simulated multilevel Palm distribution, …


Zero-Inflated Models To Identify Transcription Factor Binding Sites In Chip-Seq Experiments, Sameera Dhananjaya Viswakula Apr 2015

Zero-Inflated Models To Identify Transcription Factor Binding Sites In Chip-Seq Experiments, Sameera Dhananjaya Viswakula

Mathematics & Statistics Theses & Dissertations

It is essential to determine the protein-DNA binding sites to understand many biological processes. A transcription factor is a particular type of protein that binds to DNA and controls gene regulation in living organisms. Chromatin immunoprecipitation followed by highthroughput sequencing (ChIP-seq) is considered the gold standard in locating these binding sites and programs use to identify DNA-transcription factor binding sites are known as peak-callers. ChIP-seq data are known to exhibit considerable background noise and other biases. In this study, we propose a negative binomial model (NB), a zero-inflated Poisson model (ZIP) and a zero-inflated negative binomial model (ZINB) for peak-calling. …


Efficient Unbiased Estimating Equations For Analyzing Structured Correlation Matrices, Yihao Deng Jul 2006

Efficient Unbiased Estimating Equations For Analyzing Structured Correlation Matrices, Yihao Deng

Mathematics & Statistics Theses & Dissertations

Analysis of dependent continuous and discrete data has become an active area of research. For normal data, correlations fully quantify the dependence. And historically, maximum likelihood method has been very successful to estimate the correlations and unbiased estimating equation approach has become a popular alternative when there may be a departure from normality. In this thesis we show that the optimal unbiased estimating equation coincides with the likelihood equations for normal data. We then introduce a general class of weighted unbiased estimating equations to estimate parameters in a structured correlation matrix. We derive expressions for asymptotic covariance of the estimates, …


Estimation Of Parameters In Replicated Time Series Regression Models, Genming Shi Jul 2003

Estimation Of Parameters In Replicated Time Series Regression Models, Genming Shi

Mathematics & Statistics Theses & Dissertations

The time series regression model was widely studied in the literature by several authors. However, statistical analysis of replicated time series regression models has received little attention. In this thesis, we study the application of quasi-least squares, a relatively new method, to estimate the parameters in replicated time series models with general ARMA( p, q) correlation structure. We also study several established methods for estimating the parameters in those models, including the maximum likelihood, method of moments, and the GEE method. Asymptotic comparisons of the methods are made bV fixing the number of repeated measurements in each series, and …


Mathematical Models Of Prevascular Tumor Growth By Diffusion, Sophia A. Maggelakis Jul 1989

Mathematical Models Of Prevascular Tumor Growth By Diffusion, Sophia A. Maggelakis

Mathematics & Statistics Theses & Dissertations

A study of several complementary mathematical models that describe the early, prevascular stages of solid tumor growth by diffusion under various simplifying assumptions is presented. The advantage of these models is that their degree of complexity is relatively low, which ensures fairly straightforward comparisons with experimental or clinical data (as it becomes available), yet they are mathematically sophisticated enough to capture the main biological phenomena of interest.

The tumor growth and cell proliferation rate are assumed to depend on the local concentrations of nutrients and inhibitory factors. The effects of geometry and spatially non-uniform inhibitor production and non-uniform nutrient consumption …