Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 12 of 12

Full-Text Articles in Physical Sciences and Mathematics

Time Series Analysis Of Weather Data In South Carolina, Geophrey Odero Oct 2019

Time Series Analysis Of Weather Data In South Carolina, Geophrey Odero

Theses and Dissertations

This thesis discusses time series analysis of weather data in South Carolina for the last fifteen years (January 2003 to December 2017) for Columbia, Greenville and North Myrtle Beach. The first part presents a brief overview of different variables that are used in the analysis. That is, temperature, dew point, humidity and sea level pressure. A short discussion of time series data is also introduced. The second part is about modeling the variables. The models of choice are presented, fitted and model diagnostics is carried out. In the third part, we discuss background on climates of the cities and model …


Statistical Analysis Of Interval-Censored Data Subject To Additional Complications, Qiang Zheng Jul 2019

Statistical Analysis Of Interval-Censored Data Subject To Additional Complications, Qiang Zheng

Theses and Dissertations

Survival analysis is an important branch of statistics that studies time to event data (or survival data), in which the response variable is time to a certain event of interest. The most prominent feature of survival data is that the response is not exactly observed due to limits of the study design or nature of the event of interest. Interval-censored data are a common type of survival data and occur frequently in real life studies where subjects are examined at periodical follow ups. The response time is usually not observed, but the status of the event of interest is known …


Estimation Problems For Pooled Data, Xichen Mou Jul 2019

Estimation Problems For Pooled Data, Xichen Mou

Theses and Dissertations

In epidemiological applications, individual specimens (e.g., blood, urine, etc.) are often pooled together to detect the presence of disease or to measure the concentration level of a specific biomarker. Due to the advantage of cost efficiency, pooled data are also seen in diverse areas such as genetics, animal ecology, and environmental science. With pooled data, individual observations are masked and new statistical methods are needed to estimate characteristics such as disease prevalence, the underlying density function of a biomarker, etc. We focus on three estimation problems for pooled data. Chapters 2 and 3 propose nonparametric estimators for the density function …


Multivariate Probit Models For Interval-Censored Failure Time Data, Yifan Zhang Jul 2019

Multivariate Probit Models For Interval-Censored Failure Time Data, Yifan Zhang

Theses and Dissertations

Survival analysis is an important branch of statistics that analyzes the time to event data. The events of interest can be death, disease occurrence, the failure of a machine part, etc.. One important feature of this type of data is censoring: information on time to event is not observed exactly due to loss to follow-up or non-occurrence of interested event before the trial ends. Censored data are commonly observed in clinical trials and epidemiological studies, since monitoring a person’s health over time after treatment is often required in medical or health studies. In this dissertation we focus on studying multivariate …


Extension Of Risk-Based Measure Of Time-Varying Prognostic Discrimination For Survival Models, Shujie Chen Jul 2019

Extension Of Risk-Based Measure Of Time-Varying Prognostic Discrimination For Survival Models, Shujie Chen

Theses and Dissertations

The Cox proportional hazards (PH) model and time dependent PH model are the most popular survival models in survival analysis. The hazard discrimination summary HDS(t) proposed by Liang and Heagerty [2017] is used to evaluate the mean hazard difference between cases and controls at time t. Liang and Heagerty [2017] evaluated the discrimination performance under the PH model and time dependent PH model with right censoring.

In this thesis, first, we further investigate their method via comprehensive simulations including 1) We extend the simulation in Liang and Heagerty [2017] under the PH model by adding more scenarios such as different …


Investigations On Multiple Interval Estimators, Taeho Kim Jul 2019

Investigations On Multiple Interval Estimators, Taeho Kim

Theses and Dissertations

Multiple interval estimation for a set of parameters is investigated. To begin, a strategy of optimization for a multiple interval estimator (MIE) is introduced. This approach allocates distinct optimized levels to individual interval estimators so that the global expected content can be minimized while the global coverage probability is still maintained at a global level. This optimal allocation is achieved by a decision theoretic procedure which consists of two global risk functions. The major part of this manuscript is devoted to two multiple interval estimation procedures. Both procedures adopt prior information added to the classical setting, but these procedures do …


Randomization Analysis Driven Software, Steph-Yves Louis Apr 2019

Randomization Analysis Driven Software, Steph-Yves Louis

Theses and Dissertations

The application of a method of randomization for a clinical trial frequently summarizes to using Simple Randomization. Even though the latter method provides favorable characteristics, if the collected sample is not large enough, it still presents the highest chance of imbalance both marginally in the treatment groups and locally in terms of the covariates. Methods of Permuted Block Randomization, Urn Randomization, Stratified Permuted Block Randomization, and Minimization represent popular alternative methods that one should consider depending on the goal of the study. A comparison of the previously mentioned methods is carried to evaluate their performance with samples that are not …


Cluster Analysis Of Mixed-Mode Data, Yawei Liang Apr 2019

Cluster Analysis Of Mixed-Mode Data, Yawei Liang

Theses and Dissertations

In the modern world, data have become increasingly more complex and often contain different types of features. Two very common types of features are continuous and discrete variables. Clustering mixed-mode data, which include both continuous and discrete variables, can be done in various ways. Furthermore, a continuous variable can take any value between its minimum and maximum. Types of continuous vari- ables include bounded or unbounded normal variables, uniform variables, circular variables, etc. Discrete variables include types other than continuous variables, such as binary variables, categorical (nominal) variables, Poisson variables, etc. Difficulties in clustering mixed-mode data include handling the association …


Regression For Pooled Testing Data With Biomedical Applications, Juexin Lin Apr 2019

Regression For Pooled Testing Data With Biomedical Applications, Juexin Lin

Theses and Dissertations

Since first introduced by Dorfman in 1943, pooled testing has been widely used as a cost and time effective testing protocol in the variety of applications. This dis- sertation consists of three projects that reveal the use of pooling techniques in the disease prevention from the perspective of regression. For disease monitoring and control, individual covariates information are often of practical interest and yield meaningful interpretations. It is natural to model the outcome of interest, which can be either a disease status (binary) or a biomarker concentration index (continuous), with individual-specific covariates through a regression analysis. Chapter 2 focuses on …


Spatio-Temporal Analysis Of Precipitation And Flood Data From South Carolina, Haigang Liu Apr 2019

Spatio-Temporal Analysis Of Precipitation And Flood Data From South Carolina, Haigang Liu

Theses and Dissertations

Spatio-temporal data are everywhere: we encounter them on TV, in newspapers, on computer screens, on tablets, and on plain paper maps. As a result, researchers in di- verse areas are increasingly faced with the task of modeling geographically-referenced and temporally-correlated data. In this dissertation, we propose two different spa- tiotemporal models to capture the behavior of rainfall and flood data in the state of South Carolina.

Both models are built using a Bayesian hierarchical framework, which involves specifying the true underlying process in the first level and the spatio-temporal ran- dom effect in the second level of the hierarchy. The …


Inflated Standard Errors Of Mcmc Estimates In Irt, Dongho Shin Apr 2019

Inflated Standard Errors Of Mcmc Estimates In Irt, Dongho Shin

Theses and Dissertations

Two widely used algorithms for estimating item response theory (IRT) parameters are Markov chain Monte Carlo (MCMC) and the EM algorithm. In general, the MCMC algorithm has advantages over the EM algorithm - for example, the MCMC algorithm allows one to estimate the desired posterior distribution and also works more straightforwardly with complex IRT models. This ease of use, allows one to implement the MCMC algorithm without carefully consideration. Previous studies, Hendrix (2011) and Lee (2016), noted that the estimated standard errors from the MCMC algorithm are larger than those from the EM algorithm. Therefore, this study investigate the reason …


Mle And Bayesian Methods To Analyze Data With Missing Values Below The Limit Of Detection, Xinxin Hu Apr 2019

Mle And Bayesian Methods To Analyze Data With Missing Values Below The Limit Of Detection, Xinxin Hu

Theses and Dissertations

As pesticides are widely used in agriculture, more and more people who work at places like farm are exposed to the pesticides. According to enviroment re- searches [Villarejo; 2003; Reigart and Roberts; 1999], being exposed to some kind of pesticides like Organophosphorus (OP) insecticides has significantly effected the health of farmworkers and their family. The actual level of pesticides can be detected with some limitation for now. However, it is hard to detect when the level is below the limit of detection (LOD). Therefore, the goal of our research is to propose several different methods to analyze data …