Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Keyword
-
- Physical Sciences and Mathematics, Statistics and Probability (7)
- Physical Sciences and Mathematics, Statistics and Probability, Biostatistics (4)
- Bayesian (2)
- Physical Sciences and Mathematics, Statistics and Probability, Biostatistics (2)
- Semiparametric (2)
-
- Analyzing (1)
- Application (1)
- Application to Copula Functions (1)
- Application to HIV (1)
- Arts and Humanities, Statistics and Probability, Statistical Models (1)
- Bayesian Ensemble (1)
- Biostatistics, Longitudinal Data Analysis and Time Series (1)
- Bivariate Probability (1)
- Calculation (1)
- Clustered (1)
- Clustering (1)
- Complex Accelerated (1)
- Count Data (1)
- Cure Model (1)
- Density estimation (1)
- Dependent Data (1)
- Development (1)
- Dietary inflammatory index (1)
- Distribution (1)
- Double Poisson Distribution (1)
- Estimation Problems (1)
- Failure Time Data (1)
- Failure Time Model (1)
- Fraily Probit Models (1)
- Functional Observations (1)
- Publication Type
Articles 1 - 18 of 18
Full-Text Articles in Physical Sciences and Mathematics
Inflammatory Properties Of Diet And Glucose-Insulin Homeostasis In A Cohort Of Iranian Adults, Nazanin Moslehi, Behnaz Ehsani, Parvin Mirmiran, Nitin Shivappa, Maryam Tohidi, James R. Hébert, Fereidoun Azizi
Inflammatory Properties Of Diet And Glucose-Insulin Homeostasis In A Cohort Of Iranian Adults, Nazanin Moslehi, Behnaz Ehsani, Parvin Mirmiran, Nitin Shivappa, Maryam Tohidi, James R. Hébert, Fereidoun Azizi
Faculty Publications
We aimed to investigate associations of the dietary inflammatory index (DII) with glucose-insulin homeostasis markers, and the risk of glucose intolerance. This cross-sectional study included 2975 adults from the Tehran Lipid and Glucose Study. Fasting plasma glucose (FPG), 2-h post-load glucose (2h-PG), and fasting serum insulin were measured. Homeostatic model assessment of insulin resistance index (HOMA-IR) and β-cell function (HOMA-B), and the quantitative insulin sensitivity check index (QUICKI) were calculated. Glucose tolerance abnormalities included impaired fasting glucose (IFG), impaired glucose tolerance (IGT), and type 2 diabetes (T2DM). DII scores were positively associated with 2h-PG (β = 0.04; p = 0.05). …
Development And Application Of Bayesian Semiparametric Models For Dependent Data, Junshu Bao
Development And Application Of Bayesian Semiparametric Models For Dependent Data, Junshu Bao
Theses and Dissertations
Dependent data are very common in many research fields, such as medicine (repeated measures), finance (time series), traffic (clustered), etc. Effective control/modeling of the dependency among data can enhance the performance of the models and result in better prediction. In many cases, the correlation itself may be of great interest. In this dissertation, we develop novel Bayesian semi-/nonparametric regression models to analyze data with various dependence structures. In Chapter 2, a Bayesian non- parametric multivariate ordinal regression model is proposed to fit drinking behavior survey data from DWI offenders. The responses are two-dimensional ordinal data, drinking frequency and drinking quantity …
Bayesian Nonparametric Approaches To Multiple Testing, Density Estimation, And Supervised Learning, William Cipolli Iii
Bayesian Nonparametric Approaches To Multiple Testing, Density Estimation, And Supervised Learning, William Cipolli Iii
Theses and Dissertations
This dissertation presents methods for several applications of Polya tree models. These novel nonparametric approaches to the problems of multiple testing, density estimation and supervised learning provide an alternative to other parametric and nonparametric models. In Chapter 2, the proposed approximate finite Polya tree multiple testing procedure is very successful in correctly classifying the observations with non-zero mean in a computationally efficient manner; this holds even when the non-zero means are simulated from a mean-zero distribution. Further, the model is capable of this for “interestingly different” observations in the cases where that is of interest. Chapter 3 proposes discrete, and …
Novel Methods For Analyzing Longitudinal Data With Measurement Error In The Time Variable, Caroline Munindi Mulatya
Novel Methods For Analyzing Longitudinal Data With Measurement Error In The Time Variable, Caroline Munindi Mulatya
Theses and Dissertations
In some longitudinal studies, the observed time points are often confounded with measurement error due to the sampling conditions, resulting into data with measurement error in the time variable. This type of data occurs mainly in observational studies when the onset of a longitudinal process is unknown or in clinical trials when individual visits do not take place as specified by the study protocol, but are often rounded to coincide with the study protocol. Methodological and inferential implications of error in time varying covariates for both linear and nonlinear models have been studied widely. In this dissertation, we shift attention …
Spatio-Temporal Analysis Of The Occupational Fatal Victimization Of Law Enforcement Officers In The Us, Xueyi Xing
Spatio-Temporal Analysis Of The Occupational Fatal Victimization Of Law Enforcement Officers In The Us, Xueyi Xing
Theses and Dissertations
The models with constant coefficients of the covariates across space and time are commonly used in spatio-temporal analyses. However, the associations between risk factors and the outcome could have locally differential temporal trends in many cases. In this study, a Bayesian latent cluster modeling strategy is employed to identify potential spatial clusters in which locally specific sets of temporally varying coefficients of covariates are allowed. A state-level panel data of police officers occupational fatal victimization for the years 1979-2010 is used. To accommodate overdisperson and excess zeros, a negative binomial model and zero-inflated Poisson/negative binomial models are also utilized. A …
Score Test Derivations And Implementations For Bivariate Probability Mass And Density Functions With An Application To Copula Functions, Roy Bower
Theses and Dissertations
This dissertation is comprised and grounded in statistical theory with an application to solving real world problems. In particular, the development and implementation of multiple score tests under a variety of scenarios are derived, applied, and interpreted. In chapter 2, I propose a score test for independence of the marginals based on Lakshminarayana’s bivariate Poisson distribution. Each marginal distribution of the bivariate model is a univariate Poisson distribution, and the parameters of the bivariate distribution can be estimated using maximum likelihood methods. The simulation study shows that the score test maintains size close to the nominal level. To assess the …
Semiparametric Estimation Methods For Complex Accelerated Failure Time Model, Yinding Wang
Semiparametric Estimation Methods For Complex Accelerated Failure Time Model, Yinding Wang
Theses and Dissertations
The proportional hazards (PH) model and the accelerated failure time (AFT) model are the two most popular survival models in fitting the right-censored data. The AFT model is a useful alternative to the PH model, particularly when the PH assumption is not satisfied. Usually, the linear association is assumed with logarithm of survival time in the AFT model. However, the nonlinear association may exist in practice. The first project aims to handle the nonlinear component in the AFT model, which is called the semiparametric additive partial accelerated failure time (AP-AFT) model. Two estimation methods based on the rank-smooth method and …
The Reflected-Shifted-Truncated-Gamma Distribution For Negatively Skewed Survival Data With Application To Pediatric Nephrotic Syndrome, Sophia D. Waymyers
The Reflected-Shifted-Truncated-Gamma Distribution For Negatively Skewed Survival Data With Application To Pediatric Nephrotic Syndrome, Sophia D. Waymyers
Theses and Dissertations
Negatively skewed survival data arise occasionally in public health fields and in statistical research. Standard distributions such as the exponential, generalized F, generalized gamma, Gompertz, log-logistic, lognormal, Rayleigh, and Weibull distributions are not always well suited to this data. The primary goal of this dissertation is to find a viable alternative for modeling negatively skewed survival data such as the time to first remission for pediatric patients with frequently relapsing or steroid dependent nephrotic syndrome.
We begin with a brief introduction of survival analysis and the nature of pediatric nephrotic syndrome. A meta-analysis on atopy and pediatric nephrotic syndrome using …
Sample Size Calculation For Ph Mixture Cure Model, Yihong Zhan
Sample Size Calculation For Ph Mixture Cure Model, Yihong Zhan
Theses and Dissertations
With the development of advanced medical technology, a significant proportion of patients can be cured of many chronic diseases. Because a substantial fraction of patients have censored information, the standard survival model, such as the proportional hazards (PH) model cannot capture the cured information of patients. Thus PH mixture cure model is developed to handle the survival data with potential cured information. A corresponding sample size formula based on log rank test has been proposed by Wang et al. (2012) and the probability of death in their formula is only contributed by the control arm. However, to calculate the sample …
Modern Estimation Problems In Group Testing, Md Shamim Sarker
Modern Estimation Problems In Group Testing, Md Shamim Sarker
Theses and Dissertations
In the simplest form of group testing, pools are formed by compositing a fixed number of individual specimens (e.g., blood, urine, swab, etc.) and then the pools are tested for a binary characteristic, such as presence or absence of a disease. Group testing is commonly used to screen for a variety of sexually transmitted diseases in epidemiological applications where the main goal is to increase testing efficiency. In this dissertation, we study three estimation problems that are motivated by real-life applications. We propose new methods to model group testing data for both single and multiple infections. In the first problem, …
Parametric Reversed Hazards Model For Left Censored Data With Application To Hiv, Farahnaz Islam
Parametric Reversed Hazards Model For Left Censored Data With Application To Hiv, Farahnaz Islam
Theses and Dissertations
Left censoring is generally a rare type of censoring in time-to-event data, however there are some fields such as HIV related studies where it commonly occurs. Currently, there is no clear recommendation in the literature on the optimal model and distribution to analyze left-censored data. Recommendations can help researchers apply more accurate models for this type of censoring. This study derives the Parametric Reversed Hazards (PRH) Model for a variety of distributions which may be appropriate for left censored data. The performance of these derived PRH models to analyze HIV viral load data are compared using extensive simulations and a …
Semiparametric Regression Analysis Of Panel Count Data And Interval-Censored Failure Time Data, Bin Yao
Semiparametric Regression Analysis Of Panel Count Data And Interval-Censored Failure Time Data, Bin Yao
Theses and Dissertations
This dissertation discusses three important research topics on semiparametric regression analysis of panel count data and interval-censored data. Both types of data arise commonly in real-life studies in many fields such as epidemiology, social science, and medical research. In these studies, subjects are usually examined multiple times at periodical or irregular follow-up examinations. For panel count data, the response variable is the counts of some recurrent events, whose exact occurrence times are usually unknown. For interval-censored data, the response variable is the time to some events of interest, often called survival time or failure time, and the exact response time …
Registration And Clustering Of Functional Observations, Zizhen Wu
Registration And Clustering Of Functional Observations, Zizhen Wu
Theses and Dissertations
As an important exploratory analysis, curves of similar shape are often classified into groups, which we call clustering of functional data. Phase variations or time distortions are often encountered in the biological processes, such as growth patterns or gene profiles. As a result of time distortion, curves of similar shape may not be aligned. Regular clustering methods for functional data usually ignore the presence of phase variations, which may result in low clustering accuracy. However, it is difficult to account for phase variation without knowing the cluster structure.
In this dissertation, we first propose a Bayesian method that simultaneously clusters …
Semiparametric Joint Dynamic Modeling Of A Longitudinal Marker, Recurrent Competing Risks, And A Terminal Event, Piaomu Liu
Theses and Dissertations
The joint modeling framework has found extensive applications in cancer and other biomedical research. For example, recent initiatives and developments in precision medicine call for appropriate prognostic tools to assist individualized or personalized approaches in cancer diagnosis and treatment. Data generated by clinical trials and medical research often include correlated longitudinal marker measurements and time- to-event information, which are possibly a recurrent event, competing risks, and a survival outcome. Primary interests of joint modeling include the association between the longitudinal marker measurements and time-to-event data, as well as predictions of survival probabilities of new observational units from the same population. …
Regression Models For Count Data Based On The Double Poisson Distribution, Rebecca Wardrop
Regression Models For Count Data Based On The Double Poisson Distribution, Rebecca Wardrop
Theses and Dissertations
This paper explores the double Poisson distribution. The probability mass function and the difficulties associated with derivative-based optimization for this distribution are discussed. Stata software developed for estimation of double Poisson regression is detailed. Simulations are used to test the software. Data which are over-, under-, and equidispersed relative to the Poisson are generated and the software is utilized to estimate a regression model, a zero-inflated model, and a marginalized zero-inflated model all based on the double Poisson distribution. The estimated power of the test for φ = 1 for the double Poisson models are compared to the power of …
Frailty Probit Models For Clustered Interval-Censored Failure Time Data, Haifeng Wu
Frailty Probit Models For Clustered Interval-Censored Failure Time Data, Haifeng Wu
Theses and Dissertations
Survival analysis is an important branch of statistics that deals with time to event data or survival data. An important feature of such data is that the survival time of interest is usually not completely known but is censored due to the design of the study or an early dropout. In this dissertation we focus on studying clustered interval-censored data, a special type of survival data. Interval-censored data arise in many epidemiological, social science, and medical studies, in which subjects are examined at periodical follow-up visits. The survival (or failure) time of interest is never exactly observed but is known …
Some Issues In Markov Chain Monte Carlo Estimation For Item Response Theory, Han Kil Lee
Some Issues In Markov Chain Monte Carlo Estimation For Item Response Theory, Han Kil Lee
Theses and Dissertations
Both the marginalized Bayesian modal estimation (MBME) and Metropolis-Hasting within Gibbs (MH/Gibbs) are the popular estimation methods for Item Response Theory (IRT). However, predictions from MBME and MH/Gibbs are not directly comparable because of two problems. First, the examinees with the same response pattern do not produce the same ability estimates from MH/Gibbs while MBME provides identical estimates. This problem can be handled by updating each response pattern instead of updating each examinee. Second, standard errors from MBME are smaller than standard error estimates from MH/Gibbs. This pattern occurs because of two speculated reasons; correlation between item parameter estimations and …
Bayesian Ensemble Of Regression Trees For Multinomial Probit And Quantile Regression, Bereket P. Kindo
Bayesian Ensemble Of Regression Trees For Multinomial Probit And Quantile Regression, Bereket P. Kindo
Theses and Dissertations
This dissertation proposes multinomial probit Bayesian additive regression trees (MPBART), ordered multiclass Bayesian additive classification trees (O-MBACT) and Bayesian quantile additive regression trees (BayesQArt) as extensions of BART - Bayesian additive regression trees for tackling multinomial choice, multiclass classification, ordinal regression and quantile regression problems. The proposed models exhibit very good predictive performances. In particular, ranking among the top performing procedures when non-linear relationships exist between the response and the predictors. The proposed procedures can readily be applied on data sets with the number of predictors larger than the number of observations.
MPBART is sufficiently flexible to allow inclusion of …