Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Statistical Models (9)
- Statistical Methodology (8)
- Statistical Theory (8)
- Biostatistics (5)
- Medicine and Health Sciences (5)
-
- Survival Analysis (5)
- Applied Mathematics (3)
- Epidemiology (3)
- Numerical Analysis and Computation (3)
- Public Health (3)
- Categorical Data Analysis (2)
- Disease Modeling (2)
- Diseases (2)
- Genetics (2)
- Genetics and Genomics (2)
- Life Sciences (2)
- Longitudinal Data Analysis and Time Series (2)
- Clinical Trials (1)
- Laboratory and Basic Science Research (1)
- Multivariate Analysis (1)
- Keyword
-
- 0.632 resampling; Bootstrap; K-fold cross-validation; Model and variable selections; Perturbation-resampling; Prediction (1)
- B-splines (1)
- Backfitting algorithm; CAR model; collapsibility; epidemiology; Gauss-Seidel algorithm; iterative weighted least squares algorithm (1)
- Bayesian statistics; Fourier basis; FFT; generalized linear mixed model; geostatistics; spatial statistics (1)
- Bootstrap; Median regression; Metropolis algorithm (1)
-
- Censored linear regression; Partial linear model; Resampling method; Rank estimation (1)
- Clustered/longitudinal data; Generalized estimating equations; Generalized linear mixed models; Kernel method (1)
- Conditional power; frailty model; adaptive design (1)
- Cumulative Residual (1)
- Diagnostic Accuracy (1)
- Enteroccoccus (1)
- Fourier series (1)
- Generalized Linear Model (1)
- Model Checking (1)
- Normal approximation; Resampling (1)
- Orthogonal Basis (1)
- Penalized spline (1)
- Poisson-gamma (1)
- ROC Regression (1)
Articles 1 - 15 of 15
Full-Text Articles in Physical Sciences and Mathematics
Model Checking For Roc Regression Analysis, Tianxi Cai, Yingye Zheng
Model Checking For Roc Regression Analysis, Tianxi Cai, Yingye Zheng
Harvard University Biostatistics Working Paper Series
The Receiver Operating Characteristic (ROC) curve is a prominent tool for characterizing the accuracy of continuous diagnostic test. To account for factors that might invluence the test accuracy, various ROC regression methods have been proposed. However, as in any regression analysis, when the assumed models do not fit the data well, these methods may render invalid and misleading results. To date practical model checking techniques suitable for validating existing ROC regression models are not yet available. In this paper, we develop cumulative residual based procedures to graphically and numerically assess the goodness-of-fit for some commonly used ROC regression models, and …
Model Evaluation Based On The Distribution Of Estimated Absolute Prediction Error, Lu Tian, Tianxi Cai, Els Goetghebeur, L. J. Wei
Model Evaluation Based On The Distribution Of Estimated Absolute Prediction Error, Lu Tian, Tianxi Cai, Els Goetghebeur, L. J. Wei
Harvard University Biostatistics Working Paper Series
The construction of a reliable, practically useful prediction rule for future response is heavily dependent on the "adequacy" of the fitted regression model. In this article, we consider the absolute prediction error, the expected value of the absolute difference between the future and predicted responses, as the model evaluation criterion. This prediction error is easier to interpret than the average squared error and is equivalent to the mis-classification error for the binary outcome. We show that the distributions of the apparent error and its cross-validation counterparts are approximately normal even under a misspecified fitted model. When the prediction rule is …
Gauss-Seidel Estimation Of Generalized Linear Mixed Models With Application To Poisson Modeling Of Spatially Varying Disease Rates, Subharup Guha, Louise Ryan
Gauss-Seidel Estimation Of Generalized Linear Mixed Models With Application To Poisson Modeling Of Spatially Varying Disease Rates, Subharup Guha, Louise Ryan
Harvard University Biostatistics Working Paper Series
Generalized linear mixed models (GLMMs) provide an elegant framework for the analysis of correlated data. Due to the non-closed form of the likelihood, GLMMs are often fit by computational procedures like penalized quasi-likelihood (PQL). Special cases of these models are generalized linear models (GLMs), which are often fit using algorithms like iterative weighted least squares (IWLS). High computational costs and memory space constraints often make it difficult to apply these iterative procedures to data sets with very large number of cases.
This paper proposes a computationally efficient strategy based on the Gauss-Seidel algorithm that iteratively fits sub-models of the GLMM …
Designed Extension Of Survival Studies: Application To Clinical Trials With Unrecognized Heterogeneity, Yi Li, Mei-Chiung Shih, Rebecca A. Betensky
Designed Extension Of Survival Studies: Application To Clinical Trials With Unrecognized Heterogeneity, Yi Li, Mei-Chiung Shih, Rebecca A. Betensky
Harvard University Biostatistics Working Paper Series
It is well known that unrecognized heterogeneity among patients, such as is conferred by genetic subtype, can undermine the power of randomized trial, designed under the assumption of homogeneity, to detect a truly beneficial treatment. We consider the conditional power approach to allow for recovery of power under unexplained heterogeneity. While Proschan and Hunsberger (1995) confined the application of conditional power design to normally distributed observations, we consider more general and difficult settings in which the data are in the framework of continuous time and are subject to censoring. In particular, we derive a procedure appropriate for the analysis of …
Computational Techniques For Spatial Logistic Regression With Large Datasets, Christopher J. Paciorek, Louise Ryan
Computational Techniques For Spatial Logistic Regression With Large Datasets, Christopher J. Paciorek, Louise Ryan
Harvard University Biostatistics Working Paper Series
In epidemiological work, outcomes are frequently non-normal, sample sizes may be large, and effects are often small. To relate health outcomes to geographic risk factors, fast and powerful methods for fitting spatial models, particularly for non-normal data, are required. We focus on binary outcomes, with the risk surface a smooth function of space. We compare penalized likelihood models, including the penalized quasi-likelihood (PQL) approach, and Bayesian models based on fit, speed, and ease of implementation.
A Bayesian model using a spectral basis representation of the spatial surface provides the best tradeoff of sensitivity and specificity in simulations, detecting real spatial …
Feature-Specific Penalized Latent Class Analysis For Genomic Data, E. Andres Houseman, Brent A. Coull, Rebecca A. Betensky
Feature-Specific Penalized Latent Class Analysis For Genomic Data, E. Andres Houseman, Brent A. Coull, Rebecca A. Betensky
Harvard University Biostatistics Working Paper Series
No abstract provided.
A Pseudolikelihood Approach For Simultaneous Analysis Of Array Comparative Genomic Hybridizations (Acgh), David A. Engler, Gayatry Mohapatra, David N. Louis, Rebecca Betensky
A Pseudolikelihood Approach For Simultaneous Analysis Of Array Comparative Genomic Hybridizations (Acgh), David A. Engler, Gayatry Mohapatra, David N. Louis, Rebecca Betensky
Harvard University Biostatistics Working Paper Series
DNA sequence copy number has been shown to be associated with cancer development and progression. Array-based Comparative Genomic Hybridization (aCGH) is a recent development that seeks to identify the copy number ratio at large numbers of markers across the genome. Due to experimental and biological variations across chromosomes and across hybridizations, current methods are limited to analyses of single chromosomes. We propose a more powerful approach that borrows strength across chromosomes and across hybridizations. We assume a Gaussian mixture model, with a hidden Markov dependence structure, and with random effects to allow for intertumoral variation, as well as intratumoral clonal …
A Nonstationary Negative Binomial Time Series With Time-Dependent Covariates: Enterococcus Counts In Boston Harbor, E. Andres Houseman, Brent Coull, James P. Shine
A Nonstationary Negative Binomial Time Series With Time-Dependent Covariates: Enterococcus Counts In Boston Harbor, E. Andres Houseman, Brent Coull, James P. Shine
Harvard University Biostatistics Working Paper Series
Boston Harbor has had a history of poor water quality, including contamination by enteric pathogens. We conduct a statistical analysis of data collected by the Massachusetts Water Resources Authority (MWRA) between 1996 and 2002 to evaluate the effects of court-mandated improvements in sewage treatment. Motivated by the ineffectiveness of standard Poisson mixture models and their zero-inflated counterparts, we propose a new negative binomial model for time series of Enterococcus counts in Boston Harbor, where nonstationarity and autocorrelation are modeled using a nonparametric smooth function of time in the predictor. Without further restrictions, this function is not identifiable in the presence …
Semiparametric Estimation In General Repeated Measures Problems, Xihong Lin, Raymond J. Carroll
Semiparametric Estimation In General Repeated Measures Problems, Xihong Lin, Raymond J. Carroll
Harvard University Biostatistics Working Paper Series
This paper considers a wide class of semiparametric problems with a parametric part for some covariate effects and repeated evaluations of a nonparametric function. Special cases in our approach include marginal models for longitudinal/clustered data, conditional logistic regression for matched case-control studies, multivariate measurement error models, generalized linear mixed models with a semiparametric component, and many others. We propose profile-kernel and backfitting estimation methods for these problems, derive their asymptotic distributions, and show that in likelihood problems the methods are semiparametric efficient. While generally not true, with our methods profiling and backfitting are asymptotically equivalent. We also consider pseudolikelihood methods …
Mixture Cure Survival Models With Dependent Censoring, Yi Li, Ram C. Tiwari, Subharup Guha
Mixture Cure Survival Models With Dependent Censoring, Yi Li, Ram C. Tiwari, Subharup Guha
Harvard University Biostatistics Working Paper Series
A number of authors have studies the mixture survival model to analyze survival data with nonnegligible cure fractions. A key assumption made by these authors is the independence between the survival time and the censoring time. To our knowledge, no one has studies the mixture cure model in the presence of dependent censoring. To account for such dependence, we propose a more general cure model which allows for dependent censoring. In particular, we derive the cure models from the perspective of competing risks and model the dependence between the censoring time and the survival time using a class of Archimedean …
Semiparametric Normal Transformation Models For Spatially Correlated Survival Data, Yi Li, Xihong Lin
Semiparametric Normal Transformation Models For Spatially Correlated Survival Data, Yi Li, Xihong Lin
Harvard University Biostatistics Working Paper Series
There is an emerging interest in modeling spatially correlated survival data in biomedical and epidemiological studies. In this paper, we propose a new class of semiparametric normal transformation models for right censored spatially correlated survival data. This class of models assumes that survival outcomes marginally follow a Cox proportional hazard model with unspecified baseline hazard, and their joint distribution is obtained by transforming survival outcomes to normal random variables, whose joint distribution is assumed to be multivariate normal with a spatial correlation structure. A key feature of the class of semiparametric normal transformation models is that it provides a rich …
Inference On Survival Data With Covariate Measurement Error - An Imputation-Based Approach, Yi Li, Louise Ryan
Inference On Survival Data With Covariate Measurement Error - An Imputation-Based Approach, Yi Li, Louise Ryan
Harvard University Biostatistics Working Paper Series
We propose a new method for fitting proportional hazards models with error-prone covariates. Regression coefficients are estimated by solving an estimating equation that is the average of the partial likelihood scores based on imputed true covariates. For the purpose of imputation, a linear spline model is assumed on the baseline hazard. We discuss consistency and asymptotic normality of the resulting estimators, and propose a stochastic approximation scheme to obtain the estimates. The algorithm is easy to implement, and reduces to the ordinary Cox partial likelihood approach when the measurement error has a degenerative distribution. Simulations indicate high efficiency and robustness. …
The Sensitivity And Specificity Of Markers For Event Times, Tianxi Cai, Margaret S. Pepe, Thomas Lumley, Yingye Zheng, Nancy Swords Jenny
The Sensitivity And Specificity Of Markers For Event Times, Tianxi Cai, Margaret S. Pepe, Thomas Lumley, Yingye Zheng, Nancy Swords Jenny
Harvard University Biostatistics Working Paper Series
No abstract provided.
Implementation Of Estimating-Function Based Inference Procedures With Mcmc Sampler, Lu Tian, Jun S. Liu, L. J. Wei
Implementation Of Estimating-Function Based Inference Procedures With Mcmc Sampler, Lu Tian, Jun S. Liu, L. J. Wei
Harvard University Biostatistics Working Paper Series
No abstract provided.
Robust Inferences For Covariate Effects On Survival Time With Censored Linear Regression Models, Larry Leon, Tianxi Cai, L. J. Wei
Robust Inferences For Covariate Effects On Survival Time With Censored Linear Regression Models, Larry Leon, Tianxi Cai, L. J. Wei
Harvard University Biostatistics Working Paper Series
Various inference procedures for linear regression models with censored failure times have been studied extensively. Recent developments on efficient algorithms to implement these procedures enhance the practical usage of such models in survival analysis. In this article, we present robust inferences for certain covariate effects on the failure time in the presence of "nuisance" confounders under a semiparametric, partial linear regression setting. Specifically, the estimation procedures for the regression coefficients of interest are derived from a working linear model and are valid even when the function of the confounders in the model is not correctly specified. The new proposals are …