Open Access. Powered by Scholars. Published by Universities.®
- Discipline
-
- Biostatistics (40)
- Survival Analysis (35)
- Statistical Theory (32)
- Bioinformatics (30)
- Life Sciences (30)
-
- Statistical Models (9)
- Medicine and Health Sciences (8)
- Epidemiology (6)
- Longitudinal Data Analysis and Time Series (6)
- Microarrays (6)
- Public Health (6)
- Categorical Data Analysis (4)
- Multivariate Analysis (4)
- Applied Mathematics (3)
- Applied Statistics (3)
- Business (3)
- Computational Biology (3)
- Genetics and Genomics (3)
- Management Sciences and Quantitative Methods (3)
- Disease Modeling (2)
- Diseases (2)
- Numerical Analysis and Computation (2)
- Design of Experiments and Sample Surveys (1)
- Genetics (1)
- Genomics (1)
- Medical Specialties (1)
- Psychology (1)
- Institution
- Keyword
-
- Regularized Analysis of Large p, Small n Data (5)
- Conference (3)
- Statistical Methodology (3)
- Computational Biology/Bioinformatics (2)
- Statistical Models (2)
-
- Working Papers (2)
- Order Statistics Model. (1)
- measurement error. (1)
- Annual percent change (APC) (1)
- AUC; Cox's proportional hazards model; Framingham risk score; ROC (1)
- Age-adjusted cancer rates (1)
- Area under the receiver operating characteristic curve; C-statistic; Cox's regression; Gaussian process; Integrated discriminiation improvement; Improvement in the area under the curve; Risk prediction (1)
- Asthma; Cumuluative Residuals; Repeated Measured; Spatial Cluster Detection; Wheeze (1)
- Asymptotic linearity; coarsening at random; causal effect; censored data; cross-validation; collaborative double robust; efficient influence curve (1)
- Bayesian analysis (1)
- Bayesian inference; Markov chain; Monte Carlo; Clustering (1)
- Bayesian method (1)
- Bayesian methods; design-based inference; sampling weights; regression; robustness; survey sampling (1)
- Bioinformatics/Cancer Genomics (1)
- Body mass index; Cumulative residuals; Generalized estimating equations; Socioeconomic status; Spatial cluster detection; Weighted linear regression (1)
- Bootstrap null distribution (1)
- Cancer genomics; Integrative analysis; Microarray (1)
- ChIP-Seq (1)
- Child Psychology (1)
- Classical method (1)
- Classification; Diagnosis; Prediction; Prognosis; Risk models (1)
- Cross-validation; HIV-infection; Nonparametric function estimation; Personalized medicine; Subgroup analysis (1)
- Crossed random effects (1)
- Data traffic (1)
- Diffusion tensor imaging; random matrix; likelihood ratio test; manifold-valued data; Satterthwaite approximation; multiple testing (1)
- Publication
-
- Shuangge Ma (27)
- Harvard University Biostatistics Working Paper Series (13)
- U.C. Berkeley Division of Biostatistics Working Paper Series (6)
- Chongzhi Di (4)
- COBRA Preprint Series (3)
-
- Sally Wood (3)
- Johns Hopkins University, Dept. of Biostatistics Working Papers (2)
- UW Biostatistics Working Paper Series (2)
- Department of Statistics: Dissertations, Theses, and Student Work (1)
- Mark R Segal (1)
- Psychology: Faculty Publications and Other Works (1)
- René Schenkendorf (1)
- Sunduz Keles (1)
- The University of Michigan Department of Biostatistics Working Paper Series (1)
- UNLV Theses, Dissertations, Professional Papers, and Capstones (1)
- UPenn Biostatistics Working Papers (1)
- Publication Type
- File Type
Articles 1 - 30 of 68
Full-Text Articles in Statistical Methodology
Fully Exponential Laplace Approximation Em Algorithm For Nonlinear Mixed Effects Models, Meijian Zhou
Fully Exponential Laplace Approximation Em Algorithm For Nonlinear Mixed Effects Models, Meijian Zhou
Department of Statistics: Dissertations, Theses, and Student Work
Nonlinear mixed effects models provide a flexible and powerful platform for the analysis of clustered data that arise in numerous fields, such as pharmacology, biology, agriculture, forestry, and economics. This dissertation focuses on fitting parametric nonlinear mixed effects models with single- and multi-level random effects. A new, efficient, and accurate method that gives an error of order O(1/n2), fully exponential Laplace approximation EM algorithm (FELA-EM), for obtaining restricted maximum likelihood (REML) estimates in nonlinear mixed effects models is developed. Sample codes for implementing FELA-EM algorithm in R are given. Simulation studies have been conducted to evaluate …
Pragmatic Estimation Of A Spatio-Temporal Air Quality Model With Irregular Monitoring Data, Paul D. Sampson, Adam A. Szpiro, Lianne Sheppard, Johan Lindström, Joel D. Kaufman
Pragmatic Estimation Of A Spatio-Temporal Air Quality Model With Irregular Monitoring Data, Paul D. Sampson, Adam A. Szpiro, Lianne Sheppard, Johan Lindström, Joel D. Kaufman
UW Biostatistics Working Paper Series
Statistical analyses of the health effects of air pollution have increasingly used GIS-based covariates for prediction of ambient air quality in “land-use” regression models. More recently these regression models have accounted for spatial correlation structure in combining monitoring data with land-use covariates. The current paper builds on these concepts to address spatio-temporal prediction of ambient concentrations of particulate matter with aerodynamic diameter less than 2.5 μm (PM2.5) on the basis of a model representing spatially varying seasonal trends and spatial correlation structures. Our hierarchical methodology provides a pragmatic approach that fully exploits regulatory and other supplemental monitoring data which jointly …
On The Behaviour Of Marginal And Conditional Akaike Information Criteria In Linear Mixed Models, Sonja Greven, Thomas Kneib
On The Behaviour Of Marginal And Conditional Akaike Information Criteria In Linear Mixed Models, Sonja Greven, Thomas Kneib
Johns Hopkins University, Dept. of Biostatistics Working Papers
In linear mixed models, model selection frequently includes the selection of random effects. Two versions of the Akaike information criterion (AIC) have been used, based either on the marginal or on the conditional distribution. We show that the marginal AIC is no longer an asymptotically unbiased estimator of the Akaike information, and in fact favours smaller models without random effects. For the conditional AIC, we show that ignoring estimation uncertainty in the random effects covariance matrix, as is common practice, induces a bias that leads to the selection of any random effect not predicted to be exactly zero. We derive …
Survival Analysis With Error-Prone Time-Varying Covariates: A Risk Set Calibration Approach, Xiaomei Liao, David M. Zucker, Yi Li, Donna Spiegelman
Survival Analysis With Error-Prone Time-Varying Covariates: A Risk Set Calibration Approach, Xiaomei Liao, David M. Zucker, Yi Li, Donna Spiegelman
Harvard University Biostatistics Working Paper Series
No abstract provided.
A Statistical Framework For The Analysis Of Chip-Seq Data, Pei Fen Kuan, Dongjun Chung, Guangjin Pan, James A. Thomson, Ron Stewart, Sunduz Keles
A Statistical Framework For The Analysis Of Chip-Seq Data, Pei Fen Kuan, Dongjun Chung, Guangjin Pan, James A. Thomson, Ron Stewart, Sunduz Keles
Sunduz Keles
Chromatin immunoprecipitation followed by sequencing (ChIP-Seq) has revolutionalized experiments for genome-wide profiling of DNA-binding proteins, histone modifications, and nucleosome occupancy. As the cost of sequencing is decreasing, many researchers are switching from microarray-based technologies (ChIP-chip) to ChIP-Seq for genome-wide study of transcriptional regulation. Despite its increasing and well-deserved popularity, there is little work that investigates and accounts for sources of biases in the ChIP-Seq technology. These biases typically arise from both the standard pre-processing protocol and the underlying DNA sequence of the generated data.
We study data from a naked DNA sequencing experiment, which sequences non-cross-linked DNA after deproteinizing and …
A New Class Of Minimum Power Divergence Estimators With Applications To Cancer Surveillance, Nirian Martin, Yi Li
A New Class Of Minimum Power Divergence Estimators With Applications To Cancer Surveillance, Nirian Martin, Yi Li
Harvard University Biostatistics Working Paper Series
No abstract provided.
Quasi-Least Squares With Mixed Linear Correlation Structures, Jichun Xie, Justine Shults, Jon Peet, Dwight Stambolian, Mary F. Cotch
Quasi-Least Squares With Mixed Linear Correlation Structures, Jichun Xie, Justine Shults, Jon Peet, Dwight Stambolian, Mary F. Cotch
UPenn Biostatistics Working Papers
Quasi-least squares (QLS) is a two-stage computational approach for estimation of the correlation parameters in the framework of generalized estimating equations (GEE). We prove two general results for the class of mixed linear correlation structures: namely, that the stage one QLS estimate of the correlation parameter always exists and is feasible (yields a positive definite estimated correlation matrix) for any correlation structure, while the stage two estimator exists and is unique (and therefore consistent) with probability one, for the class of mixed linear correlation structures. Our general results justify the implementation of QLS for particular members of the class of …
Readings In Targeted Maximum Likelihood Estimation, Mark J. Van Der Laan, Sherri Rose, Susan Gruber
Readings In Targeted Maximum Likelihood Estimation, Mark J. Van Der Laan, Sherri Rose, Susan Gruber
U.C. Berkeley Division of Biostatistics Working Paper Series
This is a compilation of current and past work on targeted maximum likelihood estimation. It features the original targeted maximum likelihood learning paper as well as chapters on super (machine) learning using cross validation, randomized controlled trials, realistic individualized treatment rules in observational studies, biomarker discovery, case-control studies, and time-to-event outcomes with censored data, among others. We hope this collection is helpful to the interested reader and stimulates additional research in this important area.
Causal Inference For Nested Case-Control Studies Using Targeted Maximum Likelihood Estimation, Sherri Rose, Mark J. Van Der Laan
Causal Inference For Nested Case-Control Studies Using Targeted Maximum Likelihood Estimation, Sherri Rose, Mark J. Van Der Laan
U.C. Berkeley Division of Biostatistics Working Paper Series
A nested case-control study is conducted within a well-defined cohort arising out of a population of interest. This design is often used in epidemiology to reduce the costs associated with collecting data on the full cohort; however, the case control sample within the cohort is a biased sample. Methods for analyzing case-control studies have largely focused on logistic regression models that provide conditional and not marginal causal estimates of the odds ratio. We previously developed a Case-Control Weighted Targeted Maximum Likelihood Estimation (TMLE) procedure for case-control study designs, which relies on the prevalence probability q0. We propose the use of …
Integrative Analysis Of Cancer Genomic Data, Shuangge Ma
Integrative Analysis Of Cancer Genomic Data, Shuangge Ma
Shuangge Ma
In the past decade, we have witnessed a period of unparallel development in the field of cancer genomics. To address the same or similar biomedical questions, multiple cancer genomic studies have been independently designed and conducted. Cancer gene signatures identified from analysis of individual datasets often have low reproducibility. A cost-effective way of improving reproducibility is to conduct integrative analysis of datasets from multiple studies with comparable designs. To properly integrate multiple studies and conduct integrative analysis, we need to access various public data warehouses, retrieve experiment protocols and raw data, evaluate individual studies and select those with comparable designs, …
Targeted Maximum Likelihood Estimation: A Gentle Introduction, Susan Gruber, Mark J. Van Der Laan
Targeted Maximum Likelihood Estimation: A Gentle Introduction, Susan Gruber, Mark J. Van Der Laan
U.C. Berkeley Division of Biostatistics Working Paper Series
This paper provides a concise introduction to targeted maximum likelihood estimation (TMLE) of causal effect parameters. The interested analyst should gain sufficient understanding of TMLE from this introductory tutorial to be able to apply the method in practice. A program written in R is provided. This program implements a basic version of TMLE that can be used to estimate the effect of a binary point treatment on a continuous or binary outcome.
Comparing Risk Scoring Systems Beyond The Roc Paradigm In Survival Analysis, Hajime Uno, Lu Tian, Tianxi Cai, Isaac S. Kohane, L. J. Wei
Comparing Risk Scoring Systems Beyond The Roc Paradigm In Survival Analysis, Hajime Uno, Lu Tian, Tianxi Cai, Isaac S. Kohane, L. J. Wei
Harvard University Biostatistics Working Paper Series
No abstract provided.
Combinational Mixtures Of Multiparameter Distributions, Valeria Edefonti, Giovanni Parmigiani
Combinational Mixtures Of Multiparameter Distributions, Valeria Edefonti, Giovanni Parmigiani
Johns Hopkins University, Dept. of Biostatistics Working Papers
We introduce combinatorial mixtures - a flexible class of models for inference on mixture distributions whose component have multidimensional parameters. The key idea is to allow each element of the component-specific parameter vectors to be shared by a subset of other components. This approach allows for mixtures that range from very flexible to very parsimonious, and unifies inference on component-specific parameters with inference on the number of components. We develop Bayesian inference and computation approaches for this class of distributions, and illustrate them in an application. This work was originally motivated by the analysis of cancer subtypes: in terms of …
Shrinkage Estimation Of Expression Fold Change As An Alternative To Testing Hypotheses Of Equivalent Expression, Zahra Montazeri, Corey M. Yanofsky, David R. Bickel
Shrinkage Estimation Of Expression Fold Change As An Alternative To Testing Hypotheses Of Equivalent Expression, Zahra Montazeri, Corey M. Yanofsky, David R. Bickel
COBRA Preprint Series
Research on analyzing microarray data has focused on the problem of identifying differentially expressed genes to the neglect of the problem of how to integrate evidence that a gene is differentially expressed with information on the extent of its differential expression. Consequently, researchers currently prioritize genes for further study either on the basis of volcano plots or, more commonly, according to simple estimates of the fold change after filtering the genes with an arbitrary statistical significance threshold. While the subjective and informal nature of the former practice precludes quantification of its reliability, the latter practice is equivalent to using a …
Identification Of Cancer-Associated Gene Pathways From Analysis Of Expression Data, Shuangge Ma
Identification Of Cancer-Associated Gene Pathways From Analysis Of Expression Data, Shuangge Ma
Shuangge Ma
No abstract provided.
The Effect Of Correlation In False Discovery Rate Estimation, Armin Schwartzman, Xihong Lin
The Effect Of Correlation In False Discovery Rate Estimation, Armin Schwartzman, Xihong Lin
Harvard University Biostatistics Working Paper Series
No abstract provided.
Lecture 5, Shuangge Ma
Final Project, Shuangge Ma
Lecture 4, Shuangge Ma
Lecture 4, Shuangge Ma
Computer Intensive Methods Lecture 13, Shuangge Ma
Spatial Cluster Detection For Repeatedly Measured Outcomes While Accounting For Residential History, Andrea J. Cook, Diane Gold, Yi Li
Spatial Cluster Detection For Repeatedly Measured Outcomes While Accounting For Residential History, Andrea J. Cook, Diane Gold, Yi Li
Harvard University Biostatistics Working Paper Series
No abstract provided.
Marginalized Frailty Models For Multivariate Survival Data, Megan Othus, Yi Li
Marginalized Frailty Models For Multivariate Survival Data, Megan Othus, Yi Li
Harvard University Biostatistics Working Paper Series
No abstract provided.
Spatial Cluster Detection For Weighted Outcomes Using Cumulative Geographic Residuals, Andrea J. Cook, Yi Li, David Arterburn, Ram C. Tiwari
Spatial Cluster Detection For Weighted Outcomes Using Cumulative Geographic Residuals, Andrea J. Cook, Yi Li, David Arterburn, Ram C. Tiwari
Harvard University Biostatistics Working Paper Series
No abstract provided.
Final Project (Description), Shuangge Ma
Final Project (Data), Shuangge Ma
Lecture 3, Shuangge Ma
Lecture 2, Shuangge Ma
Reference: Multiple Imputation, Shuangge Ma
Reference: Weighted Bootstrap, Shuangge Ma