Open Access. Powered by Scholars. Published by Universities.®

Biostatistics Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

2013

Selected Works

Discipline
Institution
Keyword
Publication

Articles 1 - 28 of 28

Full-Text Articles in Biostatistics

Causal Mediation In A Survival Setting With Time-Dependent Mediators, Wenjing Zheng, Mark J. Van Der Laan Dec 2013

Causal Mediation In A Survival Setting With Time-Dependent Mediators, Wenjing Zheng, Mark J. Van Der Laan

Wenjing Zheng

The effect of an expsore on an outcome of interest is often mediated by intermediate variables. The goal of causal mediation analysis is to evaluate the role of these intermediate variables (mediators) in the causal effect of the exposure on the outcome. In this paper, we consider causal mediation of a baseline exposure on a survival (or time-to-event) outcome, when the mediator is time-dependent. The challenge in this setting lies in that the event process takes places jointly with the mediator process; in particular, the length of the mediator history depends on the survival time. As a result, we argue …


Hierarchical Vector Auto-Regressive Models And Their Applications To Multi-Subject Effective Connectivity, Cristina Gorrostieta, Mark Fiecas, Hernando Ombao, Erin Burke, Steven Cramer Oct 2013

Hierarchical Vector Auto-Regressive Models And Their Applications To Multi-Subject Effective Connectivity, Cristina Gorrostieta, Mark Fiecas, Hernando Ombao, Erin Burke, Steven Cramer

Mark Fiecas

Vector auto-regressive (VAR) models typically form the basis for constructing directed graphical models for investigating connectivity in a brain network with brain regions of interest (ROIs) as nodes. There are limitations in the standard VAR models. The number of parameters in the VAR model increases quadratically with the number of ROIs and linearly with the order of the model and thus due to the large number of parameters, the model could pose serious estimation problems. Moreover, when applied to imaging data, the standard VAR model does not account for variability in the connectivity structure across all subjects. In this paper, …


Use Of P-Values To Evaluate The Probability Of A Genuine Finding In Large-Scale Genetic Association Studies, Olga A. Vsevolozhskaya, Qing Lu, Chia-Ling Kuo, Dmitri V. Zaykin Oct 2013

Use Of P-Values To Evaluate The Probability Of A Genuine Finding In Large-Scale Genetic Association Studies, Olga A. Vsevolozhskaya, Qing Lu, Chia-Ling Kuo, Dmitri V. Zaykin

Olga A. Vsevolozhskaya

To claim the existence of an association in modern genome-wide association studies (GWAS), a nominal P-value has to exceed a stringent Bonferroni-adjusted significance level. Despite strictness of the correction, a significant P-value does not indicate high probability that the claimed association is genuine. A simple Bayesian solution -- the False Positive Report Probability (FPRP) -- was previously proposed to convert the observed P-value to the corresponding probability of no true association. Although the FPRP solution is highly popular, it does not reflect probability that a particular finding is false. Here, we offer a simple POFIG method -- a Probability that …


Designing The Search Trial: Ph250b In Practice, Laura Balzer Sep 2013

Designing The Search Trial: Ph250b In Practice, Laura Balzer

Laura B. Balzer

No abstract provided.


Estimating Effects On Rare Outcomes: Knowledge Is Power, Laura B. Balzer, Mark J. Van Der Laan May 2013

Estimating Effects On Rare Outcomes: Knowledge Is Power, Laura B. Balzer, Mark J. Van Der Laan

Laura B. Balzer

Many of the secondary outcomes in observational studies and randomized trials are rare. Methods for estimating causal effects and associations with rare outcomes, however, are limited, and this represents a missed opportunity for investigation. In this article, we construct a new targeted minimum loss-based estimator (TMLE) for the effect of an exposure or treatment on a rare outcome. We focus on the causal risk difference and statistical models incorporating bounds on the conditional risk of the outcome, given the exposure and covariates. By construction, the proposed estimator constrains the predicted outcomes to respect this model knowledge. Theoretically, this bounding provides …


A Study Of Mexican Free-Tailed Bat Chirp Syllables: Bayesian Functional Mixed Modeling Of Nonstationary Time Series Data With Time-Dependent Spectra, Josue G. Martinez, Kirsten M. Bohn, Raymond J. Carroll, Jeffrey S. Morris Feb 2013

A Study Of Mexican Free-Tailed Bat Chirp Syllables: Bayesian Functional Mixed Modeling Of Nonstationary Time Series Data With Time-Dependent Spectra, Josue G. Martinez, Kirsten M. Bohn, Raymond J. Carroll, Jeffrey S. Morris

Jeffrey S. Morris

We describe a new approach to analyze chirp syllables of free-tailed bats from two regions of Texas in which they are predominant: Austin and College Station. Our goal is to characterize any systematic regional differences in the mating chirps and assess whether individual bats have signature chirps. The data are analyzed by modeling spectrograms of the chirps as responses in a Bayesian functional mixed model. Given the variable chirp lengths, we compute the spectrograms on a relative time scale interpretable as the relative chirp position, using a variable window overlap based on chirp length. We use 2D wavelet transforms to …


Global Quantitative Assessment Of The Colorectal Polyp Burden In Familial Adenomatous Polyposis Using A Web-Based Tool, Patrick M. Lynch, Jeffrey S. Morris, William A. Ross, Miguel A. Rodriguez-Bigas, Juan Posadas, Rossa Khalaf, Diane M. Weber, Valerie O. Sepeda, Bernard Levin, Imad Shureiqi Jan 2013

Global Quantitative Assessment Of The Colorectal Polyp Burden In Familial Adenomatous Polyposis Using A Web-Based Tool, Patrick M. Lynch, Jeffrey S. Morris, William A. Ross, Miguel A. Rodriguez-Bigas, Juan Posadas, Rossa Khalaf, Diane M. Weber, Valerie O. Sepeda, Bernard Levin, Imad Shureiqi

Jeffrey S. Morris

Background: Accurate measures of the total polyp burden in familial adenomatous polyposis (FAP) are lacking. Current assessment tools include polyp quantitation in limited-field photographs and qualitative total colorectal polyp burden by video.

Objective: To develop global quantitative tools of the FAP colorectal adenoma burden.

Design: A single-arm, phase II trial.

Patients: Twenty-seven patients with FAP.

Intervention: Treatment with celecoxib for 6 months, with before-treatment and after-treatment videos posted to an intranet with an interactive site for scoring.

Main Outcome Measurements: Global adenoma counts and sizes (grouped into categories: less than 2 mm, 2-4 mm, and greater than 4 mm) were …


Interactions Between Serotypes Of Dengue Highlight Epidemiological Impact Of Cross-Immunity, Nicholas Reich, Sourya Shrestha, Aaron King, Pejman Rohani, Justin Lessler, Siripen Kalayanarooj, In-Kyu Yoon, Robert Gibbons, Donald Burke, Derek Cummings Jan 2013

Interactions Between Serotypes Of Dengue Highlight Epidemiological Impact Of Cross-Immunity, Nicholas Reich, Sourya Shrestha, Aaron King, Pejman Rohani, Justin Lessler, Siripen Kalayanarooj, In-Kyu Yoon, Robert Gibbons, Donald Burke, Derek Cummings

Nicholas G Reich

Dengue, a mosquito-borne virus of humans, infects over 50 million people annually. Infection with any of the four dengue serotypes induces protective immunity to that serotype, but does not confer long-term protection against infection by other serotypes. The immunological interactions between sero- types are of central importance in understanding epidemiological dynamics and anticipating the impact of dengue vaccines. We analysed a 38-year time series with 12 197 serotyped dengue infections from a hospital in Bangkok, Thailand. Using novel mechanistic models to represent different hypothesized immune interactions between serotypes, we found strong evidence that infec- tion with dengue provides substantial short-term …


Dose-Response And Finding In Phase Ii Clinical Studies — Mcp-Mod Methodologies, Zhao Yang Jan 2013

Dose-Response And Finding In Phase Ii Clinical Studies — Mcp-Mod Methodologies, Zhao Yang

Zhao (Tony) Yang, Ph.D.

This presentation give an overall introduction to the MCP-Mod methodology with detailed step-by-step demonstration.


Phase I Design For Multiple Treatment Schedules, Nolan A. Wages Jan 2013

Phase I Design For Multiple Treatment Schedules, Nolan A. Wages

Nolan A. Wages

No abstract provided.


Early-Phase Dose-Finding Design For Oncology Trials Of Molecularly Targeted Agents, Nolan A. Wages Jan 2013

Early-Phase Dose-Finding Design For Oncology Trials Of Molecularly Targeted Agents, Nolan A. Wages

Nolan A. Wages

No abstract provided.


Bayesian Inferences For Beta Semiparametric Mixed Models To Analyze Longitudinal Neuroimaging Data, Xiaofeng Wang, Yingxing Li Jan 2013

Bayesian Inferences For Beta Semiparametric Mixed Models To Analyze Longitudinal Neuroimaging Data, Xiaofeng Wang, Yingxing Li

Xiaofeng Wang

Diffusion tensor imaging (DTI) is a quantitative magnetic resonance imaging technique that measures the three-dimensional diffusion of water molecules within tissue through the application of multiple diffusion gradients. This technique is rapidly increasing in popularity for studying white matter properties and structural connectivity in the living human brain. The major measure derived from the DTI process is known as fractional anisotropy, a continuous measure restricted on the interval (0,1). Motivated from a DTI study of multiple sclerosis, we use a beta semiparametric mixed-effect regression model for the longitudinal neuroimaging data. This work extends the generalized additive model methodology with beta …


Bayesian Nonparametric Regression And Density Estimation Using Integrated Nested Laplace Approximations, Xiaofeng Wang Jan 2013

Bayesian Nonparametric Regression And Density Estimation Using Integrated Nested Laplace Approximations, Xiaofeng Wang

Xiaofeng Wang

Integrated nested Laplace approximations (INLA) are a recently proposed approximate Bayesian approach to fit structured additive regression models with latent Gaussian field. INLA method, as an alternative to Markov chain Monte Carlo techniques, provides accurate approximations to estimate posterior marginals and avoid time-consuming sampling. We show here that two classical nonparametric smoothing problems, nonparametric regression and density estimation, can be achieved using INLA. Simulated examples and \texttt{R} functions are demonstrated to illustrate the use of the methods. Some discussions on potential applications of INLA are made in the paper.


Sberia: Set Based Gene Environment Interaction Test For Rare And Common Variants In Complex Diseases, Shuo Jiao, Li Hsu, Stéphane Bézieau, Hermann Brenner, Andrew T. Chan, Jenny Chang-Claude, Loic Le Marchand, Mathieu Lemire, Polly A. Newcomb, Martha L. Slattery, Ulrike Peters Jan 2013

Sberia: Set Based Gene Environment Interaction Test For Rare And Common Variants In Complex Diseases, Shuo Jiao, Li Hsu, Stéphane Bézieau, Hermann Brenner, Andrew T. Chan, Jenny Chang-Claude, Loic Le Marchand, Mathieu Lemire, Polly A. Newcomb, Martha L. Slattery, Ulrike Peters

Shuo Jiao

Identification of gene-environment interaction (GxE) is important in understanding the etiology of complex diseases. However, partially due to the lack of power, there have been very few replicated GxE findings compared to the success in marginal association studies. The existing GxE testing methods mainly focus on improving the power for individual markers. In this paper, we took a different strategy and proposed a Set Based gene EnviRonment InterAction test (SBERIA), which can improve the power by reducing the multiple testing burdens and aggregating signals within a set. The major challenge of the signal aggregation within a set is how to …


Mixtures Of Receiver Operating Characteristic Curves, Mithat Gonen Jan 2013

Mixtures Of Receiver Operating Characteristic Curves, Mithat Gonen

Mithat Gönen

Rationale and Objectives: ROC curves are ubiquitous in the analysis of imaging metrics as markers of both diagnosis and prognosis. While empirical estimation of ROC curves remains the most popular method, there are several reasons to consider smooth estimates based on a parametric model.

Materials and Methods: A mixture model is considered for modeling the distribution of the marker in the diseased population motivated by the biological observation that here is more heterogeneity in the diseased population than there is in the normal one. It is shown that this model results in an analytically tractable ROC curve which is itself …


Penalized Regression Procedures For Variable Selection In The Potential Outcomes Framework, Debashis Ghosh, Yeying Zhu, Donna L. Coffman Jan 2013

Penalized Regression Procedures For Variable Selection In The Potential Outcomes Framework, Debashis Ghosh, Yeying Zhu, Donna L. Coffman

Debashis Ghosh

A recent topic of much interest in causal inference is model selection. In this article, we describe a framework in which to consider penalized regression approaches to variable selection for causal effects. The framework leads to a simple `impute, then select' class of procedures that is agnostic to the type of imputation algorithm as well as penalized regression used. It also clarifies how model selection involves a multivariate regression model, and that these methods can be applied for identifying subgroups in which treatment effects are homogeneous. Analogies and links with the literature on machine learning methods, missing data and imputation …


Using Methods From The Data-Mining And Machine-Learning Literature For Disease Classification And Prediction: A Case Study Examining Classification Of Heart Failure Subtypes, Peter C. Austin Jan 2013

Using Methods From The Data-Mining And Machine-Learning Literature For Disease Classification And Prediction: A Case Study Examining Classification Of Heart Failure Subtypes, Peter C. Austin

Peter Austin

OBJECTIVE: Physicians classify patients into those with or without a specific disease. Furthermore, there is often interest in classifying patients according to disease etiology or subtype. Classification trees are frequently used to classify patients according to the presence or absence of a disease. However, classification trees can suffer from limited accuracy. In the data-mining and machine-learning literature, alternate classification schemes have been developed. These include bootstrap aggregation (bagging), boosting, random forests, and support vector machines.

STUDY DESIGN AND SETTING: We compared the performance of these classification methods with that of conventional classification trees to classify patients with heart failure (HF) …


Predictive Accuracy Of Risk Factors And Markers: A Simulation Study Of The Effect Of Novel Markers On Different Performance Measures For Logistic Regression Models, Peter C. Austin Jan 2013

Predictive Accuracy Of Risk Factors And Markers: A Simulation Study Of The Effect Of Novel Markers On Different Performance Measures For Logistic Regression Models, Peter C. Austin

Peter Austin

The change in c-statistic is frequently used to summarize the change in predictive accuracy when a novel risk factor is added to an existing logistic regression model. We explored the relationship between the absolute change in the c-statistic, Brier score, generalized R(2) , and the discrimination slope when a risk factor was added to an existing model in an extensive set of Monte Carlo simulations. The increase in model accuracy due to the inclusion of a novel marker was proportional to both the prevalence of the marker and to the odds ratio relating the marker to the outcome but inversely …


On The Exact Size Of Multiple Comparison Tests, Chris Lloyd Dec 2012

On The Exact Size Of Multiple Comparison Tests, Chris Lloyd

Chris J. Lloyd

No abstract provided.


On The Size Accuracy Of Combination Tests, Chris Lloyd Dec 2012

On The Size Accuracy Of Combination Tests, Chris Lloyd

Chris J. Lloyd

One element of the analysis of adaptive clinical trials is combining the evidence from several (often two) stages. When the endpoint is binary, standard single stage tests statistics do not control size well. Yet the combined test might not be valid if the single stage tests are not. The purpose of this paper is to numerically and theoretically examine the extent to which combining basic tests statistics mitigates or magnifies the size violation of the final test.


A Targeted Confounder Selection Strategy For Propensity Score Estimation, Susan Gruber Dec 2012

A Targeted Confounder Selection Strategy For Propensity Score Estimation, Susan Gruber

Susan Gruber

These slides provide an introduction to data-adaptive propensity score estimation, and the collaborative targeted maximum likelihood estimator (C-TMLE) of van der Laan and Gruber. The notation has been greatly simplified, which makes the work accessible to a more general audience, but loses a little in the translation.


An Overview Of Targeted Maximum Likelihood Estimation, Susan Gruber Dec 2012

An Overview Of Targeted Maximum Likelihood Estimation, Susan Gruber

Susan Gruber

These slides provide an introduction to targeted maximum likelihood estimation in a point treatment setting.


Bayesian Methods For Expression-Based Integration, Elizabeth M. Jennings, Jeffrey S. Morris, Raymond J. Carroll, Ganiraju C. Manyam, Veera Baladandayuthapani Dec 2012

Bayesian Methods For Expression-Based Integration, Elizabeth M. Jennings, Jeffrey S. Morris, Raymond J. Carroll, Ganiraju C. Manyam, Veera Baladandayuthapani

Jeffrey S. Morris

We propose methods to integrate data across several genomic platforms using a hierarchical Bayesian analysis framework that incorporates the biological relationships among the platforms to identify genes whose expression is related to clinical outcomes in cancer. This integrated approach combines information across all platforms, leading to increased statistical power in finding these predictive genes, and further provides mechanistic information about the manner in which the gene affects the outcome. We demonstrate the advantages of the shrinkage estimation used by this approach through a simulation, and finally, we apply our method to a Glioblastoma Multiforme dataset and identify several genes potentially …


A Case-Control Study Of Physical Activity Patterns And Risk Of Non-Fatal Myocardial Infarction, Jian Gong, Hannia Campos, Mark Fiecas, Stephen Mcgarvey, Robert Goldberg, Caroline Richardson, Ana Baylin Dec 2012

A Case-Control Study Of Physical Activity Patterns And Risk Of Non-Fatal Myocardial Infarction, Jian Gong, Hannia Campos, Mark Fiecas, Stephen Mcgarvey, Robert Goldberg, Caroline Richardson, Ana Baylin

Mark Fiecas

Background The interactive effects of different types of physical activity on cardiovascular disease (CVD) risk have not been fully considered in previous studies. We aimed to identify physical activity patterns that take into account combinations of physical activities and examine the association between derived physical activity patterns and risk of acute myocardial infarction (AMI). Methods We examined the relationship between physical activity patterns, identified by principal component analysis (PCA), and AMI risk in a case-control study of myocardial infarction in Costa Rica (N=4172), 1994-2004. The component scores derived from PCA and total METS were used in natural cubic spline models …


Varying-Smoother Models For Functional Responses, Philip T. Reiss, Lei Huang, Huaihou Chen, Stan Colcombe Dec 2012

Varying-Smoother Models For Functional Responses, Philip T. Reiss, Lei Huang, Huaihou Chen, Stan Colcombe

Philip T. Reiss

This paper studies estimation of a smooth function f(x,v) when we are given functional responses of the form f(x, ·) + error, but scientific interest centers on the collection of functions f(·,v) for different v. The motivation comes from studies of human brain development, in which x denotes age whereas v refers to brain locations. Analogously to varying-coefficient models, in which the mean response is linear in x, the “varying-smoother” models that we consider exhibit nonlinear dependence on x that varies smoothly with v. We discuss three approaches to estimating varying-smoother models: (a) methods that employ a tensor product penalty; …


Progression From New Methicillin-Resistant Staphylococcus Aureus Colonisation To Infection: An Observational Study In A Hospital Cohort, Michelle Nd Balm, Andrew A. Lover, Sharon Salmon, Paul A. Tambyah, Dale A. Fisher Dec 2012

Progression From New Methicillin-Resistant Staphylococcus Aureus Colonisation To Infection: An Observational Study In A Hospital Cohort, Michelle Nd Balm, Andrew A. Lover, Sharon Salmon, Paul A. Tambyah, Dale A. Fisher

Andrew Lover

Background
Patients newly colonised with methicillin-resistant Staphylococcus aureus (MRSA) are at higher risk of clinical MRSA infection. At present, there are limited data on the duration or magnitude of this risk in a hospital population with a known time of MRSA acquisition.
Methods
A retrospective cohort study of 909 adult patients known to have newly identified MRSA colonisation during admission to National University Hospital, Singapore between 1 July 2007 and 30 June 2011 was undertaken. Patients were excluded if they had history of previous MRSA colonisation or infection, or if they had been a hospital inpatient in the preceding 12 …


Quantifying Effect Of Geographic Location On Epidemiology Of Plasmodium Vivax Malaria, Andrew A. Lover, Richard J. Coker Dec 2012

Quantifying Effect Of Geographic Location On Epidemiology Of Plasmodium Vivax Malaria, Andrew A. Lover, Richard J. Coker

Andrew Lover

Recent autochthonous transmission of Plasmodium vivax malaria in previously malaria-free temperate regions has generated renewed interest in the epidemiology of this disease. Accurate estimates of the incubation period and time to relapse are required for effective malaria surveillance; however, this information is currently lacking. By using historical data from experimental human infections with diverse P. vivax strains, survival analysis models were used to obtain quantitative estimates of the incubation period and time to first relapse for P. vivax malaria in broad geographic regions. Results show that Eurasian strains from temperate regions have longer incubation periods, and Western Hemisphere strains from …


Methods For Evaluating Prediction Performance Of Biomarkers And Tests, Margaret S. Pepe Phd, Holly Janes Phd Dec 2012

Methods For Evaluating Prediction Performance Of Biomarkers And Tests, Margaret S. Pepe Phd, Holly Janes Phd

Margaret S Pepe PhD

This chapter describes and critiques methods for evaluating the performance of markers to predict risk of a current or future clinical outcome. We consider three criteria that are important for evaluating a risk model: calibration, benefit for decision making and accurate classification. We also describe and discuss a variety of summary measures in common use for quantifying predictive information such as the area under the ROC curve and R-squared. The roles and problems with recently proposed risk reclassification approaches are discussed in detail.