Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 19 of 19

Full-Text Articles in Statistics and Probability

Online Variational Bayes Inference For High-Dimensional Correlated Data, Sylvie T. Kabisa, Jeffrey S. Morris, David Dunson Jan 2016

Online Variational Bayes Inference For High-Dimensional Correlated Data, Sylvie T. Kabisa, Jeffrey S. Morris, David Dunson

Jeffrey S. Morris

High-dimensional data with hundreds of thousands of observations are becoming commonplace in many disciplines. The analysis of such data poses many computational challenges, especially when the observations are correlated over time and/or across space. In this paper we propose exible hierarchical regression models for analyzing such data that accommodate serial and/or spatial correlation. We address the computational challenges involved in fitting these models by adopting an approximate inference framework. We develop an online variational Bayes algorithm that works by incrementally reading the data into memory one portion at a time. The performance of the method is assessed through simulation studies. …


Functional Car Models For Spatially Correlated Functional Datasets, Lin Zhang, Veerabhadran Baladandayuthapani, Hongxiao Zhu, Keith A. Baggerly, Tadeusz Majewski, Bogdan Czerniak, Jeffrey S. Morris Jan 2016

Functional Car Models For Spatially Correlated Functional Datasets, Lin Zhang, Veerabhadran Baladandayuthapani, Hongxiao Zhu, Keith A. Baggerly, Tadeusz Majewski, Bogdan Czerniak, Jeffrey S. Morris

Jeffrey S. Morris

We develop a functional conditional autoregressive (CAR) model for spatially correlated data for which functions are collected on areal units of a lattice. Our model performs functional response regression while accounting for spatial correlations with potentially nonseparable and nonstationary covariance structure, in both the space and functional domains. We show theoretically that our construction leads to a CAR model at each functional location, with spatial covariance parameters varying and borrowing strength across the functional domain. Using basis transformation strategies, the nonseparable spatial-functional model is computationally scalable to enormous functional datasets, generalizable to different basis functions, and can be used on …


Bayesian Function-On-Function Regression For Multi-Level Functional Data, Mark J. Meyer, Brent A. Coull, Francesco Versace, Paul Cinciripini, Jeffrey S. Morris Jan 2015

Bayesian Function-On-Function Regression For Multi-Level Functional Data, Mark J. Meyer, Brent A. Coull, Francesco Versace, Paul Cinciripini, Jeffrey S. Morris

Jeffrey S. Morris

Medical and public health research increasingly involves the collection of more and more complex and high dimensional data. In particular, functional data|where the unit of observation is a curve or set of curves that are finely sampled over a grid -- is frequently obtained. Moreover, researchers often sample multiple curves per person resulting in repeated functional measures. A common question is how to analyze the relationship between two functional variables. We propose a general function-on-function regression model for repeatedly sampled functional data, presenting a simple model as well as a more extensive mixed model framework, along with multiple functional posterior …


Functional Regression, Jeffrey S. Morris Jan 2015

Functional Regression, Jeffrey S. Morris

Jeffrey S. Morris

Functional data analysis (FDA) involves the analysis of data whose ideal units of observation are functions defined on some continuous domain, and the observed data consist of a sample of functions taken from some population, sampled on a discrete grid. Ramsay and Silverman's 1997 textbook sparked the development of this field, which has accelerated in the past 10 years to become one of the fastest growing areas of statistics, fueled by the growing number of applications yielding this type of data. One unique characteristic of FDA is the need to combine information both across and within functions, which Ramsay and …


Ordinal Probit Wavelet-Based Functional Models For Eqtl Analysis, Mark J. Meyer, Jeffrey S. Morris, Craig P. Hersh, Jarret D. Morrow, Christoph Lange, Brent A. Coull Jan 2015

Ordinal Probit Wavelet-Based Functional Models For Eqtl Analysis, Mark J. Meyer, Jeffrey S. Morris, Craig P. Hersh, Jarret D. Morrow, Christoph Lange, Brent A. Coull

Jeffrey S. Morris

Current methods for conducting expression Quantitative Trait Loci (eQTL) analysis are limited in scope to a pairwise association testing between a single nucleotide polymorphism (SNPs) and expression probe set in a region around a gene of interest, thus ignoring the inherent between-SNP correlation. To determine association, p-values are then typically adjusted using Plug-in False Discovery Rate. As many SNPs are interrogated in the region and multiple probe-sets taken, the current approach requires the fitting of a large number of models. We propose to remedy this by introducing a flexible function-on-scalar regression that models the genome as a functional outcome. The …


A Study Of Mexican Free-Tailed Bat Chirp Syllables: Bayesian Functional Mixed Modeling Of Nonstationary Time Series Data With Time-Dependent Spectra, Josue G. Martinez, Kirsten M. Bohn, Raymond J. Carroll, Jeffrey S. Morris Feb 2013

A Study Of Mexican Free-Tailed Bat Chirp Syllables: Bayesian Functional Mixed Modeling Of Nonstationary Time Series Data With Time-Dependent Spectra, Josue G. Martinez, Kirsten M. Bohn, Raymond J. Carroll, Jeffrey S. Morris

Jeffrey S. Morris

We describe a new approach to analyze chirp syllables of free-tailed bats from two regions of Texas in which they are predominant: Austin and College Station. Our goal is to characterize any systematic regional differences in the mating chirps and assess whether individual bats have signature chirps. The data are analyzed by modeling spectrograms of the chirps as responses in a Bayesian functional mixed model. Given the variable chirp lengths, we compute the spectrograms on a relative time scale interpretable as the relative chirp position, using a variable window overlap based on chirp length. We use 2D wavelet transforms to …


Statistical Methods For Proteomic Biomarker Discovery Based On Feature Extraction Or Functional Modeling Approaches, Jeffrey S. Morris Jan 2012

Statistical Methods For Proteomic Biomarker Discovery Based On Feature Extraction Or Functional Modeling Approaches, Jeffrey S. Morris

Jeffrey S. Morris

In recent years, developments in molecular biotechnology have led to the increased promise of detecting and validating biomarkers, or molecular markers that relate to various biological or medical outcomes. Proteomics, the direct study of proteins in biological samples, plays an important role in the biomarker discovery process. These technologies produce complex, high dimensional functional and image data that present many analytical challenges that must be addressed properly for effective comparative proteomics studies that can yield potential biomarkers. Specific challenges include experimental design, preprocessing, feature extraction, and statistical analysis accounting for the inherent multiple testing issues. This paper reviews various computational …


Wavelet-Based Functional Linear Mixed Models: An Application To Measurement Error–Corrected Distributed Lag Models, Elizabeth J. Malloy, Jeffrey S. Morris, Sara D. Adar, Helen Suh, Diane R. Gold, Brent A. Coull Jan 2010

Wavelet-Based Functional Linear Mixed Models: An Application To Measurement Error–Corrected Distributed Lag Models, Elizabeth J. Malloy, Jeffrey S. Morris, Sara D. Adar, Helen Suh, Diane R. Gold, Brent A. Coull

Jeffrey S. Morris

Frequently, exposure data are measured over time on a grid of discrete values that collectively define a functional observation. In many applications, researchers are interested in using these measurements as covariates to predict a scalar response in a regression setting, with interest focusing on the most biologically relevant time window of exposure. One example is in panel studies of the health effects of particulate matter (PM), where particle levels are measured over time. In such studies, there are many more values of the functional data than observations in the data set so that regularization of the corresponding functional regression coefficient …


Bayesian Random Segmentationmodels To Identify Shared Copy Number Aberrations For Array Cgh Data, Veerabhadran Baladandayuthapani, Yuan Ji, Rajesh Talluri, Luis E. Nieto-Barajas, Jeffrey S. Morris Jan 2010

Bayesian Random Segmentationmodels To Identify Shared Copy Number Aberrations For Array Cgh Data, Veerabhadran Baladandayuthapani, Yuan Ji, Rajesh Talluri, Luis E. Nieto-Barajas, Jeffrey S. Morris

Jeffrey S. Morris

Array-based comparative genomic hybridization (aCGH) is a high-resolution high-throughput technique for studying the genetic basis of cancer. The resulting data consists of log fluorescence ratios as a function of the genomic DNA location and provides a cytogenetic representation of the relative DNA copy number variation. Analysis of such data typically involves estimation of the underlying copy number state at each location and segmenting regions of DNA with similar copy number states. Most current methods proceed by modeling a single sample/array at a time, and thus fail to borrow strength across multiple samples to infer shared regions of copy number aberrations. …


Statistical Issues In Proteomic Research, Jeffrey S. Morris Dec 2007

Statistical Issues In Proteomic Research, Jeffrey S. Morris

Jeffrey S. Morris

No abstract provided.


Wavelet-Based Functional Mixed Models To Characterize Population Heterogeneity In Accelerometer Profiles: A Case Study. , Jeffrey S. Morris, Cassandra Arroyo, Brent A. Coull, Louise M. Ryan, Steven L. Gortmaker Dec 2006

Wavelet-Based Functional Mixed Models To Characterize Population Heterogeneity In Accelerometer Profiles: A Case Study. , Jeffrey S. Morris, Cassandra Arroyo, Brent A. Coull, Louise M. Ryan, Steven L. Gortmaker

Jeffrey S. Morris

We present a case study illustrating the challenges of analyzing accelerometer data taken from a sample of children participating in an intervention study designed to increase physical activity. An accelerometer is a small device worn on the hip that records the minute-by-minute activity levels of the child throughout the day for each day it is worn. The resulting data are irregular functions characterized by many peaks representing short bursts of intense activity. We model these data using the wavelet-based functional mixed model. This approach incorporates multiple fixed effects and random effect functions of arbitrary form, the estimates of which are …


Wavelet-Based Functional Mixed Model Analysis: Computational Considerations, Richard C. Herrick, Jeffrey S. Morris Aug 2006

Wavelet-Based Functional Mixed Model Analysis: Computational Considerations, Richard C. Herrick, Jeffrey S. Morris

Jeffrey S. Morris

Wavelet-based Functional Mixed Models is a new Bayesian method extending mixed models to irregular functional data (Morris and Carroll, JRSS-B, 2006). These data sets are typically very large and can quickly run into memory and time constraints unless these issues are carefully dealt with in the software. We reduce runtime by 1.) identifying and optimizing hotspots, 2.) using wavelet compression to do less computation with minimal impact on results, and 3.) dividing the code into multiple executables to be run in parallel using a grid computing resource. We discuss rules of thumb for estimating memory requirements and computation times in …


Wavelet-Based Functional Mixed Models, Jeffrey S. Morris, Raymond J. Carroll Apr 2006

Wavelet-Based Functional Mixed Models, Jeffrey S. Morris, Raymond J. Carroll

Jeffrey S. Morris

Increasingly, Increasingly, scientific studies yield functional data, in which the ideal units of observation are curves and the observed data consist of sets of curves that are sampled on a fine grid. We present new methodology that generalizes the linear mixed model to the functional mixed model framework, with model fitting done by using a Bayesian wavelet-based approach. This method is flexible, allowing functions of arbitrary formand the full range of fixed effects structures and between-curve covariance structures that are available in the mixed model framework. It yields nonparametric estimates of the fixed and random-effects functions as well as the …


Analysis Of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models, Jeffrey S. Morris, Philip J. Brown, Keith A. Baggerly, Kevin R. Coombes Mar 2006

Analysis Of Mass Spectrometry Data Using Bayesian Wavelet-Based Functional Mixed Models, Jeffrey S. Morris, Philip J. Brown, Keith A. Baggerly, Kevin R. Coombes

Jeffrey S. Morris

In this chapter, we demonstrate how to analyze MALDI-TOF/SELDITOF mass spectrometry data using the wavelet-based functional mixed model introduced by Morris and Carroll (2006), which generalizes the linear mixed models to the case of functional data. This approach models each spectrum as a function, and is very general, accommodating a broad class of experimental designs and allowing one to model nonparametric functional effects for various factors, which can be conditions of interest (e.g. cancer/normal) or experimental factors (blocking factors). Inference on these functional effects allows us to identify protein peaks related to various outcomes of interest, including dichotomous outcomes, categorical …


Improved Peak Detection And Quantification Of Mass Spectrometry Data Acquired From Surface-Enhanced Laser Desorption And Ionization By Denoising Spectra With The Undecimated Discrete Wavelet Transform, Kevin R. Coombes, Spiros Tsavachidis, Jeffrey S. Morris, Keith A. Baggerly, Henry M. Kuerer Dec 2005

Improved Peak Detection And Quantification Of Mass Spectrometry Data Acquired From Surface-Enhanced Laser Desorption And Ionization By Denoising Spectra With The Undecimated Discrete Wavelet Transform, Kevin R. Coombes, Spiros Tsavachidis, Jeffrey S. Morris, Keith A. Baggerly, Henry M. Kuerer

Jeffrey S. Morris

Background: Mass spectrometry, especially surface enhanced laser desorption and ionization (SELDI) is increasingly being used to find disease-related proteomic patterns in complex mixtures of proteins derived from tissue samples or from easily obtained biological fluids such as serum, urine, or nipple aspirate fluid. Questions have been raised about the reproducibility and reliability of peak quantifications using this technology. For example, Yasui and colleagues opted to replace continuous measures of the size of a peak by a simple binary indicator of its presence or absence in their analysis of a set of spectra from prostate cancer patients.

Methods: We collected nipple …


Rejoinder To "“Wavelet-Based Nonparametric Modeling Of Hierarchical Functions In Colon Carcinogenesis.”, Jeffrey S. Morris, Marina Vannucci, Philip J. Brown, Raymond J. Carroll Oct 2003

Rejoinder To "“Wavelet-Based Nonparametric Modeling Of Hierarchical Functions In Colon Carcinogenesis.”, Jeffrey S. Morris, Marina Vannucci, Philip J. Brown, Raymond J. Carroll

Jeffrey S. Morris

No abstract provided.


Wavelet-Based Nonparametric Modeling Of Hierarchical Functions In Colon Carcinogenesis., Jeffrey S. Morris, Marina Vannucci, Philip J. Brown, Raymond J. Carroll Sep 2003

Wavelet-Based Nonparametric Modeling Of Hierarchical Functions In Colon Carcinogenesis., Jeffrey S. Morris, Marina Vannucci, Philip J. Brown, Raymond J. Carroll

Jeffrey S. Morris

In this article we develop new methods for analyzing the data from an experiment using rodent models to investigate the effect of type of dietary fat on O6-methylguanine-DNA-methyltransferase (MGMT), an important biomarker in early colon carcinogenesis. The data consist of observed profiles over a spatial variable contained within a two-stage hierarchy, a structure that we dub hierarchical functional data. We present a new method providing a unified framework for modeling these data, simultaneously yielding estimates and posterior samples for mean, individual, and subsample-level profiles, as well as covariance parameters at the various hierarchical levels. Our method is nonparametric in that …


A Bayesian Analysis Involving Colonic Crypt Structure And Coordinated Response To Carcinogens Incorporating Missing Crypts, Jeffrey S. Morris, Naisyin Wang, Joanne R. Lupton, Robert S. Chapkin, Nancy D. Turner, Mee-Young Hong, Raymond J. Carroll Sep 2002

A Bayesian Analysis Involving Colonic Crypt Structure And Coordinated Response To Carcinogens Incorporating Missing Crypts, Jeffrey S. Morris, Naisyin Wang, Joanne R. Lupton, Robert S. Chapkin, Nancy D. Turner, Mee-Young Hong, Raymond J. Carroll

Jeffrey S. Morris

This paper is concerned with modeling the architecture of colonic crypts and the implications of this modeling for understanding possible coordinated response of carcinogen–induced DNA damage between various regions of the colon. The methods we develop to address these two issues are applied to a particular important example in colon carcinogenesis. We cast the problem as an unusual and not previously studied hierarchical mixed-effects model characterized by completely missing covariates in units at a structurally base level, except for some randomly selected units. Information concerning the missing covariates is available through certain known ordering constraints and surrogate measures. Our methods …


Parametric And Nonparametric Methods For Understanding The Relationship Between Carcinogen-Induced Dna Adduct Levels In Distal And Proximal Regions Of The Colon., Jeffrey S. Morris, Naisyin Wang, Joanne R. Lupton, Robert S. Chapkin, Nancy D. Turner, Mee-Young Hong, Raymond J. Carroll Sep 2001

Parametric And Nonparametric Methods For Understanding The Relationship Between Carcinogen-Induced Dna Adduct Levels In Distal And Proximal Regions Of The Colon., Jeffrey S. Morris, Naisyin Wang, Joanne R. Lupton, Robert S. Chapkin, Nancy D. Turner, Mee-Young Hong, Raymond J. Carroll

Jeffrey S. Morris

An important problem in studying the etiology of colon cancer is understanding the relationship between DNA adduct levels (broadly, DNA damage) in cells within colonic crypts in distal and proximal parts of the colon, following treatment with a carcinogen and different types of diet. In particular, it is important to understand whether rats who have elevated adduct levels in particular positions in distal region crypts also have elevated levels in the same positions of the crypts in proximal regions, and whether this relationship depends on diet. We cast this problem as estimating the correlation function of two responses as a …