Open Access. Powered by Scholars. Published by Universities.®

Medicine and Health Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Statistics and Probability

PDF

2005

Institution
Keyword
Publication
Publication Type

Articles 1 - 30 of 36

Full-Text Articles in Medicine and Health Sciences

Model Checking For Roc Regression Analysis, Tianxi Cai, Yingye Zheng Dec 2005

Model Checking For Roc Regression Analysis, Tianxi Cai, Yingye Zheng

Harvard University Biostatistics Working Paper Series

The Receiver Operating Characteristic (ROC) curve is a prominent tool for characterizing the accuracy of continuous diagnostic test. To account for factors that might invluence the test accuracy, various ROC regression methods have been proposed. However, as in any regression analysis, when the assumed models do not fit the data well, these methods may render invalid and misleading results. To date practical model checking techniques suitable for validating existing ROC regression models are not yet available. In this paper, we develop cumulative residual based procedures to graphically and numerically assess the goodness-of-fit for some commonly used ROC regression models, and …


Improved Peak Detection And Quantification Of Mass Spectrometry Data Acquired From Surface-Enhanced Laser Desorption And Ionization By Denoising Spectra With The Undecimated Discrete Wavelet Transform, Kevin R. Coombes, Spiros Tsavachidis, Jeffrey S. Morris, Keith A. Baggerly, Henry M. Kuerer Dec 2005

Improved Peak Detection And Quantification Of Mass Spectrometry Data Acquired From Surface-Enhanced Laser Desorption And Ionization By Denoising Spectra With The Undecimated Discrete Wavelet Transform, Kevin R. Coombes, Spiros Tsavachidis, Jeffrey S. Morris, Keith A. Baggerly, Henry M. Kuerer

Jeffrey S. Morris

Background: Mass spectrometry, especially surface enhanced laser desorption and ionization (SELDI) is increasingly being used to find disease-related proteomic patterns in complex mixtures of proteins derived from tissue samples or from easily obtained biological fluids such as serum, urine, or nipple aspirate fluid. Questions have been raised about the reproducibility and reliability of peak quantifications using this technology. For example, Yasui and colleagues opted to replace continuous measures of the size of a peak by a simple binary indicator of its presence or absence in their analysis of a set of spectra from prostate cancer patients.

Methods: We collected nipple …


Pooling Information Across Different Studies And Oligonucleotide Microarray Chip Types To Identify Prognostic Genes For Lung Cancer., Jeffrey S. Morris, Guosheng Yin, Keith A. Baggerly, Chunlei Wu, Li Zhang Dec 2005

Pooling Information Across Different Studies And Oligonucleotide Microarray Chip Types To Identify Prognostic Genes For Lung Cancer., Jeffrey S. Morris, Guosheng Yin, Keith A. Baggerly, Chunlei Wu, Li Zhang

Jeffrey S. Morris

Our goal in this work is to pool information across microarray studies conducted at different institutions using two different versions of Affymetrix chips to identify genes whose expression levels offer information on lung cancer patients’ survival above and beyond the information provided by readily available clinical covariates. We combine information across chip types by identifying “matching probes” present on both chips, and then assembling them into new probesets based on Unigene clusters. This method yields comparable expression level quantifications across chips without sacrificing much precision or significantly altering the relative ordering of the samples. We fit a series of multivariable …


Accounting For Missing Data In End-Of-Life Research, Paula Diehr, Laura Lee Johnson Dec 2005

Accounting For Missing Data In End-Of-Life Research, Paula Diehr, Laura Lee Johnson

Paula Diehr

End-of-life studies are likely to have missing data because sicker persons are less likely to provide information and because measurements cannot be made after death. Ignoring missing data may result in data that are too favorable, because the sickest persons are effectively dropped from the analysis. In a comparison of two groups, the group with the most deaths and missing data will tend to have the most favorable data, which is not desirable. Results based on only the available data may not be generalizable to the original study population. If most of the missing data are absent because of death, …


Autism And Parental Marital Satisfaction: The Role Of Adequacy Of Resources, Geneeta Kaliah Chambers Dec 2005

Autism And Parental Marital Satisfaction: The Role Of Adequacy Of Resources, Geneeta Kaliah Chambers

Loma Linda University Electronic Theses, Dissertations & Projects

The goal of the present study was to expand on the existing literature exploring families with children who have developmental disabilities, particularly autism. Previous studies have been constrained by univariate approaches that have failed to adequately capture the nuances of family functioning. Using an ecological/context approach, stemming from an ongoing research program conducted within a university-based treatment center, the present study attempted to improve on the conceptualization of interrelationships among family members and the role that contextual factors play within that dynamic. Specifically, the present study explored the influence of children’s level of autism on parents’ reports of their marital …


Accuracy Of The Newtom 3g™ In Measuring The Angle Of The Articular Eminence, Rehana Khan Dec 2005

Accuracy Of The Newtom 3g™ In Measuring The Angle Of The Articular Eminence, Rehana Khan

Loma Linda University Electronic Theses, Dissertations & Projects

The purpose of this study was to determine the accuracy of the Newtom 3G™ in determining the angulation of the articular eminence. The benefits of conducting this study were to provide additional uses for the standard records that are taken for the purposes of orthodontic treatment, as well as evaluate the Newtom 3G™ for accuracy in measuring the anatomy of the glenoid fossa. This study required 20 participants that volunteered to allow their records to be used. Records evaluated were the Newtom 3G™, impressions, and wax check bite registrations. The wax record was taken using the 'forced bite' technique to …


Expert Testimony In Capital Sentencing: Juror Responses, John H. Montgomery, J. Richard Ciccone, Stephen P. Garvey, Theodore Eisenberg Dec 2005

Expert Testimony In Capital Sentencing: Juror Responses, John H. Montgomery, J. Richard Ciccone, Stephen P. Garvey, Theodore Eisenberg

Cornell Law Faculty Publications

The U.S. Supreme Court, in Furman v. Georgia (1972), held that the death penalty is constitutional only when applied on an individualized basis. The resultant changes in the laws in death penalty states fostered the involvement of psychiatric and psychologic expert witnesses at the sentencing phase of the trial, to testify on two major issues: (1) the mitigating factor of a defendant’s abnormal mental state and (2) the aggravating factor of a defendant’s potential for future violence. This study was an exploration of the responses of capital jurors to psychiatric/psychologic expert testimony during capital sentencing. The Capital Jury Project is …


Biosecurity And The Role Of Statisticians, Ron Brookmeyer Nov 2005

Biosecurity And The Role Of Statisticians, Ron Brookmeyer

Ron Brookmeyer

No abstract provided.


Methods For Incorporating Death Into Health-Related Variables In Longitudinal Studies, Paula Diehr Nov 2005

Methods For Incorporating Death Into Health-Related Variables In Longitudinal Studies, Paula Diehr

Paula Diehr

BACKGROUND AND OBJECTIVES: Longitudinal studies of health over time may be misleading if some people die. Self-rated health (excellent to poor) and the SF-36 profile scores have been transformed to incorporate death. We applied the same approaches to incorporate death into activities of daily living difficulties (ADLs), IADLs, mini-mental state examination, depressive symptoms, blocks walked per week, bed days, the timed walk, body mass index and blood pressure. STUDY DESIGN AND SETTING: The Cardiovascular Health Study of 5,888 older adults, was followed up to 9 years. Mean age was 73 at baseline, and 658 had an incident stroke during follow-up. …


Population Intervention Models In Causal Inference, Alan E. Hubbard, Mark J. Van Der Laan Oct 2005

Population Intervention Models In Causal Inference, Alan E. Hubbard, Mark J. Van Der Laan

U.C. Berkeley Division of Biostatistics Working Paper Series

Marginal structural models (MSM) provide a powerful tool for estimating the causal effect of a] treatment variable or risk variable on the distribution of a disease in a population. These models, as originally introduced by Robins (e.g., Robins (2000a), Robins (2000b), van der Laan and Robins (2002)), model the marginal distributions of treatment-specific counterfactual outcomes, possibly conditional on a subset of the baseline covariates, and its dependence on treatment. Marginal structural models are particularly useful in the context of longitudinal data structures, in which each subject's treatment and covariate history are measured over time, and an outcome is recorded at …


Gauss-Seidel Estimation Of Generalized Linear Mixed Models With Application To Poisson Modeling Of Spatially Varying Disease Rates, Subharup Guha, Louise Ryan Oct 2005

Gauss-Seidel Estimation Of Generalized Linear Mixed Models With Application To Poisson Modeling Of Spatially Varying Disease Rates, Subharup Guha, Louise Ryan

Harvard University Biostatistics Working Paper Series

Generalized linear mixed models (GLMMs) provide an elegant framework for the analysis of correlated data. Due to the non-closed form of the likelihood, GLMMs are often fit by computational procedures like penalized quasi-likelihood (PQL). Special cases of these models are generalized linear models (GLMs), which are often fit using algorithms like iterative weighted least squares (IWLS). High computational costs and memory space constraints often make it difficult to apply these iterative procedures to data sets with very large number of cases.

This paper proposes a computationally efficient strategy based on the Gauss-Seidel algorithm that iteratively fits sub-models of the GLMM …


Computational Techniques For Spatial Logistic Regression With Large Datasets, Christopher J. Paciorek, Louise Ryan Oct 2005

Computational Techniques For Spatial Logistic Regression With Large Datasets, Christopher J. Paciorek, Louise Ryan

Harvard University Biostatistics Working Paper Series

In epidemiological work, outcomes are frequently non-normal, sample sizes may be large, and effects are often small. To relate health outcomes to geographic risk factors, fast and powerful methods for fitting spatial models, particularly for non-normal data, are required. We focus on binary outcomes, with the risk surface a smooth function of space. We compare penalized likelihood models, including the penalized quasi-likelihood (PQL) approach, and Bayesian models based on fit, speed, and ease of implementation.

A Bayesian model using a spectral basis representation of the spatial surface provides the best tradeoff of sensitivity and specificity in simulations, detecting real spatial …


Is The Number Of Sick Persons In A Cohort Constant Over Time?, Paula Diehr, Ann Derleth, Anne Newman, Liming Cai Oct 2005

Is The Number Of Sick Persons In A Cohort Constant Over Time?, Paula Diehr, Ann Derleth, Anne Newman, Liming Cai

UW Biostatistics Working Paper Series

Objectives: To estimate the number of persons in a cohort who are sick, over time.

Methods: We calculated the number of sick persons in the Cardiovascular Health Study (CHS), a cohort study of older adults followed up to 14 years, using eight definitions of “healthy” and “sick”. We projected the number in each health state over time for a birth cohort.

Results: The number of sick persons in CHS was approximately constant for 14 years, for all definitions of “sick”. The estimated number of sick persons in the birth cohort was approximately constant from ages 55-75, after which it decreased. …


A Nonstationary Negative Binomial Time Series With Time-Dependent Covariates: Enterococcus Counts In Boston Harbor, E. Andres Houseman, Brent Coull, James P. Shine Sep 2005

A Nonstationary Negative Binomial Time Series With Time-Dependent Covariates: Enterococcus Counts In Boston Harbor, E. Andres Houseman, Brent Coull, James P. Shine

Harvard University Biostatistics Working Paper Series

Boston Harbor has had a history of poor water quality, including contamination by enteric pathogens. We conduct a statistical analysis of data collected by the Massachusetts Water Resources Authority (MWRA) between 1996 and 2002 to evaluate the effects of court-mandated improvements in sewage treatment. Motivated by the ineffectiveness of standard Poisson mixture models and their zero-inflated counterparts, we propose a new negative binomial model for time series of Enterococcus counts in Boston Harbor, where nonstationarity and autocorrelation are modeled using a nonparametric smooth function of time in the predictor. Without further restrictions, this function is not identifiable in the presence …


The Outcome Of Mta As A Root End Filling Material: A Long Term Evaluation, Christopher M. Sechrist Sep 2005

The Outcome Of Mta As A Root End Filling Material: A Long Term Evaluation, Christopher M. Sechrist

Loma Linda University Electronic Theses, Dissertations & Projects

Periradicular surgery is a viable option to save natural teeth when non-surgical treatment fails or when endodontic retreatment is not feasible or contraindicated. Laboratory and animal studies have demonstrated that MTA is biocompatible, provides an excellent seal against penetrating bacteria, and promotes hard tissue healing. The purpose of this study was to provide long term (>3 years) clinical evidence for its use as a root-end filling material in endodontics. The clinical records of 294 patients who had MTA used during endodontic treatment from 1996 to 2001 were reviewed. From these, 75 patients whose root end cavities had been filled …


Laser And Led Effects On The Proliferation Rate Of Periodontal Ligament Fibroblasts, Allen J. Job Sep 2005

Laser And Led Effects On The Proliferation Rate Of Periodontal Ligament Fibroblasts, Allen J. Job

Loma Linda University Electronic Theses, Dissertations & Projects

PURPOSE: To compare the effectiveness of a Gallium Aluminum Arsenide (GaAlAs) diode laser and a light emitting diode (LED) on periodontal ligament fibroblast cell proliferative rates.

METHODS and MATERIALS: PDLF obtained from freshly extracted permanent teeth were cultured under standard conditions until a subconfluent monolayer was present. The next section took 5 days to complete. On day 1, the initial cell concentration of 700 uL/cm2 was plated on 96-well assay plates and placed in a CO2 incubator at 37° C for 24 hours. On day 2, cell counts were first verified using hemocytometry then were irradiated using an …


Direct Effect Models, Mark J. Van Der Laan, Maya L. Petersen Aug 2005

Direct Effect Models, Mark J. Van Der Laan, Maya L. Petersen

U.C. Berkeley Division of Biostatistics Working Paper Series

The causal effect of a treatment on an outcome is generally mediated by several intermediate variables. Estimation of the component of the causal effect of a treatment that is mediated by a given intermediate variable (the indirect effect of the treatment), and the component that is not mediated by that intermediate variable (the direct effect of the treatment) is often relevant to mechanistic understanding and to the design of clinical and public health interventions. Under the assumption of no-unmeasured confounders for treatment and the intermediate variable, Robins & Greenland (1992) define an individual direct effect as the counterfactual effect of …


Attributable Risk Function In The Proportional Hazards Model, Ying Qing Chen, Chengcheng Hu, Yan Wang May 2005

Attributable Risk Function In The Proportional Hazards Model, Ying Qing Chen, Chengcheng Hu, Yan Wang

UW Biostatistics Working Paper Series

As an epidemiological parameter, the population attributable fraction is an important measure to quantify the public health attributable risk of an exposure to morbidity and mortality. In this article, we extend this parameter to the attributable fraction function in survival analysis of time-to-event outcomes, and further establish its estimation and inference procedures based on the widely used proportional hazards models. Numerical examples and simulations studies are presented to validate and demonstrate the proposed methods.


Prognosis Of Stage Ii Colon Cancer By Non-Neoplastic Mucosa Gene Expresssion Profiling, Alain Barrier, Sandrine Dudoit, Et Al. May 2005

Prognosis Of Stage Ii Colon Cancer By Non-Neoplastic Mucosa Gene Expresssion Profiling, Alain Barrier, Sandrine Dudoit, Et Al.

U.C. Berkeley Division of Biostatistics Working Paper Series

Aims. This study assessed the possibility to build a prognosis predictor, based on non-neoplastic mucosa microarray gene expression measures, in stage II colon cancer patients. Materials and Methods. Non-neoplastic colonic mucosa mRNA samples from 24 patients (10 with a metachronous metastasis, 14 with no recurrence) were profiled using the Affymetrix HGU133A GeneChip. The k-nearest neighbor method was used for prognosis prediction using microarray gene expression measures. Leave-one-out cross-validation was used to select the number of neighbors and number of informative genes to include in the predictor. Based on this information, a prognosis predictor was proposed and its accuracy estimated by …


Colon Cancer Prognosis Prediction By Gene Expression Profiling, Alain Barrier, Sandrine Dudoit, Et Al. May 2005

Colon Cancer Prognosis Prediction By Gene Expression Profiling, Alain Barrier, Sandrine Dudoit, Et Al.

U.C. Berkeley Division of Biostatistics Working Paper Series

Aims. This study assessed the possibility to build a prognosis predictor, based on microarray gene expression measures, in stage II and III colon cancer patients. Materials and Methods. Tumour (T) and non-neoplastic mucosa (NM) mRNA samples from 18 patients (9 with a recurrence, 9 with no recurrence) were profiled using the Affymetrix HGU133A GeneChip. The k-nearest neighbour method was used for prognosis prediction using T and NM gene expression measures. Six-fold cross-validation was applied to select the number of neighbours and the number of informative genes to include in the predictors. Based on this information, one T-based and one NM-based …


Causal Inference In Longitudinal Studies With History-Restricted Marginal Structural Models, Romain Neugebauer, Mark J. Van Der Laan, Ira B. Tager Apr 2005

Causal Inference In Longitudinal Studies With History-Restricted Marginal Structural Models, Romain Neugebauer, Mark J. Van Der Laan, Ira B. Tager

U.C. Berkeley Division of Biostatistics Working Paper Series

Causal Inference based on Marginal Structural Models (MSMs) is particularly attractive to subject-matter investigators because MSM parameters provide explicit representations of causal effects. We introduce History-Restricted Marginal Structural Models (HRMSMs) for longitudinal data for the purpose of defining causal parameters which may often be better suited for Public Health research. This new class of MSMs allows investigators to analyze the causal effect of a treatment on an outcome based on a fixed, shorter and user-specified history of exposure compared to MSMs. By default, the latter represents the treatment causal effect of interest based on a treatment history defined by the …


The Sensitivity And Specificity Of Markers For Event Times, Tianxi Cai, Margaret S. Pepe, Thomas Lumley, Yingye Zheng, Nancy Swords Jenny Apr 2005

The Sensitivity And Specificity Of Markers For Event Times, Tianxi Cai, Margaret S. Pepe, Thomas Lumley, Yingye Zheng, Nancy Swords Jenny

Harvard University Biostatistics Working Paper Series

No abstract provided.


The Importance Of Experimental Design In Proteomic Mass Spectrometry Experiments: Some Cautionary Tales, Jeffrey S. Morris, Jianhua Hu, Kevin R. Coombes, Keith A. Baggerly Mar 2005

The Importance Of Experimental Design In Proteomic Mass Spectrometry Experiments: Some Cautionary Tales, Jeffrey S. Morris, Jianhua Hu, Kevin R. Coombes, Keith A. Baggerly

Jeffrey S. Morris

Proteomic expression patterns derived from mass spectrometry have been put forward as potential biomarkers for the early diagnosis of cancer and other diseases. This approach has generated much excitement and has led to a large number of new experiments and vast amounts of new data. The data, derived at great expense, can have very little value if careful attention is not paid to the experimental design and analysis. Using examples from surfaceenhanced laser desorption/ionisation time-of-flight (SELDI-TOF) and matrix-assisted laser desorption–ionisation/time-of-flight (MALDI-TOF) experiments, we describe several experimental design issues that can corrupt a dataset. Fortunately, the problems we identify can be …


Acute Toxicity Testing Without Animals: More Scientific And Less Of A Gamble, Gillian R. Langley Mar 2005

Acute Toxicity Testing Without Animals: More Scientific And Less Of A Gamble, Gillian R. Langley

Application of Alternative Methods Collection

In this report, we argue specifically that acute toxicity data should not be sought from animal tests. The underlying principle of such tests on rats and mice is that the results can be effectively extrapolated to humans. In fact, after nearly 80 years of use of these tests, the predictivity of rodent data for human acute toxic effects has been disputed but never proven.


Serum Proteomics Profiling: A Young Technology Begins To Mature, Kevin R. Coombes, Jeffrey S. Morris, Jianhua Hu, Sarah R. Edmondson, Keith A. Baggerly Mar 2005

Serum Proteomics Profiling: A Young Technology Begins To Mature, Kevin R. Coombes, Jeffrey S. Morris, Jianhua Hu, Sarah R. Edmondson, Keith A. Baggerly

Jeffrey S. Morris

No abstract provided.


Against 'Individual Risk': A Sympathetic Critique Of Risk Assessment, Matthew D. Adler Mar 2005

Against 'Individual Risk': A Sympathetic Critique Of Risk Assessment, Matthew D. Adler

All Faculty Scholarship

"Individual risk" currently plays a major role in risk assessment and in the regulatory practices of the health and safety agencies that employ risk assessment, such as EPA, FDA, OSHA, NRC, CPSC, and others. Risk assessors use the term "population risk" to mean the number of deaths caused by some hazard. By contrast, "individual risk" is the incremental probability of death that the hazard imposes on some particular person. Regulatory decision procedures keyed to individual risk are widespread. This is true both for the regulation of toxic chemicals (the heartland of risk assessment), and for other health hazards, such as …


Signal In Noise: Evaluating Reported Reproducibility Of Serum Proteomic Tests For Ovarian Cancer, Keith A. Baggerly, Jeffrey S. Morris, Sarah R. Edmonson, Kevin R. Coombes Feb 2005

Signal In Noise: Evaluating Reported Reproducibility Of Serum Proteomic Tests For Ovarian Cancer, Keith A. Baggerly, Jeffrey S. Morris, Sarah R. Edmonson, Kevin R. Coombes

Jeffrey S. Morris

Proteomic profi ling of serum initially appeared to be dramatically effective for diagnosis of early-stage ovarian cancer, but these results have proven diffi cult to reproduce. A recent publication reported good classifi cation in one dataset using results from training on a much earlier dataset, but the authors have since reported that they did not perform the analysis as described. We examined the reproducibility of the proteomic patterns across datasets in more detail. Our analysis reveals that the pattern that enabled successful classifi cation is biologically implausible and that the method, properly applied, does not classify the data accurately. We …


Reliability, Effect Size, And Responsiveness Of Health Status Measures In The Design Of Randomized And Cluster-Randomized Trials, Paula Diehr Feb 2005

Reliability, Effect Size, And Responsiveness Of Health Status Measures In The Design Of Randomized And Cluster-Randomized Trials, Paula Diehr

Paula Diehr

BACKGROUND: New health status survey instruments are often described by their psychometric (measurement) properties, such as Validity, Reliability, Effect Size, and Responsiveness. For cluster-randomized trials, another important statistic is the Intraclass Correlation (ICC) for the instrument within clusters. Studies using better instruments can be performed with smaller sample sizes, but better instruments may be more expensive in terms of dollars, opportunity cost, or poorer data quality due to the response burden of longer instruments. METHODS: We defined the psychometric statistics in terms of a mathematical model, and examined the power of a two-sample test as a function of the test-retest …


Insights Into Latent Class Analysis, Margaret S. Pepe, Holly Janes Jan 2005

Insights Into Latent Class Analysis, Margaret S. Pepe, Holly Janes

UW Biostatistics Working Paper Series

Latent class analysis is a popular statistical technique for estimating disease prevalence and test sensitivity and specificity. It is used when a gold standard assessment of disease is not available but results of multiple imperfect tests are. We derive analytic expressions for the parameter estimates in terms of the raw data, under the conditional independence assumption. These expressions indicate explicitly how observed two- and three-way associations between test results are used to infer disease prevalence and test operating characteristics. Although reasonable if the conditional independence model holds, the estimators have no basis when it fails. We therefore caution against using …


Standardizing Markers To Evaluate And Compare Their Performances, Margaret S. Pepe, Gary M. Longton Jan 2005

Standardizing Markers To Evaluate And Compare Their Performances, Margaret S. Pepe, Gary M. Longton

UW Biostatistics Working Paper Series

Introduction: Markers that purport to distinguish subjects with a condition from those without a condition must be evaluated rigorously for their classification accuracy. A single approach to statistically evaluating and comparing markers is not yet established.

Methods: We suggest a standardization that uses the marker distribution in unaffected subjects as a reference. For an affected subject with marker value Y, the standardized placement value is the proportion of unaffected subjects with marker values that exceed Y.

Results: We apply the standardization to two illustrative datasets. In patients with pancreatic cancer placement values calculated for the CA 19-9 marker are smaller …