Statistical Models Commons

Open Access. Powered by Scholars. Published by Universities.

25 Institutions 440 Full-Text Articles 570 Authors 71,630 Downloads

Recent Articles in Statistical Models

Targeted Maximum Likelihood Estimation For Dynamic And Static Longitudinal Marginal Structural Working Models, Maya L. Petersen, Joshua Schwab, Susan Gruber, Nello Blaser, Michael Schomaker, Mark J. van der Laan COBRA

Targeted Maximum Likelihood Estimation For Dynamic And Static Longitudinal Marginal Structural Working Models, Maya L. Petersen, Joshua Schwab, Susan Gruber, Nello Blaser, Michael Schomaker, Mark J. Van Der Laan

U.C. Berkeley Division of Biostatistics Working Paper Series

This paper presents a novel targeted maximum likelihood estimator (TMLE) estimator for the parameters of longitudinal static and dynamic marginal structural models.We consider a longitudinal data structure consisting of baseline covariates, time-dependent intervention nodes, intermediate time-dependent covariates, and a possibly time dependent outcome. The intervention nodes at each time point can include a binary treatment as well as a right-censoring indicator. Given a class of dynamic or static interventions, a marginal structural model is used to model the mean of the intervention specific counterfactual outcome as a function of the intervention and time point.Because the true shape of ...


Does The Sat Predict Academic Achievement And Academic Choices At Macalester College?, Jing Wen Macalester College

Does The Sat Predict Academic Achievement And Academic Choices At Macalester College?, Jing Wen

Honors Projects

This paper examines the predictive power of the Scholastic Aptitude Test (SAT) for Macalester students’ college success and academic choices. We use linear regression to study whether the SAT can predict students’ first year or four-year grades. Using Kullback-Leibler divergence and classification trees, we also examine the SAT’s predictive ability for other aspects of students’ academic experience, for example, major selection, or academic division of study. After controlling for major and course level, we find that the SAT does not explain a large proportion of the variability in Macalester students’ college success. However, the SAT does provide some useful ...


Analysis Of Spatial Data, Xiang Zhang University of Kentucky

Analysis Of Spatial Data, Xiang Zhang

Theses and Dissertations--Statistics

In many areas of the agriculture, biological, physical and social sciences, spatial lattice data are becoming increasingly common. In addition, a large amount of lattice data shows not only visible spatial pattern but also temporal pattern (see, Zhu et al. 2005). An interesting problem is to develop a model to systematically model the relationship between the response variable and possible explanatory variable, while accounting for space and time effect simultaneously.

Spatial-temporal linear model and the corresponding likelihood-based statistical inference are important tools for the analysis of spatial-temporal lattice data. We propose a general asymptotic framework for spatial-temporal linear models and ...


Estimating Effects On Rare Outcomes: Knowledge Is Power, Laura B. Balzer, Mark J. van der Laan COBRA

Estimating Effects On Rare Outcomes: Knowledge Is Power, Laura B. Balzer, Mark J. Van Der Laan

U.C. Berkeley Division of Biostatistics Working Paper Series

Understanding the etiology of rare cancers, perinatal mortality, international conflicts or natural disasters can have profound impacts on population health and well-being. However, when the outcome of interest occurs in 5% or less of the population, effect estimation can be particularly challenging. To increase statistical power and the stability of results, researchers commonly oversample cases or events. However, the study of rare outcomes need not be limited to case-control settings. Building on the work of Gruber and van der Laan (2010), we construct a new targeted minimum loss-based estimator (TMLE) for estimating the effect of an exposure or treatment on ...


Modeling The Relationship Between Coal Mining And Respiratory Health In West Virginia, Jessica Welch University of Tennessee, Knoxville

Modeling The Relationship Between Coal Mining And Respiratory Health In West Virginia, Jessica Welch

University of Tennessee Honors Thesis Projects

No abstract provided.


Statistical Mechanics Of Temporal And Interacting Networks, Kun Zhao Northeastern University

Statistical Mechanics Of Temporal And Interacting Networks, Kun Zhao

Physics Dissertations

In the last ten years important breakthroughs in the understanding of the topology of complexity have been made in the framework of network science. Indeed it has been found that many networks belong to the universality classes called small-world networks or scale-free networks. Moreover it was found that the complex architecture of real world networks strongly affects the critical phenomena defined on these structures. Nevertheless the main focus of the research has been the characterization of single and static networks.

Recently, temporal networks and interacting networks have attracted large interest. Indeed many networks are interacting or formed by a multilayer ...


Is Obesity Socially Contagious?, Ciani Jean Sparks California Polytechnic State University

Is Obesity Socially Contagious?, Ciani Jean Sparks

Statistics

The main objective of this paper is to analyze three different articles that discuss whether obesity could be socially contagious. According to the World Health Organization in 2013, obesity is the fifth leading risk for deaths around the world. This disease has dramatically increased in the last decade, which has led scientists to believe there are other factors contributing to the epidemic besides genetics. The first article I analyzed, written by Nicholas Christakis and James Fowler, provided a logistic regression model to estimate the odds of a person becoming obese. The model included the explanatory variables: age, sex, education, smoking ...


Significant Themes In 19th-Century Literature, Matthew L. Jockers, David Mimno University of Nebraska - Lincoln

Significant Themes In 19th-Century Literature, Matthew L. Jockers, David Mimno

Faculty Publications -- Department of English

External factors such as author gender, author nationality, and date of publication affect both the choice of literary themes in novels and the expression of those themes, but the extent of this association is difficult to quantify. In this work, we apply statistical methods to identify and extract hundreds of "topics" from a corpus of 3,346 works of 19th-century British, Irish, and American fiction. We use these topics as a measurable, data-driven proxy for literary themes. External factors may predict fluctuations in the use of themes and the individual word choices within themes. We use topics to measure the ...


A Bayesian Regression Tree Approach To Identify The Effect Of Nanoparticles Properties On Toxicity Profiles, Cecile Low-Kam, Haiyuan Zhang, Zhaoxia Ji, Tian Xia, Jeffrey I. Zinc, Andre Nel, Donatello Telesca COBRA

A Bayesian Regression Tree Approach To Identify The Effect Of Nanoparticles Properties On Toxicity Profiles, Cecile Low-Kam, Haiyuan Zhang, Zhaoxia Ji, Tian Xia, Jeffrey I. Zinc, Andre Nel, Donatello Telesca

COBRA Preprint Series

We introduce a Bayesian multiple regression tree model to characterize relationships between physico-chemical properties of nanoparticles and their in-vitro toxicity over multiple doses and times of exposure. Unlike conventional models that rely on data summaries, our model solves the low sample size issue and avoids arbitrary loss of information by combining all measurements from a general exposure experiment across doses, times of exposure, and replicates. The proposed technique integrates Bayesian trees for modeling threshold effects and interactions, and penalized B-splines for dose and time-response surfaces smoothing. The resulting posterior distribution is sampled via a Markov Chain Monte Carlo algorithm. This ...


Consistency Under Sampling Of Exponential Random Graph Models, Cosma R. Shalizi, Alessandro Rinaldo Carnegie Mellon University

Consistency Under Sampling Of Exponential Random Graph Models, Cosma R. Shalizi, Alessandro Rinaldo

Department of Statistics

The growing availability of network data and of scientific interest in distributed systems has led to the rapid development of statistical models of network structure. Typically, however, these are models for the entire network, while the data consists only of a sampled sub-network. Parameters for the whole network, which is what is of interest, are estimated by applying the model to the sub-network. This assumes that the model is consistent under sampling, or, in terms of the theory of stochastic processes, that it defines a projective family. Focussing on the popular class of exponential random graph models (ERGMs), we show ...


Generalized Fiducial Inference For Normal Linear Mixed Models, Jessica Cisewski, Jan Hannig Carnegie Mellon University

Generalized Fiducial Inference For Normal Linear Mixed Models, Jessica Cisewski, Jan Hannig

Department of Statistics

While linear mixed modeling methods are foundational concepts introduced in any statistical education, adequate general methods for interval estimation involving models with more than a few variance components are lacking, especially in the unbalanced setting. Generalized fiducial inference provides a possible framework that accommodates this absence of methodology. Under the fabric of generalized fiducial inference along with sequential Monte Carlo methods, we present an approach for interval estimation for both balanced and unbalanced Gaussian linear mixed models. We compare the proposed method to classical and Bayesian results in the literature in a simulation study of two-fold nested models and two-factor ...


Global Quantitative Assessment Of The Colorectal Polyp Burden In, Patrick M. Lynch, Jeffrey S. Morris, William A. Ross, Miguel A. Rodriguez-Bigas, Juan Posadas, Rossa Khalaf, Diane M. Weber, Valerie O. Sepeda, Bernard Levin, Imad Shureiqi The University of Texas

Global Quantitative Assessment Of The Colorectal Polyp Burden In, Patrick M. Lynch, Jeffrey S. Morris, William A. Ross, Miguel A. Rodriguez-Bigas, Juan Posadas, Rossa Khalaf, Diane M. Weber, Valerie O. Sepeda, Bernard Levin, Imad Shureiqi

Jeffrey S. Morris

Background: Accurate measures of the total polyp burden in familial adenomatous polyposis (FAP) are lacking. Current assessment tools include polyp quantitation in limited-field photographs and qualitative total colorectal polyp burden by video.

Objective: To develop global quantitative tools of the FAP colorectal adenoma burden.

Design: A single-arm, phase II trial.

Patients: Twenty-seven patients with FAP.

Intervention: Treatment with celecoxib for 6 months, with before-treatment and after-treatment videos posted to an intranet with an interactive site for scoring.

Main Outcome Measurements: Global adenoma counts and sizes (grouped into categories: less than 2 mm, 2-4 mm, and greater than 4 mm) were ...


Asymmetric Empirical Similarity, Joshua C. Teitelbaum Georgetown University Law Center

Asymmetric Empirical Similarity, Joshua C. Teitelbaum

Georgetown Law Faculty Publications and Other Works

The paper suggests a similarity function for applications of empirical similarity theory in which the notion of similarity is asymmetric. I propose defining similarity in terms of a quasimetric. I suggest a particular quasimetric and explore the properties of the empirical similarity model given this function. The proposed function belongs to the class of quasimetrics induced by skewed norms. Finally, I provide a skewness axiom that, when imposed in lieu of the symmetry axiom in the main result of Billot et al. (2008), characterizes an exponential similarity function based on a skewed norm.


A Regionalized National Universal Kriging Model Using Partial Least Squares Regression For Estimating Annual Pm2.5 Concentrations In Epidemiology, Paul D. Sampson, Mark Richards, Adam A. Szpiro, Silas Bergen, Lianne Sheppard, Timothy V. Larson, Joel Kaufman COBRA

A Regionalized National Universal Kriging Model Using Partial Least Squares Regression For Estimating Annual Pm2.5 Concentrations In Epidemiology, Paul D. Sampson, Mark Richards, Adam A. Szpiro, Silas Bergen, Lianne Sheppard, Timothy V. Larson, Joel Kaufman

UW Biostatistics Working Paper Series

Many cohort studies in environmental epidemiology require accurate modeling and prediction of fine scale spatial variation in ambient air quality across the U.S. This modeling requires the use of small spatial scale geographic or “land use” regression covariates and some degree of spatial smoothing. Furthermore, the details of the prediction of air quality by land use regression and the spatial variation in ambient air quality not explained by this regression should be allowed to vary across the continent due to the large scale heterogeneity in topography, climate, and sources of air pollution. This paper introduces a regionalized national universal ...


General Recognition Theory Extended To Include Response Times: Predictions For A Class Of Parallel Systems, Joseph W. Houpt, James T. Townsend, Noah H. Silbert Wright State University

General Recognition Theory Extended To Include Response Times: Predictions For A Class Of Parallel Systems, Joseph W. Houpt, James T. Townsend, Noah H. Silbert

Joseph W. Houpt

No abstract provided.


Capacity Coefficient Variations, Joseph W. Houpt, Andrew Heathcote, Ami Eidels, Nathan Medeiros-Ward, Jason Watson, David Strayer Wright State University

Capacity Coefficient Variations, Joseph W. Houpt, Andrew Heathcote, Ami Eidels, Nathan Medeiros-Ward, Jason Watson, David Strayer

Joseph W. Houpt

The capacity coefficient has become an increasingly popular measure of efficiency under changes in workload. It has been used in applications ranging from psychophysical detection tasks to complex cognitive tasks, as well as in addressing questions in social and clinical psychology. The basic formulation compares response times to each stimulus property (or task) in isolation to response times with all stimulus properties (or tasks) at the same time. A number of variations on the basic capacity coefficient have been used, both in the experimental design and in the calculations, and many more are possible. Here we outline the theoretical reasons ...


Statistical And Methodological Issues On Covariate Adjustment In Clinical Trials, Rong Chu McMaster University

Statistical And Methodological Issues On Covariate Adjustment In Clinical Trials, Rong Chu

Open Access Dissertations and Theses

Background and objectives

We investigate three issues related to the adjustment for baseline covariates in late phase clinical trials: (1) the analysis of correlated outcomes in multicentre RCTs, (2) the assessment of the probability and implication of prognostic imbalance in RCTs, and (3) the adjustment for baseline confounding in cohort studies.

Methods

Project 1: We investigated the properties of six statistical methods for analyzing continuous outcomes in multicentre randomized controlled trials (RCTs) where within-centre clustering was possible. We simulated studies over various intraclass correlation (ICC) values with several centre combinations.

Project 2: We simulated data from RCTs evaluating a binary ...


Approximate Methods For Dynamic Portfolio Allocation Under Transaction Costs, Nabeel Butt Western University

Approximate Methods For Dynamic Portfolio Allocation Under Transaction Costs, Nabeel Butt

Electronic Thesis and Dissertation Repository

The thesis provides robust and efficient lattice based algorithms for solving dynamic portfolio allocation problems under transaction costs. The early part of the thesis concentrates upon developing a toolbox based on multinomial trees. The multinomial trees are shown to provide a reasonable approximation for most popular transaction cost models in the academic literature. The tool, once forged, is implemented in the powerful Mathematica based parallel computing environment. In the second part of the thesis we provide applications of our framework to real world problems. We show re-balancing portfolios is more valuable in an investment environment where the growth and volatility ...


Retrieval Of Sub-Pixel-Based Fire Intensity And Its Application For Characterizing Smoke Injection Heights And Fire Weather In North America, David Peterson University of Nebraska - Lincoln

Retrieval Of Sub-Pixel-Based Fire Intensity And Its Application For Characterizing Smoke Injection Heights And Fire Weather In North America, David Peterson

Dissertations & Theses in Earth and Atmospheric Sciences

For over two decades, satellite sensors have provided the locations of global fire activity with ever-increasing accuracy. However, the ability to measure fire intensity, know as fire radiative power (FRP), and its potential relationships to meteorology and smoke plume injection heights, are currently limited by the pixel resolution. This dissertation describes the development of a new, sub-pixel-based FRP calculation (FRPf) for fire pixels detected by the MODerate Resolution Imaging Spectroradiometer (MODIS) fire detection algorithm (Collection 5), which is subsequently applied to several large wildfire events in North America. The methodology inherits an earlier bi-spectral algorithm for retrieving sub-pixel fire ...


Modeling Hydrogen-Bonding In Diblock Copolymer/Homopolymer Blends, Ashkan Dehghan Kooshkghazi McMaster University

Modeling Hydrogen-Bonding In Diblock Copolymer/Homopolymer Blends, Ashkan Dehghan Kooshkghazi

Open Access Dissertations and Theses

The phase behavior of AB diblock copolymers mixed with C homopolymers (AB/C), in which A and C are capable of forming hydrogen-bonds, is examined using self-consistent field theory. The study focuses on the modeling of hydrogen-bonding in polymers. Specifically, we examine two models for the formation of hydrogen-bonds between polymer chains. The first commonly used model assumes a large attractive interaction parameter between the A/C monomers. This model reproduces correct phase transition sequences as compared with experiments, but it fails to correctly describe the change of lamellar spacing induced by the addition of the C homopolymers. The second ...