Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Applied Statistics

Selected Works

2010

Institution
Keyword
Publication
File Type

Articles 1 - 28 of 28

Full-Text Articles in Physical Sciences and Mathematics

Modeling Longitudinal Data Using A Pair-Copula Decomposition Of Serial Dependence, Michael S. Smith, Aleksey Min, Carlos Almeida, Claudia Czado Nov 2010

Modeling Longitudinal Data Using A Pair-Copula Decomposition Of Serial Dependence, Michael S. Smith, Aleksey Min, Carlos Almeida, Claudia Czado

Michael Stanley Smith

Copulas have proven to be very successful tools for the flexible modelling of cross-sectional dependence. In this paper we express the dependence structure of continuous-valued time series data using a sequence of bivariate copulas. This corresponds to a type of decomposition recently called a ‘vine’ in the graphical models literature, where each copula is entitled a ‘pair-copula’. We propose a Bayesian approach for the estimation of this dependence structure for longitudinal data. Bayesian selection ideas are used to identify any independence pair-copulas, with the end result being a parsimonious representation of a time-inhomogeneous Markov process of varying order. Estimates are …


The Positive Solutions Of The Matukuma Equation And The Problem Of Finite Radius And Finite Mass, Jurgen Batt, Yi Li Nov 2010

The Positive Solutions Of The Matukuma Equation And The Problem Of Finite Radius And Finite Mass, Jurgen Batt, Yi Li

Yi Li

This work is an extensive study of the 3 different types of positive solutions of the Matukuma equation 1r2(r2ϕ′)′=−rλ−2(1+r2)λ/2ϕp,p>1,λ>0 : the E-solutions (regular at r = 0), the M-solutions (singular at r = 0) and the F-solutions (whose existence begins away from r = 0). An essential tool is a transformation of the equation into a 2-dimensional asymptotically autonomous system, whose limit sets (by a theorem of H. R. Thieme) are the limit sets of Emden–Fowler systems, and serve as to characterizate the different solutions. The emphasis lies on the study of the M-solutions. …


Men In Black: The Impact Of New Contracts On Football Referees’ Performances, Babatunde Buraimo, Alex Bryson, Rob Simmons Oct 2010

Men In Black: The Impact Of New Contracts On Football Referees’ Performances, Babatunde Buraimo, Alex Bryson, Rob Simmons

Dr Babatunde Buraimo

No abstract provided.


Estimating Confidence Intervals For Eigenvalues In Exploratory Factor Analysis, Ross Larsen, Russell Warne Jul 2010

Estimating Confidence Intervals For Eigenvalues In Exploratory Factor Analysis, Ross Larsen, Russell Warne

Russell T Warne

Exploratory factor analysis (EFA) has become a common procedure in educational and psychological research. In the course of performing an EFA, researchers often base the decision of how many factors to retain on the eigenvalues for the factors. However, many researchers do not realize that eigenvalues, like all sample statistics, are subject to sampling error, which means that confidence intervals (CIs) can be estimated for each eigenvalue. In the present article, we demonstrate two methods of estimating CIs for eigenvalues: one based on the mathematical properties of the central limit theorem, and the other based on bootstrapping. References to appropriate …


The 1905 Einstein Equation In A General Mathematical Analysis Model Of Quasars, Byron E. Bell May 2010

The 1905 Einstein Equation In A General Mathematical Analysis Model Of Quasars, Byron E. Bell

Byron E. Bell

The 1905 wave equation of Albert Einstein is a model that can be used in many areas, such as physics, applied mathematics, statistics, quantum chaos and financial mathematics, etc. I will give a proof from the equation of A. Einstein’s paper “Zur Elektrodynamik bewegter Körper” it will be done by removing the variable time (t) and the constant (c) the speed of light from the above equation and look at the factors that affect the model in a real analysis framework. Testing the model with SDSS-DR5 Quasar Catalog (Schneider +, 2007). Keywords: direction cosine, apparent magnitudes of optical light; ultraviolet …


Conference Proceedings 3rd International Scientific Conference On “Energy Systems With It” At Alvsjö Fair In Association With Energitinget March 16-17 2010, Dr. Erik Dahlquist, Dr. Jenny Palm Mar 2010

Conference Proceedings 3rd International Scientific Conference On “Energy Systems With It” At Alvsjö Fair In Association With Energitinget March 16-17 2010, Dr. Erik Dahlquist, Dr. Jenny Palm

Dr. Erik Dahlquist

2010 “The Energiting” is performed for the 12th time. The International Scientific conference is arranged for the 3rd time. The organisers are Swedish Energy Agency, Mälardalen University and the Research School for Energy Systems with LiU, KTH, UU and CTH. The first topic will be “Energy systems” covering use of renewable energy sources, energy conversion and process efficiency improvement with new technologies, as well as societal aspects of the introduction of new technologies. The second topic is “Energy and IT”. This covers energy and load management, interaction between production, distribution and “consumption”, usage of data for decision support and control, …


On Simulating Univariate And Multivariate Burr Type Iii And Type Xii Distributions, Todd C. Headrick, Mohan D. Pant, Yanyan Sheng Mar 2010

On Simulating Univariate And Multivariate Burr Type Iii And Type Xii Distributions, Todd C. Headrick, Mohan D. Pant, Yanyan Sheng

Mohan Dev Pant

This paper describes a method for simulating univariate and multivariate Burr Type III and Type XII distributions with specified correlation matrices. The methodology is based on the derivation of the parametric forms of a pdf and cdf for this family of distributions. The paper shows how shape parameters can be computed for specified values of skew and kurtosis. It is also demonstrated how to compute percentage points and other measures of central tendency such as the mode, median, and trimmed mean. Examples are provided to demonstrate how this Burr family can be used in the context of distribution fitting using …


Manifest Greatness The Final Original Version By Emmanuel Mario B Santos Aka Marc Guerrero, Emmanuel Mario B. Santos Aka Marc Guerrero Jan 2010

Manifest Greatness The Final Original Version By Emmanuel Mario B Santos Aka Marc Guerrero, Emmanuel Mario B. Santos Aka Marc Guerrero

Emmanuel Mario B Santos aka Marc Guerrero

MANIFEST GREATNESS vf24jan2010 WE COME TOGETHER THERE OUGHT TO BE NO POOR WE TAKE CHARGE.


Errata Negative Binomial Regression 1st Edition 1st Print, Joseph Hilbe Jan 2010

Errata Negative Binomial Regression 1st Edition 1st Print, Joseph Hilbe

Joseph M Hilbe

Errata for the first edition and printing of Negative Binomal Regression, August 2007. Many of the items listed here were corrected in the 2008 second printing.


International Diversification: A Copula Approach, Lorán Chollete, Victor De La Pena, Ching-Chih Lu Jan 2010

International Diversification: A Copula Approach, Lorán Chollete, Victor De La Pena, Ching-Chih Lu

Lorán Chollete

No abstract provided.


Wavelet-Based Functional Linear Mixed Models: An Application To Measurement Error–Corrected Distributed Lag Models, Elizabeth J. Malloy, Jeffrey S. Morris, Sara D. Adar, Helen Suh, Diane R. Gold, Brent A. Coull Jan 2010

Wavelet-Based Functional Linear Mixed Models: An Application To Measurement Error–Corrected Distributed Lag Models, Elizabeth J. Malloy, Jeffrey S. Morris, Sara D. Adar, Helen Suh, Diane R. Gold, Brent A. Coull

Jeffrey S. Morris

Frequently, exposure data are measured over time on a grid of discrete values that collectively define a functional observation. In many applications, researchers are interested in using these measurements as covariates to predict a scalar response in a regression setting, with interest focusing on the most biologically relevant time window of exposure. One example is in panel studies of the health effects of particulate matter (PM), where particle levels are measured over time. In such studies, there are many more values of the functional data than observations in the data set so that regularization of the corresponding functional regression coefficient …


Members’ Discoveries: Fatal Flaws In Cancer Research, Jeffrey S. Morris Jan 2010

Members’ Discoveries: Fatal Flaws In Cancer Research, Jeffrey S. Morris

Jeffrey S. Morris

A recent article published in The Annals of Applied Statistics (AOAS) by two MD Anderson researchers—Keith Baggerly and Kevin Coombes—dissects results from a highly-influential series of medical papers involving genomics-driven personalized cancer therapy, and outlines a series of simple yet fatal flaws that raises serious questions about the veracity of the original results. Having immediate and strong impact, this paper, along with related work, is providing the impetus for new standards of reproducibility in scientific research.


Statistical Contributions To Proteomic Research, Jeffrey S. Morris, Keith A. Baggerly, Howard B. Gutstein, Kevin R. Coombes Jan 2010

Statistical Contributions To Proteomic Research, Jeffrey S. Morris, Keith A. Baggerly, Howard B. Gutstein, Kevin R. Coombes

Jeffrey S. Morris

Proteomic profiling has the potential to impact the diagnosis, prognosis, and treatment of various diseases. A number of different proteomic technologies are available that allow us to look at many proteins at once, and all of them yield complex data that raise significant quantitative challenges. Inadequate attention to these quantitative issues can prevent these studies from achieving their desired goals, and can even lead to invalid results. In this chapter, we describe various ways the involvement of statisticians or other quantitative scientists in the study team can contribute to the success of proteomic research, and we outline some of the …


Informatics And Statistics For Analyzing 2-D Gel Electrophoresis Images, Andrew W. Dowsey, Jeffrey S. Morris, Howard G. Gutstein, Guang Z. Yang Jan 2010

Informatics And Statistics For Analyzing 2-D Gel Electrophoresis Images, Andrew W. Dowsey, Jeffrey S. Morris, Howard G. Gutstein, Guang Z. Yang

Jeffrey S. Morris

Whilst recent progress in ‘shotgun’ peptide separation by integrated liquid chromatography and mass spectrometry (LC/MS) has enabled its use as a sensitive analytical technique, proteome coverage and reproducibility is still limited and obtaining enough replicate runs for biomarker discovery is a challenge. For these reasons, recent research demonstrates the continuing need for protein separation by two-dimensional gel electrophoresis (2-DE). However, with traditional 2-DE informatics, the digitized images are reduced to symbolic data though spot detection and quantification before proteins are compared for differential expression by spot matching. Recently, a more robust and automated paradigm has emerged where gels are directly …


Bayesian Random Segmentationmodels To Identify Shared Copy Number Aberrations For Array Cgh Data, Veerabhadran Baladandayuthapani, Yuan Ji, Rajesh Talluri, Luis E. Nieto-Barajas, Jeffrey S. Morris Jan 2010

Bayesian Random Segmentationmodels To Identify Shared Copy Number Aberrations For Array Cgh Data, Veerabhadran Baladandayuthapani, Yuan Ji, Rajesh Talluri, Luis E. Nieto-Barajas, Jeffrey S. Morris

Jeffrey S. Morris

Array-based comparative genomic hybridization (aCGH) is a high-resolution high-throughput technique for studying the genetic basis of cancer. The resulting data consists of log fluorescence ratios as a function of the genomic DNA location and provides a cytogenetic representation of the relative DNA copy number variation. Analysis of such data typically involves estimation of the underlying copy number state at each location and segmenting regions of DNA with similar copy number states. Most current methods proceed by modeling a single sample/array at a time, and thus fail to borrow strength across multiple samples to infer shared regions of copy number aberrations. …


Participation And Engagement In Sport: A Double Hurdle Approach For The United Kingdom, Babatunde Buraimo, Brad Humphreys, Rob Simmons Jan 2010

Participation And Engagement In Sport: A Double Hurdle Approach For The United Kingdom, Babatunde Buraimo, Brad Humphreys, Rob Simmons

Dr Babatunde Buraimo

This paper uses pooled cross-section data from four waves of the United Kingdom’s Taking Part Survey, 2005 to 2009, in order to investigate determinants of probability of participation and levels of engagement in sports. The two rival modelling approaches considered here are the double-hurdle approach and the Heckman sample selection model. The Heckman model proves to be deficient in several key respects. The double-hurdle approach offers more reliable estimates than the Heckman sample selection model, at least for this particular survey. The distinction is more than just statistical nuance as there are substantive differences in qualitative results from the two …


Creation Of Synthetic Discrete Response Regression Models, Joseph Hilbe Jan 2010

Creation Of Synthetic Discrete Response Regression Models, Joseph Hilbe

Joseph M Hilbe

The development and use of synthetic regression models has proven to assist statisticians in better understanding bias in data, as well as how to best interpret various statistics associated with a modeling situation. In this article I present code that can be easily amended for the creation of synthetic binomial, count, and categorical response models. Parameters may be assigned to any number of predictors (which are shown as continuous, binary, or categorical), negative binomial heterogeneity parameters may be assigned, and the number of levels or cut points and values may be specified for ordered and unordered categorical response models. I …


Statistical Criteria For Selecting The Optimal Number Of Untreated Subjects Matched To Each Treated Subject When Using Many-To-One Matching On The Propensity Score, Peter C. Austin Jan 2010

Statistical Criteria For Selecting The Optimal Number Of Untreated Subjects Matched To Each Treated Subject When Using Many-To-One Matching On The Propensity Score, Peter C. Austin

Peter Austin

Propensity-score matching is increasingly being used to estimate the effects of treatments using observational data. In many-to-one (M:1) matching on the propensity score, M untreated subjects are matched to each treated subject using the propensity score. The authors used Monte Carlo simulations to examine the effect of the choice of M on the statistical performance of matched estimators. They considered matching 1–5 untreated subjects to each treated subject using both nearest-neighbor matching and caliper matching in 96 different scenarios. Increasing the number of untreated subjects matched to each treated subject tended to increase the bias in the estimated treatment effect; …


The Performance Of Different Propensity-Score Methods For Estimating Differences In Proportions (Risk Differences Or Absolute Risk Reductions) In Observational Studies, Peter C. Austin Jan 2010

The Performance Of Different Propensity-Score Methods For Estimating Differences In Proportions (Risk Differences Or Absolute Risk Reductions) In Observational Studies, Peter C. Austin

Peter Austin

Propensity score methods are increasingly being used to estimate the effects of treatments on health outcomes using observational data. There are four methods for using the propensity score to estimate treatment effects: covariate adjustment using the propensity score, stratification on the propensity score, propensity-score matching, and inverse probability of treatment weighting (IPTW) using the propensity score. When outcomes are binary, the effect of treatment on the outcome can be described using odds ratios, relative risks, risk differences, or the number needed to treat. Several clinical commentators suggested that risk differences and numbers needed to treat are more meaningful for clinical …


Existence Of Traveling Wave Solutions For A Nonlocal Reaction-Diffusion Model Of Influenza A Drift, Joaquin Riviera, Yi Li Jan 2010

Existence Of Traveling Wave Solutions For A Nonlocal Reaction-Diffusion Model Of Influenza A Drift, Joaquin Riviera, Yi Li

Yi Li

In this paper we discuss the existence of traveling wave solutions for a nonlocal reaction-diffusion model of Influenza A proposed in Lin et. al. (2003). The proof for the existence of the traveling wave takes advantage of the different time scales between the evolution of the disease and the progress of the disease in the population. Under this framework we are able to use the techniques from geometric singular perturbation theory to prove the existence of the traveling wave.


Imputation Procedures For American Community Survey Group Quarters Small Area Estimation, Chandra Erdman, Chaitra Nagaraja Dec 2009

Imputation Procedures For American Community Survey Group Quarters Small Area Estimation, Chandra Erdman, Chaitra Nagaraja

Chaitra H Nagaraja

No abstract provided.


The Effect Of Salvage Therapy On Survival In A Longitudinal Study With Treatment By Indication, Edward Kennedy, Jeremy Taylor, Douglas Schaubel, Scott Williams Dec 2009

The Effect Of Salvage Therapy On Survival In A Longitudinal Study With Treatment By Indication, Edward Kennedy, Jeremy Taylor, Douglas Schaubel, Scott Williams

Edward H. Kennedy

We consider using observational data to estimate the effect of a treatment on disease recurrence, when the decision to initiate treatment is based on longitudinal factors associated with the risk of recurrence. The effect of salvage androgen deprivation therapy (SADT) on the risk of recurrence of prostate cancer is inadequately described by the existing literature. Furthermore, standard Cox regression yields biased estimates of the effect of SADT, since it is necessary to adjust for prostate-specific antigen (PSA), which is a time-dependent confounder and an intermediate variable. In this paper, we describe and compare two methods which appropriately adjust for PSA …


Fast Function-On-Scalar Regression With Penalized Basis Expansions, Philip T. Reiss, Lei Huang, Maarten Mennes Dec 2009

Fast Function-On-Scalar Regression With Penalized Basis Expansions, Philip T. Reiss, Lei Huang, Maarten Mennes

Lei Huang

Regression models for functional responses and scalar predictors are often fitted by means of basis functions, with quadratic roughness penalties applied to avoid overfitting. The fitting approach described by Ramsay and Silverman in the 1990s amounts to a penalized ordinary least squares (P-OLS) estimator of the coefficient functions. We recast this estimator as a generalized ridge regression estimator, and present a penalized generalized least squares (P-GLS) alternative. We describe algorithms by which both estimators can be implemented, with automatic selection of optimal smoothing parameters, in a more computationally efficient manner than has heretofore been available. We discuss pointwise confidence intervals …


The 1905 Einstein Equation In A General Mathematical Analysis Model Of Quasars, Byron E. Bell Dec 2009

The 1905 Einstein Equation In A General Mathematical Analysis Model Of Quasars, Byron E. Bell

Byron E. Bell

No abstract provided.


Bayesian Inference For A Periodic Stochastic Volatility Model Of Intraday Electricity Prices, Michael S. Smith Dec 2009

Bayesian Inference For A Periodic Stochastic Volatility Model Of Intraday Electricity Prices, Michael S. Smith

Michael Stanley Smith

The Gaussian stochastic volatility model is extended to allow for periodic autoregressions (PAR) in both the level and log-volatility process. Each PAR is represented as a first order vector autoregression for a longitudinal vector of length equal to the period. The periodic stochastic volatility model is therefore expressed as a multivariate stochastic volatility model. Bayesian posterior inference is computed using a Markov chain Monte Carlo scheme for the multivariate representation. A circular prior that exploits the periodicity is suggested for the log-variance of the log-volatilities. The approach is applied to estimate a periodic stochastic volatility model for half-hourly electricity prices …


Bayesian Skew Selection For Multivariate Models, Michael S. Smith, Anastasios Panagiotelis Dec 2009

Bayesian Skew Selection For Multivariate Models, Michael S. Smith, Anastasios Panagiotelis

Michael Stanley Smith

We develop a Bayesian approach for the selection of skew in multivariate skew t distributions constructed through hidden conditioning in the manners suggested by either Azzalini and Capitanio (2003) or Sahu, Dey and Branco~(2003). We show that the skew coefficients for each margin are the same for the standardized versions of both distributions. We introduce binary indicators to denote whether there is symmetry, or skew, in each dimension. We adopt a proper beta prior on each non-zero skew coefficient, and derive the corresponding prior on the skew parameters. In both distributions we show that as the degrees of freedom increases, …


Using Twitter Hash Tags To Demonstrate Basic Concepts From Network Analysis, Matt Bogard Dec 2009

Using Twitter Hash Tags To Demonstrate Basic Concepts From Network Analysis, Matt Bogard

Matt Bogard

Social Network Analysis focuses on finding patterns in interactions between people or entities. These patterns may be described in the form of a network. Network analysis in general has many applications including models of student integration and persistence, business to business supply chains, terrorist cells, or analysis of social media such as Facebook and Twitter. This presentation provides a reference for basic concepts from social network analysis with examples using tweets from Twitter.


Fast Function-On-Scalar Regression With Penalized Basis Expansions, Philip T. Reiss, Lei Huang, Maarten Mennes Dec 2009

Fast Function-On-Scalar Regression With Penalized Basis Expansions, Philip T. Reiss, Lei Huang, Maarten Mennes

Philip T. Reiss

Regression models for functional responses and scalar predictors are often fitted by means of basis functions, with quadratic roughness penalties applied to avoid overfitting. The fitting approach described by Ramsay and Silverman in the 1990s amounts to a penalized ordinary least squares (P-OLS) estimator of the coefficient functions. We recast this estimator as a generalized ridge regression estimator, and present a penalized generalized least squares (P-GLS) alternative. We describe algorithms by which both estimators can be implemented, with automatic selection of optimal smoothing parameters, in a more computationally efficient manner than has heretofore been available. We discuss pointwise confidence intervals …