Open Access. Powered by Scholars. Published by Universities.®

Selected Works

Discipline
Institution
Keyword
Publication Year
Publication
File Type

Articles 1 - 26 of 26

Full-Text Articles in Other Statistics and Probability

Greek New Testament For Data Analysis, Keith L. Yoder Dec 2016

Greek New Testament For Data Analysis, Keith L. Yoder

Keith L. Yoder

Updated: 18 December 2017
This Excel data file (compatible with Excel 2007 and later versions) is an extract of my working Greek New Testament database which I use for statistical and data analysis. It originated in the early 2000's from UBS3 data files in beta code I obtained from CCAT, and has since been evolving through countless changes and corrections. A flat-file table display such as Excel 2007+ is the best format suitable for Autofilter and VBA applications, without involving a more complex XML format. The file itself may be opened with Excel 2007 or later versions, or with the …


Performance-Constrained Binary Classification Using Ensemble Learning: An Application To Cost-Efficient Targeted Prep Strategies, Wenjing Zheng, Laura Balzer, Maya L. Petersen, Mark J. Van Der Laan Oct 2016

Performance-Constrained Binary Classification Using Ensemble Learning: An Application To Cost-Efficient Targeted Prep Strategies, Wenjing Zheng, Laura Balzer, Maya L. Petersen, Mark J. Van Der Laan

Laura B. Balzer

Binary classifications problems are ubiquitous in health and social science applications. In many cases, one wishes to balance two conflicting criteria for an optimal binary classifier. For instance, in resource-limited settings, an HIV prevention program based on offering Pre-Exposure Prophylaxis (PrEP) to select high-risk individuals must balance the sensitivity of the binary classifier in detecting future seroconverters (and hence offering them PrEP regimens) with the total number of PrEP regimens that is financially and logistically feasible for the program to deliver. In this article, we consider a general class of performance-constrained binary classification problems wherein the objective function and the …


The Expected Total Curvature Of Random Polygons, Jason Cantarella, Alexander Y. Grosberg, Robert Kusner, Clayton Shonkwiler Apr 2015

The Expected Total Curvature Of Random Polygons, Jason Cantarella, Alexander Y. Grosberg, Robert Kusner, Clayton Shonkwiler

Robert Kusner

We consider the expected value for the total curvature of a random closed polygon. Numerical experiments have suggested that as the number of edges becomes large, the difference between the expected total curvature of a random closed polygon and a random open polygon with the same number of turning angles approaches a positive constant. We show that this is true for a natural class of probability measures on polygons, and give a formula for the constant in terms of the moments of the edgelength distribution.

We then consider the symmetric measure on closed polygons of fixed total length constructed by …


Depicting Estimates Using The Intercept In Meta-Regression Models: The Moving Constant Technique, Blair T. Johnson Dr., Tania B. Huedo-Medina Dr. Aug 2014

Depicting Estimates Using The Intercept In Meta-Regression Models: The Moving Constant Technique, Blair T. Johnson Dr., Tania B. Huedo-Medina Dr.

Blair T. Johnson

In any scientific discipline, the ability to portray research patterns graphically often aids greatly in interpreting a phenomenon. In part to depict phenomena, the statistics and capabilities of meta-analytic models have grown increasingly sophisticated. Accordingly, this article details how to move the constant in weighted meta-analysis regression models (viz. “meta-regression”) to illuminate the patterns in such models across a range of complexities. Although it is commonly ignored in practice, the constant (or intercept) in such models can be indispensible when it is not relegated to its usual static role. The moving constant technique makes possible estimates and confidence intervals at …


A Comparison Of Periodic Autoregressive And Dynamic Factor Models In Intraday Energy Demand Forecasting, Thomas Mestekemper, Goeran Kauermann, Michael Smith Dec 2012

A Comparison Of Periodic Autoregressive And Dynamic Factor Models In Intraday Energy Demand Forecasting, Thomas Mestekemper, Goeran Kauermann, Michael Smith

Michael Stanley Smith

We suggest a new approach for forecasting energy demand at an intraday resolution. Demand in each intraday period is modeled using semiparametric regression smoothing to account for calendar and weather components. Residual serial dependence is captured by one of two multivariate stationary time series models, with dimension equal to the number of intraday periods. These are a periodic autoregression and a dynamic factor model. We show the benefits of our approach in the forecasting of district heating demand in a steam network in Germany and aggregate electricity demand in the state of Victoria, Australia. In both studies, accounting for weather …


Bayesian Approaches To Copula Modelling, Michael S. Smith Dec 2012

Bayesian Approaches To Copula Modelling, Michael S. Smith

Michael Stanley Smith

Copula models have become one of the most widely used tools in the applied modelling of multivariate data. Similarly, Bayesian methods are increasingly used to obtain efficient likelihood-based inference. However, to date, there has been only limited use of Bayesian approaches in the formulation and estimation of copula models. This article aims to address this shortcoming in two ways. First, to introduce copula models and aspects of copula theory that are especially relevant for a Bayesian analysis. Second, to outline Bayesian approaches to formulating and estimating copula models, and their advantages over alternative methods. Copulas covered include Archimedean, copulas constructed …


The Dirty “S” Word: Innovative Teaching Techniques For Counselor Educators Facilitating Learning In Statistics And Research, Rebecca L. Tadlock-Marlo, Megan Michalak Jan 2012

The Dirty “S” Word: Innovative Teaching Techniques For Counselor Educators Facilitating Learning In Statistics And Research, Rebecca L. Tadlock-Marlo, Megan Michalak

Rebecca L Tadlock-Marlo

Innovative pedagogy will be presented and discussed to help make research a less painful class to both teach and learn. Foci include teaching methods, potential assignments, and suggestions for activities to help facilitate a more fluid learning process for counselors. Attendees will explore aspects of helping students overcome their fear of both statistics and research.


Modeling Dependence Using Skew T Copulas: Bayesian Inference And Applications, Michael S. Smith, Quan Gan, Robert Kohn Dec 2011

Modeling Dependence Using Skew T Copulas: Bayesian Inference And Applications, Michael S. Smith, Quan Gan, Robert Kohn

Michael Stanley Smith

[THIS IS AN AUGUST 2010 REVISION THAT REPLACES ALL PREVIOUS VERSIONS.]

We construct a copula from the skew t distribution of Sahu, Dey & Branco (2003). This copula can capture asymmetric and extreme dependence between variables, and is one of the few copulas that can do so and still be used in high dimensions effectively. However, it is difficult to estimate the copula model by maximum likelihood when the multivariate dimension is high, or when some or all of the marginal distributions are discrete-valued, or when the parameters in the marginal distributions and copula are estimated jointly. We therefore propose …


Estimation Of Copula Models With Discrete Margins Via Bayesian Data Augmentation, Michael S. Smith, Mohamad A. Khaled Dec 2011

Estimation Of Copula Models With Discrete Margins Via Bayesian Data Augmentation, Michael S. Smith, Mohamad A. Khaled

Michael Stanley Smith

Estimation of copula models with discrete margins is known to be difficult beyond the bivariate case. We show how this can be achieved by augmenting the likelihood with latent variables, and computing inference using the resulting augmented posterior. To evaluate this we propose two efficient Markov chain Monte Carlo sampling schemes. One generates the latent variables as a block using a Metropolis-Hasting step with a proposal that is close to its target distribution, the other generates them one at a time. Our method applies to all parametric copulas where the conditional copula functions can be evaluated, not just elliptical copulas …


Statistical Trends In Women's Participation In Science: Commentary On Valla And Ceci, Cathy Kessel, Donna J. Nelson Jan 2011

Statistical Trends In Women's Participation In Science: Commentary On Valla And Ceci, Cathy Kessel, Donna J. Nelson

Cathy Kessel

“Can Sex Differences in Science Be Tied to the Long Reach of Prenatal Hormones?” describes women's participation in science and claims that the increases in quantitative fields (e.g., mathematics, engineering, and physical sciences) are an exception to the statement that “Women’s growth in the scientific workforce has been meteoric over the past 40 years.” We disagree and present statistics demonstrating a more positive view of the increase of the number of women in quantitative fields.


Rejoinder: Estimation Issues For Copulas Applied To Marketing Data, Peter Danaher, Michael Smith Dec 2010

Rejoinder: Estimation Issues For Copulas Applied To Marketing Data, Peter Danaher, Michael Smith

Michael Stanley Smith

Estimating copula models using Bayesian methods presents some subtle challenges, ranging from specification of the prior to computational tractability. There is also some debate about what is the most appropriate copula to employ from those available. We address these issues here and conclude by discussing further applications of copula models in marketing.


Forecasting Television Ratings, Peter Danaher, Tracey Dagger, Michael Smith Dec 2010

Forecasting Television Ratings, Peter Danaher, Tracey Dagger, Michael Smith

Michael Stanley Smith

Despite the state of flux in media today, television remains the dominant player globally for advertising spend. Since television advertising time is purchased on the basis of projected future ratings, and ad costs have skyrocketed, there is increasing pressure to forecast television ratings accurately. Previous forecasting methods are not generally very reliable and many have not been validated, but more distressingly, none have been tested in today’s multichannel environment. In this study we compare 8 different forecasting models, ranging from a naïve empirical method to a state-of-the-art Bayesian model-averaging method. Our data come from a recent time period, 2004-2008 in …


Windows Executable For Gaussian Copula With Nbd Margins, Michael S. Smith Dec 2010

Windows Executable For Gaussian Copula With Nbd Margins, Michael S. Smith

Michael Stanley Smith

This is an example Windows 32bit program to estimate a Gaussian copula model with NBD margins. The margins are estimated first using MLE, and the copula second using Bayesian MCMC. The model was discussed in Danaher & Smith (2011; Marketing Science) as example 4 (section 4.2).


Bicycle Commuting In Melbourne During The 2000s Energy Crisis: A Semiparametric Analysis Of Intraday Volumes, Michael S. Smith, Goeran Kauermann Dec 2010

Bicycle Commuting In Melbourne During The 2000s Energy Crisis: A Semiparametric Analysis Of Intraday Volumes, Michael S. Smith, Goeran Kauermann

Michael Stanley Smith

Cycling is attracting renewed attention as a mode of transport in western urban environments, yet the determinants of usage are poorly understood. In this paper we investigate some of these using intraday bicycle volumes collected via induction loops located at ten bike paths in the city of Melbourne, Australia, between December 2005 and June 2008. The data are hourly counts at each location, with temporal and spatial disaggregation allowing for the impact of meteorology to be measured accurately for the first time. Moreover, during this period petrol prices varied dramatically and the data also provide a unique opportunity to assess …


Modeling Longitudinal Data Using A Pair-Copula Decomposition Of Serial Dependence, Michael S. Smith, Aleksey Min, Carlos Almeida, Claudia Czado Nov 2010

Modeling Longitudinal Data Using A Pair-Copula Decomposition Of Serial Dependence, Michael S. Smith, Aleksey Min, Carlos Almeida, Claudia Czado

Michael Stanley Smith

Copulas have proven to be very successful tools for the flexible modelling of cross-sectional dependence. In this paper we express the dependence structure of continuous-valued time series data using a sequence of bivariate copulas. This corresponds to a type of decomposition recently called a ‘vine’ in the graphical models literature, where each copula is entitled a ‘pair-copula’. We propose a Bayesian approach for the estimation of this dependence structure for longitudinal data. Bayesian selection ideas are used to identify any independence pair-copulas, with the end result being a parsimonious representation of a time-inhomogeneous Markov process of varying order. Estimates are …


Bayesian Inference For A Periodic Stochastic Volatility Model Of Intraday Electricity Prices, Michael S. Smith Dec 2009

Bayesian Inference For A Periodic Stochastic Volatility Model Of Intraday Electricity Prices, Michael S. Smith

Michael Stanley Smith

The Gaussian stochastic volatility model is extended to allow for periodic autoregressions (PAR) in both the level and log-volatility process. Each PAR is represented as a first order vector autoregression for a longitudinal vector of length equal to the period. The periodic stochastic volatility model is therefore expressed as a multivariate stochastic volatility model. Bayesian posterior inference is computed using a Markov chain Monte Carlo scheme for the multivariate representation. A circular prior that exploits the periodicity is suggested for the log-variance of the log-volatilities. The approach is applied to estimate a periodic stochastic volatility model for half-hourly electricity prices …


Bayesian Skew Selection For Multivariate Models, Michael S. Smith, Anastasios Panagiotelis Dec 2009

Bayesian Skew Selection For Multivariate Models, Michael S. Smith, Anastasios Panagiotelis

Michael Stanley Smith

We develop a Bayesian approach for the selection of skew in multivariate skew t distributions constructed through hidden conditioning in the manners suggested by either Azzalini and Capitanio (2003) or Sahu, Dey and Branco~(2003). We show that the skew coefficients for each margin are the same for the standardized versions of both distributions. We introduce binary indicators to denote whether there is symmetry, or skew, in each dimension. We adopt a proper beta prior on each non-zero skew coefficient, and derive the corresponding prior on the skew parameters. In both distributions we show that as the degrees of freedom increases, …


Time Series Analysis, Yogendra Sisodia Jul 2009

Time Series Analysis, Yogendra Sisodia

Yogendra Sisodia

Time Series Analysis using Project R. ARMA Model on Nifty Spot, Nifty Futures and MCX Comdex.


A Plotless Density Estimator Based On The Asymptotic Limit Of Ordered Distance Estimation Values, Barry J. Kronenfeld Jan 2009

A Plotless Density Estimator Based On The Asymptotic Limit Of Ordered Distance Estimation Values, Barry J. Kronenfeld

Barry J. Kronenfeld

Estimation of tree density from point-tree distances is an attractive option for quick inventory of new sites, but estimators that are unbiased in clustered and dispersed situations have not been found. Noting that bias of an estimator derived from distances to the kth nearest neighbor from a random point tends to decrease with increasing k, a method is proposed for estimating the limit of an asymptotic function through a set of ordered distance estimators. A standard asymptotic model is derived from the limiting case of a clustered distribution. The proposed estimator is evaluated against 13 types of simulated generating processes, …


Notes On The Two-Body Problem, Cathy Kessel Jan 2009

Notes On The Two-Body Problem, Cathy Kessel

Cathy Kessel

No abstract provided.


Technical Note: Comparative Static Analysis Of Information Value In A Canonical Decision Problem, Jeffrey Keisler Sep 2004

Technical Note: Comparative Static Analysis Of Information Value In A Canonical Decision Problem, Jeffrey Keisler

Jeffrey Keisler

To gain insight into the behavior of the value of information, this paper identifies specific rules for a canonical decision problem: the two-act linear loss decision with normal prior probability distributions. Conditions are derived for which the expected value of perfect information increases when mean and standard deviation are both linear functions of an exogenous variable. A variety of richer decision problems can be adapted to the problem, so that the general results obtained here can be immediately applied to understand drivers of information value.


Additive Nonparametric Regression With Autocorrelated Errors, Michael S. Smith, C Wong, Robert Kohn Dec 1997

Additive Nonparametric Regression With Autocorrelated Errors, Michael S. Smith, C Wong, Robert Kohn

Michael Stanley Smith

A Bayesian approach is presented for nonparametric estimation of an additive regression model with autocorrelated errors. Each of the potentially nonlinear components is modelled as a regression spline using many knots, while the errors are modelled by a high order stationary autoregressive process parameterised in terms of its autocorrelations. The distribution of significant knots and partial autocorrelations is accounted for using subset selection. Our approach also allows the selection of a suitable transformation of the dependent variable. All aspects of the model are estimated simultaneously using Markov chain Monte Carlo. It is shown empirically that the proposed approach works well …


A Bayesian Approach To Bivariate Nonparametric Regression, Michael Smith, Robert Kohn Dec 1996

A Bayesian Approach To Bivariate Nonparametric Regression, Michael Smith, Robert Kohn

Michael Stanley Smith

No abstract provided.


Nonparametric Regression Using Bayesian Variable Selection, Michael Smith, Robert Kohn Dec 1995

Nonparametric Regression Using Bayesian Variable Selection, Michael Smith, Robert Kohn

Michael Stanley Smith

No abstract provided.


Finite Sample Performance Of Robust Bayesian Regression, Michael Smith, Sheather Simon, Kohn Robert Dec 1995

Finite Sample Performance Of Robust Bayesian Regression, Michael Smith, Sheather Simon, Kohn Robert

Michael Stanley Smith

No abstract provided.


A Bayesian Approach To Additive Nonparametric Regression, Michael S. Smith, Robert Kohn Dec 1993

A Bayesian Approach To Additive Nonparametric Regression, Michael S. Smith, Robert Kohn

Michael Stanley Smith

This proceedings paper was the first to suggest using a Gaussian g-prior combined with a point mass to undertake Bayesian variable selection in a Gaussian linear regression model. It also was the first to suggest integrating out the regression parameters and variance in closed form, resulting in an efficient Gibbs sampling scheme. The idea was applied to estimate regression functions in an additive model by using a linear basis expansion for each component function in an additive model. The conference proceeding was eventually published in a slightly tighter form in Journal of Econometrics (1996).