Multiple Random Oracles Are Better Than One, 2010 University of Pennsylvania

#### Multiple Random Oracles Are Better Than One, Jan Arpe, Elchanan Mossel

*Statistics Papers*

We study the problem of learning *k*-juntas given access to examples drawn from a number of different product distributions. Thus we wish to learn a function *f*: {−1, 1}^{n} → {−1, 1} that depends on *k* (unknown) coordinates. While the best-known algorithms for the general problem of learning a *k*-junta require running times of *n ^{k}* poly(

*n*, 2

^{k}), we show that, given access to

*k*different product distributions with biases separated by γ > 0, the functions may be learned in time poly(

*n*, 2

^{k}, γ

^{−k}). More generally, given access to

*t*≤

*k*different product ...

Software Internationalization: A Framework Validated Against Industry Requirements For Computer Science And Software Engineering Programs, 2010 California Polytechnic State University - San Luis Obispo

#### Software Internationalization: A Framework Validated Against Industry Requirements For Computer Science And Software Engineering Programs, John Huân Vũ

*Master's Theses and Project Reports*

View John Huân Vũ's thesis presentation at http://youtu.be/y3bzNmkTr-c.

In 2001, the ACM and IEEE Computing Curriculum stated that it was necessary to address "the need to develop implementation models that are international in scope and could be practiced in universities around the world." With increasing connectivity through the internet, the move towards a global economy and growing use of technology places software internationalization as a more important concern for developers. However, there has been a "clear shortage in terms of numbers of trained persons applying for entry-level positions" in this area. Eric Brechner, Director of Microsoft ...

Dynamic Model Pooling Methodology For Improving Aberration Detection Algorithms, 2010 University of Massachusetts Amherst

#### Dynamic Model Pooling Methodology For Improving Aberration Detection Algorithms, Brenton J. Sellati

*Masters Theses 1911 - February 2014*

Syndromic surveillance is defined generally as the collection and statistical analysis of data which are believed to be leading indicators for the presence of deleterious activities developing within a system. Conceptually, syndromic surveillance can be applied to any discipline in which it is important to know when external influences manifest themselves in a system by forcing it to depart from its baseline. Comparing syndromic surveillance systems have led to mixed results, where models that dominate in one performance metric are often sorely deficient in another. This results in a zero-sum trade off where one performance metric must be afforded greater ...

Principal Component Analysis And Biochemical Characterization Of Protein And Starch Reveal Primary Targets For Improving Sorghum Grain, 2010 University of California - Berkeley

#### Principal Component Analysis And Biochemical Characterization Of Protein And Starch Reveal Primary Targets For Improving Sorghum Grain, Joshua H. Wong, D. B. Marx, Jeff D. Wilson, Bob B. Buchanan, Peggy G. Lemaux, Jeffrey F. Pedersen

*Faculty Publications, Department of Statistics*

Limited progress has been made on genetic improvement of the digestibility of sorghum grain because of variability among different varieties. In this study, we applied multiple techniques to assess digestibility of grain from 18 sorghum lines to identify major components responsible for variability. We also identified storage proteins and enzymes as potential targets for genetic modification to improve digestibility. Results from principal component analysis revealed that content of amylose and total starch, together with protein digestibility (PD), accounted for 94% of variation in digestibility. Control of amylose content is understood and manageable. Up-regulation of genes associated with starch accumulation is ...

Reference Priors For Exponential Families With Increasing Dimension, 2010 University of Nebraska-Lincoln

#### Reference Priors For Exponential Families With Increasing Dimension, Bertrand S. Clarke, Subhashis Ghosal

*Faculty Publications, Department of Statistics*

In this article, we establish the asymptotic normality of the posterior distribution for the natural parameter in an exponential family based on independent and identically distributed data. The mode of convergence is expected Kullback-Leibler distance and the number of parameters p is increasing with the sample size n. Using this, we give an asymptotic expansion of the Shannon mutual information valid when *p* = *pn* increases at a sufficiently slow rate. The second term in the asymptotic expansion is the largest term that depends on the prior and can be optimized to give Jeffreys’ prior as the reference prior in the ...

Bayesian Inference For A Periodic Stochastic Volatility Model Of Intraday Electricity Prices, 2009 Melbourne Business School

#### Bayesian Inference For A Periodic Stochastic Volatility Model Of Intraday Electricity Prices, Michael S. Smith

*Michael Stanley Smith*

The Gaussian stochastic volatility model is extended to allow for periodic autoregressions (PAR) in both the level and log-volatility process. Each PAR is represented as a first order vector autoregression for a longitudinal vector of length equal to the period. The periodic stochastic volatility model is therefore expressed as a multivariate stochastic volatility model. Bayesian posterior inference is computed using a Markov chain Monte Carlo scheme for the multivariate representation. A circular prior that exploits the periodicity is suggested for the log-variance of the log-volatilities. The approach is applied to estimate a periodic stochastic volatility model for half-hourly electricity prices ...

Bayesian Skew Selection For Multivariate Models, 2009 Melbourne Business School

#### Bayesian Skew Selection For Multivariate Models, Michael S. Smith, Anastasios Panagiotelis

*Michael Stanley Smith*

We develop a Bayesian approach for the selection of skew in multivariate skew t distributions constructed through hidden conditioning in the manners suggested by either Azzalini and Capitanio (2003) or Sahu, Dey and Branco~(2003). We show that the skew coefficients for each margin are the same for the standardized versions of both distributions. We introduce binary indicators to denote whether there is symmetry, or skew, in each dimension. We adopt a proper beta prior on each non-zero skew coefficient, and derive the corresponding prior on the skew parameters. In both distributions we show that as the degrees of freedom ...