Open Access. Powered by Scholars. Published by Universities.®

Johns Hopkins University, Dept. of Biostatistics Working Papers

2009

Articles 1 - 2 of 2

Full-Text Articles in Numerical Analysis and Computation

Caching And Visualizing Statistical Analyses, Roger D. Peng, Duncan Temple Lang Jun 2009

Caching And Visualizing Statistical Analyses, Roger D. Peng, Duncan Temple Lang

Johns Hopkins University, Dept. of Biostatistics Working Papers

We present the cacher and CodeDepends packages for R, which provide tools for (1) caching and analyzing the code for statistical analyses and (2) distributing these analyses to others in an efficient manner over the web. The cacher package takes objects created by evaluating R expressions and stores them in key-value databases. These databases of cached objects can subsequently be assembled into “cache packages” for distribution over the web. The cacher package also provides tools to help readers examine the data and code in a statistical analysis and reproduce, modify, or improve upon the results. In addition, readers can easily …


Efficient Evaluation Of Ranking Procedures When The Number Of Units Is Large With Application To Snp Identification, Thomas A. Louis, Ingo Ruczinski Feb 2009

Efficient Evaluation Of Ranking Procedures When The Number Of Units Is Large With Application To Snp Identification, Thomas A. Louis, Ingo Ruczinski

Johns Hopkins University, Dept. of Biostatistics Working Papers

Simulation-based assessment is a popular and frequently necessary approach to evaluation of statistical procedures. Sometimes overlooked is the ability to take advantage of underlying mathematical relations and we focus on this aspect. We show how to take advantage of large-sample theory when conducting a simulation using the analysis of genomic data as a motivating example. The approach uses convergence results to provide an approximation to smaller-sample results, results that are available only by simulation. We consider evaluating and comparing a variety of ranking-based methods for identifying the most highly associated SNPs in a genome-wide association study, derive integral equation representations …