Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 5 of 5

Full-Text Articles in Physical Sciences and Mathematics

Penalized Functional Regression For Next-Generation Sequencing Studies, Olga A. Vsevolozhskaya Aug 2015

Penalized Functional Regression For Next-Generation Sequencing Studies, Olga A. Vsevolozhskaya

Olga A. Vsevolozhskaya

Recent technological advances equipped researchers with capabilities that go beyond traditional genotyping of loci known to be polymorphic in a general population. Genetic sequences of study participants can now be assessed directly. This capability removed technology-driven bias toward scoring predominantly common polymorphisms and let researchers reveal a wealth of rare and sample-specific variants. While the relative contributions of rare and common polymorphisms to trait variation are being debated, researchers are faced with the need for new statistical tools for simultaneous evaluation of all variants within a region. Several research groups demonstrated flexibility and good statistical power of the functional linear …


Estimated Probability Of Becoming A Case Of Drug Dependence In Relation To Duration Of Drug-Taking Experience: A Function Approach, Olga A. Vsevolozhskaya, James C. Anthony Jun 2015

Estimated Probability Of Becoming A Case Of Drug Dependence In Relation To Duration Of Drug-Taking Experience: A Function Approach, Olga A. Vsevolozhskaya, James C. Anthony

Olga A. Vsevolozhskaya

Measured as elapsed time from first use to dependence syndrome onset, the estimated 'induction interval' for cocaine clearly is short relative to the cannabis interval, but little is known about risk of becoming dependent when use persists. Published estimates for this facet of drug dependence epidemiology are from life histories elicited years after first use. To improve estimation, we turn to new data from nationally representative samples of newly incident drug users identified via probability sampling and confidential computer-assisted self-interviews for the National Surveys on Drug Use and Health, 2004-2013. Standardized modules assess first and most recent use, and dependence …


Assessing The Probability That A Finding Is Genuine For Large-Scale Genetic Association Studies, Chia-Ling Kuo, Olga A. Vsevolozhskaya, Dmitri V. Zaykin May 2015

Assessing The Probability That A Finding Is Genuine For Large-Scale Genetic Association Studies, Chia-Ling Kuo, Olga A. Vsevolozhskaya, Dmitri V. Zaykin

Olga A. Vsevolozhskaya

Genetic association studies routinely involve massive numbers of statistical tests accompanied by P-values. Whole genome sequencing technologies increased the potential number of tested variants to tens of millions. The more tests are performed, the smaller P-value is required to be deemed significant. However, a small P-value is not equivalent to small chances of a spurious finding and significance thresholds may fail to serve as efficient filters against false results. While the Bayesian approach can provide a direct assessment of the probability that a finding is spurious, its adoption in association studies has been slow, due in part to the ubiquity …


Functional Analysis Of Variance For Association Studies, Olga A. Vsevolozhskaya, Dmitri V. Zaykin, Mark C. Greenwood, Changshuai Wei, Qing Lu Sep 2014

Functional Analysis Of Variance For Association Studies, Olga A. Vsevolozhskaya, Dmitri V. Zaykin, Mark C. Greenwood, Changshuai Wei, Qing Lu

Olga A. Vsevolozhskaya

While progress has been made in identifying common genetic variants associated with human diseases, for most of common complex diseases, the identified genetic variants only account for a small proportion of heritability. Challenges remain in finding additional unknown genetic variants predisposing to complex diseases. With the advance in next-generation sequencing technologies, sequencing studies have become commonplace in genetic research. The ongoing exome-sequencing and whole-genome-sequencing studies generate a massive amount of sequencing variants and allow researchers to comprehensively investigate their role in human diseases. The discovery of new disease-associated variants can be enhanced by utilizing powerful and computationally efficient statistical methods. …


Use Of P-Values To Evaluate The Probability Of A Genuine Finding In Large-Scale Genetic Association Studies, Olga A. Vsevolozhskaya, Qing Lu, Chia-Ling Kuo, Dmitri V. Zaykin Oct 2013

Use Of P-Values To Evaluate The Probability Of A Genuine Finding In Large-Scale Genetic Association Studies, Olga A. Vsevolozhskaya, Qing Lu, Chia-Ling Kuo, Dmitri V. Zaykin

Olga A. Vsevolozhskaya

To claim the existence of an association in modern genome-wide association studies (GWAS), a nominal P-value has to exceed a stringent Bonferroni-adjusted significance level. Despite strictness of the correction, a significant P-value does not indicate high probability that the claimed association is genuine. A simple Bayesian solution -- the False Positive Report Probability (FPRP) -- was previously proposed to convert the observed P-value to the corresponding probability of no true association. Although the FPRP solution is highly popular, it does not reflect probability that a particular finding is false. Here, we offer a simple POFIG method -- a Probability that …