Open Access. Powered by Scholars. Published by Universities.®

Bioinformatics Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 11 of 11

Full-Text Articles in Bioinformatics

Evaluating The Impact Of Genotype Errors On Rare Variant Tests Of Association, Kaitlyn Cook, Alejandra Benitez, Casey Fu, Nathan L. Tintle Apr 2014

Evaluating The Impact Of Genotype Errors On Rare Variant Tests Of Association, Kaitlyn Cook, Alejandra Benitez, Casey Fu, Nathan L. Tintle

Faculty Work Comprehensive List

The new class of rare variant tests has usually been evaluated assuming perfect genotype information. In reality, rare variant genotypes may be incorrect, and so rare variant tests should be robust to imperfect data. Errors and uncertainty in SNP genotyping are already known to dramatically impact statistical power for single marker tests on common variants and, in some cases, inflate the type I error rate. Recent results show that uncertainty in genotype calls derived from sequencing reads are dependent on several factors, including read depth, calling algorithm, number of alleles present in the sample, and the frequency at which an …


Assessing Methods For Assigning Snps To Genes In Gene-Based Tests Of Association Using Common Variants, Ashley Petersen, Carolina Alvarez, Scott Declaire, Nathan L. Tintle May 2013

Assessing Methods For Assigning Snps To Genes In Gene-Based Tests Of Association Using Common Variants, Ashley Petersen, Carolina Alvarez, Scott Declaire, Nathan L. Tintle

Faculty Work Comprehensive List

Gene-based tests of association are frequently applied to common SNPs (MAF>5%) as an alternative to single-marker tests. In this analysis we conduct a variety of simulation studies applied to five popular gene-based tests investigating general trends related to their performance in realistic situations. In particular, we focus on the impact of non-causal SNPs and a variety of LD structures on the behavior of these tests. Ultimately, we find that non-causal SNPs can significantly impact the power of all gene-based tests. On average, we find that the “noise” from 6–12 non-causal SNPs will cancel out the “signal” of one causal …


Geometric Framework For Evaluating Rare Variant Tests Of Association, Keli Liu, Shannon Fast, Matthew Zawistowski, Nathan L. Tintle May 2013

Geometric Framework For Evaluating Rare Variant Tests Of Association, Keli Liu, Shannon Fast, Matthew Zawistowski, Nathan L. Tintle

Faculty Work Comprehensive List

The wave of next-generation sequencing data has arrived. However, many questions still remain about how to best analyze sequence data, particularly the contribution of rare genetic variants to human disease. Numerous statistical methods have been proposed to aggregate association signals across multiple rare variant sites in an effort to increase statistical power; however, the precise relation between the tests is often not well understood. We present a geometric representation for rare variant data in which rare allele counts in case and control samples are treated as vectors in Euclidean space. The geometric framework facilitates a rigorous classification of existing rare …


Optimal Methods For Using Posterior Probabilities In Association Testing, Keli Liu, Alexander Luedtke, Nathan L. Tintle May 2013

Optimal Methods For Using Posterior Probabilities In Association Testing, Keli Liu, Alexander Luedtke, Nathan L. Tintle

Faculty Work Comprehensive List

Objective: The use of haplotypes to impute the genotypes of unmeasured single nucleotide variants continues to rise in popularity. Simulation results suggest that the use of the dosage as a one-dimensional summary statistic of imputation posterior probabilities may be optimal both in terms of statistical power and computational efficiency; however, little theoretical understanding is available to explain and unify these simulation results. In our analysis, we provide a theoretical foundation for the use of the dosage as a one-dimensional summary statistic of genotype posterior probabilities from any technology. Methods: We analytically evaluate the dosage, mode and the more general set …


Assessing The Impact Of Differential Genotyping Errors On Rare Variant Tests Of Association, Morgan Mayer-Jochimsen, Shannon Fast, Nathan L. Tintle Mar 2013

Assessing The Impact Of Differential Genotyping Errors On Rare Variant Tests Of Association, Morgan Mayer-Jochimsen, Shannon Fast, Nathan L. Tintle

Faculty Work Comprehensive List

Genotyping errors are well-known to impact the power and type I error rate in single marker tests of association. Genotyping errors that happen according to the same process in cases and controls are known as non-differential genotyping errors, whereas genotyping errors that occur with different processes in the cases and controls are known as differential genotype errors. For single marker tests, non-differential genotyping errors reduce power, while differential genotyping errors increase the type I error rate. However, little is known about the behavior of the new generation of rare variant tests of association in the presence of genotyping errors. In …


Assessing The Impact Of Non-Differential Genotyping Errors On Rare Variant Tests Of Association, Scott Powers, Shyam Gopalakrishnan, Nathan L. Tintle Nov 2011

Assessing The Impact Of Non-Differential Genotyping Errors On Rare Variant Tests Of Association, Scott Powers, Shyam Gopalakrishnan, Nathan L. Tintle

Faculty Work Comprehensive List

Background/Aims: We aim to quantify the effect of non-differential genotyping errors on the power of rare variant tests and identify those situations when genotyping errors are most harmful. Methods: We simulated genotype and phenotype data for a range of sample sizes, minor allele frequencies, disease relative risks and numbers of rare variants. Genotype errors were then simulated using five different error models covering a wide range of error rates. Results: Even at very low error rates, misclassifying a common homozygote as a heterozygote translates into a substantial loss of power, a result that is exacerbated even further as the minor …


Identifying Rare Variants From Exome Scans: The Gaw17 Experience, Saurabh Ghosh, Heike Bickeboller, Julia Bailey, Joan E. Bailey-Wilson, Rita Cantor, Robert Culverhouse, Warwick Daw, Anita L. Destefano, Corinne D. Engelman, Anthony Hinrichs, Jeanine Houwing-Duistermaat, Inke R. Konig, Jack Kent, Nan Laird, Nathan Pankratz, Andrew Paterson, Elizabeth Pugh, Brian Suarez, Yan Sun, Alun Thomas, Nathan L. Tintle, Xiaofeng Zhu, Andreas Ziegler, Jean W. Maccluer, Laura Almasy Jan 2011

Identifying Rare Variants From Exome Scans: The Gaw17 Experience, Saurabh Ghosh, Heike Bickeboller, Julia Bailey, Joan E. Bailey-Wilson, Rita Cantor, Robert Culverhouse, Warwick Daw, Anita L. Destefano, Corinne D. Engelman, Anthony Hinrichs, Jeanine Houwing-Duistermaat, Inke R. Konig, Jack Kent, Nan Laird, Nathan Pankratz, Andrew Paterson, Elizabeth Pugh, Brian Suarez, Yan Sun, Alun Thomas, Nathan L. Tintle, Xiaofeng Zhu, Andreas Ziegler, Jean W. Maccluer, Laura Almasy

Faculty Work Comprehensive List

Genetic Analysis Workshop 17 (GAW17) provided a platform for evaluating existing statistical genetic methods and for developing novel methods to analyze rare variants that modulate complex traits. In this article, we present an overview of the 1000 Genomes Project exome data and simulated phenotype data that were distributed to GAW17 participants for analyses, the different issues addressed by the participants, and the process of preparation of manuscripts resulting from the discussions during the workshop


Evaluating Methods For The Analysis Of Rare Variants In Sequence Data, Alexander Luedtke, Scott Powers, Ashley Petersen, Alexandra Sitarik, Airat Bekmetjev, Nathan L. Tintle Jan 2011

Evaluating Methods For The Analysis Of Rare Variants In Sequence Data, Alexander Luedtke, Scott Powers, Ashley Petersen, Alexandra Sitarik, Airat Bekmetjev, Nathan L. Tintle

Faculty Work Comprehensive List

A number of rare variant statistical methods have been proposed for analysis of the impending wave of next-generation sequencing data. To date, there are few direct comparisons of these methods on real sequence data. Furthermore, there is a strong need for practical advice on the proper analytic strategies for rare variant analysis. We compare four recently proposed rare variant methods (combined multivariate and collapsing, weighted sum, proportion regression, and cumulative minor allele test) on simulated phenotype and next-generation sequencing data as part of Genetic Analysis Workshop 17. Overall, we find that all analyzed methods have serious practical limitations on identifying …


Inflated Type I Error Rates When Using Aggregation Methods To Analyze Rare Variants In The 1000 Genomes Project Exon Sequencing Data In Unrelated Individuals: Summary Results From Group 7 At Genetic Analysis Workshop 17, Nathan L. Tintle, Hugues Aschard, Inchi Hu, Nora Nock, Haitian Wang, Elizabeth Pugh Jan 2011

Inflated Type I Error Rates When Using Aggregation Methods To Analyze Rare Variants In The 1000 Genomes Project Exon Sequencing Data In Unrelated Individuals: Summary Results From Group 7 At Genetic Analysis Workshop 17, Nathan L. Tintle, Hugues Aschard, Inchi Hu, Nora Nock, Haitian Wang, Elizabeth Pugh

Faculty Work Comprehensive List

As part of Genetic Analysis Workshop 17 (GAW17), our group considered the application of novel and standard approaches to the analysis of genotype-phenotype association in next-generation sequencing data. Our group identified a major issue in the analysis of the GAW17 next-generation sequencing data: type I error and false-positive report probability rates higher than those expected based on empirical type I error levels (as high as 90%). Two main causes emerged: population stratification and long-range correlation (gametic phase disequilibrium) between rare variants. Population stratification was expected because of the diverse sample. Correlation between rare variants was attributable to both random causes …


Identification Of Genetic Association Of Multiple Rare Variants Using Collapsing Methods, Yan V. Sun, Yun Ju Sung, Nathan L. Tintle, Andreas Ziegler Jan 2011

Identification Of Genetic Association Of Multiple Rare Variants Using Collapsing Methods, Yan V. Sun, Yun Ju Sung, Nathan L. Tintle, Andreas Ziegler

Faculty Work Comprehensive List

Next-generation sequencing technology allows investigation of both common and rare variants in humans. Exomes are sequenced on the population level or in families to further study the genetics of human diseases. Genetic Analysis Workshop 17 (GAW17) provided exomic data from the 1000 Genomes Project and simulated phenotypes. These data enabled evaluations of existing and newly developed statistical methods for rare variant sequence analysis for which standard statistical methods fail because of the rareness of the alleles. Various alternative approaches have been proposed that overcome the rareness problem by combining multiple rare variants within a gene. These approaches are termed collapsing …


Evaluating Methods For Combining Rare Variant Data In Pathway-Based Tests Of Genetic Association, Ashley Petersen, Alexandra Sitarik, Alexander Luedtke, Scott Powers, Airat Bekmetjev, Nathan L. Tintle Jan 2011

Evaluating Methods For Combining Rare Variant Data In Pathway-Based Tests Of Genetic Association, Ashley Petersen, Alexandra Sitarik, Alexander Luedtke, Scott Powers, Airat Bekmetjev, Nathan L. Tintle

Faculty Work Comprehensive List

Analyzing sets of genes in genome-wide association studies is a relatively new approach that aims to capitalize on biological knowledge about the interactions of genes in biological pathways. This approach, called pathway analysis or gene set analysis, has not yet been applied to the analysis of rare variants. Applying pathway analysis to rare variants offers two competing approaches. In the first approach rare variant statistics are used to generate p-values for each gene (e.g., combined multivariate collapsing [CMC] or weighted-sum [WS]) and the gene-level p-values are combined using standard pathway analysis methods (e.g., gene set enrichment analysis or …