Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Genetics and Genomics

PDF

University of Kentucky

Keyword
Publication Year
Publication
Publication Type

Articles 1 - 30 of 54

Full-Text Articles in Physical Sciences and Mathematics

Genomics Of Postprandial Lipidomics In The Genetics Of Lipid-Lowering Drugs And Diet Network Study, Marguerite R. Irvin, May E. Montasser, Tobias Kind, Sili Fan, Dinesh K. Barupal, Amit Patki, Rikki M. Tanner, Nicole D. Armstrong, Kathleen A. Ryan, Steven A. Claas, Jeffrey R. O’Connell, Hemant K. Tiwari, Donna K. Arnett Nov 2021

Genomics Of Postprandial Lipidomics In The Genetics Of Lipid-Lowering Drugs And Diet Network Study, Marguerite R. Irvin, May E. Montasser, Tobias Kind, Sili Fan, Dinesh K. Barupal, Amit Patki, Rikki M. Tanner, Nicole D. Armstrong, Kathleen A. Ryan, Steven A. Claas, Jeffrey R. O’Connell, Hemant K. Tiwari, Donna K. Arnett

Epidemiology and Environmental Health Faculty Publications

Postprandial lipemia (PPL) is an important risk factor for cardiovascular disease. Inter-individual variation in the dietary response to a meal is known to be influenced by genetic factors, yet genes that dictate variation in postprandial lipids are not completely characterized. Genetic studies of the plasma lipidome can help to better understand postprandial metabolism by isolating lipid molecular species which are more closely related to the genome. We measured the plasma lipidome at fasting and 6 h after a standardized high-fat meal in 668 participants from the Genetics of Lipid-Lowering Drugs and Diet Network study (GOLDN) using ultra-performance liquid chromatography coupled …


Epigenome-Wide Association Study Of Kidney Function Identifies Trans-Ethnic And Ethnic-Specific Loci, Charles E. Breeze, Anna Batorsky, Mi Kyeong Lee, Mindy D. Szeto, Xiaoguang Xu, Daniel L. Mccartney, Rong Jiang, Amit Patki, Holly J. Kramer, James M. Eales, Laura Raffield, Leslie Lange, Ethan Lange, Peter Durda, Yongmei Liu, Russ P. Tracy, David Van Den Berg, Nhlbi Trans-Omics For Precision Medicine (Topmed) Consortium, Topmed Mesa Multi-Omics Working Group, Kathryn L. Evans, William E. Kraus, Donna K. Arnett Apr 2021

Epigenome-Wide Association Study Of Kidney Function Identifies Trans-Ethnic And Ethnic-Specific Loci, Charles E. Breeze, Anna Batorsky, Mi Kyeong Lee, Mindy D. Szeto, Xiaoguang Xu, Daniel L. Mccartney, Rong Jiang, Amit Patki, Holly J. Kramer, James M. Eales, Laura Raffield, Leslie Lange, Ethan Lange, Peter Durda, Yongmei Liu, Russ P. Tracy, David Van Den Berg, Nhlbi Trans-Omics For Precision Medicine (Topmed) Consortium, Topmed Mesa Multi-Omics Working Group, Kathryn L. Evans, William E. Kraus, Donna K. Arnett

Epidemiology and Environmental Health Faculty Publications

BACKGROUND: DNA methylation (DNAm) is associated with gene regulation and estimated glomerular filtration rate (eGFR), a measure of kidney function. Decreased eGFR is more common among US Hispanics and African Americans. The causes for this are poorly understood. We aimed to identify trans-ethnic and ethnic-specific differentially methylated positions (DMPs) associated with eGFR using an agnostic, genome-wide approach.

METHODS: The study included up to 5428 participants from multi-ethnic studies for discovery and 8109 participants for replication. We tested the associations between whole blood DNAm and eGFR using beta values from Illumina 450K or EPIC arrays. Ethnicity-stratified analyses were performed using linear …


An Ensemble Of The Icluster Method To Analyze Longitudinal Lncrna Expression Data For Psoriasis Patients, Suyan Tian, Chi Wang Apr 2021

An Ensemble Of The Icluster Method To Analyze Longitudinal Lncrna Expression Data For Psoriasis Patients, Suyan Tian, Chi Wang

Internal Medicine Faculty Publications

BACKGROUND: Psoriasis is an immune-mediated, inflammatory disorder of the skin with chronic inflammation and hyper-proliferation of the epidermis. Since psoriasis has genetic components and the diseased tissue of psoriasis is very easily accessible, it is natural to use high-throughput technologies to characterize psoriasis and thus seek targeted therapies. Transcriptional profiles change correspondingly after an intervention. Unlike cross-sectional gene expression data, longitudinal gene expression data can capture the dynamic changes and thus facilitate causal inference.

METHODS: Using the iCluster method as a building block, an ensemble method was proposed and applied to a longitudinal gene expression dataset for psoriasis, with the …


Chromosome Xq23 Is Associated With Lower Atherogenic Lipid Concentrations And Favorable Cardiometabolic Indices, Pradeep Natarajan, Akhil Pampana, Sarah E. Graham, Sanni E. Ruotsalainen, James A. Perry, Paul S. De Vries, Jai G. Broome, James P. Pirruccello, Michael C. Honigberg, Krishna Aragam, Brooke Wolford, Jennifer A. Brody, Lucinda Antonacci-Fulton, Moscati Arden, Stella Aslibekyan, Themistocles L. Assimes, Christie M. Ballantyne, Lawrence F. Bielak, Joshua C. Bis, Brian E. Cade, Donna K. Arnett Apr 2021

Chromosome Xq23 Is Associated With Lower Atherogenic Lipid Concentrations And Favorable Cardiometabolic Indices, Pradeep Natarajan, Akhil Pampana, Sarah E. Graham, Sanni E. Ruotsalainen, James A. Perry, Paul S. De Vries, Jai G. Broome, James P. Pirruccello, Michael C. Honigberg, Krishna Aragam, Brooke Wolford, Jennifer A. Brody, Lucinda Antonacci-Fulton, Moscati Arden, Stella Aslibekyan, Themistocles L. Assimes, Christie M. Ballantyne, Lawrence F. Bielak, Joshua C. Bis, Brian E. Cade, Donna K. Arnett

Epidemiology and Environmental Health Faculty Publications

Autosomal genetic analyses of blood lipids have yielded key insights for coronary heart disease (CHD). However, X chromosome genetic variation is understudied for blood lipids in large sample sizes. We now analyze genetic and blood lipid data in a high-coverage whole X chromosome sequencing study of 65,322 multi-ancestry participants and perform replication among 456,893 European participants. Common alleles on chromosome Xq23 are strongly associated with reduced total cholesterol, LDL cholesterol, and triglycerides (min P = 8.5 × 10−72), with similar effects for males and females. Chromosome Xq23 lipid-lowering alleles are associated with reduced odds for CHD among 42,545 …


Whole-Exome Sequencing And Hipsc Cardiomyocyte Models Identify Myrip, Trappc11, And Slc27a6 Of Potential Importance To Left Ventricular Hypertrophy In An African Ancestry Population, Marguerite R. Irvin, Praful Aggarwal, Steven A. Claas, Lisa De Las Fuentes, Anh N. Do, C. Charles Gu, Andrea Matter, Benjamin S. Olson, Amit Patki, Karen Schwander, Joshua D. Smith, Vinodh Srinivasasainagendra, Hemant K. Tiwari, Amy J. Turner, Deborah A. Nickerson, Dabeeru C. Rao, Ulrich Broeckel, Donna K. Arnett Feb 2021

Whole-Exome Sequencing And Hipsc Cardiomyocyte Models Identify Myrip, Trappc11, And Slc27a6 Of Potential Importance To Left Ventricular Hypertrophy In An African Ancestry Population, Marguerite R. Irvin, Praful Aggarwal, Steven A. Claas, Lisa De Las Fuentes, Anh N. Do, C. Charles Gu, Andrea Matter, Benjamin S. Olson, Amit Patki, Karen Schwander, Joshua D. Smith, Vinodh Srinivasasainagendra, Hemant K. Tiwari, Amy J. Turner, Deborah A. Nickerson, Dabeeru C. Rao, Ulrich Broeckel, Donna K. Arnett

Epidemiology and Environmental Health Faculty Publications

Background: Indices of left ventricular (LV) structure and geometry represent useful intermediate phenotypes related to LV hypertrophy (LVH), a predictor of cardiovascular (CV) disease (CVD) outcomes.

Methods and Results: We conducted an exome-wide association study of LV mass (LVM) adjusted to height2.7, LV internal diastolic dimension (LVIDD), and relative wall thickness (RWT) among 1,364 participants of African ancestry (AAs) in the Hypertension Genetic Epidemiology Network (HyperGEN). Both single-variant and gene-based sequence kernel association tests were performed to examine whether common and rare coding variants contribute to variation in echocardiographic traits in AAs. We then used a data-driven …


Root Hair Single Cell Type Specific Profiles Of Gene Expression And Alternative Polyadenylation Under Cadmium Stress, Jingyi Cao, Congting Ye, Guijie Hao, Carole Dabney-Smith, Arthur G. Hunt, Qingshun Q. Li May 2019

Root Hair Single Cell Type Specific Profiles Of Gene Expression And Alternative Polyadenylation Under Cadmium Stress, Jingyi Cao, Congting Ye, Guijie Hao, Carole Dabney-Smith, Arthur G. Hunt, Qingshun Q. Li

Plant and Soil Sciences Faculty Publications

Transcriptional networks are tightly controlled in plant development and stress responses. Alternative polyadenylation (APA) has been found to regulate gene expression under abiotic stress by increasing the heterogeneity at mRNA 3′-ends. Heavy metals like cadmium pollute water and soil due to mining and industry applications. Understanding how plants cope with heavy metal stress remains an interesting question. The Arabidopsis root hair was chosen as a single cell model to investigate the functional role of APA in cadmium stress response. Primary root growth inhibition and defective root hair morphotypes were observed. Poly(A) tag (PAT) libraries from single cell types, i.e., root …


Incorporating Pathway Information Into Feature Selection Towards Better Performed Gene Signatures, Suyan Tian, Chi Wang, Bing Wang Apr 2019

Incorporating Pathway Information Into Feature Selection Towards Better Performed Gene Signatures, Suyan Tian, Chi Wang, Bing Wang

Biostatistics Faculty Publications

To analyze gene expression data with sophisticated grouping structures and to extract hidden patterns from such data, feature selection is of critical importance. It is well known that genes do not function in isolation but rather work together within various metabolic, regulatory, and signaling pathways. If the biological knowledge contained within these pathways is taken into account, the resulting method is a pathway-based algorithm. Studies have demonstrated that a pathway-based method usually outperforms its gene-based counterpart in which no biological knowledge is considered. In this article, a pathway-based feature selection is firstly divided into three major categories, namely, pathway-level selection, …


Feature Selection For Longitudinal Data By Using Sign Averages To Summarize Gene Expression Values Over Time, Suyan Tian, Chi Wang Mar 2019

Feature Selection For Longitudinal Data By Using Sign Averages To Summarize Gene Expression Values Over Time, Suyan Tian, Chi Wang

Biostatistics Faculty Publications

With the rapid evolution of high-throughput technologies, time series/longitudinal high-throughput experiments have become possible and affordable. However, the development of statistical methods dealing with gene expression profiles across time points has not kept up with the explosion of such data. The feature selection process is of critical importance for longitudinal microarray data. In this study, we proposed aggregating a gene’s expression values across time into a single value using the sign average method, thereby degrading a longitudinal feature selection process into a classic one. Regularized logistic regression models with pseudogenes (i.e., the sign average of genes across time as predictors) …


Substantial Red Wolf Genetic Ancestry Persists In Wild Canids Of Southwestern Louisiana, Jennifer R. Adams, John J. Cox, Lisette P. Waits Mar 2019

Substantial Red Wolf Genetic Ancestry Persists In Wild Canids Of Southwestern Louisiana, Jennifer R. Adams, John J. Cox, Lisette P. Waits

Forestry and Natural Resources Faculty Publications

Concerns over red wolf (Canis rufus) extinction caused by hybridization with coyotes (C. latrans) led to the capture and removal of remnant wild wolves from southwestern Louisiana and southeastern Texas, United States, during the 1970s. Here we show that despite decades of unmitigated hybridization, and declaration of endangered red wolves as functionally extinct in the wild, red wolf mitochondrial or nuclear DNA ancestry persists in ∼55% of contemporary wild canids sampled in southwestern Louisiana. Surprisingly, one individual had 78–100% red wolf ancestry, which is within the range for 75% red wolf, red wolf backcross, or putative …


Genome-Wide Association Studies For Yield-Related Traits In Soft Red Winter Wheat Grown In Virginia, Brian P. Ward, Gina Brown-Guedira, Frederic L. Kolb, David A. Van Sanford, Priyanka Tyagi, Clay H. Sneller, Carl A. Griffey Feb 2019

Genome-Wide Association Studies For Yield-Related Traits In Soft Red Winter Wheat Grown In Virginia, Brian P. Ward, Gina Brown-Guedira, Frederic L. Kolb, David A. Van Sanford, Priyanka Tyagi, Clay H. Sneller, Carl A. Griffey

Plant and Soil Sciences Faculty Publications

Grain yield is a trait of paramount importance in the breeding of all cereals. In wheat (Triticum aestivum L.), yield has steadily increased since the Green Revolution, though the current rate of increase is not forecasted to keep pace with demand due to growing world population and increasing affluence. While several genome-wide association studies (GWAS) on yield and related component traits have been performed in wheat, the previous lack of a reference genome has made comparisons between studies difficult. In this study, a GWAS for yield and yield-related traits was carried out on a population of 322 soft red …


Discerning Novel Splice Junctions Derived From Rna-Seq Alignment: A Deep Learning Approach, Yi Zhang, Xinan Liu, James N. Macleod, Jinze Liu Dec 2018

Discerning Novel Splice Junctions Derived From Rna-Seq Alignment: A Deep Learning Approach, Yi Zhang, Xinan Liu, James N. Macleod, Jinze Liu

Computer Science Faculty Publications

Background: Exon splicing is a regulated cellular process in the transcription of protein-coding genes. Technological advancements and cost reductions in RNA sequencing have made quantitative and qualitative assessments of the transcriptome both possible and widely available. RNA-seq provides unprecedented resolution to identify gene structures and resolve the diversity of splicing variants. However, currently available ab initio aligners are vulnerable to spurious alignments due to random sequence matches and sample-reference genome discordance. As a consequence, a significant set of false positive exon junction predictions would be introduced, which will further confuse downstream analyses of splice variant discovery and abundance estimation.

Results: …


Large-Scale Genome-Wide Meta-Analysis Of Polycystic Ovary Syndrome Suggests Shared Genetic Architecture For Different Diagnosis Criteria, Felix Day, Tugce Karaderi, Michelle R. Jones, Cindy Meun, Chunyan He, Alex Drong, Peter Kraft, Nan Lin, Hongyan Huang, Linda Broer, Reedik Magi, Richa Saxena, Triin Laisk, Margrit Urbanek, M. Geoffrey Hayes, Gudmar Thorleifsson, Juan Fernandez-Tajes, Anubha Mahajan, Benjamin H. Mullin, Bronwyn G. A. Stuckey, Timothy D. Spector, Scott G. Wilson, Mark O. Goodarzi, Lea Davis, Barbara Obermayer-Pietsch, André G. Uitterlinden, Verneri Anttila, Benjamin M. Neale, Marjo-Riitta Jarvelin, Bart Fauser Dec 2018

Large-Scale Genome-Wide Meta-Analysis Of Polycystic Ovary Syndrome Suggests Shared Genetic Architecture For Different Diagnosis Criteria, Felix Day, Tugce Karaderi, Michelle R. Jones, Cindy Meun, Chunyan He, Alex Drong, Peter Kraft, Nan Lin, Hongyan Huang, Linda Broer, Reedik Magi, Richa Saxena, Triin Laisk, Margrit Urbanek, M. Geoffrey Hayes, Gudmar Thorleifsson, Juan Fernandez-Tajes, Anubha Mahajan, Benjamin H. Mullin, Bronwyn G. A. Stuckey, Timothy D. Spector, Scott G. Wilson, Mark O. Goodarzi, Lea Davis, Barbara Obermayer-Pietsch, André G. Uitterlinden, Verneri Anttila, Benjamin M. Neale, Marjo-Riitta Jarvelin, Bart Fauser

Internal Medicine Faculty Publications

Polycystic ovary syndrome (PCOS) is a disorder characterized by hyperandrogenism, ovulatory dysfunction and polycystic ovarian morphology. Affected women frequently have metabolic disturbances including insulin resistance and dysregulation of glucose homeostasis. PCOS is diagnosed with two different sets of diagnostic criteria, resulting in a phenotypic spectrum of PCOS cases. The genetic similarities between cases diagnosed based on the two criteria have been largely unknown. Previous studies in Chinese and European subjects have identified 16 loci associated with risk of PCOS. We report a fixed-effect, inverse-weighted-variance meta-analysis from 10,074 PCOS cases and 103,164 controls of European ancestry and characterisation of PCOS related …


A Logitudinal Feature Selection Method Identifies Relevant Genes To Distinguish Complicated Injury And Uncomplicated Injury Over Time, Suyan Tian, Chi Wang, Howard H. Chang Dec 2018

A Logitudinal Feature Selection Method Identifies Relevant Genes To Distinguish Complicated Injury And Uncomplicated Injury Over Time, Suyan Tian, Chi Wang, Howard H. Chang

Biostatistics Faculty Publications

Background: Feature selection and gene set analysis are of increasing interest in the field of bioinformatics. While these two approaches have been developed for different purposes, we describe how some gene set analysis methods can be utilized to conduct feature selection.

Methods: We adopted a gene set analysis method, the significance analysis of microarray gene set reduction (SAMGSR) algorithm, to carry out feature selection for longitudinal gene expression data.

Results: Using a real-world application and simulated data, it is demonstrated that the proposed SAMGSR extension outperforms other relevant methods. In this study, we illustrate that a gene’s expression profiles over …


Breeding For Resilience To Increasing Temperatures: A Field Trial Assessing Genetic Variation In Soft Red Winter Wheat, Kathleen Russell, David Van Sanford Dec 2018

Breeding For Resilience To Increasing Temperatures: A Field Trial Assessing Genetic Variation In Soft Red Winter Wheat, Kathleen Russell, David Van Sanford

Plant and Soil Sciences Faculty Publications

Breeding for resilience to climate change is a daunting prospect. Crop and climate models tell us that global wheat yields are likely to decline as the climate warms, causing a significant risk to global food security. High temperatures are known to affect crop development yet breeding for tolerance to heat stress is difficult to achieve in field environments. We conducted an active warming study over two years to quantify the effects of heat stress on genetic variation of soft red winter (SRW) wheat (Triticum aestivum L.). Forty SRW cultivars and breeding lines were chosen based on marker genotypes at …


Integrating Multiple Genetic Detection Methods To Estimate Population Density Of Social And Territorial Carnivores, Sean M. Murphy, Ben C. Augustine, Jennifer R. Adams, Lisette P. Waits, John J. Cox Oct 2018

Integrating Multiple Genetic Detection Methods To Estimate Population Density Of Social And Territorial Carnivores, Sean M. Murphy, Ben C. Augustine, Jennifer R. Adams, Lisette P. Waits, John J. Cox

Forestry and Natural Resources Faculty Publications

Spatial capture–recapture models can produce unbiased estimates of population density, but sparse detection data often plague studies of social and territorial carnivores. Integrating multiple types of detection data can improve estimation of the spatial scale parameter (σ), activity center locations, and density. Noninvasive genetic sampling is effective for detecting carnivores, but social structure and territoriality could cause differential detectability among population cohorts for different detection methods. Using three observation models, we evaluated the integration of genetic detection data from noninvasive hair and scat sampling of the social and territorial coyote (Canis latrans). Although precision of estimated density was …


Seqothello: Querying Rna-Seq Experiments At Scale, Ye Yu, Jinpeng Liu, Xinan Liu, Yi Zhang, Eamonn Magner, Erik Lehnert, Chen Qian, Jinze Liu Oct 2018

Seqothello: Querying Rna-Seq Experiments At Scale, Ye Yu, Jinpeng Liu, Xinan Liu, Yi Zhang, Eamonn Magner, Erik Lehnert, Chen Qian, Jinze Liu

Computer Science Faculty Publications

We present SeqOthello, an ultra-fast and memory-efficient indexing structure to support arbitrary sequence query against large collections of RNA-seq experiments. It takes SeqOthello only 5 min and 19.1 GB memory to conduct a global survey of 11,658 fusion events against 10,113 TCGA Pan-Cancer RNA-seq datasets. The query recovers 92.7% of tier-1 fusions curated by TCGA Fusion Gene Database and reveals 270 novel occurrences, all of which are present as tumor-specific. By providing a reference-free, alignment-free, and parameter-free sequence search system, SeqOthello will enable large-scale integrative studies using sequence-level data, an undertaking not previously practicable for many individual labs.


Iac Gene Expression In The Indole-3-Acetic Acid-Degrading Soil Bacterium Enterobacter Soli Lf7, Isaac V. Greenhut, Beryl L. Slezak, Johan H. J. Leveau Oct 2018

Iac Gene Expression In The Indole-3-Acetic Acid-Degrading Soil Bacterium Enterobacter Soli Lf7, Isaac V. Greenhut, Beryl L. Slezak, Johan H. J. Leveau

Plant and Soil Sciences Faculty Publications

We show for soil bacterium Enterobacter soli LF7 that the possession of an indole-3-acetic acid catabolic (iac) gene cluster is causatively linked to the ability to utilize the plant hormone indole-3-acetic acid (IAA) as a carbon and energy source. Genome-wide transcriptional profiling by mRNA sequencing revealed that these iac genes, chromosomally arranged as iacHABICDEFG and coding for the transformation of IAA to catechol, were the most highly induced (>29-fold) among the relatively few (iac cluster were genes for a major facilitator superfamily protein (mfs) and enzymes of the β-ketoadipate pathway (pcaIJD-catBCA), which channels …


Association Analyses Of Repeated Measures On Triglyceride And High-Density Lipoprotein Levels: Insights From Gaw20, Saurabh Ghosh, David W. Fardo Sep 2018

Association Analyses Of Repeated Measures On Triglyceride And High-Density Lipoprotein Levels: Insights From Gaw20, Saurabh Ghosh, David W. Fardo

Biostatistics Faculty Publications

Background: The GAW20 group formed on the theme of methods for association analyses of repeated measures comprised 4sets of investigators. The provided “real” data set included genotypes obtained from a human whole-genome association study based on longitudinal measurements of triglycerides (TGs) and high-density lipoprotein in addition to methylation levels before and after administration of fenofibrate. The simulated data set contained 200 replications of methylation levels and posttreatment TGs, mimicking the real data set.

Results: The different investigators in the group focused on the statistical challenges unique to family-based association analyses of phenotypes measured longitudinally and applied a wide spectrum of …


Gaw20: Methods And Strategies For The New Frontiers Of Epigenetics And Pharmacogenomics, Nathan L. Tintle, David W. Fardo, Marzia De Andrade, Stella Aslibekyan, Julia N. Bailey, Justo Lorenzo Bermejo, Rita M. Cantor, Saurabh Ghosh, Philip Melton, Xuexua Wang, Jean W. Maccluer, Laura Almasy Sep 2018

Gaw20: Methods And Strategies For The New Frontiers Of Epigenetics And Pharmacogenomics, Nathan L. Tintle, David W. Fardo, Marzia De Andrade, Stella Aslibekyan, Julia N. Bailey, Justo Lorenzo Bermejo, Rita M. Cantor, Saurabh Ghosh, Philip Melton, Xuexua Wang, Jean W. Maccluer, Laura Almasy

Biostatistics Faculty Publications

GAW20 provided a platform for developing and evaluating statistical methods to analyze human lipid-related phenotypes, DNA methylation, and single-nucleotide markers in a study involving a pharmaceutical intervention. In this article, we present an overview of the data sets and the contributions analyzing these data. The data, donated by the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) investigators, included data from 188 families (N = 1105) which included genome-wide DNA methylation data before and after a 3-week treatment with fenofibrate, single-nucleotide polymorphisms, metabolic syndrome components before and after treatment, and a variety of covariates. The contributions from individual …


Longitudinal Data Methods For Evaluating Genome-By-Epigenome Interactions In Families, Justin C. Strickland, I-Chen Chen, Chanung Wang, David W. Fardo Sep 2018

Longitudinal Data Methods For Evaluating Genome-By-Epigenome Interactions In Families, Justin C. Strickland, I-Chen Chen, Chanung Wang, David W. Fardo

Psychology Faculty Publications

Background: Longitudinal measurement is commonly employed in health research and provides numerous benefits for understanding disease and trait progression over time. More broadly, it allows for proper treatment of correlated responses within clusters. We evaluated 3 methods for analyzing genome-by-epigenome interactions with longitudinal outcomes from family data.

Results: Linear mixed-effect models, generalized estimating equations, and quadratic inference functions were used to test a pharmacoepigenetic effect in 200 simulated posttreatment replicates. Adjustment for baseline outcome provided greater power and more accurate control of Type I error rates than computation of a pre-to-post change score.

Conclusions: Comparison of all modeling approaches indicated …


Characterization Of Mrna Polyadenylation In The Apicomplexa, Ashley T. Stevens, Daniel K. Howe, Arthur G. Hunt Aug 2018

Characterization Of Mrna Polyadenylation In The Apicomplexa, Ashley T. Stevens, Daniel K. Howe, Arthur G. Hunt

Plant and Soil Sciences Faculty Publications

Messenger RNA polyadenylation is a universal aspect of gene expression in eukaryotes. In well-established model organisms, this process is mediated by a conserved complex of 15–20 subunits. To better understand this process in apicomplexans, a group of unicellular parasites that causes serious disease in humans and livestock, a computational and high throughput sequencing study of the polyadenylation complex and poly(A) sites in several species was conducted. BLAST-based searches for orthologs of the human polyadenylation complex yielded clear matches to only two—poly(A) polymerase and CPSF73—of the 19 proteins used as queries in this analysis. As the human subunits that recognize the …


Transcriptional Response Of Honey Bee (Apis Mellifera) To Differential Nutritional Status And Nosema Infection, Farida Azzouz-Olden, Arthur G. Hunt, Gloria Degrandi-Hoffman Aug 2018

Transcriptional Response Of Honey Bee (Apis Mellifera) To Differential Nutritional Status And Nosema Infection, Farida Azzouz-Olden, Arthur G. Hunt, Gloria Degrandi-Hoffman

Plant and Soil Sciences Faculty Publications

Background: Bees are confronting several environmental challenges, including the intermingled effects of malnutrition and disease. Intuitively, pollen is the healthiest nutritional choice, however, commercial substitutes, such as Bee-Pro and MegaBee, are widely used. Herein we examined how feeding natural and artificial diets shapes transcription in the abdomen of the honey bee, and how transcription shifts in combination with Nosema parasitism.

Results: Gene ontology enrichment revealed that, compared with poor diet (carbohydrates [C]), bees fed pollen (P > C), Bee-Pro (B > C), and MegaBee (M > C) showed a broad upregulation of metabolic processes, especially lipids; however, pollen feeding promoted more functions, and …


Imapsplice: Alleviating Reference Bias Through Personalized Rna-Seq Alignment, Xinan Liu, James N. Macleod, Jinze Liu Aug 2018

Imapsplice: Alleviating Reference Bias Through Personalized Rna-Seq Alignment, Xinan Liu, James N. Macleod, Jinze Liu

Computer Science Faculty Publications

Genomic variants in both coding and non-coding sequences can have functionally important and sometimes deleterious effects on exon splicing of gene transcripts. For transcriptome profiling using RNA-seq, the accurate alignment of reads across exon junctions is a critical step. Existing algorithms that utilize a standard reference genome as a template sometimes have difficulty in mapping reads that carry genomic variants. These problems can lead to allelic ratio biases and the failure to detect splice variants created by splice site polymorphisms. To improve RNA-seq read alignment, we have developed a novel approach called iMapSplice that enables personalized mRNA transcriptome profiling. The …


Genome-Wide Atlas Of Alternative Polyadenylation In The Forage Legume Red Clover, Manohar Chakrabarti, Randy D. Dinkins, Arthur G. Hunt Jul 2018

Genome-Wide Atlas Of Alternative Polyadenylation In The Forage Legume Red Clover, Manohar Chakrabarti, Randy D. Dinkins, Arthur G. Hunt

Plant and Soil Sciences Faculty Publications

Studies on prevalence and significance of alternative polyadenylation (APA) in plants have been so far limited mostly to the model plants. Here, a genome-wide analysis of APA was carried out in different tissue types in the non-model forage legume red clover (Trifolium pratense L). A profile of poly(A) sites in different tissue types was generated using so-called ‘poly(A)-tag sequencing’ (PATseq) approach. Our analysis revealed tissue-wise dynamics of usage of poly(A) sites located at different genomic locations. We also identified poly(A) sites and underlying genes displaying APA in different tissues. Functional categories enriched in groups of genes manifesting APA between …


A Model Species For Agricultural Pest Genomics: The Genome Of The Colorado Potato Beetle, Leptinotarsa Decemlineata (Coleoptera: Chrysomelidae), Sean D. Schoville, Yolanda H. Chen, Martin N. Andersson, Joshua B. Benoit, Anita Bhandari, Julia H. Bowsher, Kristian Brevik, Kaat Cappelle, Mei-Ju M. Chen, Anna K. Childers, Christopher Childers, Olivier Christiaens, Justin Clements, Elise M. Didion, Elena N. Elpidina, Patamarerk Engsontia, Markus Friedrich, Inmaculada García-Robles, Richard A. Gibbs, Chandan Goswami, Alessandro Grapputo, Kristina Gruden, Marcin Grynberg, Bernard Henrissat, Emily C. Jennings, Jeffery W. Jones, Megha Kalsi, Sher A. Khan, Abhishek Kumar, Fei Li, Vincent Lombard, Subba Reddy Palli, June-Sun Yoon Jan 2018

A Model Species For Agricultural Pest Genomics: The Genome Of The Colorado Potato Beetle, Leptinotarsa Decemlineata (Coleoptera: Chrysomelidae), Sean D. Schoville, Yolanda H. Chen, Martin N. Andersson, Joshua B. Benoit, Anita Bhandari, Julia H. Bowsher, Kristian Brevik, Kaat Cappelle, Mei-Ju M. Chen, Anna K. Childers, Christopher Childers, Olivier Christiaens, Justin Clements, Elise M. Didion, Elena N. Elpidina, Patamarerk Engsontia, Markus Friedrich, Inmaculada García-Robles, Richard A. Gibbs, Chandan Goswami, Alessandro Grapputo, Kristina Gruden, Marcin Grynberg, Bernard Henrissat, Emily C. Jennings, Jeffery W. Jones, Megha Kalsi, Sher A. Khan, Abhishek Kumar, Fei Li, Vincent Lombard, Subba Reddy Palli, June-Sun Yoon

Entomology Faculty Publications

The Colorado potato beetle is one of the most challenging agricultural pests to manage. It has shown a spectacular ability to adapt to a variety of solanaceaeous plants and variable climates during its global invasion, and, notably, to rapidly evolve insecticide resistance. To examine evidence of rapid evolutionary change, and to understand the genetic basis of herbivory and insecticide resistance, we tested for structural and functional genomic changes relative to other arthropod species using genome sequencing, transcriptomics, and community annotation. Two factors that might facilitate rapid evolutionary change include transposable elements, which comprise at least 17% of the genome and …


Recurrent Neural Networks And Their Applications To Rna Secondary Structure Inference, Devin Willmott Jan 2018

Recurrent Neural Networks And Their Applications To Rna Secondary Structure Inference, Devin Willmott

Theses and Dissertations--Mathematics

Recurrent neural networks (RNNs) are state of the art sequential machine learning tools, but have difficulty learning sequences with long-range dependencies due to the exponential growth or decay of gradients backpropagated through the RNN. Some methods overcome this problem by modifying the standard RNN architecure to force the recurrent weight matrix W to remain orthogonal throughout training. The first half of this thesis presents a novel orthogonal RNN architecture that enforces orthogonality of W by parametrizing with a skew-symmetric matrix via the Cayley transform. We present rules for backpropagation through the Cayley transform, show how to deal with the Cayley …


Bayesian Prediction Intervals For Assessing P-Value Variability In Prospective Replication Studies, Olga A. Vsevolozhskaya, Gabriel Ruiz, Dmitri Zaykin Dec 2017

Bayesian Prediction Intervals For Assessing P-Value Variability In Prospective Replication Studies, Olga A. Vsevolozhskaya, Gabriel Ruiz, Dmitri Zaykin

Biostatistics Faculty Publications

Increased availability of data and accessibility of computational tools in recent years have created an unprecedented upsurge of scientific studies driven by statistical analysis. Limitations inherent to statistics impose constraints on the reliability of conclusions drawn from data, so misuse of statistical methods is a growing concern. Hypothesis and significance testing, and the accompanying P-values are being scrutinized as representing the most widely applied and abused practices. One line of critique is that P-values are inherently unfit to fulfill their ostensible role as measures of credibility for scientific hypotheses. It has also been suggested that while P-values …


Analysis Of Complex Vertebrate Genomes: Computational Challenges And Solutions, Jeramiah J. Smith Oct 2017

Analysis Of Complex Vertebrate Genomes: Computational Challenges And Solutions, Jeramiah J. Smith

Commonwealth Computational Summit

No abstract provided.


Discovery Of Sex-Specific Regions In A Salamander Genome, Nataliya Y. Timoshevskaya, Melissa C. Keinath, Jeramiah J. Smith Oct 2017

Discovery Of Sex-Specific Regions In A Salamander Genome, Nataliya Y. Timoshevskaya, Melissa C. Keinath, Jeramiah J. Smith

Commonwealth Computational Summit

Biological Aspects:

Salamander (Ambystoma mexicanum) has a gigantic genome: ~32,000,000,000 bases (10X of size of human genome)

Sex is determined by a pair of morphologically identical chromosomes:

  • ZZ in male
  • ZW in female

Object:

  • Find (if there are any) genomic differences between chromosomes W and Z

Workflow:

  1. Sequencing and de novo assembly of the reference salamander genome
  2. Alignment of short sequences from male and female genomes to the reference
  3. Coverage analysis


The Lamprey Genome: Illuminating Genomic Change Across Eons And Embryogenesis, Jeramiah J. Smith, Courtney K. M. Waterbury, Melissa C. Keinath, Cody B. Saraceno, Vladimir A. Timoshevskiy, Nataliya Y. Timoshevskaya Oct 2017

The Lamprey Genome: Illuminating Genomic Change Across Eons And Embryogenesis, Jeramiah J. Smith, Courtney K. M. Waterbury, Melissa C. Keinath, Cody B. Saraceno, Vladimir A. Timoshevskiy, Nataliya Y. Timoshevskaya

Commonwealth Computational Summit

The lamprey genome provides unique insights into both the deep evolutionary history of vertebrate genomes and the maintenance of genome structure/integrity over development. The lamprey lineage diverged from all other vertebrates approximately 500 million years ago. As such, comparisons between lamprey and other vertebrates permit reconstruction of ancient duplication and rearrangement events that defined the fundamental architecture and gene content of all extant vertebrate genomes. Lamprey also undergoes programmatic changes genome structure that result in the physical elimination of ~20% of its genomic DNA (~0.5Gb from a ~2 Gb genome) from all somatic cell lineages during early embryonic development. Here, …