Open Access. Powered by Scholars. Published by Universities.®
- Institution
-
- Western University (9)
- The Texas Medical Center Library (8)
- University of Tennessee, Knoxville (6)
- New Jersey Institute of Technology (5)
- Purdue University (3)
-
- Wayne State University (3)
- Rochester Institute of Technology (2)
- University of Texas at El Paso (2)
- Virginia Commonwealth University (2)
- California Polytechnic State University, San Luis Obispo (1)
- Clemson University (1)
- Louisiana Tech University (1)
- Michigan Technological University (1)
- Syracuse University (1)
- University of Arkansas, Fayetteville (1)
- University of Kentucky (1)
- University of Louisville (1)
- University of Mississippi (1)
- University of Nevada, Las Vegas (1)
- University of New Hampshire (1)
- University of New Orleans (1)
- University of Wisconsin Milwaukee (1)
- Wilfrid Laurier University (1)
- Keyword
-
- Bioinformatics (4)
- Biological sciences (3)
- Pure sciences (3)
- Algorithm (2)
- Comparative genomics (2)
-
- Data mining (2)
- Genotyping (2)
- Germination (2)
- Microarray (2)
- Mutation (2)
- Next-generation sequencing (2)
- RNA-Seq (2)
- Rice (2)
- SNPs (2)
- 16S tag sequencing (1)
- Abiotic and biotic stresses (1)
- Abscisic acid (1)
- Age-structured model (1)
- Alcohol (1)
- Alternative oxidase (1)
- Alternative polyadenylation (APA) of mRNA (1)
- Anti-Toxin (1)
- Apolipoprotein E (1)
- Applied sciences (1)
- Arabidopsis thaliana (1)
- Assoiciation study (1)
- Bacterial vaginosis (1)
- Basic reproduction number (1)
- Biological pathway analysis (1)
- Biomarker (1)
- Publication
-
- Electronic Thesis and Dissertation Repository (9)
- Dissertations & Theses (Open Access) (8)
- Theses (7)
- Doctoral Dissertations (4)
- Masters Theses (3)
-
- Open Access Dissertations (3)
- Theses and Dissertations (3)
- Wayne State University Dissertations (3)
- Electronic Theses and Dissertations (2)
- Open Access Theses & Dissertations (2)
- All Dissertations (1)
- Dissertations - ALL (1)
- Dissertations, Master's Theses and Master's Reports - Open (1)
- Graduate Theses and Dissertations (1)
- Honors Theses and Capstones (1)
- Master's Theses (1)
- Theses and Dissertations (Comprehensive) (1)
- Theses and Dissertations--Computer Science (1)
- UNLV Theses, Dissertations, Professional Papers, and Capstones (1)
- University of New Orleans Theses and Dissertations (1)
Articles 1 - 30 of 54
Full-Text Articles in Life Sciences
Statistical And Comparative Phylogeography Of Mexican Freshwater Taxa In Extreme Aquatic Environments, Lyndon M. Coghill
Statistical And Comparative Phylogeography Of Mexican Freshwater Taxa In Extreme Aquatic Environments, Lyndon M. Coghill
University of New Orleans Theses and Dissertations
Phylogeography aims to understand the processes that underlie the distribution of genetic variation within and among closely related species. Although the means by which this goal might be achieved differ considerably from those that spawned the field some thirty years ago, the foundation and conceptual breakthroughs made by Avise are nonetheless the same and are as relevant today as they were two decades ago. Namely, patterns of neutral genetic variation among individuals carry the signature of a species’ demographic past, and the spatial and temporal environmental heterogeneity across a species’ geographic range can influence patterns of evolutionary change. Aquatic systems …
Computational Molecular Coevolution, Russell J. Dickson
Computational Molecular Coevolution, Russell J. Dickson
Electronic Thesis and Dissertation Repository
A major goal in computational biochemistry is to obtain three-dimensional structure information from protein sequence. Coevolution represents a biological mechanism through which structural information can be obtained from a family of protein sequences. Evolutionary relationships within a family of protein sequences are revealed through sequence alignment. Statistical analyses of these sequence alignments reveals positions in the protein family that covary, and thus appear to be dependent on one another throughout the evolution of the protein family. These covarying positions are inferred to be coevolving via one of two biological mechanisms, both of which imply that coevolution is facilitated by inter-residue …
Characterizing The Human Vaginal Microbiome Using High-Throughput Sequencing, Jean Megan E. Macklaim
Characterizing The Human Vaginal Microbiome Using High-Throughput Sequencing, Jean Megan E. Macklaim
Electronic Thesis and Dissertation Repository
The human vaginal microbiome undoubtedly has a significant role in reproductive health and for protection from infectious organisms. Recent efforts to characterize the bacterial species of the vagina using molecular techniques have uncovered an unexpected diversity. Using high-throughput sequencing I sought to describe the structure and function of the vaginal microbiome under different physiological states including healthy, bacterial vaginosis (BV), post-menopausal vaginal atrophy, and acute vulvovaginal candidiasis (VVC).
Partial 16S rRNA gene sequencing revealed that healthy, asymptomatic women most often have vaginal biotas dominated by Lactobacillus iners or L. crispatus. In contrast, BV is a heterogeneous, highly diversified condition …
Discovering Driver Somatic Mutations, Copy Number Alterations And Methylation Changes Using Markov Chain Monte Carlo, Bokhari Yahya
Discovering Driver Somatic Mutations, Copy Number Alterations And Methylation Changes Using Markov Chain Monte Carlo, Bokhari Yahya
Theses and Dissertations
Nowadays we have tremendous amount of genetic data needing to be interpreted. Somatic mutations, copy number variations and methylation are example of the genetics data we are dealing with. Discovering driver mutations from these combined data types is challenging. Mutations are unpredictable and have broad heterogeneity, which makes our goal hard to accomplish. Many methods have been proposed to solve the mystery of genetics of cancer. In this project we manipulate those above mentioned genetics data types and choose to use and modified an existing method utilizing Markov Chain Monte Carlo (MCMC). The method introduced two properties, coverage and exclusivity. …
Attributing Meaning To Online Social Network Analysis For Tailored Socio-Behavioral Support Systems, Sahiti Myneni
Attributing Meaning To Online Social Network Analysis For Tailored Socio-Behavioral Support Systems, Sahiti Myneni
Dissertations & Theses (Open Access)
Ubiquitous online social networks provide us with a unique opportunity to deliver scalable interventions for the support of lifestyle modifications in order to change behaviors that predispose toward cancer and other diseases. At the same time these networks act as rich data sources to inform our understanding of end-user needs. Traditionally, social network analysis is based on communication frequency among members. In this work, I introduce communication content as a complementary frame for studying these networks.
QuitNet, an online social network developed to provide smoking cessation support is considered for analysis. Qualitative coding, automated content analysis, and network analysis were …
Filter-Based Multiscale Entropy Analysis Of Complex Physiological Time Series, Liang Zhao
Filter-Based Multiscale Entropy Analysis Of Complex Physiological Time Series, Liang Zhao
Dissertations - ALL
The multiscale entropy (MSE) has been widely and successfully used in analyzing the complexity of physiologic time series. In this thesis, we re-interpret the averaging process in MSE as filtering a time series by a filter of a piecewise constant type. From this viewpoint, we introduce the {\it filter-based multiscale entropy} (FME) which filters a time series by filters to generate its multiple frequency components and then compute the {\it blockwise} entropy of the resulting components. By choosing filters adapted to the feature of a given time series, FME is able to better capture its multiscale information and to provide …
Introducing A Novel Method For Genetic Analysis Of Autism Spectrum Disorder, Sepideh Nouri
Introducing A Novel Method For Genetic Analysis Of Autism Spectrum Disorder, Sepideh Nouri
Dissertations & Theses (Open Access)
Autism is a spectrum of neurological disorders that is characterized by repetitive and stereotyped behaviors, lack of social skills in verbal and non-verbal communications, and intellectual disability. Recent statistics shows that 1 out of every 88 children in the US is affected by autism.
In this thesis, I first review previous studies on genetic association analyses of autism spectrum disorder. A large number of these studies fall into two categories: Genome Wide Association Studies (GWAS) and sequencing studies. Although GWAS are able to identify multiple common risk variants associated with different diseases, these common variants explain only a small portion …
Demonstration Of A Targeted Proteome Characterization Approach For Examining Specific Metabolic Pathways In Complex Bacterial Systems, Adam Justin Martin
Demonstration Of A Targeted Proteome Characterization Approach For Examining Specific Metabolic Pathways In Complex Bacterial Systems, Adam Justin Martin
Masters Theses
Multiple Reaction Monitoring (MRM) is a powerful tandem mass spectrometry (MS/MS) tool frequently implemented in proteomic studies to provide targeted analysis of proteins and peptides. The selectivity that MRM delivers is so strong that it provides the quadrupole mass spectrometers (QQQ), on which it is commonly employed, with pertinence to proteomic studies that they would otherwise lack for their relatively low resolution. Additionally, this increased level of selectivity is sufficient enough to supplant complicated fractionation techniques, additional dimensions of chromatography, and 24 hour long MS/MS experiments in simplistic biological samples. But there is a deficiency of evidence to determine the …
Development And Evaluation Of An Ontology-Based Quality Metrics Extraction System, Sina Madani
Development And Evaluation Of An Ontology-Based Quality Metrics Extraction System, Sina Madani
Dissertations & Theses (Open Access)
The Institute of Medicine reports a growing demand in recent years for quality improvement within the healthcare industry. In response, numerous organizations have been involved in the development and reporting of quality measurement metrics. However, disparate data models from such organizations shift the burden of accurate and reliable metrics extraction and reporting to healthcare providers. Furthermore, manual abstraction of quality metrics and diverse implementation of Electronic Health Record (EHR) systems deepens the complexity of consistent, valid, explicit, and comparable quality measurement reporting within healthcare provider organizations.
The main objective of this research is to evaluate an ontology-based information extraction framework …
Estimation Of Variation For High-Throughput Molecular Biological Experiments With Small Sample Size, Danni Yu
Estimation Of Variation For High-Throughput Molecular Biological Experiments With Small Sample Size, Danni Yu
Open Access Dissertations
Motivation: In the quantification of molecular components, a large variation can affect and even potentially mislead the biological conclusions. Meanwhile, the high-throughput experiments often involve a small number of samples due to the limitation of cost and time. In such cases, the stochastic information may dominate the outcome of an experiment because there may not be enough samples to present the true biological information. It is challenging to distinguish the changes in phenotype from the stochastic variation.
Methods: Since the biological molecules have been quantified with different technologies, different statistical methods are required. Focusing on three types of important high-throughput …
Statistical Models For Gene And Transcripts Quantification And Identification Using Rna-Seq Technology, Han Wu
Open Access Dissertations
RNA-Seq has emerged as a powerful technique for transcriptome study. As much as the improved sensitivity and coverage, RNA-Seq also brings challenges for data analysis. The massive amount of sequence reads data, excessive variability, uncertainties, and bias and noises stemming from multiple sources all make the analysis of RAN-Seq data difficult. Despite much progress, RNA-Seq data analysis still has much room for improvement, especially on the quantification of gene and transcript expression levels. The quantification of gene expression level is a direct inference problem, whereas the quantification of the transcript expression level is an indirect problem, because the label of …
Development Of Tyrosine Kinase Peptide Biosensors And Methods For Detection, Andrew Michael Lipchik
Development Of Tyrosine Kinase Peptide Biosensors And Methods For Detection, Andrew Michael Lipchik
Open Access Dissertations
New methods to monitor tyrosine kinase activity are critical for studying kinases in cell biology, drug discovery and the clinic. Peptide-based biosensors for detection of kinase activity utilitize a kinase specific artificial peptide substrate, which can report intercellular kinase activity through the incorporation of phosphate.
An artificial Syk substrate peptide was developed and incorporated with other functional modules to produce a Syk biosensor. These modules included a biotin-tag for affinity capture, a photo-cleavable amino acid to allow release of the substrate from the delivery module and the cell penetrating peptides TAT. A live cell kinase assay utilizing this biosensor was …
Identifying Chromosome Rearrangements In The Allopolyploid Brassica Napus Using Pyrosequencing, Alexandra R. Barbella
Identifying Chromosome Rearrangements In The Allopolyploid Brassica Napus Using Pyrosequencing, Alexandra R. Barbella
Master's Theses
Allopolyploids form through the hybridization of two or more diploid genomes. A challenge to reproduction in allopolyploids is that pairing can occur between homologous chromosomes or homeologous chromosomes (i.e.different subgenomes.). Crossover between homeologous chromosomes can result in chromosome rearrangements that lower fertility and overall fitness. Rearrangements can alter the dosage of either entire chromosomes or just parts of chromosomes. Understanding the frequency and extent of rearrangements will help to explain the evolution and genome stabilization of agriculturally important allopolyploid species. Pyrosequencing is a useful tool in the study dosage changes in allopolyploids because it allows quantification of the relative contribution …
Quantifying Mutational Impacts On Intrinsic Dna Flexibility In Prokaryotic Genomes, Mohammed Alawad
Quantifying Mutational Impacts On Intrinsic Dna Flexibility In Prokaryotic Genomes, Mohammed Alawad
Theses
The existence of synonymous codon biases across all taxonomic groups is a long standing problem in biology. While codon bias seems to be adequately explained by the maintenance of translation efficiency and accuracy in some organisms, there is still no adequate explanation of why codon biases universally track the intergenic gc content, as these regions of the genome would not be under selection pressures affecting translation. One part of the story may come from the triplet nature of codon in which each third position defines the minor groove width and thus affects the basic structure of the DNA by altering …
Role Of Branched-Chain Amino Acid Transporters In Staphylococcus Aureus Virulence, Sameha Omer
Role Of Branched-Chain Amino Acid Transporters In Staphylococcus Aureus Virulence, Sameha Omer
Electronic Thesis and Dissertation Repository
Branched-chain amino acids (BCAAs) act as effector molecules that signal a global transcriptional regulator, CodY, to regulate virulence factors in nutrient depleted environments. Staphylococcus aureus contains three putative BCAA transporters (BrnQ1, BrnQ2, BrnQ3) whose role in BCAA uptake is unknown. We hypothesize that BrnQ transporters are involved in BCAA uptake and contribute to virulence in S. aureus by modulating CodY activity. Results from radioactive uptake assays indicate that BrnQ1 is the predominant BrnQ transporter of isoleucine, valine and leucine. Meanwhile, BrnQ2 is more specific for isoleucine. Furthermore, only the lack of BrnQ1 hinders growth of S. aureus in chemically-defined media …
Modeling Leafhopper Populations And Their Role In Transmitting Plant Diseases., Ji Ruan
Modeling Leafhopper Populations And Their Role In Transmitting Plant Diseases., Ji Ruan
Electronic Thesis and Dissertation Repository
This M.Sc. thesis focuses on the interactions between crops and leafhoppers.
Firstly, a general delay differential equations system is proposed, based on the infection age structure, to investigate disease dynamics when disease latencies are considered. To further the understanding on the subject, a specific model is then introduced. The basic reproduction numbers $\cR_0$ and $\cR_1$ are identified and their threshold properties are discussed. When $\cR_0 < 1$, the insect-free equilibrium is globally asymptotically stable. When $\cR_0 > 1$ and $\cR_1 < 1$, the disease-free equilibrium exists and is locally asymptotically stable. When $\cR_1>1$, the disease will persist.
Secondly, we derive another general delay differential equations system to examine how different life stages of leafhoppers affect crops. The basic reproduction numbers $\cR_0$ is determined: when …
Development And Integration Of Informatic Tools For Qualitative And Quantitative Characterization Of Proteomic Datasets Generated By Tandem Mass Spectrometry, Rachel Michelle Adams
Development And Integration Of Informatic Tools For Qualitative And Quantitative Characterization Of Proteomic Datasets Generated By Tandem Mass Spectrometry, Rachel Michelle Adams
Doctoral Dissertations
Shotgun proteomic experiments provide qualitative and quantitative analytical information from biological samples ranging in complexity from simple bacterial isolates to higher eukaryotes such as plants and humans and even to communities of microbial organisms. Improvements to instrument performance, sample preparation, and informatic tools are increasing the scope and volume of data that can be analyzed by mass spectrometry (MS). To accommodate for these advances, it is becoming increasingly essential to choose and/or create tools that can not only scale well but also those that make more informed decisions using additional features within the data. Incorporating novel and existing tools into …
Chromatin Insulators: Master Regulators Of The Eukaryotic Genome, Todd Andrew Schoborg
Chromatin Insulators: Master Regulators Of The Eukaryotic Genome, Todd Andrew Schoborg
Doctoral Dissertations
Proper organization of the chromatin fiber within the three dimensional space of the eukaryotic nucleus relies on a number of DNA elements and their interacting proteins whose structural and functional consequences exert significant influence on genome behavior. Chromatin insulators are one such example, where it is thought that these elements assist in the formation of higher order chromatin loop structures by mediating long-range contacts between distant sites scattered throughout the genome. Such looping serves a dual role, helping to satisfy both the physical constraints needed to package the linear DNA polymer within the small volume of the nucleus while simultaneously …
Alcohol Biomarkers As Predictive Factors Of Rearrest In High Risk Repeat Offense Drunk Drivers, Brian Charles Kay
Alcohol Biomarkers As Predictive Factors Of Rearrest In High Risk Repeat Offense Drunk Drivers, Brian Charles Kay
Theses and Dissertations
Alcohol biomarkers, or naturally occurring molecules which occur in response to one's alcohol consumption, are proving to be a value tool in objectively monitoring one's alcohol consumption. Coupling this assessment tool, with advances in computing power, new and powerful predictions are becoming evermore possible. In this retrospective study, data was first collected that consisted of a sample of 249 drivers convicted of driving under the influence charge and who monitored over the course of a year by biomarker blood tests. This data was then analyzed using machine learning methods. TwoStep cluster analysis showed distinct drinking groups within the drivers who …
Rna-Sequencing Applications: Gene Expression Quantification And Methylator Phenotype Identification, Guoshuai Cai
Rna-Sequencing Applications: Gene Expression Quantification And Methylator Phenotype Identification, Guoshuai Cai
Dissertations & Theses (Open Access)
My dissertation focuses on two aspects of RNA sequencing technology. The first is the methodology for modeling the overdispersion inherent in RNA-seq data for differential expression analysis. This aspect is addressed in three sections. The second aspect is the application of RNA-seq data to identify the CpG island methylator phenotype (CIMP) by integrating datasets of mRNA expression level and DNA methylation status.
Section 1: The cost of DNA sequencing has reduced dramatically in the past decade. Consequently, genomic research increasingly depends on sequencing technology. However it remains elusive how the sequencing capacity influences the accuracy of mRNA expression measurement. We …
Single-Nucleotide Polymorphisms Associated With Performance Traits In Beef Cattle Grazing Endophyte-Infected Tall Fescue, Bryan Christopher Bastin
Single-Nucleotide Polymorphisms Associated With Performance Traits In Beef Cattle Grazing Endophyte-Infected Tall Fescue, Bryan Christopher Bastin
Masters Theses
Tall fescue (Lolium arundinaceum Schreb.) is the most prevalent forage in the Midsouth United States due in part to the presence of the endophytic fungus Neotyphodium coenophialum. The fungus, while conferring hardiness to tall fescue, contributes to decreased production efficiency in cow-calf operations. A previous genome-wide association study was performed using the Illumina 50k bovine SNP chip. Twenty-four SNP were found to be associated (P < 0.05) with adjusted birth weight and adjusted 205-day weights of calves from 48 beef cows at Ames Plantation. The first objective was to validate each SNP by testing associations with several additional phenotypes. Custom Taqman genotyping assays (Applied Biosystems, Foster City, CA) were subsequently designed to genotype each SNP in beef cattle located at Tennessee Tech University (n = 654), to validate associations in a large, independent herd. The results yielded 15 associations that were significant (P < 0.05) with 6 phenotypes linked to those affected by fescue toxicosis. The second objective investigated the link between fescue toxicosis and the XK, Kell blood group complex subunit-related, member 4 …
Array-Based Genomic Diversity Measures Portray Mus Musculus Phylogenetic And Genealogical Relationships, And Detect Genetic Variation Among C57bl/6j Mice And Between Tissues Of The Same Mouse, Susan T. Eitutis
Electronic Thesis and Dissertation Repository
Mouse models lack affordable genomic technologies slowing the identification of candidate variants contributing to complex phenotypes. The Mouse Diversity Genotyping Array (MDGA) is a low cost, high-resolution platform permitting genomic diversity assessment. Using a validated list of >500,000 single nucleotide polymorphisms (SNPs), we applied the first comprehensive analysis of SNP differences to detect genetic distance across 362 Mus musculus samples. Genetic distance measured between distantly and closely related mice correlates with known phylogeny and genealogy. Variation detected between C57BL/6J mice is consistent with previous reports of variants within this strain. Putative genetic variation detected between and within tissues indicates somatic …
Identification Of Cyclophilin Gene Family In Soybean And Characterization Of Gmcyp1, Hemanta Raj Mainali
Identification Of Cyclophilin Gene Family In Soybean And Characterization Of Gmcyp1, Hemanta Raj Mainali
Electronic Thesis and Dissertation Repository
I identified members of the Cyclophilin (CYP) gene family in soybean (Glycine max) and characterized the GmCYP1, one of the members of soybean CYP. CYPs belong to the immunophilin superfamily with peptidyl-prolyl cis-trans isomerase (PPIase) activity. PPIase catalyzes the interconversion of the cis- and trans-rotamers of the peptidyl-prolyl amide bond of peptides. After extensive data mining, I identified 62 different CYP genes in soybean (GmCYP1 to GmCYP62), of which 8 are multi-domain proteins and 54 are single domain proteins. At least 25% of the GmCYP genes are expressed in soybean. GmCYP1 …
A Mathematical Model And Numerical Method For Thermoelectric Dna Sequencing, Liwei Shi
A Mathematical Model And Numerical Method For Thermoelectric Dna Sequencing, Liwei Shi
Doctoral Dissertations
DNA sequencing is the process of determining the precise order of nucleotide bases, adenine, guanine, cytosine, and thymine within a DNA molecule. It includes any method or technology that is used to determine the order of the four bases in a strand of DNA. The advent of rapid DNA sequencing methods has greatly accelerated biological and medical research and discovery. Thermoelectric DNA sequencing is a novel method to sequence DNA by measuring the heat that is released when DNA polymerase inserts a deoxyribonucleoside triphosphate into a growing DNA strand. The thermoelectric device for this project is composed of four parts: …
Genetic Approaches To Studying Complex Human Disease, Joseph B. Dube
Genetic Approaches To Studying Complex Human Disease, Joseph B. Dube
Electronic Thesis and Dissertation Repository
Common, complex diseases such as cardiovascular disease (CVD) represent an intricate interaction between environmental and genetic factors and now account for the leading causes of mortality in western society. By investigating the genetic component of complex disease etiology, we have gained a better understanding of the biological pathways underlying complex disease and the heterogeneity of complex disease risk. However, the development of high throughput genomic technologies and large well-phenotyped multi-ethnic cohorts has opened the door towards more in-depth and trans-disciplinary approaches to studying the genetics of complex disease pathogenesis. Accordingly, we sought to investigate select complex traits and diseases using …
Genome Wide Search For Pseudo Knotted Non-Coding Rnas, Meghana S. Vasavada
Genome Wide Search For Pseudo Knotted Non-Coding Rnas, Meghana S. Vasavada
Theses
Non-coding RNAs (ncRNAs) are the functional RNA molecules that are involved in many biological processes including gene regulation, chromosome replication and RNA modification. Searching genomes using computational methods has become an important asset for prediction and annotation of ncRNAs. To annotate an individual genome for a specific family of ncRNAs, a computational tool is interpreted to scan through the genome and align its sequence segments to some structure model for the ncRNA family. With the recent advances in detecting an ncRNA in the genome, heuristic techniques are designed to perform an accurate search and sequence-structure alignment. This study uses a …
Rna-Sequence Analysis Of Human Melanoma Cells, Jharna Miya
Rna-Sequence Analysis Of Human Melanoma Cells, Jharna Miya
Theses
RNA-sequencing refers to the use of high throughput sequencing technologies that are used to sequence cDNA in order to get the complete information of a sample’s RNA content. The objective of this study is to analyze this data in different aspects and to characterize gene expression. Besides this characterization, the data was also used to investigate the effect of sequencing depth on gene expression measurements.
This research focuses on quantitative measurement of expression levels of genes and their transcripts. In this study, complementary DNA fragments of cultured human melanoma cells are sequenced and a total of 139,501,106 million 200-bp reads …
Performance Comparison Of Five Rna-Seq Alignment Tools, Yuanpeng Lu
Performance Comparison Of Five Rna-Seq Alignment Tools, Yuanpeng Lu
Theses
Aligning millions of short reads to a reference genome is a critical task in high throughput sequencing. In recent years, a large number of mapping algorithms have been developed, all of which have in common that they align a vast number of reads to genomic or transcriptomic sequences. RNA-Seq data is discrete in nature, therefore with reasonable gene models and comparative metrics RNA-Seq data can be simulated to sufficient accuracy to enable meaningful benchmarking of alignment algorithms. To provide guidance in the choice of alignment algorithms, five different alignment tools for RNA-Seq data are evaluated. In order to compare the …
Polyaseeker: A Computational Framework For Identifying Polyadenylation Cleavage Site From Rna-Seq, Xiao Ling
Polyaseeker: A Computational Framework For Identifying Polyadenylation Cleavage Site From Rna-Seq, Xiao Ling
Theses
Alternative polyadenylation (APA) of mRNA plays a crucial role for post-transcriptional gene regulation. Recently, advances in next generation sequencing technology have made it possible to efficiently characterize the transcriptome and identify the 3’end of polyadenylated RNAs. However, no comprehensive bioi nformatic pipelines have fulfilled this goal. The PolyASeeker, a computational framework for identifying polyadenylation cleavage sites from RNA-Seq data is proposed in this thesis. By using the simulated RNA-seq dataset, a novel method is developed to evaluate the performance of the proposed framework versus the traditional A-stretch approach, and compute accurate Precisions and Recalls that previous estimation could not get. …
A Gpu Program To Compute Snp-Snp Interactions In Genome-Wide Association Studies, Srividya Ramakrishnan
A Gpu Program To Compute Snp-Snp Interactions In Genome-Wide Association Studies, Srividya Ramakrishnan
Theses
With the recent advances in the next generation sequencing technologies, short read sequences of human genome are made more accessible. Paired end sequencing of short reads is currently the most sensitive method for detecting somatic mutations that arise during tumor development. In this study, a novel approach to optimize the detection of structural variants using a new short read alignment program is presented.
Pairwise interaction effects of the Single Nucleotide Polymorphisms (SNPs) have proven to uncover the underlying complex disease traits. Computing the disease risk based on the interaction effects of SNPs on a case - control study is a …