Open Access. Powered by Scholars. Published by Universities.®
Physical Sciences and Mathematics Commons™
Open Access. Powered by Scholars. Published by Universities.®
- Institution
- Keyword
-
- Statistics (2)
- Bayesian statistics (1)
- Big data (1)
- Binding Sites (1)
- Birth-death process (1)
-
- Clustering (1)
- Community detection (1)
- Cryo-electron microscopy (1)
- Crystallography (1)
- Custom CDF (1)
- Data mining (1)
- Deep learning (1)
- Exosomal RNAs (1)
- Exosomes (1)
- Expression (1)
- Fossils (1)
- Gene expression (1)
- Genes (1)
- Graph algorithms (1)
- Information Theory (1)
- Interactome (1)
- Machine Learning (1)
- Machine learning (1)
- Macroevolution (1)
- Mass spectrometry (1)
- Metabolomics (1)
- MicroRNAs (1)
- Microarrays (1)
- Motif finding (1)
- Mutation (1)
- Publication
- Publication Type
Articles 1 - 9 of 9
Full-Text Articles in Physical Sciences and Mathematics
Bayesian Analytical Approaches For Metabolomics : A Novel Method For Molecular Structure-Informed Metabolite Interaction Modeling, A Novel Diagnostic Model For Differentiating Myocardial Infarction Type, And Approaches For Compound Identification Given Mass Spectrometry Data., Patrick J. Trainor
Electronic Theses and Dissertations
Metabolomics, the study of small molecules in biological systems, has enjoyed great success in enabling researchers to examine disease-associated metabolic dysregulation and has been utilized for the discovery biomarkers of disease and phenotypic states. In spite of recent technological advances in the analytical platforms utilized in metabolomics and the proliferation of tools for the analysis of metabolomics data, significant challenges in metabolomics data analyses remain. In this dissertation, we present three of these challenges and Bayesian methodological solutions for each. In the first part we develop a new methodology to serve a basis for making higher order inferences in metabolomics, …
Region Based Gene Expression Via Reanalysis Of Publicly Available Microarray Data Sets., Ernur Saka
Region Based Gene Expression Via Reanalysis Of Publicly Available Microarray Data Sets., Ernur Saka
Electronic Theses and Dissertations
A DNA microarray is a high-throughput technology used to identify relative gene expression. One of the most widely used platforms is the Affymetrix® GeneChip® technology which detects gene expression levels based on probe sets composed of a set of twenty-five nucleotide probes designed to hybridize with specific gene targets. Given a particular Affymetrix® GeneChip® platform, the design of the probes is fixed. However, the method of analysis is dynamic in nature due to the ability to annotate and group probes into uniquely defined groupings. This is particularly important since publicly available repositories of microarray datasets, such as ArrayExpress and NCBI’s …
Efficient Reduced Bias Genetic Algorithm For Generic Community Detection Objectives, Aditya Karnam Gururaj Rao
Efficient Reduced Bias Genetic Algorithm For Generic Community Detection Objectives, Aditya Karnam Gururaj Rao
Theses
The problem of community structure identification has been an extensively investigated area for biology, physics, social sciences, and computer science in recent years for studying the properties of networks representing complex relationships. Most traditional methods, such as K-means and hierarchical clustering, are based on the assumption that communities have spherical configurations. Lately, Genetic Algorithms (GA) are being utilized for efficient community detection without imposing sphericity. GAs are machine learning methods which mimic natural selection and scale with the complexity of the network. However, traditional GA approaches employ a representation method that dramatically increases the solution space to be searched by …
Computational Modelling Of Human Transcriptional Regulation By An Information Theory-Based Approach, Ruipeng Lu
Computational Modelling Of Human Transcriptional Regulation By An Information Theory-Based Approach, Ruipeng Lu
Electronic Thesis and Dissertation Repository
ChIP-seq experiments can identify the genome-wide binding site motifs of a transcription factor (TF) and determine its sequence specificity. Multiple algorithms were developed to derive TF binding site (TFBS) motifs from ChIP-seq data, including the entropy minimization-based Bipad that can derive both contiguous and bipartite motifs. Prior studies applying these algorithms to ChIP-seq data only analyzed a small number of top peaks with the highest signal strengths, biasing their resultant position weight matrices (PWMs) towards consensus-like, strong binding sites; nor did they derive bipartite motifs, disabling the accurate modelling of binding behavior of dimeric TFs.
This thesis presents a novel …
Prediction Of Lncrna-Disease Associations Based On Inductive Matrix Completion, Chengqian Lu, Mengyun Yang, Feng Luo, Fang-Xiang Wu, Min Li, Yi Pan, Yaohang Li, Jianxin Wang
Prediction Of Lncrna-Disease Associations Based On Inductive Matrix Completion, Chengqian Lu, Mengyun Yang, Feng Luo, Fang-Xiang Wu, Min Li, Yi Pan, Yaohang Li, Jianxin Wang
Computer Science Faculty Publications
Motivation: Accumulating evidences indicate that long non-coding RNAs (lncRNAs) play pivotal roles in various biological processes. Mutations and dysregulations of lncRNAs are implicated in miscellaneous human diseases. Predicting lncRNA–disease associations is beneficial to disease diagnosis as well as treatment. Although many computational methods have been developed, precisely identifying lncRNA–disease associations, especially for novel lncRNAs, remains challenging.
Results: In this study, we propose a method (named SIMCLDA) for predicting potential lncRNA– disease associations based on inductive matrix completion. We compute Gaussian interaction profile kernel of lncRNAs from known lncRNA–disease interactions and functional similarity of diseases based on disease–gene and gene–gene onotology …
The Fossilized Birth-Death Model For The Analysis Of Stratigraphic Range Data Under Different Speciation Modes, Tanja Stadler, Alexandra Gavryushkina, Rachel C. M. Warnock, Alexei J. Drummond, Tracy A. Heath
The Fossilized Birth-Death Model For The Analysis Of Stratigraphic Range Data Under Different Speciation Modes, Tanja Stadler, Alexandra Gavryushkina, Rachel C. M. Warnock, Alexei J. Drummond, Tracy A. Heath
Tracy Heath
A birth-death-sampling model gives rise to phylogenetic trees with samples from the past and the present. Interpreting “birth” as branching speciation, “death” as extinction, and “sampling” as fossil preservation and recovery, this model – also referred to as the fossilized birth-death (FBD) model – gives rise to phylogenetic trees on extant and fossil samples. The model has been mathematically analyzed and successfully applied to a range of datasets on different taxonomic levels, such as penguins, plants, and insects. However, the current mathematical treatment of this model does not allow for a group of temporally distinct fossil specimens to be assigned …
An Investigation Of Atomic Structures Derived From X-Ray Crystallography And Cryo-Electron Microscopy Using Distal Blocks Of Side-Chains, Lin Chen, Jing He, Salim Sazzed, Rayshawn Walker
An Investigation Of Atomic Structures Derived From X-Ray Crystallography And Cryo-Electron Microscopy Using Distal Blocks Of Side-Chains, Lin Chen, Jing He, Salim Sazzed, Rayshawn Walker
Computer Science Faculty Publications
Cryo-electron microscopy (cryo-EM) is a structure determination method for large molecular complexes. As more and more atomic structures are determined using this technique, it is becoming possible to perform statistical characterization of side-chain conformations. Two data sets were involved to characterize block lengths for each of the 18 types of amino acids. One set contains 9131 structures resolved using X-ray crystallography from density maps with better than or equal to 1.5 Å resolutions, and the other contains 237 protein structures derived from cryo-EM density maps with 2-4 Å resolutions. The results show that the normalized probability density function of block …
A Systematic Approach To Rna-Associated Motif Discovery, Tian Gao, Jiang Shu, Juan Cui
A Systematic Approach To Rna-Associated Motif Discovery, Tian Gao, Jiang Shu, Juan Cui
School of Computing: Faculty Publications
Background: Sequencing-based large screening of RNA-protein and RNA-RNA interactions has enabled the mechanistic study of post-transcriptional RNA processing and sorting, including exosome-mediated RNA secretion. The downstream analysis of RNA binding sites has encouraged the investigation of novel sequence motifs, which resulted in exceptional new challenges for identifying motifs from very short sequences (e.g., small non-coding RNAs or truncated messenger RNAs), where conventional methods tend to be ineffective. To address these challenges, we propose a novel motif-finding method and validate it on a wide range of RNA applications.
Results: We first perform motif analysis on microRNAs and longer RNA fragments from …
Recurrent Neural Networks And Their Applications To Rna Secondary Structure Inference, Devin Willmott
Recurrent Neural Networks And Their Applications To Rna Secondary Structure Inference, Devin Willmott
Theses and Dissertations--Mathematics
Recurrent neural networks (RNNs) are state of the art sequential machine learning tools, but have difficulty learning sequences with long-range dependencies due to the exponential growth or decay of gradients backpropagated through the RNN. Some methods overcome this problem by modifying the standard RNN architecure to force the recurrent weight matrix W to remain orthogonal throughout training. The first half of this thesis presents a novel orthogonal RNN architecture that enforces orthogonality of W by parametrizing with a skew-symmetric matrix via the Cayley transform. We present rules for backpropagation through the Cayley transform, show how to deal with the Cayley …