Open Access. Powered by Scholars. Published by Universities.®
- Institution
- Keyword
-
- Machine learning (3)
- Bayesian shrinkage priors (1)
- Binding Sites (1)
- Bioinformatics (1)
- Classification (1)
-
- Clinical trials (1)
- Data mining (1)
- Deep learning (1)
- Diagnosis (1)
- Information Theory (1)
- Machine Learning (1)
- Microbiome (1)
- Mutation (1)
- Parkinson's Disease (1)
- Personalized medicine (1)
- Position-Specific Scoring Matrices (1)
- Supervised classification (1)
- Transcription Factors (1)
- Variational autoencoder (1)
Articles 1 - 4 of 4
Full-Text Articles in Other Computer Sciences
Enhancing Microbiome Host Disease Prediction With Variational Autoencoders, Celeste Manughian-Peter
Enhancing Microbiome Host Disease Prediction With Variational Autoencoders, Celeste Manughian-Peter
Computational and Data Sciences (MS) Theses
Advancements in genetic sequencing methods for microbiomes in recent decades have permitted the collection of taxonomic and functional profiles of microbial communities, accelerating the discovery of the functional aspects of the microbiome and generating an increased interest among clinicians in applying these techniques with patients. This advancement has coincided with software and hardware improvements in the field of machine learning and deep learning. Combined, these advancements implicate further potential for progress in disease diagnosis and treatment in humans. The ability to classify a human microbiome profile into a disease category, and additionally identify the differentiating factors within the profile between …
Novel Inference Methods For Generalized Linear Models Using Shrinkage Priors And Data Augmentation., Arinjita Bhattacharyya
Novel Inference Methods For Generalized Linear Models Using Shrinkage Priors And Data Augmentation., Arinjita Bhattacharyya
Electronic Theses and Dissertations
Generalized linear models have broad applications in biostatistics and sociology. In a regression setup, the main target is to find a relevant set of predictors out of a large collection of covariates. Sparsity is the assumption that only a few of these covariates in a regression setup have a meaningful correlation with an outcome variate of interest. Sparsity is incorporated by regularizing the irrelevant slopes towards zero without changing the relevant predictors and keeping the resulting inferences intact. Frequentist variable selection and sparsity are addressed by popular techniques like Lasso, Elastic Net. Bayesian penalized regression can tackle the curse of …
Computational Modelling Of Human Transcriptional Regulation By An Information Theory-Based Approach, Ruipeng Lu
Computational Modelling Of Human Transcriptional Regulation By An Information Theory-Based Approach, Ruipeng Lu
Electronic Thesis and Dissertation Repository
ChIP-seq experiments can identify the genome-wide binding site motifs of a transcription factor (TF) and determine its sequence specificity. Multiple algorithms were developed to derive TF binding site (TFBS) motifs from ChIP-seq data, including the entropy minimization-based Bipad that can derive both contiguous and bipartite motifs. Prior studies applying these algorithms to ChIP-seq data only analyzed a small number of top peaks with the highest signal strengths, biasing their resultant position weight matrices (PWMs) towards consensus-like, strong binding sites; nor did they derive bipartite motifs, disabling the accurate modelling of binding behavior of dimeric TFs.
This thesis presents a novel …
A Machine Learning Approach To Diagnosis Of Parkinson’S Disease, Sumaiya F. Hashmi
A Machine Learning Approach To Diagnosis Of Parkinson’S Disease, Sumaiya F. Hashmi
CMC Senior Theses
I will investigate applications of machine learning algorithms to medical data, adaptations of differences in data collection, and the use of ensemble techniques.
Focusing on the binary classification problem of Parkinson’s Disease (PD) diagnosis, I will apply machine learning algorithms to a primary dataset consisting of voice recordings from healthy and PD subjects. Specifically, I will use Artificial Neural Networks, Support Vector Machines, and an Ensemble Learning algorithm to reproduce results from [MS12] and [GM09].
Next, I will adapt a secondary regression dataset of PD recordings and combine it with the primary binary classification dataset, testing various techniques to consolidate …