Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Molecular Biology

PDF

Washington University in St. Louis

Theses/Dissertations

Algorithm

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

Robust Algorithms For Detecting Hidden Structure In Biological Data, Roman Sloutsky Aug 2017

Robust Algorithms For Detecting Hidden Structure In Biological Data, Roman Sloutsky

Arts & Sciences Electronic Theses and Dissertations

Biological data, such as molecular abundance measurements and protein

sequences, harbor complex hidden structure that reflects its underlying

biological mechanisms. For example, high-throughput abundance measurements

provide a snapshot the global state of a living cell, while homologous

protein sequences encode the residue-level logic of the proteins' function

and provide a snapshot of the evolutionary trajectory of the protein family.

In this work I describe algorithmic approaches and analysis software I

developed for uncovering hidden structure in both kinds of data.

Clustering is an unsurpervised machine learning technique commonly used

to map the structure of data collected in high-throughput experiments,

such …