Open Access. Powered by Scholars. Published by Universities.®

Life Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Genetics and Genomics

Arts & Sciences Electronic Theses and Dissertations

Theses/Dissertations

Machine Learning

Publication Year

Articles 1 - 2 of 2

Full-Text Articles in Life Sciences

Knowledge Driven Approaches And Machine Learning Improve The Identification Of Clinically Relevant Somatic Mutations In Cancer Genomics, Benjamin John Ainscough Dec 2017

Knowledge Driven Approaches And Machine Learning Improve The Identification Of Clinically Relevant Somatic Mutations In Cancer Genomics, Benjamin John Ainscough

Arts & Sciences Electronic Theses and Dissertations

For cancer genomics to fully expand its utility from research discovery to clinical adoption, somatic variant detection pipelines must be optimized and standardized to ensure identification of clinically relevant mutations and to reduce laborious and error-prone post-processing steps. To address the need for improved catalogues of clinically and biologically important somatic mutations, we developed DoCM, a Database of Curated Mutations in Cancer (http://docm.info), as described in Chapter 2. DoCM is an open source, openly licensed resource to enable the cancer research community to aggregate, store and track biologically and clinically important cancer variants. DoCM is currently comprised of 1,364 variants …


Application Of Genomic Technologies To Study Infertility, Nicholas Rui Yuan Ho May 2016

Application Of Genomic Technologies To Study Infertility, Nicholas Rui Yuan Ho

Arts & Sciences Electronic Theses and Dissertations

An estimated one in eight couples in the United States are diagnosed with infertility. There is a significant genetic contribution to infertility, with estimates of heritability ranging from 0.2 to 0.5. We know surprisingly little about the genetic causes, with only slightly more than a hundred genes known to cause human infertility. I have been translating recent advances in genomics to study infertility in a more efficient manner, in order to improve our knowledge of the genetic causes. By using high throughput genomics and proteomics datasets from other groups, I was able to feed that into a machine learning algorithm …