Open Access. Powered by Scholars. Published by Universities.®

Medicine and Health Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 2 of 2

Full-Text Articles in Medicine and Health Sciences

A Novel Methodology To Identify The Primary Topics Contained Within The Covid-19 Research Corpus, Allen Crane, Brock Freidrich, William Fehlman, Igor Frolow, Daniel W. Engels Aug 2020

A Novel Methodology To Identify The Primary Topics Contained Within The Covid-19 Research Corpus, Allen Crane, Brock Freidrich, William Fehlman, Igor Frolow, Daniel W. Engels

SMU Data Science Review

In this paper, we present a novel framework and system for the identification of primary research topics from within a corpus of related publications, the classification of individual publications according to these topics, and the results of the application of our framework and system to the COVID-19 Open Research Dataset (CORD-19). CORD-19 is a corpus of published peer reviewed and pre-peer reviewed articles related to the coronavirus that causes COVID-19. Using machine learning techniques, such as Non-negative Matrix Factorization for Natural Language Processing and a Bayesian classifier, we developed a novel framework and system that automatically extracts sparse and meaningful …


Predicting Premature Birth Risk With Cfrna, Jason Lin, Jonathan Marin, John Santerre Aug 2019

Predicting Premature Birth Risk With Cfrna, Jason Lin, Jonathan Marin, John Santerre

SMU Data Science Review

Identifying which genes are early indicators for preterm births using cell-free ribonucleic acid (cfRNA) from non-invasive blood tests provided by pregnant women can improve prenatal care. Currently, there are no medical tests for early detection of preterm birth risk in routine checkups for pregnant women. Recent studies have shown potential genes that can predict preterm birth. Machine learning techniques are utilized to see if the Area Under the Curve (AUC) can be improved upon when evaluating the prediction accuracy for chosen genes sequences and concentrations. Using cell-free RNA data from non-invasive blood tests in conjunction with machine learning, we improve …