Open Access. Powered by Scholars. Published by Universities.®
- Publication Type
Articles 1 - 2 of 2
Full-Text Articles in Engineering
C.A.R.E (Cohort Assessment & Retrieval Environment), Kyle Ellis, Payal Shah, Jordan Tang
C.A.R.E (Cohort Assessment & Retrieval Environment), Kyle Ellis, Payal Shah, Jordan Tang
Capstone Design Expo Posters
The purpose of clinical trials is to explore whether a medical treatment is safe and effective for humans or to enhance preexisting methods. The identification of patients who satisfy a set of predefined criteria for the trial is instrumental. However, the process of distinguishing these patients on the basis of their clinical records is a challenging task since it can have structured (ex: precise measurements) and unstructured data (ex: physician notes). One difficulty with this is data normalization; there are many ways to describe a single concept. For example, “heart attack” and “myocardial infarction” both refer to the death of …
Parsing Metamap Files In Hadoop, Amy Olex, Alberto Cano, Bridget T. Mcinnes
Parsing Metamap Files In Hadoop, Amy Olex, Alberto Cano, Bridget T. Mcinnes
Computer Science Publications
The UMLS::Association CUICollector module identifies UMLS Concept Unique Identifier bigrams and their frequencies in a biomedical text corpus. CUICollector was re-implemented in Hadoop MapReduce to improve algorithm speed, flexibility, and scalability. Evaluation of the Hadoop implementation compared to the serial module produced equivalent results and achieved a 28x speedup on a single-node Hadoop system.