Open Access. Powered by Scholars. Published by Universities.®
- Publication Type
Articles 1 - 2 of 2
Full-Text Articles in Engineering
Nlp@Vcu: Crop Characteristic Extraction Framework, Cora Lewis, Bridget Mcinnes, Getiria Onsongo
Nlp@Vcu: Crop Characteristic Extraction Framework, Cora Lewis, Bridget Mcinnes, Getiria Onsongo
Summer REU Program
We developed a crop characteristic extraction framework. Starting from a custom SpaCy named entity recognition model, we added pre-trained word embeddings and a part-of-speech based entity expansion post-processing step. Then, we implemented an evaluation framework that functioned as a 5-fold cross validation wrapper for SpaCy custom training. Preliminary results showed improvement in the extraction framework after these additions.
An Annotated Corpus With Nanomedicine And Pharmacokinetic Parameters, Nastassja Lewinski, Ivan Jimenez, Bridget Mcinnes
An Annotated Corpus With Nanomedicine And Pharmacokinetic Parameters, Nastassja Lewinski, Ivan Jimenez, Bridget Mcinnes
Chemical and Life Science Engineering Publications
A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration’s Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the …