Open Access. Powered by Scholars. Published by Universities.®

Engineering Commons

Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics

PDF

Virginia Commonwealth University

Natural language processing

Articles 1 - 2 of 2

Full-Text Articles in Engineering

Nlp@Vcu: Crop Characteristic Extraction Framework, Cora Lewis, Bridget Mcinnes, Getiria Onsongo Jan 2022

Nlp@Vcu: Crop Characteristic Extraction Framework, Cora Lewis, Bridget Mcinnes, Getiria Onsongo

Summer REU Program

We developed a crop characteristic extraction framework. Starting from a custom SpaCy named entity recognition model, we added pre-trained word embeddings and a part-of-speech based entity expansion post-processing step. Then, we implemented an evaluation framework that functioned as a 5-fold cross validation wrapper for SpaCy custom training. Preliminary results showed improvement in the extraction framework after these additions.


An Annotated Corpus With Nanomedicine And Pharmacokinetic Parameters, Nastassja Lewinski, Ivan Jimenez, Bridget Mcinnes Jan 2017

An Annotated Corpus With Nanomedicine And Pharmacokinetic Parameters, Nastassja Lewinski, Ivan Jimenez, Bridget Mcinnes

Chemical and Life Science Engineering Publications

A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration’s Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the …