Open Access. Powered by Scholars. Published by Universities.®

Social and Behavioral Sciences Commons

Open Access. Powered by Scholars. Published by Universities.®

Purdue University

2019

Geographic Information Sciences

Purdue University Libraries Open Access Publishing Fund

Articles 1 - 1 of 1

Full-Text Articles in Social and Behavioral Sciences

Geoannotator: A Collaborative Semi-Automatic Platform For Constructing Geo-Annotated Text Corpora, Morteza Karimzadeh, Alan M. Maceachren Mar 2019

Geoannotator: A Collaborative Semi-Automatic Platform For Constructing Geo-Annotated Text Corpora, Morteza Karimzadeh, Alan M. Maceachren

Purdue University Libraries Open Access Publishing Fund

Ground-truth datasets are essential for the training and evaluation of any automated algorithm. As such, gold-standard annotated corpora underlie most advances in natural language processing (NLP). However, only a few relatively small (geo-)annotated datasets are available for geoparsing, i.e., the automatic recognition and geolocation of place references in unstructured text. The creation of geoparsing corpora that include both the recognition of place names in text and matching of those names to toponyms in a geographic gazetteer (a process we call geo-annotation), is a laborious, time-consuming and expensive task. The field lacks efficient geo-annotation tools to support corpus building and lacks …