Recommended Citation

Karimzadeh, M.; MacEachren, A.M. GeoAnnotator: A Collaborative Semi-Automatic Platform for Constructing Geo-Annotated Text Corpora. ISPRS Int. J. Geo-Inf. 2019, 8, 161.

DOI

/10.3390/ijgi8040161

Date of this Version

3-27-2019

Keywords

Geoparsing; iterative design; design guidelines; annotation; corpus; spatial linguistics; geographic information retrieval

Abstract

Ground-truth datasets are essential for the training and evaluation of any automated algorithm. As such, gold-standard annotated corpora underlie most advances in natural language processing (NLP). However, only a few relatively small (geo-)annotated datasets are available for geoparsing, i.e., the automatic recognition and geolocation of place references in unstructured text. The creation of geoparsing corpora that include both the recognition of place names in text and matching of those names to toponyms in a geographic gazetteer (a process we call geo-annotation), is a laborious, time-consuming and expensive task. The field lacks efficient geo-annotation tools to support corpus building and lacks design guidelines for the development of such tools. Here, we present the iterative design of GeoAnnotator, a web-based, semi-automatic and collaborative visual analytics platform for geo-annotation. GeoAnnotator facilitates collaborative, multi-annotator creation of large corpora of geo-annotated text by generating computationally-generated pre-annotations that can be improved by human-annotator users. The resulting corpora can be used in improving and benchmarking geoparsing algorithms as well as various other spatial language-related methods. Further, the iterative design process and the resulting design decisions can be used in annotation platforms tailored for other application domains of NLP.

Download

Find in your library

Included in

Geographic Information Sciences Commons, Linguistics Commons, Spatial Science Commons

COinS

Purdue University Libraries Open Access Publishing Fund

GeoAnnotator: A Collaborative Semi-Automatic Platform for Constructing Geo-Annotated Text Corpora

Recommended Citation

DOI

Date of this Version

Keywords

Abstract

Included in

Search

Links

Links for Authors

Browse

Links

Purdue University Libraries Open Access Publishing Fund

GeoAnnotator: A Collaborative Semi-Automatic Platform for Constructing Geo-Annotated Text Corpora

Authors

Recommended Citation

DOI

Date of this Version

Keywords

Abstract

Included in

Share

Search

Links

Links for Authors

Browse

Links