University of Cambridge > Talks.cam > NLIP Seminar Series > Processing Geographic Language

Processing Geographic Language

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Johanna Geiss.

Humans are able to communicate geographic information in a highly concise but vague manner, posing interesting challenges for natural language understanding. In recent years, information extraction systems have been developed to ground geographical references in text in terms of geo-coordinates, with the tags produced by such systems being used by geographical search engines and mapping tools. However, without a standard for how different types of geographical entities should be tagged, such systems are impossible to reliably evaluate. I will describe an annotation scheme called SpatialML, that has been used to accurately mark up places, their geo-coordinates, and spatial relationships in a variety of text corpora. SpatialML represents spatial relationships among geographical regions in terms of the Region Connection Calculus (RCC), and it has also been mapped to the Generalized Upper Model (GUM) ontology from the University of Bremen. SpatialML is also being used in the Cross-Language Evaluation Forum (CLEF) to assess tools to analyze geographical queries posed to search engines, and it is currently being integrated with a time markup standard (TimeML). Despite these positive trends, I will argue that a far more concerted research effort is required to address thefundamental challenges of geographic language.

Dr. Inderjeet Mani is a Visiting Fellow at Cambridge. He has been a Senior Principal Scientist at MITRE (in Boston), a Research Scholar at Brandeis University, a Research Affiliate at MIT , and a (tenured) Associate Professor at Georgetown University. His research areas in natural language processing include automatic summarization, temporal and spatial information extraction, and narrative understanding. More information can be found at www.cs.brandeis.edu/~im5.

This talk is part of the NLIP Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2020 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity