Putting Semantic Information Extraction on the Map: Noisy Label Models for Fact Extraction

Authors

Chris Pal

Gideon Mann

Richard Minerich

Track:

Contents

Downloads:

Download PDF

Abstract:

Geographic indexing is a powerful and effective way to organize information on the web, but the use of standardized location tags is not widespread. Therefore, there is considerable interest in using machine learning approaches to automatically obtain semantic associations involving geographic locations from processing unstructured natural language text. While it is often impractical or expensive to obtain training labels, there are often ways to obtain noisy labels. We present a novel discriminative approach using a hidden variable model suitable for learning with noisy labels and apply it to extracting location relationships from natural language. We examine the problem of associating events with locations, where simple keyword matching produces a small number of positive examples within many false positives. Compared to a state-of-the-art baseline, our method doubles the precision of extracting semantic information while maintaining the same recall.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.