Distant Supervision for Relation Extraction with Linear Attenuation Simulation and Non-IID Relevance Embedding

Authors

  • Changsen Yuan Beijing Institute of Technology
  • Heyan Huang Beijing Institute of Technology
  • Chong Feng Beijing Institute of Technology
  • Xiao Liu Beijing Institute of Technology
  • Xiaochi Wei Baidu Inc.

DOI:

https://doi.org/10.1609/aaai.v33i01.33017418

Abstract

Distant supervision for relation extraction is an efficient method to reduce labor costs and has been widely used to seek novel relational facts in large corpora, which can be identified as a multi-instance multi-label problem. However, existing distant supervision methods suffer from selecting important words in the sentence and extracting valid sentences in the bag. Towards this end, we propose a novel approach to address these problems in this paper. Firstly, we propose a linear attenuation simulation to reflect the importance of words in the sentence with respect to the distances between entities and words. Secondly, we propose a non-independent and identically distributed (non-IID) relevance embedding to capture the relevance of sentences in the bag. Our method can not only capture complex information of words about hidden relations, but also express the mutual information of instances in the bag. Extensive experiments on a benchmark dataset have well-validated the effectiveness of the proposed method.

Downloads

Published

2019-07-17

How to Cite

Yuan, C., Huang, H., Feng, C., Liu, X., & Wei, X. (2019). Distant Supervision for Relation Extraction with Linear Attenuation Simulation and Non-IID Relevance Embedding. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 7418-7425. https://doi.org/10.1609/aaai.v33i01.33017418

Issue

Section

AAAI Technical Track: Natural Language Processing