Connecting language and knowledge with heterogeneous representations for neural relation extraction

This is a paper about relationship extraction on NAACL 2019. Connecting language and knowledge with heterogeneous representations for neural relation extraction

Problem

In the process of building a knowledge base, we usually extract the relationship between entities from sentences. If the entity already exists in the knowledge base, then we can use the knowledge in the knowledge base to improve the results of relation extraction.

The usual practice is to train two models, one is the RE model, and another is the KBE knowledge model(Knowledge Base Embedding). But there is little research to properly unify these models systematically.

Contribution

In this paper, a Heterogeneous REpresentations for neural Relation Extraction(HRERE) of RE and KBE is proposed. The framework unifies the RE model and the KBE model, and the framework can effectively enhance the relationship between the two. The gap between the language representions and knoledge representions can be reduced as much as possible leading to significant improvements over the state-of-the-art in RE.

Solution

The RE model uses a Bi-LSTM with multiple levels of attention mechanism to predict the ralationship between entity pairs. The KBE model borrows from ComplEx proposed by Trouillon et al in 2016, which can nudge the language model to agree with facts in the KB.

The framework introduces three loss functions, namely the RE model language representation loss, the KBE model knowledge representation loss, and the cross entropy loss of the two distributions.
J L = − 1 N ∑ i = 1 N log ⁡ p ( r i ∣ S i ; Θ ( L ) ) J_L = - \frac{1}{N}\sum^N_{i=1}\log p(r_i|S_i;\Theta^{(L)}) JL=N1i=1Nlogp(riSi;Θ(L))
J G = − 1 N ∑ i = 1 N log ⁡ p ( r i ∣ ( h i , t i ) Θ ( L ) ) J_G = - \frac{1}{N}\sum^N_{i=1}\log p(r_i|(h_i,t_i)\Theta^{(L)}) JG=N1i=1Nlogp(ri(hi,ti)Θ(L))
J D = − 1 N ∑ i = 1 N log ⁡ p ( r i ∗ ∣ S i ; Θ ( L ) ) J_D = - \frac{1}{N}\sum^N_{i=1}\log p(r_i^*|S_i;\Theta^{(L)}) JD=N1i=1Nlogp(riSi;Θ(L))
其中 r i ∗ = a r g max ⁡ r ∈ R ∪ N A p ( r ∣ ( h i , t i ) ; Θ ( G ) ) r_i^* = arg \max_{r\in R\cup{NA}}p(r|(h_i,t_i);\Theta^{(G)}) ri=argmaxrRNAp(r(hi,ti);Θ(G))
min ⁡ Θ J = J L + J G + J D + λ ∣ ∣ Θ ∣ ∣ 2 2 \min_{\Theta} J = J_L + J_G + J_D + \lambda||\Theta||_2^2 ΘminJ=JL+JG+JD+λΘ22

Understanding

The essence of this paper is to improve the results of relation extraction RE through the existing knowledge base. By training the KBE model on the existing knowledge base to form the knowledge representation, the RE model predicts the relationship between the entity pairs through the language model, so that the prediction results can be as close as possible to existing knowledge.

This paper describes and evaluates a novel neural framework for jointly learningrepresentations for RE and KBE tasks that uses a cross-entropy loss function to ensure both representations are learned together, resulting in significant improvements over the current state-of-theart for the RE task.

Limitation

In real-life scenarios, we often want to extract entity pairs and their relationships from a sentence, rather than extracting relationship by a given entity pair.

But this paper gives me ideas on how to extract relationships in the above scenarios. We can construct a new loss function to represent entity extraction and relationship extraction,
thus reducing the error propagation of the step model.

Reference

Connecting language and knowledge with heterogeneous representations for neural relation extraction

你可能感兴趣的:(关系抽取,论文)