Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision

诸神缄默不语-个人CSDN博文目录
诸神缄默不语的论文阅读笔记和分类

论文名称:Entities as Experts: Sparse Memory Access with Entity Supervision
模型名称:Entities as Experts (EaE)

ArXiv网址:https://arxiv.org/abs/2004.07202

本文是2020年EMNLP论文。作者来自谷歌。
这篇文章也贯彻了谷歌论文的风格,那就是非常难读。

EaE的核心思想从文本中学习实体表征,结合到LM中做QA任务

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第1张图片

为每个实体构建独立的表征,然后EaE再将其用于QA
① MLM预测实体 ② 获取每个实体的正确memory (用了现成的实体识别工具和维基百科超链接)

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第2张图片

文章目录

  • 1. 模型公式
  • 2. 实验

1. 模型公式

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第3张图片

Entity Memory Layer
在这里插入图片描述

伪实体表征(头尾表征):
在这里插入图片描述

在实体嵌入表中找伪实体表征的K近邻,加权求和:
Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第4张图片

Task-Specific Heads
TokenPred and EntityPred(实体嵌入中离伪实体表征最近的)

Inference-time Mention Detection
mention detection layer
BIO预测实体

损失函数
(1) a mention boundary detection loss, (2) an entity linking loss, and (3) a masked language modeling loss

2. 实验

下游任务:cloze knowledge probes, opendomain question answering and relation extraction

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第5张图片

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第6张图片

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第7张图片

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第8张图片

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第9张图片

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第10张图片

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第11张图片

Re55:读论文 Entities as Experts: Sparse Memory Access with Entity Supervision_第12张图片

你可能感兴趣的:(人工智能学习笔记,LLM,大规模预训练语言模型,实体识别,QA)