事件抽取相关数据集整理

朋友们好呀~在近期的研究中,小编越来越注意到用于事件抽取的数据集渐渐多样化了起来,所以这次把他们同一整理一下:

Sentence-level EE

  • ACE2005

    • Link
  • KBP2017

    • Link
  • MAVEN

    • Time: EMNLP2020
    • Paper: MAVEN: A Massive General Domain Event Detection Dataset
    • Link: https://github.com/THU-KEG/MAVEN-dataset
  • FewED

    • Time: WSDM2020
    • Paper: Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection
    • Link: https://github.com/231sm/Low_Resource_KBP
  • FMC

    • Time: IJCAI2020
    • Paper: F-HMTC: Detecting Financial Events for Investment Decisions Based on Neural Hierarchical Multi-Label Text Classification
    • Link: https://github.com/finint/F-HMTC
  • CySecED

    • Time: EMNLP2020
    • Paper: Introducing a New Dataset for Event Detection in Cybersecurity Texts
  • CASIE

    • Time: AAAI2020
    • Paper: CASIE: Extracting Cybersecurity Event Information from Text
    • Link: https://github.com/Ebiquity/CASIE
  • Dealogue EE

    • Time: Proceedings of the First Joint Workshop on Narrative Understanding, Storylines, and Events, 2020
    • Paper: Automatic extraction of personal events from dialogue
    • Link: https://www.artie.com/data/personaleventsindialogue/
  • Commodity News Corpus for Event Extraction

    • Time: 2021
    • Paper: An Annotated Commodity News Corpus for Event Extraction
    • Link: https://github.com/meisin/Commodity-News-Event-Extraction
  • Few-shot Financial Chinese event extraction datase

    • Link: https://github.com/TimeBurningFish/FewFC
  • DuEE

    • Time: NLPCC2020
    • Paper: DuEE: A Large-Scale Dataset for Chinese Event Extraction in Real-World Scenarios
    • Link: https://ai.baidu.com/broad/subordinate?dataset=duee
  • Genia Event Extraction (GE)

    • Time: 2011
    • Link: http://bionlp-st.dbcls.jp/GE/2011/eval-test/
  • TimeBank

    • Link: https://catalog.ldc.upenn.edu/LDC2006T08
  • LitBank

    • Time: ACL2019
    • Paper: https://aclanthology.org/P19-1353/
    • Link: https://github.com/dbamman/litbank

Doc-level EE

  • MUC4

    • Link
  • DCFEE

    • Time: ACL2018
    • Paper: DCFEE: A Document-level Chinese Financial Event Extraction System based on Automatically Labeled Training Data
    • Link: https://github.com/yanghang111/DCFEE
  • ChFinAnn

    • Time: EMNLP2019
    • Paper: Doc2EDAG: An End-to-End Document-level Framework for Chinese Financial Event Extraction
    • Link: https://github.com/dolphin-zs/Doc2EDAG
  • RAMS

    • Time: ACL2020
    • Paper: Multi-Sentence Argument Linking
    • Link: https://nlp.jhu.edu/rams/
  • WIKIEVENTS

    • Time: NAACL2021
    • Paper: Document-Level Event Argument Extraction by Conditional Generation
    • Link: https://github.com/raspberryice/gen-arg

以上的整理主要面向于目前的事件抽取任务,以上内容在github也做了同步更新,疏漏之处欢迎交流讨论呀!

你可能感兴趣的:(自然语言处理)