ACL2021主会议论文汇总及分类

转发自公众号:NLP工作站   知乎 刘聪NLP

主要包括10个分类,如下:(1)预训练语言模型及应用(58篇);(2)表征学习(9篇);(3)问答及检索(42篇);(4)文本生成(29篇);(5)摘要(23篇);(6)小样本(16篇);(7)对话(32篇);(8)情感及情绪分析(15篇);(9)信息抽取(60篇);(10)其他(21篇)。

一、预训练语言模型及应用

Long

(1)How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models

https://arxiv.org/abs/2012.15613

(2)Meta-KD: A Meta Knowledge Distillation Framework for Language ModelCompression across Domains

https://arxiv.org/abs/2012.01266

(3)How is BERT surprised? Layerwise detection of linguistic anomalies

https://arxiv.org/abs/2105.07452

(4)Super Tickets in Pre-Trained Language Models: From Model Compressionto Improving Generalization

https://arxiv.org/abs/2105.12002

(5)R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling

(6)IrEne: Interpretable Energy Prediction for Transformers

https://arxiv.org/abs/2106.01199

(7)GhostBERT: Generate More Features with Cheap Operations for BERT

(8)Syntax-Enhanced Pre-trained Model

https://arxiv.org/abs/2012.14116

(9)PLOME: Pre-training with Misspelled Knowledge for Chinese SpellingCorrection

(10)EnsLM: Ensemble Language Model for Data Diversity by SemanticClustering

(11)StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling

https://arxiv.org/abs/2012.00857

(12)Convolutions and Self-Attention: Re-interpreting Relative Positionsin Pre-trained Language Models

https://arxiv.org/abs/2106.05505

(13)Implicit Representations of Meaning in Neural Language Model

https://arxiv.org/abs/2106.00737

(14)ERICA: Improving Entity and RelationUnderstanding for Pre-trained Language Models via Contrastive Learning

https://arxiv.org/abs/2012.15022

(15)Improving Formality Style Transfer with Context-Aware Rule Injection

https://arxiv.org/abs/2106.00210

(16)BinaryBERT: Pushing the Limit of BERT Quantization

https://arxiv.org/abs/2012.15701

(17)Shortformer: Better Language Modeling using Shorter Inputs

https://arxiv.org/abs/2012.15832

(18)Making Pre-trained Language Models Better Few-shot Learners

https://arxiv.org/abs/2012.15723

(19)ChineseBERT: Chinese Pretraining Enhanced by Glyph and PinyinInformation

(20)Are Pretrained Convolutions Better than Pretrained Transformers?

https://arxiv.org/abs/2105.03322

(21)ERNIE-Doc: A Retrospective Long-Document Modeling Transformer

https://arxiv.org/abs/2012.15688

(22)LeeBERT: Learned Early Exit for BERT with cross-level optimization

(23)Positional Artefacts Propagate Through Masked Language Model Embeddings

https://arxiv.org/abs/2011.04393

(24)Optimizing Deeper Transformers on Small Datasets

https://arxiv.org/abs/2012.15355

(25)When Do You Need Billions of Words of Pretraining Data?

https://arxiv.org/abs/2011.04946

(26)Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases

https://arxiv.org/abs/2106.09231

(27)EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets

https://arxiv.org/abs/2101.00063

(28)SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining

(29)Structural Guidance for Transformer Language Models

(30)MPC-BERT: A Pre-Trained Language Model for Multi-Party ConversationUnderstanding

https://arxiv.org/abs/2106.01541

(31)Language Model Evaluation Beyond Perplexity

https://arxiv.org/abs/2106.00085

(32)BERTGen: Multi-task Generation through BERT

https://arxiv.org/abs/2106.03484

(33)Pre-training Universal Language Representation

https://arxiv.org/abs/2105.14478

(34)Cascaded Head-colliding Attention

https://arxiv.org/abs/2105.14850

(35)Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks

https://arxiv.org/abs/2106.04489

(36)Accelerating BERT Inference for Sequence Labeling via Early-Exit

https://arxiv.org/abs/2105.13878

(37)AutoTinyBERT: Automatic Hyper-parameter Optimization for EfficientPre-trained Language Models

(38)Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter

https://arxiv.org/abs/2105.07148

(39)On the Effectiveness of Adapter-based Tuning for Pretrained LanguageModel Adaptation

https://arxiv.org/abs/2106.03164

(40)Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation

(41)Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation

https://arxiv.org/abs/2106.05691

(42)Obtaining Better Static Word Embeddings UsingContextual Embedding Models

https://arxiv.org/abs/2106.04302

(43)Reflective Decoding: Beyond Unidirectional Generation withOff-the-Shelf Language Models

https://arxiv.org/abs/2010.08566

(44)Reservoir Transformers

https://arxiv.org/abs/2012.15045

(45)LexFit: Lexical Fine-Tuning of Pretrained Language Models

(46)Selecting Informative Contexts Improves Language Model Fine-tuning

https://arxiv.org/abs/2005.00175

(47)BERT is to NLP what AlexNet is to CV: Can Pre-Trained LanguageModels Identify Analogies?

https://arxiv.org/abs/2105.04949

(48)Examining the Inductive Bias of Neural Language Models with Artificial Languages

https://arxiv.org/abs/2106.01044

(49)An Empirical Study on Hyperparameter Optimization for Fine-TuningPre-trained Language Models

https://arxiv.org/abs/2106.09204

(50)BERTAC: Enhancing Transformer-based Language Models withAdversarially Pretrained Convolutional Neural Networks

(51)Enabling Lightweight Fine-tuning for Pre-trained Language ModelCompression based on Matrix Product Operators

https://arxiv.org/abs/2106.02205

(52)Length-Adaptive Transformer: Train Once with Length Drop, UseAnytime with Search

https://arxiv.org/abs/2010.07003

(53)H-Transformer-1D: Fast One-Dimensional Hierarchical Attention for Sequences

Short

(54)Hi-Transformer: Hierarchical Interactive Transformer for Efficient and Effective Long Document Modeling

https://arxiv.org/abs/2106.01040

(55)Is Sparse Attention more Interpretable?

https://arxiv.org/abs/2106.01087

(56)Learning to Generate Task-Specific Adapters from Task Description

https://arxiv.org/abs/2101.00420

(57)Thank you BART! Rewarding Pre-Trained Models Improves FormalityStyle Transfer

https://arxiv.org/abs/2105.06947?context=cs

(58)Pre-training is a Hot Topic: Contextualized Document EmbeddingsImprove Topic Coherence

https://arxiv.org/abs/2004.03974

 二、表征学习

Long

(1)DeCLUTR: Deep Contrastive Learning for Unsupervised TextualRepresentations

https://arxiv.org/abs/2006.03659(对比学习)

(2)Automated Concatenation of Embeddings for Structured Prediction

https://arxiv.org/abs/2010.05006

(3)Lightweight Cross-Lingual Sentence Representation Learning

https://arxiv.org/abs/2105.13856

(4)ConSERT: A Contrastive Framework for Self-Supervised SentenceRepresentation Transfer

https://arxiv.org/abs/2105.11741

(5)Dynamic Contextualized Word Embeddings

https://arxiv.org/abs/2010.12684

(6)Self-Guided Contrastive Learning for BERT Sentence Representations

https://arxiv.org/abs/2106.07345

(7)Bootstrapped Unsupervised Sentence Representation Learning

Short

(8)Attentive Multiview Text Representation for Differential Diagnosis

Short

(9)DefSent: Sentence Embeddings using Definition Sentences

https://arxiv.org/abs/2105.04339

三、问答及检索

Long

(1)Evaluating Evaluation Measures for Ordinal Classification and Ordinal Quantification

(2)Dual Reader-Parser on Hybrid Textual and Tabular Evidence for OpenDomain Question Answering

https://www.amazon.science/publications/dual-reader-parser-on-hybrid-textual-and-tabular-evidence-for-open-domain-question-answering

(3)Explanations for CommonsenseQA: New Dataset and Models

https://zenodo.org/record/4784281#.YNngJvkzZsY

(4)Answering Ambiguous Questions through Generative Evidence Fusion andRound-Trip Prediction

https://arxiv.org/abs/2011.13137

(5)Improving Document Representations by Generating Pseudo QueryEmbeddings for Dense Retrieval

https://arxiv.org/abs/2105.03599

(6)CoSQA: 20,000+ Web Queries for Code Search and Question Answering

https://arxiv.org/abs/2105.13239

(7)Coreference Reasoning in Machine Reading Comprehension

https://arxiv.org/pdf/2012.15573.pdf

(8)End-to-End Training of Neural Retrievers for Open-Domain QuestionAnswering

https://arxiv.org/abs/2101.00408

(9)Few-Shot Question Answering by Pretraining Span Selection

https://arxiv.org/abs/2101.00438

(10)Integrating Semantics and Neighborhood Information with Graph-DrivenGenerative Models for Document Retrieval

https://arxiv.org/abs/2105.13066

(11)Robustifying Multi-hop QA through Pseudo-Evidentiality Training

(12)Learning Dense Representations of Phrases at Scale

https://arxiv.org/abs/2012.12624

(13)Generation-Augmented Retrieval for Open-Domain Question Answering

https://arxiv.org/abs/2009.08553

(14)xMoCo: Cross Momentum Contrastive Learning for Open-Domain QuestionAnswering

(15)TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance

https://arxiv.org/abs/2105.07624

(16)A Semantic-based Method for Unsupervised Commonsense QuestionAnswering

https://arxiv.org/abs/2105.14781

(17)A Neural Model for Joint Document and Snippet Ranking in QuestionAnswering for Large Document Collections

https://arxiv.org/abs/2106.08908

(18)Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval

https://arxiv.org/abs/2010.11915

(19)Question Answering Over Temporal Knowledge Graphs

https://arxiv.org/abs/2106.01515

(20)Can Generative Pre-trained Language Models Serve as Knowledge Bases for Closed-book QA?

https://arxiv.org/abs/2106.01561

(21)Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims

(22)UnitedQA: A Hybrid Approach for Open Domain Question Answering

https://arxiv.org/abs/2101.00178

(23)ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data

https://arxiv.org/abs/2005.00792

(24)On the Efficacy of Adversarial Data Collection for QuestionAnswering: Results from a Large-Scale Randomized Study

https://arxiv.org/abs/2106.00872

(25)Multi-task Retrieval for Knowledge-Intensive Tasks

https://arxiv.org/abs/2101.00117

(26)Joint Models for Answer Verification in Question Answering Systems

(27)Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering

https://arxiv.org/abs/2101.00391

(28)Modeling Transitions of Focal Entities for Conversational KnowledgeBase Question Answering

(29)A Mutual Information Maximization Approach for the Spurious SolutionProblem in Weakly Supervised Question Answering

https://arxiv.org/abs/2106.07174

(30)Learn to Resolve Conversational Dependency: A Consistency TrainingFramework for Conversational Question Answering

(31)Learning to Perturb Word Embeddings for Out-of-distribution QA

https://arxiv.org/abs/2105.02692

Short

(32)The Curse of Dense Low-Dimensional Information Retrieval for LargeIndex Sizes

https://arxiv.org/abs/2012.14210

(33)DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications

https://arxiv.org/abs/2004.11142

(34)Towards a more Robust Evaluation for Conversational QuestionAnswering

(35)Training Adaptive Computation for Open-DomainQuestion Answering with Computational Constraints

(36)Efficient Passage Retrieval with Hashing for Open-domain QuestionAnswering

https://arxiv.org/abs/2106.00882

(37)Using Adversarial Attacks to Reveal the Statistical Bias in MachineReading Comprehension Models

https://arxiv.org/abs/2105.11136

(38)VAULT: VAriable Unified Long Text Representation for Machine Reading Comprehension

https://arxiv.org/abs/2105.03229

(39)Towards more equitable question answering systems: How much more data do you need?

https://arxiv.org/abs/2105.14115

(40)A Semantics-aware Transformer Model of Relation Linking for Knowledge Base Question Answering

(41)Neural Retrieval for Question Answering with Cross-AttentionSupervised Data Augmentation

https://arxiv.org/abs/2009.13815

(42)Addressing Semantic Drift in Generative Question Answering with Auxiliary Extraction

 四、文本生成

Long

(1)Generalising Multilingual Concept-to-Text NLG with Language AgnosticDelexicalisation

https://arxiv.org/abs/2105.03432

(2)Prefix-Tuning: Optimizing Continuous Prompts for Generation

https://arxiv.org/abs/2101.00190

(3)Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models

https://arxiv.org/abs/2101.00288

(4)Competence-based Multimodal Curriculum Learning for Medical ReportGeneration

(5)BACO: A Background Knowledge- and Content-Based Framework for CitingSentence Generation

(6)Mention Flags (MF): Constraining Transformer-based Text Generators

(7)Guiding the Growth: Difficulty-Controllable Question Generationthrough Step-by-Step Rewriting

https://arxiv.org/abs/2105.11698

(8)Improving Encoder by Auxiliary Supervision Tasks for Table-to-TextGeneration

(9)Writing by Memorizing:Hierarchical Retrieval-based Medical Report Generation

https://arxiv.org/abs/2106.06471

(10)Data Augmentation for TextGeneration Without Any Augmented Data

https://arxiv.org/abs/2105.13650

(11)Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence

https://arxiv.org/abs/2105.08963

(12)PENS: A Dataset and GenericFramework for Personalized News Headline Generation

https://www.microsoft.com/en-us/research/uploads/prod/2021/06/ACL2021_PENS_Camera_Ready_1862_Paper.pdf

(13)De-Confounded VariationalEncoder-Decoder for Logical Table-to-Text Generation

(14)Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation

https://arxiv.org/abs/2106.06125

(15)Employing ArgumentationKnowledge Graphs for Neural Argument Generation

(16)Select, Extract and Generate: Neural Keyphrase Generation with Layer-wise Coverage Attention

https://arxiv.org/abs/2008.01739

(17)DESCGEN: A DistantlySupervised Datasetfor Generating Entity Descriptions

https://arxiv.org/abs/2106.05365

(18)GTM: A Generative Triple-wise Model for Conversational Question Generation

https://arxiv.org/abs/2106.03635

(19)All That’s ‘Human’ Is NotGold: Evaluating Human Evaluation of Generated Text

(20)A Hierarchical VAE for Calibrating Attributes while Generating Text using Normalizing Flow

(21)DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text Generation

https://arxiv.org/abs/2106.00791

(22)Controllable Open-ended Question Generation with A New Question Type Ontology

https://web.eecs.umich.edu/~wangluxy/papers/ACL2021_cao_wang.pdf

(23)DExperts: Decoding-TimeControlled Text Generation with Experts and Anti-Experts

https://arxiv.org/abs/2105.03023

(24)Towards Table-to-TextGeneration with Numerical Reasoning

(25)TGEA: An Error-AnnotatedDataset and Benchmark Tasks for TextGeneration from Pretrained Language Models

Short

(26)On Training InstanceSelection for Few-Shot Neural Text Generation

(27)How Helpful is InverseReinforcement Learning for Table-to-Text Generation?

(28)QuestionGeneration for Adaptive Education

https://arxiv.org/abs/2106.04262

(29)Avoiding Overlap in DataAugmentation for AMR-to-Text Generation

五、摘要

Long

(1)Cross-Lingual Abstractive Summarization with Limited ParallelResources

https://arxiv.org/abs/2105.13648

(2)Unsupervised Extractive Summarization-Based Representations for Accurate and Explainable Collaborative Filtering

(3)Improving Factual Consistency of Abstractive Summarization viaQuestion Answering

https://arxiv.org/abs/2105.04623

(4)Long-Span Summarization via Local Attention and Content Selection

https://arxiv.org/abs/2105.03801

(5)RepSum: Unsupervised Dialogue Summarization based on ReplacementStrategy

(6)TWAG: A Topic-Guided Wikipedia Abstract Generator

(7)Language Model as an Annotator: Exploring DialoGPT for DialogueSummarization

https://arxiv.org/abs/2105.12544

(8)BASS: Boosting Abstractive Summarization with Unified Semantic Graph

https://arxiv.org/abs/2105.12041

(9)Focus Attention: Promoting Faithfulness and Diversity inSummarization

https://arxiv.org/abs/2105.11921

(10)Deep Differential Amplifier for Extractive Summarization

(11)Generating Query Focused Summaries from Query-Free Resources

https://arxiv.org/abs/2012.14774

(12)PASS: Perturb-and-Select Summarizer for Product Reviews

https://www.amazon.science/publications/pass-perturb-and-select-summarizer-for-product-reviews

(13)ConvoSumm: Conversation Summarization Benchmark and ImprovedAbstractive Summarization with Argument Mining

https://arxiv.org/abs/2106.00829

(14)Multi-TimeLine Summarization (MTLS): Improving TimelineSummarization by Generating Multiple Summaries

(15)EmailSum: Abstractive Email Thread Summarization

(16)Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution

https://arxiv.org/abs/2106.01518

(17)A Training-free and Reference-free Summarization Evaluation Metricvia Centrality-weighted Relevance and Self-referenced Redundancy

https://arxiv.org/abs/2106.13945

(18)Generating SOAP Notes from Doctor-Patient Conversations UsingModular Summarization Techniques

https://arxiv.org/abs/2005.01795

Short

(19)WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation

https://registry.opendata.aws/wikisum/

(20)Bringing Structure into Summaries: a Faceted Summarization Datasetfor Long Scientific Documents

https://arxiv.org/abs/2106.00130

(21)Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards

(22)Demoting the Lead Bias in News Summarization via AlternatingAdversarial Learning

https://arxiv.org/abs/2105.14241

(23)SimCLS: A Simple Framework for Contrastive Learning of AbstractiveSummarization

https://arxiv.org/abs/2106.01890

六、小样本学习

Long

(1)Meta-KD: A Meta Knowledge Distillation Framework for Language ModelCompression across Domains

https://arxiv.org/abs/2012.01266

(2)Multi-Label Few-Shot Learning for Aspect Category Detection

https://arxiv.org/abs/2105.14174

(3)ProtAugment: Intent Detection Meta-Learning through UnsupervisedDiverse Paraphrasing

https://arxiv.org/abs/2105.12995

(4)Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision

https://arxiv.org/abs/2012.14862

(5)AugNLG: Few-shot Natural Language Generation using Self-trained DataAugmentation

https://arxiv.org/abs/2106.05589

(6)A Pre-training Strategy for Zero-Resource Response Selection in Knowledge-Grounded Conversations

(7)Evaluatingmorphological typology in zero-shot cross-lingual transfer

(8)LexiconLearning for Few Shot Sequence Modeling

https://arxiv.org/abs/2106.03993

(9)To POS Tagor Not to POS Tag: The Impact of POS Tags on Morphological Learning in Low-Resource Settings

(10)Meta-Learningto Compositionally Generalize

https://arxiv.org/abs/2106.04252

(11)RiskMinimization for Zero-shot Sequence Labeling

http://faculty.sist.shanghaitech.edu.cn/faculty/tukw/acl21rm.pdf

Short

(12)QA-DrivenZero-shot Slot Filling with Weak Supervision Pretraining

(13)Zero-shotFact Verification by Claim Generation

https://arxiv.org/abs/2105.14682

(14)DistinctLabel Representations for Few-Shot Text Classification

(15)Zero-shot Event Extraction via Transfer Learning: Challenges andInsights

(16)Issues with Entailment-based Zero-shot Text Classification

七、对话

Long

(1)TicketTalk: Toward human-level performance with end-to-end,transaction-based dialog systems

https://arxiv.org/abs/2012.12458

(2)SocAoG: Incremental Graph Parsing for Social Relation Inference inDialogues

https://arxiv.org/abs/2106.01006

(3)HERALD: An Annotation Efficient Method to Detect User Disengagementin Social Conversations

https://arxiv.org/abs/2106.00162

(4)Comprehensive Study: How the Context Information of DifferentGranularity Affects Dialogue State Tracking?

https://arxiv.org/abs/2105.03571

(5)Discovering Dialog Structure Graph for Coherent Dialog Generation

(6)Dialogue Response Selection with Hierarchical Curriculum Learning

https://arxiv.org/abs/2012.14756

(7)Diversifying Dialog Generation via Adaptive Label Smoothing

https://arxiv.org/abs/2105.14556

(8)BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data

https://arxiv.org/abs/2106.06169

(9)I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling

https://arxiv.org/abs/2012.13391

(10)Towards Quantifiable Dialogue Coherence Evaluation

https://arxiv.org/abs/2106.00507

(11)A Sequence-to-Sequence Approach to Dialogue State Tracking

https://arxiv.org/abs/2011.09553

(12)Dual Slot Selector via Local Reliability Verification for DialogueState Tracking

(13)Learning from Perturbations: Diverse and Informative DialogueGeneration with Inverse Adversarial Training

https://arxiv.org/abs/2105.15171

(14)Novel Slot Detection: A Benchmark for Discovering Unknown Slot Typesin the Task-Oriented Dialogue System

https://arxiv.org/abs/2105.14313

(15)RADDLE: An Evaluation Benchmark and Analysis Platform for RobustTask-oriented Dialog Systems

https://arxiv.org/abs/2012.14666

(16)Learning to Ask Conversational Questions by Optimizing LevenshteinDistance

(17)Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances https://arxiv.org/abs/2106.02227

(18)Semantic Representation for Dialogue Modeling

https://arxiv.org/abs/2105.10188

(19)Towards Emotional Support Dialog Systems

https://arxiv.org/abs/2106.01144

(20)Discovering Dialogue Slots with Weak Supervision

(21)Structural Pre-training for Dialogue Comprehension

https://arxiv.org/abs/2105.10956

(22)Transferable Dialogue Systems and User Simulators

(23)Improving Dialog Systems for Negotiation with Personality Modeling

(24)TIMEDIAL: Temporal Commonsense Reasoning in Dialog

https://arxiv.org/abs/2106.04571

(25)Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features

(26)GL-GIN: Fast and Accurate Non-Autoregressive Model for JointMultiple Intent Detection and Slot Filling

https://arxiv.org/abs/2106.01925

(27)DynaEval: Unifying Turn and Dialogue Level Evaluation

https://arxiv.org/abs/2106.01112

Short

(28)Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries

https://arxiv.org/abs/2012.01873

(29)Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialogue State Tracking

https://arxiv.org/abs/2106.00291

(30)Continual Learning for Task-oriented Dialogue System with IterativeNetwork Pruning, Expanding and Masking

(31)Domain-Adaptive Pretraining Methods for Dialogue Understanding

https://arxiv.org/abs/2105.13665

(32)PRAL: A Tailored Pre-Training Model for Task-Oriented DialogGeneration

https://arxiv.org/abs/2004.13835

 八、情感或情绪分析

Long

(1)Dual Graph Convolutional Networks for Aspect-based SentimentAnalysis

(2)Directed Acyclic Graph Network for Conversational EmotionRecognition

https://arxiv.org/abs/2105.12907

(3)DynaSent: A Dynamic Benchmark for Sentiment Analysis

https://arxiv.org/abs/2012.15349

(4)Position Bias Mitigation: A Knowledge-Aware Graph Model for EmotionCause Extraction

https://arxiv.org/abs/2106.03518

(5)Topic-Driven and Knowledge-Aware Transformer for Dialogue EmotionDetection

https://arxiv.org/abs/2106.01071

(6)Distributed Representations of Emotion Categories in Emotion Space

(7)DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations

https://arxiv.org/abs/2106.01978

(8)Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities

(9)A Unified Generative Framework for Aspect-based Sentiment Analysis

https://arxiv.org/abs/2106.04300

(10)Exploring the Efficacy of Automatically Generated Counterfactualsfor Sentiment Analysis

(11)Structured Sentiment Analysis as Dependency Graph Parsing

https://arxiv.org/abs/2105.14504

(12)Aspect-Category-Opinion-Sentiment Quadruple Extraction with ImplicitAspects and Opinions

Short

(13)Deep Context- and Relation-Aware Learning for Aspect-based SentimentAnalysis

https://arxiv.org/abs/2106.03806

(14)Towards Generative Aspect-Based Sentiment Analysis

(15)eMLM: A New Pre-training Objective for Emotion Related Tasks

九、信息抽取

Long

(1)Named Entity Recognition with Small Strongly Labeled and LargeWeakly Labeled Data

https://arxiv.org/abs/2106.08977

(2)Competence-based Multimodal Curriculum Learning for Medical ReportGeneration

https://arxiv.org/abs/2105.06804

(3)OntoED: Low-resource Event Detection with Ontology Embedding

https://arxiv.org/abs/2105.10922

(4)Subsequence Based Deep Active Learning for Named Entity Recognition

(5)BERTifying the Hidden Markov Model for Multi-Source WeaklySupervised Named Entity Recognition

https://arxiv.org/abs/2105.12848

(6)Knowledge-Enriched Event Causality Identification via LatentStructure Induction Networks

(7)Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker

https://arxiv.org/abs/2105.14924

(8)A Large-Scale Chinese Multimodal NER Dataset with Speech Clues

(9)LearnDA: Learnable Knowledge-Guided Data Augmentation for EventCausality Identification

https://arxiv.org/abs/2106.01649

(10)CIL: Contrastive Instance Learning Framework for DistantlySupervised Relation Extraction

https://arxiv.org/abs/2106.10855

(11)Few-NERD: A Few-shot Named Entity Recognition Dataset

https://arxiv.org/abs/2105.07464

(12)SENT: Sentence-level Distant Relation Extraction via NegativeTraining

https://arxiv.org/abs/2106.11566?context=cs

(13)Modularized Interaction Network for Named Entity Recognition

(14)Capturing Event Argument Interaction via A Bi-DirectionalEntity-Level Recurrent Decoder

(15)A Span-Based Model for Joint Overlapped and Discontinuous NamedEntity Recognition

(16)An End-to-End Progressive Multi-Task Learning Framework for MedicalNamed Entity Recognition and Normalization

(17)MLBiNet: A Cross-Sentence Collective Event Detection Network

https://arxiv.org/abs/2105.09458

(18)PRGC: Potential Relation and Global Correspondence Based JointRelational Triple Extraction

https://arxiv.org/abs/2106.09895

(20)Improving Named Entity Recognition by External Context Retrievingand Cooperative Learning

https://arxiv.org/abs/2105.03654

(21)Leveraging Type Descriptions for Zero-shot Named Entity Recognition and Classification

(22)Revisiting the Negative Data of Distantly Supervised RelationExtraction

https://arxiv.org/abs/2105.10158

(23)Learning from Miscellaneous Other-Class Words for Few-shot NamedEntity Recognition

(24)Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collective Inference

https://arxiv.org/abs/2105.13456

(25)Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path

(27)How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction

(28)From Discourse to Narrative: Knowledge Projection for Event RelationExtraction

https://arxiv.org/abs/2106.08629

(29)Fine-grained Information Extraction from Biomedical Literature based on Knowledge-enriched Abstract Meaning Representation

(30)A Unified Generative Framework for Various NER Subtasks

https://arxiv.org/abs/2106.01223

(31)MECT: Multi-Metadata Embedding based Cross-Transformer for ChineseNamed Entity Recognition

(32)Unleash GPT-2 Power for Event Detection

(33)Trigger is Not Sufficient: Exploiting Frame-aware Knowledge for Implicit Event Argument Extraction

(34)Element Intervention for Open Relation Extraction

https://arxiv.org/abs/2106.09558

(35)Text2Event: Controllable Sequence-to-Structure Generation forEnd-to-end Event Extraction

https://arxiv.org/abs/2106.09232

(36)CLEVE: Contrastive Pre-training for Event Extraction

https://arxiv.org/abs/2105.14485

(37)MulDA: A Multilingual Data Augmentation Framework for Low-ResourceCross-Lingual NER

https://raihanjoty.github.io/papers/linlin-et-al-acl-21.html

(38)De-biasing Distantly Supervised Named Entity Recognition via CausalIntervention

https://arxiv.org/abs/2106.09233

(39)UniRE: A Unified Label Space for Entity Relation Extraction

(40)Crowdsourcing Learning as Domain Adaptation: A Case Study on NamedEntity Recognition

https://arxiv.org/abs/2105.14980

(41)Modeling Fine-Grained Entity Types with Box Embeddings

https://arxiv.org/abs/2101.00345

(42)CoRI: Collective Relation Integration with Data Augmentation for Open Information Extraction

https://arxiv.org/abs/2106.00793

(43)CitationIE: Leveraging the Citation Graph for Scientific InformationExtraction

https://arxiv.org/abs/2106.01560

(44)Dependency-driven Relation Extraction with Attentive GraphConvolutional Networks

(45)Discontinuous Named Entity Recognition as Maximal Clique Discovery

https://arxiv.org/abs/2106.00218

(46)Weakly Supervised Named Entity Tagging with Learnable Logical Rules

(47)SpanNER: Named Entity Re-/Recognition as Span Prediction

https://arxiv.org/abs/2106.00641

(48)Refining Sample Embeddings with Relation Prototypes to EnhanceContinual Relation Extraction

https://www.researchgate.net/publication/352257560_Refining_Sample_Embeddings_with_Relation_Prototypes_to_Enhance_Continual_Relation_Extraction

(49)Document-level Event Extraction via Parallel Prediction Networks

(50)Learning Span-Level Interactions for Aspect Sentiment TripletExtraction

(51)The Possible, the Plausible, and the Desirable: Event-Based ModalityDetection for Language Processing

https://arxiv.org/abs/2106.08037

(52)A Neural Transition-based Joint Model for Disease Named EntityRecognition and Normalization

Short

(53)TIMERS: Document-level Temporal Relation Extraction

(54)ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

https://arxiv.org/abs/2106.10786

(55)Enhancing Entity Boundary Detection for Better Chinese Named EntityRecognition

(56)Entity Enhancement for Implicit Discourse Relation Classification in the Biomedical Domain

(57)Entity Concept-enhanced Few-shot Relation Extraction

https://arxiv.org/abs/2106.02401

(58)Improving Model Generalization: A Chinese Named Entity RecognitionCase Study

(59)Explicitly Capturing Relations between Entity Mentions via GraphNeural Networks for Domain-specific Named Entity Recognition

(60)Three Sentences Are All You Need: Local Path Enhanced DocumentRelation Extraction

https://arxiv.org/abs/2106.01793

十、 其他

(1)Semi-Supervised Text Classification with Balanced DeepRepresentation Distributions

(2)Defense against Adversarial Attacks in NLP via DirichletNeighborhood Ensemble

https://arxiv.org/abs/2006.11627

(3)Inter-GPS: Interpretable Geometry Problem Solving with FormalLanguage and Symbolic Reasoning

https://arxiv.org/abs/2105.04165

(4)Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification

https://arxiv.org/abs/2105.02657

(5)Concept-Based Label Embedding via Dynamic Routing for HierarchicalText Classification

(6)Joint Verification and Reranking for Open Fact Checking Over Tables

https://arxiv.org/abs/2012.15115

(7)Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor

https://arxiv.org/abs/2010.05010

(8)UnNatural Language Inference

https://arxiv.org/abs/2101.00010

(9)OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classification

https://arxiv.org/abs/2105.06750

(10)Database Reasoning Over Text

https://arxiv.org/abs/2106.01074

(11)Towards Robustness of Text-to-SQL Models against SynonymSubstitution

https://arxiv.org/abs/2106.01065

(12)Determinantal Beam Search

https://arxiv.org/abs/2106.07400

(13)POS-Constrained Parallel Decoding for Non-autoregressive Generation

(14)Hierarchy-aware Label Semantics Matching Network for HierarchicalText Classification

(15)Multi-View Cross-Lingual Structured Prediction with Minimum Supervision

http://faculty.sist.shanghaitech.edu.cn/faculty/tukw/acl21mv.pdf

(16)Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-SQL

(17)Factoring Statutory Reasoning as Language Understanding Challenges

https://arxiv.org/abs/2105.07903

(18)HiddenCut: Simple Data Augmentation for Natural LanguageUnderstanding with Better Generalization

https://arxiv.org/abs/2106.00149

(19)KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers

https://arxiv.org/abs/2106.11455

(20)Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism

(21)Alignment Rationale for Natural Language Inference

你可能感兴趣的:(深度学习,自然语言处理,深度学习,机器学习)