BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation ...
DenoisingSequence-to-SequencePre-trainingforNaturalLanguageGeneration,Translation,andComprehensionAbstract本文提出BART(BidirectionalandAuto-RegressiveTransformers),一个用于预训练seq2seq