一周新论文 | 2020年第10周 | 自然语言处理相关

《一周新论文》系列之2020年第10周:自然语言处理相关

本周重点关注:

  • Microsoft: [29], [32], [43], [59], [63]
  • Amazon: [14]
  • Google: [5], [13]
  • 其他: [18], [31], [33], [34], [35], [40], [45], [47], [50]

2020年3月6日

[1]. An Empirical Accuracy Law for Sequential Machine Translation: the Case of Google Translate
链接 | https://arxiv.org/abs/2003.02817
作者 | Lucas Nunes Sequeira, Bruno Moreschi, Fabio Gagliardi Cozman, Bernardo Fontes

[2]. HypoNLI: Exploring the Artificial Patterns of Hypothesis-only Bias in Natural Language Inference
链接 | https://arxiv.org/abs/2003.02756
作者 | Tianyu Liu, Xin Zheng, Baobao Chang, Zhifang Sui
单位 | Peking University; Peng Cheng Laboratory; Beijing University of Posts and Telecommunications
备注 | LREC 2020

[3]. Zero-Shot Cross-Lingual Transfer with Meta Learning
链接 | https://arxiv.org/abs/2003.02739
作者 | Farhad Nooralahzadeh, Giannis Bekoulis, Johannes Bjerva, Isabelle Augenstein

[4]. Fact Check-Worthiness Detection as Positive Unlabelled Learning
链接 | https://arxiv.org/abs/2003.02736
作者 | Dustin Wright, Isabelle Augenstein

[5]. SentenceMIM: A Latent Variable Language Model
链接 | https://arxiv.org/abs/2003.02645
作者 | Micha Livne, Kevin Swersky, David J. Fleet
单位 | University of Toronto; Vector Institute; Google Research, Toronto

[6]. RecipeGPT: Generative Pre-training Based Cooking Recipe Generation and Evaluation System
链接 | https://arxiv.org/abs/2003.02498
作者 | Helena H. Lee, Ke Shu, Palakorn Achananuparp, Philips Kokoh Prasetyo, Yue Liu, Ee-Peng Lim, Lav R. Varshney
单位 | Singapore Management University; University of Illinois at Urbana-Champaign

[7]. Kleister: A novel task for Information Extraction involving Long Documents with Complex Layout
链接 | https://arxiv.org/abs/2003.02356
作者 | Filip Graliński, Tomasz Stanisławek, Anna Wróblewska, Dawid Lipiński, Agnieszka Kaliska, Paulina Rosalska, Bartosz Topolski, Przemysław Biecek

[8]. A Study on Efficiency, Accuracy and Document Structure for Answer Sentence Selection
链接 | https://arxiv.org/abs/2003.02349
作者 | Daniele Bonadiman, Alessandro Moschitti
单位 | Amazon Alexa

[9]. BERT as a Teacher: Contextual Embeddings for Sequence-Level Reward
链接 | https://arxiv.org/abs/2003.02738
作者 | Florian Schmidt, Thomas Hofmann

[10]. Phase transitions in a decentralized graph-based approach to human language
链接 | https://arxiv.org/abs/2003.02639
作者 | Javier Vera, Felipe Urbina, Wenceslao Palma

[11]. An Incremental Explanation of Inference in Hybrid Bayesian Networks for Increasing Model Trustworthiness and Supporting Clinical Decision Making
链接 | https://arxiv.org/abs/2003.02599
作者 | Evangelia Kyrimi, Somayyeh Mossadegh, Nigel Tai, William Marsh

[12]. Real-time, Universal, and Robust Adversarial Attacks Against Speaker Recognition Systems
链接 | https://arxiv.org/abs/2003.02301
作者 | Yi Xie, Cong Shi, Zhuohang Li, Jian Liu, Yingying Chen, Bo Yuan
单位 | Rutgers University;

2020年3月5日

[13]. jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models
链接 | https://arxiv.org/abs/2003.02249
作者 | Yada Pruksachatkun, Phil Yeres, Haokun Liu, Jason Phang, Phu Mon Htut, Alex Wang, Ian Tenney, Samuel R. Bowman
单位 | New York University; Google Research

[14]. Data Augmentation using Pre-trained Transformer Models
链接 | https://arxiv.org/abs/2003.02245
作者 | Varun Kumar, Ashutosh Choudhary, Eunah Cho
单位 | Amazon

[15]. Unsupervised Adversarial Domain Adaptation for Implicit Discourse Relation Classification
链接 | https://arxiv.org/abs/2003.02244
作者 | Hsin-Ping Huang, Junyi Jessy Li
单位 | The University of Texas at Austin
备注 | CoNLL 2019

[16]. Evaluating Low-Resource Machine Translation between Chinese and Vietnamese with Back-Translation
链接 | https://arxiv.org/abs/2003.02197
作者 | Hongzheng Li, Heyan Huang
单位 | Beijing Institute of Technology

[17]. Sequential Neural Networks for Noetic End-to-End Response Selection
链接 | https://arxiv.org/abs/2003.02126
作者 | Qian Chen, Wen Wang
单位 | Alibaba Group

[18]. Posterior-GAN: Towards Informative and Coherent Response Generation with Posterior Generative Adversarial Network
链接 | https://arxiv.org/abs/2003.02020
作者 | Shaoxiong Feng, Hongshen Chen, Kan Li, Dawei Yin
单位 | Beijing Institute of Technology; JD.com
备注 | Accepted by AAAI 2020

[19]. Restoration of Fragmentary Babylonian Texts Using Recurrent Neural Networks
链接 | https://arxiv.org/abs/2003.01912
作者 | Ethan Fetaya, Yonatan Lifshitz, Elad Aaron, Shai Gordin

[20]. SeMemNN: A Semantic Matrix-Based Memory Neural Network for Text Classification
链接 | https://arxiv.org/abs/2003.01857
作者 | Changzeng Fu, Chaoran Liu, Carlos Toshinori Ishi, Yuichiro Yoshikawa, Hiroshi Ishiguro

[21]. HyperEmbed: Tradeoffs Between Resources and Performance in NLP Tasks with Hyperdimensional Computing enabled Embedding of n-gram Statistics
链接 | https://arxiv.org/abs/2003.01821
作者 | Pedro Alonso, Kumar Shridhar, Denis Kleyko, Evgeny Osipov, Marcus Liwicki

[22]. AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit Alignment
链接 | https://arxiv.org/abs/2003.01950
作者 | Zhen Zeng, Jianzong Wang, Ning Cheng, Tian Xia, Jing Xiao
单位 | Ping An Technology
备注 | will be presented in ICASSP 2020

[23]. GraphTTS: graph-to-sequence modelling in neural text-to-speech
链接 | https://arxiv.org/abs/2003.01950
作者 | Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Jing Xiao
单位 | Ping An Technology
备注 | Accepted to ICASSP 2020

[24]. On Emergent Communication in Competitive Multi-Agent Teams
链接 | https://arxiv.org/abs/2003.01848
作者 | Paul Pu Liang, Jeffrey Chen, Ruslan Salakhutdinov, Louis-Philippe Morency, Satwik Kottur
单位 | Carnegie Mellon University
备注 | AAMAS 2020

[25]. Discover Your Social Identity from What You Tweet: a Content Based Approach
链接 | https://arxiv.org/abs/2003.01797
作者 | Binxuan Huang, Kathleen M. Carley
单位 | Carnegie Mellon University

[26]. Untangling in Invariant Speech Recognition
链接 | https://arxiv.org/abs/2003.01787
作者 | Cory Stephenson, Jenelle Feather, Suchismita Padhy, Oguz Elibol, Hanlin Tang, Josh McDermott, SueYeon Chung
单位 | Intel AI Lab; MIT; Columbia University
备注 | Advances in Neural Information Processing Systems. 2019

[27]. Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data
链接 | https://arxiv.org/abs/2003.01769
作者 | Peter Plantinga, Deblin Bagchi, Eric Fosler-Lussier
单位 | The Ohio State University
备注 | 4 pages + 1 page for references, accepted to ICASSP 2020

[28]. Towards Real-time Mispronunciation Detection in Kids’ Speech
链接 | https://arxiv.org/abs/2003.01765
作者 | Peter Plantinga, Eric Fosler-Lussier
单位 | The Ohio State University
备注 | 6 pages + 1 page for references, accepted at ASRU 2019

2020年3月4日

[29]. Hybrid Generative-Retrieval Transformers for Dialogue Domain Adaptation
链接 | https://arxiv.org/abs/2003.01680
作者 | Igor Shalyminov, Alessandro Sordoni, Adam Atkinson, Hannes Schulz
单位 | Microsoft Research
备注 | Presented at DSTC8@AAAI 2020

[30]. Improving Uyghur ASR systems with decoders using morpheme-based language models
链接 | https://arxiv.org/abs/2003.01509
作者 | Zicheng Qiu, Wei Jiang, Turghunjan Mamut

[31]. Multi-Task Learning Network for Emotion Recognition in Conversation
链接 | https://arxiv.org/abs/2003.01478
作者 | Jingye Li, Meishan Zhang, Donghong Ji, Yijiang Liu
单位 | Wuhan University; Tianjin University

[32]. XGPT: Cross-modal Generative Pre-Training for Image Captioning
链接 | https://arxiv.org/abs/2003.01473
作者 | Qiaolin Xia, Haoyang Huang, Nan Duan, Dongdong Zhang, Lei Ji, Zhifang Sui, Edward Cui, Taroon Bharti, Ming Zhou
单位 | Peking University; Microsoft Research Asia

[33]. Meta-Embeddings Based On Self-Attention
链接 | https://arxiv.org/abs/2003.01371
作者 | Qichen Li, Xiaoke Jiang, Jun Xia, Jian Li
单位 | SenseTime; Tsinghua University

[34]. CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model
链接 | https://arxiv.org/abs/2003.01355
作者 | Liang Xu, Xuanwei Zhang, Qianqian Dong

[35]. Improving Candidate Generation for Low-resource Cross-lingual Entity Linking
链接 | https://arxiv.org/abs/2003.01343
作者 | Shuyan Zhou, Shruti Rijhawani, John Wieting, Jaime Carbonell, Graham Neubig
单位 | Carnegie Mellon University
备注 | Accepted to TACL 2020

[36]. Controllable Time-Delay Transformer for Real-Time Punctuation Prediction and Disfluency Detection
链接 | https://arxiv.org/abs/2003.01309
作者 | Qian Chen, Mengzhe Chen, Bo Li, Wen Wang
单位 | Alibaba Group
备注 | 4 pages, 2 figures, accepted by ICASSP 2020

[37]. Transfer Learning for Context-Aware Spoken Language Understanding
链接 | https://arxiv.org/abs/2003.01305
作者 | Qian Chen, Zhu Zhuo, Wen Wang, Qiuyun Xu
单位 | Alibaba Group
备注 | 6 pages, 3 figures, ASRU2019

[38]. Med7: a transferable clinical natural language processing model for electronic health records
链接 | https://arxiv.org/abs/2003.01271
作者 | Andrey Kormilitzin, Nemanja Vaci, Qiang Liu, Alejo Nevado-Holgado

[39]. Understanding the Prediction Mechanism of Sentiments by XAI Visualization
链接 | https://arxiv.org/abs/2003.01425
作者 | Chaehan So
备注 | This is the author’s prefinal version be published in conference proceedings: 4th International Conference on Natural Language Processing and Information Retrieval, Sejong, South Korea, 26-28 June, 2020, ACM

[40]. Hierarchical Context Enhanced Multi-Domain Dialogue System for Multi-domain Task Completion
链接 | https://arxiv.org/abs/2003.01338
作者 | Jingyuan Yang, Guang Liu, Yuzhao Mao, Zhiwei Zhao, Weiguo Gao, Xuan Li, Haiqin Yang, Jianping Shen
单位 | Ping An Technology
备注 | Presented at DSTC workshop, AAAI 2020

2020年3月3日

[41]. Gated Mechanism for Attention Based Multimodal Sentiment Analysis
链接 | https://arxiv.org/abs/2003.01043
作者 | Ayush Kumar, Jithendra Vepa
备注 | Accepted to appear in ICASSP 2020

[42]. Identification of primary and collateral tracks in stuttered speech
链接 | https://arxiv.org/abs/2003.01018
作者 | Rachid Riad, Anne-Catherine Bachoud-Lévi, Frank Rudzicz, Emmanuel Dupoux
备注 | To be published in LREC 2020

[43]. Multi-View Learning for Vision-and-Language Navigation
链接 | https://arxiv.org/abs/2003.00857
作者 | Qiaolin Xia, Xiujun Li, Chunyuan Li, Yonatan Bisk, Zhifang Sui, Yejin Choi, Noah A. Smith
单位 | University of Washington; Peking University; Microsoft Research;

[44]. PhoBERT: Pre-trained language models for Vietnamese
链接 | https://arxiv.org/abs/2003.00744
作者 | Dat Quoc Nguyen, Anh Tuan Nguyen

[45]. Style Example-Guided Text Generation using Generative Adversarial Transformers
链接 | https://arxiv.org/abs/2003.00674
作者 | Kuo-Hao Zeng, Mohammad Shoeybi, Ming-Yu Liu
单位 | NVIDIA

[46]. Learning from Easy to Complex: Adaptive Multi-curricula Learning for Neural Dialogue Generation
链接 | https://arxiv.org/abs/2003.00639
作者 | Hengyi Cai, Hongshen Chen, Cheng Zhang, Yonghao Song, Xiaofang Zhao, Yangxi Li, Dongsheng Duan, Dawei Yin
单位 | Chinese Academy of Sciences
备注 | AAAI 2020

[47]. StructSum: Incorporating Latent and Explicit Sentence Dependencies for Single Document Summarization
链接 | https://arxiv.org/abs/2003.00576
作者 | Vidhisha Balachandran, Artidoro Pagnoni, Jay Yoon Lee, Dheeraj Rajagopal, Jaime Carbonell, Yulia Tsvetkov
单位 | Carnegie Mellon University

[48]. Clinical Text Summarization with Syntax-Based Negation and Semantic Concept Identification
链接 | https://arxiv.org/abs/2003.00353
作者 | Wei-Hung Weng, Yu-An Chung, Schrasing Tong
单位 | MIT

[49]. Voice trigger detection from LVCSR hypothesis lattices using bidirectional lattice recurrent neural networks
链接 | https://arxiv.org/abs/2003.00304
作者 | Woojay Jeon, Leo Liu, Henry Mason
单位 | Apple
备注 | Presented at IEEE ICASSP, May 2019

[50]. Depth-Adaptive Graph Recurrent Network for Text Classification
链接 | https://arxiv.org/abs/2003.00166
作者 | Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou
单位 | Beijing Jiaotong University; Tencent

[51]. AraBERT: Transformer-based Model for Arabic Language Understanding
链接 | https://arxiv.org/abs/2003.00104
作者 | Wissam Antoun, Fady Baly, Hazem Hajj

[52]. The STEM-ECR Dataset: Grounding Scientific Entity References in STEM Scholarly Content to Authoritative Encyclopedic and Lexicographic Sources
链接 | https://arxiv.org/abs/2003.01006
作者 | Jennifer D’Souza, Anett Hoppe, Arthur Brack, Mohamad Yaser Jaradeh, Sören Auer, Ralph Ewerth
备注 | To appear in LREC 2020 proceedings. 11 pages, 6 figures

[53]. Pathological speech detection using x-vector embeddings
链接 | https://arxiv.org/abs/2003.00864
作者 | Catarina Botelho, Francisco Teixeira, Thomas Rolland, Alberto Abad, Isabel Trancoso
备注 | Submitted to EUSIPCO 2020

[54]. Long Short-Term Sample Distillation
链接 | https://arxiv.org/abs/2003.00739
作者 | Liang Jiang, Zujie Wen, Zhongping Liang, Yafang Wang, Gerard de Melo, Zhe Li, Liangzhuang Ma, Jiaxing Zhang, Xiaolong Li, Yuan Qi
单位 | Ant Financial Services Group; Rutgers University
备注 | published as a conference paper at AAAI 2020

[55]. Environment-agnostic Multitask Learning for Natural Language Grounded Navigation
链接 | https://arxiv.org/abs/2003.00443
作者 | Xin Wang, Vihan Jain, Eugene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi
单位 | University of California, Santa Barbara; Google; Amazon

[56]. What Emotions Make One or Five Stars? Understanding Ratings of Online Product Reviews by Sentiment Analysis and XAI
链接 | https://arxiv.org/abs/2003.00201
作者 | Chaehan So
备注 | To be published in: Lecture Notes in Artificial Intelligence, 1st International Conference on Artificial Intelligence in HCI, AI-HCI, Held as Part of HCI International 2020, Kopenhagen, Denmark, July 19-24, Springer

2020年3月2日

[57]. Do all Roads Lead to Rome? Understanding the Role of Initialization in Iterative Back-Translation
链接 | https://arxiv.org/abs/2002.12867
作者 | Mikel Artetxe, Gorka Labaka, Noe Casas, Eneko Agirre

[58]. Metaphoric Paraphrase Generation
链接 | https://arxiv.org/abs/2002.12854
作者 | Kevin Stowe, Leonardo Ribeiro, Iryna Gurevych

[59]. UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training
链接 | https://arxiv.org/abs/2002.12804
作者 | Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Songhao Piao, Jianfeng Gao, Ming Zhou, Hsiao-Wuen Hon
单位 | Microsoft Research; Harbin Institute of Technology

[60]. Automatic Section Recognition in Obituaries
链接 | https://arxiv.org/abs/2002.12699
作者 | Valentino Sabbatino, Laura Bostan, Roman Klinger
备注 | 9 pages, 1 figure, accepted at LREC 2020

[61]. Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis
链接 | https://arxiv.org/abs/2002.12645
作者 | Jennifer Williams, Joanna Rownicka, Pilar Oplustil, Simon King
单位 | University of Edinburgh
备注 | submitted to Odyssey 2020

[62]. TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing
链接 | https://arxiv.org/abs/2002.12620
作者 | Ziqing Yang, Yiming Cui, Zhipeng Chen, Wanxiang Che, Ting Liu, Shijin Wang, Guoping Hu
单位 | iFLYTEK Research; Harbin Institute of Technology

[63]. DC-BERT: Decoupling Question and Document for Efficient Contextual Encoding
链接 | https://arxiv.org/abs/2002.12591
作者 | Yuyu Zhang, Ping Nie, Xiubo Geng, Arun Ramamurthy, Le Song, Daxin Jiang
单位 | Georgia Institute of Technology; Peking University; Microsoft

[64]. Modeling Future Cost for Neural Machine Translation
链接 | https://arxiv.org/abs/2002.12558
作者 | Chaoqun Duan, Kehai Chen, Rui Wang, Masao Utiyama, Eiichiro Sumita, Conghui Zhu, Tiejun Zhao
单位 | Harbin Institute of Technology

[65]. Robust Unsupervised Neural Machine Translation with Adversarial Training
链接 | https://arxiv.org/abs/2002.12549
作者 | Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita, Tiejun Zhao
单位 | Harbin Institute of Technology

[66]. UKARA 1.0 Challenge Track 1: Automatic Short-Answer Scoring in Bahasa Indonesia
链接 | https://arxiv.org/abs/2002.12540
作者 | Ali Akbar Septiandri, Yosef Ardhito Winatmoko

[67]. Temporal Convolutional Attention-based Network For Sequence Modeling
链接 | https://arxiv.org/abs/2002.12530
作者 | Hongyan Hao, Yan Wang, Yudi Xia, Jian Zhao, Furao Shen
单位 | Nanjing University

[68]. RP-DNN: A Tweet level propagation context based deep neural networks for early rumor detection in Social Media
链接 | https://arxiv.org/abs/2002.12683
作者 | Jie Gao, Sooji Han, Xingyi Song, Fabio Ciravegna
备注 | Manuscript accepted for publication at The LREC 2020 Proceedings.

[69]. A multi-layer approach to disinformation detection on Twitter
链接 | https://arxiv.org/abs/2002.12612
作者 | Francesco Pierri, Carlo Piccardi, Stefano Ceri

[70]. Exploring and Distilling Cross-Modal Information for Image Captioning
链接 | https://arxiv.org/abs/2002.12585
作者 | Fenglin Liu, Xuancheng Ren, Yuanxin Liu, Kai Lei, Xu Sun
单位 | Peking University; Beijing University of Posts and Telecommunications

[71]. Learning Directly from Grammar Compressed Text
链接 | https://arxiv.org/abs/2002.12570
作者 | Yoichi Sasaki, Kosuke Akimoto, Takanori Maehara
单位 | NEC Corporation

[72]. Comment Ranking Diversification in Forum Discussions
链接 | https://arxiv.org/abs/2002.12457
作者 | Curtis G. Northcutt, Kimberly A. Leon, Naichun Chen
单位 | MIT
备注 | published in Learning @ Scale, 2017


想要了解更多的自然语言处理最新进展、技术干货及学习教程,欢迎关注微信公众号“语言智能技术笔记簿”或扫描二维码添加关注。
在这里插入图片描述

你可能感兴趣的:(一周新论文,arxiv,一周新论文,自然语言处理,NLP)