PIX2SEQ: A LANGUAGE MODELING FRAMEWORK FOR OBJECT DETECTION
文章目录ABSTRACT1INTRODUCTION2pix2seq框架2.1SEQUENCECONSTRUCTIONFROMOBJECTDESCRIPTIONS2.2ARCHITECTURE,OBJECTIVEANDINFERENCE2.3SEQUENCEAUGMENTATIONTOINTEGRATETASKPRIORS3EXPERIMENTS3.1EXPERIMENTALSETUP3.2MAIN