2020 ICML 全部论文

All Papers

38 - ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences

"Zhizhong Han (University of Maryland, College Park); Chao Chen (Tsinghua University); Yu-Shen Liu (Tsinghua University)*; Matthias Zwicker (University of Maryland)"

 

46 - VideoIC: A Video Interactive Comments Dataset and Multimodal Multitask Learning for Comments Generation

Weiying Wang (Renmin University of China)*; Jieting Chen (Renmin University of China); Qin Jin (Renmin University of China)

 

53 - Image Inpainting Based on Multi-frequency Probabilistic Inference Model

Jin Wang (Beijing University of Technology)*; Chen Wang (Beijing University of Technology); Qingming Huang (University of Chinese Academy of Sciences); Yunhui Shi (Beijing University of Technology); Jian-Feng Cai (The Hong Kong University of Science and Technology); Qing Zhu (Beijing University of Technology); Baocai Yin (Beijing University of Technology)

 

60 - "Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition"

"Wenbo Zheng (School of Software Engineering, Xi'an Jiaotong University); Lan Yan (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences); Chao Gou (School of Intelligent Systems Engineering, Sun Yat-sen University)*; Fei-Yue Wang (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences)"

 

63 - Dual Adversarial Network for Unsupervised Ground/Satellite-to-Aerial Scene Adaptation

jianzhe peter lin (University of British Columbia)*; Lichao Mou (DLR&TUM); tianze yu (University of British Columbia); Xiaoxiang Zhu (Technical University of Munich (TUM); German Aerospace Center (DLR)); Z. Jane Wang (University of British Columbia)

 

68 - Scene-Aware Background Music Synthesis

Yujia Wang (Beijing Institute of Technology)*; Wei Liang (Beijing Institute of Technology); Wanwan Li (George Mason University); Dingzeyu Li (Adobe Research); Lap-Fai Yu (George Mason University)

 

70 - Textual Dependency Embedding for Person Search by Language

"Kai Niu (National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; University of Chinese Academy of Sciences;)*; Yan Huang (Institute of Automation, Chinese Academy of Sciences); Liang Wang (NLPR, China)"

 

73 - University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization

Zhedong Zheng (University of Technology Sydney)*; Yunchao Wei (UTS); Yi Yang (UTS)

 

75 - Adversarial Bipartite Graph Learning for Video Domain Adaptation

"Yadan Luo (University of Queensland)*; Zi Huang (University of Queensland); Zijian Wang (University of Queensland); Zheng Zhang (Harbin Institute of Technology, Shenzhen); Mahsa Baktashmotlagh (University of Queensland)"

 

76 - DIPDefend: Deep Image Prior Driven Defense against Adversarial Examples

Tao Dai (Tsinghua University)*; Yan Feng (Tsinghua University); Dongxian Wu (Tsinghua University); Bin Chen (Tsinghua University); Jian Lu (Shenzhen University); Yong Jiang (Tsinghua University); Shutao Xia (Tsinghua University)

 

77 - MRS-Net: Multi-Scale Recurrent Scalable Network for Face Quality Enhancement of Compressed Videos

Tie Liu (BUAA)*; Mai Xu (BUAA); Shengxi Li (Imperial College London); Rui Ding (Beihang University); Huaida Liu (Momo Inc.)

 

78 - TRIE: End-to-End Text Reading and Information Extraction for Document Understanding

PENG ZHANG (Hikvision Research Institute); Yunlu Xu (Hikvision Research Institute); Zhanzhan Cheng (Hikvision Research Institute)*; Shiliang Pu (Hikvision Research Institute); Jing Lu (Hikvision Research Institute); Liang Qiao (Hikvision Research Institute); Yi Niu (Hikvision Research Institute); Fei Wu (Zhejiang University)

 

86 - Iterative Back Modification for Faster Image Captioning

"Zhengcong Fei (Chinese Academy of Sciences, Institute of Computing Technology)*"

 

87 - Visual-Semantic Graph Matching for Visual Grounding

"Chenchen Jing (Beijing Institute of Technology); Mingtao Pei (Beijing Institute of Technology); Yuwei WU (Beijing Institute of Technology (BIT), China)*; Yao Hu (Alibaba Youku Cognitive and Intelligent Lab); Yunde Jia (Beijing Institute of Technology); Qi Wu (University of Adelaide)"

 

99 - Human Identification and Interaction Detection in Cross-View Multi-Person Videos with Wearable Cameras

"Jiewen Zhao (College of Intelligence and Computing, Tianjin University); Ruize Han (College of Intelligence and Computing, Tianjin University); Yiyang Gan (College of Intelligence and Computing, Tianjin University); Liang Wan (College of Intelligence and Computing, Tianjin University)*; Wei Feng (College of Intelligence and Computing, Tianjin University, China); Song Wang (University of South Carolina)"

 

111 - Domain Adaptive Person Re-Identification via Coupling Optimization

Xiaobin Liu (Peking University); Shiliang Zhang (Peking University)*

 

118 - Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge

Peng Wang (Northwestern Polytechnical University); Dongyang Liu (Northwestern Polytechnical University); Hui Li (the University of Adelaide)*; Qi Wu (University of Adelaide)

 

123 - Adversarial Privacy-preserving Filter

"Jiaming Zhang (Beijing Jiaotong University, China)*; Jitao Sang (Beijing Jiaotong University, China); Xian Zhao (Beijing Jiaotong University, China); Xiaowen Huang (Beijing Jiaotong University, China); Yanfeng Sun (Beijing University of Technology); Yongli Hu (Beijing University of Technology)"

 

135 - Deep Disturbance-disentangled Learning for Facial Expression Recognition

Delian Ruan (Xiamen University); Yan Yan (Xiamen University)*; Si Chen (Xiamen University of Technology); Jing-Hao Xue (University College London); Hanzi Wang (Xiamen University)

 

142 - Controllable Video Captioning with an Exemplar Sentence

Yitian Yuan (Tsinghua University)*; Lin Ma (Tencent AI Lab); Jingwen Wang (Tencent AI Lab); Wenwu Zhu (Tsinghua University)

 

143 - MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos

Guangyao Shen (Tsinghua University)*; Xin Wang (Tsinghua University); Xuguang Duan (Tsinghua University); Hongzhi Li (Microsoft Research); Wenwu Zhu (Tsinghua University)

 

146 - Mix Dimension in Poincar\`e Geometry for 3D Skeleton-based Action Recognition

"Wei Peng (CMVS, University of Oulu)*; Jingang Shi (University of Oulu); Zhaoqiang Xia (Northwestern Polytechnical University); Guoying Zhao (University of Oulu)"

 

151 - Online Multi-view Subspace Learning with Mixed Noise

"Jinxing Li (The Chinese University of Hong Kong (Shenzhen))*; Hongwei Yong (The Hong Kong Polytechnic University); Feng Wu (University of Science and Technology of China); Mu Li (The Chinese University of Hong Kong, Shenzhen)"

 

160 - Few-Shot Ensemble Learning for Video Classification with SlowFast Memory Networks

"Mengshi Qi (Ecole polytechnique f¨¦d¨¦rale de Lausanne (EPFL))*; Jie Qin (Inception Institute of Artificial Intelligence); Xiantong Zhen (University of Amsterdam); Di Huang (Beihang University, China); Yi Yang (UTS); Jiebo Luo (U. Rochester)"

 

162 - Single Image De-noising via Staged Memory Network

Weijiang Yu (SUN YAT-SEN UNIVERSITY)*; Jian Liang (Nanchang University); Lu Li (Zhejiang University); Nong Xiao (Sun Yat-sen University)

 

166 - LAL: Linguistically Aware Learning for Scene Text Recognition

Yi Zheng (Bostion University)*; Wenda Qin (Bostion University); Derry Wijaya (Boston University); Margrit Betke (Boston University)

 

174 - Emerging Topic Detection on the Meta-data of Images from Fashion Social Media

"Kunihiro Miyazaki (The University of Tokyo)*; Scarlett Young (Neural Pocket Inc.); Yuichi Sasaki (Neural Pocket); Takayuki Uchiba (Sugakubunka Co., Ltd.); Kenji Tanaka (the University of Tokyo)"

 

180 - Dynamic Extension Nets for Few-shot Semantic Segmentation

Lizhao Liu (South China University of Technology); Junyi Cao (South China University of Technology); Minqian Liu (South China University of Technology); Yong Guo (South China University of Technology); Qi Chen (South China University of Technology); Mingkui Tan (South China University of Technology)*

 

187 - Interpretable Embedding for Ad-Hoc Video Search

Jiaxin Wu (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong)

 

191 - Joint Attribute Manipulation and Modality Alignment Learning for Composing Text and Image to Image Retrieval

"Feifei Zhang (Institute of Automation, Chinese Academy of Sciences); Mingliang Xu (Zhengzhou University); Qirong Mao (Jiangsu University)*; Changsheng Xu (CASIA)"

 

192 - Leveraging QoE Heterogenity for Large-Scale Livecaset Scheduling

"Ruixiao Zhang (Tsinghua University)*; Ming Ma (Beijing Kuaishou Technology Co., Ltd); Tianchi Huang (Tsinghua University); Hanyu Li (Tsinghua University); Jiangchuan Liu (Simon Fraser University); Lifeng Sun (Tsinghua University)"

 

193 - Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing

"Peipei Li ( Institute of Automation Chinese Academy of Sciences)*; Yinglu Liu (JD AI); Hailin Shi (JD AI); Xiang Wu (Reconova); Yibo Hu (Institute of Automation, Chinese Academy of Sciences); Ran He (Institute of Automation, Chinese Academy of Sciences); Zhenan Sun (Chinese of Academy of Sciences)"

 

194 - Surface Reconstruction with Unconnected Normal Maps: An Efficient Mesh-based Approach

Miaohui Wang (Shenzhen University); Wuyuan Xie (Shenzhen University)*

 

197 - A Human-Computer Duet System for Music Performance

Yuen-Jen Lin (Academia Sinica)*; Hsuan-Kai Kao (Academia Sinica); Yih-Chih Tseng (Academia Sinica); Ming Tsai (KoKo Lab); Li Su (Academia Sinica)

 

199 - LSOTB-TIR: A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark

"Qiao Liu (Harbin Institute of Technology, Shenzhen); Xin Li (Harbin Institute of Technology, Shenzhen); Zhenyu He (Harbin Institute of Technology (Shenzhen); Peng Cheng Laboratory)*; Chenglong Li (Anhui University); Jun Li (HARBIN INSTITUTE OF TECHNOLOGY, SHENZHEN); Zikun Zhou (Harbin Institute of Technology, Shenzhen); Di Yuan (Harbin Institute of Technology, Shenzhen; Monash University); Jing Li (Harbin Institute of Technology, Shenzhen); kai yang (Harbin Institute of Technology, Shenzhen); Nana Fan (Harbin Institute of Technology, Shenzhen); Feng Zheng (SUSTech)"

 

202 - Invisible: Federated Learning over Non-Informative Intermediate Updates against Multimedia Privacy Leakages

Qiushi Li (Tsinghua University)*; Wenwu Zhu (Tsinghua University); Chao Wu (Tsinghua University); xinglin pan (University of Electronic Science and Technology of China); Fan Yang (Tsinghua University); Yuezhi Zhou (Tsinghua University); Yaoxue Zhang (Tsinghua University)

 

204 - Cascade Reasoning Network For Text-based Visual Question Answering

Fen Liu (South China University of Technology); Guanghui Xu (South China University of Technology); Qi Wu (University of Adelaide); Qing Du (South China Univercity of Technology); Wei Jia (CVTE Research); Mingkui Tan (South China University of Technology)*

 

208 - Fast Enhancement for Non-Uniform Illumination Images using Light-weight CNNs

Feifan Lv (Beihang University); Bo Liu (Beihang University); Feng Lu (Beihang University)*

 

209 - Animating Through Warping: an Efficient Method for High-Quality Facial Expression Animation

Zili Yi (Huawei Canada)*; Qiang Tang (Huawei Canada); Vishnu Sanjay Ramiya Srinivasan (Huawei); Zhan Xu (Huawei Canada)

 

211 - Exploiting Better Feature Aggregation for Video Object Detection

Liang Han (Stony Brook University); Pichao Wang (Alibaba Group (U.S.) Inc.)*; Zhaozheng Yin (Stony Brook University); Fan Wang (Alibaba Group); Hao Li (Alibaba Group)

 

220 - NuI-Go: Recursive Non-local Encoder-Decoder Network for Retinal Image Non-uniform Illumination Removal

"Chongyi Li ( Nanyang Technological University)*; Huazhu Fu (Inception Institute of Artificial Intelligence); Runmin Cong (Beijing Jiaotong University); Zechao Li (Nanjing University of Science and Technology); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences)"

 

221 - Every Moment Matters: Detail-Aware Networks to Bring a Blurry Image Alive

"Kaihao Zhang (Australian National University)*; Wenhan Luo (Tencent AI Lab); Bjorn Stenger (Rakuten Institute of Technology); Wenqi Ren (Institute of Information Engineering, Chinese Academy of Sciences); Lin Ma (Tencent AI Lab); HONGDONG LI (Australian National University, Australia)"

 

227 - Online Filtering Training Samples for Robust Visual Tracking

Jie Zhao (Dalian University of Technology); Kenan Dai (Dalian University of Technology); Dong Wang (Dalian University of Technology)*; Huchuan Lu (Dalian University of Technology)

 

228 - Boosting Continuous Sign Language Recognition via Cross Modality Augmentation

Junfu Pu (University of Science and Technology of China)*; Hezhen Hu (University of Science and Technology of China); Wengang Zhou (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China)

 

230 - Self-supervised Dance Video Synthesis Conditioned on Music

Xuanchi Ren (HKUST); Haoran Li (The Hong Kong University of Science and Technology); Zijian HUANG (the Hong Kong University of Science and Technology); Qifeng Chen (HKUST)*

 

232 - Co-Attentive Lifting for Infrared-Visible Person Re-Identification

Xing Wei (Xi'an Jiaotong University)*; Diangang Li (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Wei Ke (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

 

235 - MOR-UAV: A Benchmark Dataset and Baselines for Moving Object Recognition in UAV Videos

Murari Mandal (Malaviya National Institute of Technology Jaipur)*; Lav Kush Kumar (Malaviya National Institute of Technology Jaipur); Santosh Kumar vipparthi (MNIT)

 

241 - Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization

Daizong Liu (Huazhong University of Science and Technology)*; Xiaoye Qu (Huazhong University of Science and Technology); Xiao-Yang Liu (Columbia University); Jianfeng Dong (Zhejiang Gongshang University); Pan Zhou ( Huazhong University of Science and Technology); Zichuan Xu (Dalian University of Technology)

 

251 - Learning Tuple Compatibility for Conditional Outfit Recommendation

"Xuewen Yang (Stony Brook University)*; Jiangbo Yuan (eBay Inc.,); Wanying Ding (JPMorgan Chase & Co ); Pengyun Yan (Vipshop Inc); Dongliang Xie (Beijing University of Posts and Telecommunications); Xin Wang (Stony Brook University)"

 

265 - ThumbNet: One Thumbnail Image Contains All You Need for Recognition

Chen Zhao (KAUST)*; Bernard Ghanem (KAUST)

 

267 - Efficient Crowd Counting via Structured Knowledge Transfer

Lingbo Liu (Sun Yat-sen University)*; Jiaqi Chen (Sun Yat-sen University); Hefeng Wu (Sun Yat-sen University); Tianshui Chen (DarkMatter AI); Guanbin Li (Sun Yat-sen University); Liang Lin (DarkMatter AI)

 

274 - Text-guided Image Inpainting

"Zijian Zhang (Zhejiang University)*; Zhou Zhao (Zhejiang University); Zhu Zhang (Zhejiang University); Baoxing Huai (HUAWEI TECHNOLOGIES CO., LTD.); Jing Yuan (Huawei Cloud BU)"

 

282 - Lab2Pix: Label-Adaptive Generative Adversarial Network for Unsupervised Image Synthesis

Lianli Gao (The University of Electronic Science and Technology of China); junchen zhu (University of Electronic Science and Technology of China); Jingkuan Song (UESTC)*; Feng Zheng (SUSTech); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

 

291 - Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

Fanfan Ye ( Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute); Qiaoyong Zhong (Hikvision Research Institute)*; Chao Li (Hikvision Research Institute); Di Xie (Hikvision Research Institute); Huiming Tang (Zhejiang University)

 

298 - Dual Temporal Memory Network for Efficient Video Object Segmentation

Kaihua Zhang (NUIST)*; Long Wang (Nanjing University of Information Science & Technology); Dong Liu (Netflix Inc); Bo Liu (JD.com); Qingshan Liu (Nanjing University of Information Science & Technology); Zhu Li (university of missouri-kansas city)

 

304 - Boosting Visual Question Answering with Context-aware Knowledge Aggregation

Guohao Li (Tsinghua University)*; Xin Wang (Tsinghua University); Wenwu Zhu (Tsinghua University)

 

306 - Meta Parsing Networks: Towards Generalized Few-shot Scene Parsing with Adaptive Metric Learning

Peike Li (UTS)*; Yunchao Wei (University of Technology Sydney); Yi Yang (UTS)

 

312 - CODAN: Counting-driven Attention Network for Vehicle Detection in Congested Scenes

Wei Li (Southwest Jiaotong University); Zhenting Wang (Southwest Jiaotong University); Xiao Wu (Southwest Jiaotong University)*; Ji Zhang (Southwest Jiaotong University); Qiang Peng (Southwest Jiaotong University); Hongliang Li (University of Electronic Science and Technology of China)

 

318 - Coorperative Bi-path Metric for Few-shot Learning

Zeyuan Wang (Beihang University); Yifan Zhao (Beihang University); Jia Li (Beihang University)*; Yonghong Tian (Peking University)

 

325 - Deep Unsupervised Hybrid-similarity Hadamard Hashing

"Wanqian Zhang (Institute of Information Engineering, Chinese Academy of Sciences); Dayan Wu (Institute of Information Engineering, Chinese Academy of Sciences)*; Yu Zhou (Institute of Information Engineering, CAS); Bo Li ( Institute of Information Engineering, Chinese Academy of Sciences); Weiping Wang (Institute of Information Engineering, CAS, China); Dan Meng (Institute of Information Engineering, CAS)"

 

330 - Semi-supervised Online Multi-Task Metric Learning for Visual Recognition and Retrieval

"Yangxi Li (National Computer network Emergency Response technical Team/Coordination Center of China)*; Han Hu (Beijing Institute of Technology, China); Jin Li (Beijing University of Posts and Telecommunications); Yong Luo (Nanyang Technological University); Yonggang Wen (Nanyang Technological University)"

 

352 - Modeling both Intra- and Inter-modal Influence for Real-Time Emotion Detection in Conversations

Dong Zhang (Soochow University)*; Weisheng Zhang (Soochow University); Shoushan Li (Soochow University); Zhu Qiaoming (Soochow University); Zhou Guodong (Soochow University)

 

355 - WIKI Food-500: A dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

"Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences)*; Linhu Liu (ICT); Zhiling Wang (Institute of Computing Technology, Chinese Academy of Sciences); Zhengdong Luo (University of Chinese Academy of Sciences); Xiaoming Wei (MeituanDianping group ); Xiaolin Wei (MeituanDianping group ); Shuqiang Jiang (ICT, China Academy of Science)"

 

356 - RT-VENet: A Convolutional Network for Real-time Video Enhancement

Mohan Zhang (Zhejiang university); Qiqi Gao (Microsoft Research Asia); Jinglu Wang (Microsoft Research Asia); Henrik Turbell (Microsoft); David Zhao (Microsoft); Jinhui Yu (Zhejiang Unviersity); Yan Lu (Microsoft Research Asia)*

 

358 - Learning Image Classifier from Only Web Labels and Metadata: Automatic Label Correction through Graph

Jingkang Yang (Sensetime Research)*; Weirong Chen (SenseTime Research); Litong Feng (Sensetime Research); Xiaopeng Yan (SenseTime Research); Huabin Zheng (SenseTime Research); Wayne Zhang (SenseTime Research)

 

359 - From Design Draft to Real Attire: Unaligned Fashion Image Translation

Yu Han (Peking University)*; Shuai Yang (Peking University); Wenjing Wang (Peking University); Jiaying Liu (Peking University)

 

362 - Towards More Explainability: Concept Knowledge Mining Network for Event Recognition

"Zhaobo Qi (University of Chinese Academy of Sciences)*; Shuhui Wang (VIPL,ICT,Chinese academic of science); Chi Su (Kingsoft Cloud); Li Su (University of Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences); Qi Tian (Huawei Cloud & AI)"

 

365 - Multi-task Regression for Facial Action Unit Intensity Estimation via Differentiable Renderer

Xinhui Song (Netease Fuxi AI Lab)*; Tianyang Shi (NetEase Fuxi AI Lab); Zunlei Feng (Zhejiang University); Mingli Song (Zhejiang University); Jackie Lin (Netease Fuxi AI Lab); Chuanjie Lin (Netease Fuxi AI Lab); Yi Yuan (NetEase Fuxi AI Lab); Changjie Fan (NetEase Fuxi AI Lab)

 

370 - Siamese Attentive Graph Tracking

"Fei zhao (Alibaba Group & Institute of Automation,Chinese Academy of Sciences)*; Ting Zhang (CEIEC); Chao Ma (Shanghai Jiao Tong University); Ming Tang (Chinese Academy of Sciences, China); Jinqiao Wang (Institute of Automation, Chinese Academy of Sciences); Xiaobo Wang (Alibaba Group)"

 

373 - Photo Stand-Out: Photography with Virtual Character

Yujia Wang (Beijing Institute of Technology)*; Sifan Hou (Beijing Institute of Technology); Wei Liang (Beijing Institute of Technology); Bing Ning (Beijing Institute of Fashion Technology)

 

376 - DeSmoothGAN: Recovering Details of Smoothed Images via Spatial Feature-wise Transformation and Full Attention

Yifei Huang (East China Normal University)*; Chenhui Li (East China Normal University); Xiaohu Guo (The University of Texas at Dallas); Jing Liao (City University of Hong Kong); Chenxu Zhang (The University of Texas at Dallas); Changbo Wang (East China Normal University)

 

378 - Accurate UAV Tracking with Distance-Injected Overlap Maximization

Chunhui Zhang (Chinese Academy of Sciences); Shiming Ge (Chinese Academy of Sciences)*; Kangkai Zhang (Chinese Academy of Sciences); Dan Zeng (Shanghai University)

 

380 - Look Through Masks: Towards Occluded Face Recognition with Amodal Completion

Chenyu Li (Chinese Academy of Sciences); Shiming Ge (Chinese Academy of Sciences)*; Daichi Zhang (Chinese Academy of Sciences); Jia Li (Beihang University)

 

383 - Context-Aware Multi-View Summarization Network for Image-Text Matching

Leigang Qu (Shandong University); Meng Liu (Shandong Jianzhu University); Da Cao (Hunan University); Liqiang Nie (Shandong University )*; Qi Tian (Huawei Cloud & AI)

 

389 - Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval

Yu-Wei Zhan (Shandong University); Xin Luo (Shandong University)*; Yongxin Wang (Shandong University); Xin-Shun Xu (Shandong University)

 

391 - "PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music"

Hongru Liang (Nankai University); Wenqiang Lei (National University of Singapore)*; Paul Yaozhu Chan (A_STAR); Zhenglu Yang (Nankai University); Maosong Sun (Tsinghua University); Tat-Seng Chua (National Univ. of Singapore)

 

395 - An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis

"Tianyu Zhang (ICT)*; Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences); Ying Zhu (University of Chinese Academy of Sciences); Yong Rui (Lenovo); Shuqiang Jiang (ICT, China Academy of Science)"

 

396 - HiFaceGAN: Face Renovation via Collaborative Suppression and Replenishment

"Lingbo Yang (Peking University)*; Chang Liu (University of Chinese Academy of Sciences); Pan Wang (Alibaba Group); Shanshe Wang (Peking University); Peiran Ren (Alibaba ); Siwei Ma (Peking University, China); Wen Gao (PKU)"

 

397 - PatchMatch based Multiview Stereo with Local Quadric Window

"Hyewon Song (Yonsei university); Jaeseong Park (Yonsei University); Suwoong Heo (Yonsei University); Jiwoo Kang (Yonsei University); Sanghoon Lee (Yonsei University, Korea)*"

 

403 - Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos

Zhu Zhang (Zhejiang University)*; Zhijie Lin (Zhejiang University); Zhou Zhao (Zhejiang University); jieming zhu (Huawei Noah''s Ark Lab); Xiuqiang He (Huawei Noah's Ark Lab)

 

411 - Discernible Image Compression

Zhaohui Yang (Peking University)*; Yunhe Wang (Huawei Technologies); Chang Xu (University of Sydney); Peng Du (Hangzhou Dianzi University); Chao Xu (Peking University); Chunjing Xu (Huawei Noah's Ark Lab); Qi Tian (Huawei Cloud & AI)

 

419 - Feature Reintegration over Differential Treatment: A Top-down and Adaptive Fusion Network for RGB-D Salient Object Detection

Miao Zhang (Dalian University of Technology); Yu Zhang (Dalian University of Technology); Yongri Piao (Dalian University of Technology)*; Beiqi Hu (Dalian University of Technology); Huchuan Lu (Dalian University of Technology)

 

434 - Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation

"Jialian Wu (State University of New York at Buffalo)*; Liangchen Song (University at Buffalo); Tiancai Wang (Tianjin University); Qian Zhang (Horizon Robotics); Junsong Yuan (""State University of New York at Buffalo, USA"")"

 

436 - Label Embedding Online Hashing for Cross-Modal Retrieval

Yongxin Wang (Shandong University); Xin Luo (Shandong University); Xin-Shun Xu (Shandong University)*

 

442 - Privacy-sensitive Objects Pixelation for Live Video Streaming

Jizhe Zhou (University of Macau); Chi-Man Pun (University of Macau)*; Yu Tong (University of Macau)

 

444 - Cloze Test Helps: Effective Video Anomaly Detection via Learning to Complete Video Events

Guang Yu (National University of Defense Technology)*; Siqi Wang (National University of Defense Technology); Zhiping Cai (NUDT); En Zhu (National University of Defense Technology); Chuanfu Xu (National University of Defense Technology); Jianping Yin (National University of Defense Technology); Marius Kloft (TU Kaiserslautern)

 

446 - All-in-depth via Cross-baseline Light Field Camera

"Dingjian Jin (Tsinghua university)*; Anke Zhang (Tsinghua University); Jiamin Wu (Tsinghua University); Gaochang Wu (Northeastern University); haoqian wang (Graduate School at Shenzhen, Tsinghua University); Lu Fang (Tsinghua University)"

 

458 - Dual Path Interaction Network for Video Moment Localization

Hao Wang (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China); Jiebo Luo (U. Rochester)

 

464 - Adv-watermark: A Novel Watermark Perturbation for Adversarial Examples

"Xiaojun Jia (Institute of Information Engineering£¬Chinese Academy of Sciences); Xingxing Wei (Beihang University); Xiaochun Cao (Chinese Academy of Sciences)*; Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen))"

 

480 - Deep Multimodal Neural Architecture Search

Zhou Yu (Hangzhou Dianzi University); Yuhao Cui (Hangzhou Dianzi University); Jun Yu (HDU)*; Meng Wang (Hefei University of Technology); Dacheng Tao (The University of Sydney); Qi Tian (Huawei Cloud & AI)

 

484 - CRSSC: Salvage Reusable Samples from Noisy Data for Robust Learning

Zeren Sun (Nanjing University of Science and Technology ); Xian-Sheng Hua (Alibaba Group); Yazhou Yao (Nanjing University of Science and Technology)*; Xiu-Shen Wei (Nanjing University of Science and Technology); Guosheng Hu (AnyVision); Jian Zhang (UTS)

 

488 - Deep Local Binary Coding for Person Re-Identification by Delving into the Details

Jiaxin Chen (Inception Institute of Artificial Intelligence)*; Jie Qin (Inception Institute of Artificial Intelligence); Yichao Yan (inception institute of artificial intelligence); Lei Huang (Inception Institute of Artificial Intelligence); Li Liu (the inception institute of artificial intelligence); Fan Zhu (Inception Institute of Artificial Intelligence); Ling Shao (Inception Institute of Artificial Intelligence)

 

497 - Expert Performance in the Examination of Interior Surfaces in an Automobile: Virtual Reality vs. Reality

Alexander Tesch (Volkswagen AG)*; Ralf Doerner (HS Rhein-Main)

 

508 - A Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild

Jichao Zhang (University of Trento)*; Jingjing Chen (Shandong University); Hao Tang (University of Trento); Wei Wang (EPFL); Yan Yan (Texas State University); Enver Sangineto (University of Trento); Nicu Sebe (University of Trento)

 

519 - MMFL: Multimodal Fusion Learning for Text-Guided Image Inpainting

Qing Lin (Fudan University); Bo Yan (Fudan University)*; Jichun Li (Fudan University); Weimin Tan (Fudan University)

 

527 - Learning Hierarchical Graph for Occluded Pedestrian Detection

Gang LI (Nanjing University of Science and Technology); Jian Li (Tencent Youtu); Shanshan Zhang (Max Planck Institute for Informatics)*; Jian Yang (Nanjing University of Science and Technology)

 

531 - Vision Meets Wireless Positioning: Effective Person Re-identification with Recurrent Context Propagation

Yiheng Liu (University of Science and Technology of China)*; Wengang Zhou (University of Science and Technology of China); Mao Xi (University of Science and Technology of China); Sanjing Shen (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China)

 

541 - Learning From Music to Visual Storytelling of Shots: A Deep Interactive Learning Mechanism

"Jen-Chun Lin (Academia Sinica)*; Wen-Li Wei (Academia Sinica); Yen-Yu Lin (National Chiao Tung University); Tyng-Luh Liu (Academia Sinica); Hong-Yuan Mark Liao (Institute of Information Science, Academia Sinica, Taiwan)"

 

543 - Adaptively-Accumulated Knowledge Transfer for Partial Domain Adaptation

Taotao Jing (Indiana University-Purdue University Indianapolis); Haifeng Xia (Indiana University-Purdue University Indianapolis); Zhengming Ding (Indiana University-Purdue University Indianapolis)*

 

549 - Multi-graph convolutional network for unsupervised 3D shape retrieval

"Weizhi Nie (Tianjin University); Yue Zhao (Tianjin University); An-An Liu (Tianjin University)*; Zan Gao (Qilu University of Technology (Shandong Academy of Sciences), Shandong Computer Science Center (National Supercomputer Center in Jinan), Shandong Artificial Intelligence Institute, China); Yu-ting Su (Tianjin University)"

 

553 - Asymmetric Deep Hashing for Efficient Hash Code Compression

"Shu Zhao (Institute of Information Engineering, Chinese Academy of Sciences); Dayan Wu (Institute of Information Engineering, Chinese Academy of Sciences)*; Wanqian Zhang (Institute of Information Engineering, Chinese Academy of Sciences); Yu Zhou (Institute of Information Engineering, CAS); Bo Li ( Institute of Information Engineering, Chinese Academy of Sciences); Weiping Wang (Institute of Information Engineering, CAS, China)"

 

554 - Box Guided Convolution for Pedestrian Detection

Jinpeng Li (Inception Institute of Artificial Intelligence); Shengcai Liao (Inception Institute of Artificial Intelligence)*; Hangzhi Jiang (CASIA); Ling Shao (Inception Institute of Artificial Intelligence)

 

580 - Cap2Seg: Inferring Semantic and Spatial Context from Captions for Zero-Shot Image Segmentation

Guiyu Tian (Peking University); Shuai Wang (BOE); Jie Feng (BOE); Li Zhou (BOE); Yadong Mu (Peking University)*

 

585 - Bottom-Up Foreground-Aware Feature Fusion for Person Search

"Wenjie Yang (Institute of Automation, Chinese Academy of Sciences)*; Dangwei Li (Institute of Automation, Chinese Academy of Sciences); Xiaotang Chen (Institute of Automation, Chinese Academy of Sciences); Kaiqi Huang (Institute of Automation, Chinese Academy of Sciences)"

 

588 - Quaternion-Based Knowledge Graph Network for Recommendation

"Zhaopeng Li (State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences)*; Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Yangbangyan Jiang (Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences); Xiaochun Cao (Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)"

 

601 - Multi-Person Action Recognition in Microwave Sensors

Diangang Li (Xi'an Jiaotong University); Jianquan Liu (NEC Corporation)*; Shoji Nishimura (NEC Corporation); Yuka Hayashi (NEC Corporation); Jun Suzuki (NEC Corporation); Yihong Gong (Xi'an Jiaotong University)

 

605 - "Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition"

"Yi-Fan Song (University of Chinese Academy of Sciences)*; Zhang Zhang (Institute of Automation, Chinese Academy of Sciences); Caifeng Shan (CAS-AIR); Liang Wang (NLPR, China)"

 

606 - Spatial-Temporal Knowledge Integration: Robust Self-Supervised Facial Landmarks Tracking

Congcong Zhu (Shanghai University); Xiaoqiang Li (Shanghai University)*; Jide Li ( Shanghai University); Guangtai Ding (Shanghai University); Weiqin Tong (Shanghai University)

 

612 - Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment

Dingquan Li (Peking University); Tingting Jiang (Peking University)*; Ming Jiang (Peking University)

 

615 - Weakly Supervised 3D Object Detection from Point Clouds

Zengyi Qin (MIT); Jinglu Wang (Microsoft Research Asia)*; Yan Lu (Microsoft Research Asia)

 

619 - Neutral Face Game Character Auto-Creation via Poker-GAN

Tianyang Shi (NetEase Fuxi AI Lab)*; Zhengxia Zou (University of Michigan); Xinhui Song (Netease Fuxi AI Lab); Zheng Song (NetEase Fuxi AI Lab); Changjian Gu (NetEase Fuxi AI Lab); Yi Yuan (NetEase Fuxi AI Lab); Changjie Fan (NetEase Fuxi AI Lab)

 

621 - DIMC-net: Deep Incomplete Multi-view Clustering Network

"Jie Wen (Harbin Institute of Technology, Shenzhen)*; Zheng Zhang (Harbin Institute of Technology, Shenzhen); Zhihao Wu (Harbin Institute of Technology, Shenzhen); Lunke Fei (Guangdong University of Technology); Zhao Zhang (Hefei University of Technology); Yong Xu (Harbin Institute of Technology Shenzhen Graduate School); Bob Zhang (Univerisity of Macau)"

 

630 - Adversarial Image Attacks Using Multi-Sample and Most-Likely Ensemble Methods

Xia Du (University of Macau); Chi-Man Pun (University of Macau)*

 

632 - Cross-domain Cross-modal Food Transfer

Bin Zhu (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong); Jingjing Chen (Fudan University)

 

639 - Coupling deep textural and shape features for sketch retrieval

Qi Jia (Dalian University of Technology); Xin Fan (Dalian University of Technology)*; Meiyu Yu (Didi Chuxing); Yuqing Liu (Dalian University of Technology); Dingrong Wang (Dalian University of Technology); Longin Jan Latecki (Temple University)

 

647 - Memory-Augmented Relation Network for Few-Shot Learning

He Jun (Hefei University of Technology)*; Richang Hong (Hefei University of Technology); Xueliang Liu (Hefei University of Technology); Mingliang Xu (Zhengzhou University); Zheng-Jun Zha (University of Science and Technology of China); Meng Wang (Hefei University of Technology)

 

654 - Panoptic Image Annotation with a CollaborativeAssistant

Jasper Uijlings (Google Research)*; Misha Andriluka (Google); Vittorio Ferrari (Google Research)

 

663 - Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches

Zhi Chen (The University of Queensland)*; Sen Wang (The University of Queensland); Jingjing Li (University of Electronic Science and Technology of China); Zi Huang (University of Queensland)

 

668 - Performance Optimization of Federated Person Re-identification via Benchmark Analysis

Weiming Zhuang (Nanyang Technological University)*; Yonggang Wen (Nanyang Technological University); Xuesen Zhang (SenseTime); Xin Gan (SenseTime); Daiying Yin (SenseTime); Dongzhan Zhou (The University of Sydney); shuai zhang (Sensetime Ltd); Shuai Yi (SenseTime Group Limited)

 

688 - Surpassing Real-World Source Training Data: Random 3D Characters for Generalizable Person Re-Identification

Yanan Wang (Inception Institute of Artificial Intelligence)*; Shengcai Liao (Inception Institute of Artificial Intelligence); Ling Shao (Inception Institute of Artificial Intelligence)

 

691 - Guided Attention Network for Object Detection and Counting on Drones

CAI YuanQiang (UCAS); Dawei Du (University of Chinese Academy of Sciences); Libo Zhang (Institute of Software Chinese Academy of Sciences)*; Longyin Wen (JD Digit); Weiqiang Wang (University of Chinese Academy of Sciences); Yanjun Wu (Institute of Software Chinese Academy of Sciences ); Siwei Lyu (University at Albany)

 

696 - K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering

"Yiyi Zhou (Xiamen University); Rongrong Ji (Xiamen University, China)*; Xiaoshuai Sun ( Xiamen University); Gen Luo (Xiamen University); Xiaopeng Hong (Xi'an Jiaotong University); Jinsong Su (Xiamen University); Xinghao Ding (Xiamen University); Ling Shao (Inception Institute of Artificial Intelligence)"

 

700 - Simultaneous Semantic Alignment Network for Heterogeneous Domain Adaptation

Shuang Li (Beijing Institute of Technology); Binhui Xie (Beijing Institute of Technology ); Jiashu Wu (University of Melbourne); Ying Zhao (Beijing Institute of Technology); Chi Harold Liu (Beijing Institute of Technology)*; Zhengming Ding (Indiana University-Purdue University Indianapolis)

 

701 - TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection

"Fangfang Wang (Zhejiang University)*; Yifeng Chen (Zhejiang University); Fei Wu (Zhejiang University, China); Xi Li (Zhejiang University)"

 

703 - Deep Cross-scale Fusion Network for Single Image Rain Removal

Cong Wang (Dalian University of Technology)*; Xiaoying Xing (Tsinghua University); Zhixun Su (Dalian University of Technology); junyang chen (University of Macau)

 

704 - Class-Aware Modality Mix and Center-Guided Metric Learning for Visible-Thermal Person Re-Identification

"Yongguo Ling (Xiamen University)*; Zhun Zhong (University of Trento); Zhiming Luo (Xiamen University); Paolo Rota (University of Trento); Shaozi Li (Xiamen University, China); Nicu Sebe (University of Trento)"

 

707 - Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition

Yuan Xie (DarkMatter AI); Tianshui Chen (DarkMatter AI)*; Tao Pu (Sun Yat-sen University); Hefeng Wu (Sun Yat-sen University); Liang Lin (DarkMatter AI)

 

708 - Self-Paced Video Data Augmentation by Generative Adversarial Networks with Insufficient Samples

Yumeng Zhang (Tsinghua University); GaoGuo Jia (Tsinghua University ); Li Chen (Tsinghua University)*; MingRui Zhang (Beijing University of Posts and Telecommunications); JunHai Yong (Tsinghua University)

 

710 - Weakly Supervised Real-time Image Cropping based on Aesthetic Distributions

"Peng Lu (Beijing University of Posts and Telecommunications)*; Jiahui Liu (Beijing University of Posts and Telecommunications); Xujun Peng (Information Sciences Institute, University of Southern California); Xiaojie Wang (Beijing University of Posts and Telecommunications)"

 

732 - Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

Yuting Liu (Sichuan University)*; Zheng Wang (National Institute of Informatics); Miaojing Shi (King's College London); Shin'ichi Satoh (National Institute of Informatics); Qijun Zhao (Sichuan University); hongyu yang (sichuan university)

 

734 - KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue

"Xiaoze Jiang (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Siyi Du (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Zengchang Qin (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University)*; Yajing Sun (Institute of Information Engineering,Chinese Academy of Sciences); JING YU (Institute of Information Engineering, Chinese Academy of Sciences)"

 

736 - Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning

Wentao Bao (Rochester Institute of Technology)*; Qi Yu (Rochester Institute of Technology); Yu Kong (Rochester Institute of Technology)

 

737 - Occluded Prohibited Items Detection: An X-ray Security Inspection Benchmark and De-occlusion Attention Module

Yanlu Wei (Beihang University); Renshuai Tao (Beihang University)*; Zhangjie Wu (Beihang University); Yuqing Ma (Beihang University); Libo Zhang (Institute of Software Chinese Academy of Sciences); Xianglong Liu (BUAA)

 

738 - CF-SIS: Semantic-Instance Segmentation of 3D Point Clouds by Context Fusion with Self-Attention

"Xin Wen (Tsinghua University); Zhizhong Han (University of Maryland, College Park); Geunhyuk Youk (Tsinghua University); Yu-Shen Liu (Tsinghua University)*"

 

743 - Diverter-Guider Recurrent Network for Diverse Poems Generation from Image

"liang li (Institute of Computing Technology, Chinese Academy of Sciences)*; Shijie Yang (vipl,ict,Chinese academic of science); Li Su (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Chenggang Yan (Hangzhou Dianzi University); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)"

 

758 - Hybrid Resolution Network Using Edge Guided Region Mutual Information Loss for Human Parsing

Yunan Liu (Nanjing University of Science & Technology)*; Liang Zhao (Nanjing University of Science & Technology); Shanshan Zhang (Max Planck Institute for Informatics); Jian Yang (Nanjing University of Science and Technology)

 

760 - Meta-RCNN: Meta Learning for Few-Shot Object Detection

Xiongwei Wu (Singapore Management U)*; Doyen Sahoo (Salesforce); Steven Hoi (Singapore Management University)

 

762 - Texture Semantically Aligned with Visibility-aware for Partial Person Re-identification

"Lishuai Gao (Tianjin University of Technology); Hua Zhang (Tianjin University of Technology); Zan Gao (1. Shandong AI Institute, QiLU University of Technology, 2. Shandong Computer Science Center(National Supercomputer Center in Jinan), 3. Tianjing University of Technology)*; Weili Guan (Monash University); Zhiyong Cheng (Shandong Academy of Sciences); Meng Wang (Hefei University of Technology)"

 

765 - Context-aware Attention Network for Predicting Image Aesthetic Subjectivity

"Munan Xu (Shenzhen Graduate School, Peking University); Jia-Xing Zhong (School of Electronic and Computer Engineering, Peking University); Yurui Ren (Shenzhen Graduate School, Peking University); Shan Liu (Tencent America); Ge Li (SECE, Shenzhen Graduate School, Peking University)*"

 

769 - OCR: Objectness Consistent Representation for Weakly Supervised Object Detection

"Ke Yang (NUDT)*; Peng Zhang (NUDT); Peng Qiao (NUDT); Zhiyuan Wang (AIRC); Dongsheng Li (School of Computer Science, National University of Defense Technology); Yong Dou (National University of Defense Technology)"

 

774 - Bridging the Gap between Vision and Language Domains for Improved Image Captioning

Fenglin Liu (Peking University)*; Xian Wu (Tencent Medical AI Lab); Shen Ge (Tencent Medical AI Lab); Xiaoyu Zhang ( Peking University); Wei Fan (Tencent); Yuexian Zou (Peking University)

 

783 - PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection

Jingchen Sun (Zhejiang University); Jiming Chen (Zhejiang University); Tao Chen (Fudan University); jiayuan fan (Fudan University); Shibo He (Zhejiang University)*

 

787 - ChoreoNet: Torwards Music to Dance Synthesis with Choreographic Action Unit

"Zijie Ye (Tsinghua University)*; Haozhe Wu (Tsinghua University); Jia Jia (Tsinghua University); Yaohua Bu (Tsinghua University); Wei Chen (Beijing Sougou Science and Technology Development Co., Ltd); Fanbo Meng (Sogou Corporation, Beijing, China); Yanfeng Wang ( Beijing Sougou Science and Technology Development Co., Ltd)"

 

790 - Unpaired Image Enhancement with Quality-Attention Generative Adversarial Network

Zhangkai NI (City University of Hong Kong)*; Wenhan Yang (Peking University); Shiqi Wang (CityU); Lin Ma (Tencent AI Lab); Sam Kwong (City Univeristy of Hong Kong)

 

793 - STRONG: Spatio-Temporal Reinforcement Learning for Cross-Modal Video Moment Localization

Da Cao (Hunan University)*; Yawen Zeng (Hunan University); Meng Liu (Shandong Jianzhu University); Xiangnan He (University of Science and Technology of China); Meng Wang (Hefei University of Technology); Zheng Qin (Hunan University)

 

794 - Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization

Da Cao (Hunan University)*; Yawen Zeng (Hunan University); Xiaochi Wei (Baidu Inc.); Liqiang Nie (Shandong University ); Richang Hong (Hefei University of Technology); Zheng Qin (Hunan University)

 

795 - Pose-native Network Architecture Search for Multi-person Human Pose Estimation

Qian Bao (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jun Hong (AI Research of JD.com); Lingyu Duan (Peking University); Tao Mei (AI Research of JD.com)

 

798 - ASTA-Net: Adaptive Spatio-Temporal Attention Network for Person Re-Identification in Videos

Xierong Zhu (University of Science and Technology of China)*; Jiawei Liu (University of Science and Technology of China); Haoze Wu (University of Science and Technology of China); Meng Wang (Hefei University of Technology); Zheng-Jun Zha (University of Science and Technology of China)

 

801 - Talking Face Generation with Expression-Tailored Generative Adversarial Network

Dan Zeng (Shanghai University); Han Liu (Shanghai University); Hui Lin (); Shiming Ge (Chinese Academy of Sciences)*

 

808 - Blind Natural Video Quality Prediction via Statistical Temporal Features and Deep Spatial Features

Jari Korhonen (Shenzhen University)*; Yicheng Su (Shenzhen University); Junyong You (Norwegian Research Centre)

 

812 - Cross-Modal Omni Interaction Modeling for Phrase Grounding

"Tianyu Yu (Beihang University)*; Tianrui Hui (Institute of Information Engineering, Chinese Academy of Sciences); Zhihao Yu (Beihang University); Yue Liao (Beihang University); si liu (Beihang University); Sansi Yu (Tencent); Faxi Zhang (Tencent)"

 

814 - Cascade Grouped Attention Network for Referring Expression Segmentation

"Gen Luo (Xiamen University); Rongrong Ji (Xiamen University, China)*; Yiyi Zhou (Xiamen University); Xiaoshuai Sun ( Xiamen University); Jinsong Su (Xiamen University); Chia-Wen Lin (National Tsing Hua University); Qi Tian (Huawei Cloud & AI)"

 

816 - Temporally Guided Music-to-Body-Movement Generation

Hsuan-Kai Kao (Academia Sinica); Li Su (Academia Sinica)*

 

818 - Compositional Few-Shot Recognition with Primitive Discovery and Enhancing

Yixiong Zou (Peking University)*; Shanghang Zhang (UC Berkeley); Ke Chen (South China University of Technology); Jos¨¦ M. F. Moura (Carnegie Mellon University); Yaowei Wang (PengCheng Laboratory); Yonghong Tian (Peking University)

 

820 - Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension

Heqian Qiu (University of Electronic Science and Technology of China)*; Hongliang Li (University of Electronic Science and Technology of China); Qingbo Wu (University of Electronic Science and Technology of China); Fanman Meng (University of Electronic Science and Technology of China); Hengcan Shi ( University of Electronic Science and Technology of China); Taijin Zhao (University of Electronic Science and Technology of China); King Ngi Ngan (University of Electronic Science and Technology of China)

 

821 - Bridging the Web Data and Fine-Grained Visual Recognition via Alleviating Label Noise and Domain Mismatch

Yazhou Yao (Nanjing University of Science and Technology)*; Xian-Sheng Hua (Alibaba Group); Guanyu Gao (Nanjing University of Science and Technology); Zeren Sun (Nanjing University of Science and Technology ); Zhibin Li (University of Technology Sydney ); Jian Zhang (UTS)

 

823 - March on Data Imperfections: Domain Division and Domain Generalization for Semantic Segmentation

Hai Xu (University of Science and Technology of China); Hongtao Xie (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Sun-Ao Liu (University of Science and Technology of China); Yongdong Zhang (University of Science and Technology of China)

 

825 - Aesthetic-Aware Image Style Transfer

Zhiyuan Hu (Tsinghua University); Jia Jia (Tsinghua University)*; Bei Liu (Microsoft Research); Yaohua Bu (Tsinghua University); Jianlong Fu (Microsoft Research)

 

830 - InteractGAN: Learning to Generate Human-Object Interaction

"Chen Gao (Institute of Information Engineering, CAS)*; si liu (Beihang University); Defa Zhu (Institute of Information Engineering, CAS); Quan Liu (Beihang University); Jie Cao (Institute of Automation, Chinese Academy of Sciences); Haoqian He (Beihang University); Ran He (Institute of Automation, Chinese Academy of Sciences); Shuicheng Yan (YITU Tech)"

 

831 - Is Depth Really Necessary for Salient Object Detection?

Jiawei Zhao (Beihang University); Yifan Zhao (Beihang University); Jia Li (Beihang University)*; Xiaowu Chen (Beihang University)

 

836 - Zero-Shot Multi-View Indoor Localization via Graph Location Networks

Meng-Jiun Chiou (National University of Singapore)*; Zhenguang Liu (Zhejiang Gongshang University); Yifang Yin (National University of Singapore); An-An Liu (Tianjin University); Roger Zimmermann (NUS)

 

845 - Self-Play Reinforcement Learning for Fast Image Retargeting

Nobukatsu Kajiura (The University of Tokyo)*; Satoshi Kosugi (The University of Tokyo); Xueting Wang (The University of Tokyo); Toshihiko Yamasaki (The University of Tokyo)

 

849 - Brain-media: A dual conditioned and lateralization supported GAN (DCLS-GAN) towards visualization of image-evoked brain activities

Ahmed Fares (Shenzhen University); Sheng-hua Zhong (Shenzhen University); Jianmin Jiang (Shenzhen University)*

 

850 - Hierarchical Scene Graph Encoder-Decoder for Image Paragraph Captioning

XU YANG (Nanyang Technological University)*; Chongyang Gao (Dartmouth College); Hanwang Zhang (Nanyang Technological University); Jianfei Cai (Monash University)

 

854 - Deep Concept-wise Temporal Convolutional Networks for Action Localization

"Xin Li (Baidu); Tianwei Lin (Baidu)*; Xiao Liu (Baidu); Wangmeng Zuo (Harbin Institute of Technology, China); Chao Li (Baidu); Xiang Long (Baidu); Dongliang He (Baidu); Fu Li (Baidu); Shilei Wen (Baidu Research); Chuang Gan (MIT-IBM Watson AI Lab)"

 

859 - Gait Recognition with Multiple-Temporal-Scale 3D Convolutional Neural Network

BeiBei Lin (Beijing Jiaotong University); Shunli Zhang (Beijing Jiaotong University)*; Feng Bao (Beijing Jiaotong University)

 

867 - Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos

"Jie Wu (Sun Yat-sen University)*; Guanbin Li (Sun Yat-sen University); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)); Liang Lin (DarkMatter AI)"

 

876 - Traffic-Aware Multi-Camera Tracking of Vehicles Based on ReID and Camera Link Model

Hung-Min Hsu (UW)*; Yizhou Wang (University of Washington); Jenq-Neng Hwang (University of WA_)

 

887 - Hierarchical Gumbel Attention Network for Text-based Person Search

Kecheng Zheng (University of Science and Technology of China); Wu Liu (AI Research of JD.com)*; Jiawei Liu (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China); Tao Mei (AI Research of JD.com)

 

892 - Mesh Guided One-shot Face Reenactment Using Graph Convolutional Networks

Guangming Yao (NetEase Fuxi AI Lab)*; Yi Yuan (NetEase Fuxi AI Lab); Tianjia Shao (Zhejiang University); Kun Zhou (Zhejiang University)

 

893 - VONAS: Network Design in Visual Odometry using Neural Architecture Search

"Xing Cai (Peking University); Lanqing Zhang (Peking University); Chengyuan Li (Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University); Thomas H Li (Advanced Institute of Information Technology, Peking University)*"

 

910 - "A Tightly-coupled Semantic SLAM System with Visual, Inertial and Surround-view Sensors for Autonomous Indoor Parking"

"Xuan Shao (Tongji University); Lin Zhang (Tongji University, China)*; Tianjun Zhang (Tongji University); Ying Shen (Tongji University); Hongyu Li (tongdun); Yicong Zhou (University of Macau)"

 

911 - Controllable Continuous Gaze Redirection

"Weihao Xia (Tsinghua University)*; Yujiu Yang (Tsinghua University); Jing-Hao Xue (University College London); Wensen Feng (College of Computer Science & Software Engineering, Shenzhen University)"

 

915 - "Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning"

Ying Cheng (Fudan University)*; Ruize Wang (Fudan University); Zhihao Pan (Fudan University); Rui Feng (Fudan University); Yuejie Zhang (Fudan University)

 

918 - SRHEN: Stepwise-Refining Homography Estimation Networkvia Parsing Geometric Correspondences in Deep Latent Space

"Yi Li (Harbin Institute of Technology (Shenzhen)); Wenjie Pei (Harbin Institute of Technology, Shenzhen); Zhenyu He (Harbin Institute of Technology (Shenzhen); Peng Cheng Laboratory)*"

 

921 - Category-specific Semantic Coherency Learning for Fine-grained Image Recognition

Shijie Wang (Dalian University of Technology); zhihui wang (Dalian University of Technology); Haojie Li (Dalian University of Technology)*; Wanli Ouyang (The University of Sydney)

 

922 - Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer

Xinxiao Wu (Beijing Institute of Technology)*; Jialu Chen (Beijing Institute of Technology)

 

925 - Deep Shapely Portraits

"Qinjie Xiao (Zhejiang University)*; Xiangjun Tang (Zhejiang University); Leyang Jin (The Chinese University of Hong Kong, Shenzhen); Yu Wu (Zhejiang University); Yongliang Yang (University of Bath); Xiaogang Jin (Zhejiang University)"

 

927 - Depth Super-Resolution via Deep Controllable Slicing Network

Xinchen Ye (Dalian University of Technology)*; Baoli Sun (Dalian University of Technology); zhihui wang (Dalian University of Technology); Jingyu Yang (Tianjin University); Rui Xu (Dalian University of Techonology); Haojie Li (Dalian University of Technology); Baopu Li (Baidu Research(USA))

 

931 - Efficient Joint Gradient Based Attack Against SOR Defense for 3D Point Cloud Classification

"Chengcheng Ma (Institute of Automation, Chinese Academy of Sciences)*; Weiliang Meng (Institute of Automation, Chinese Academy of Sciences); Baoyuan Wu (Tencent AI Lab); Shibiao Xu (Institute of Automation, Chinese Academy of Sciences); Xiaopeng Zhang (Institute of Automation, Chinese Academy of Sciences)"

 

933 - Discrete Haze Level Dehazing Network

Xiao-Feng Cong (Anhui University); Jie Gui (Umich); Kai-Chao Miao (Anhui Meteorological Bureau); Jun Zhang (Anhui university)*; Bing Wang (Anhui University of Technology); Peng Chen (Anhui University)

 

942 - Improving Intra- and Inter-Modality Visual Relation for Image Captioning

"Yong Wang (Aerospace Information Research Institute, Chinese Academy of Sciences;University of Chinese Academy of Sciences)*; WenKai Zhang (Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China); Qing Liu (Aerospace Information research Institute, Chinese Academy of Sciences); Zhengyuan Zhang (Aerospace Information Research Institute, Chinese Academy of Sciences); Xin Gao (Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China); Xian Sun (IECAS)"

 

950 - Dual Context-Aware Refinement Network for Person Search

Jiawei Liu (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Richang Hong (HeFei University of Technology); Meng Wang (Hefei University of Technology); Yongdong Zhang (University of Science and Technology of China)

 

951 - Exploring Language Prior for Mode-Sensitive Visual Attention Modeling

"Xiaoshuai Sun ( Xiamen University)*; Xuying Zhang (Xiamen University); Liujuan Cao (Xiamen University); Yongjian Wu (Tencent Technology (Shanghai) Co.,Ltd); Feiyue Huang (Tencent); Rongrong Ji (Xiamen University, China)"

 

961 - Poet: Product-oriented Video Captioner for E-commerce

"Shengyu Zhang (Zhejiang University)*; Ziqi Tan (Zhejiang University); Jin Yu (Alibaba Group); Zhou Zhao (Zhejiang University); Kun Kuang (Zhejiang University); jie liu (Alibaba); Jingren Zhou (Alibaba Group); Hongxia Yang (Alibaba Group); Fei Wu (Zhejiang University, China)"

 

966 - Building Movie Map - A Tool for Exploring in a City - and its Evaluations

Naoki Sugimoto (University of Tokyo)*; Yoshihito Ebine (VTEC Laboratories Inc.); Kiyoharu Aizawa (The University of Tokyo)

 

970 - Searching Privately by Imperceptible Lying: A Novel Private Hashing Method with Differential Privacy

Yimu Wang (Nanjing University)*; Shiyin Lu (Nanjing University); Lijun Zhang (Nanjing University)

 

976 - Beyond the Attention: Distinguish the Discriminative and Confusable Features For Fine-grained Image Classification

"Xiruo Shi (Beijing University of Posts and Telecommunications ); Liutong Xu (Beijing University of Posts and Telecommunications); Pengfei Wang (School of Computer Science, Beijing University of Posts and Telecommunications); Yuanyuan Gao (Beihang Univeristy); Haifang Jian (Institute of Semiconductors, Chinese Academy of Sciences); Wu Liu (AI Research of JD.com)*"

 

977 - BlockMix: Meta Regularization and Self-Calibrated Inference for Metric-Based Meta-Learning

Hao Tang (Nanjing University of Science and Technology); Zechao Li (Nanjing University of Science and Technology)*; Zhimao Peng (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

 

980 - Structural Semantic Adversarial Active Learning for Image Captioning

"Beichen Zhang (University of Chinese Academy of Sciences)*; liang li (Institute of Computing Technology, Chinese Academy of Sciences); Li Su (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)"

 

983 - Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling

"Jiacheng Li (Zhejiang University); Siliang Tang (Zhejiang University)*; Juncheng Li (Zhejiang University); Jun Xiao (Zhejiang University); Fei Wu (Zhejiang University, China); Shiliang Pu (Hikvision Research Institute); Yueting Zhuang (Zhejiang University)"

 

988 - Scene-Aware Context Reasoning for Unsupervised Abnormal Event Detection in Videos

"Che Sun (Beijing Institute of Technology); Yunde Jia (Beijing Institute of Technology); Yao Hu (Alibaba Youku Cognitive and Intelligent Lab); Yuwei WU (Beijing Institute of Technology (BIT), China)*"

 

1002 - Active Object Search

Jie Wu (Sun Yat-sen University)*; Tianshui Chen (DarkMatter AI); Lishan Huang (Sun Yat-Sen University); Hefeng Wu (Sun Yat-sen University); Guanbin Li (Sun Yat-sen University); Ling Tian (University of Electronic Science and Technology of China); Liang Lin (DarkMatter AI)

 

1009 - Deep-Modal: Real-Time Impact Sound Synthesis for Arbitrary Shapes

Xutong Jin (Peking University); Sheng Li (Peking University)*; Tianshu Qu (Peking University); Dinesh Manocha (UMD); Guoping Wang (Peking University)

 

1011 - Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID

"Dechao Meng (vipl,ict,Chinese academic of science)*; Liang Li (Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Xingyu Gao (Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)"

 

1013 - Deep Heterogeneous Multi-Task Metric Learning for Visual Recognition and Retrieval

"Shikang Gan (Nanyang Technological University); YONG LUO (Nanyang Technological University)*; Yonggang Wen (Nanyang Technological University); Tongliang Liu (The University of Sydney); Han Hu (Beijing Institute of Technology, China)"

 

1016 - HOSE-Net:Higher Order Structure Embedded Network for Scene Graph Generation

meng wei (Graduate school at ShenZhen£¬Tsinghua university)*; Chun Yuan (Graduate school at ShenZhen£¬Tsinghua university); Xiaoyu Yue (SenseTime); Kuo Zhong (Graduate school at ShenZhen£¬Tsinghua university)

 

1022 - ICECAP: Information Concentrated Entity-aware Image Captioning

Anwen Hu (Renming University of China)*; Shizhe Chen (Renmin University of China); Qin Jin (Renmin University of China)

 

1035 - Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection

xincheng Ju (Soochow University)*; Dong Zhang (Soochow University); Junhui Li (Soochow University); Zhou Guodong (Soochow University)

 

1038 - Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification

Xinchen Liu (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jinkai Zheng (Hangzhou Dianzi University); Chenggang Yan (Hangzhou Dianzi University); Tao Mei (AI Research of JD.com)

 

1039 - Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching

Jingjun Liang (Renmin University of China); Ruichen Li (Renmin University of China); Qin Jin (Renmin University of China)*

 

1043 - Attacking Image Captioning Towards Accuracy-Preserving Target Words Removal

"Jiayi Ji (Xiamen University)*; Rongrong Ji (Xiamen University, China); Yiyi Zhou (Xiamen University); Xiaoshuai Sun ( Xiamen University); Fuhai Chen (Xiamen University); Jianzhuang Liu (Huawei Noah's Ark Lab); Qi Tian (Huawei Cloud & AI)"

 

1046 - Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization

Haoming Xu (South China University of Technology); Runhao Zeng (South China University of Technology); Qingyao Wu (South China University of Technology); Mingkui Tan (South China University of Technology)*; Chuang Gan (MIT-IBM Watson AI Lab)

 

1064 - "Look, Read and Feel: Benchmarking Ads Understanding with Multimodal Multitask Learning"

"Huaizheng Zhang (Nanyang Technological University)*; YONG LUO (Nanyang Technological University); Qiming Ai (Nanyang Technological University); Han Hu (Beijing Institute of Technology, China); Yonggang Wen (Nanyang Technological University)"

 

1068 - Dual Semantic Fusion Network for Video Object Detection

"Lijian Lin (Xiamen University); Haosheng Chen (Xiamen University); Honglun Zhang (Applied Research Center, Tencent PCG); Jun Liang (Xiamen University); Yu Li (Tencent ); Ying Shan (Tencent); Hanzi Wang (Xiamen University)*"

 

1073 - Sharp Multiple Instance Learning for DeepFake Video Detection

"Xiaodan Li (Alibaba Group, China); Yining Lang (Alibaba Group); Yuefeng Chen (Alibaba Group)*; Xiaofeng Mao (Alibaba Group); Yuan He (Alibaba Group ); Shuhui Wang (VIPL,ICT,Chinese academic of science); hui xue (Alibaba); Quan Lu (Alibaba Group)"

 

1075 - Light Field Super-resolution via Attention-Guided Fusion of Hybrid Lenses

"Jing Jin (City University of Hong Kong); Junhui Hou (City University of Hong Kong, Hong Kong)*; Jie Chen (Hong Kong Baptist University); Sam Kwong (City Univeristy of Hong Kong); Jingyi Yu (Shanghai Tech University)"

 

1077 - Learning to Detect Specular Highlights from Real-world Images

"Gang Fu (Wuhan University)*; Qing Zhang ( Sun Yat-sen University); Qifeng Lin (School of Computer Science, Wuhan University); Lei Zhu (The Chinese University of Hong Kong); Chunxia Xiao (Wuhan University)"

 

1080 - Video Super-Resolution using Multi-scale Pyramid 3D Convolutional Networks

Jianping Luo (Shenzhen University); Shaofei Huang (Shenzhen University); yuan yuan (Shenzhen University)*

 

1085 - Tactile Sketch Saliency

Jianbo Jiao (University of Oxford); Ying Cao (City University of Hong Kong)*; Manfred Lau (City University of Hong Kong); Rynson W.H. Lau (City University of Hong Kong)

 

1098 - Who You Are Decides How You Tell

"Shuang Wu (National University of Singapore)*; Shaojing Fan (National University of Singapore); Zhiqi Shen (National University of Singapore); Mohan Kankanhalli (National University of Singapore,); Anthony Tung (NUS)"

 

1112 - PCA-SRGAN: Incremental Orthogonal Projection Discrimination for Face Super-resolution

"Hao Dou (Institude Of Automation,chinese Academy Of Sciences; University of Chinese Academy of Sciences)*; Chen Chen (The Chinese academy of science); Xiyuan Hu (School of Computer Science and Engineering, Nanjing University of Science and Technology); zuxing Xuan (Beijing Union University); Zhisen Hu (Beijing University of Posts and Telecommunications); Silong Peng (The Chinese academy of science)"

 

1122 - PersonalitySensing: A Multi-View Multi-Task Learning Approach for Personality Detection based on Smartphone Usage

Songcheng Gao (Nanjing University); Wenzhong Li (Nanjing University)*; Lynda J. Song (University of Leeds); Xiao Zhang (Shandong University); Mingkai Lin (Nanjing University); Sanglu Lu (NJU)

 

1132 - Exploring Font-independent Features for Scene Text Recognition

Yizhi Wang (Peking University)*; Zhouhui Lian (Peking University)

 

1133 - Context-aware Feature Generation For Zero-shot Semantic Segmentation

Zhangxuan Gu (Shanghai Jiao Tong University); Siyuan Zhou (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong University)*; Zihan Zhao (Shanghai Jiao Tong University); Liqing Zhang (Shanghai Jiao Tong University)

 

1141 - Gray2ColorNet: Transfer More Colors from Reference Image

"Peng Lu (Beijing University of Posts and Telecommunications)*; Jinbei Yu (Beijing University of Posts and Telecommunications); Xujun Peng (Information Sciences Institute, University of Southern California); Zhaoran Zhao (Beijing University of Posts and Telecommunications); Xiaojie Wang (Beijing University of Posts and Telecommunications)"

 

1147 - Compact Bilinear Augmented Query Structured Attention for Sport Highlights Classification

Yanbin Hao (City University of Hong Kong); Hao Zhang (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong); Qiang Liu (DeepAIT (Hong Kong) Limited); Xiaojun Hu (DeepAIT (Hong Kong) Limited)

 

1152 - Leverage Social Media for Personalized Stress Detection

Xin Wang (Tsinghua University)*; Huijun Zhang (Tsinghua university); Lei Cao (Tsinghua university); Ling Feng (Tsinghua university)

 

1157 - Towards Clustering-friendly Representations: Subspace Clustering via Graph Filtering

Zhengrui Ma (University of Electronic Science and Technology); Zhao Kang (University of Electronic Science and Technology of China)*; Guangchun Luo (University of Electronic Science and Technology of China); Ling Tian (University of Electronic Science and Technology of China); Wenyu Chen (University of Electronic Science and Technology of China)

 

1173 - Heterogeneous Fusion of Semantic and Collaborative Information for Visually-Aware Food Recommendation

Lei Meng (National University of Singapore)*; Xiangnan He (University of Science and Technology of China); Fuli Feng (National University of Singapore); Xiaoyan Gao (Beijing Institute of Technology ); Tat-Seng Chua (National university of Singapore)

 

1180 - KTN: Knowledge Transfer Network for Multi-person DensePose Estimation

xuanhan wang (University of Electronic Science and Technology of China)*; Lianli Gao (The University of Electronic Science and Technology of China); Jingkuan Song (UESTC); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

 

1189 - ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection

"Ye Liu (Wuhan University)*; Junsong Yuan (""State University of New York at Buffalo, USA""); Chang Wen Chen (The Chinese University of Hong Kong and University at Buffalo)"

 

1195 - Semantic Image Analogy with a Conditional Single-Image GAN

Jiacheng Li (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Dong Liu (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

 

1196 - Trajectory Prediction in Heterogeneous Environment via Attended Ecology Embedding

"Wei-Cheng Lai (National Chiao Tung University); Zi-Xiang Xia (National Chiao Tung University); Hao-Siang Lin (National Chiao Tung University); Lien-Feng Hsu (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University); I-Hong Jhuo (IBM); Wen-Huang Cheng (EE, NCTU)*"

 

1198 - A Unified Framework for Detecting Audio Adversarial Examples

"Xia Du (University of Macau); Chi-Man Pun (University of Macau)*; Zheng Zhang (Harbin Institute of Technology, Shenzhen)"

 

1200 - Defending Adversarial Examples via DNN Bottleneck Reinforcement

Wenqing Liu (Tongji University ); Miaojing Shi (King's College London); Teddy Furon (Inria); Li Li (Tongji University )*

 

1205 - AU-assisted Graph Attention Convolutional Network for Micro-Expression Recognition

"Hong-Xia Xie (National Chiao Tung University)*; Ling Lo ( National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University); Wen-Huang Cheng (EE, NCTU)"

 

1209 - Revealing True Identity: Detecting Makeup Attacks in Face-based Biometric Systems

Mohammad Amin Arab (Simon Fraser University)*; Puria Azadi Moghadam (Simon Fraser University); Mohamed Hussein (USC/ISI); Wael Abd-Almageed (Information Sciences Institute); Mohamed Hefeeda (Simon Fraser University)

 

1214 - A Structured Graph Attention Network for Vehicle Re-Identification

Yangchun Zhu (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Tianzhu Zhang (University of Science and Technology of China); Jiawei Liu (University of Science and Technology of China); Jiebo Luo (U. Rochester)

 

1217 - Arbitrary Style Transfer via Multi-Adaptation Network

"Yingying Deng (Institute of Automation£¬Chinese Academy of Sciences); Fan Tang (Fosafer); Weiming Dong (NLPR, Institute of Automation, Chinese Academy of Sciences)*; Wen Sun (University of Chinese Academy of Sciences); Feiyue Huang (Tencent); Changsheng Xu (CASIA)"

 

1224 - Scoring High: Analysis and Prediction of Viewer Behavior and Engagement in the Context of 2018 FIFA WC Live Streaming

Nikolas Wehner (University of W¨¹rzburg)*; Michael Seufert (University of W¨¹rzburg); Sebastian Egger-Lampl (AIT Austrian Institute of Technology GmbH); Bruno Gardlo (AIT Austrian Institute of Technology GmbH); Pedro Casas (AIT Austrian Institute of Technology GmbH); Raimund Schatz (AIT)

 

1226 - Weakly-Supervised Video Object Grounding by Exploring Spatio-Temporal Contexts

Xun Yang (National University of Singapore)*; Xueliang liu (Hefei University of Technology); Meng Jian (Beijing University of Technology); Xinjian Gao (Hefei University of Technology); Meng Wang (Hefei University of Technology)

 

1228 - S^2SiamFC: Self-supervised Fully Convolutional Siamese Network for Visual Tracking

"Chon Hou Sio (National Chiao Tung University); Yu-Jen Ma (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University)*; Jun-Cheng Chen (Academia Sinica); Wen-Huang Cheng (EE, NCTU)"

 

1263 - Learnable Optimal Sequential Grouping for Video Scene Detection

Daniel Rotman (IBM Research)*; Yevgeny Yaroker (IBM Research); Elad Amrani (IBM / Technion); Udi Barzelay (IBM ); Rami Ben-Ari (IBM-Research)

 

1266 - Dual-view Attention Networks for Single Image Super-Resolution

Jingcai Guo (The Hong Kong Polytechnic University)*; Shiheng Ma (Shanghai Jiao Tong University); Jie Zhang (The Hong Kong Polytechnic University); Qihua Zhou (The Hong Kong Polytechnic University); Song Guo (The Hong Kong Polytechnic University)

 

1269 - Activity-driven Weakly-Supervised Spatio-Temporal Grounding from Untrimmed Videos

Junwen Chen (Rochester Institute of Technology); Wentao Bao (Rochester Institute of Technology); Yu Kong (Rochester Institute of Technology)*

 

1275 - Text-Guided Neural Image Inpainting

"Lisai Zhang (Harbin Institute of Technology, Shenzhen)*; Qingcai Chen ( Harbin Institute of Technology, Shenzhen); Baotian Hu (Harbin Institute of Technology, Shenzhen); Shuoran Jiang (Harbin Institute of Technology, Shenzhen)"

 

1283 - One-shot Scene Graph Generation

Yuyu Guo (UESTC); Jingkuan Song (UESTC)*; Lianli Gao (The University of Electronic Science and Technology of China); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

 

1285 - NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

Penghao Zhou (Tencent Youtu Lab)*; Chong Zhou (Tencent Youtu Lab); Pai Peng (Tencent Youtu Lab); Junlong Du (Tencent Youtu Lab); Xing Sun (Tencent); Xiaowei Guo (Tencent Youtu Lab); Feiyue Huang (Tencent)

 

1286 - Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

"Zhaobo Qi (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science)*; Chi Su (Kingsoft Cloud); Li Su (University of Chinese Academy of Sciences); Weigang Zhang (Harbin Institute of Technology, Weihai); Qingming Huang (University of Chinese Academy of Sciences)"

 

1298 - A probabilistic graphical model for analyzing the subjective visual quality assessment data from crowdsourcing

"Jing Li (Alibaba Group)*; Suiyi Ling (University of Nantes); Junle Wang (Tencent); Patrick Le Callet (""Universite de Nantes, France"")"

 

1300 - DFEW: A Large-Scale Database for Recognizing Dynamic Facial Expressions in the Wild

Xingxun Jiang (Southeast University)*; Yuan Zong (Southeast University); Wenming Zheng (Southeast University); Chuangao Tang (Southeast University); WanChuang Xia (Southeast University); Cheng Lu (Southeast University); Jiateng Liu (Southeast University)

 

1303 - Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion

Yikai Wang (Tsinghua University); Fuchun Sun (Tsinghua University); Ming Lu (Intel Labs China); Anbang Yao (Intel Labs China)*

 

1304 - Dual-Gradients Localization framework for Weakly Supervised Object Localization

Chuangchuang Tan (Beijing Jiaotong University); Guanghua Gu (Yanshan University); Tao Ruan (Beijjing Jiaotong University); Shikui Wei (Beijing Jiaotong University); Yao Zhao (Beijing Jiaotong University)*

 

1307 - DualLip: A System for Joint Lip Reading and Generation

Weicong Chen (Tsinghua University); Xu Tan (Microsoft Research Asia); Yingce Xia (Microsoft Research Asia); Tao Qin (Microsoft Research Asia); Yu Wang (Tsinghua University)*; Tieyan Liu (Microsoft Research)

 

1308 - Crossing You in Style: Cross-modal Style Transfer from Music to Visual Arts

Cheng-Che Lee (MediaTek); Wan-Yi Lin (National Tsing-Hua University); Yen-Ting Shih (National Tsing-Hua University); Pei-Yi (Patricia) Kuo (National Tsing-Hua University); Li Su (Academia Sinica)*

 

1314 - Single Image Shape-from-Silhouettes

Yawen Lu (Rochester Institute of Technology); Yuxing Wang (Rochester Institute of Technology)*; Guoyu Lu (Rochester Institute of Technology)

 

1319 - Weakly-supervised Image Hashing through Masked Visual Semantic Graph Reasoning

Lu Jin (Nanjing University of Science and Technology ); Zechao Li (Nanjing University of Science and Technology)*; Yonghua Pan (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

 

1320 - "Look, Listen and Infer"

Ruijian Jia (Xi'an Jiaotong University); Xinsheng Wang (Xi¡¯an Jiaotong University); Shanmin Pang (Xi'an Jiaotong University)*; Jihua Zhu (Xi'an Jiaotong University); Jianru Xue (Xi'an Jiaotong University)

 

1324 - How to Learn Item Representation for Cold-Start Multimedia Recommendation?

Xiaoyu Du (National University of Singapore)*; Xiang Wang (National University of Singapore); Xiangnan He (University of Science and Technology of China); Zechao Li (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology); Tat-Seng Chua (National university of Singapore)

 

1326 - Dual Attention GANs for Semantic Image Synthesis

Hao Tang (University of Trento)*; Song Bai (University of Oxford); Nicu Sebe (University of Trento)

 

1327 - MRI Measurement Matrix Learning via Correlation Reweighting

"Zhongnian Li (Nanjing University of Aeronautics and Astronautics,China); Tao Zhang (Nanjing University of Aeronautics and Astronautics,China); Ruoyu Chen (Nanjing University of Aeronautics and Astronautics); Daoqiang Zhang (Nanjing University of Aeronautics and Astronautics, China)*"

 

1329 - SimSwap: An Efficient Framework For High Fidelity Face Swapping

Renwang Chen (Shanghai Jiaotong University); Xuanhong Chen (Shanghai Jiao Tong University); Bingbing Ni (Shanghai Jiao Tong University)*; Yanhao Ge (Tencent)

 

1344 - Semantic Consistency Guided Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval

"Heyu Zhou (Tianjin University, China); Weizhi Nie (Tianjin University)*; Dan Song (Tianjin University); Nian Hu (Tianjin University); Xuanya Li (Baidu); An-An Liu (Tianjin University)"

 

1347 - Performance over Random: A robust evaluation protocol for video summarization methods

"Evlampios Apostolidis (QMUL & CERTH-ITI)*; Eleni Adamantidou (CERTH); Alexandros I Metsai (CERTH-ITI); Vasileios Mezaris (Information Technologies Institute, Centre for Research and Technology Hellas, Greece); Ioannis Patras (Queen Mary University of London)"

 

1355 - ARSketch: Sketch-Based User Interface for Augmented Reality Glasses

"Zhaohui Zhang (Rokid); Haichao Zhu (The Chinese University of Hong Kong)*; Qian Zhang (California University, Los Angeles)"

 

1356 - Self-Mimic Learning for Small-scale Pedestrian Detection

"Jialian Wu (State University of New York at Buffalo)*; CHUNLUAN ZHOU (Wormpex AI Research); Qian Zhang (Horizon Robotics); Ming Yang (Horizon Robotics); Junsong Yuan (""State University of New York at Buffalo, USA"")"

 

1359 - Action2Motion: Conditioned Generation of 3D Human Motions

"Chuan Guo (University of Alberta)*; Xinxin Zuo (University of Alberta); Sen Wang (University of Alberta); Shihao Zou (University of Alberta); Qingyao Sun (University of Chicago); Annan Deng (Yale University); Minglun Gong (University of Guelph); Li Cheng (ECE dept., University of Alberta)"

 

1363 - ChefGAN: Food Image Generation from Recipes

siyuan pan (shanghai jiao tong university)*; Ling Dai (Shanghai Jiao Tong University); Xuhong Hou (Shanghai Jiao Tong University ); Huating Li (Shanghai Jiao Tong University); Bin Sheng (Shanghai Jiao Tong University)

 

1365 - Skin Textural Generation via Blue-noise Gabor Filtering based Generative Adversarial Network

HUI ZHANG (The University of Hong Kong)*; Chuan Wang (Face++ (Megvii)); Nenglun Chen (The University of Hong Kong); Wenping Wang (The University of Hong Kong); jue wang (Megvii Technology)

 

1367 - Text-Embedded Bilinear Model for Fine-Grained Visual Recognition

Liang Sun (University of Electronic Science and Technology of China); Xiang Guan (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)*; Lei Zhang (Chongqing University)

 

1369 - VVSec: Securing Volumetric Video Streaming via Benign Use of Adversarial Perturbation

Zhongze Tang (Rutgers University)*; Xianglong Feng (Rutgers University); Yi Xie (Rutgers University); Huy Phan (Rutgers University); Tian Guo (Worcester Polytechnic Institute); bo yuan (rutgers university); Sheng Wei (Rutgers University - New Brunswick)

 

1373 - Personalized Item Recommendation for Second-hand Trading Platform

Xuzheng Yu (Shandong University)*; Tian Gan (Shandong University); Yinwei Wei (Shandong University); Zhiyong Cheng (Shandong Academy of Sciences); Liqiang Nie (Shandong University )

 

1374 - A Slow-I-Fast-P Architecture for Compressed Video Action Recognition

Jiapeng Li (Xi'an Jiaotong University); Ping Wei (Xi'an Jiaotong University)*; Yongchi Zhang (Xi'an Jiaotong University); Nanning Zheng (Xi'an Jiaotong University)

 

1384 - Learning Scales from Points: A Scale-aware Probabilistic Model for Crowd Counting

Zhiheng Ma (Xi'an Jiaotong University)*; Xing Wei (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

 

1391 - Modeling Caricature Expressions by 3D Blendshape and Dynamic Texture

Keyu Chen (University of Science and Technology of China); Juyong Zhang (University of Science and Technology of China)*; Jianfei Cai (Monash University); Jianmin Zheng (Nanyang Technological University)

 

1394 - Learning Global Structure Consistency for Robust Object Tracking

Bi Li (Huazhong University of Science and Technology); Chengquan Zhang (Baidu Inc); Zhibin Hong (Baidu Inc.); Xu Tang (Baidu); jingtuo liu (baidu); Junyu Han (Baidu Inc.); Errui Ding (Baidu Inc.); Wenyu Liu (Huazhong University of Science and Technology)*

 

1396 - DMVOS: Discriminative Matching for Real-time Video Object Segmentation

"Peisong Wen (Nankai University); Ruolin Yang (SenseTime); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Chen Qian (SenseTime); Qingming Huang (University of Chinese Academy of Sciences); Runmin Cong (Beijing Jiaotong University); Jianlou Si ( SenseTime)*"

 

1397 - Multi-Group Multi-Attention: Towards Discriminative Spatiotemporal Representation

Zhensheng Shi (Ocean University of China); Liangjie Cao (Ocean University of China); Cheng Guan (Ocean University of China); Ju Liang (Ocean University of China); Qianqian Li (Ocean University of China); Zhaorui Gu (Ocean University of China); Haiyong Zheng (Ocean University of China)*; Bing Zheng (Ocean University of China)

 

1399 - RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

Niluthpol c Mithun (SRI International)*; Karan Sikka (SRI International); Han-Pang Chiu (SRI International); Supun Samarasekera (SRI International); Rakesh Kumar (SRI International)

 

1401 - Vaccine-style-net: Point Cloud Completion in Implicit Continuous Function Space

"Wei Yan (Peking university); Ruonan Zhang ( Peng Cheng Laboratory); Jing Wang (Artificial Intelligence Research Center Peng Cheng Laboratory); Shan Liu (Tencent America); Thomas H Li (Advanced Institute of Information Technology, Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University)*"

 

1417 - Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering

"Fei Liu (National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences and University of Chinese Academy of Sciences)*; Jing Liu (National Lab of Pattern Recognition, Institute of Automation,Chinese Academy of Sciences); Xinxin Zhu (National Lab of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences); Richang Hong (Hefei University of Technology); Hanqing Lu (NLPR, Institute of Automation, CAS)"

 

1418 - Multimodal Representation with Embedded Visual Guiding Objects for Named Entity Recognition in Social Media Posts

"Zhiwei Wu (School of Software Engineering, South China University of Technology); Changmeng Zheng (South China University of Technology); Yi Cai (School of Software Engineering, South China University of Technology)*; Junying Chen (South China University of Technology); Ho-fung Leung (The Chinese University of Hong Kong); Qing Li (The Hong Kong Polytechnic University)"

 

1421 - Adaptive Wasserstein Hourglass for Weakly Supervised RGB 3D Hand Pose Estimation

Yumeng Zhang (Tsinghua University); Li Chen (Tsinghua University)*; Yufeng Liu (Kuaishou Technology); Wen Zheng (Kuaishou Technology); JunHai Yong (Tsinghua University)

 

1422 - Weakly Supervised Segmentation with Maximum Bipartite Graph Matching

WEIDE LIU (Nanyang Technological University)*; Chi Zhang (Nanyang Technological University); Guosheng Lin (Nanyang Technological University); Tzu-Yi HUNG (Delta Research Center); Chunyan Miao (NTU)

 

1429 - What Aspect Do You Like: Multi-scale Time-aware User Interest Modeling for Micro-video Recommendation

"Hao Jiang (Shandong University)*; Wenjie Wang (National University of Singapore); Yinwei Wei (Shandong University); Zan Gao (1. Shandong AI Institute, QiLU University of Technology, 2. Shandong Computer Science Center(National Supercomputer Center in Jinan), 3. Tianjing University of Technology); Yinglong Wang (Shandong Artificial Intelligence Institute); Liqiang Nie (Shandong University )"

 

1431 - Recognizing Camera Wearer from Hand Gestures in Egocentric Videos

"Daksh Thapar (Indian Institute of Technology, Mandi)*; Chetan Arora (Indian Institute of Technology Delhi); Aditya Nigam (IIT mandi)"

 

1441 - Domain-Specific Alignment Network for Multi-Domain Image-Based 3D Object Retrieval

Yu-ting Su (Tianjin University); Yuqian Li (Tianjin University); Dan Song (Tianjin University)*; Zhendong Mao (University of Science and Technology of China); Xuanya Li (Baidu); An-An Liu (Tianjin University)

 

1443 - Cross-Granularity Learning for Multi-Domain Image-to-Image Translation

Huiyuan Fu (Beijing University of Posts and Telecommunications)*; Ting Yu (Beijing University of Posts and Telecommunications); Xin Wang (Stony Brook University); Huadong Ma (Beijing University of Posts and Telecommunications)

 

1444 - Generalized Zero-Shot Learning using Generated Proxy Unseen Samples and Entropy Separation

Omkar Anil Gune (Indian Institute of Technology Bombay)*; Biplab Banerjee (Indian Institute of Technology Bombay); Subhasis Chaudhuri (Indian Institute of Technology Bombay); Fabio Cuzzolin (Oxford Brookes University)

 

1445 - Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Neural Networks

Negin Ghamsarian (Alpen-Adria University of Klagenfurt); Hadi Amirpourazarian (Alpen-Adria-Universit_t Klagenfurt); Christian Timmerer (Alpen-Adria-Universit_t Klagenfurt); Mario Taschwer (Klagenfurt University); Klaus Sch_ffmann (Klagenfurt University)*

 

1448 - Complementary-View Co-Interest Person Detection

"Ruize Han (College of Intelligence and Computing, Tianjin University); Jiewen Zhao (College of Intelligence and Computing, Tianjin University); Wei Feng (College of Intelligence and Computing, Tianjin University, China)*; Yiyang Gan (College of Intelligence and Computing, Tianjin University); Liang Wan (College of Intelligence and Computing, Tianjin University); Song Wang (University of South Carolina)"

 

1453 - Contextual Multi-Scale Feature Learning for Person Re-Identification

"Baoyu Fan (Inspur Electronic Information Industry Co.,Ltd.); Li Wang (inspur)*; Runze Zhang (Inspur Electronic Information Industry Co.,Ltd.); Zhenhua Guo (Inspur Electronic Information Industry Co.,Ltd.); Yaqian Zhao (Inspur); Rengang Li (Inspur); Weifeng Gong ( Inspur Electronic Information Industry Co.,Ltd.)"

 

1456 - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene

"Xinke Li (National University of Singapore); Chongshou Li (National University of Singapore)*; Zekun Tong (National University of Singapore); Andrew Lim (National University of Singapore); Junsong Yuan (""State University of New York at Buffalo, USA""); Yuwei Wu (National University of Singapore); Jing Tang (National University of Singapore); Raymond Huang (National University of Singapore)"

 

1458 - Prototype-Matching Graph Network for Heterogeneous Domain Adaptation

Zijian Wang (University of Queensland)*; Yadan Luo (University of Queensland); Zi Huang (University of Queensland); Mahsa Baktashmotlagh (University of Queensland)

 

1459 - VIMES: A Wearable Memory Assistance System for Automatic Information Retrieval

"Carlos Bermejo Fernandez (Hong Kong University of Science and Technology)*; Tristan Braud (HKUST); Ji Yang (The Hong Kong University of Science and Technology); Shayan Mirjafari (Dartmouth College ); Bowen Shi (Hong Kong University of Science and Technology); Yu Xiao (Department of Communications and Networking, Aalto University, Finland); Pan Hui (Hong Kong University of Science and Technology)"

 

1462 - Towards Lighter and Faster: Learning Wavelets Progressively for Image Super-Resolution

"Huanrong Zhang (Sun Yat-Sen University); Zhi Jin (Sun Yat-sen University)*; Xiaojun Tan (Sun Yat-sen University); Xiying Li (Research Center of ITS, Sun Yat-sen University)"

 

1464 - Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval

Rui Zhao (University of Science and Technology of China)*; Kecheng Zheng (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China); Hongtao Xie (University of Science and Technology of China); Jiebo Luo (U. Rochester)

 

1472 - Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition

"Zhen Huang (University of Science and Technology of China); Xu Shen (Alibaba Group); Xinmei Tian (USTC)*; Houqiang Li (University of Science and Technology of China); Jianqiang Huang (Alibaba Group); Xian-Sheng Hua (Damo Academy, Alibaba Group)"

 

1473 - Space-Time Video Super-Resolution using Temporal Profiles

Zeyu Xiao (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Xueyang Fu (University of Science and Technology of China); Dong Liu (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

 

1475 - Answer-driven Visual State Estimator for Goal-oriented Visual Dialogue

Zipeng Xu (Beijing University of Posts and Telecommunications)*; Xiaojie Wang (Beijing University of Posts and Telecommunications); Fangxiang Feng (Beijing University of Posts and Telecommunications); Yushu Yang (Meituan-Dianping Group); Huixing Jiang (Meituan-Dianping Group); Zhongyuan Wang (Meituan-Dianping Group)

 

1482 - Dynamic Future Net: Diversified Human Motion Generation

Chen WenHeng (NetEase Fuxi AI Lab)*; He E Wang (Leeds University); Yi Yuan (NetEase Fuxi AI Lab); Tianjia Shao (Zhejiang University); Kun Zhou (Zhejiang University)

 

1491 - ATF : Towards robust face alignment via leveraging similarity and diversity across different datasets

"Xing Lan (University of Chinese Academy of Sciences;Institute of Automation,Chinese Academy of Sciences)*; Qinghao Hu (Institute of Automation, Chinese Academy of Sciences); Fangzhou Xiong (Nanjing Aritificial Intelligence Chip Research, Institute of Automation, Chinese Academy of Sciences;Nanjing University of Science and Technology); Cong Leng ( Institute of Automation,Chinese Academy of Sciences; Nanjing Aritificial Intelligence Chip Research, Institute of Automation, Chinese Academy of Sciences); Jian Cheng (""Chinese Academy of Sciences, China"")"

 

1493 - Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions

Yu-Siang Huang (Academia Sinica)*; Yi-Hsuan Yang (Academia Sinica)

 

1495 - DCNet: Dense Correspondence Neural Network for 6DoF Object Pose Estimation in Occluded Scenes

Zhi Chen (University of Science and Technology of China); Wei Yang (University of Science and Technology of China)*; Zhenbo Xu (University of Science and Technology of China); Xike Xie (University of Science and Technology of China); Liusheng Huang (University of Science and Technology of China)

 

1508 - Dual Gaussian-based Variational Subspace Disentanglement for Visible-Infrared Person Re-Identification

Nan Pu (Leiden University)*; Wei Chen (Leiden University); Yu Liu (KU Leuven); Erwin M. Bakker (Leiden University); Michael S Lew (Leiden University)

 

1517 - Region of Interest Based Graph Convolution: A Heatmap Regression Approach for Action Unit Detection

Zheng Zhang (State University of New York at Binghamton)*; Taoyue Wang (State Univerisity of New York at Binghamton); Lijun Yin (State University of New York at Binghamton)

 

1525 - DroidCloud: Scalable High Density Android Cloud Rendering

Linsheng Li (SJTU)*; bin yang (Intel); cathy bao (Intel); shuo liu (Intel); randy xu (Intel); yong yao (Intel); Haghighat Mohammad R (Intel); Jerry W Hu (Intel); Shoumeng Yan (Intel); Zhengwei Qi (SJTU)

 

1534 - Incomplete Cross-modal Retrieval with Dual-Aligned Variational Autoencoders

Mengmeng Jing (University of Electronic Science and Technology of China); Jingjing Li (University of Electronic Science and Technology of China)*; Lei Zhu (Shandong Normal Unversity); Ke Lu (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China); Zi Huang (University of Queensland)

 

1538 - Transferrable Referring Expression Grounding with Concept Transfer and Context Inheritance

"Xuejing Liu (CAS)*; Liang Li (Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Zheng-Jun Zha (University of Science and Technology of China); Dechao Meng (vipl,ict,Chinese academic of science); Qingming Huang (University of Chinese Academy of Sciences)"

 

1541 - MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

"Devamanyu Hazarika (NUS, Singapore)*; Roger Zimmermann (NUS); Soujanya Poria (Singapore University of Technology and Design)"

 

1546 - Multimodal Dialogue Systems via Capturing Context-aware Dependencies of Semantic Elements

"Weidong He (University of Science and Technology of China)*; Zhi Li (University of Science and Technology of China); Dongcai Lu (Huawei Cloud BU); Enhong Chen (University of Science and Technology of China); Tong Xu (University of Science and Technology of China); Jing Yuan (Huawei Cloud BU); Baoxing Huai (HUAWEI TECHNOLOGIES CO., LTD.)"

 

1549 - Instability of Successive Deep Image Compression

"Jun-Hyuk Kim (Yonsei University); Soobeom Jang (Yonsei University); Jun-Ho Choi (Yonsei University); Jong-Seok Lee (""Yonsei University, Korea"")*"

 

1551 - Bitrate Requirements of Non-Panoramic VR Remote Rendering

Viktor Kelkkanen (Blekinge Institute of Technology)*; Markus Fiedler (Blekinge Institute of Technology); David Lindero (Ericsson)

 

1555 - Fine-grained Iterative Attention Network for Temporal Language Localization in Videos

Xiaoye Qu (Huazhong University of Science and Technology)*; Pengwei Tang ( Huazhong University of Science and Technology); Zhikang Zou (Huazhong university of science and technology); Yu Cheng (Microsoft); Jianfeng Dong (Zhejiang Gongshang University); Pan Zhou (Huazhong University of Science and Technology); Zichuan Xu (Dalian University of Technology)

 

1562 - EyeShopper: Estimating Shoppers' Gaze using CCTV Cameras

Carlos Bermejo Fernandez (Hong Kong University of Science and Technology)*; Dimitris Chatzopoulos (Hong Kong University of Science and Technology); Pan Hui (Hong Kong University of Science and Technology)

 

1570 - DeepFacePencil: Creating Face Images from Freehand Sketches

Yuhang Li (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China)*; Binxin Yang (University of Science and Technology of China); Zihan Chen (University of Science and Technology of China); Zhihua Cheng (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

 

1572 - Attention Based Dual Branches Fingertip Detection Network and Virtual Key System

Chong Mou (South China University of Technology)*; Xin Zhang (South China University of Technology)

 

1576 - ALANET: Adaptive Latent Attention Network for Joint Video Deblurring and Interpolation

"Akash Gupta (University of California, Riverside)*; Abhishek Aich (University of California, Riverside); Amit K. Roy-Chowdhury (University of California, Riverside)"

 

1578 - Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization

Md Moniruzzaman (Stony Brook University)*; Zhaozheng Yin (Stony Brook University); Zhihai He (University of Missouri Columbia); Ruwen Qin (MST); Ming Leu (University of Misssouri of Science and Technology)

 

1579 - Adversarial Knowledge Transfer from Unlabeled Data

"Akash Gupta (University of California, Riverside)*; Rameswar Panda (MIT-IBM Watson AI Lab); Sujoy Paul (UC Riverside); Jianming Zhang (Adobe Research); Amit K. Roy-Chowdhury (University of California, Riverside)"

 

1592 - Hierarchical Bi-Directional Feature Perception Network for Person Re-Identification

Zhipu Liu (Chongqing University); Lei Zhang (Chongqing University)*; Yang Yang (University of Electronic Science and Technology of China)

 

1595 - CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis

"Kaicheng Yang (Hebei University Of Science and Technology); Hua Xu (State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China)*; kai gao (Hebei University Of Science and Technology)"

 

1598 - Single-Shot Two-Pronged Detector with Rectified IoU Loss

Keyang Wang (chongqing university); Lei Zhang (Chongqing University)*

 

1601 - Hard Negative Samples Emphasis Tracker without Anchors

Zhongzhou Zhang (Chongqing University)*; Lei Zhang (Chongqing University)

 

1605 - Task Decoupled Knowledge Distillation For Lightweight Face Detectors

"Xiaoqing Liang (University of Chinese Academy of Sciences)*; Xu Zhao (Chinese Academy of Sciences); Chaoyang Zhao (National Laboratory of Pattern Recognition, CASIA); Nanfei Jiang (University of Chinese Academy of Sciences ); Ming Tang (Chinese Academy of Sciences, China); Jinqiao Wang (Institute of Automation, Chinese Academy of Sciences)"

 

1606 - Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework

Li Tao (The University of Tokyo)*; Xueting Wang (The University of Tokyo); Toshihiko Yamasaki (The University of Tokyo)

 

1612 - Object-level Attention for Aesthetic Rating Distribution Prediction

"Jingwen Hou (Nanyang Technological University)*; Sheng Yang (Nanyang Technological University); Weisi Lin (Nanyang Technological University, Singapore)"

 

1619 - Memory Recursive Network for Single Image Super-Resolution

"Jie Liu (Nanjing University)*; Minqiang Zou (Department of Computer Science and Technology, Nanjing University); Jie Tang (Nanjing University); Gangshan Wu (Nanjing University)"

 

1623 - A Modular Approach for Synchronized Wireless Multimodal Multisensor Data Acquisition in Highly Dynamic Social Settings

Chirag Raman (Delft University of Technology )*; Stephanie Tan (TU Delft); Hayley Hung (TU Delft)

 

1625 - Scale-aware Progressive Optimization Network

Ying Chen (Sun Yat-sen University)*; Lifeng Huang (SunYat-sen university); Chengying Gao (Sun Yat-sen University ); Ning Liu (Sun Yat-sen University )

 

1630 - Kalman Filter-based Head Motion Prediction for Cloud-based Mixed Reality

Serhan G¨¹l (Fraunhofer HHI)*; Sebastian Bosse (Fraunhofer HHI); Dimitri Podborski (Fraunhofer HHI); Thomas Schierl (Fraunhofer HHI); Cornelius Hellge (Fraunhofer HHI)

 

1633 - Not made for each other - Audio-Visual Dissonance-based Deepfake Detection and Localization

Komal Chugh (Indian Institute of Technology Ropar); Parul Gupta (Indian Institute of Technology Ropar); Abhinav Dhall (Monash University)*; Ramanathan Subramanian (Indian Institute of Technology Ropar)

 

1637 - Resource Efficient Domain Adaptation

"Junguang Jiang (Tsinghua University); Ximei Wang (Tsinghua University); Mingsheng Long (Tsinghua University)*; Jianmin Wang (""Tsinghua University, China"")"

 

1653 - A Multi-update Deep Reinforcement Learning Algorithm for Edge Computing Service Offloading

Hao Hao (Beijing University of Posts and Telecommunications); Changqiao Xu (Beijing University of Posts and Telecommunications)*; Lujie Zhong (Capital Normal University); Gabriel-Miro Muntean (Dublin City University)

 

1655 - MGAAttack: Toward more query-efficient black-box attack by microbial genetic algorithm

"Lina Wang (Computer School of Wuhan University, China)*; Kang Yang (Wuhan University); Wenqi Wang (Wuhan University); Run Wang (Nanyang Technological University); Aoshuang Ye (Wuhan University)"

 

1656 - Make your favorite music curative: music style transfer for anxiety reduction

Zhejing Hu (The Hong Kong Polytechnic University); Yan Liu (The Hong Kong Polytechnic University)*; Gong Chen (The Hong Kong Polytechnic University); Sheng-hua Zhong (Shenzhen University); Aiwei Zhang (St. Paul¡¯s Co-educational College)

 

1657 - JointFontGAN: Joint Geometry-Content GAN for Font Generation via Few-Shot Learning

Yankun Xi (Wayne State University); Guoli Yan (Wayne State University); Jing Hua (Wayne State University); Zichun Zhong (Wayne State University)*

 

1673 - Enhancing Self-supervised Monocular Depth Estimation via Incorporating Robust Constraints

Rui Li (Northwestern Polytechnical University)*; Xiantuo He (Northwestern Polytechnical University); Yu Zhu (Northwestern Polytechnical University); Xianjun Li (Northwestern Polytechnical University); Jinqiu Sun (Northwestern Polytechnical University); Yanning Zhang (Northwestern Polytechnical University)

 

1675 - DeepRhythm: Exposing DeepFakes with Attentional Visual Heartbeat Rhythms

"Hua Qi (Kyushu University); Qing Guo (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Wei Feng (College of Intelligence and Computing, Tianjin University, China); Yang Liu (Nanyang Technology University, Singapore); Jianjun Zhao (Kyushu University)"

 

1678 - Query Twice: Dual Mixture Attention Meta Learning for Video Summarization

Junyan Wang (MeituanDianping group); Yang Bai (Newcastle University); Yang Long (Durham University); BingZhang Hu (Newcastle University); Zhenhua Chai (MeituanDianping group)*; Yu Guan (Newcastle University); Xiaolin Wei (MeituanDianping group )

 

1679 - Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

Yanhui Guo (Mcmaster University); Xi Zhang (Shanghai Jiao Tong University); Xiaolin Wu (McMaster University)*

 

1685 - Hearing like Seeing: Improving Voice-Face Interactions and Associations via Adversarial Deep Semantic Matching Network

Kai Cheng (Huaqiao University); Xin Liu (Huaqiao University)*; Yiu-ming CHEUNG (Hong Kong Baptist University); Rui Wang (Huaqiao University); Xing Xu (University of Electronic Science and Technology of China); Bineng Zhong (Huaqiao University)

 

1696 - Multi-modal Attentive Graph Pooling Model for Community Question Answer Matching

Jun Hu (HeFei University of Technology); Quan Fang (Institute of Automation Chinese Academy of Sciences); Shengsheng Qian (institute of automation chinese academy of sciences); Changsheng Xu (CASIA)*

 

1701 - Towards Viewport-dependent 6DoF 360 Video Tiled Streaming for Virtual Reality Systems

Jong-Beom Jeong (Sungkyunkwan University)*; Soonbin Lee (Sungkyunkwan University); Il-Woong Ryu (Gachon University); Tuan Thanh Le (Gachon University); Eun-Seok Ryu (Sungkyunkwan University)

 

1702 - Concept Drift Detection for Multivariate Data Streams and Temporal Segmentation of Daylong Egocentric Videos

Pravin Nagar (IIIT Delhi)*; Mansi Khemka (Columbia University); Chetan Arora (Indian Institute of Technology Delhi)

 

1706 - A Novel Graph-TCN with a Graph Structured Representation for Micro-expression Recognition

Ling Lei (Southwest University); Jianfeng Li (Southwest University)*; Tong Chen (Southwest University); SHIGANG LI (Hiroshima City University)

 

1708 - Dynamic Context-guided Capsule Network for Multimodal Machine Translation

Huan Lin (Xiamen University)*; Fandong Meng (Tencent WeChat AI - Pattern Recognition Center Tencent Inc.); Jinsong Su (Xiamen University); Yongjing Yin (Xiamen University); Zhengyuan Yang (University of Rochester); Yubin Ge (University of Illinois at Urbana-Champaign); Jie Zhou (Tencent); Jiebo Luo (U. Rochester)

 

1710 - DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices

"Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Yihao Huang (East China Normal University); Qing Guo (Nanyang Technological University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)"

 

1717 - RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment

"Pengfei Chen (Xidian University / China University of Mining and Technology); Leida Li (Xidian University)*; Lei Ma (Hangzhou Multi-Color Optoelctronics Co., Ltd.); Jinjian Wu (Xidian University); Guangming Shi (Xidian University)"

 

1718 - Incremental facial expression recognition

Junjie Zhu (Tsinghua University)*; bingjun luo (Tsinghua University); Sicheng Zhao (University of California Berkeley); Shihui Ying (Shanghai University); Xibin Zhao (Tsinghua University); Yue Gao (Tsinghua University)

 

1719 - Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification

BOQIANG XU (University of Chinese Academy of Sciences£»Institute of Automation£¬Chinese Academy of Sciences)*; Lingxiao He (AI Research of JD.com); Xingyu Liao (AI Research of JD.com); Wu Liu (AI Research of JD.com); Zhenan Sun (Chinese of Academy of Sciences); Tao Mei (AI Research of JD.com)

 

1720 - SketchMan: Learning to Create Professional Sketch

Jia Li (Communication University of China)*; Nan Gao (Communication University of China); Tong Shen (JD AI Research); Wei Zhang (JD AI Research); Hui Ren (Communication University of China); Tao Mei (AI Research of JD.com)

 

1722 - PopMAG: Pop Music Accompaniment Generation

Yi Ren (Zhejiang University)*; Jinzheng He (Zhejiang University); Xu Tan (Microsoft Research Asia); Tao Qin (Microsoft Research Asia); Zhou Zhao (Zhejiang University); Tie-Yan Liu (Microsoft)

 

1729 - PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation

Shaotian Yan (Zhejiang University)*; Chen Shen (Alibaba Group); Zhongming Jin (Alibaba Group); Jianqiang Huang (Alibaba Group); Rongxin Jiang (Zhejiang University); Yaowu Chen (Zhejiang University); Xian-Sheng Hua (Alibaba Group)

 

1736 - Masked Face Recognition with Generative Data Augmentation and Domain Constrained Ranking

Mengyue Geng (Peking University)*; Peixi Peng (Peking University); Yangru Huang (Beijing University); Yonghong Tian (Peking University)

 

1737 - SST-EmotionNet: Spatial-Spectral-Temporal based Attention 3D Dense Network for EEG Emotion Recognition

Ziyu Jia (Beijing Jiaotong University); Youfang Lin (Beijing Jiaotong University); Xiyang Cai (Beijing Jiaotong University); Haobin Chen (Beijing Jiaotong University); Haijun Gou (Beijing Jiaotong University); Jing Wang (Beijing Jiaotong University)*

 

1754 - Occlusion Detection for Automatic Video Editing

"Junhua Liao (Sichuan University); Haihan Duan (The Chinese University of Hong Kong, Shenzhen); Xin Li (Sichuan University); Haoran Xu (Sichuan University); Yanbing Yang (Sichuan University); Wei Cai (""The Chinese University of Hong Kong, Shenzhen""); Yanru Chen (Sichuan University); Liangyin Chen (Sichuan University)*"

 

1758 - Cartoon Face Recognition: A Benchmark Dataset

"Yi Zheng (iQIYI,Inc.); Yifan Zhao (Beihang University); Mengyuan Ren (iQIYI,Inc.); he yan (iQiYi,Inc.); Xiangju Lu (iQIYI,Inc.); Junhui Liu (iQIYI Inc); Jia Li (Beihang University)*"

 

1761 - Differentiable Manifold Reconstruction for Point Cloud Denoising

Shitong Luo (Peking University)*; Wei Hu (Peking University)

 

1765 - Perception-Lossless Codec of Haptic Data with Low Delay

Chaoyang Zeng (Fuzhou University)*; Tiesong Zhao (Fuzhou University); Qian Liu (Dalian University of Technology); Yiwen Xu (Fuzhou University); Kai Wang (Fuzhou University)

 

1770 - Reversible Watermarking in Deep Convolutional Neural Networks for Integrity Authentication

Xiquan Guan (University of Science and Technology of China)*; Weiming Zhang (University of Science and Technology of China); Huamin Feng (Beijing Electronic Science and Technology Institute); Hang Zhou (University of Science and Technology of China); Jie Zhang (University of Science and Technology in China); Nenghai Yu (University of Science and Technology of China)

 

1775 - Discriminative Spatial Feature Learning for Person Re-Identification

Peixi Peng (Peking University)*; Yonghong Tian (Peking University); Yangru Huang (Beijing University); Xiangqian Wang (Huawei); Huilong An (AI Application Research Center)

 

1779 - Masked Face Recognition with Latent Part Detection

Feifei Ding (Peking University)*; Peixi Peng (Peking University); Yangru Huang (Beijing University); Mengyue Geng (Peking University); Yonghong Tian (Peking University)

 

1781 - FakePolisher: Making DeepFakes More Detection-Evasive by Shallow Reconstruction

"Yihao Huang (East China Normal University)*; Felix Juefei-Xu (Alibaba Group); Run Wang (Nanyang Technological University); Qing Guo (Nanyang Technological University); Lei Ma (Kyushu University); Xiaofei Xie (Nanyang Technological University); Jianwen Li (East China Normal University); Weikai Miao (East China Normal University); Yang Liu (Nanyang Technology University, Singapore); Geguang Pu (East China Normal University)"

 

1784 - SalGCN: Saliency Prediction for 360-Degree Images Based on Spherical Graph Convolutional Networks

Haoran Lv (Shanghai Jiao Tong University)*; Qin Yang (Shanghai Jiao Tong University); Chenglin Li (Shanghai Jiao Tong University); Wenrui Dai (Shanghai Jiao Tong University); Junni Zou (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University)

 

1789 - Neural3D: Light-weight Neural Portrait Scanning via Context-aware Correspondence Learning

Xin Suo (Shanghaitech university); Minye Wu (Shanghaitech University); Yanshun Zhang (Dgene); Yingliang Zhang (Dgene); Qiang Hu (ShanghaiTech University)*; LAN XU (HKUST); Jingyi Yu (Shanghai Tech University)

 

1790 - PanelNet: A Novel Deep Neural Network for Predicting Collective Diagnostic Ratings by a Panel of Radiologists for Pulmonary Nodules

"Chunyan Zhang (Institute of Medical Artificial Intelligence, the Second Affiliated Hospital of Xi'an Jiaotong University); Songhua Xu (School of Mathematics and Statistics, Xi'an Jiaotong University)*; Zongfang Li (Institute of Medical Artificial Intelligence, the Second Affiliated Hospital of Xi'an Jiaotong University)"

 

1795 - Multi-modal Multi-relational Feature Aggregation Network for Medical Knowledge Representation Learning

"Yingying Zhang (Institute of Automation, Chinese Academy of Sciences;Univiersity of Chinese Academy of Sciences); Quan Fang (Institute of Automation Chinese Academy of Sciences); Shengsheng Qian (institute of automation chinese academy of sciences); Changsheng Xu (CASIA)*"

 

1800 - AdaHGNN: Adaptive Hypergraph Neural Networks for Multi-Label Image Classification

"Xiangping Wu (Harbin Institute of Technology, Shenzhen); Qingcai Chen ( Harbin Institute of Technology, Shenzhen)*; Wei Li (Harbin Institute of Technology, Shenzhen); Yulun Xiao (Harbin Institute of Technology, Shenzhen); Baotian Hu (University of Massachusetts)"

 

1801 - Privacy-Preserving Visual Content Tagging using Graph Transformer Networks

"Xuan-Son Vu (Ume_ University)*; Duc-Trong Le (Vietnam National Univeristy); Christoffer K Edlund (Sartorius); Lili Jiang (Department of Computing Science, Ume_ University, Sweden); Hoang D. Nguyen (University of Glasgow)"

 

1803 - Task-distribution-aware Meta-learning for Cold-start CTR Prediction

"Tianwei Cao (University of Chinese Academy of Sciences)*; Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Zhiyong Yang (SKLOIS, Institute of Information Engineering, Chinese Academy of Sciences; SCS, University of Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)"

 

1806 - FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire

"Jinglin Liu (Zhejiang University)*; Yi Ren (Zhejiang University); Zhou Zhao (Zhejiang University); Chen Zhang (Zhejiang University); Baoxing Huai (HUAWEI TECHNOLOGIES CO., LTD.); Jing Yuan (Huawei Cloud BU)"

 

1816 - Identity-Aware Attribute Recognition via Real-Time Distributed Inference in Mobile Edge Clouds

Zichuan Xu (Dalian University of Technology); Jiangkai Wu (Dalian University of Technology); Qiufen Xia (Dalian University of Technology)*; Pan Zhou ( Huazhong University of Science and Technology); Jiankang Ren (Dalian University of Technology); Huizhi Liang ()

 

1825 - A Novel Object Re-Track Framework for 3D Point Clouds

Tuo Feng (Xidian University)*; Licheng Jiao (Xidian University); Hao Zhu (Xidian University); Long Sun (Xidian University)

 

1828 - Reinforced Similarity Learning: Siamese Relation Networks for Robust Object Tracking

Dawei Zhang (Zhejiang Normal University)*; Zhonglong Zheng (Zhejiang Normal University); Minglu Li (Zhejiang Normal University); Xiaowei He (Zhejiang Normal University); Tianxiang Wang (Zhejiang Normal University); Liyuan Chen (Zhejiang Normal University); Riheng Jia (Zhejiang Normal University); Feilong Lin (Zhejiang Normal University)

 

1832 - "AffectI: A Game for Diverse, Reliable, and Efficient Affective Image Annotation"

xingkun zuo (University of Yamanashi); Jiyi Li (University of Yamanashi / RIKEN AIP); qili zhou (hangzhou dianzi university); jianjun li (HangZhou Dianzi University); Xiaoyang mao (University of Yamanashi)*

 

1833 - Photo Stream Question Answer

"Wenqiao Zhang (Zhejiang University)*; Siliang Tang (Zhejiang University); Yanpeng Cao (Zhejiang University); Jun Xiao (Zhejiang University); Shiliang Pu (Hikvision Research Institute); Fei Wu (Zhejiang University, China); Yueting Zhuang (Zhejiang University)"

 

1835 - Relational Graph Learning for Grounded Video Description Generation

Wenqiao Zhang (Zhejiang University)*; Xin Wang (UC Santa Barbara); Siliang Tang (Zhejiang University); Haochen Shi (Zhejiang University); Jun Xiao (Zhejiang University); Haizhou Shi (Zhejiang University); Yueting Zhuang (Zhejiang University); William Yang Wang (UC Santa Barbara)

 

1837 - Cognitive Representation Learning of Self-Media Online Article Quality

"Yiru Wang (Tencent Inc.; Tsinghua University)*; Shen Huang (Tencent Inc.); Gongfu Li (Tencent Inc.); Qiang Deng (Tencent Inc.); Dongliang Liao (Data Quality Team, WeChat, Tencent Inc., China); Pengda Si (Tsinghua University); Yujiu Yang (Tsinghua University); Jin Xu (Tencent Inc.)"

 

1841 - Exploiting Active Learning in Novel Refractive Error Detection with Smartphones

Eugene Yujun Fu (The Hong Kong Polytechnic University)*; Zhongqi Yang (The Hong Kong Polytechnic University); Hong Va Leong (The Hong Kong Polytechnic University); Grace Ngai (The Hong Kong Polytechnic University); Chi-wai Do (The Hong Kong Polytechnic University); Lily Chan (The Hong Kong Polytechnic University)

 

1852 - Describing Subjective Experiment Consistency by p-value qq-plot

Jakub Nawa_a (AGH University of Science and Technology)*; Lucjan Janowski (AGH University of Science and Technology); Bogdan _miel (); Krzysztof Rusek (AGH University of Science and Technology)

 

1859 - Deep Structural Contour Detection

Ruoxi Deng (Central South University)*; Shengjun Liu (Central South University)

 

1862 - Low-latency FoV-adaptive Coding and Streaming for Interactive 360-Degree Video Streaming

Yixiang Mao (New York University)*; Liyang Sun (New York University); Yong Liu (NYU); Yao Wang (New York University)

 

1874 - Multimodal Multi-Task Financial Risk Forecasting

"Ramit Sawhney (Netaji Subhas Institute of Technology)*; Puneet Mathur (University of Maryland, College Park); Ayush Mangal (IIT Roorkee); Piyush Khanna (Delhi Technological University); Rajiv Ratn Shah (""Indraprastha Institute of Information Technology, Delhi""); Roger Zimmermann (NUS)"

 

1888 - Multimodal Attention with Image Text Spatial Relationship for OCR-Based Image Captioning

Jing Wang (Nanjing University of Science and Technology)*; Jinhui Tang (Nanjing University of Science and Technology); Jiebo Luo (U. Rochester)

 

1889 - Rotationally-Consistent Novel View Synthesis for Humans

YoungJoong Kwon (The University of North Carolina at Chapel Hill)*; Stefano Petrangeli (Adobe); Haoliang Wang (Adobe Research); Dahun Kim (KAIST); Henry Fuchs (unc); Viswanathan Swaminathan (Adobe)

 

1890 - Language Models as Emotional Classifiers for Textual Conversation

Connor Heaton (Pennsylvania State University)*; David M Schwartz (Penn State)

 

1893 - Cross-modal Non-linear Guided Attention and TemporalCoherence in Multi-modal Deep Video Models

Saurabh Sahu (); Palash Goyal (Samsung Research); Shalini Ghosh (Samsung Research)*; Chul Lee (Samsung Research America)

 

1918 - Integrating Semantic Segmentation and Retinex Model for Low-Light Image Enhancement

Minhao Fan (Peking University)*; Wenjing Wang (Peking University); Wenhan Yang (Peking University); Jiaying Liu (Peking University)

 

1919 - Alleviating Human-level Shift : A Robust Domain Adaptation Method for Multi-person Pose Estimation

Xixia Xu (Beijing Jiaotong university)*; Qi Zou (Beijing Jiaotong University); Xue Lin ( Beijing Jiaotong University)

 

1924 - Price Suggestion for Online Second-hand Items with Texts and Images

Liang Han (Stony Brook University)*; Zhaozheng Yin (Stony Brook University); Zhurong Xia (Alibaba Group); Minqian Tang (Alibaba Group); rong jin (alibaba group)

 

1927 - SpatialGAN: Progressive Image Generation Based on Spatial Recursive Adversarial Expansion

"Lei Zhao (Zhejiang University)*; Sihuan Lin (Zhejiang university); Ailin Li (College of Computer Science and Technology, Zhejiang University); Huaizhong Lin (Zhejiang University); Wei Xing (Zhejiang University); Dongming Lu (Zhejiang University)"

 

1932 - Medical Visual Question Answering via Conditional Reasoning

Li-Ming Zhan (The Hong Kong Polytechnic University)*; Bo Liu (The Hong Kong Polytechnic University); Lu Fan (The Hong Kong Polytechnic University); JIAXIN CHEN (The Hong Kong Polytechnic University); Xiao-Ming Wu (PolyU Hong Kong)

 

1938 - Towards Modality Transferable Visual Information Representation with Optimal Model Compression

"Rongqun Lin (City University of Hong Kong)*; Linwei Zhu (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Shiqi Wang (CityU); Sam Kwong (City Univeristy of Hong Kong)"

 

1939 - Nighttime Dehazing with a Synthetic Benchmark

Jing Zhang (The University of Sydney)*; Yang Cao (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China); Dacheng Tao (The University of Sydney)

 

1942 - Video Relation Detection via Multiple Hypothesis Association

Zixuan Su (Fudan University)*; Xindi Shang (National University of Singapore); Jingjing Chen (Fudan University); Yu-Gang Jiang (Fudan University); Zhiyong Qiu (Tencent); Tat-Seng Chua (National Univ. of Singapore)

 

1946 - Multi-modal Cooking Workflow Construction for Food Recipes

Liang-Ming Pan (National University of Singapore)*; Jingjing Chen (Fudan University); Jianlong Wu (Fudan University); Shaoteng Liu (Xi'an Jiaotong University); Chong-Wah Ngo (City University of Hong Kong); Min-Yen Kan (National University of Singapore); Yu-Gang Jiang (Fudan University); Tat-Seng Chua (National university of Singapore)

 

1947 - Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation

Chenru Jiang (XJTLU)*; Kaizhu Huang (Xi'an Jiaotong-Liverpool Univ.); Shufei Zhang (University of Liverpool); xinheng wang ( Xi'an Jiaotong-Liverpool University); Jimin Xiao (Xi'an Jiaotong-Liverpool University)

 

1950 - Distributed Multi-agent Video Fast-forwarding

"Shuyue Lan (Northwestern University)*; Zhilu Wang (Northwestern University); Amit K. Roy-Chowdhury (University of California, Riverside); Ermin Wei (); Zhu Qi (Northwestern University)"

 

1954 - Data-driven Meta-set Based Fine-Grained Visual Recognition

Chuanyi Zhang (Nanjing University of Science and Technology); Yazhou Yao (Nanjing University of Science and Technology)*; Xiangbo Shu (Nanjing University of Science and Technology); Zechao Li (Nanjing University of Science and Technology); Zhenmin Tang ( Nanjing University of Science and Technology); Qi Wu (University of Adelaide)

 

1958 - WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection

Bojia Zi (Fudan University)*; Xingjun Ma (Deakin University); Jingjing Chen (Fudan University); Minghao Chang (Fudan University); Yu-Gang Jiang (Fudan University)

 

1964 - Anisotropic Stroke Control for Multiple Artists Style Transfer

Xuanhong Chen (Shanghai Jiao Tong University); Xirui Yan (Shanghai Jiao Tong University); Naiyuan Liu (Shanghai Jiao Tong University); Ting Qiu (Shanghai Jiao Tong University); Bingbing Ni (Shanghai Jiao Tong University)*

 

1965 - LodoNet: A Deep Neural Network with Keypoint Matching for LiDAR Odometry

Ce Zheng (University of North Carolina at Charlotte)*; Yecheng Lyu (Worcester Polytechnic Institute); Ming Li (Worcester Polytechnic Institute); Ziming Zhang (Worcester Polytechnic Institute)

 

1966 - Towards Accuracy-Fairness Paradox: Adversarial Example-based Data Augmentation for Visual Debiasing

"Yi Zhang (Beijing Jiaotong University, China)*; Jitao Sang (Beijing Jiaotong University, China)"

 

1967 - Occluded Facial Expression Recognition with Step-Wise Assistance from Unpaired Non-Occluded Images

Bin Xia (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*

 

1968 - Learning from Macro-expression: a Micro-expression Recognition Framework

Bin Xia (University of Science and Technology of China); Weikang Wang (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Enhong Chen (University of Science and Technology of China)

 

1980 - HOT-Net: Non-Autoregressive Transformer for 3D Hand-Object Pose Estimation

"Lin Huang (University at Buffalo)*; Jianchao Tan (Kwai Inc.); Jingjing Meng (State University of New York at Buffalo); Ji Liu (Kwai Inc.); Junsong Yuan (""State University of New York at Buffalo, USA"")"

 

1987 - Emotion-Based End-to-End Matching Between Image and Music in Valence-Arousal Space

"Sicheng Zhao (University of California Berkeley)*; Yaxian Li (Renmin University of China); Xingxu Yao (Nankai University); Weizhi Nie (Tianjin University); Pengfei Xu (Didi Chuxing); Jufeng Yang (Nankai University ); Kurt Keutzer (EECS, UC Berkeley)"

 

1988 - IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning

"Zhenhuan Liu (Institute of Computing Technology, Chinese Academy of Sciences); liang li (Institute of Computing Technology, Chinese Academy of Sciences)*; Shaofei Cai (Institute of Computing Technology, Chinese Academy of Sciences); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Qingming Huang (University of Chinese Academy of Sciences)"

 

1994 - LIGHTEN: Learning Interactions with Graph and Heirarchical TEmporal Networks for HOI in videos

"Sai Praneeth Reddy Sunkesula (Indian Institute of Technology, Bombay)*; Rishabh Dabral (IIT Bombay); Ganesh Ramakrishnan (IIT Bombay)"

 

2004 - Learning Semantic Concepts and Temporal Alignment for Narrated Video Procedural Captioning

"Botian Shi (Beijing Institute of Technology)*; Lei Ji (Microsoft); Zhendong Niu (Beijing Institute of Technology); Nan Duan (Microsoft Research); Ming Zhou (Microsoft Research); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences)"

 

2010 - Multi-Features Fusion and Decomposition for Age-Invariant Face Recognition

"Lixuan Meng (School of Mechanical, Electrical and Information Engineering, Shandong University, China); Chenggang Yan (Hangzhou Dianzi University); Jun Li (School of Mechanical, Electrical and Information Engineering, Shandong University, China); Jian Yin (Department of Computer, Shandong University, Weihai, China)*; Wu Liu (AI Research of JD.com); Hongtao Xie (University of Science and Technology of China); liang li (Institute of Computing Technology, Chinese Academy of Sciences)"

 

2014 - BS-MCVR: Binary-sensing based Mobile-cloud Visual Recognition

"Hongyi Zheng (The Hong Kong Polytechnic University); Lei Zhang (""Hong Kong Polytechnic University, Hong Kong, China"")*"

 

2015 - Part-Aware Interactive Learning for Scene Graph Generation

Hongshuo Tian (Tianjin University); Ning Xu (Tianjin University)*; An-An Liu (Tianjin University); Yongdong Zhang (University of Science and Technology of China)

 

2017 - Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition

Yuqian Fu (Fudan University)*; Yanwei Fu (Fudan University); junke wang (Fudan University); Li Zhang (University of Oxford); Xing Zhang (Fudan University); Yu-Gang Jiang (Fudan University)

 

2030 - Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning

Jingjing Li (University of Electronic Science and Technology of China)*; Mengmeng Jing (University of Electronic Science and Technology of China); Lei Zhu (Shandong Normal Unversity); Zhengming Ding (Indiana University-Purdue University Indianapolis); Ke Lu (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)

 

2032 - When Bitstream Prior Meets Deep Prior: Compressed Video Super-resolution with Learning from Decoding

Peilin Chen (City University of Hong Kong)*; Wenhan Yang (City University of Hong Kong); Long Sun (Huawei); Shiqi Wang (CityU)

 

2035 - Describe What to Change: A Text-guided Unsupervised Image-to-image Translation Approach

"Yahui Liu (University of Trento); Marco De Nadai (Fondazione Bruno Kessler)*; Deng Cai (The Chinese University of Hong Kong); Huayang Li (Tencent AI Lab); Xavier Alameda-Pineda (INRIA); Nicu Sebe (University of Trento); Bruno Lepri (FBK, Trento, Italy)"

 

2042 - Exploiting Multi-Emotion Relations at Feature and Label Levels for Emotion Tagging

Zhiwei Xu (University of science and technology of China); Shangfei Wang (University of Science and Technology of China)*; Can Wang (USTC)

 

2043 - Memory-Based Network for Scene Graph with Unbalanced Relations

Weitao Wang (Southeast University); Ruyang Liu (Southeast University); Meng Wang (Southeast University); Sen Wang (The University of Queensland)*; Xiaojun Chang (Monash University); Y ang Chen (Southeast University)

 

2052 - Increasing Video Perceptual Quality with GANs and Semantic Coding

Leonardo Galteri (University of Florence); Marco Bertini (University of Florence)*; Lorenzo Seidenari (University of Florence); Tiberio Uricchio (University of Florence); Alberto Del Bimbo (University of Florence)

 

2053 - Attentive One-Dimensional Heatmap Regression for Facial Landmark Detection and Tracking

Shi Yin (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Xiaoping Chen (University of Science and Technology of China); Enhong Chen (University of Science and Technology of China); Cong Liang (University of Science and Technology of China)

 

2071 - Fine-Grained Similarity Measurement between Educational Videos and Exercises

"Xin Wang (University of Science and Technology of China); Wei Huang (University of Science and Technology of China); Qi Liu ("" University of Science and Technology of China, China"")*; Yu Yin (University of Science and Technology of China); Zhenya Huang (University of Science and Technology of China ); Le Wu (Hefei University of Technology); Jianhui Ma (University of Science and Technology of China); Xue Wang (Nankai University)"

 

2073 - One-shot Text Field labeling using Attention and Belief Propagation for Structure information extraction

Mengli Cheng (Alibaba Group)*; Minghui Qiu (Alibaba)

 

2081 - GRAD: Learning for Overhead-aware Adaptive Video Streaming with Scalable Video Coding

Yunzhuo Liu (Shanghai Jiao Tong University); Bo Jiang (Shanghai Jiao Tong University)*; Tian Guo (Worcester Polytechnic Institute); Ramesh K. Sitaraman (UMass Amherst & Akamai Technologies); Don Towsley (University of Massachusetts Amherst); Xinbing Wang (Shanghai Jiao Tong University)

 

2084 - LGNN: A context-aware line segment detector

Quan Meng (ShanghaiTech University)*; Jiakai Zhang (ÉϺ£¿Æ¼¼´óѧ); Qiang Hu (ShanghaiTech University); Xuming He (ShanghaiTech University); Jingyi Yu (Shanghai Tech University)

 

2088 - Down to the Last Detail: Virtual Try-on with Fine-grained Details

Jiahang Wang (Huazhong University of Science and Technology)*; Tong Sha (Beihang University); Wei Zhang (JD AI Research); Zhoujun Li (Beihang University); Tao Mei (AI Research of JD.com)

 

2095 - Uncertainty-aware Cross-dataset Facial Expression Recognition via Regularized Conditional Alignment

Linyi Zhou (Nanjing Forestry University ); Xijian fan (Nanjing Forestry University)*; Yingjie Ma (Nanjing Forestry University ); Dr.Tardi Tjahjadi (Warwick University); Qiaolin Ye ( Nanjing Forestry University)

 

2103 - Pairwise Similarity Regularization for Adversarial Domain Adaptation

Haotian Wang (National University of Defense Technology); Wenjing Yang (National University of Defense Technology)*; Ji Wang (National University of Defense Technology); Ruxin Wang (Union Vision Innovation Co Ltd.); long lan (NUDT); Mingyang Geng (National University of Defense Technology)

 

2106 - Generalized Zero-Shot Video Classification via Generative Adversarial Networks

Mingyao Hong (University of Chinese Academy of Sciences)*; Guorong Li (University of Chinese Academy of Sciences); xinfeng zhang (University of Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)

 

2111 - DeVLBert: Learning Deconfounded Visio-Linguistic Representations

"Shengyu Zhang (Zhejiang University)*; Tan Jiang (ZhangJiang University); Tan Wang (University of Electronic Science and Technology of China); Kun Kuang (Zhejiang University); Zhou Zhao (Zhejiang University); Jianke Zhu (Zhejiang University); Jin Yu (Alibaba Group); Hongxia Yang (Alibaba Group); Fei Wu (Zhejiang University, China)"

 

2112 - Drum Synthesis and Rhythmic Transformation with Adversarial Autoencoders

Maciej Tomczak (Birmingham City University)*; Jason Hockman (Birmingham City University); Masataka Goto (National Institute of Advanced Industrial Science and Technology (AIST))

 

2115 - "Presence, embodied interaction and motivation: distinct learning phenomena in an immersive virtual environment"

Jack Ratcliffe (QMUL)*

 

2116 - AdaP-360: User-Adaptive Area-of-Focus Projections for Bandwidth-Efficient 360-Degree Video Streaming

Chao Zhou (SUNY Binghamton); Shuoqian Wang (SUNY Binghamton); Mengbai Xiao (the Ohio State University); Sheng Wei (Rutgers University - New Brunswick); Yao Liu (SUNY Binghamton)*

 

2130 - Retrieval Guided Unsupervised Multi-domain Image to Image Translation

"Raul Gomez (Eurecat, Unitat de Tecnologies Audiovisuals - Computer Vision Centre, Universitat Aut¨°noma de Barcelona); Yahui Liu (University of Trento); Marco De Nadai (Fondazione Bruno Kessler)*; Dimosthenis Karatzas (Computer Vision Centre); Bruno Lepri (FBK, Trento, Italy); Nicu Sebe (University of Trento)"

 

2145 - MMNet: Multi-Stage and Multi-Scale Fusion Network for RGB-D Salient Object Detection

Guibiao Liao (Peking University); Wei Gao (Peking University)*; Qiuping Jiang (Ningbo University); Ronggang Wang (Peking University); Ge Li (Peking University)

 

2151 - Reduce the Influence of Stability in Content Delivery Network via Learning-Based Caching Algorithm

Gang Yan (Binghamton University-SUNY); Jian Li (Binghamton University-SUNY )*

 

2158 - Temporal Denoising Mask Synthesis Network for Learning Blind Video Temporal Consistency

Yifeng Zhou (University of Electronic Science and Technology of China); Xing Xu (University of Electronic Science and Technology of China)*; Fumin Shen (UESTC); Lianli Gao (The University of Electronic Science and Technology of China); Huimin Lu (Kyushu Institute of Technology); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

 

2161 - Stable Video Style Transfer Based on Partial Convolution with Depth-Aware Supervision

Songhua Liu (Nanjing University); Wu Hao (Nanjing University); Shoutong Luo (Nanjing University); Zhengxing Sun (Nanjing University)*

 

2171 - Interpretable Video Synthesis via Transform-Based Tensor Reconstruction Network

Yimeng Zhang (Columbia University); Xiao-Yang Liu (Columbia University); Bo Wu (MIT-IBM Watson AI Lab)*; Anwar Walid (Bell Laboratories)

 

2174 - INCLUDE: A Large Scale Dataset for Indian Sign Language Recognition

Advaith Sridhar (IIT Madras)*; Rohith Gandhi G (IIT Madras); Pratyush Kumar (IIT Madras); Mitesh Khapra (IIT Madras)

 

2178 - Cluster Attention Contrast for Video Anomaly Detection

Ziming Wang (Peking University); Yuexian Zou (Peking University)*; Zeming Zhang (Harbin institute of technology)

 

2198 - Automatic Interest Recognition from Posture and Behaviour

Wolmer Bigi (Univeristy of Florence); Claudio Baecchi (University of Florence)*; Alberto Del Bimbo (University of Florence)

 

2199 - Finding Achilles' Heel: Adversarial Attack on Multi-modal Action Recognition

"Deepak Kumar (University of Massachusetts Dartmouth)*; Chetan Kumar (University of Massachusetts Dartmouth); Chun Wei Seah (University of Massachusetts); Siyu Xia (Southeast University, China); Ming Shao (University of Massachusetts Dartmouth)"

 

2205 - A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild

"Prajwal K R (International Institute of Information Technology, Hyderabad)*; Rudrabha Mukhopadhyay (IIIT Hyderabad); Vinay Namboodiri (University of Bath); C.V. Jawahar (IIIT-Hyderabad)"

 

2209 - PanoRTC: A System for Content-Adaptive Real-Time 360-Degree Video Communication

Shuoqian Wang (SUNY Binghamton); Xiaoyang Zhang (SUNY Binghamton); Mengbai Xiao (The Ohio State University); Kenneth Chiu (Binghamton University ); Yao Liu (SUNY Binghamton)*

 

2225 - Fonts Like This but Happier: A New Way to Discover Fonts

Tugba Kulahcioglu (Rutgers University)*; Gerard de Melo (Hasso Plattner Institute)

 

2226 - User Centered Adaptive Streaming of Dynamic Point Clouds with Low Complexity Tiling

"Shishir Subramanyam (Centrum Wiskunde & Informatica)*; Irene Viola (CWI); Alan Hanjalic (TU Delft, Netherlands); Pablo Cesar (CWI, The Netherlands)"

 

2237 - Efficient adaptation of neural network filter for video compression

Yat-Hong Lam (Nokia Technologies)*; Alireza Zare (Nokia Technologies); Francesco Cricri (Nokia Technologies); Jani Lainema (Nokia); Miska Hannuksela (Nokia Technologies)

 

2239 - An Advanced LiDAR Point Cloud Sequence Coding Scheme for Autonomous Driving

Xuebin Sun (CUHK)*; Sukai Wang (HKUST); Miaohui Wang (Shenzhen University); Shing shin Cheng (CUHK); Ming Liu (HKUST)

 

2242 - Adaptive Multimodal Fusion for Facial Action Units Recognition

Huiyuan Yang (Binghamton University-SUNY)*; Lijun Yin (State University of New York at Binghamton); Taoyue Wang (State Univerisity of New York at Binghamton)

 

2246 - An Analysis of Delay in Live 360¡ã Video Streaming Systems

"Jun Yi (Georgia State University)*; Md Reazul Islam (Georgia State University); Shivang Aggarwal (University at Buffalo, The State University of New York); Dimitrios Koutsonikolas (SUNY Buffalo); Y. Charlie Hu (Purdue University); Zhisheng Yan (Georgia State University)"

 

2249 - Adaptive Temporal Triplet-loss for Cross-modal Embedding Learning

David Semedo (Universidade NOVA de Lisboa)*; Joao Magalhaes (Universidade NOVA Lisboa)

 

2253 - CFVMNet: A Multi-branch Network for Vehicle Re-identification based the Common Field of View

ziruo sun (Shandong University); Xiushan Nie (Shandong Jianzhu University)*; Xiaoming Xi (Shandong Jianzhu University ); Yilong Yin (Shandong University)

 

2257 - SonoSpace: Visual Feedback of Timbre with Unsupervised Learning

Naoki Kimura (The University of Tokyo)*; Keisuke Shiro (The University of Tokyo); Yota Takakura (Innoqua Inc.); Hiromi Nakamura (The University of Tokyo); Jun Rekimoto (The Univertsity of Tokyo)

 

2262 - Learning Optimization-based Adversarial Perturbations for Attacking Sequential Recognition Models

Xing Xu (University of Electronic Science and Technology of China)*; Jiefu Chen (University of Electronic Science and Technology of China); Jinhui Xiao (University of Electronic Science and Technology of China); Zheng Wang (UESTC); Yang Yang (University of Electronic Science and Technology of China); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

 

2264 - Amora: Black-box Adversarial Morphing Attack

"Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Qing Guo (Nanyang Technological University); Yihao Huang (East China Normal University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)"

 

2269 - PmR-QP: Prediction-Based R-QP Modeling on Bitrate Estimation

Yangfan Sun (University of Missouri-Kansas City); Li Li (University of Missouri-Kansas City); Zhu Li (university of missouri-kansas city)*; Shan Liu (Tencent America)

 

2272 - GangSweep: Sweep out Neural Backdoors by GAN

LIUWAN ZHU (Old Dominion University)*; Rui Ning (Old Dominion University); Cong Wang (Old Dominion University); Chunsheng Xin (Old Dominion University); Michael Wu (Nil)

 

2273 - Exploiting Self-Supervised and Semi-Supervised Learning for Facial Landmark Tracking with Unlabeled Data

Shi Yin (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Xiaoping Chen (University of Science and Technology of China); Enhong Chen (University of Science and Technology of China)

 

2274 - MS^2L: Multi-task Self-supervised Learning for Skeleton Based Action Recognition

Lilang Lin (Peking University)*; Sijie Song (Peking University); Wenhan Yang (Peking University); Jiaying Liu (Peking University)

 

2275 - Exploiting Heterogeneous Composer and Listener Preference Graph for Music Genre Classification

"Chunyuan Yuan (Institute of Information Engineering, Chinese Academy of Sciences)*; Qianwen Ma ( Institute of Information Engineering, School of Cyber Security, University of Chinese Academy of Sciences); junyang chen (University of Macau); Yijun Lu (Alibaba Cloud Computing Co. Ltd.); Wei Zhou (Institute of Information Engineering, School of Cyber Security, University of Chinese Academy of Sciences); Jizhong Han ( Institute of Information Engineering,Chinese Academy of Sciences); Songlin Hu ( Institute of Information Engineering,Chinese Academy of Sciences)"

 

2292 - Tile Rate Allocation for 360-Degree Tiled Adaptive Video Streaming

Praveen Kumar Yadav (National University of Singapore)*; Wei Tsang Ooi (National University of Singapore)

 

2297 - Sequential Attention GAN for Interactive Image Editing

Yu Cheng (Microsoft)*; Zhe Gan (Microsoft); Yitong Li (Apple Inc); Jingjing Liu (Microsoft); Jianfeng Gao (Microsoft Research)

 

2298 - Cross Corpus Physiological-based Emotion Recognition Using a Learnable Visual Semantic Graph Convolutional Network

"Woan-Shiuan Chien (Department of Electrical Engineering, National Tsing Hua University ); Hao-Chun Yang (Department of Electrical Engineering, National Tsing Hua University); Chi-Chun Lee (Department of Electrical Engineering, National Tsing Hua University)*"

 

2314 - Domain-Adaptive Object Detection via Uncertainty-Aware Distribution Alignment

Dang-Khoa Nguyen (National Chiao Tung University); Wei-Lun Tseng (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University)*

 

2323 - Single Image Deraining via Scale-space Invariant Attention Neural Network

Bo Pang (Harbin Institute of Technology); Deming Zhai (Harbin Institute of Technolgy); Junjun Jiang (Harbin Institute of Technology); Xianming Liu (Harbin Institute of Technology)*

 

2327 - MM-Hand: 3D-Aware Multi-Modal Guided Hand Generation for 3D Hand Pose Synthesis

"Zhenyu Wu (Texas A&M University)*; Duc Hoang (Texas A&M); Shih-Yao Lin (Tencent America); Yusheng Xie (Amazon); Liangjian Chen (University of California, Irvine); Yen-Yu Lin (National Chiao Tung University); Zhangyang Wang (University of Texas at Austin); Wei Fan (Tencent)"

 

2340 - Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback

Yinwei Wei (Shandong University)*; Xiang Wang (National University of Singapore); Liqiang Nie (Shandong University ); Xiangnan He (University of Science and Technology of China); Tat-Seng Chua (National Univ. of Singapore)

 

2342 - Concept-based Explanation for Fine-grained Images and Its Application in Infectious Keratitis Classification

Zhengqing Fang (Zhejiang University)*; Kun Kuang (Zhejiang University); Yuxiao Lin (Zhejiang University); Fei Wu (Zhejiang University); Yufeng Yao (Zhejiang University)

 

2346 - Visually Precise Query

"Riddhiman Dasgupta (Microsoft); Francis Tom (Microsoft); Sudhir Kumar (Microsoft); Mithun Das Gupta (Microsoft,India)*; Yokesh Kumar (Microsoft); Badri Patro (IIT Kanpur); Vinay P Namboodiri (IIT Kanpur)"

 

2380 - Joint Self-Attention and Scale-Aggregation for Self-Calibrated Deraining Network

Yutong Wu (Dalian University of Technology)*; Cong Wang (Dalian University of Technology); Zhixun Su (Dalian University of Technology); junyang chen (University of Macau)

 

2390 - Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos

"Ling-An Zeng (Sun Yat-sen University); Fa-Ting Hong (Sun Yat-Sen University); WEI-SHI ZHENG (Sun Yat-sen University, China)*; Qizhi Yu (Zhejiang Laboratory); Wei Zeng (Peking University, China); Yaowei Wang (PengCheng Laboratory); Jian-Huang Lai (Sun Yat-sen University)"

 

2394 - F2GAN: Fusing-and-Filling GAN for Few-shot Image Generation

Yan Hong (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong University)*; Jianfu Zhang (Shanghai Jiao Tong University); Weijie Zhao (Versa-AI); Chen Fu (Versa-AI); Liqing Zhang (Shanghai Jiao Tong Univercity)

 

2405 - JAFPro: Joint Appearance Fusion and Propagation for Human Video Motion Transfer from Multiple Reference Images

"Xianggang Yu (The Chinese University of Hong Kong, Shenzhen); Haolin Liu (The Chinese University of Hong Kong, Shenzhen); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen))*; Zhen Li (Chinese University of Hong Kong, Shenzhen); Zixiang Xiong (Texas A&M University); Shuguang Cui (The Chinese University of Hong Kong, Shenzhen )"

 

2407 - A W2VV++ Case Study with Automated and Interactive Text-to-Video Retrieval

"Jakub Lokoc (Charles University in Prague); Tom¨¢_ Sou_ek (Charles University, Prague ); Patrik Vesel_ (Charles University, Prague ); Franti_ek Mejzl¨ªk (Charles University, Prague ); Jiaqi Ji (Renmin University of China); Chaoxi Xu (Renmin University of China); Xirong Li (Renmin University of China)*"

 

2415 - Attention Cube Network for Image Restoration

Yucheng Hang (Tsinghua University); Qingmin Liao (Tsinghua Univeristy); Wenming Yang (Tsinghua University)*; Yupeng Chen (Peng Cheng Laboratory); Jie Zhou (Tsinghua University)

 

2419 - CRNet: A Center-aware Representation for Detecting Text of Arbitrary Shapes

Yu Zhou (University of Science and Technology of China)*; Hongtao Xie (University of Science and Technology of China); Shancheng Fang (University of Science and Technology of China); Yan Li (Kuaishou); Yongdong Zhang (University of Science and Technology of China)

 

2448 - Visual Relation of Interest Detection

Fan Yu (Nanjing University); Haonan Wang (Nanjing University); Tongwei Ren (Nanjing University)*; Jinhui Tang (Nanjing University of Science and Technology); Gangshan Wu (Nanjing University)

 

2463 - Expressional Region Retrieval

"xiaoqian guo (Institute of Computing Technology, Chinese Academy of Sciences)*; Xiangyang Li (Institute of Computing Technology, Chinese Academy of Sciences); Shuqiang Jiang (ICT, China Academy of Science)"

 

2464 - Generalized Zero-shot Learning with Multi-source Semantic Embeddings for Scene Recognition

"Xinhang Song (ICT)*; Haitao Zeng (China University of Mining & Technology (Beijing),and ICT, Chinese Academy of Sciences); sixian zhang (Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS)); Luis Herranz (Computer Vision Center); Shuqiang Jiang (ICT, China Academy of Science)"

 

2485 - ATRW: A Benchmark for Amur Tiger Re-identification in the Wild

Shuyuan Li (Shanghai Jiao Tong University); Jianguo Li (Ant Group)*; Hanlin Tang (Intel Corporation); Rui Qian (Shanghai Jiao Tong University); Weiyao Lin (Shanghai Jiao Tong university)

 

2486 - Emotions Don't Lie: An Audio-Visual Deepfake Detection Method using Affective Cues

"Trisha Mittal (University of Maryland)*; Uttaran Bhattacharya (University of Maryland, College Park); Rohan Chandra (University of Maryland); Aniket Bera (University of Maryland, College Park); Dinesh Manocha (UMD)"

你可能感兴趣的:(随笔,deep,learning,深度学习,机器学习)