A 3D Convolutional Approach to Spectral Object Segmentation in Space and Time
Elena Burceanu, Marius Leordeanu
A Graph-based Interactive Reasoning for Human-Object Interaction Detection
Dongming Yang, Yuexian Zou
A Similarity Inference Metric for RGB-Infrared Cross-Modality Person Re-identification
Mengxi Jia, Yunpeng Zhai, Shijian Lu, Siwei Ma, Jian Zhang
Action-Guided Attention Mining and Relation Reasoning Network for Human-Object Interaction Detection
Xue Lin, Qi Zou, Xixia Xu
AttAN: Attention Adversarial Networks for 3D Point Cloud Semantic Segmentation
Gege Zhang, Qinghua Ma, Licheng Jiao, Fang Liu, Qigong Sun
BARNet: Bilinear Attention Network with Adaptive Receptive Fields for Surgical Instrument Segmentation
Zhen-Liang Ni, Gui-Bin Bian, Guan-An Wang, Xiao-Hu Zhou, Zeng-Guang Hou, Xiao-Liang Xie, Zhen Li, Yu-Han Wang
Bi-level Probabilistic Feature Learning for Deformable Image Registration
Risheng Liu, Zi Li, Yuxi Zhang, Xin Fan, Zhongxuan Luo
Biased Feature Learning for Occlusion Invariant Face Recognition
Changbin Shao, Jing Huo, Lei Qi, Zhen-Hua Feng, Wenbin Li, Chuanqi Dong, Yang Gao
Bidirectional Adversarial Training for Semi-Supervised Domain Adaptation
Pin Jiang, Aming Wu, Yahong Han, Yunfeng Shao, Meiyu Qi, Bingshuai Li
Bottom-up and Top-down: Bidirectional Additive Net for Edge Detection
Lianli Gao, Zhilong Zhou, Heng Tao Shen, Jingkuan Song
Channel Pruning via Automatic Structure Search
Mingbao Lin, Rongrong Ji, Yuxin Zhang, Baochang Zhang, Yongjian Wu, Yonghong Tian
Channel-Level Variable Quantization Network for Deep Image Compression
Zhisheng Zhong, Hiroaki Akutsu, Kiyoharu Aizawa
Characterizing Similarity of Visual Stimulus from Associated Neuronal Response
Vikram Ravindra, Ananth Grama
Co-Saliency Spatio-Temporal Interaction Network for Person Re-Identification in Videos
Jiawei Liu, Zheng-Jun Zha, Xierong Zhu, Na Jiang
Collaborative Learning of Depth Estimation, Visual Odometry and Camera Relocalization from Monocular Videos
Haimei Zhao, Wei Bian, Bo Yuan, Dacheng Tao
Consistent Domain Structure Learning and Domain Alignment for 2D Image-Based 3D Objects Retrieval
Yuting Su, Yuqian Li, Dan Song, Weizhi Nie, Wenhui Li, An-An Liu
CP-NAS: Child-Parent Neural Architecture Search for 1-bit CNNs
Li'an Zhuo, Baochang Zhang, Hanlin Chen, Linlin Yang, Chen Chen, Yanjun Zhu, David Doermann
Cross-denoising Network against Corrupted Labels in Medical Image Segmentation with Domain Shift
Qinming Zhang, Luyan Liu, Kai Ma, Cheng Zhuo, Yefeng Zheng
DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue
Xiaoze Jiang, Jing Yu, Yajing Sun, Zengchang Qin, Zihao Zhu, Yue Hu, Qi Wu
Deep Interleaved Network for Single Image Super-Resolution with Asymmetric Co-Attention
Feng Li, Runmin Cong, Huihui Bai, Yifan He
Deep Polarized Network for Supervised Learning of Accurate Binary Hashing Codes
Lixin Fan, Kam Woh Ng, Ce Ju, Tianyu Zhang, Chee Seng Chan
Detecting Adversarial Attacks via Subset Scanning of Autoencoder Activations and Reconstruction Error
Celia Cintas, Skyler Speakman, Victor Akinwande, William Ogallo, Komminist Weldemariam, Srihari Sridharan, Edward McFowland
Diagnosing the Environment Bias in Vision-and-Language Navigation
Yubo Zhang, Hao Tan, Mohit Bansal
DIDFuse: Deep Image Decomposition for Infrared and Visible Image Fusion
Zixiang Zhao, Shuang Xu, Chunxia Zhang, Junmin Liu, Jiangshe Zhang, Pengfei Li
Disentangled Feature Learning Network for Vehicle Re-Identification
Yan Bai, Yihang Lou, Yongxing Dai, Jun Liu, Ziqian Chen, Ling-Yu Duan
Dress like an Internet Celebrity: Fashion Retrieval in Videos
Hongrui Zhao, Jin Yu, Yanan Li, Donghui Wang, Jie Liu, Hongxia Yang, Fei Wu
Dynamic Language Binding in Relational Visual Reasoning
Thao Minh Le, Vuong Le, Svetha Venkatesh, Truyen Tran
E3SN: Efficient End-to-End Siamese Network for Video Object Segmentation
Meng Lan, Yipeng Zhang, Qinning Xu, Lefei Zhang
EViLBERT: Learning Task-Agnostic Multimodal Sense Embeddings
Agostina Calabrese, Michele Bevilacqua, Roberto Navigli
Exploiting Visual Semantic Reasoning for Video-Text Retrieval
Zerun Feng, Zhimin Zeng, Caili Guo, Zheng Li
Feature Augmented Memory with Global Attention Network for VideoQA
Jiayin Cai, Chun Yuan, Cheng Shi, Lei Li, Yangyang Cheng, Ying Shan
Few-shot Human Motion Prediction via Learning Novel Motion Dynamics
Chuanqi Zang, Mingtao Pei, Yu Kong
Few-shot Visual Learning with Contextual Memory and Fine-grained Calibration
Yuqing Ma, Wei Liu, Shihao Bai, Qingyu Zhang, Aishan Liu, Weimin Chen, Xianglong Liu
G2RL: Geometry-Guided Representation Learning for Facial Action Unit Intensity Estimation
Yingruo Fan, Zhaojiang Lin
Generating Person Images with Appearance-aware Pose Stylizer
Siyu Huang, Haoyi Xiong, Zhi-Qi Cheng, Qingzhong Wang, Xingran Zhou, Bihan Wen, Jun Huan, Dejing Dou
GestureDet: Real-time Student Gesture Analysis with Multi-dimensional Attention-based Detector
Rui Zheng, Fei Jiang, Ruimin Shen
GSM: Graph Similarity Model for Multi-Object Tracking
Qiankun Liu, Qi Chu, Bin Liu, Nenghai Yu
HAF-SVG: Hierarchical Stochastic Video Generation with Aligned Features
Zhihui Lin, Chun Yuan, Maomao Li
Hierarchical Attention Based Spatial-Temporal Graph-to-Sequence Learning for Grounded Video Description
Kai Shen, Lingfei Wu, Fangli Xu, Siliang Tang, Jun Xiao, Yueting Zhuang
Hierarchical Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval
Heyu Zhou, Weizhi Nie, Wenhui Li, Dan Song, An-An Liu
Human Consensus-Oriented Image Captioning
Ziwei Wang, Zi Huang, Yadan Luo
JPEG Artifacts Removal via Compression Quality Ranker-Guided Networks
Menglu Wang, Xueyang Fu, Zepei Sun, Zheng-Jun Zha
k-SDPP: Fixed-Size Video Summarization via Sequential Determinantal Point Processes
Jiping Zheng, Ganfeng Lu
Label-Attended Hashing for Multi-Label Image Retrieval
Yanzhao Xie, Yu Liu, Yangtao Wang, Lianli Gao, Peng Wang, Ke Zhou
Large Scale Audiovisual Learning of Sounds with Weakly Labeled Data
Haytham M. Fayek, Anurag Kumar
Latent Regularized Generative Dual Adversarial Network For Abnormal Detection
Chengwei Chen, Jing Liu, Yuan Xie, Yin Xiao Ban, Chunyun Wu, Yiqing Tao, Haichuan Song
Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation
Tao He, Lianli Gao, Jingkuan Song, Jianfei Cai, Yuan-Fang Li
Learning Task-aware Local Representations for Few-shot Learning
Chuanqi Dong, Wenbin Li, Jing Huo, Zheng Gu, Yang Gao
Learning to Discretely Compose Reasoning Module Networks for Video Captioning
Ganchao Tan, Daqing Liu, Meng Wang, Zheng-Jun Zha
Lifelong Zero-Shot Learning
Kun Wei, Cheng Deng, Xu Yang
Meta Segmentation Network for Ultra-Resolution Medical Images
Tong Wu, Bicheng Dai, Shuxin Chen, Yanyun Qu, Yuan Xie
Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering
Zihao Zhu, Jing Yu, Yujing Wang, Yajing Sun, Yue Hu, Qi Wu
Multi-attention Meta Learning for Few-shot Fine-grained Image Recognition
Yaohui Zhu, Chenlong Liu, Shuqiang Jiang
Multi-graph Fusion for Functional Neuroimaging Biomarker Detection
Jiangzhang Gan, Xiaofeng Zhu, Rongyao Hu, Yonghua Zhu, Junbo Ma, Ziwen Peng, Guorong Wu
Multi-Scale Spatial-Temporal Integration Convolutional Tube for Human Action Recognition
Haoze Wu, Jiawei Liu, Xierong Zhu, Meng Wang, Zheng-Jun Zha
Multichannel Color Image Denoising via Weighted Schatten p-norm Minimization
Xinjian Huang, Bo Du, Weiwei Liu
Non-Autoregressive Image Captioning with Counterfactuals-Critical Multi-Agent Learning
Longteng Guo, Jing Liu, Xinxin Zhu, Xingjian He, Jie Jiang, Hanqing Lu
Object-Aware Multi-Branch Relation Networks for Spatio-Temporal Video Grounding
Zhu Zhang, Zhou Zhao, Zhijie Lin, Baoxing Huai, Jing Yuan
Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
Xi Zhu, Zhendong Mao, Chunxiao Liu, Peng Zhang, Bin Wang, Yongdong Zhang
Overflow Aware Quantization: Accelerating Neural Network Inference by Low-bit Multiply-Accumulate Operations
Hongwei Xie, Yafei Song, Ling Cai, Mingyang Li
Pay Attention to Devils: A Photometric Stereo Network for Better Details
Yakun Ju, Kin-Man Lam, Yang Chen, Lin Qi, Junyu Dong
Polar Relative Positional Encoding for Video-Language Segmentation
Ke Ning, Lingxi Xie, Fei Wu, Qi Tian
Position-Aware Recalibration Module: Learning From Feature Semantics and Feature Position
Xu Ma, Song Fu
Progressive Domain-Independent Feature Decomposition Network for Zero-Shot Sketch-Based Image Retrieval
Xinxun Xu, Muli Yang, Yanhua Yang, Hao Wang
Real-World Automatic Makeup via Identity Preservation Makeup Net
Zhikun Huang, Zhedong Zheng, Chenggang Yan, Hongtao Xie, Yaoqi Sun, Jianzhong Wang, Jiyong Zhang
Recurrent Relational Memory Network for Unsupervised Image Captioning
Dan Guo, Yang Wang, Peipei Song, Meng Wang
Reference Guided Face Component Editing
Qiyao Deng, Jie Cao, Yunfan Liu, Zhenhua Chai, Qi Li, Zhenan Sun
SBAT: Video Captioning with Sparse Boundary-Aware Transformer
Tao Jin, Siyu Huang, Ming Chen, Yingming Li, Zhongfei Zhang
SceneEncoder: Scene-Aware Semantic Segmentation of Point Clouds with A Learnable Scene Descriptor
Jiachen Xu, Jingyu Gong, Jie Zhou, Xin Tan, Yuan Xie, Lizhuang Ma
SelectScale: Mining More Patterns from Images via Selective and Soft Dropout
Zhengsu Chen, Jianwei Niu, Xuefeng Liu, Shaojie Tang
Self-Supervised Gait Encoding with Locality-Aware Attention for Person Re-Identification
Haocong Rao, Siqi Wang, Xiping Hu, Mingkui Tan, Huang Da, Jun Cheng, Bin Hu
Self-supervised Monocular Depth and Visual Odometry Learning with Scale-consistent Geometric Constraints
Mingkang Xiong, Zhenghong Zhang, Weilin Zhong, Jinsheng Ji, Jiyuan Liu, Huilin Xiong
Self-Supervised Tuning for Few-Shot Segmentation
Kai Zhu, Wei Zhai, Yang Cao
Semi-Dynamic Hypergraph Neural Network for 3D Pose Estimation
Shengyuan Liu, Pei Lv, Yuzhen Zhang, Jie Fu, Junjin Cheng, Wanqing Li, Bing Zhou, Mingliang Xu
Set and Rebase: Determining the Semantic Graph Connectivity for Unsupervised Cross-Modal Hashing
Weiwei Wang, Yuming Shen, Haofeng Zhang, Yazhou Yao, Li Liu
SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation
Siddhartha Gairola, Mayur Hemani, Ayush Chopra, Balaji Krishnamurthy
Spatiotemporal Super-Resolution with Cross-Task Consistency and Its Semi-supervised Extension
Han-Yi Lin, Pi-Cheng Hsiu, Tei-Wei Kuo, Yen-Yu Lin
Super-Resolution and Inpainting with Degraded and Upgraded Generative Adversarial Networks
Yawen Huang, Feng Zheng, Danyang Wang, Junyu Jiang, Xiaoqian Wang, Ling Shao
Temporal Adaptive Alignment Network for Deep Video Inpainting
Ruixin Liu, Zhenyu Weng, Yuesheng Zhu, Bairong Li
TextFuseNet: Scene Text Detection with Richer Fused Features
Jian Ye, Zhe Chen, Juhua Liu, Bo Du
TLPG-Tracker: Joint Learning of Target Localization and Proposal Generation for Visual Tracking
Siyuan Li, Zhi Zhang, Ziyu Liu, Anna Wang, Linglong Qiu, Feng Du
Transductive Relation-Propagation Network for Few-shot Learning
Yuqing Ma, Shihao Bai, Shan An, Wei Liu, Aishan Liu, Xiantong Zhen, Xianglong Liu
TRP: Trained Rank Pruning for Efficient Deep Neural Networks
Yuhui Xu, Yuxi Li, Shuai Zhang, Wei Wen, Botao Wang, Yingyong Qi, Yiran Chen, Weiyao Lin, Hongkai Xiong
Unsupervised Scene Adaptation with Memory Regularization in vivo
Zhedong Zheng, Yi Yang
Unsupervised Vehicle Re-identification with Progressive Adaptation
Jinjia Peng, Yang Wang, Huibing Wang, Zhao Zhang, Xianping Fu, Meng Wang
Video Question Answering on Screencast Tutorials
Wentian Zhao, Seokhwan Kim, Ning Xu, Hailin Jin
Visual Encoding and Decoding of the Human Brain Based on Shared Features
Chao Li, Baolin Liu, Jianguo Wei
Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Embeddings
Mennatullah Siam, Naren Doraiswamy, Boris N. Oreshkin, Hengshuai Yao, Martin Jagersand
Weakly Supervised Local-Global Relation Network for Facial Expression Recognition
Haifeng Zhang, Wen Su, Jun Yu, Zengfu Wang
When Pedestrian Detection Meets Nighttime Surveillance: A New Benchmark
Xiao Wang, Jun Chen, Zheng Wang, Wu Liu, Shin'ichi Satoh, Chao Liang, Chia-Wen Lin
Zero-Shot Object Detection via Learning an Embedding from Semantic Space to Visual Space
Licheng Zhang, Xianzhi Wang, Lina Yao, Lin Wu, Feng Zheng