CVPR2017论文
作为计算机视觉领域的三大顶级会议之一,CVPR 2017 又收录了很多优秀的文章。具体可参见 CVPR 的论文官网:http://www.cvpapers.com/cvpr2017.html
Spotlight 1-1A (关注的焦点 1-1 A)
Exclusivity-Consistency Regularized Multi-View Subspace Clustering
Xiaojie Guo, Xiaobo Wang, Zhen Lei, Changqing Zhang, Stan Z. Li
Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning
Weifeng Ge, Yizhou Yu
The More You Know: Using Knowledge Graphs for Image Classification
Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta
Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs
Martin Simonovsky, Nikos Komodakis
Convolutional Neural Network Architecture for Geometric Matching
Ignacio Rocco, Relja Arandjelović, Josef Sivic
Deep Affordance-Grounded Sensorimotor Object Recognition
Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos
Discovering Causal Signals in Images
David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, Léon Bottou
On Compressing Deep Models by Low Rank and Sparse Decomposition
Xiyu Yu, Tongliang Liu, Xinchao Wang, Dacheng Tao
Oral 1-1A (口头汇报 1-1A)
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas
Universal Adversarial Perturbations
Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard
Unsupervised Pixel-Level Domain Adaptation With Generative Adversarial Networks
Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network (PDF,code)
Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi
Spotlight 1-1B(关注的焦点 1-1 B)
Context-Aware Captions From Context-Agnostic Supervision
Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik
Global Hypothesis Generation for 6D Object Pose Estimation (PDF)
Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother
A Practical Method for Fully Automatic Intrinsic Camera Calibration Using Directionally Encoded Light
Mahdi Abbaspour Tehrani, Thabo Beeler, Anselm Grundhöfer
CATS: A Color and Thermal Stereo Benchmark
Wayne Treible, Philip Saponaro, Scott Sorensen, Abhishek Kolagunda, Michael O'Neal, Brian Phelan, Kelly Sherbondy, Chandra Kambhamettu
Elastic Shape-From-Template With Spatially Sparse Deforming Forces
Abed Malti, Cédric Herzet
Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context
Qingan Yan, Long Yang, Ling Zhang, Chunxia Xiao
Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation
Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, Nicu Sebe
Dynamic Time-Of-Flight
Michael Schober, Amit Adam, Omer Yair, Shai Mazor, Sebastian Nowozin
Oral 1-1B
Semantic Scene Completion From a Single Depth Image
Shuran Song, Fisher Yu, Andy Zeng, Angel X. Chang, Manolis Savva, Thomas Funkhouser
3DMatch: Learning Local Geometric Descriptors From RGB-D Reconstructions
Andy Zeng, Shuran Song, Matthias Nießner, Matthew Fisher, Jianxiong Xiao, Thomas Funkhouser
Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency (PDF,project,code)
Shubham Tulsiani,Tinghui Zhou,Alexei A. Efros,Jitendra Malik
On-The-Fly Adaptation of Regression Forests for Online Camera Relocalisation (PDF)
Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano,Philip H. S. Torr
Spotlight 1-1C
Designing Effective Inter-Pixel Information Flow for Natural Image Matting
Yağiz Aksoy, Tunç Ozan Aydin, Marc Pollefeys
Deep Video Deblurring for Hand-Held Cameras
Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, Wolfgang Heidrich, Oliver Wang
Instance-Level Salient Object Segmentation
Guanbin Li, Yuan Xie, Liang Lin, Yizhou Yu
Deep Multi-Scale Convolutional Neural Network for Dynamic Scene Deblurring
Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee
Diversified Texture Synthesis With Feed-Forward Networks
Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang
Radiometric Calibration for Internet Photo Collections (PDF)
Zhipeng Mo, Boxin Shi, Sai-Kit Yeung, Yasuyuki Matsushita
Deeply Aggregated Alternating Minimization for Image Restoration
Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn
End-To-End Instance Segmentation With Recurrent Attention
Mengye Ren, Richard S. Zemel
分类:实例物体分割
简要:使用端到端的递归神经网络进行实例物体分割.本文针对实例分割使用递归神经网络(RNN)架构将每个物体依次定位分割出来,使用了一个注意机制模型类似人类的计算过程。
Oral 1-1C
SRN: Side-output Residual Network for Object Symmetry Detection in the Wild
Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye
Deep Image Matting (PDF,abstract)
Ning Xu, Brian Price, Scott Cohen, Thomas Huang
Wetness and Color From a Single Multispectral Image
Mihoko Shimano, Hiroki Okawa, Yuta Asano, Ryoma Bise, Ko Nishino, Imari Sato
FC4: Fully Convolutional Color Constancy With Confidence-Weighted Pooling
Yuanming Hu, Baoyuan Wang, Stephen Lin
3D Computer Vision
Face Normals “In-The-Wild†Using Fully Convolutional Networks
George Trigeorgis, Patrick Snape, Iasonas Kokkinos, Stefanos Zafeiriou
A Non-Convex Variational Approach to Photometric Stereo Under Inaccurate Lighting
Yvain Quéau, Tao Wu, François Lauze, Jean-Denis Durou, Daniel Cremers
A Linear Extrinsic Calibration of Kaleidoscopic Imaging System From Single 3D Point
Kosuke Takahashi, Akihiro Miyata, Shohei Nobuhara, Takashi Matsuyama
Polarimetric Multi-View Stereo
Zhaopeng Cui, Jinwei Gu, Boxin Shi, Ping Tan, Jan Kautz
An Exact Penalty Method for Locally Convergent Maximum Consensus (PDF,code)
Huu Le, Tat-Jun Chin, David Suter
Deep Supervision With Shape Concepts for Occlusion-Aware 3D Object Parsing
Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker
Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes From 2D Ones in RGB-Depth Images
Zhuo Deng, Longin Jan Latecki
Analyzing Humans in Images
Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection
Guillermo Garcia-Hernando, Tae-Kyun Kim
Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition With Convolutional Neural Networks
Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, Chang Tang, Philip Ogunbona
Detecting Masked Faces in the Wild With LLE-CNNs
Shiming Ge, Jia Li, Qiting Ye, Zhao Luo
A Domain Based Approach to Social Relation Recognition
Qianru Sun, Bernt Schiele, Mario Fritz
Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition
Junwu Weng, Chaoqun Weng, Junsong Yuan
Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks
Ajjen Joshi, Soumya Ghosh, Margrit Betke, Stan Sclaroff, Hanspeter Pfister
Applications
Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core
Wadim Kehl, Federico Tombari, Slobodan Ilic, Nassir Navab
Multi-Scale FCN With Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild
Dafang He, Xiao Yang, Chen Liang, Zihan Zhou, Alexander G. Ororbi II, Daniel Kifer, C. Lee Giles
Viraliency: Pooling Local Virality
Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, Elisa Ricci
Biomedical Image/Video Analysis
A Non-Local Low-Rank Framework for Ultrasound Speckle Reduction
Lei Zhu, Chi-Wing Fu, Michael S. Brown, Pheng-Ann Heng
Image Motion & Tracking
Video Acceleration Magnification
Silvia L. Pintea, Yichao Zhang, Jan C. van Gemert
Superpixel-Based Tracking-By-Segmentation Using Markov Chains
Donghun Yeo, Jeany Son, Bohyung Han, Joon Hee Han
BranchOut: Regularization for Online Ensemble Tracking With Convolutional Neural Networks
Bohyung Han, Jack Sim, Hartwig Adam
Learning Motion Patterns in Videos
Pavel Tokmakov, Karteek Alahari, Cordelia Schmid
Low- & Mid-Level Vision
Deep Level Sets for Salient Object Detection
Ping Hu, Bing Shuai, Jun Liu, Gang Wang
Binary Constraint Preserving Graph Matching
Bo Jiang, Jin Tang, Chris Ding, Bin Luo
From Local to Global: Edge Profiles to Camera Motion in Blurred Images
Subeesh Vasu, A. N. Rajagopalan
What Is the Space of Attenuation Coefficients in Underwater Computer Vision?
Derya Akkaynak, Tali Treibitz, Tom Shlesinger, Yossi Loya, Raz Tamir, David Iluz
Robust Energy Minimization for BRDF-Invariant Shape From Light Fields
Zhengqin Li, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker
Boundary-Aware Instance Segmentation
Zeeshan Hayder, Xuming He, Mathieu Salzmann
Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes
S. Alireza Golestaneh, Lina J. Karam
Model-Based Iterative Restoration for Binary Document Image Compression With Dictionary Learning
Yandong Guo, Cheng Lu, Jan P. Allebach, Charles A. Bouman
FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence
Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn
Machine Learning
Learning by Association — A Versatile Semi-Supervised Training Method for Neural Networks
Philip Haeusser, Alexander Mordvintsev, Daniel Cremers
Dilated Residual Networks
Fisher Yu, Vladlen Koltun, Thomas Funkhouser
Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction
Richard Zhang, Phillip Isola, Alexei A. Efros
Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting
Mariano Tepper, Guillermo Sapiro
Truncated Max-Of-Convex Models
Pankaj Pansari, M. Pawan Kumar
Additive Component Analysis
Calvin Murdock, Fernando De la Torre
Subspace Clustering via Variance Regularized Ridge Regression
Zhao Kang, Chong Peng, Qiang Cheng
The Incremental Multiresolution Matrix Factorization Algorithm
Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh
Transformation-Grounded Image Generation Network for Novel 3D View Synthesis
Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, Alexander C. Berg
Learning Dynamic Guidance for Depth Image Enhancement (PDF)
Shuhang Gu, Wangmeng Zuo, Shi Guo, Yunjin Chen, Chongyu Chen, Lei Zhang
A-Lamp: Adaptive Layout-Aware Multi-Patch Deep Convolutional Neural Network for Photo Aesthetic Assessment (PDF)
Shuang Ma, Jing Liu, Chang Wen Chen
Teaching Compositionality to CNNs
Austin Stone, Huayan Wang, Michael Stark, Yi Liu, D. Scott Phoenix, Dileep George
Using Ranking-CNN for Age Estimation
Shixing Chen, Caojin Zhang, Ming Dong, Jialiang Le, Mike Rao
Accurate Single Stage Detector Using Recurrent Rolling Convolution
Jimmy Ren, Xiaohao Chen, Jianbo Liu, Wenxiu Sun, Jiahao Pang, Qiong Yan, Yu-Wing Tai, Li Xu
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation
Chunpeng Wu, Wei Wen, Tariq Afzal, Yongmei Zhang, Yiran Chen, Hai (Helen) Li
The Impact of Typicality for Informative Representative Selection
Jawadul H. Bappy, Sujoy Paul, Ertem Tuncel, Amit K. Roy-Chowdhury
Infinite Variational Autoencoder for Semi-Supervised Learning
M. Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel
SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks
Ayan Sinha, Asim Unmesh, Qixing Huang, Karthik Ramani
Intrinsic Grassmann Averages for Online Linear and Robust Subspace Learning
Rudrasis Chakraborty, Søren Hauberg, Baba C. Vemuri
Variational Bayesian Multiple Instance Learning With Gaussian Processes
Manuel Haußmann, Fred A. Hamprecht, Melih Kandemir
Temporal Attention-Gated Model for Robust Sequence Classification
Wenjie Pei, Tadas Baltrušaitis, David M.J. Tax, Louis-Philippe Morency
Non-Uniform Subset Selection for Active Learning in Structured Data
Sujoy Paul, Jawadul H. Bappy, Amit K. Roy-Chowdhury
Colorization as a Proxy Task for Visual Understanding
Gustav Larsson, Michael Maire, Gregory Shakhnarovich
Shading Annotations in the Wild
Balazs Kovacs, Sean Bell, Noah Snavely, Kavita Bala
LCNN: Lookup-Based Convolutional Neural Network
Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi
Object Recognition & Scene Understanding
Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation
Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang
Pixelwise Instance Segmentation With a Dynamically Instantiated Network
Anurag Arnab, Philip H. S. Torr
Object Detection in Videos With Tubelet Proposal Networks
Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang
AMVH: Asymmetric Multi-Valued Hashing
Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, Shiming Xiang, Chunhong Pan
Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion
Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, Xiaoou Tang
Deep Visual-Semantic Quantization for Efficient Image Retrieval
Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu
Efficient Diffusion on Region Manifolds: Recovering Small Objects With Compact CNN Representations
Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Teddy Furon, Ondřej Chum
Feature Pyramid Networks for Object Detection
Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie
Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation
Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, Yong Xu, Wangmeng Zuo
StyleNet: Generating Attractive Visual Captions With Styles
Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng
Fine-Grained Recognition of Thousands of Object Categories With Single-Example Training
Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok
Improving Interpretability of Deep Neural Networks With Semantic Information
Yinpeng Dong, Hang Su, Jun Zhu, Bo Zhang
Video Captioning With Transferred Semantic Attributes
Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei
Fast Boosting Based Detection Using Scale Invariant Multimodal Multiresolution Filtered Features
Arthur Daniel Costea, Robert Varga, Sergiu Nedevschi
Temporal Convolutional Networks for Action Segmentation and Detection
Colin Lea, Michael D. Flynn, René Vidal, Austin Reiter, Gregory D. Hager
Surveillance Video Parsing With Single Frame Supervision
Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao, Yao Sun
簡要:监视视频解析,将视频帧分成多个标签,即脸,裤子,左腿,有广泛的应用。
Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking
Yan Yan, Chenliang Xu, Dawen Cai, Jason J. Corso
Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos
De-An Huang, Joseph J. Lim, Li Fei-Fei, Juan Carlos Niebles
Zero-Shot Action Recognition With Error-Correcting Output Codes
Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen, Yunhong Wang
Enhancing Video Summarization via Vision-Language Embedding
Bryan A. Plummer, Matthew Brown, Svetlana Lazebnik
Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet
Jianwen Xie, Song-Chun Zhu, Ying Nian Wu
Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries
Yuting Zhang, Luyao Yuan, Yijie Guo, Zhiyuan He, I-An Huang, Honglak Lee
Automatic Understanding of Image and Video Advertisements
Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka
Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval
Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, Ling Shao
Discover and Learn New Objects From Documentaries
Kai Chen, Hang Song, Chen Change Loy, Dahua Lin
Spatial-Semantic Image Search by Visual Feature Synthesis
Long Mai, Hailin Jin, Zhe Lin, Chen Fang, Jonathan Brandt, Feng Liu
Fully-Adaptive Feature Sharing in Multi-Task Networks With Applications in Person Attribute Classification
Yongxi Lu, Abhishek Kumar, Shuangfei Zhai, Yu Cheng, Tara Javidi, Rogerio Feris
Semantic Compositional Networks for Visual Captioning
Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng
Training Object Class Detectors With Click Supervision
Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari
Oral 1-2A
Deep Reinforcement Learning-Based Image Captioning With Embedding Reward
Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li
From Red Wine to Red Tomato: Composition With Context
Ishan Misra, Abhinav Gupta, Martial Hebert
Captioning Images With Diverse Objects
Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, Trevor Darrell, Kate Saenko
Self-Critical Sequence Training for Image Captioning
Steven J. Rennie, Etienne Marcheret, Youssef Mroueh, Jerret Ross, Vaibhava Goel
Spotlight 1-2B
Crossing Nets: Combining GANs and VAEs With a Shared Latent Space for Hand Pose Estimation
Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao
Predicting Behaviors of Basketball Players From First Person Videos
Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park
LCR-Net: Localization-Classification-Regression for Human Pose
Grégory Rogez, Philippe Weinzaepfel, Cordelia Schmid
Learning Residual Images for Face Attribute Manipulation
Wei Shen, Rujie Liu
Seeing What Is Not There: Learning Context to Determine Where Objects Are Missing
Jin Sun, David W. Jacobs
Deep Learning on Lie Groups for Skeleton-Based Action Recognition
Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool
Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis
Oral 1-2B
Weakly Supervised Action Learning With RNN Based Fine-To-Coarse Modeling
Alexander Richard, Hilde Kuehne, Juergen Gall
Disentangled Representation Learning GAN for Pose-Invariant Face Recognition
Luan Tran, Xi Yin, Xiaoming Liu
ArtTrack: Articulated Multi-Person Tracking in the Wild
Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, Evgeny Levinkov, Bjoern Andres, Bernt Schiele
Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields (PDF,code)
Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh
Spotlight 1-2C
Template Matching With Deformable Diversity Similarity
Itamar Talmi, Roey Mechrez, Lihi Zelnik-Manor
Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-Identification
Weihua Chen, Xiaotang Chen, Jianguo Zhang, Kaiqi Huang
Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization
Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles, Min Sun
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos
Linchao Zhu, Zhongwen Xu, Yi Yang
Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning
Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering
Yunseok Jang, Yale Song, Youngjae Yu, Youngjin Kim, Gunhee Kim
Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing
Yu-Chuan Su, Kristen Grauman
Unsupervised Adaptive Re-Identification in Open World Dynamic Camera Networks
Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury
Oral 1-2C
Context-Aware Correlation Filter Tracking
Matthias Mueller, Neil Smith, Bernard Ghanem
Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360° Sports Videos
Hou-Ning Hu, Yen-Chen Lin, Ming-Yu Liu, Hsien-Tzu Cheng, Yung-Ju Chang, Min Sun
Slow Flow: Exploiting High-Speed Cameras for Accurate and Diverse Optical Flow Reference Data
Joel Janai, Fatma Güney, Jonas Wulff, Michael J. Black, Andreas Geiger
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos
Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, Shih-Fu Chang
3D Computer Vision
Exploiting 2D Floorplan for Building-Scale Panorama RGBD Alignment
Erik Wijmans, Yasutaka Furukawa
A Combinatorial Solution to Non-Rigid 3D Shape-To-Image Matching
Florian Bernard, Frank R. Schmidt, Johan Thunberg, Daniel Cremers
NID-SLAM: Robust Monocular SLAM Using Normalised Information Distance
Geoffrey Pascoe, Will Maddern, Michael Tanner, Pedro Piniés, Paul Newman
End-To-End Training of Hybrid CNN-CRF Models for Stereo
Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock
Learning Shape Abstractions by Assembling Volumetric Primitives (PDF,project,code)
Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, Jitendra Malik
Locality-Sensitive Deconvolution Networks With Gated Fusion for RGB-D Indoor Semantic Segmentation
Yanhua Cheng, Rui Cai, Zhiwei Li, Xin Zhao, Kaiqi Huang
Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging (PDF)
Jaewon Kim, Ilya Reshetouski, Abhijeet Ghosh
Regressing Robust and Discriminative 3D Morphable Models With a Very Deep Neural Network
Anh Tuấn Trần, Tal Hassner, Iacopo Masi, Gérard Medioni
End-To-End 3D Face Reconstruction With Deep Neural Networks
Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris
DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction
Antonio Agudo, Francesc Moreno-Noguer
Analyzing Humans in Images
Finding Tiny Faces
Peiyun Hu, Deva Ramanan
Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network
Jinwei Gu, Xiaodong Yang, Shalini De Mello, Jan Kautz
Deep Temporal Linear Encoding Networks
Ali Diba, Vivek Sharma, Luc Van Gool
Joint Registration and Representation Learning for Unconstrained Face Identification (PDF)
Munawar Hayat,Salman H. Khan,Naoufel Werghi, Roland Goecke
3D Human Pose Estimation From a Single Image via Distance Matrix Regression
Francesc Moreno-Noguer
One-Shot Metric Learning for Person Re-Identification
Slawomir BÄ…k, Peter Carr
Generalized Rank Pooling for Activity Recognition
Anoop Cherian, Basura Fernando, Mehrtash Harandi, Stephen Gould
Deep Representation Learning for Human Motion Prediction and Classification
Judith Bütepage, Michael J. Black, Danica Kragic, Hedvig Kjellström
Interspecies Knowledge Transfer for Facial Keypoint Detection
Maheen Rashid, Xiuye Gu, Yong Jae Lee
Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization
Runpeng Cui, Hu Liu, Changshui Zhang
Applications
Modeling Sub-Event Dynamics in First-Person Action Recognition
Hasan F. M. Zaki, Faisal Shafait, Ajmal Mian
Computational Photography
Turning an Urban Scene Video Into a Cinemagraph
Hang Yan, Yebin Liu, Yasutaka Furukawa
Light Field Reconstruction Using Deep Convolutional Network on EPI
Gaochang Wu, Mandan Zhao, Liangyong Wang, Qionghai Dai, Tianyou Chai, Yebin Liu
Image Motion & Tracking
FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks
Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, Thomas Brox
Low- & Mid-Level Vision
Attention-Aware Face Hallucination via Deep Reinforcement Learning
Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Li
Simple Does It: Weakly Supervised Instance and Semantic Segmentation
Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, Bernt Schiele
Anti-Glare: Tightly Constrained Optimization for Eyeglass Reflection Removal
Tushar Sandhan, Jin Young Choi
Deep Joint Rain Detection and Removal From a Single Image
Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan
Radiometric Calibration From Faces in Images
Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi
Webly Supervised Semantic Segmentation
Bin Jin, Maria V. Ortiz Segovia, Sabine Süsstrunk
Removing Rain From Single Images via a Deep Detail Network
Xueyang Fu, Jiabin Huang, Delu Zeng, Yue Huang, Xinghao Ding, John Paisley
Deep Crisp Boundaries
Yupei Wang, Xin Zhao, Kaiqi Huang
Coarse-To-Fine Segmentation With Shape-Tailored Continuum Scale Spaces
Naeemullah Khan, Byung-Woo Hong, Anthony Yezzi, Ganesh Sundaramoorthi
Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network
Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, Jian Sun
Single Image Reflection Suppression
Nikolaos Arvanitopoulos, Radhakrishna Achanta, Sabine Süsstrunk
CASENet: Deep Category-Aware Semantic Edge Detection
Zhiding Yu, Chen Feng, Ming-Yu Liu, Srikumar Ramalingam
Reflectance Adaptive Filtering Improves Intrinsic Image Estimation
Thomas Nestmeyer, Peter V. Gehler
Machine Learning
Conditional Similarity Networks
Andreas Veit, Serge Belongie, Theofanis Karaletsos
Spatially Adaptive Computation Time for Residual Networks
Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov
Xception: Deep Learning With Depthwise Separable Convolutions
François Chollet
Feedback Networks
Amir R. Zamir, Te-Lin Wu, Lin Sun, William B. Shen, Bertram E. Shi, Jitendra Malik, Silvio Savarese
Online Summarization via Submodular and Convex Optimization
Ehsan Elhamifar, M. Clara De Paolis Kaluza
Deep MANTA: A Coarse-To-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis From Monocular Image
Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Céline Teulière, Thierry Chateau
Improving Pairwise Ranking for Multi-Label Image Classification
Yuncheng Li, Yale Song, Jiebo Luo
Active Convolution: Learning the Shape of Convolution for Image Classification
Yunho Jeon, Junmo Kim
Linking Image and Text With 2-Way Nets
Aviv Eisenschtat, Lior Wolf
Stacked Generative Adversarial Networks
Xun Huang, Yixuan Li, Omid Poursaeed, John Hopcroft, Serge Belongie
Image Splicing Detection via Camera Response Function Analysis
Can Chen, Scott McCloskey, Jingyi Yu
Building a Regular Decision Boundary With Deep Networks
Edouard Oyallon
More Is Less: A More Complicated Network With Less Inference Complexity
Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan
Joint Graph Decomposition and Node Labeling: Problem, Algorithms, Applications
Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres
Scale-Aware Face Detection
Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu
Deep Unsupervised Similarity Learning Using Partially Ordered Sets
Miguel A. Bautista, Artsiom Sanakoyeu, Björn Ommer
Generative Hierarchical Learning of Sparse FRAME Models
Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, Song-Chun Zhu
Object Recognition & Scene Understanding
Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval
Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, Vlad I. Morariu, Larry S. Davis
Perceptual Generative Adversarial Networks for Small Object Detection
Jianan Li (Group: Work group, Company,... - optional),Xiaodan Liang (Group: Work group, Company,... - optional),Yunchao Wei (Group: Work group, Company,... - optional),Tingfa Xu (Group: Work group, Company,... - optional),Jiashi Feng (Group: Work group, Company,... - optional),Shuicheng Yan (Group: Work group, Company,... - optional)
Emotion Recognition in Context (PDF,supplementary material)
Ronak Kosti,Jose M. Alvarez,Adria Recasens,Agata Lapedriza
Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework
Jongyoo Kim, Sanghoon Lee
Dense Captioning With Joint Inference and Visual Context
Linjie Yang, Kevin Tang, Jianchao Yang, Li-Jia Li
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick
Cross-View Image Matching for Geo-Localization in Urban Environments
Yicong Tian, Chen Chen, Mubarak Shah
Matrix Tri-Factorization With Manifold Regularizations for Zero-Shot Learning
Xing Xu, Fumin Shen, Yang Yang, Dongxiang Zhang, Heng Tao Shen, Jingkuan Song
Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces
Lluis Gomez, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar
Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification
Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang
Semantically Consistent Regularization for Zero-Shot Recognition
Pedro Morgado, Nuno Vasconcelos
Can Walking and Measuring Along Chord Bunches Better Describe Leaf Shapes?
Bin Wang, Yongsheng Gao, Changming Sun, Michael Blumenstein, John La Salle
Video Analytics
Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model
Qixiang Ye, Tianliang Zhang, Wei Ke, Qiang Qiu, Jie Chen, Guillermo Sapiro, Baochang Zhang
Predictive-Corrective Networks for Action Detection (project,abstract,PDF)
Achal Dave,Olga Russakovsky,Deva Ramanan
Budget-Aware Deep Semantic Video Segmentation
Behrooz Mahasseni, Sinisa Todorovic, Alan Fern
Unified Embedding and Metric Learning for Zero-Exemplar Event Detection
Noureldien Hussein, Efstratios Gavves, Arnold W.M. Smeulders
Spatiotemporal Pyramid Network for Video Action Recognition
Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu
ER3: A Unified Framework for Event Retrieval, Recognition and Recounting
Zhanning Gao, Gang Hua, Dongqing Zhang, Nebojsa Jojic, Le Wang, Jianru Xue, Nanning Zheng
FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos
Suyog Dutt Jain, Bo Xiong, Kristen Grauman
Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach
Aidean Sharghi, Jacob S. Laurel, Boqing Gong
Flexible Spatio-Temporal Networks for Video Prediction
Chaochao Lu, Michael Hirsch, Bernhard Schölkopf
Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos
Konstantinos Papoutsakis, Costas Panagiotakis, Antonis A. Argyros
Spotlight 2-1A
Dual Attention Networks for Multimodal Reasoning and Matching
Hyeonseob Nam, Jung-Woo Ha, Jeonghee Kim
DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents
Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker
Interpretable Structure-Evolving LSTM
Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, Shuicheng Yan, Eric P. Xing
ShapeOdds: Variational Bayesian Learning of Generative Shape Models
Shireen Elhabian, Ross Whitaker
Fast Video Classification via Adaptive Cascading of Deep Models
Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy
Deep Metric Learning via Facility Location
Hyun Oh Song, Stefanie Jegelka, Vivek Rathod, Kevin Murphy
Semi-Supervised Deep Learning for Monocular Depth Map Prediction
Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe
Weakly Supervised Semantic Segmentation Using Web-Crawled Videos
Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, Bohyung Han
Oral 2-1A
Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach
Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, Lizhen Qu
Learning From Simulated and Unsupervised Images Through Adversarial Training
Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang, Russell Webb
Inverse Compositional Spatial Transformer Networks
Chen-Hsuan Lin, Simon Lucey
Densely Connected Convolutional Networks
Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger
Spotlight 2-1B
Visual Dialog
Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra
Video Frame Interpolation via Adaptive Convolution
Simon Niklaus, Long Mai, Feng Liu
FastMask: Segment Multi-Scale Object Candidates in One Shot
Hexiang Hu, Shiyi Lan, Yuning Jiang, Zhimin Cao, Fei Sha
Reconstructing Transient Images From Single-Photon Sensors
Matthew O'Toole, Felix Heide, David B. Lindell, Kai Zang, Steven Diamond, Gordon Wetzstein
DeshadowNet: A Multi-Context Embedding Deep Network for Shadow Removal
Liangqiong Qu, Jiandong Tian, Shengfeng He, Yandong Tang, Rynson W. H. Lau
Illuminant-Camera Communication to Observe Moving Objects Under Strong External Light by Spread Spectrum Modulation
Ryusuke Sagawa, Yutaka Satoh
Photorealistic Facial Texture Inference Using Deep Neural Networks
Shunsuke Saito, Lingyu Wei, Liwen Hu, Koki Nagano, Hao Li
The Geometry of First-Returning Photons for Non-Line-Of-Sight Imaging
Chia-Yin Tsai, Kiriakos N. Kutulakos, Srinivasa G. Narasimhan, Aswin C. Sankaranarayanan
Oral 2-1B
Unrolling the Shutter: CNN to Correct Motion Distortions
Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan
Light Field Blind Motion Deblurring
Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi
Computational Imaging on the Electric Grid
Mark Sheinin, Yoav Y. Schechner, Kiriakos N. Kutulakos
Deep Outdoor Illumination Estimation
Yannick Hold-Geoffroy, Kalyan Sunkavalli, Sunil Hadap, Emiliano Gambaretto, Jean-François Lalonde
Spotlight 2-1C
Efficient Solvers for Minimal Problems by Syzygy-Based Reduction
Viktor Larsson, Kalle Åström, Magnus Oskarsson
HSfM: Hybrid Structure-from-Motion
Hainan Cui, Xiang Gao, Shuhan Shen, Zhanyi Hu
Efficient Global Point Cloud Alignment Using Bayesian Nonparametric Mixtures
Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III
A New Rank Constraint on Multi-View Fundamental Matrices, and Its Application to Camera Location Recovery
Soumyadip Sengupta, Tal Amir, Meirav Galun, Tom Goldstein, David W. Jacobs, Amit Singer, Ronen Basri
IM2CAD
Hamid Izadinia, Qi Shan, Steven M. Seitz
ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner
Noise Robust Depth From Focus Using a Ring Difference Filter
Jaeheung Surh, Hae-Gon Jeon, Yunwon Park, Sunghoon Im, Hyowon Ha, In So Kweon
Group-Wise Point-Set Registration Based on Rényi's Second Order Entropy
Luis G. Sanchez Giraldo, Erion Hasanbelliu, Murali Rao, Jose C. Principe
Oral 2-1C
A Point Set Generation Network for 3D Object Reconstruction From a Single Image
Haoqiang Fan, Hao Su, Leonidas J. Guibas
3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder
Gil Elbaz, Tamar Avraham, Anath Fischer
Flight Dynamics-Based Recovery of a UAV Trajectory Using Ground Cameras
Artem Rozantsev, Sudipta N. Sinha, Debadeepta Dey, Pascal Fua
DSAC - Differentiable RANSAC for Camera Localization (PDF,code,project)
Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother
3D Computer Vision
Scalable Surface Reconstruction From Point Clouds With Extreme Scale and Density Diversity
Christian Mostegel, Rudolf Prettenthaler, Friedrich Fraundorfer, Horst Bischof
Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes With Deep Generative Networks
Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, Joshua B. Tenenbaum
General Models for Rational Cameras and the Case of Two-Slit Projections
Matthew Trager, Bernd Sturmfels, John Canny, Martial Hebert, Jean Ponce
Accurate Depth and Normal Maps From Occlusion-Aware Focal Stack Symmetry
Michael Strecke, Anna Alperovich, Bastian Goldluecke
A Multi-View Stereo Benchmark With High-Resolution Images and Multi-Camera Videos
Thomas Schöps, Johannes L. Schönberger, Silvano Galliani, Torsten Sattler, Konrad Schindler, Marc Pollefeys, Andreas Geiger
Non-Contact Full Field Vibration Measurement Based on Phase-Shifting
Hiroyuki Kayaba, Yuji Kokumai
A Minimal Solution for Two-View Focal-Length Estimation Using Two Affine Correspondences (PDF,code)
Daniel Barath, Tekla Toth, Levente Hajder
PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning
Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother
An Efficient Background Term for 3D Reconstruction and Tracking With Smooth Surface Models
Mariano Jaimez, Thomas J. Cashman, Andrew Fitzgibbon, Javier Gonzalez-Jimenez, Daniel Cremers
Analyzing Humans in Images
Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild
Shan Li, Weihong Deng, JunPing Du
Procedural Generation of Videos to Train Deep Action Recognition Networks
César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel López
BigHand2.2M Benchmark: Hand Pose Dataset and State of the Art Analysis
Shanxin Yuan, Qi Ye, Björn Stenger, Siddhant Jain, Tae-Kyun Kim
DenseReg: Fully Convolutional Dense Shape Regression In-The-Wild
Rıza Alp Güler, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos
Adaptive Class Preserving Representation for Image Classification
Jian-Xun Mi, Qiankun Fu, Weisheng Li
Applications
Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval
Devraj Mandal, Kunal N. Chaudhury, Soma Biswas
EAST: An Efficient and Accurate Scene Text Detector
Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang
VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization
Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen
Biomedical Image/Video Analysis
Improving RANSAC-Based Segmentation Through CNN Encapsulation
Dustin Morley, Hassan Foroosh
Computational Photography
Position Tracking for Virtual Reality Using Commodity WiFi
Manikanta Kotaru, Sachin Katti
Designing Illuminant Spectral Power Distributions for Surface Classification
Henryk Blasinski, Joyce Farrell, Brian Wandell
One-Shot Hyperspectral Imaging Using Faced Reflectors
Tsuyoshi Takatani, Takahito Aoto, Yasuhiro Mukaigawa
Image Motion & Tracking
Direct Photometric Alignment by Mesh Deformation
Kaimo Lin, Nianjuan Jiang, Shuaicheng Liu, Loong-Fah Cheong, Minh Do, Jiangbo Lu
CNN-Based Patch Matching for Optical Flow With Thresholded Hinge Embedding Loss
Christian Bailer, Kiran Varanasi, Didier Stricker
Optical Flow Estimation Using a Spatial Pyramid Network
Anurag Ranjan, Michael J. Black
Deep Network Flow for Multi-Object Tracking
Manmohan Chandraker, Paul Vernaza, Wongun Choi, Samuel Schulter
Low- & Mid-Level Vision
Material Classification Using Frequency- and Depth-Dependent Time-Of-Flight Distortion
Kenichiro Tanaka, Yasuhiro Mukaigawa, Takuya Funatomi, Hiroyuki Kubo, Yasuyuki Matsushita, Yasushi Yagi
Benchmarking Denoising Algorithms With Real Photographs
Tobias Plötz, Stefan Roth
A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation (PDF,project)
Jinsun Park, Yu-Wing Tai,Donghyeon Cho, In So Kweon
StyleBank: An Explicit Representation for Neural Image Style Transfer
Dongdong Chen, Lu Yuan, Jing Liao, Nenghai Yu, Gang Hua
Specular Highlight Removal in Facial Images
Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi
Image Super-Resolution via Deep Recursive Residual Network
Ying Tai, Jian Yang, Xiaoming Liu
Deep Image Harmonization
Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang
Learning Deep CNN Denoiser Prior for Image Restoration (PDF,code)
Kai Zhang, Wangmeng Zuo, Shuhang Gu, Lei Zhang
A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors
Tai-Xiang Jiang, Ting-Zhu Huang, Xi-Le Zhao, Liang-Jian Deng, Yao Wang
GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence
JiaWang Bian, Wen-Yan Lin, Yasuyuki Matsushita, Sai-Kit Yeung, Tan-Dat Nguyen, Ming-Ming Cheng
簡要:一种视频的快速搜索技术,比SIFT还厉害。基于网格的运动统计,用于快速、超鲁棒的特征匹配
Video Desnowing and Deraining Based on Matrix Decomposition
Weihong Ren, Jiandong Tian, Zhi Han, Antoni Chan, Yandong Tang
Real-Time Video Super-Resolution With Spatio-Temporal Networks and Motion Compensation
Jose Caballero, Christian Ledig, Andrew Aitken, Alejandro Acosta, Johannes Totz, Zehan Wang, Wenzhe Shi
Deep Watershed Transform for Instance Segmentation
Min Bai, Raquel Urtasun
AnchorNet: A Weakly Supervised Network to Learn Geometry-Sensitive Features for Semantic Matching
David Novotny, Diane Larlus, Andrea Vedaldi
Learning Diverse Image Colorization
Aditya Deshpande, Jiajun Lu, Mao-Chuang Yeh, Min Jin Chong, David Forsyth
Awesome Typography: Statistics-Based Text Effects Transfer
Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo
Machine Learning
Unsupervised Video Summarization With Adversarial LSTM Networks
Behrooz Mahasseni, Michael Lam, Sinisa Todorovic
Deep TEN: Texture Encoding Network
Hang Zhang, Jia Xue, Kristin Dana
Order-Preserving Wasserstein Distance for Sequence Matching
Bing Su, Gang Hua
A Dual Ascent Framework for Lagrangean Decomposition of Combinatorial Problems
Paul Swoboda, Jan Kuske, Bogdan Savchynskyy
Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data
Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian Reid
Hierarchical Multimodal Metric Learning for Multimodal Classification
Heng Zhang, Vishal M. Patel, Rama Chellappa
Efficient Linear Programming for Dense CRFs
Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, Philip H. S. Torr, M. Pawan Kumar
Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold
YoungJoon Yoo, Sangdoo Yun, Hyung Jin Chang, Yiannis Demiris, Jin Young Choi
Learning Random-Walk Label Propagation for Weakly-Supervised Semantic Segmentation
Paul Vernaza, Manmohan Chandraker
Low-Rank-Sparse Subspace Representation for Robust Regression
Yongqiang Zhang, Daming Shi, Junbin Gao, Dansong Cheng
Object Recognition & Scene Understanding
Generating the Future With Adversarial Transformers
Carl Vondrick, Antonio Torralba
Semantic Amodal Segmentation
Yan Zhu, Yuandong Tian, Dimitris Metaxas, Piotr Dollár
Learning a Deep Embedding Model for Zero-Shot Learning
Li Zhang, Tao Xiang, Shaogang Gong
BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition
Jacob Chan, Jimmy Addison Lee, Qian Kemao
Growing a Brain: Fine-Tuning by Increasing Model Capacity
Yu-Xiong Wang, Deva Ramanan, Martial Hebert
A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection (PDF)
Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta
Multiple Instance Detection Network With Online Instance Classifier Refinement
Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu
Kernel Pooling for Convolutional Neural Networks
Yin Cui, Feng Zhou, Jiang Wang, Xiao Liu, Yuanqing Lin, Serge Belongie
Learning Cross-Modal Embeddings for Cooking Recipes and Food Images
Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marin, Ferda Ofli, Ingmar Weber, Antonio Torralba
Zero-Shot Learning - the Good, the Bad and the Ugly
Yongqin Xian, Bernt Schiele, Zeynep Akata
DeepNav: Learning to Navigate Large Cities
Samarth Brahmbhatt, James Hays
Scene Graph Generation by Iterative Message Passing
Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei
Visual Translation Embedding Network for Visual Relation Detection
Hanwang Zhang, Zawlin Kyaw, Shih-Fu Chang, Tat-Seng Chua
Unsupervised Part Learning for Visual Recognition
Ronan Sicre, Yannis Avrithis, Ewa Kijak, Frédéric Jurie
Comprehension-Guided Referring Expressions
Ruotian Luo, Gregory Shakhnarovich
Top-Down Visual Saliency Guided by Captions
Vasili Ramanishka, Abir Das, Jianming Zhang, Kate Saenko
Theory
Grassmannian Manifold Optimization Assisted Sparse Spectral Clustering
Junbin Gao, Qiong Wang, Hong Li
Video Analytics
Video Propagation Networks
Varun Jampani, Raghudeep Gadde, Peter V. Gehler
ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification
Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, Bryan Russell
SCC: Semantic Context Cascade for Efficient Action Detection
Fabian Caba Heilbron, Wayner Barrios, Victor Escorcia, Bernard Ghanem
Hierarchical Boundary-Aware Neural Encoder for Video Captioning
Lorenzo Baraldi, Costantino Grana, Rita Cucchiara
HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos
Tan Yu, Yuwei Wu, Junsong Yuan
Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos
Ionut Cosmin Duta, Bogdan Ionescu, Kiyoharu Aizawa, Nicu Sebe
Temporal Action Localization by Structured Maximal Sums
Zehuan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng
Predicting Salient Face in Multiple-Face Videos
Yufan Liu, Songyang Zhang, Mai Xu, Xuming He
Spotlight 2-2A
Graph-Structured Representations for Visual Question Answering
Damien Teney, Lingqiao Liu, Anton van den Hengel
Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning
Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher
Learned Contextual Feature Reweighting for Image Geo-Localization
Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm
End-To-End Concept Word Detection for Video Captioning, Retrieval, and Question Answering
Youngjae Yu, Hyungjin Ko, Jongwook Choi, Gunhee Kim
Deep Cross-Modal Hashing
Qing-Yuan Jiang, Wu-Jun Li
Unambiguous Text Localization and Retrieval for Cluttered Scenes
Xuejian Rong, Chucai Yi, Yingli Tian
Bayesian Supervised Hashing
Zihao Hu, Junxuan Chen, Hongtao Lu, Tongzhen Zhang
Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors
Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy
Oral 2-2A
Detecting Visual Relationships With Deep Relational Networks
Bo Dai, Yuqi Zhang, Dahua Lin
Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes (PDF,videos,code)
Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe
Network Dissection: Quantifying Interpretability of Deep Visual Representations
David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, Antonio Torralba
AGA: Attribute-Guided Augmentation
Mandar Dixit, Roland Kwitt, Marc Niethammer, Nuno Vasconcelos
Spotlight 2-2B
A Hierarchical Approach for Generating Descriptive Image Paragraphs
Jonathan Krause, Justin Johnson, Ranjay Krishna, Li Fei-Fei
Person Re-Identification in the Wild
Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, Qi Tian
Scalable Person Re-Identification on Supervised Smoothed Manifold
Song Bai, Xiang Bai, Qi Tian
Binge Watching: Scaling Affordance Learning From Sitcoms (PDF)
Xiaolong Wang, Rohit Girdhar, Abhinav Gupta
Joint Detection and Identification Feature Learning for Person Search
Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, Xiaogang Wang
Synthesizing Normalized Faces From Facial Identity Features
Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, Inbar Mosseri, William T. Freeman
Consistent-Aware Deep Learning for Person Re-Identification in a Camera Network
Ji Lin, Liangliang Ren, Jiwen Lu, Jianjiang Feng, Jie Zhou
Level Playing Field for Million Scale Face Recognition
Aaron Nech, Ira Kemelmacher-Shlizerman
Oral 2-2B
Re-Sign: Re-Aligned End-To-End Sequence Modelling With Deep Recurrent CNN-HMMs
Oscar Koller, Sepehr Zargaran, Hermann Ney
Social Scene Understanding: End-To-End Multi-Person Action Localization and Collective Activity Recognition
Timur Bagautdinov, Alexandre Alahi, François Fleuret, Pascal Fua, Silvio Savarese
Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly
Hao Jiang, Kristen Grauman
Lip Reading Sentences in the Wild
Joon Son Chung, Andrew Senior, Oriol Vinyals, Andrew Zisserman
Spotlight 2-2C
Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection
Lianwen Jin, Yuliang Liu
ChestX-ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases
Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Mohammadhadi Bagheri, Ronald M. Summers
Attentional Push: A Deep Convolutional Network for Augmenting Image Salience With Shared Attention Modeling in Social Scenes
Siavash Gorji, James J. Clark
Detecting Oriented Text in Natural Images by Linking Segments
Baoguang Shi, Xiang Bai, Serge Belongie
Learning Video Object Segmentation From Static Images
Federico Perazzi, Anna Khoreva, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung
Seeing Invisible Poses: Estimating 3D Body Pose From Egocentric Video
Hao Jiang, Kristen Grauman
Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space
Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg
Oral 2-2C
End-To-End Learning of Driving Models From Large-Scale Video Datasets
Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell
Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks
Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Qi Zhao, Jiashi Feng
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network
Zizhao Zhang, Yuanpu Xie, Fuyong Xing, Mason McGough, Lin Yang
3D Computer Vision
Surface Motion Capture Transfer With Gaussian Process Regression
Adnane Boukhayma, Jean-Sébastien Franco, Edmond Boyer
Visual-Inertial-Semantic Scene Representation for 3D Object Detection
Jingming Dong, Xiaohan Fei, Stefano Soatto
Template-Based Monocular 3D Recovery of Elastic Shapes Using Lagrangian Multipliers
Nazim Haouchine, Stephane Cotin
Learning Category-Specific 3D Shape Models From Weakly Labeled 2D Images
Dingwen Zhang, Junwei Han, Yang Yang, Dong Huang
Simultaneous Geometric and Radiometric Calibration of a Projector-Camera Pair
Marjan Shahpaski, Luis Ricardo Sapaico, Gaspard Chevassus, Sabine Süsstrunk
Learning Barycentric Representations of 3D Shapes for Sketch-Based 3D Shape Retrieval
Jin Xie, Guoxian Dai, Fan Zhu, Yi Fang
Geodesic Distance Descriptors
Gil Shamai, Ron Kimmel
Analyzing Humans in Images
Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks
Hongsong Wang, Liang Wang
Forecasting Human Dynamics From Static Images
Yu-Wei Chao, Jimei Yang, Brian Price, Scott Cohen, Jia Deng
Re-Ranking Person Re-Identification With k-Reciprocal Encoding (code)
Zhun Zhong,Liang Zheng, Donglin Cao, Shaozi Li
Deep Sequential Context Networks for Action Prediction
Yu Kong, Zhiqiang Tao, Yun Fu
Global Context-Aware Attention LSTM Networks for 3D Action Recognition
Jun Liu, Gang Wang, Ping Hu, Ling-Yu Duan, Alex C. Kot
Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting
Zhen-Hua Feng, Josef Kittler, William Christmas, Patrik Huber, Xiao-Jun Wu
A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection
Jiangjing Lv, Xiaohu Shao, Junliang Xing, Cheng Cheng, Xi Zhou
Multiple People Tracking by Lifted Multicut and Person Re-Identification
Siyu Tang, Mykhaylo Andriluka, Bjoern Andres, Bernt Schiele
Towards Accurate Multi-Person Pose Estimation in the Wild
George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy
Applications
Towards a Quality Metric for Dense Light Fields
Vamsi Kiran Adhikarla, Marek Vinkler, Denis Sumin, Rafał K. Mantiuk, Karol Myszkowski, Hans-Peter Seidel, Piotr Didyk
Controlling Perceptual Factors in Neural Style Transfer
Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, Aaron Hertzmann, Eli Shechtman
Biomedical Image/Video Analysis
Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation
Kuan-Lun Tseng, Yen-Liang Lin, Winston Hsu, Chung-Yang Huang
LSTM Self-Supervision for Detailed Behavior Analysis
Biagio Brattoli, Uta Büchler, Anna-Sophia Wahl, Martin E. Schwab, Björn Ommer
Computational Photography
A Wide-Field-Of-View Monocentric Light Field Camera (PDF,project,project)
Donald G. Dansereau, Glenn Schuster, Joseph Ford,Gordon Wetzstein
Image Motion & Tracking
S2F: Slow-To-Fast Interpolator Flow
Yanchao Yang, Stefano Soatto
CLKN: Cascaded Lucas-Kanade Networks for Image Alignment
Che-Han Chang, Chun-Nan Chou, Edward Y. Chang
Multi-Object Tracking With Quadruplet Convolutional Neural Networks
Mooyeol Baek, Jeany Son, Minsu Cho, Bohyung Han
Low- & Mid-Level Vision
Learning to Detect Salient Objects With Image-Level Supervision
Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Baocai Yin, Xiang Ruan
From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur
Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian Reid, Chunhua Shen, Anton van den Hengel, Qinfeng Shi
Co-Occurrence Filter
Roy J. Jevnisek, Shai Avidan
Fractal Dimension Invariant Filtering and Its CNN-Based Implementation
Hongteng Xu, Junchi Yan, Nils Persson, Weiyao Lin, Hongyuan Zha
Noise-Blind Image Deblurring
Meiguang Jin, Stefan Roth, Paolo Favaro
Simultaneous Visual Data Completion and Denoising Based on Tensor Rank and Total Variation Minimization and Its Primal-Dual Splitting Algorithm
Tatsuya Yokota, Hidekata Hontani
HPatches: A Benchmark and Evaluation of Handcrafted and Learned Local Descriptors
Vassileios Balntas, Karel Lenc, Andrea Vedaldi, Krystian Mikolajczyk
Hyperspectral Image Super-Resolution via Non-Local Sparse Tensor Factorization
Renwei Dian, Leyuan Fang, Shutao Li
Reflection Removal Using Low-Rank Matrix Completion
Byeong-Ju Han, Jae-Young Sim
Object Co-Skeletonization With Co-Segmentation
Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan
Machine Learning
Mining Object Parts From CNNs via Active Question-Answering
Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu
PolyNet: A Pursuit of Structural Diversity in Very Deep Networks
Xingcheng Zhang, Zhizhong Li, Chen Change Loy, Dahua Lin
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions
Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel
Joint Discriminative Bayesian Dictionary and Classifier Learning
Naveed Akhtar, Ajmal Mian, Fatih Porikli
A Study of Lagrangean Decompositions and Dual Ascent Solvers for Graph Matching
Paul Swoboda, Carsten Rother, Hassan Abu Alhaija, Dagmar Kainmüller, Bogdan Savchynskyy
Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection
Nikolay Savinov, Akihito Seki, Ľubor Ladický, Torsten Sattler, Marc Pollefeys
Outlier-Robust Tensor PCA
Pan Zhou, Jiashi Feng
Learning Adaptive Receptive Fields for Deep Image Parsing Network
Zhen Wei, Yao Sun, Jinqiao Wang, Hanjiang Lai, Si Liu
Learning an Invariant Hilbert Space for Domain Adaptation
Samitha Herath, Mehrtash Harandi, Fatih Porikli
Fixed-Point Factorized Networks
Peisong Wang, Jian Cheng
Discriminative Optimization: Theory and Applications to Point Cloud Registration
Jayakorn Vongkulbhisal, Fernando De la Torre, João P. Costeira
Online Asymmetric Similarity Learning for Cross-Modal Retrieval
Yiling Wu, Shuhui Wang, Qingming Huang
Improving Training of Deep Neural Networks via Singular Value Bounding
Kui Jia, Dacheng Tao, Shenghua Gao, Xiangmin Xu
S3Pool: Pooling With Stochastic Spatial Sampling
Shuangfei Zhai, Hui Wu, Abhishek Kumar, Yu Cheng, Yongxi Lu, Zhongfei Zhang, Rogerio Feris
Sports Field Localization via Deep Structured Models
Namdar Homayounfar, Sanja Fidler, Raquel Urtasun
Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation
Binghui Chen, Weihong Deng, Junping Du
Switching Convolutional Neural Network for Crowd Counting (project,PDF)
Deepak Babu Sam*, Shiv Surya*,R. Venkatesh Babu ((* Equal Contributors) Video Analytics Lab, Indian Institute of Science)
Network Sketching: Exploiting Binary Structure in Deep CNNs (PDF)
Yiwen Guo, Anbang Yao, Hao Zhao, Yurong Chen
Multi-Task Clustering of Human Actions by Sharing Information
Shizhe Hu, Xiaoqiang Yan, Yangdong Ye
Soft-Margin Mixture of Regressions
Dong Huang, Longfei Han, Fernando De la Torre
Multigrid Neural Architectures
Tsung-Wei Ke, Michael Maire, Stella X. Yu
High-Resolution Image Inpainting Using Multi-Scale Neural Patch Synthesis
Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li
Deep Quantization: Encoding Convolutional Activations With Deep Generative Model
Zhaofan Qiu, Ting Yao, Tao Mei
DOPE: Distributed Optimization for Pairwise Energies
Jose Dolz, Ismail Ben Ayed, Christian Desrosiers
Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis
Dmitry Ulyanov, Andrea Vedaldi, Victor Lempitsky
Object Recognition & Scene Understanding
Polyhedral Conic Classifiers for Visual Object Detection and Classification
Hakan Cevikalp, Bill Triggs
Incremental Kernel Null Space Discriminant Analysis for Novelty Detection
Juncheng Liu, Zhouhui Lian, Yi Wang, Jianguo Xiao
Predicting Ground-Level Scene Layout From Aerial Imagery
Menghua Zhai, Zachary Bessinger, Scott Workman, Nathan Jacobs
Deep Feature Flow for Video Recognition
Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, Yichen Wei
Object-Aware Dense Semantic Correspondence
Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen
Semantic Regularisation for Recurrent Image Annotation
Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun
Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images
Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua
Fast-At: Fast Automatic Thumbnail Generation Using Deep Neural Networks
Seyed A. Esmaeili, Bharat Singh, Larry S. Davis
Multi-Level Attention Networks for Visual Question Answering
Dongfei Yu, Jianlong Fu, Tao Mei, Yong Rui
Generating Descriptions With Grounded and Co-Referenced People
Anna Rohrbach, Marcus Rohrbach, Siyu Tang, Seong Joon Oh, Bernt Schiele
Straight to Shapes: Real-Time Detection of Encoded Shapes
Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr
Simultaneous Feature Aggregating and Hashing for Large-Scale Image Search
Thanh-Toan Do, Dang-Khoa Le Tan, Trung T. Pham, Ngai-Man Cheung
Improving Facial Attribute Prediction Using Semantic Segmentation
Mahdi M. Kalayeh, Boqing Gong, Mubarak Shah
Video Analytics
Learning Cross-Modal Deep Representations for Robust Pedestrian Detection
Dan Xu, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, Nicu Sebe
Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection From Videos
Yang Du, Chunfeng Yuan, Bing Li, Weiming Hu, Stephen Maybank
CERN: Confidence-Energy Recurrent Network for Group Activity Recognition
Tianmin Shu, Sinisa Todorovic, Song-Chun Zhu
Understanding Traffic Density From Large-Scale Web Camera Data
Shanghang Zhang, Guanhang Wu, João P. Costeira, José M. F. Moura
Collaborative Summarization of Topic-Related Videos
Rameswar Panda, Amit K. Roy-Chowdhury
Spotlight 3-1A
Local Binary Convolutional Neural Networks
Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides
Deep Self-Taught Learning for Weakly Supervised Object Localization
Zequn Jie, Yunchao Wei, Xiaojie Jin, Jiashi Feng, Wei Liu
Multi-Modal Mean-Fields via Cardinality-Based Clamping
Pierre Baqué, François Fleuret, Pascal Fua
Probabilistic Temporal Subspace Clustering
Vladimir Pavlovic, Behnam Gholami
Provable Self-Representation Based Outlier Detection in a Union of Subspaces
Chong You, Daniel P. Robinson, René Vidal
Latent Multi-View Subspace Clustering
Changqing Zhang, Qinghua Hu, Huazhu Fu, Pengfei Zhu, Xiaochun Cao
Learning to Extract Semantic Structure From Documents Using Multimodal Fully Convolutional Neural Networks
Xiao Yang, Ersin Yumer, Paul Asente, Mike Kraley, Daniel Kifer, C. Lee Giles
Age Progression/Regression by Conditional Adversarial Autoencoder
Zhifei Zhang, Yang Song, Hairong Qi
Oral 3-1A
Compact Matrix Factorization With Dependent Subspaces
Viktor Larsson, Carl Olsson
FFTLasso: Large-Scale LASSO in the Fourier Domain
Adel Bibi, Hani Itani, Bernard Ghanem
On the Global Geometry of Sphere-Constrained Sparse Blind Deconvolution
Yuqian Zhang, Yenson Lau, Han-wen Kuo, Sky Cheung, Abhay Pasupathy, John Wright
Global Optimality in Neural Network Training
Benjamin D. Haeffele, René Vidal
Spotlight 3-1B
What Is and What Is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors
Changqun Xia, Jia Li, Xiaowu Chen, Anlin Zheng, Yu Zhang
Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection
Xiaodan Liang, Lisa Lee, Eric P. Xing
Modeling Relationships in Referential Expressions With Compositional Modular Networks
Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, Kate Saenko
Counting Everyday Objects in Everyday Scenes
Prithvijit Chattopadhyay, Ramakrishna Vedantam, Ramprasaath R. Selvaraju, Dhruv Batra, Devi Parikh
Fully Convolutional Instance-Aware Semantic Segmentation
Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, Yichen Wei
Semantic Autoencoder for Zero-Shot Learning
Elyor Kodirov, Tao Xiang, Shaogang Gong
CityPersons: A Diverse Dataset for Pedestrian Detection
Shanshan Zhang, Rodrigo Benenson, Bernt Schiele
GuessWhat?! Visual Object Discovery Through Multi-Modal Dialogue
Harm de Vries, Florian Strub, Sarath Chandar, Olivier Pietquin, Hugo Larochelle, Aaron Courville
Oral 3-1B
Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition
Jianlong Fu, Heliang Zheng, Tao Mei
Annotating Object Instances With a Polygon-RNN
LluÃs Castrejón, Kaustav Kundu, Raquel Urtasun, Sanja Fidler
Connecting Look and Feel: Associating the Visual and Tactile Properties of Physical Materials
Wenzhen Yuan, Shaoxiong Wang, Siyuan Dong, Edward Adelson
Deep Learning Human Mind for Automated Visual Classification
Concetto Spampinato, Simone Palazzo, Isaak Kavasidis, Daniela Giordano, Nasim Souly, Mubarak Shah
3D Computer Vision
Self-Calibration-Based Approach to Critical Motion Sequences of Rolling-Shutter Structure From Motion
Eisuke Ito, Takayuki Okatani
Semi-Calibrated Near Field Photometric Stereo
Fotios Logothetis, Roberto Mecca, Roberto Cipolla
Semantic Multi-View Stereo: Jointly Estimating Objects and Voxels
Ali Osman Ulusoy, Michael J. Black, Andreas Geiger
Learning to Predict Stereo Reliability Enforcing Local Consistency of Confidence Maps
Matteo Poggi, Stefano Mattoccia
The Misty Three Point Algorithm for Relative Pose
Tobias Palmér, Kalle Ã…ström, Jan-Michael Frahm
The Surfacing of Multiview 3D Drawings via Lofting and Occlusion Reasoning (PDF,dataset,poster)
Anil Usumezbas, Ricardo Fabbri, Benjamin B. Kimia
A New Representation of Skeleton Sequences for 3D Action Recognition
Qiuhong Ke, Mohammed Bennamoun, Senjian An, Ferdous Sohel, Farid Boussaid
A General Framework for Curve and Surface Comparison and Registration With Oriented Varifolds
Irène Kaltenmark, Benjamin Charlier, Nicolas Charon
Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization
Anil Armagan, Martin Hirzer, Peter M. Roth, Vincent Lepetit
A Generative Model for Depth-Based Robust 3D Facial Pose Tracking
Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, King Ngi Ngan
Fast 3D Reconstruction of Faces With Glasses
Fabio Maninchedda, Martin R. Oswald, Marc Pollefeys
An Efficient Algebraic Solution to the Perspective-Three-Point Problem
Tong Ke, Stergios I. Roumeliotis
Analyzing Humans in Images
Learning From Synthetic Humans
Gül Varol, Javier Romero, Xavier Martin, Naureen Mahmood, Michael J. Black, Ivan Laptev, Cordelia Schmid
Forecasting Interactive Dynamics of Pedestrians With Fictitious Play
Wei-Chiu Ma, De-An Huang, Namhoon Lee, Kris M. Kitani
Hand Keypoint Detection in Single Images Using Multiview Bootstrapping
Tomas Simon, Hanbyul Joo, Iain Matthews, Yaser Sheikh
PoseTrack: Joint Multi-Person Pose Estimation and Tracking
Umar Iqbal, Anton Milan, Juergen Gall
Expecting the Unexpected: Training Detectors for Unusual Pedestrians With Adversarial Imposters
Shiyu Huang, Deva Ramanan
On Human Motion Prediction Using Recurrent Neural Networks
Julieta Martinez, Michael J. Black, Javier Romero
Learning and Refining of Privileged Information-Based RNNs for Action Recognition From Depth Sequences
Zhiyuan Shi, Tae-Kyun Kim
Quality Aware Network for Set to Set Recognition
Yu Liu, Junjie Yan, Wanli Ouyang
Unite the People: Closing the Loop Between 3D and 2D Human Representations
Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, Michael J. Black, Peter V. Gehler
Deep Multitask Architecture for Integrated 2D and 3D Human Sensing
Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira, Andrew Zisserman
Applications
Identifying First-Person Camera Wearers in Third-Person Videos
Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, Yong Jae Lee, David J. Crandall, Michael S. Ryoo
Biomedical Image/Video Analysis
Parsing Images of Overlapping Organisms With Deep Singling-Out Networks
Victor Yurchenko, Victor Lempitsky
Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally
Zongwei Zhou, Jae Shin, Lei Zhang, Suryakanth Gurudu, Michael Gotway, Jianming Liang
Computational Photography
Depth From Defocus in the Wild
Huixuan Tang, Scott Cohen, Brian Price, Stephen Schiller, Kiriakos N. Kutulakos
Matting and Depth Recovery of Thin Structures Using a Focal Stack
Chao Liu, Srinivasa G. Narasimhan, Artur W. Dubrawski
Image Motion & Tracking
Robust Interpolation of Correspondences for Large Displacement Optical Flow
Yinlin Hu, Yunsong Li, Rui Song
Large Margin Object Tracking With Circulant Feature Maps
Mengmeng Wang, Yong Liu, Zeyi Huang
Minimum Delay Moving Object Detection
Dong Lao, Ganesh Sundaramoorthi
Multi-Task Correlation Particle Filter for Robust Object Tracking
Tianzhu Zhang, Changsheng Xu, Ming-Hsuan Yang
Attentional Correlation Filter Network for Adaptive Visual Tracking
Jongwon Choi, Hyung Jin Chang, Sangdoo Yun, Tobias Fischer, Yiannis Demiris, Jin Young Choi
The World of Fast Moving Objects
Denys Rozumnyi, Jan Kotera, Filip Šroubek, Lukáš Novotný, Jiřà Matas
Discriminative Correlation Filter With Channel and Spatial Reliability
Alan LukežiÄ, Tomáš VojÃÅ™, Luka ÄŒehovin Zajc, Jiřà Matas, Matej Kristan
Low- & Mid-Level Vision
Learning Deep Binary Descriptor With Multi-Quantization
Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie Zhou
One-To-Many Network for Visually Pleasing Compression Artifacts Reduction
Jun Guo, Hongyang Chao
Gated Feedback Refinement Network for Dense Image Labeling
Md Amirul Islam, Mrigank Rochan, Neil D. B. Bruce, Yang Wang
BRISKS: Binary Features for Spherical Images on a Geodesic Grid
Hao Guan, William A. P. Smith
Superpixels and Polygons Using Simple Non-Iterative Clustering
Radhakrishna Achanta, Sabine Süsstrunk
Hardware-Efficient Guided Image Filtering for Multi-Label Problem
Longquan Dai, Mengke Yuan, Zechao Li, Xiaopeng Zhang, Jinhui Tang
Alternating Direction Graph Matching (PDF)
D. Khuê Lê-Huu, Nikos Paragios
Learning Discriminative and Transformation Covariant Local Feature Detectors
Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang
Machine Learning
Correlational Gaussian Processes for Cross-Domain Visual Recognition
Chengjiang Long, Gang Hua
DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data (PDF,code)
Swaminathan Gurumurthy (CMU), Ravi Kiran Sarvadevabhatla (Video Analytics Lab, Indian Institute of Science),R. Venkatesh Babu
Oriented Response Networks
Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao
Missing Modalities Imputation via Cascaded Residual Autoencoder
Luan Tran, Xiaoming Liu, Jiayu Zhou, Rong Jin
Efficient Optimization for Hierarchically-structured Interacting Segments (HINTS)
Hossam Isack, Olga Veksler, Ipek Oguz, Milan Sonka, Yuri Boykov
A Message Passing Algorithm for the Minimum Cost Multicut Problem
Paul Swoboda, Bjoern Andres
End-To-End Representation Learning for Correlation Filter Based Tracking
Jack Valmadre, Luca Bertinetto, João Henriques, Andrea Vedaldi, Philip H. S. Torr
Filter Flow Made Practical: Massively Parallel and Lock-Free
Sathya N. Ravi, Yunyang Xiong, Lopamudra Mukherjee, Vikas Singh
Online Graph Completion: Multivariate Signal Recovery in Computer Vision
Won Hwa Kim, Mona Jalal, Seongjae Hwang, Sterling C. Johnson, Vikas Singh
Point to Set Similarity Based Deep Feature Learning for Person Re-Identification
Sanping Zhou, Jinjun Wang, Jiayun Wang, Yihong Gong, Nanning Zheng
Exploiting Saliency for Object Segmentation From Image Level Labels
Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, Mario Fritz, Bernt Schiele
Consensus Maximization With Linear Matrix Inequality Constraints
Pablo Speciale, Danda Pani Paudel, Martin R. Oswald, Till Kroeger, Luc Van Gool, Marc Pollefeys
Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks
Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, Joon-Young Lee, Hailin Jin, Thomas Funkhouser
Deep Multimodal Representation Learning From Temporal Data
Xitong Yang, Palghat Ramesh, Radha Chitta, Sriganesh Madhvanath, Edgar A. Bernal, Jiebo Luo
All You Need Is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks With Orthonormality and Modulation
Di Xie, Jiang Xiong, Shiliang Pu
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision
Sam Gross, Marc'Aurelio Ranzato, Arthur Szlam
A Reinforcement Learning Approach to the View Planning Problem
Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim
Zero-Shot Classification With Discriminative Semantic Representation Learning
Meng Ye, Yuhong Guo
Adversarial Discriminative Domain Adaptation
Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell
None of the above
Learning to Rank Retargeted Images
Yang Chen, Yong-Jin Liu, Yu-Kun Lai
Object Recognition & Scene Understanding
Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories
Ziad Al-Halah, Rainer Stiefelhagen
Scene Parsing Through ADE20K Dataset
Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso, Antonio Torralba
Weakly Supervised Cascaded Convolutional Networks
Ali Diba, Vivek Sharma, Ali Pazandeh, Hamed Pirsiavash, Luc Van Gool
Discretely Coding Semantic Rank Orders for Supervised Image Hashing
Li Liu, Ling Shao, Fumin Shen, Mengyang Yu
Joint Geometrical and Statistical Alignment for Visual Domain Adaptation
Jing Zhang, Wanqing Li, Philip Ogunbona
Weakly Supervised Dense Video Captioning
Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, Yurong Chen, Yu-Gang Jiang, Xiangyang Xue
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation
Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid
Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF
Falong Shen, Rui Gan, Shuicheng Yan, Gang Zeng
Person Search With Natural Language Description
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang
Weakly Supervised Affordance Detection
Johann Sawatzky, Abhilash Srikantha, Juergen Gall
Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths
Yanan Li, Donghui Wang, Huanhang Hu, Yuetan Lin, Yueting Zhuang
Neural Aggregation Network for Video Face Recognition (PDF)
Jiaolong Yang (Microsoft Research), Peiran Ren (Microsoft Research), Dongqing Zhang (Microsoft Research), Dong Chen (Microsoft Research), Fang Wen (Microsoft Research), Hongdong Li (ANU), Gang Hua (Microsoft Research)
Relationship Proposal Networks
Ji Zhang, Mohamed Elhoseiny, Scott Cohen, Walter Chang, Ahmed Elgammal
Learning Object Interactions and Descriptions for Semantic Image Segmentation
Guangrun Wang, Ping Luo, Liang Lin, Xiaogang Wang
RON: Reverse Connection With Objectness Prior Networks for Object Detection
Tao Kong, Fuchun Sun, Anbang Yao, Huaping Liu, Ming Lu, Yurong Chen
Weakly-Supervised Visual Grounding of Phrases With Linguistic Structures
Fanyi Xiao, Leonid Sigal, Yong Jae Lee
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects
Ting Yao, Yingwei Pan, Yehao Li, Tao Mei
Beyond Instance-Level Image Retrieval: Leveraging Captions to Learn a Global Visual Representation for Semantic Retrieval
Diane Larlus, Albert Gordo
MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features
Youssef Tamaazousti, Hervé Le Borgne, Céline Hudelot
Zero Shot Learning via Multi-Scale Manifold Regularization
Shay Deutsch, Soheil Kolouri, Kyungnam Kim, Yuri Owechko, Stefano Soatto
Theory
Deeply Supervised Salient Object Detection With Short Connections
Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, Zhuowen Tu, Philip H. S. Torr
A Matrix Splitting Method for Composite Function Minimization
Ganzhao Yuan, Wei-Shi Zheng, Bernard Ghanem
Video Analytics
One-Shot Video Object Segmentation (PDF,project,code,code)
Sergi Caelles,Kevis-Kokitsi Maninis,Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc Van Gool
Fast Person Re-Identification via Cross-Camera Semantic Binary Transformation
Jiaxin Chen, Yunhong Wang, Jie Qin, Li Liu, Ling Shao
SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos
Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, Junwei Han
Spotlight 4-1A
Hidden Layers in Perceptual Learning
Gad Cohen, Daphna Weinshall
Few-Shot Object Recognition From Machine-Labeled Web Images
Zhongwen Xu, Linchao Zhu, Yi Yang
Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders
Xin Yu, Fatih Porikli
Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension
Aniruddha Kembhavi, Minjoon Seo, Dustin Schwenk, Jonghyun Choi, Ali Farhadi, Hannaneh Hajishirzi
Deep Hashing Network for Unsupervised Domain Adaptation
Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, Sethuraman Panchanathan
Generalized Deep Image to Image Regression
Venkataraman Santhanam, Vlad I. Morariu, Larry S. Davis
Deep Learning With Low Precision by Half-Wave Gaussian Quantization
Zhaowei Cai, Xiaodong He, Jian Sun, Nuno Vasconcelos
Creativity: Generating Diverse Questions Using Variational Autoencoders
Unnat Jain, Ziyu Zhang, Alexander G. Schwing
Oral 4-1A
Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs
Federico Monti, Davide Boscaini, Jonathan Masci, Emanuele Rodolà , Jan Svoboda, Michael M. Bronstein
Full Resolution Image Compression With Recurrent Neural Networks
George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, David Minnen, Joel Shor, Michele Covell
Neural Face Editing With Intrinsic Image Disentangling
Zhixin Shu, Ersin Yumer, Sunil Hadap, Kalyan Sunkavalli, Eli Shechtman, Dimitris Samaras
Ubernet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory
Iasonas Kokkinos
Spotlight 4-1B
3D Face Morphable Models “In-The-Wildâ€
James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, Yannis Panagakis, Stefanos Zafeiriou
KillingFusion: Non-Rigid 3D Reconstruction Without Correspondences
Miroslava Slavcheva, Maximilian Baust, Daniel Cremers, Slobodan Ilic
Detailed, Accurate, Human Shape Estimation From Clothed 3D Scan Sequences
Chao Zhang, Sergi Pujades, Michael J. Black, Gerard Pons-Moll
POSEidon: Face-From-Depth for Driver Pose Estimation
Guido Borghi, Marco Venturelli, Roberto Vezzani, Rita Cucchiara
Human Shape From Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks
Endri Dibra, Himanshu Jain, Cengiz Öztireli, Remo Ziegler, Markus Gross
Parametric T-Spline Face Morphable Model for Detailed Fitting in Shape Subspace
Weilong Peng, Zhiyong Feng, Chao Xu, Yong Su
3D Menagerie: Modeling the 3D Shape and Pose of Animals
Silvia Zuffi, Angjoo Kanazawa, David W. Jacobs, Michael J. Black
iCaRL: Incremental Classifier and Representation Learning
Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, Christoph H. Lampert
Oral 4-1B
Recurrent 3D Pose Sequence Machines
Mude Lin, Liang Lin, Xiaodan Liang, Keze Wang, Hui Cheng
Learning Detailed Face Reconstruction From a Single Image
Elad Richardson, Matan Sela, Roy Or-El, Ron Kimmel
Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos
Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges
Dynamic FAUST: Registering Human Bodies in Motion
Federica Bogo, Javier Romero, Gerard Pons-Moll, Michael J. Black
3D Computer Vision
Semantically Coherent Co-Segmentation and Reconstruction of Dynamic Scenes
Armin Mustafa, Adrian Hilton
On the Two-View Geometry of Unsynchronized Cameras
Cenek Albl, Zuzana Kukelova, Andrew Fitzgibbon, Jan Heller, Matej Smid, Tomas Pajdla
Using Locally Corresponding CAD Models for Dense 3D Reconstructions From a Single Image
Chen Kong, Chen-Hsuan Lin, Simon Lucey
A Clever Elimination Strategy for Efficient Minimal Solvers
Zuzana Kukelova, Joe Kileel, Bernd Sturmfels, Tomas Pajdla
Convex Global 3D Registration With Lagrangian Duality
Jesus Briales, Javier Gonzalez-Jimenez
DeMoN: Depth and Motion Network for Learning Monocular Stereo (project,PDF)
Benjamin Ummenhofer, Huizhong Zhou, Jonas Uhrig, Nikolaus Mayer, Eddy Ilg, Alexey Dosovitskiy, Thomas Brox
3D Bounding Box Estimation Using Deep Learning and Geometry
Arsalan Mousavian, Dragomir Anguelov, John Flynn, Jana Košecká
A Dataset for Benchmarking Image-Based Localization
Xun Sun, Yuanfan Xie, Pei Luo, Liang Wang
Analyzing Humans in Images
Asynchronous Temporal Fields for Action Recognition
Gunnar A. Sigurdsson, Santosh Divvala, Ali Farhadi, Abhinav Gupta
Sequential Person Recognition in Photo Albums With a Recurrent Network
Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Anton van den Hengel
Multi-Context Attention for Human Pose Estimation
Xiao Chu, Wei Yang, Wanli Ouyang, Cheng Ma, Alan L. Yuille, Xiaogang Wang
3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation From Single Depth Images
Liuhao Ge, Hui Liang, Junsong Yuan, Daniel Thalmann
Lifting From the Deep: Convolutional 3D Pose Estimation From a Single Image
Denis Tome, Chris Russell, Lourdes Agapito
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma
Deep Structured Learning for Facial Action Unit Intensity Estimation
Robert Walecki, Ognjen (Oggi) Rudovic, Vladimir Pavlovic, Bjöern Schuller, Maja Pantic
Simultaneous Facial Landmark Detection, Pose and Deformation Estimation Under Facial Occlusion
Yue Wu, Chao Gou, Qiang Ji
Self-Supervised Video Representation Learning With Odd-One-Out Networks
Basura Fernando, Hakan Bilen, Efstratios Gavves, Stephen Gould
Robust Joint and Individual Variance Explained
Christos Sagonas, Yannis Panagakis, Alina Leidinger, Stefanos Zafeiriou
Discriminative Covariance Oriented Representation Learning for Face Recognition With Image Sets
Wen Wang, Ruiping Wang, Shiguang Shan, Xilin Chen
3D Human Pose Estimation = 2D Pose Estimation + Matching
Ching-Hang Chen, Deva Ramanan
Applications
Joint Gap Detection and Inpainting of Line Drawings
Kazuma Sasaki, Satoshi Iizuka, Edgar Simo-Serra, Hiroshi Ishikawa
Biomedical Image/Video Analysis
Riemannian Nonlinear Mixed Effects Models: Analyzing Longitudinal Deformations in Neuroimaging
Hyunwoo J. Kim, Nagesh Adluru, Heemanshu Suri, Baba C. Vemuri, Sterling C. Johnson, Vikas Singh
Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images Using Weakly-Supervised Joint Convolutional Sparse Coding
Yawen Huang, Ling Shao, Alejandro F. Frangi
Computational Photography
Multiple-Scattering Microphysics Tomography
Aviad Levis, Yoav Y. Schechner, Anthony B. Davis
Image Motion & Tracking
Accurate Optical Flow via Direct Cost Volume Processing
Jia Xu, René Ranftl, Vladlen Koltun
Event-Based Visual Inertial Odometry
Alex Zihao Zhu, Nikolay Atanasov, Kostas Daniilidis
Robust Visual Tracking Using Oblique Random Forests
Le Zhang, Jagannadan Varadarajan, Ponnuthurai Nagaratnam Suganthan, Narendra Ahuja, Pierre Moulin
Low- & Mid-Level Vision
Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution
Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang
Learning Non-Lambertian Object Intrinsics Across ShapeNet Categories
Jian Shi, Yue Dong, Hao Su, Stella X. Yu
MCMLSD: A Dynamic Programming Approach to Line Segment Detection
Emilio J. Almazà n, Ron Tal, Yiming Qian, James H. Elder
Contour-Constrained Superpixels for Image and Video Processing
Se-Ho Lee, Won-Dong Jang, Chang-Su Kim
Richer Convolutional Features for Edge Detection
Yun Liu, Ming-Ming Cheng, Xiaowei Hu, Kai Wang, Xiang Bai
Non-Local Color Image Denoising With Convolutional Neural Networks
Stamatios Lefkimmiatis
Generative Face Completion
Yijun Li, Sifei Liu, Jimei Yang, Ming-Hsuan Yang
Hyper-Laplacian Regularized Unidirectional Low-Rank Tensor Recovery for Multispectral Image Denoising
Yi Chang, Luxin Yan, Sheng Zhong
Unsupervised Semantic Scene Labeling for Streaming Data
Maggie Wigness, John G. Rogers III
Why You Should Forget Luminance Conversion and Do Something Better
Rang M. H. Nguyen, Michael S. Brown
Deep Semantic Feature Matching
Nikolai Ufer, Björn Ommer
Machine Learning
Revisiting the Variable Projection Method for Separable Nonlinear Least Squares Problems
Je Hyeong Hong, Christopher Zach, Andrew Fitzgibbon
Efficient Multiple Instance Metric Learning Using Weakly Supervised Data
Marc T. Law, Yaoliang Yu, Raquel Urtasun, Richard S. Zemel, Eric P. Xing
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation (PDF,supplementary material, code)
Thibaut Durand, Taylor Mordan, Nicolas Thome, Matthieu Cord
Image-To-Image Translation With Conditional Adversarial Networks
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros
Deep Roots: Improving CNN Efficiency With Hierarchical Filter Groups
Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie, Ross Girshick, Piotr Dollár, Zhuowen Tu, Kaiming He
MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks With Privileged Information
Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew Soon Ong
Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning
Zhengming Ding, Ming Shao, Yun Fu
Factorized Variational Autoencoders for Modeling Audience Reactions to Movies
Zhiwei Deng, Rajitha Navarathna, Peter Carr, Stephan Mandt, Yisong Yue, Iain Matthews, Greg Mori
Learning Features by Watching Objects Move
Deepak Pathak, Ross Girshick, Piotr Dollár, Trevor Darrell, Bharath Hariharan
What Can Help Pedestrian Detection?
Jiayuan Mao, Tete Xiao, Yuning Jiang, Zhimin Cao
DeepPermNet: Visual Permutation Learning
Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould
Learning the Multilinear Structure of Visual Data
Mengjiao Wang, Yannis Panagakis, Patrick Snape, Stefanos Zafeiriou
Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies
Lena Gorelick, Yuri Boykov, Olga Veksler
Designing Energy-Efficient Convolutional Neural Networks Using Energy-Aware Pruning
Tien-Ju Yang, Yu-Hsin Chen, Vivienne Sze
Joint Multi-Person Pose Estimation and Semantic Part Segmentation (PDF,dataset)
Fangting Xia, Peng Wang, Xianjie Chen, Alan L. Yuille
Deep Feature Interpolation for Image Content Changes
Paul Upchurch, Jacob Gardner, Geoff Pleiss, Robert Pless, Noah Snavely, Kavita Bala, Kilian Weinberger
FASON: First and Second Order Information Fusion Network for Texture Recognition
Xiyang Dai, Joe Yue-Hei Ng, Larry S. Davis
Lean Crowdsourcing: Combining Humans and Machines in an Online System
Steve Branson, Grant Van Horn, Pietro Perona
Object Recognition & Scene Understanding
Supervising Neural Attention Models for Video Captioning by Human Gaze Data
Youngjae Yu, Jongwook Choi, Yeonhwa Kim, Kyung Yoo, Sang-Hun Lee, Gunhee Kim
L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space
Yurun Tian, Bin Fan, Fuchao Wu
Convolutional Random Walk Networks for Semantic Image Segmentation
Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi
Knowledge Acquisition for Visual Question Answering via Iterative Querying
Yuke Zhu, Joseph J. Lim, Li Fei-Fei
Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search
Bo Zhao, Jiashi Feng, Xiao Wu, Shuicheng Yan
From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis
Yang Long, Li Liu, Ling Shao, Fumin Shen, Guiguang Ding, Jungong Han
Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?
Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, Hajime Taira, Masatoshi Okutomi, Tomas Pajdla
Asymmetric Feature Maps With Application to Sketch Based Retrieval
Giorgos Tolias, Ondřej Chum
Diverse Image Annotation
Baoyuan Wu, Fan Jia, Wei Liu, Bernard Ghanem
AMC: Attention guided Multi-modal Correlation Learning for Image Search
Kan Chen, Trung Bui, Chen Fang, Zhaowen Wang, Ram Nevatia
Multi-Attention Network for One Shot Learning
Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton van den Hengel, Heng Tao Shen
Fried Binary Embedding for High-Dimensional Visual Features
Weixiang Hong, Junsong Yuan, Sreyasee Das Bhattacharjee
Pyramid Scene Parsing Network
Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, Jiaya Jia
Learning Deep Match Kernels for Image-Set Classification
Haoliang Sun, Xiantong Zhen, Yuanjie Zheng, Gongping Yang, Yilong Yin, Shuo Li
Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description
Xishan Zhang, Ke Gao, Yongdong Zhang, Dongming Zhang, Jintao Li, Qi Tian
Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks
Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen
Indoor Scene Parsing With Instance Segmentation, Semantic Labeling and Support Relationship Inference
Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu
Episodic CAMN: Contextual Attention-Based Memory Networks With Iterative Feedback for Scene Labeling
Abrar H. Abdulnabi, Bing Shuai, Stefan Winkler, Gang Wang
Link the Head to the “Beakâ€: Zero Shot Learning From Noisy Text Description at Part Precision
Mohamed Elhoseiny, Yizhe Zhu, Han Zhang, Ahmed Elgammal
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning
Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua
Deep Pyramidal Residual Networks (PDF,code)
Dongyoon Han, Jiwhan Kim, Junmo Kim
Product Split Trees
Artem Babenko, Victor Lempitsky
Making the v in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering
Yash Goyal, Tejas Khot, Douglas Summers-Stay, Dhruv Batra, Devi Parikh
Commonly Uncommon: Semantic Sparsity in Situation Recognition
Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi
Cross-Modality Binary Code Learning via Fusion Similarity Hashing
Hong Liu, Rongrong Ji, Yongjian Wu, Feiyue Huang, Baochang Zhang
Theory
Saliency Revisited: Analysis of Mouse Movements Versus Fixations
Hamed R. Tavakoli, Fawad Ahmed, Ali Borji, Jorma Laaksonen
InterpoNet, a Brain Inspired Neural Network for Optical Flow Dense Interpolation
Shay Zweig, Lior Wolf
Video Analytics
SST: Single-Stream Temporal Action Proposals
Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, Juan Carlos Niebles
Video Segmentation via Multiple Granularity Analysis
Rui Yang, Bingbing Ni, Chao Ma, Yi Xu, Xiaokang Yang
Spatio-Temporal Alignment of Non-Overlapping Sequences From Independently Panning Cameras
Seyed Morteza Safdarnejad, Xiaoming Liu
UntrimmedNets for Weakly Supervised Action Recognition and Detection
Limin Wang, Yuanjun Xiong, Dahua Lin, Luc Van Gool
Spotlight 4-2A
Gaze Embeddings for Zero-Shot Image Classification
Nour Karessli, Zeynep Akata, Bernt Schiele, Andreas Bulling
What's in a Question: Using Visual Questions as a Form of Supervision
Siddha Ganju, Olga Russakovsky, Abhinav Gupta
Attend to You: Personalized Image Captioning With Context Sequence Memory Networks
Cesc Chunseong Park, Byeongchang Kim, Gunhee Kim
Adversarially Tuned Scene Generation
VSR Veeravasarapu, Constantin Rothkopf, Ramesh Visvanathan
Residual Attention Network for Image Classification
Fei Wang, Mengqing Jiang, Chen Qian, Shuo Yang, Cheng Li, Honggang Zhang, Xiaogang Wang, Xiaoou Tang
Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade
Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, Xiaoou Tang
Learning Non-Maximum Suppression
Jan Hosang, Rodrigo Benenson, Bernt Schiele
The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives
Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, Jordan Boyd-Graber, Hal Daumé III, Larry S. Davis
Oral 4-2A
Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach
Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, Yao Zhao, Shuicheng Yan
Fine-Grained Recognition as HSnet Search for Informative Image Parts
Michael Lam, Behrooz Mahasseni, Sinisa Todorovic
G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition
Qilong Wang, Peihua Li, Lei Zhang
YOLO9000: Better, Faster, Stronger
Joseph Redmon, Ali Farhadi
Spotlight 4-2B
Multi-View 3D Object Detection Network for Autonomous Driving
Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia
UltraStereo: Efficient Learning-Based Matching for Active Stereo Systems
Sean Ryan Fanello, Julien Valentin, Christoph Rhemann, Adarsh Kowdle, Vladimir Tankovich, Philip Davidson, Shahram Izadi
Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis
Angela Dai, Charles Ruizhongtai Qi, Matthias Nießner
Geometric Loss Functions for Camera Pose Regression With Deep Learning
Alex Kendall, Roberto Cipolla
CNN-SLAM: Real-Time Dense Monocular SLAM With Learned Depth Prediction
Keisuke Tateno, Federico Tombari, Iro Laina, Nassir Navab
Learning From Noisy Large-Scale Datasets With Minimal Supervision
Andreas Veit, Neil Alldrin, Gal Chechik, Ivan Krasin, Abhinav Gupta, Serge Belongie
SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation
Li Yi, Hao Su, Xingwen Guo, Leonidas J. Guibas
Non-Local Deep Features for Salient Object Detection
Zhiming Luo, Akshaya Mishra, Andrew Achkar, Justin Eichel, Shaozi Li, Pierre-Marc Jodoin
Oral 4-2B
Unsupervised Monocular Depth Estimation With Left-Right Consistency
Clément Godard, Oisin Mac Aodha, Gabriel J. Brostow
Unsupervised Learning of Depth and Ego-Motion From Video
Tinghui Zhou, Matthew Brown, Noah Snavely, David G. Lowe
OctNet: Learning Deep 3D Representations at High Resolutions
Gernot Riegler, Ali Osman Ulusoy, Andreas Geiger
3D Shape Segmentation With Projective Convolutional Networks
Evangelos Kalogerakis, Melinos Averkiou, Subhransu Maji, Siddhartha Chaudhuri
3D Computer Vision
SGM-Nets: Semi-Global Matching With Neural Networks
Akihito Seki, Marc Pollefeys
Stereo-Based 3D Reconstruction of Dynamic Fluid Surfaces by Global Optimization
Yiming Qian, Minglun Gong, Yee-Hong Yang
Fine-To-Coarse Global Registration of RGB-D Scans
Maciej Halber, Thomas Funkhouser
Analyzing Computer Vision Data - The Good, the Bad and the Ugly
Oliver Zendel, Katrin Honauer, Markus Murschitz, Martin Humenberger, Gustavo Fernández DomÃnguez
Product Manifold Filter: Non-Rigid Shape Correspondence via Kernel Density Estimation in the Product Space
Matthias Vestner, Roee Litman, Emanuele Rodolà , Alex Bronstein, Daniel Cremers
Unsupervised Vanishing Point Detection and Camera Calibration From a Single Manhattan Image With Radial Distortion
Michel Antunes, João P. Barreto, Djamila Aouada, Björn Ottersten
Toroidal Constraints for Two-Point Localization Under High Outlier Ratios
Federico Camposeco, Torsten Sattler, Andrea Cohen, Andreas Geiger, Marc Pollefeys
4D Light Field Superpixel and Segmentation
Hao Zhu, Qi Zhang, Qing Wang
Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation From Single and Multiple Images
Yuan Gao, Alan L. Yuille
Analyzing Humans in Images
Binary Coding for Partial Action Analysis With Limited Observation Ratios
Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen, Yunhong Wang
SphereFace: Deep Hypersphere Embedding for Face Recognition
Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song
IRINA: Iris Recognition (Even) in Inaccurately Segmented Data
Hugo Proença, João C. Neves
Look Into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing
Ke Gong, Xiaodan Liang, Dongyu Zhang, Xiaohui Shen, Liang Lin
Action Unit Detection With Region Adaptation, Multi-Labeling Learning and Optimal Temporal Fusing
Wei Li, Farnaz Abtahi, Zhigang Zhu
See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-Identification
Zhen Zhou, Yan Huang, Wei Wang, Liang Wang, Tieniu Tan
Joint Intensity and Spatial Metric Learning for Robust Gait Recognition
Yasushi Makihara, Atsuyuki Suzuki, Daigo Muramatsu, Xiang Li, Yasushi Yagi
Pose-Aware Person Recognition
Vijay Kumar, Anoop Namboodiri, Manohar Paluri, C. V. Jawahar
Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding
José Lezama, Qiang Qiu, Guillermo Sapiro
Applications
Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals
Katsuyuki Nakamura, Serena Yeung, Alexandre Alahi, Li Fei-Fei
Binarized Mode Seeking for Scalable Visual Pattern Discovery
Wei Zhang, Xiaochun Cao, Rui Wang, Yuanfang Guo, Zhineng Chen
Scribbler: Controlling Deep Image Synthesis With Sketch and Color
Patsorn Sangkloy, Jingwan Lu, Chen Fang, Fisher Yu, James Hays
Biomedical Image/Video Analysis
Multi-Way Multi-Level Kernel Modeling for Neuroimaging Classification
Lifang He, Chun-Ta Lu, Hao Ding, Shen Wang, Linlin Shen, Philip S. Yu, Ann B. Ragin
WSISA: Making Survival Prediction From Whole Slide Histopathological Images
Xinliang Zhu, Jiawen Yao, Feiyun Zhu, Junzhou Huang
Computational Photography
On the Effectiveness of Visible Watermarks
Tali Dekel, Michael Rubinstein, Ce Liu, William T. Freeman
Snapshot Hyperspectral Light Field Imaging
Zhiwei Xiong, Lizhi Wang, Huiqun Li, Dong Liu, Feng Wu
Semantic Image Inpainting With Deep Generative Models
Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa-Johnson, Minh N. Do
Image Motion & Tracking
Fast Multi-Frame Stereo Scene Flow With Motion Segmentation
Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato
Improved Stereo Matching With Constant Highway Networks and Reflective Confidence Learning
Amit Shaked, Lior Wolf
Optical Flow in Mostly Rigid Scenes
Jonas Wulff, Laura Sevilla-Lara, Michael J. Black
Optical Flow Requires Multiple Strategies (but Only One Network) (PDF,code)
Tal Schuster, Lior Wolf, David Gadot
ECO: Efficient Convolution Operators for Tracking
Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg
Low- & Mid-Level Vision
Differential Angular Imaging for Material Recognition
Jia Xue, Hang Zhang, Kristin Dana, Ko Nishino
Fast Fourier Color Constancy
Jonathan T. Barron, Yun-Ta Tsai
Comparative Evaluation of Hand-Crafted and Learned Local Features
Johannes L. Schönberger, Hans Hardmeier, Torsten Sattler, Marc Pollefeys
Learning Fully Convolutional Networks for Iterative Non-Blind Deconvolution
Jiawei Zhang, Jinshan Pan, Wei-Sheng Lai, Rynson W. H. Lau, Ming-Hsuan Yang
Image Deblurring via Extreme Channels Prior
Yanyang Yan, Wenqi Ren, Yuanfang Guo, Rui Wang, Xiaochun Cao
Simultaneous Stereo Video Deblurring and Scene Flow Estimation
Liyuan Pan, Yuchao Dai, Miaomiao Liu, Fatih Porikli
Deep Photo Style Transfer
Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala
Generative Attribute Controller With Conditional Filtered Generative Adversarial Networks
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino
Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior
Jing Zhang, Yang Cao, Shuai Fang, Yu Kang, Chang Wen Chen
Machine Learning
Low-Rank Bilinear Pooling for Fine-Grained Classification
Shu Kong, Charless Fowlkes
Neural Scene De-Rendering
Jiajun Wu, Joshua B. Tenenbaum, Pushmeet Kohli
Real-Time Neural Style Transfer for Videos
Haozhi Huang, Hao Wang, Wenhan Luo, Lin Ma, Wenhao Jiang, Xiaolong Zhu, Zhifeng Li, Wei Liu
A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning
Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang
Collaborative Deep Reinforcement Learning for Joint Object Search
Xiangyu Kong, Bo Xin, Yizhou Wang, Gang Hua
Loss Max-Pooling for Semantic Image Segmentation
Samuel Rota Bulò, Gerhard Neuhold, Peter Kontschieder
Deep View Morphing
Dinghuang Ji, Junghyun Kwon, Max McFarland, Silvio Savarese
Unsupervised Learning of Long-Term Motion Dynamics for Videos
Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, Li Fei-Fei
Revisiting Metric Learning for SPD Matrix Based Visual Representation
Luping Zhou, Lei Wang, Jianjia Zhang, Yinghuan Shi, Yang Gao
Expert Gate: Lifelong Learning With a Network of Experts
Rahaf Aljundi, Punarjay Chakravarty, Tinne Tuytelaars
A Gift From Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning
Junho Yim, Donggyu Joo, Jihoon Bae, Junmo Kim
Domain Adaptation by Mixture of Alignments of Second- or Higher-Order Scatter Tensors
Piotr Koniusz, Yusuf Tas, Fatih Porikli
Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation
Stéphane Lathuilière, Rémi Juge, Pablo Mesejo, Rafael Muñoz-Salinas, Radu Horaud
STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling
Yang He, Wei-Chen Chiu, Margret Keuper, Mario Fritz
Harmonic Networks: Deep Translation and Rotation Equivariance
Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow
Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer
Xin Wang, Geoffrey Oxholm, Da Zhang, Yuan-Fang Wang
Detect, Replace, Refine: Deep Structured Prediction for Pixel Wise Labeling
Spyros Gidaris, Nikos Komodakis
Weighted-Entropy-Based Quantization for Deep Neural Networks
Eunhyeok Park, Junwhan Ahn, Sungjoo Yoo
Residual Expansion Algorithm: Fast and Effective Optimization for Nonconvex Least Squares Problems
Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-In-The-Blank Image Captioning
Qing Sun, Stefan Lee, Dhruv Batra
Newton-Type Methods for Inference in Higher-Order Markov Random Fields
Hariprasad Kannan, Nikos Komodakis, Nikos Paragios
Adaptive Relaxed ADMM: Convergence Theory and Practical Implementation
Zheng Xu, Mário A. T. Figueiredo, Xiaoming Yuan, Christoph Studer, Tom Goldstein
Object Recognition & Scene Understanding
ViP-CNN: Visual Phrase Guided Convolutional Neural Network
Yikang Li, Wanli Ouyang, Xiaogang Wang, Xiao'ou Tang
Instance-Aware Image and Sentence Matching With Selective Multimodal LSTM
Yan Huang, Wei Wang, Liang Wang
Kernel Square-Loss Exemplar Machines for Image Retrieval
Rafael S. Rezende, Joaquin Zepeda, Jean Ponce, Francis Bach, Patrick Pérez
Cognitive Mapping and Planning for Visual Navigation
Saurabh Gupta, James Davidson, Sergey Levine, Rahul Sukthankar, Jitendra Malik
Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation
Anirban Roy, Sinisa Todorovic
Seeing Into Darkness: Scotopic Visual Recognition
Bo Chen, Pietro Perona
Deep Co-Occurrence Feature Learning for Visual Object Recognition
Ya-Fang Shih, Yang-Ming Yeh, Yen-Yu Lin, Ming-Fang Weng, Yi-Chang Lu, Yung-Yu Chuang
An Empirical Evaluation of Visual Question Answering for Novel Objects
Santhosh K. Ramakrishnan, Ambar Pal, Gaurav Sharma, Anurag Mittal
InstanceCut: From Edges to Instances With MultiCut
Alexander Kirillov, Evgeny Levinkov, Bjoern Andres, Bogdan Savchynskyy, Carsten Rother
Fine-Grained Image Classification via Combining Vision and Language
Xiangteng He, Yuxin Peng
Mimicking Very Efficient Network for Object Detection
Quanquan Li, Shengying Jin, Junjie Yan
Tracking by Natural Language Specification
Zhenyang Li, Ran Tao, Efstratios Gavves, Cees G. M. Snoek, Arnold W.M. Smeulders
A Dataset and Exploration of Models for Understanding Video Data Through Fill-In-The-Blank Question-Answering
Tegan Maharaj, Nicolas Ballas, Anna Rohrbach, Aaron Courville, Christopher Pal
Learning Detection With Diverse Proposals
Samaneh Azadi, Jiashi Feng, Trevor Darrell
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition
Yufei Wang, Zhe Lin, Xiaohui Shen, Scott Cohen, Garrison W. Cottrell
Theory
A Low Power, Fully Event-Based Gesture Recognition System
Arnon Amir, Brian Taba, David Berg, Timothy Melano, Jeffrey McKinstry, Carmelo Di Nolfo, Tapan Nayak, Alexander Andreopoulos, Guillaume Garreau, Marcela Mendoza, Jeff Kusnitz, Michael Debole, Steve Esser, Tobi Delbruck, Myron Flickner, Dharmendra Modha
Video Analytics
Learning Deep Context-Aware Features Over Body and Latent Parts for Person Re-Identification
Dangwei Li, Xiaotang Chen, Zhang Zhang, Kaiqi Huang
Recurrent Modeling of Interaction Context for Collective Activity Recognition
Minsi Wang, Bingbing Ni, Xiaokang Yang
Primary Object Segmentation in Videos Based on Region Augmentation and Reduction
Yeong Jun Koh, Chang-Su Kim
ROAM: A Rich Object Appearance Model With Application to Rotoscoping
Ondrej Miksik, Juan-Manuel Pérez-Rúa, Philip H. S. Torr, Patrick Pérez
Temporal Residual Networks for Dynamic Scene Recognition
Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes
Spatiotemporal Multiplier Networks for Video Action Recognition
Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes
Learning to Learn From Noisy Web Videos
Serena Yeung, Vignesh Ramanathan, Olga Russakovsky, Liyue Shen, Greg Mori, Li Fei-Fei
YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video
Esteban Real, Jonathon Shlens, Stefano Mazzocchi, Xin Pan, Vincent Vanhoucke
Online Video Object Segmentation via Convolutional Trident Network
Won-Dong Jang, Chang-Su Kim