CVPR2017論文匯總

CVPR2017论文
作为计算机视觉领域的三大顶级会议之一,CVPR 2017 又收录了很多优秀的文章。具体可参见 CVPR 的论文官网:http://www.cvpapers.com/cvpr2017.html

Machine Learning 1 (机器学习)

Spotlight 1-1A  (关注的焦点 1-1 A)

Exclusivity-Consistency Regularized Multi-View Subspace Clustering
    Xiaojie Guo, Xiaobo Wang, Zhen Lei, Changqing Zhang, Stan Z. Li
Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning
    Weifeng Ge, Yizhou Yu
The More You Know: Using Knowledge Graphs for Image Classification
    Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta
Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs
    Martin Simonovsky, Nikos Komodakis
Convolutional Neural Network Architecture for Geometric Matching
    Ignacio Rocco, Relja Arandjelović, Josef Sivic
Deep Affordance-Grounded Sensorimotor Object Recognition
    Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos
Discovering Causal Signals in Images
    David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, Léon Bottou
On Compressing Deep Models by Low Rank and Sparse Decomposition
    Xiyu Yu, Tongliang Liu, Xinchao Wang, Dacheng Tao

Oral 1-1A  (口头汇报 1-1A)

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas

Universal Adversarial Perturbations

Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard

Unsupervised Pixel-Level Domain Adaptation With Generative Adversarial Networks

Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan

Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network (PDF,code)

Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi

3D Vision 1(三维视觉)

Spotlight 1-1B(关注的焦点 1-1 B)

Context-Aware Captions From Context-Agnostic Supervision

Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik

Global Hypothesis Generation for 6D Object Pose Estimation (PDF)

Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother

A Practical Method for Fully Automatic Intrinsic Camera Calibration Using Directionally Encoded Light

Mahdi Abbaspour Tehrani, Thabo Beeler, Anselm Grundhöfer

CATS: A Color and Thermal Stereo Benchmark

Wayne Treible, Philip Saponaro, Scott Sorensen, Abhishek Kolagunda, Michael O'Neal, Brian Phelan, Kelly Sherbondy, Chandra Kambhamettu

Elastic Shape-From-Template With Spatially Sparse Deforming Forces

Abed Malti, Cédric Herzet

Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context

Qingan Yan, Long Yang, Ling Zhang, Chunxia Xiao

Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, Nicu Sebe

Dynamic Time-Of-Flight

Michael Schober, Amit Adam, Omer Yair, Shai Mazor, Sebastian Nowozin

Oral 1-1B

Semantic Scene Completion From a Single Depth Image

Shuran Song, Fisher Yu, Andy Zeng, Angel X. Chang, Manolis Savva, Thomas Funkhouser

3DMatch: Learning Local Geometric Descriptors From RGB-D Reconstructions

Andy Zeng, Shuran Song, Matthias Nießner, Matthew Fisher, Jianxiong Xiao, Thomas Funkhouser

Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency (PDF,project,code)

Shubham Tulsiani,Tinghui Zhou,Alexei A. Efros,Jitendra Malik

On-The-Fly Adaptation of Regression Forests for Online Camera Relocalisation (PDF)

Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano,Philip H. S. Torr

Low- & Mid-Level Vision

Spotlight 1-1C

Designing Effective Inter-Pixel Information Flow for Natural Image Matting

Yağiz Aksoy, Tunç Ozan Aydin, Marc Pollefeys

Deep Video Deblurring for Hand-Held Cameras

Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, Wolfgang Heidrich, Oliver Wang

Instance-Level Salient Object Segmentation

Guanbin Li, Yuan Xie, Liang Lin, Yizhou Yu

Deep Multi-Scale Convolutional Neural Network for Dynamic Scene Deblurring

Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee

Diversified Texture Synthesis With Feed-Forward Networks

Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

Radiometric Calibration for Internet Photo Collections (PDF)

Zhipeng Mo, Boxin Shi, Sai-Kit Yeung, Yasuyuki Matsushita

Deeply Aggregated Alternating Minimization for Image Restoration

Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn

  • 物体检测

End-To-End Instance Segmentation With Recurrent Attention

Mengye Ren, Richard S. Zemel

分类:实例物体分割

简要:使用端到端的递归神经网络进行实例物体分割.本文针对实例分割使用递归神经网络(RNN)架构将每个物体依次定位分割出来,使用了一个注意机制模型类似人类的计算过程。

Oral 1-1C

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild

Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye

Deep Image Matting (PDF,abstract)

Ning Xu, Brian Price, Scott Cohen, Thomas Huang

Wetness and Color From a Single Multispectral Image

Mihoko Shimano, Hiroki Okawa, Yuta Asano, Ryoma Bise, Ko Nishino, Imari Sato

FC4: Fully Convolutional Color Constancy With Confidence-Weighted Pooling

Yuanming Hu, Baoyuan Wang, Stephen Lin

Poster 1-1

3D Computer Vision

Face Normals “In-The-Wild†Using Fully Convolutional Networks

George Trigeorgis, Patrick Snape, Iasonas Kokkinos, Stefanos Zafeiriou

A Non-Convex Variational Approach to Photometric Stereo Under Inaccurate Lighting

Yvain Quéau, Tao Wu, François Lauze, Jean-Denis Durou, Daniel Cremers

A Linear Extrinsic Calibration of Kaleidoscopic Imaging System From Single 3D Point

Kosuke Takahashi, Akihiro Miyata, Shohei Nobuhara, Takashi Matsuyama

Polarimetric Multi-View Stereo

Zhaopeng Cui, Jinwei Gu, Boxin Shi, Ping Tan, Jan Kautz

An Exact Penalty Method for Locally Convergent Maximum Consensus (PDF,code)

Huu Le, Tat-Jun Chin, David Suter

Deep Supervision With Shape Concepts for Occlusion-Aware 3D Object Parsing

Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker

Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes From 2D Ones in RGB-Depth Images

Zhuo Deng, Longin Jan Latecki

Analyzing Humans in Images

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection

Guillermo Garcia-Hernando, Tae-Kyun Kim

Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition With Convolutional Neural Networks

Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, Chang Tang, Philip Ogunbona

Detecting Masked Faces in the Wild With LLE-CNNs

Shiming Ge, Jia Li, Qiting Ye, Zhao Luo

A Domain Based Approach to Social Relation Recognition

Qianru Sun, Bernt Schiele, Mario Fritz

Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition

Junwu Weng, Chaoqun Weng, Junsong Yuan

Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks

Ajjen Joshi, Soumya Ghosh, Margrit Betke, Stan Sclaroff, Hanspeter Pfister

Applications

Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core

Wadim Kehl, Federico Tombari, Slobodan Ilic, Nassir Navab

Multi-Scale FCN With Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild

Dafang He, Xiao Yang, Chen Liang, Zihan Zhou, Alexander G. Ororbi II, Daniel Kifer, C. Lee Giles

Viraliency: Pooling Local Virality

Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, Elisa Ricci

Biomedical Image/Video Analysis

A Non-Local Low-Rank Framework for Ultrasound Speckle Reduction

Lei Zhu, Chi-Wing Fu, Michael S. Brown, Pheng-Ann Heng

Image Motion & Tracking

Video Acceleration Magnification

Silvia L. Pintea, Yichao Zhang, Jan C. van Gemert

Superpixel-Based Tracking-By-Segmentation Using Markov Chains

Donghun Yeo, Jeany Son, Bohyung Han, Joon Hee Han

BranchOut: Regularization for Online Ensemble Tracking With Convolutional Neural Networks

Bohyung Han, Jack Sim, Hartwig Adam

Learning Motion Patterns in Videos

Pavel Tokmakov, Karteek Alahari, Cordelia Schmid

Low- & Mid-Level Vision

Deep Level Sets for Salient Object Detection

Ping Hu, Bing Shuai, Jun Liu, Gang Wang

Binary Constraint Preserving Graph Matching

Bo Jiang, Jin Tang, Chris Ding, Bin Luo

From Local to Global: Edge Profiles to Camera Motion in Blurred Images

Subeesh Vasu, A. N. Rajagopalan

What Is the Space of Attenuation Coefficients in Underwater Computer Vision?

Derya Akkaynak, Tali Treibitz, Tom Shlesinger, Yossi Loya, Raz Tamir, David Iluz

Robust Energy Minimization for BRDF-Invariant Shape From Light Fields

Zhengqin Li, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker

Boundary-Aware Instance Segmentation

Zeeshan Hayder, Xuming He, Mathieu Salzmann

Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes

S. Alireza Golestaneh, Lina J. Karam

Model-Based Iterative Restoration for Binary Document Image Compression With Dictionary Learning

Yandong Guo, Cheng Lu, Jan P. Allebach, Charles A. Bouman

FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence

Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn

Machine Learning

Learning by Association — A Versatile Semi-Supervised Training Method for Neural Networks

Philip Haeusser, Alexander Mordvintsev, Daniel Cremers

Dilated Residual Networks

Fisher Yu, Vladlen Koltun, Thomas Funkhouser

Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction

Richard Zhang, Phillip Isola, Alexei A. Efros

Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting

Mariano Tepper, Guillermo Sapiro

Truncated Max-Of-Convex Models

Pankaj Pansari, M. Pawan Kumar

Additive Component Analysis

Calvin Murdock, Fernando De la Torre

Subspace Clustering via Variance Regularized Ridge Regression

Zhao Kang, Chong Peng, Qiang Cheng

The Incremental Multiresolution Matrix Factorization Algorithm

Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh

Transformation-Grounded Image Generation Network for Novel 3D View Synthesis

Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, Alexander C. Berg

Learning Dynamic Guidance for Depth Image Enhancement (PDF)

Shuhang Gu, Wangmeng Zuo, Shi Guo, Yunjin Chen, Chongyu Chen, Lei Zhang

A-Lamp: Adaptive Layout-Aware Multi-Patch Deep Convolutional Neural Network for Photo Aesthetic Assessment (PDF)

Shuang Ma, Jing Liu, Chang Wen Chen

Teaching Compositionality to CNNs

Austin Stone, Huayan Wang, Michael Stark, Yi Liu, D. Scott Phoenix, Dileep George

Using Ranking-CNN for Age Estimation

Shixing Chen, Caojin Zhang, Ming Dong, Jialiang Le, Mike Rao

Accurate Single Stage Detector Using Recurrent Rolling Convolution

Jimmy Ren, Xiaohao Chen, Jianbo Liu, Wenxiu Sun, Jiahao Pang, Qiong Yan, Yu-Wing Tai, Li Xu

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

Chunpeng Wu, Wei Wen, Tariq Afzal, Yongmei Zhang, Yiran Chen, Hai (Helen) Li

The Impact of Typicality for Informative Representative Selection

Jawadul H. Bappy, Sujoy Paul, Ertem Tuncel, Amit K. Roy-Chowdhury

Infinite Variational Autoencoder for Semi-Supervised Learning

M. Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel

SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks

Ayan Sinha, Asim Unmesh, Qixing Huang, Karthik Ramani

Intrinsic Grassmann Averages for Online Linear and Robust Subspace Learning

Rudrasis Chakraborty, Søren Hauberg, Baba C. Vemuri

Variational Bayesian Multiple Instance Learning With Gaussian Processes

Manuel Haußmann, Fred A. Hamprecht, Melih Kandemir

Temporal Attention-Gated Model for Robust Sequence Classification

Wenjie Pei, Tadas Baltrušaitis, David M.J. Tax, Louis-Philippe Morency

Non-Uniform Subset Selection for Active Learning in Structured Data

Sujoy Paul, Jawadul H. Bappy, Amit K. Roy-Chowdhury

Colorization as a Proxy Task for Visual Understanding

Gustav Larsson, Michael Maire, Gregory Shakhnarovich

Shading Annotations in the Wild

Balazs Kovacs, Sean Bell, Noah Snavely, Kavita Bala

LCNN: Lookup-Based Convolutional Neural Network

Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi

Object Recognition & Scene Understanding

Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation

Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang

Pixelwise Instance Segmentation With a Dynamically Instantiated Network

Anurag Arnab, Philip H. S. Torr

Object Detection in Videos With Tubelet Proposal Networks

Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang

AMVH: Asymmetric Multi-Valued Hashing

Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, Shiming Xiang, Chunhong Pan

Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion

Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, Xiaoou Tang

Deep Visual-Semantic Quantization for Efficient Image Retrieval

Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu

Efficient Diffusion on Region Manifolds: Recovering Small Objects With Compact CNN Representations

Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Teddy Furon, Ondřej Chum

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie

Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation

Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, Yong Xu, Wangmeng Zuo

StyleNet: Generating Attractive Visual Captions With Styles

Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng

Fine-Grained Recognition of Thousands of Object Categories With Single-Example Training

Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok

Improving Interpretability of Deep Neural Networks With Semantic Information

Yinpeng Dong, Hang Su, Jun Zhu, Bo Zhang

Video Captioning With Transferred Semantic Attributes

Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei

Fast Boosting Based Detection Using Scale Invariant Multimodal Multiresolution Filtered Features

Arthur Daniel Costea, Robert Varga, Sergiu Nedevschi

  • Video Analytics監控視頻

Temporal Convolutional Networks for Action Segmentation and Detection

Colin Lea, Michael D. Flynn, René Vidal, Austin Reiter, Gregory D. Hager

Surveillance Video Parsing With Single Frame Supervision

Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao, Yao Sun

簡要:监视视频解析,将视频帧分成多个标签,即脸,裤子,左腿,有广泛的应用。

Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking

Yan Yan, Chenliang Xu, Dawen Cai, Jason J. Corso

Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos

De-An Huang, Joseph J. Lim, Li Fei-Fei, Juan Carlos Niebles

Zero-Shot Action Recognition With Error-Correcting Output Codes

Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen, Yunhong Wang

Enhancing Video Summarization via Vision-Language Embedding

Bryan A. Plummer, Matthew Brown, Svetlana Lazebnik

Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet

Jianwen Xie, Song-Chun Zhu, Ying Nian Wu

Object Recognition & Scene Understanding - Computer Vision & Language

Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries

Yuting Zhang, Luyao Yuan, Yijie Guo, Zhiyuan He, I-An Huang, Honglak Lee

Automatic Understanding of Image and Video Advertisements

Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka

Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval

Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, Ling Shao

Discover and Learn New Objects From Documentaries

Kai Chen, Hang Song, Chen Change Loy, Dahua Lin

Spatial-Semantic Image Search by Visual Feature Synthesis

Long Mai, Hailin Jin, Zhe Lin, Chen Fang, Jonathan Brandt, Feng Liu

Fully-Adaptive Feature Sharing in Multi-Task Networks With Applications in Person Attribute Classification

Yongxi Lu, Abhishek Kumar, Shuangfei Zhai, Yu Cheng, Tara Javidi, Rogerio Feris

Semantic Compositional Networks for Visual Captioning

Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng

Training Object Class Detectors With Click Supervision

Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

Oral 1-2A

Deep Reinforcement Learning-Based Image Captioning With Embedding Reward

Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li

From Red Wine to Red Tomato: Composition With Context

Ishan Misra, Abhinav Gupta, Martial Hebert

Captioning Images With Diverse Objects

Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, Trevor Darrell, Kate Saenko

Self-Critical Sequence Training for Image Captioning

Steven J. Rennie, Etienne Marcheret, Youssef Mroueh, Jerret Ross, Vaibhava Goel

Analyzing Humans 1

Spotlight 1-2B

Crossing Nets: Combining GANs and VAEs With a Shared Latent Space for Hand Pose Estimation

Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao

Predicting Behaviors of Basketball Players From First Person Videos

Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park

LCR-Net: Localization-Classification-Regression for Human Pose

Grégory Rogez, Philippe Weinzaepfel, Cordelia Schmid

Learning Residual Images for Face Attribute Manipulation

Wei Shen, Rujie Liu

Seeing What Is Not There: Learning Context to Determine Where Objects Are Missing

Jin Sun, David W. Jacobs

Deep Learning on Lie Groups for Skeleton-Based Action Recognition

Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool

Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations

Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose

Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

Oral 1-2B

Weakly Supervised Action Learning With RNN Based Fine-To-Coarse Modeling

Alexander Richard, Hilde Kuehne, Juergen Gall

Disentangled Representation Learning GAN for Pose-Invariant Face Recognition

Luan Tran, Xi Yin, Xiaoming Liu

ArtTrack: Articulated Multi-Person Tracking in the Wild

Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, Evgeny Levinkov, Bjoern Andres, Bernt Schiele

Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields (PDF,code)

Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh

Image Motion & Tracking; Video Analysis

Spotlight 1-2C

Template Matching With Deformable Diversity Similarity

Itamar Talmi, Roey Mechrez, Lihi Zelnik-Manor

Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-Identification

Weihua Chen, Xiaotang Chen, Jianguo Zhang, Kaiqi Huang

Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization

Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles, Min Sun

Bidirectional Multirate Reconstruction for Temporal Modeling in Videos

Linchao Zhu, Zhongwen Xu, Yi Yang

Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning

Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi

TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering

Yunseok Jang, Yale Song, Youngjae Yu, Youngjin Kim, Gunhee Kim

Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing

Yu-Chuan Su, Kristen Grauman

Unsupervised Adaptive Re-Identification in Open World Dynamic Camera Networks

Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury

Oral 1-2C

Context-Aware Correlation Filter Tracking

Matthias Mueller, Neil Smith, Bernard Ghanem

Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360° Sports Videos

Hou-Ning Hu, Yen-Chen Lin, Ming-Yu Liu, Hsien-Tzu Cheng, Yung-Ju Chang, Min Sun

Slow Flow: Exploiting High-Speed Cameras for Accurate and Diverse Optical Flow Reference Data

Joel Janai, Fatma Güney, Jonas Wulff, Michael J. Black, Andreas Geiger

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, Shih-Fu Chang

Poster 1-2

3D Computer Vision

Exploiting 2D Floorplan for Building-Scale Panorama RGBD Alignment

Erik Wijmans, Yasutaka Furukawa

A Combinatorial Solution to Non-Rigid 3D Shape-To-Image Matching

Florian Bernard, Frank R. Schmidt, Johan Thunberg, Daniel Cremers

NID-SLAM: Robust Monocular SLAM Using Normalised Information Distance

Geoffrey Pascoe, Will Maddern, Michael Tanner, Pedro Piniés, Paul Newman

End-To-End Training of Hybrid CNN-CRF Models for Stereo

Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock

Learning Shape Abstractions by Assembling Volumetric Primitives (PDF,project,code)

Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, Jitendra Malik

Locality-Sensitive Deconvolution Networks With Gated Fusion for RGB-D Indoor Semantic Segmentation

Yanhua Cheng, Rui Cai, Zhiwei Li, Xin Zhao, Kaiqi Huang

Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging (PDF)

Jaewon Kim, Ilya Reshetouski, Abhijeet Ghosh

Regressing Robust and Discriminative 3D Morphable Models With a Very Deep Neural Network

Anh Tuấn Trần, Tal Hassner, Iacopo Masi, Gérard Medioni

End-To-End 3D Face Reconstruction With Deep Neural Networks

Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris

DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction

Antonio Agudo, Francesc Moreno-Noguer

Analyzing Humans in Images

Finding Tiny Faces

Peiyun Hu, Deva Ramanan

Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network

Jinwei Gu, Xiaodong Yang, Shalini De Mello, Jan Kautz

Deep Temporal Linear Encoding Networks

Ali Diba, Vivek Sharma, Luc Van Gool

Joint Registration and Representation Learning for Unconstrained Face Identification (PDF)

Munawar Hayat,Salman H. Khan,Naoufel Werghi, Roland Goecke

3D Human Pose Estimation From a Single Image via Distance Matrix Regression

Francesc Moreno-Noguer

One-Shot Metric Learning for Person Re-Identification

Slawomir BÄ…k, Peter Carr

Generalized Rank Pooling for Activity Recognition

Anoop Cherian, Basura Fernando, Mehrtash Harandi, Stephen Gould

Deep Representation Learning for Human Motion Prediction and Classification

Judith Bütepage, Michael J. Black, Danica Kragic, Hedvig Kjellström

Interspecies Knowledge Transfer for Facial Keypoint Detection

Maheen Rashid, Xiuye Gu, Yong Jae Lee

Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization

Runpeng Cui, Hu Liu, Changshui Zhang

Applications

Modeling Sub-Event Dynamics in First-Person Action Recognition

Hasan F. M. Zaki, Faisal Shafait, Ajmal Mian

Computational Photography

Turning an Urban Scene Video Into a Cinemagraph

Hang Yan, Yebin Liu, Yasutaka Furukawa

Light Field Reconstruction Using Deep Convolutional Network on EPI

Gaochang Wu, Mandan Zhao, Liangyong Wang, Qionghai Dai, Tianyou Chai, Yebin Liu

Image Motion & Tracking

FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks

Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, Thomas Brox

Low- & Mid-Level Vision

Attention-Aware Face Hallucination via Deep Reinforcement Learning

Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Li

Simple Does It: Weakly Supervised Instance and Semantic Segmentation

Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, Bernt Schiele

Anti-Glare: Tightly Constrained Optimization for Eyeglass Reflection Removal

Tushar Sandhan, Jin Young Choi

Deep Joint Rain Detection and Removal From a Single Image

Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan

Radiometric Calibration From Faces in Images

Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi

Webly Supervised Semantic Segmentation

Bin Jin, Maria V. Ortiz Segovia, Sabine Süsstrunk

Removing Rain From Single Images via a Deep Detail Network

Xueyang Fu, Jiabin Huang, Delu Zeng, Yue Huang, Xinghao Ding, John Paisley

Deep Crisp Boundaries

Yupei Wang, Xin Zhao, Kaiqi Huang

Coarse-To-Fine Segmentation With Shape-Tailored Continuum Scale Spaces

Naeemullah Khan, Byung-Woo Hong, Anthony Yezzi, Ganesh Sundaramoorthi

Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network

Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, Jian Sun

Single Image Reflection Suppression

Nikolaos Arvanitopoulos, Radhakrishna Achanta, Sabine Süsstrunk

CASENet: Deep Category-Aware Semantic Edge Detection

Zhiding Yu, Chen Feng, Ming-Yu Liu, Srikumar Ramalingam

Reflectance Adaptive Filtering Improves Intrinsic Image Estimation

Thomas Nestmeyer, Peter V. Gehler

Machine Learning

Conditional Similarity Networks

Andreas Veit, Serge Belongie, Theofanis Karaletsos

Spatially Adaptive Computation Time for Residual Networks

Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov

Xception: Deep Learning With Depthwise Separable Convolutions

François Chollet

Feedback Networks

Amir R. Zamir, Te-Lin Wu, Lin Sun, William B. Shen, Bertram E. Shi, Jitendra Malik, Silvio Savarese

Online Summarization via Submodular and Convex Optimization

Ehsan Elhamifar, M. Clara De Paolis Kaluza

Deep MANTA: A Coarse-To-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis From Monocular Image

Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Céline Teulière, Thierry Chateau

Improving Pairwise Ranking for Multi-Label Image Classification

Yuncheng Li, Yale Song, Jiebo Luo

Active Convolution: Learning the Shape of Convolution for Image Classification

Yunho Jeon, Junmo Kim

Linking Image and Text With 2-Way Nets

Aviv Eisenschtat, Lior Wolf

Stacked Generative Adversarial Networks

Xun Huang, Yixuan Li, Omid Poursaeed, John Hopcroft, Serge Belongie

Image Splicing Detection via Camera Response Function Analysis

Can Chen, Scott McCloskey, Jingyi Yu

Building a Regular Decision Boundary With Deep Networks

Edouard Oyallon

More Is Less: A More Complicated Network With Less Inference Complexity

Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan

Joint Graph Decomposition and Node Labeling: Problem, Algorithms, Applications

Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres

Scale-Aware Face Detection

Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Miguel A. Bautista, Artsiom Sanakoyeu, Björn Ommer

Generative Hierarchical Learning of Sparse FRAME Models

Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, Song-Chun Zhu

Object Recognition & Scene Understanding

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval

Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, Vlad I. Morariu, Larry S. Davis

Perceptual Generative Adversarial Networks for Small Object Detection

Jianan Li (Group: Work group, Company,... - optional),Xiaodan Liang (Group: Work group, Company,... - optional),Yunchao Wei (Group: Work group, Company,... - optional),Tingfa Xu (Group: Work group, Company,... - optional),Jiashi Feng (Group: Work group, Company,... - optional),Shuicheng Yan (Group: Work group, Company,... - optional)

Emotion Recognition in Context (PDF,supplementary material)

Ronak Kosti,Jose M. Alvarez,Adria Recasens,Agata Lapedriza

Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework

Jongyoo Kim, Sanghoon Lee

Dense Captioning With Joint Inference and Visual Context

Linjie Yang, Kevin Tang, Jianchao Yang, Li-Jia Li

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick

Cross-View Image Matching for Geo-Localization in Urban Environments

Yicong Tian, Chen Chen, Mubarak Shah

Matrix Tri-Factorization With Manifold Regularizations for Zero-Shot Learning

Xing Xu, Fumin Shen, Yang Yang, Dongxiang Zhang, Heng Tao Shen, Jingkuan Song

Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces

Lluis Gomez, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification

Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

Can Walking and Measuring Along Chord Bunches Better Describe Leaf Shapes?

Bin Wang, Yongsheng Gao, Changming Sun, Michael Blumenstein, John La Salle

Video Analytics

Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model

Qixiang Ye, Tianliang Zhang, Wei Ke, Qiang Qiu, Jie Chen, Guillermo Sapiro, Baochang Zhang

Predictive-Corrective Networks for Action Detection (project,abstract,PDF)

Achal Dave,Olga Russakovsky,Deva Ramanan

Budget-Aware Deep Semantic Video Segmentation

Behrooz Mahasseni, Sinisa Todorovic, Alan Fern

Unified Embedding and Metric Learning for Zero-Exemplar Event Detection

Noureldien Hussein, Efstratios Gavves, Arnold W.M. Smeulders

Spatiotemporal Pyramid Network for Video Action Recognition

Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu

ER3: A Unified Framework for Event Retrieval, Recognition and Recounting

Zhanning Gao, Gang Hua, Dongqing Zhang, Nebojsa Jojic, Le Wang, Jianru Xue, Nanning Zheng

FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos

Suyog Dutt Jain, Bo Xiong, Kristen Grauman

Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach

Aidean Sharghi, Jacob S. Laurel, Boqing Gong

Flexible Spatio-Temporal Networks for Video Prediction

Chaochao Lu, Michael Hirsch, Bernhard Schölkopf

Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos

Konstantinos Papoutsakis, Costas Panagiotakis, Antonis A. Argyros

Machine Learning 2

Spotlight 2-1A

Dual Attention Networks for Multimodal Reasoning and Matching

Hyeonseob Nam, Jung-Woo Ha, Jeonghee Kim

DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents

Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker

Interpretable Structure-Evolving LSTM

Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, Shuicheng Yan, Eric P. Xing

ShapeOdds: Variational Bayesian Learning of Generative Shape Models

Shireen Elhabian, Ross Whitaker

Fast Video Classification via Adaptive Cascading of Deep Models

Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy

Deep Metric Learning via Facility Location

Hyun Oh Song, Stefanie Jegelka, Vivek Rathod, Kevin Murphy

Semi-Supervised Deep Learning for Monocular Depth Map Prediction

Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe

Weakly Supervised Semantic Segmentation Using Web-Crawled Videos

Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, Bohyung Han

Oral 2-1A

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach

Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, Lizhen Qu

Learning From Simulated and Unsupervised Images Through Adversarial Training

Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang, Russell Webb

Inverse Compositional Spatial Transformer Networks

Chen-Hsuan Lin, Simon Lucey

Densely Connected Convolutional Networks

Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger

Computational Photography

Spotlight 2-1B

Visual Dialog

Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra

Video Frame Interpolation via Adaptive Convolution

Simon Niklaus, Long Mai, Feng Liu

FastMask: Segment Multi-Scale Object Candidates in One Shot

Hexiang Hu, Shiyi Lan, Yuning Jiang, Zhimin Cao, Fei Sha

Reconstructing Transient Images From Single-Photon Sensors

Matthew O'Toole, Felix Heide, David B. Lindell, Kai Zang, Steven Diamond, Gordon Wetzstein

DeshadowNet: A Multi-Context Embedding Deep Network for Shadow Removal

Liangqiong Qu, Jiandong Tian, Shengfeng He, Yandong Tang, Rynson W. H. Lau

Illuminant-Camera Communication to Observe Moving Objects Under Strong External Light by Spread Spectrum Modulation

Ryusuke Sagawa, Yutaka Satoh

Photorealistic Facial Texture Inference Using Deep Neural Networks

Shunsuke Saito, Lingyu Wei, Liwen Hu, Koki Nagano, Hao Li

The Geometry of First-Returning Photons for Non-Line-Of-Sight Imaging

Chia-Yin Tsai, Kiriakos N. Kutulakos, Srinivasa G. Narasimhan, Aswin C. Sankaranarayanan

Oral 2-1B

Unrolling the Shutter: CNN to Correct Motion Distortions

Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan

Light Field Blind Motion Deblurring

Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi

Computational Imaging on the Electric Grid

Mark Sheinin, Yoav Y. Schechner, Kiriakos N. Kutulakos

Deep Outdoor Illumination Estimation

Yannick Hold-Geoffroy, Kalyan Sunkavalli, Sunil Hadap, Emiliano Gambaretto, Jean-François Lalonde

3D Vision 2

Spotlight 2-1C

Efficient Solvers for Minimal Problems by Syzygy-Based Reduction

Viktor Larsson, Kalle Åström, Magnus Oskarsson

HSfM: Hybrid Structure-from-Motion

Hainan Cui, Xiang Gao, Shuhan Shen, Zhanyi Hu

Efficient Global Point Cloud Alignment Using Bayesian Nonparametric Mixtures

Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III

A New Rank Constraint on Multi-View Fundamental Matrices, and Its Application to Camera Location Recovery

Soumyadip Sengupta, Tal Amir, Meirav Galun, Tom Goldstein, David W. Jacobs, Amit Singer, Ronen Basri

IM2CAD

Hamid Izadinia, Qi Shan, Steven M. Seitz

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner

Noise Robust Depth From Focus Using a Ring Difference Filter

Jaeheung Surh, Hae-Gon Jeon, Yunwon Park, Sunghoon Im, Hyowon Ha, In So Kweon

Group-Wise Point-Set Registration Based on Rényi's Second Order Entropy

Luis G. Sanchez Giraldo, Erion Hasanbelliu, Murali Rao, Jose C. Principe

Oral 2-1C

A Point Set Generation Network for 3D Object Reconstruction From a Single Image

Haoqiang Fan, Hao Su, Leonidas J. Guibas

3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder

Gil Elbaz, Tamar Avraham, Anath Fischer

Flight Dynamics-Based Recovery of a UAV Trajectory Using Ground Cameras

Artem Rozantsev, Sudipta N. Sinha, Debadeepta Dey, Pascal Fua

DSAC - Differentiable RANSAC for Camera Localization (PDF,code,project)

Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother

Poster 2-1

3D Computer Vision

Scalable Surface Reconstruction From Point Clouds With Extreme Scale and Density Diversity

Christian Mostegel, Rudolf Prettenthaler, Friedrich Fraundorfer, Horst Bischof

Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes With Deep Generative Networks

Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, Joshua B. Tenenbaum

General Models for Rational Cameras and the Case of Two-Slit Projections

Matthew Trager, Bernd Sturmfels, John Canny, Martial Hebert, Jean Ponce

Accurate Depth and Normal Maps From Occlusion-Aware Focal Stack Symmetry

Michael Strecke, Anna Alperovich, Bastian Goldluecke

A Multi-View Stereo Benchmark With High-Resolution Images and Multi-Camera Videos

Thomas Schöps, Johannes L. Schönberger, Silvano Galliani, Torsten Sattler, Konrad Schindler, Marc Pollefeys, Andreas Geiger

Non-Contact Full Field Vibration Measurement Based on Phase-Shifting

Hiroyuki Kayaba, Yuji Kokumai

A Minimal Solution for Two-View Focal-Length Estimation Using Two Affine Correspondences (PDF,code)

Daniel Barath, Tekla Toth, Levente Hajder

PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother

An Efficient Background Term for 3D Reconstruction and Tracking With Smooth Surface Models

Mariano Jaimez, Thomas J. Cashman, Andrew Fitzgibbon, Javier Gonzalez-Jimenez, Daniel Cremers

Analyzing Humans in Images

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild

Shan Li, Weihong Deng, JunPing Du

Procedural Generation of Videos to Train Deep Action Recognition Networks

César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel López

BigHand2.2M Benchmark: Hand Pose Dataset and State of the Art Analysis

Shanxin Yuan, Qi Ye, Björn Stenger, Siddhant Jain, Tae-Kyun Kim

DenseReg: Fully Convolutional Dense Shape Regression In-The-Wild

Rıza Alp Güler, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos

Adaptive Class Preserving Representation for Image Classification

Jian-Xun Mi, Qiankun Fu, Weisheng Li

Applications

Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval

Devraj Mandal, Kunal N. Chaudhury, Soma Biswas

EAST: An Efficient and Accurate Scene Text Detector

Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang

VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen

Biomedical Image/Video Analysis

Improving RANSAC-Based Segmentation Through CNN Encapsulation

Dustin Morley, Hassan Foroosh

Computational Photography

Position Tracking for Virtual Reality Using Commodity WiFi

Manikanta Kotaru, Sachin Katti

Designing Illuminant Spectral Power Distributions for Surface Classification

Henryk Blasinski, Joyce Farrell, Brian Wandell

One-Shot Hyperspectral Imaging Using Faced Reflectors

Tsuyoshi Takatani, Takahito Aoto, Yasuhiro Mukaigawa

Image Motion & Tracking

Direct Photometric Alignment by Mesh Deformation

Kaimo Lin, Nianjuan Jiang, Shuaicheng Liu, Loong-Fah Cheong, Minh Do, Jiangbo Lu

CNN-Based Patch Matching for Optical Flow With Thresholded Hinge Embedding Loss

Christian Bailer, Kiran Varanasi, Didier Stricker

Optical Flow Estimation Using a Spatial Pyramid Network

Anurag Ranjan, Michael J. Black

Deep Network Flow for Multi-Object Tracking

Manmohan Chandraker, Paul Vernaza, Wongun Choi, Samuel Schulter

Low- & Mid-Level Vision

Material Classification Using Frequency- and Depth-Dependent Time-Of-Flight Distortion

Kenichiro Tanaka, Yasuhiro Mukaigawa, Takuya Funatomi, Hiroyuki Kubo, Yasuyuki Matsushita, Yasushi Yagi

Benchmarking Denoising Algorithms With Real Photographs

Tobias Plötz, Stefan Roth

A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation (PDF,project)

Jinsun Park, Yu-Wing Tai,Donghyeon Cho, In So Kweon

StyleBank: An Explicit Representation for Neural Image Style Transfer

Dongdong Chen, Lu Yuan, Jing Liao, Nenghai Yu, Gang Hua

Specular Highlight Removal in Facial Images

Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi

Image Super-Resolution via Deep Recursive Residual Network

Ying Tai, Jian Yang, Xiaoming Liu

Deep Image Harmonization

Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang

Learning Deep CNN Denoiser Prior for Image Restoration (PDF,code)

Kai Zhang, Wangmeng Zuo, Shuhang Gu, Lei Zhang

A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors

Tai-Xiang Jiang, Ting-Zhu Huang, Xi-Le Zhao, Liang-Jian Deng, Yao Wang

  • 特征点匹配

GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence

JiaWang Bian, Wen-Yan Lin, Yasuyuki Matsushita, Sai-Kit Yeung, Tan-Dat Nguyen, Ming-Ming Cheng

簡要:一种视频的快速搜索技术,比SIFT还厉害。基于网格的运动统计,用于快速、超鲁棒的特征匹配

Video Desnowing and Deraining Based on Matrix Decomposition

Weihong Ren, Jiandong Tian, Zhi Han, Antoni Chan, Yandong Tang

Real-Time Video Super-Resolution With Spatio-Temporal Networks and Motion Compensation

Jose Caballero, Christian Ledig, Andrew Aitken, Alejandro Acosta, Johannes Totz, Zehan Wang, Wenzhe Shi

Deep Watershed Transform for Instance Segmentation

Min Bai, Raquel Urtasun

AnchorNet: A Weakly Supervised Network to Learn Geometry-Sensitive Features for Semantic Matching

David Novotny, Diane Larlus, Andrea Vedaldi

Learning Diverse Image Colorization

Aditya Deshpande, Jiajun Lu, Mao-Chuang Yeh, Min Jin Chong, David Forsyth

Awesome Typography: Statistics-Based Text Effects Transfer

Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo

Machine Learning

Unsupervised Video Summarization With Adversarial LSTM Networks

Behrooz Mahasseni, Michael Lam, Sinisa Todorovic

Deep TEN: Texture Encoding Network

Hang Zhang, Jia Xue, Kristin Dana

Order-Preserving Wasserstein Distance for Sequence Matching

Bing Su, Gang Hua

A Dual Ascent Framework for Lagrangean Decomposition of Combinatorial Problems

Paul Swoboda, Jan Kuske, Bogdan Savchynskyy

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data

Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian Reid

Hierarchical Multimodal Metric Learning for Multimodal Classification

Heng Zhang, Vishal M. Patel, Rama Chellappa

Efficient Linear Programming for Dense CRFs

Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, Philip H. S. Torr, M. Pawan Kumar

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

YoungJoon Yoo, Sangdoo Yun, Hyung Jin Chang, Yiannis Demiris, Jin Young Choi

Learning Random-Walk Label Propagation for Weakly-Supervised Semantic Segmentation

Paul Vernaza, Manmohan Chandraker

Low-Rank-Sparse Subspace Representation for Robust Regression

Yongqiang Zhang, Daming Shi, Junbin Gao, Dansong Cheng

Object Recognition & Scene Understanding

Generating the Future With Adversarial Transformers

Carl Vondrick, Antonio Torralba

Semantic Amodal Segmentation

Yan Zhu, Yuandong Tian, Dimitris Metaxas, Piotr Dollár

Learning a Deep Embedding Model for Zero-Shot Learning

Li Zhang, Tao Xiang, Shaogang Gong

BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition

Jacob Chan, Jimmy Addison Lee, Qian Kemao

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Yu-Xiong Wang, Deva Ramanan, Martial Hebert

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection (PDF)

Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta

Multiple Instance Detection Network With Online Instance Classifier Refinement

Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu

Kernel Pooling for Convolutional Neural Networks

Yin Cui, Feng Zhou, Jiang Wang, Xiao Liu, Yuanqing Lin, Serge Belongie

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images

Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marin, Ferda Ofli, Ingmar Weber, Antonio Torralba

Zero-Shot Learning - the Good, the Bad and the Ugly

Yongqin Xian, Bernt Schiele, Zeynep Akata

DeepNav: Learning to Navigate Large Cities

Samarth Brahmbhatt, James Hays

Scene Graph Generation by Iterative Message Passing

Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei

Visual Translation Embedding Network for Visual Relation Detection

Hanwang Zhang, Zawlin Kyaw, Shih-Fu Chang, Tat-Seng Chua

Unsupervised Part Learning for Visual Recognition

Ronan Sicre, Yannis Avrithis, Ewa Kijak, Frédéric Jurie

Comprehension-Guided Referring Expressions

Ruotian Luo, Gregory Shakhnarovich

Top-Down Visual Saliency Guided by Captions

Vasili Ramanishka, Abir Das, Jianming Zhang, Kate Saenko

Theory

Grassmannian Manifold Optimization Assisted Sparse Spectral Clustering

Junbin Gao, Qiong Wang, Hong Li

Video Analytics

Video Propagation Networks

Varun Jampani, Raghudeep Gadde, Peter V. Gehler

ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification

Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, Bryan Russell

SCC: Semantic Context Cascade for Efficient Action Detection

Fabian Caba Heilbron, Wayner Barrios, Victor Escorcia, Bernard Ghanem

Hierarchical Boundary-Aware Neural Encoder for Video Captioning

Lorenzo Baraldi, Costantino Grana, Rita Cucchiara

HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos

Tan Yu, Yuwei Wu, Junsong Yuan

Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos

Ionut Cosmin Duta, Bogdan Ionescu, Kiyoharu Aizawa, Nicu Sebe

Temporal Action Localization by Structured Maximal Sums

Zehuan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng

Predicting Salient Face in Multiple-Face Videos

Yufan Liu, Songyang Zhang, Mai Xu, Xuming He

Object Recognition & Scene Understanding 1

Spotlight 2-2A

Graph-Structured Representations for Visual Question Answering

Damien Teney, Lingqiao Liu, Anton van den Hengel

Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning

Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher

Learned Contextual Feature Reweighting for Image Geo-Localization

Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm

End-To-End Concept Word Detection for Video Captioning, Retrieval, and Question Answering

Youngjae Yu, Hyungjin Ko, Jongwook Choi, Gunhee Kim

Deep Cross-Modal Hashing

Qing-Yuan Jiang, Wu-Jun Li

Unambiguous Text Localization and Retrieval for Cluttered Scenes

Xuejian Rong, Chucai Yi, Yingli Tian

Bayesian Supervised Hashing

Zihao Hu, Junxuan Chen, Hongtao Lu, Tongzhen Zhang

Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors

Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy

Oral 2-2A

Detecting Visual Relationships With Deep Relational Networks

Bo Dai, Yuqi Zhang, Dahua Lin

Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes (PDF,videos,code)

Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe

Network Dissection: Quantifying Interpretability of Deep Visual Representations

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, Antonio Torralba

AGA: Attribute-Guided Augmentation

Mandar Dixit, Roland Kwitt, Marc Niethammer, Nuno Vasconcelos

Analyzing Humans 2

Spotlight 2-2B

A Hierarchical Approach for Generating Descriptive Image Paragraphs

Jonathan Krause, Justin Johnson, Ranjay Krishna, Li Fei-Fei

Person Re-Identification in the Wild

Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, Qi Tian

Scalable Person Re-Identification on Supervised Smoothed Manifold

Song Bai, Xiang Bai, Qi Tian

Binge Watching: Scaling Affordance Learning From Sitcoms (PDF)

Xiaolong Wang, Rohit Girdhar, Abhinav Gupta

Joint Detection and Identification Feature Learning for Person Search

Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, Xiaogang Wang

Synthesizing Normalized Faces From Facial Identity Features

Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, Inbar Mosseri, William T. Freeman

Consistent-Aware Deep Learning for Person Re-Identification in a Camera Network

Ji Lin, Liangliang Ren, Jiwen Lu, Jianjiang Feng, Jie Zhou

Level Playing Field for Million Scale Face Recognition

Aaron Nech, Ira Kemelmacher-Shlizerman

Oral 2-2B

Re-Sign: Re-Aligned End-To-End Sequence Modelling With Deep Recurrent CNN-HMMs

Oscar Koller, Sepehr Zargaran, Hermann Ney

Social Scene Understanding: End-To-End Multi-Person Action Localization and Collective Activity Recognition

Timur Bagautdinov, Alexandre Alahi, François Fleuret, Pascal Fua, Silvio Savarese

Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly

Hao Jiang, Kristen Grauman

Lip Reading Sentences in the Wild

Joon Son Chung, Andrew Senior, Oriol Vinyals, Andrew Zisserman

Applications

Spotlight 2-2C

Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection

Lianwen Jin, Yuliang Liu

ChestX-ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases

Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Mohammadhadi Bagheri, Ronald M. Summers

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience With Shared Attention Modeling in Social Scenes

Siavash Gorji, James J. Clark

Detecting Oriented Text in Natural Images by Linking Segments

Baoguang Shi, Xiang Bai, Serge Belongie

Learning Video Object Segmentation From Static Images

Federico Perazzi, Anna Khoreva, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung

Seeing Invisible Poses: Estimating 3D Body Pose From Egocentric Video

Hao Jiang, Kristen Grauman

Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space

Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

Oral 2-2C

End-To-End Learning of Driving Models From Large-Scale Video Datasets

Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell

Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks

Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Qi Zhao, Jiashi Feng

MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network

Zizhao Zhang, Yuanpu Xie, Fuyong Xing, Mason McGough, Lin Yang

Poster 2-2

3D Computer Vision

Surface Motion Capture Transfer With Gaussian Process Regression

Adnane Boukhayma, Jean-Sébastien Franco, Edmond Boyer

Visual-Inertial-Semantic Scene Representation for 3D Object Detection

Jingming Dong, Xiaohan Fei, Stefano Soatto

Template-Based Monocular 3D Recovery of Elastic Shapes Using Lagrangian Multipliers

Nazim Haouchine, Stephane Cotin

Learning Category-Specific 3D Shape Models From Weakly Labeled 2D Images

Dingwen Zhang, Junwei Han, Yang Yang, Dong Huang

Simultaneous Geometric and Radiometric Calibration of a Projector-Camera Pair

Marjan Shahpaski, Luis Ricardo Sapaico, Gaspard Chevassus, Sabine Süsstrunk

Learning Barycentric Representations of 3D Shapes for Sketch-Based 3D Shape Retrieval

Jin Xie, Guoxian Dai, Fan Zhu, Yi Fang

Geodesic Distance Descriptors

Gil Shamai, Ron Kimmel

Analyzing Humans in Images

Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks

Hongsong Wang, Liang Wang

Forecasting Human Dynamics From Static Images

Yu-Wei Chao, Jimei Yang, Brian Price, Scott Cohen, Jia Deng

Re-Ranking Person Re-Identification With k-Reciprocal Encoding (code)

Zhun Zhong,Liang Zheng, Donglin Cao, Shaozi Li

Deep Sequential Context Networks for Action Prediction

Yu Kong, Zhiqiang Tao, Yun Fu

Global Context-Aware Attention LSTM Networks for 3D Action Recognition

Jun Liu, Gang Wang, Ping Hu, Ling-Yu Duan, Alex C. Kot

Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting

Zhen-Hua Feng, Josef Kittler, William Christmas, Patrik Huber, Xiao-Jun Wu

A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection

Jiangjing Lv, Xiaohu Shao, Junliang Xing, Cheng Cheng, Xi Zhou

Multiple People Tracking by Lifted Multicut and Person Re-Identification

Siyu Tang, Mykhaylo Andriluka, Bjoern Andres, Bernt Schiele

Towards Accurate Multi-Person Pose Estimation in the Wild

George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy

Applications

Towards a Quality Metric for Dense Light Fields

Vamsi Kiran Adhikarla, Marek Vinkler, Denis Sumin, Rafał K. Mantiuk, Karol Myszkowski, Hans-Peter Seidel, Piotr Didyk

Controlling Perceptual Factors in Neural Style Transfer

Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, Aaron Hertzmann, Eli Shechtman

Biomedical Image/Video Analysis

Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation

Kuan-Lun Tseng, Yen-Liang Lin, Winston Hsu, Chung-Yang Huang

LSTM Self-Supervision for Detailed Behavior Analysis

Biagio Brattoli, Uta Büchler, Anna-Sophia Wahl, Martin E. Schwab, Björn Ommer

Computational Photography

A Wide-Field-Of-View Monocentric Light Field Camera (PDF,project,project)

Donald G. Dansereau, Glenn Schuster, Joseph Ford,Gordon Wetzstein

Image Motion & Tracking

S2F: Slow-To-Fast Interpolator Flow

Yanchao Yang, Stefano Soatto

CLKN: Cascaded Lucas-Kanade Networks for Image Alignment

Che-Han Chang, Chun-Nan Chou, Edward Y. Chang

Multi-Object Tracking With Quadruplet Convolutional Neural Networks

Mooyeol Baek, Jeany Son, Minsu Cho, Bohyung Han

Low- & Mid-Level Vision

Learning to Detect Salient Objects With Image-Level Supervision

Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Baocai Yin, Xiang Ruan

From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur

Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian Reid, Chunhua Shen, Anton van den Hengel, Qinfeng Shi

Co-Occurrence Filter

Roy J. Jevnisek, Shai Avidan

Fractal Dimension Invariant Filtering and Its CNN-Based Implementation

Hongteng Xu, Junchi Yan, Nils Persson, Weiyao Lin, Hongyuan Zha

Noise-Blind Image Deblurring

Meiguang Jin, Stefan Roth, Paolo Favaro

Simultaneous Visual Data Completion and Denoising Based on Tensor Rank and Total Variation Minimization and Its Primal-Dual Splitting Algorithm

Tatsuya Yokota, Hidekata Hontani

HPatches: A Benchmark and Evaluation of Handcrafted and Learned Local Descriptors

Vassileios Balntas, Karel Lenc, Andrea Vedaldi, Krystian Mikolajczyk

Hyperspectral Image Super-Resolution via Non-Local Sparse Tensor Factorization

Renwei Dian, Leyuan Fang, Shutao Li

Reflection Removal Using Low-Rank Matrix Completion

Byeong-Ju Han, Jae-Young Sim

Object Co-Skeletonization With Co-Segmentation

Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan

Machine Learning

Mining Object Parts From CNNs via Active Question-Answering

Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu

PolyNet: A Pursuit of Structural Diversity in Very Deep Networks

Xingcheng Zhang, Zhizhong Li, Chen Change Loy, Dahua Lin

The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions

Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel

Joint Discriminative Bayesian Dictionary and Classifier Learning

Naveed Akhtar, Ajmal Mian, Fatih Porikli

A Study of Lagrangean Decompositions and Dual Ascent Solvers for Graph Matching

Paul Swoboda, Carsten Rother, Hassan Abu Alhaija, Dagmar Kainmüller, Bogdan Savchynskyy

Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection

Nikolay Savinov, Akihito Seki, Ľubor Ladický, Torsten Sattler, Marc Pollefeys

Outlier-Robust Tensor PCA

Pan Zhou, Jiashi Feng

Learning Adaptive Receptive Fields for Deep Image Parsing Network

Zhen Wei, Yao Sun, Jinqiao Wang, Hanjiang Lai, Si Liu

Learning an Invariant Hilbert Space for Domain Adaptation

Samitha Herath, Mehrtash Harandi, Fatih Porikli

Fixed-Point Factorized Networks

Peisong Wang, Jian Cheng

Discriminative Optimization: Theory and Applications to Point Cloud Registration

Jayakorn Vongkulbhisal, Fernando De la Torre, João P. Costeira

Online Asymmetric Similarity Learning for Cross-Modal Retrieval

Yiling Wu, Shuhui Wang, Qingming Huang

Improving Training of Deep Neural Networks via Singular Value Bounding

Kui Jia, Dacheng Tao, Shenghua Gao, Xiangmin Xu

S3Pool: Pooling With Stochastic Spatial Sampling

Shuangfei Zhai, Hui Wu, Abhishek Kumar, Yu Cheng, Yongxi Lu, Zhongfei Zhang, Rogerio Feris

Sports Field Localization via Deep Structured Models

Namdar Homayounfar, Sanja Fidler, Raquel Urtasun

Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation

Binghui Chen, Weihong Deng, Junping Du

Switching Convolutional Neural Network for Crowd Counting (project,PDF)

Deepak Babu Sam*, Shiv Surya*,R. Venkatesh Babu ((* Equal Contributors) Video Analytics Lab, Indian Institute of Science)

Network Sketching: Exploiting Binary Structure in Deep CNNs (PDF)

Yiwen Guo, Anbang Yao, Hao Zhao, Yurong Chen

Multi-Task Clustering of Human Actions by Sharing Information

Shizhe Hu, Xiaoqiang Yan, Yangdong Ye

Soft-Margin Mixture of Regressions

Dong Huang, Longfei Han, Fernando De la Torre

Multigrid Neural Architectures

Tsung-Wei Ke, Michael Maire, Stella X. Yu

High-Resolution Image Inpainting Using Multi-Scale Neural Patch Synthesis

Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li

Deep Quantization: Encoding Convolutional Activations With Deep Generative Model

Zhaofan Qiu, Ting Yao, Tao Mei

DOPE: Distributed Optimization for Pairwise Energies

Jose Dolz, Ismail Ben Ayed, Christian Desrosiers

Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis

Dmitry Ulyanov, Andrea Vedaldi, Victor Lempitsky

Object Recognition & Scene Understanding

Polyhedral Conic Classifiers for Visual Object Detection and Classification

Hakan Cevikalp, Bill Triggs

Incremental Kernel Null Space Discriminant Analysis for Novelty Detection

Juncheng Liu, Zhouhui Lian, Yi Wang, Jianguo Xiao

Predicting Ground-Level Scene Layout From Aerial Imagery

Menghua Zhai, Zachary Bessinger, Scott Workman, Nathan Jacobs

Deep Feature Flow for Video Recognition

Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, Yichen Wei

Object-Aware Dense Semantic Correspondence

Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen

Semantic Regularisation for Recurrent Image Annotation

Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun

Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images

Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua

Fast-At: Fast Automatic Thumbnail Generation Using Deep Neural Networks

Seyed A. Esmaeili, Bharat Singh, Larry S. Davis

Multi-Level Attention Networks for Visual Question Answering

Dongfei Yu, Jianlong Fu, Tao Mei, Yong Rui

Generating Descriptions With Grounded and Co-Referenced People

Anna Rohrbach, Marcus Rohrbach, Siyu Tang, Seong Joon Oh, Bernt Schiele

Straight to Shapes: Real-Time Detection of Encoded Shapes

Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr

Simultaneous Feature Aggregating and Hashing for Large-Scale Image Search

Thanh-Toan Do, Dang-Khoa Le Tan, Trung T. Pham, Ngai-Man Cheung

Improving Facial Attribute Prediction Using Semantic Segmentation

Mahdi M. Kalayeh, Boqing Gong, Mubarak Shah

Video Analytics

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection

Dan Xu, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, Nicu Sebe

Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection From Videos

Yang Du, Chunfeng Yuan, Bing Li, Weiming Hu, Stephen Maybank

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition

Tianmin Shu, Sinisa Todorovic, Song-Chun Zhu

Understanding Traffic Density From Large-Scale Web Camera Data

Shanghang Zhang, Guanhang Wu, João P. Costeira, José M. F. Moura

Collaborative Summarization of Topic-Related Videos

Rameswar Panda, Amit K. Roy-Chowdhury

Machine Learning 3

Spotlight 3-1A

Local Binary Convolutional Neural Networks

Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides

Deep Self-Taught Learning for Weakly Supervised Object Localization

Zequn Jie, Yunchao Wei, Xiaojie Jin, Jiashi Feng, Wei Liu

Multi-Modal Mean-Fields via Cardinality-Based Clamping

Pierre Baqué, François Fleuret, Pascal Fua

Probabilistic Temporal Subspace Clustering

Vladimir Pavlovic, Behnam Gholami

Provable Self-Representation Based Outlier Detection in a Union of Subspaces

Chong You, Daniel P. Robinson, René Vidal

Latent Multi-View Subspace Clustering

Changqing Zhang, Qinghua Hu, Huazhu Fu, Pengfei Zhu, Xiaochun Cao

Learning to Extract Semantic Structure From Documents Using Multimodal Fully Convolutional Neural Networks

Xiao Yang, Ersin Yumer, Paul Asente, Mike Kraley, Daniel Kifer, C. Lee Giles

Age Progression/Regression by Conditional Adversarial Autoencoder

Zhifei Zhang, Yang Song, Hairong Qi

Oral 3-1A

Compact Matrix Factorization With Dependent Subspaces

Viktor Larsson, Carl Olsson

FFTLasso: Large-Scale LASSO in the Fourier Domain

Adel Bibi, Hani Itani, Bernard Ghanem

On the Global Geometry of Sphere-Constrained Sparse Blind Deconvolution

Yuqian Zhang, Yenson Lau, Han-wen Kuo, Sky Cheung, Abhay Pasupathy, John Wright

Global Optimality in Neural Network Training

Benjamin D. Haeffele, René Vidal

Object Recognition & Scene Understanding 2

Spotlight 3-1B

What Is and What Is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors

Changqun Xia, Jia Li, Xiaowu Chen, Anlin Zheng, Yu Zhang

Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection

Xiaodan Liang, Lisa Lee, Eric P. Xing

Modeling Relationships in Referential Expressions With Compositional Modular Networks

Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, Kate Saenko

Counting Everyday Objects in Everyday Scenes

Prithvijit Chattopadhyay, Ramakrishna Vedantam, Ramprasaath R. Selvaraju, Dhruv Batra, Devi Parikh

Fully Convolutional Instance-Aware Semantic Segmentation

Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, Yichen Wei

Semantic Autoencoder for Zero-Shot Learning

Elyor Kodirov, Tao Xiang, Shaogang Gong

CityPersons: A Diverse Dataset for Pedestrian Detection

Shanshan Zhang, Rodrigo Benenson, Bernt Schiele

GuessWhat?! Visual Object Discovery Through Multi-Modal Dialogue

Harm de Vries, Florian Strub, Sarath Chandar, Olivier Pietquin, Hugo Larochelle, Aaron Courville

Oral 3-1B

Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition

Jianlong Fu, Heliang Zheng, Tao Mei

Annotating Object Instances With a Polygon-RNN

Lluís Castrejón, Kaustav Kundu, Raquel Urtasun, Sanja Fidler

Connecting Look and Feel: Associating the Visual and Tactile Properties of Physical Materials

Wenzhen Yuan, Shaoxiong Wang, Siyuan Dong, Edward Adelson

Deep Learning Human Mind for Automated Visual Classification

Concetto Spampinato, Simone Palazzo, Isaak Kavasidis, Daniela Giordano, Nasim Souly, Mubarak Shah

Poster 3-1

3D Computer Vision

Self-Calibration-Based Approach to Critical Motion Sequences of Rolling-Shutter Structure From Motion

Eisuke Ito, Takayuki Okatani

Semi-Calibrated Near Field Photometric Stereo

Fotios Logothetis, Roberto Mecca, Roberto Cipolla

Semantic Multi-View Stereo: Jointly Estimating Objects and Voxels

Ali Osman Ulusoy, Michael J. Black, Andreas Geiger

Learning to Predict Stereo Reliability Enforcing Local Consistency of Confidence Maps

Matteo Poggi, Stefano Mattoccia

The Misty Three Point Algorithm for Relative Pose

Tobias Palmér, Kalle Ã…ström, Jan-Michael Frahm

The Surfacing of Multiview 3D Drawings via Lofting and Occlusion Reasoning (PDF,dataset,poster)

Anil Usumezbas, Ricardo Fabbri, Benjamin B. Kimia

A New Representation of Skeleton Sequences for 3D Action Recognition

Qiuhong Ke, Mohammed Bennamoun, Senjian An, Ferdous Sohel, Farid Boussaid

A General Framework for Curve and Surface Comparison and Registration With Oriented Varifolds

Irène Kaltenmark, Benjamin Charlier, Nicolas Charon

Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization

Anil Armagan, Martin Hirzer, Peter M. Roth, Vincent Lepetit

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking

Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, King Ngi Ngan

Fast 3D Reconstruction of Faces With Glasses

Fabio Maninchedda, Martin R. Oswald, Marc Pollefeys

An Efficient Algebraic Solution to the Perspective-Three-Point Problem

Tong Ke, Stergios I. Roumeliotis

Analyzing Humans in Images

Learning From Synthetic Humans

Gül Varol, Javier Romero, Xavier Martin, Naureen Mahmood, Michael J. Black, Ivan Laptev, Cordelia Schmid

Forecasting Interactive Dynamics of Pedestrians With Fictitious Play

Wei-Chiu Ma, De-An Huang, Namhoon Lee, Kris M. Kitani

Hand Keypoint Detection in Single Images Using Multiview Bootstrapping

Tomas Simon, Hanbyul Joo, Iain Matthews, Yaser Sheikh

PoseTrack: Joint Multi-Person Pose Estimation and Tracking

Umar Iqbal, Anton Milan, Juergen Gall

Expecting the Unexpected: Training Detectors for Unusual Pedestrians With Adversarial Imposters

Shiyu Huang, Deva Ramanan

On Human Motion Prediction Using Recurrent Neural Networks

Julieta Martinez, Michael J. Black, Javier Romero

Learning and Refining of Privileged Information-Based RNNs for Action Recognition From Depth Sequences

Zhiyuan Shi, Tae-Kyun Kim

Quality Aware Network for Set to Set Recognition

Yu Liu, Junjie Yan, Wanli Ouyang

Unite the People: Closing the Loop Between 3D and 2D Human Representations

Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, Michael J. Black, Peter V. Gehler

Deep Multitask Architecture for Integrated 2D and 3D Human Sensing

Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

João Carreira, Andrew Zisserman

Applications

Identifying First-Person Camera Wearers in Third-Person Videos

Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, Yong Jae Lee, David J. Crandall, Michael S. Ryoo

Biomedical Image/Video Analysis

Parsing Images of Overlapping Organisms With Deep Singling-Out Networks

Victor Yurchenko, Victor Lempitsky

Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally

Zongwei Zhou, Jae Shin, Lei Zhang, Suryakanth Gurudu, Michael Gotway, Jianming Liang

Computational Photography

Depth From Defocus in the Wild

Huixuan Tang, Scott Cohen, Brian Price, Stephen Schiller, Kiriakos N. Kutulakos

Matting and Depth Recovery of Thin Structures Using a Focal Stack

Chao Liu, Srinivasa G. Narasimhan, Artur W. Dubrawski

Image Motion & Tracking

Robust Interpolation of Correspondences for Large Displacement Optical Flow

Yinlin Hu, Yunsong Li, Rui Song

Large Margin Object Tracking With Circulant Feature Maps

Mengmeng Wang, Yong Liu, Zeyi Huang

Minimum Delay Moving Object Detection

Dong Lao, Ganesh Sundaramoorthi

Multi-Task Correlation Particle Filter for Robust Object Tracking

Tianzhu Zhang, Changsheng Xu, Ming-Hsuan Yang

Attentional Correlation Filter Network for Adaptive Visual Tracking

Jongwon Choi, Hyung Jin Chang, Sangdoo Yun, Tobias Fischer, Yiannis Demiris, Jin Young Choi

The World of Fast Moving Objects

Denys Rozumnyi, Jan Kotera, Filip Šroubek, Lukáš Novotný, Jiří Matas

Discriminative Correlation Filter With Channel and Spatial Reliability

Alan LukežiÄ, Tomáš Vojíř, Luka ÄŒehovin Zajc, Jiří Matas, Matej Kristan

Low- & Mid-Level Vision

Learning Deep Binary Descriptor With Multi-Quantization

Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie Zhou

One-To-Many Network for Visually Pleasing Compression Artifacts Reduction

Jun Guo, Hongyang Chao

Gated Feedback Refinement Network for Dense Image Labeling

Md Amirul Islam, Mrigank Rochan, Neil D. B. Bruce, Yang Wang

BRISKS: Binary Features for Spherical Images on a Geodesic Grid

Hao Guan, William A. P. Smith

Superpixels and Polygons Using Simple Non-Iterative Clustering

Radhakrishna Achanta, Sabine Süsstrunk

Hardware-Efficient Guided Image Filtering for Multi-Label Problem

Longquan Dai, Mengke Yuan, Zechao Li, Xiaopeng Zhang, Jinhui Tang

Alternating Direction Graph Matching (PDF)

D. Khuê Lê-Huu, Nikos Paragios

Learning Discriminative and Transformation Covariant Local Feature Detectors

Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang

Machine Learning

Correlational Gaussian Processes for Cross-Domain Visual Recognition

Chengjiang Long, Gang Hua

DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data (PDF,code)

Swaminathan Gurumurthy (CMU), Ravi Kiran Sarvadevabhatla (Video Analytics Lab, Indian Institute of Science),R. Venkatesh Babu

Oriented Response Networks

Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao

Missing Modalities Imputation via Cascaded Residual Autoencoder

Luan Tran, Xiaoming Liu, Jiayu Zhou, Rong Jin

Efficient Optimization for Hierarchically-structured Interacting Segments (HINTS)

Hossam Isack, Olga Veksler, Ipek Oguz, Milan Sonka, Yuri Boykov

A Message Passing Algorithm for the Minimum Cost Multicut Problem

Paul Swoboda, Bjoern Andres

End-To-End Representation Learning for Correlation Filter Based Tracking

Jack Valmadre, Luca Bertinetto, João Henriques, Andrea Vedaldi, Philip H. S. Torr

Filter Flow Made Practical: Massively Parallel and Lock-Free

Sathya N. Ravi, Yunyang Xiong, Lopamudra Mukherjee, Vikas Singh

Online Graph Completion: Multivariate Signal Recovery in Computer Vision

Won Hwa Kim, Mona Jalal, Seongjae Hwang, Sterling C. Johnson, Vikas Singh

Point to Set Similarity Based Deep Feature Learning for Person Re-Identification

Sanping Zhou, Jinjun Wang, Jiayun Wang, Yihong Gong, Nanning Zheng

Exploiting Saliency for Object Segmentation From Image Level Labels

Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, Mario Fritz, Bernt Schiele

Consensus Maximization With Linear Matrix Inequality Constraints

Pablo Speciale, Danda Pani Paudel, Martin R. Oswald, Till Kroeger, Luc Van Gool, Marc Pollefeys

Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks

Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, Joon-Young Lee, Hailin Jin, Thomas Funkhouser

Deep Multimodal Representation Learning From Temporal Data

Xitong Yang, Palghat Ramesh, Radha Chitta, Sriganesh Madhvanath, Edgar A. Bernal, Jiebo Luo

All You Need Is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks With Orthonormality and Modulation

Di Xie, Jiang Xiong, Shiliang Pu

Hard Mixtures of Experts for Large Scale Weakly Supervised Vision

Sam Gross, Marc'Aurelio Ranzato, Arthur Szlam

A Reinforcement Learning Approach to the View Planning Problem

Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim

Zero-Shot Classification With Discriminative Semantic Representation Learning

Meng Ye, Yuhong Guo

Adversarial Discriminative Domain Adaptation

Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

None of the above

Learning to Rank Retargeted Images

Yang Chen, Yong-Jin Liu, Yu-Kun Lai

Object Recognition & Scene Understanding

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories

Ziad Al-Halah, Rainer Stiefelhagen

Scene Parsing Through ADE20K Dataset

Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso, Antonio Torralba

Weakly Supervised Cascaded Convolutional Networks

Ali Diba, Vivek Sharma, Ali Pazandeh, Hamed Pirsiavash, Luc Van Gool

Discretely Coding Semantic Rank Orders for Supervised Image Hashing

Li Liu, Ling Shao, Fumin Shen, Mengyang Yu

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

Jing Zhang, Wanqing Li, Philip Ogunbona

Weakly Supervised Dense Video Captioning

Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, Yurong Chen, Yu-Gang Jiang, Xiangyang Xue

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid

Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF

Falong Shen, Rui Gan, Shuicheng Yan, Gang Zeng

Person Search With Natural Language Description

Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang

Weakly Supervised Affordance Detection

Johann Sawatzky, Abhilash Srikantha, Juergen Gall

Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths

Yanan Li, Donghui Wang, Huanhang Hu, Yuetan Lin, Yueting Zhuang

Neural Aggregation Network for Video Face Recognition (PDF)

Jiaolong Yang (Microsoft Research), Peiran Ren (Microsoft Research), Dongqing Zhang (Microsoft Research), Dong Chen (Microsoft Research), Fang Wen (Microsoft Research), Hongdong Li (ANU), Gang Hua (Microsoft Research)

Relationship Proposal Networks

Ji Zhang, Mohamed Elhoseiny, Scott Cohen, Walter Chang, Ahmed Elgammal

Learning Object Interactions and Descriptions for Semantic Image Segmentation

Guangrun Wang, Ping Luo, Liang Lin, Xiaogang Wang

RON: Reverse Connection With Objectness Prior Networks for Object Detection

Tao Kong, Fuchun Sun, Anbang Yao, Huaping Liu, Ming Lu, Yurong Chen

Weakly-Supervised Visual Grounding of Phrases With Linguistic Structures

Fanyi Xiao, Leonid Sigal, Yong Jae Lee

Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects

Ting Yao, Yingwei Pan, Yehao Li, Tao Mei

Beyond Instance-Level Image Retrieval: Leveraging Captions to Learn a Global Visual Representation for Semantic Retrieval

Diane Larlus, Albert Gordo

MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features

Youssef Tamaazousti, Hervé Le Borgne, Céline Hudelot

Zero Shot Learning via Multi-Scale Manifold Regularization

Shay Deutsch, Soheil Kolouri, Kyungnam Kim, Yuri Owechko, Stefano Soatto

Theory

Deeply Supervised Salient Object Detection With Short Connections

Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, Zhuowen Tu, Philip H. S. Torr

A Matrix Splitting Method for Composite Function Minimization

Ganzhao Yuan, Wei-Shi Zheng, Bernard Ghanem

Video Analytics

One-Shot Video Object Segmentation (PDF,project,code,code)

Sergi Caelles,Kevis-Kokitsi Maninis,Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc Van Gool

Fast Person Re-Identification via Cross-Camera Semantic Binary Transformation

Jiaxin Chen, Yunhong Wang, Jie Qin, Li Liu, Ling Shao

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, Junwei Han

Machine Learning 4

Spotlight 4-1A

Hidden Layers in Perceptual Learning

Gad Cohen, Daphna Weinshall

Few-Shot Object Recognition From Machine-Labeled Web Images

Zhongwen Xu, Linchao Zhu, Yi Yang

Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders

Xin Yu, Fatih Porikli

Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension

Aniruddha Kembhavi, Minjoon Seo, Dustin Schwenk, Jonghyun Choi, Ali Farhadi, Hannaneh Hajishirzi

Deep Hashing Network for Unsupervised Domain Adaptation

Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, Sethuraman Panchanathan

Generalized Deep Image to Image Regression

Venkataraman Santhanam, Vlad I. Morariu, Larry S. Davis

Deep Learning With Low Precision by Half-Wave Gaussian Quantization

Zhaowei Cai, Xiaodong He, Jian Sun, Nuno Vasconcelos

Creativity: Generating Diverse Questions Using Variational Autoencoders

Unnat Jain, Ziyu Zhang, Alexander G. Schwing

Oral 4-1A

Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs

Federico Monti, Davide Boscaini, Jonathan Masci, Emanuele Rodolà, Jan Svoboda, Michael M. Bronstein

Full Resolution Image Compression With Recurrent Neural Networks

George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, David Minnen, Joel Shor, Michele Covell

Neural Face Editing With Intrinsic Image Disentangling

Zhixin Shu, Ersin Yumer, Sunil Hadap, Kalyan Sunkavalli, Eli Shechtman, Dimitris Samaras

Ubernet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

Iasonas Kokkinos

Analyzing Humans with 3D Vision

Spotlight 4-1B

3D Face Morphable Models “In-The-Wildâ€

James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, Yannis Panagakis, Stefanos Zafeiriou

KillingFusion: Non-Rigid 3D Reconstruction Without Correspondences

Miroslava Slavcheva, Maximilian Baust, Daniel Cremers, Slobodan Ilic

Detailed, Accurate, Human Shape Estimation From Clothed 3D Scan Sequences

Chao Zhang, Sergi Pujades, Michael J. Black, Gerard Pons-Moll

POSEidon: Face-From-Depth for Driver Pose Estimation

Guido Borghi, Marco Venturelli, Roberto Vezzani, Rita Cucchiara

Human Shape From Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks

Endri Dibra, Himanshu Jain, Cengiz Öztireli, Remo Ziegler, Markus Gross

Parametric T-Spline Face Morphable Model for Detailed Fitting in Shape Subspace

Weilong Peng, Zhiyong Feng, Chao Xu, Yong Su

3D Menagerie: Modeling the 3D Shape and Pose of Animals

Silvia Zuffi, Angjoo Kanazawa, David W. Jacobs, Michael J. Black

iCaRL: Incremental Classifier and Representation Learning

Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, Christoph H. Lampert

Oral 4-1B

Recurrent 3D Pose Sequence Machines

Mude Lin, Liang Lin, Xiaodan Liang, Keze Wang, Hui Cheng

Learning Detailed Face Reconstruction From a Single Image

Elad Richardson, Matan Sela, Roy Or-El, Ron Kimmel

Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos

Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges

Dynamic FAUST: Registering Human Bodies in Motion

Federica Bogo, Javier Romero, Gerard Pons-Moll, Michael J. Black

Poster 4-1

3D Computer Vision

Semantically Coherent Co-Segmentation and Reconstruction of Dynamic Scenes

Armin Mustafa, Adrian Hilton

On the Two-View Geometry of Unsynchronized Cameras

Cenek Albl, Zuzana Kukelova, Andrew Fitzgibbon, Jan Heller, Matej Smid, Tomas Pajdla

Using Locally Corresponding CAD Models for Dense 3D Reconstructions From a Single Image

Chen Kong, Chen-Hsuan Lin, Simon Lucey

A Clever Elimination Strategy for Efficient Minimal Solvers

Zuzana Kukelova, Joe Kileel, Bernd Sturmfels, Tomas Pajdla

Convex Global 3D Registration With Lagrangian Duality

Jesus Briales, Javier Gonzalez-Jimenez

DeMoN: Depth and Motion Network for Learning Monocular Stereo (project,PDF)

Benjamin Ummenhofer, Huizhong Zhou, Jonas Uhrig, Nikolaus Mayer, Eddy Ilg, Alexey Dosovitskiy, Thomas Brox

3D Bounding Box Estimation Using Deep Learning and Geometry

Arsalan Mousavian, Dragomir Anguelov, John Flynn, Jana Košecká

A Dataset for Benchmarking Image-Based Localization

Xun Sun, Yuanfan Xie, Pei Luo, Liang Wang

Analyzing Humans in Images

Asynchronous Temporal Fields for Action Recognition

Gunnar A. Sigurdsson, Santosh Divvala, Ali Farhadi, Abhinav Gupta

Sequential Person Recognition in Photo Albums With a Recurrent Network

Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Anton van den Hengel

Multi-Context Attention for Human Pose Estimation

Xiao Chu, Wei Yang, Wanli Ouyang, Cheng Ma, Alan L. Yuille, Xiaogang Wang

3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation From Single Depth Images

Liuhao Ge, Hui Liang, Junsong Yuan, Daniel Thalmann

Lifting From the Deep: Convolutional 3D Pose Estimation From a Single Image

Denis Tome, Chris Russell, Lourdes Agapito

AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos

Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma

Deep Structured Learning for Facial Action Unit Intensity Estimation

Robert Walecki, Ognjen (Oggi) Rudovic, Vladimir Pavlovic, Bjöern Schuller, Maja Pantic

Simultaneous Facial Landmark Detection, Pose and Deformation Estimation Under Facial Occlusion

Yue Wu, Chao Gou, Qiang Ji

Self-Supervised Video Representation Learning With Odd-One-Out Networks

Basura Fernando, Hakan Bilen, Efstratios Gavves, Stephen Gould

Robust Joint and Individual Variance Explained

Christos Sagonas, Yannis Panagakis, Alina Leidinger, Stefanos Zafeiriou

Discriminative Covariance Oriented Representation Learning for Face Recognition With Image Sets

Wen Wang, Ruiping Wang, Shiguang Shan, Xilin Chen

3D Human Pose Estimation = 2D Pose Estimation + Matching

Ching-Hang Chen, Deva Ramanan

Applications

Joint Gap Detection and Inpainting of Line Drawings

Kazuma Sasaki, Satoshi Iizuka, Edgar Simo-Serra, Hiroshi Ishikawa

Biomedical Image/Video Analysis

Riemannian Nonlinear Mixed Effects Models: Analyzing Longitudinal Deformations in Neuroimaging

Hyunwoo J. Kim, Nagesh Adluru, Heemanshu Suri, Baba C. Vemuri, Sterling C. Johnson, Vikas Singh

Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images Using Weakly-Supervised Joint Convolutional Sparse Coding

Yawen Huang, Ling Shao, Alejandro F. Frangi

Computational Photography

Multiple-Scattering Microphysics Tomography

Aviad Levis, Yoav Y. Schechner, Anthony B. Davis

Image Motion & Tracking

Accurate Optical Flow via Direct Cost Volume Processing

Jia Xu, René Ranftl, Vladlen Koltun

Event-Based Visual Inertial Odometry

Alex Zihao Zhu, Nikolay Atanasov, Kostas Daniilidis

Robust Visual Tracking Using Oblique Random Forests

Le Zhang, Jagannadan Varadarajan, Ponnuthurai Nagaratnam Suganthan, Narendra Ahuja, Pierre Moulin

Low- & Mid-Level Vision

Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution

Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang

Learning Non-Lambertian Object Intrinsics Across ShapeNet Categories

Jian Shi, Yue Dong, Hao Su, Stella X. Yu

MCMLSD: A Dynamic Programming Approach to Line Segment Detection

Emilio J. Almazàn, Ron Tal, Yiming Qian, James H. Elder

Contour-Constrained Superpixels for Image and Video Processing

Se-Ho Lee, Won-Dong Jang, Chang-Su Kim

Richer Convolutional Features for Edge Detection

Yun Liu, Ming-Ming Cheng, Xiaowei Hu, Kai Wang, Xiang Bai

Non-Local Color Image Denoising With Convolutional Neural Networks

Stamatios Lefkimmiatis

Generative Face Completion

Yijun Li, Sifei Liu, Jimei Yang, Ming-Hsuan Yang

Hyper-Laplacian Regularized Unidirectional Low-Rank Tensor Recovery for Multispectral Image Denoising

Yi Chang, Luxin Yan, Sheng Zhong

Unsupervised Semantic Scene Labeling for Streaming Data

Maggie Wigness, John G. Rogers III

Why You Should Forget Luminance Conversion and Do Something Better

Rang M. H. Nguyen, Michael S. Brown

Deep Semantic Feature Matching

Nikolai Ufer, Björn Ommer

Machine Learning

Revisiting the Variable Projection Method for Separable Nonlinear Least Squares Problems

Je Hyeong Hong, Christopher Zach, Andrew Fitzgibbon

Efficient Multiple Instance Metric Learning Using Weakly Supervised Data

Marc T. Law, Yaoliang Yu, Raquel Urtasun, Richard S. Zemel, Eric P. Xing

WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation (PDF,supplementary material, code)

Thibaut Durand, Taylor Mordan, Nicolas Thome, Matthieu Cord

Image-To-Image Translation With Conditional Adversarial Networks

Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros

Deep Roots: Improving CNN Efficiency With Hierarchical Filter Groups

Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi

Aggregated Residual Transformations for Deep Neural Networks

Saining Xie, Ross Girshick, Piotr Dollár, Zhuowen Tu, Kaiming He

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks With Privileged Information

Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew Soon Ong

Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning

Zhengming Ding, Ming Shao, Yun Fu

Factorized Variational Autoencoders for Modeling Audience Reactions to Movies

Zhiwei Deng, Rajitha Navarathna, Peter Carr, Stephan Mandt, Yisong Yue, Iain Matthews, Greg Mori

Learning Features by Watching Objects Move

Deepak Pathak, Ross Girshick, Piotr Dollár, Trevor Darrell, Bharath Hariharan

What Can Help Pedestrian Detection?

Jiayuan Mao, Tete Xiao, Yuning Jiang, Zhimin Cao

DeepPermNet: Visual Permutation Learning

Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould

Learning the Multilinear Structure of Visual Data

Mengjiao Wang, Yannis Panagakis, Patrick Snape, Stefanos Zafeiriou

Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies

Lena Gorelick, Yuri Boykov, Olga Veksler

Designing Energy-Efficient Convolutional Neural Networks Using Energy-Aware Pruning

Tien-Ju Yang, Yu-Hsin Chen, Vivienne Sze

Joint Multi-Person Pose Estimation and Semantic Part Segmentation (PDF,dataset)

Fangting Xia, Peng Wang, Xianjie Chen, Alan L. Yuille

Deep Feature Interpolation for Image Content Changes

Paul Upchurch, Jacob Gardner, Geoff Pleiss, Robert Pless, Noah Snavely, Kavita Bala, Kilian Weinberger

FASON: First and Second Order Information Fusion Network for Texture Recognition

Xiyang Dai, Joe Yue-Hei Ng, Larry S. Davis

Lean Crowdsourcing: Combining Humans and Machines in an Online System

Steve Branson, Grant Van Horn, Pietro Perona

Object Recognition & Scene Understanding

Supervising Neural Attention Models for Video Captioning by Human Gaze Data

Youngjae Yu, Jongwook Choi, Yeonhwa Kim, Kyung Yoo, Sang-Hun Lee, Gunhee Kim

L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space

Yurun Tian, Bin Fan, Fuchao Wu

Convolutional Random Walk Networks for Semantic Image Segmentation

Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi

Knowledge Acquisition for Visual Question Answering via Iterative Querying

Yuke Zhu, Joseph J. Lim, Li Fei-Fei

Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search

Bo Zhao, Jiashi Feng, Xiao Wu, Shuicheng Yan

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

Yang Long, Li Liu, Ling Shao, Fumin Shen, Guiguang Ding, Jungong Han

Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization?

Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, Hajime Taira, Masatoshi Okutomi, Tomas Pajdla

Asymmetric Feature Maps With Application to Sketch Based Retrieval

Giorgos Tolias, Ondřej Chum

Diverse Image Annotation

Baoyuan Wu, Fan Jia, Wei Liu, Bernard Ghanem

AMC: Attention guided Multi-modal Correlation Learning for Image Search

Kan Chen, Trung Bui, Chen Fang, Zhaowen Wang, Ram Nevatia

Multi-Attention Network for One Shot Learning

Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton van den Hengel, Heng Tao Shen

Fried Binary Embedding for High-Dimensional Visual Features

Weixiang Hong, Junsong Yuan, Sreyasee Das Bhattacharjee

Pyramid Scene Parsing Network

Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, Jiaya Jia

Learning Deep Match Kernels for Image-Set Classification

Haoliang Sun, Xiantong Zhen, Yuanjie Zheng, Gongping Yang, Yilong Yin, Shuo Li

Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description

Xishan Zhang, Ke Gao, Yongdong Zhang, Dongming Zhang, Jintao Li, Qi Tian

Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks

Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

Indoor Scene Parsing With Instance Segmentation, Semantic Labeling and Support Relationship Inference

Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu

Episodic CAMN: Contextual Attention-Based Memory Networks With Iterative Feedback for Scene Labeling

Abrar H. Abdulnabi, Bing Shuai, Stefan Winkler, Gang Wang

Link the Head to the “Beakâ€: Zero Shot Learning From Noisy Text Description at Part Precision

Mohamed Elhoseiny, Yizhe Zhu, Han Zhang, Ahmed Elgammal

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning

Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua

Deep Pyramidal Residual Networks (PDF,code)

Dongyoon Han, Jiwhan Kim, Junmo Kim

Product Split Trees

Artem Babenko, Victor Lempitsky

Making the v in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering

Yash Goyal, Tejas Khot, Douglas Summers-Stay, Dhruv Batra, Devi Parikh

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi

Cross-Modality Binary Code Learning via Fusion Similarity Hashing

Hong Liu, Rongrong Ji, Yongjian Wu, Feiyue Huang, Baochang Zhang

Theory

Saliency Revisited: Analysis of Mouse Movements Versus Fixations

Hamed R. Tavakoli, Fawad Ahmed, Ali Borji, Jorma Laaksonen

InterpoNet, a Brain Inspired Neural Network for Optical Flow Dense Interpolation

Shay Zweig, Lior Wolf

Video Analytics

SST: Single-Stream Temporal Action Proposals

Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, Juan Carlos Niebles

Video Segmentation via Multiple Granularity Analysis

Rui Yang, Bingbing Ni, Chao Ma, Yi Xu, Xiaokang Yang

Spatio-Temporal Alignment of Non-Overlapping Sequences From Independently Panning Cameras

Seyed Morteza Safdarnejad, Xiaoming Liu

UntrimmedNets for Weakly Supervised Action Recognition and Detection

Limin Wang, Yuanjun Xiong, Dahua Lin, Luc Van Gool

Object Recognition & Scene Understanding 3

Spotlight 4-2A

Gaze Embeddings for Zero-Shot Image Classification

Nour Karessli, Zeynep Akata, Bernt Schiele, Andreas Bulling

What's in a Question: Using Visual Questions as a Form of Supervision

Siddha Ganju, Olga Russakovsky, Abhinav Gupta

Attend to You: Personalized Image Captioning With Context Sequence Memory Networks

Cesc Chunseong Park, Byeongchang Kim, Gunhee Kim

Adversarially Tuned Scene Generation

VSR Veeravasarapu, Constantin Rothkopf, Ramesh Visvanathan

Residual Attention Network for Image Classification

Fei Wang, Mengqing Jiang, Chen Qian, Shuo Yang, Cheng Li, Honggang Zhang, Xiaogang Wang, Xiaoou Tang

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, Xiaoou Tang

Learning Non-Maximum Suppression

Jan Hosang, Rodrigo Benenson, Bernt Schiele

The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives

Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, Jordan Boyd-Graber, Hal Daumé III, Larry S. Davis

Oral 4-2A

Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach

Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, Yao Zhao, Shuicheng Yan

Fine-Grained Recognition as HSnet Search for Informative Image Parts

Michael Lam, Behrooz Mahasseni, Sinisa Todorovic

G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition

Qilong Wang, Peihua Li, Lei Zhang

YOLO9000: Better, Faster, Stronger

Joseph Redmon, Ali Farhadi

Machine Learning for 3D Vision

Spotlight 4-2B

Multi-View 3D Object Detection Network for Autonomous Driving

Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia

UltraStereo: Efficient Learning-Based Matching for Active Stereo Systems

Sean Ryan Fanello, Julien Valentin, Christoph Rhemann, Adarsh Kowdle, Vladimir Tankovich, Philip Davidson, Shahram Izadi

Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis

Angela Dai, Charles Ruizhongtai Qi, Matthias Nießner

Geometric Loss Functions for Camera Pose Regression With Deep Learning

Alex Kendall, Roberto Cipolla

CNN-SLAM: Real-Time Dense Monocular SLAM With Learned Depth Prediction

Keisuke Tateno, Federico Tombari, Iro Laina, Nassir Navab

Learning From Noisy Large-Scale Datasets With Minimal Supervision

Andreas Veit, Neil Alldrin, Gal Chechik, Ivan Krasin, Abhinav Gupta, Serge Belongie

SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation

Li Yi, Hao Su, Xingwen Guo, Leonidas J. Guibas

Non-Local Deep Features for Salient Object Detection

Zhiming Luo, Akshaya Mishra, Andrew Achkar, Justin Eichel, Shaozi Li, Pierre-Marc Jodoin

Oral 4-2B

Unsupervised Monocular Depth Estimation With Left-Right Consistency

Clément Godard, Oisin Mac Aodha, Gabriel J. Brostow

Unsupervised Learning of Depth and Ego-Motion From Video

Tinghui Zhou, Matthew Brown, Noah Snavely, David G. Lowe

OctNet: Learning Deep 3D Representations at High Resolutions

Gernot Riegler, Ali Osman Ulusoy, Andreas Geiger

3D Shape Segmentation With Projective Convolutional Networks

Evangelos Kalogerakis, Melinos Averkiou, Subhransu Maji, Siddhartha Chaudhuri

Poster 4-2

3D Computer Vision

SGM-Nets: Semi-Global Matching With Neural Networks

Akihito Seki, Marc Pollefeys

Stereo-Based 3D Reconstruction of Dynamic Fluid Surfaces by Global Optimization

Yiming Qian, Minglun Gong, Yee-Hong Yang

Fine-To-Coarse Global Registration of RGB-D Scans

Maciej Halber, Thomas Funkhouser

Analyzing Computer Vision Data - The Good, the Bad and the Ugly

Oliver Zendel, Katrin Honauer, Markus Murschitz, Martin Humenberger, Gustavo Fernández Domínguez

Product Manifold Filter: Non-Rigid Shape Correspondence via Kernel Density Estimation in the Product Space

Matthias Vestner, Roee Litman, Emanuele Rodolà, Alex Bronstein, Daniel Cremers

Unsupervised Vanishing Point Detection and Camera Calibration From a Single Manhattan Image With Radial Distortion

Michel Antunes, João P. Barreto, Djamila Aouada, Björn Ottersten

Toroidal Constraints for Two-Point Localization Under High Outlier Ratios

Federico Camposeco, Torsten Sattler, Andrea Cohen, Andreas Geiger, Marc Pollefeys

4D Light Field Superpixel and Segmentation

Hao Zhu, Qi Zhang, Qing Wang

Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation From Single and Multiple Images

Yuan Gao, Alan L. Yuille

Analyzing Humans in Images

Binary Coding for Partial Action Analysis With Limited Observation Ratios

Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen, Yunhong Wang

SphereFace: Deep Hypersphere Embedding for Face Recognition

Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song

IRINA: Iris Recognition (Even) in Inaccurately Segmented Data

Hugo Proença, João C. Neves

Look Into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing

Ke Gong, Xiaodan Liang, Dongyu Zhang, Xiaohui Shen, Liang Lin

Action Unit Detection With Region Adaptation, Multi-Labeling Learning and Optimal Temporal Fusing

Wei Li, Farnaz Abtahi, Zhigang Zhu

See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-Identification

Zhen Zhou, Yan Huang, Wei Wang, Liang Wang, Tieniu Tan

Joint Intensity and Spatial Metric Learning for Robust Gait Recognition

Yasushi Makihara, Atsuyuki Suzuki, Daigo Muramatsu, Xiang Li, Yasushi Yagi

Pose-Aware Person Recognition

Vijay Kumar, Anoop Namboodiri, Manohar Paluri, C. V. Jawahar

Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding

José Lezama, Qiang Qiu, Guillermo Sapiro

Applications

Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals

Katsuyuki Nakamura, Serena Yeung, Alexandre Alahi, Li Fei-Fei

Binarized Mode Seeking for Scalable Visual Pattern Discovery

Wei Zhang, Xiaochun Cao, Rui Wang, Yuanfang Guo, Zhineng Chen

Scribbler: Controlling Deep Image Synthesis With Sketch and Color

Patsorn Sangkloy, Jingwan Lu, Chen Fang, Fisher Yu, James Hays

Biomedical Image/Video Analysis

Multi-Way Multi-Level Kernel Modeling for Neuroimaging Classification

Lifang He, Chun-Ta Lu, Hao Ding, Shen Wang, Linlin Shen, Philip S. Yu, Ann B. Ragin

WSISA: Making Survival Prediction From Whole Slide Histopathological Images

Xinliang Zhu, Jiawen Yao, Feiyun Zhu, Junzhou Huang

Computational Photography

On the Effectiveness of Visible Watermarks

Tali Dekel, Michael Rubinstein, Ce Liu, William T. Freeman

Snapshot Hyperspectral Light Field Imaging

Zhiwei Xiong, Lizhi Wang, Huiqun Li, Dong Liu, Feng Wu

Semantic Image Inpainting With Deep Generative Models

Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa-Johnson, Minh N. Do

Image Motion & Tracking

Fast Multi-Frame Stereo Scene Flow With Motion Segmentation

Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato

Improved Stereo Matching With Constant Highway Networks and Reflective Confidence Learning

Amit Shaked, Lior Wolf

Optical Flow in Mostly Rigid Scenes

Jonas Wulff, Laura Sevilla-Lara, Michael J. Black

Optical Flow Requires Multiple Strategies (but Only One Network) (PDF,code)

Tal Schuster, Lior Wolf, David Gadot

ECO: Efficient Convolution Operators for Tracking

Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg

Low- & Mid-Level Vision

Differential Angular Imaging for Material Recognition

Jia Xue, Hang Zhang, Kristin Dana, Ko Nishino

Fast Fourier Color Constancy

Jonathan T. Barron, Yun-Ta Tsai

Comparative Evaluation of Hand-Crafted and Learned Local Features

Johannes L. Schönberger, Hans Hardmeier, Torsten Sattler, Marc Pollefeys

Learning Fully Convolutional Networks for Iterative Non-Blind Deconvolution

Jiawei Zhang, Jinshan Pan, Wei-Sheng Lai, Rynson W. H. Lau, Ming-Hsuan Yang

Image Deblurring via Extreme Channels Prior

Yanyang Yan, Wenqi Ren, Yuanfang Guo, Rui Wang, Xiaochun Cao

Simultaneous Stereo Video Deblurring and Scene Flow Estimation

Liyuan Pan, Yuchao Dai, Miaomiao Liu, Fatih Porikli

Deep Photo Style Transfer

Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala

Generative Attribute Controller With Conditional Filtered Generative Adversarial Networks

Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino

Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior

Jing Zhang, Yang Cao, Shuai Fang, Yu Kang, Chang Wen Chen

Machine Learning

Low-Rank Bilinear Pooling for Fine-Grained Classification

Shu Kong, Charless Fowlkes

Neural Scene De-Rendering

Jiajun Wu, Joshua B. Tenenbaum, Pushmeet Kohli

Real-Time Neural Style Transfer for Videos

Haozhi Huang, Hao Wang, Wenhan Luo, Lin Ma, Wenhao Jiang, Xiaolong Zhu, Zhifeng Li, Wei Liu

A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning

Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang

Collaborative Deep Reinforcement Learning for Joint Object Search

Xiangyu Kong, Bo Xin, Yizhou Wang, Gang Hua

Loss Max-Pooling for Semantic Image Segmentation

Samuel Rota Bulò, Gerhard Neuhold, Peter Kontschieder

Deep View Morphing

Dinghuang Ji, Junghyun Kwon, Max McFarland, Silvio Savarese

Unsupervised Learning of Long-Term Motion Dynamics for Videos

Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, Li Fei-Fei

Revisiting Metric Learning for SPD Matrix Based Visual Representation

Luping Zhou, Lei Wang, Jianjia Zhang, Yinghuan Shi, Yang Gao

Expert Gate: Lifelong Learning With a Network of Experts

Rahaf Aljundi, Punarjay Chakravarty, Tinne Tuytelaars

A Gift From Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning

Junho Yim, Donggyu Joo, Jihoon Bae, Junmo Kim

Domain Adaptation by Mixture of Alignments of Second- or Higher-Order Scatter Tensors

Piotr Koniusz, Yusuf Tas, Fatih Porikli

Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation

Stéphane Lathuilière, Rémi Juge, Pablo Mesejo, Rafael Muñoz-Salinas, Radu Horaud

STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling

Yang He, Wei-Chen Chiu, Margret Keuper, Mario Fritz

Harmonic Networks: Deep Translation and Rotation Equivariance

Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow

Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer

Xin Wang, Geoffrey Oxholm, Da Zhang, Yuan-Fang Wang

Detect, Replace, Refine: Deep Structured Prediction for Pixel Wise Labeling

Spyros Gidaris, Nikos Komodakis

Weighted-Entropy-Based Quantization for Deep Neural Networks

Eunhyeok Park, Junwhan Ahn, Sungjoo Yoo

Residual Expansion Algorithm: Fast and Effective Optimization for Nonconvex Least Squares Problems

Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa

Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-In-The-Blank Image Captioning

Qing Sun, Stefan Lee, Dhruv Batra

Newton-Type Methods for Inference in Higher-Order Markov Random Fields

Hariprasad Kannan, Nikos Komodakis, Nikos Paragios

Adaptive Relaxed ADMM: Convergence Theory and Practical Implementation

Zheng Xu, Mário A. T. Figueiredo, Xiaoming Yuan, Christoph Studer, Tom Goldstein

Object Recognition & Scene Understanding

ViP-CNN: Visual Phrase Guided Convolutional Neural Network

Yikang Li, Wanli Ouyang, Xiaogang Wang, Xiao'ou Tang

Instance-Aware Image and Sentence Matching With Selective Multimodal LSTM

Yan Huang, Wei Wang, Liang Wang

Kernel Square-Loss Exemplar Machines for Image Retrieval

Rafael S. Rezende, Joaquin Zepeda, Jean Ponce, Francis Bach, Patrick Pérez

Cognitive Mapping and Planning for Visual Navigation

Saurabh Gupta, James Davidson, Sergey Levine, Rahul Sukthankar, Jitendra Malik

Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation

Anirban Roy, Sinisa Todorovic

Seeing Into Darkness: Scotopic Visual Recognition

Bo Chen, Pietro Perona

Deep Co-Occurrence Feature Learning for Visual Object Recognition

Ya-Fang Shih, Yang-Ming Yeh, Yen-Yu Lin, Ming-Fang Weng, Yi-Chang Lu, Yung-Yu Chuang

An Empirical Evaluation of Visual Question Answering for Novel Objects

Santhosh K. Ramakrishnan, Ambar Pal, Gaurav Sharma, Anurag Mittal

InstanceCut: From Edges to Instances With MultiCut

Alexander Kirillov, Evgeny Levinkov, Bjoern Andres, Bogdan Savchynskyy, Carsten Rother

Fine-Grained Image Classification via Combining Vision and Language

Xiangteng He, Yuxin Peng

Mimicking Very Efficient Network for Object Detection

Quanquan Li, Shengying Jin, Junjie Yan

Tracking by Natural Language Specification

Zhenyang Li, Ran Tao, Efstratios Gavves, Cees G. M. Snoek, Arnold W.M. Smeulders

A Dataset and Exploration of Models for Understanding Video Data Through Fill-In-The-Blank Question-Answering

Tegan Maharaj, Nicolas Ballas, Anna Rohrbach, Aaron Courville, Christopher Pal

Learning Detection With Diverse Proposals

Samaneh Azadi, Jiashi Feng, Trevor Darrell

Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition

Yufei Wang, Zhe Lin, Xiaohui Shen, Scott Cohen, Garrison W. Cottrell

Theory

A Low Power, Fully Event-Based Gesture Recognition System

Arnon Amir, Brian Taba, David Berg, Timothy Melano, Jeffrey McKinstry, Carmelo Di Nolfo, Tapan Nayak, Alexander Andreopoulos, Guillaume Garreau, Marcela Mendoza, Jeff Kusnitz, Michael Debole, Steve Esser, Tobi Delbruck, Myron Flickner, Dharmendra Modha

Video Analytics

Learning Deep Context-Aware Features Over Body and Latent Parts for Person Re-Identification

Dangwei Li, Xiaotang Chen, Zhang Zhang, Kaiqi Huang

Recurrent Modeling of Interaction Context for Collective Activity Recognition

Minsi Wang, Bingbing Ni, Xiaokang Yang

Primary Object Segmentation in Videos Based on Region Augmentation and Reduction

Yeong Jun Koh, Chang-Su Kim

ROAM: A Rich Object Appearance Model With Application to Rotoscoping

Ondrej Miksik, Juan-Manuel Pérez-Rúa, Philip H. S. Torr, Patrick Pérez

Temporal Residual Networks for Dynamic Scene Recognition

Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes

Spatiotemporal Multiplier Networks for Video Action Recognition

Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes

Learning to Learn From Noisy Web Videos

Serena Yeung, Vignesh Ramanathan, Olga Russakovsky, Liyue Shen, Greg Mori, Li Fei-Fei

YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video

Esteban Real, Jonathon Shlens, Stefano Mazzocchi, Xin Pan, Vincent Vanhoucke

Online Video Object Segmentation via Convolutional Trident Network

Won-Dong Jang, Chang-Su Kim

你可能感兴趣的:(杂记)