今日CS.CV计算机视觉论文速览
Tue, 1 Jan 2019
Totally 52 papers
图片快速视觉效果增强算法,基于Ignatov的算法提高图像的感知质量,利用了轻量级的模型得到了6.3倍的提速。主要就超分辨、色彩校正和去模糊等方面提出了内容损失、纹理损失、色彩损失以及全局变化损失等。(from 苏黎世理工)
网络架构如下:
一些结果:
数据集DPED dataset,包含手机照片和对应的单反照片。
Code
GAN生成图像的指纹,研究从自相关性、残差相关性以及混淆矩阵等方面进行了度量,发现GAN具有独特的数字指纹可以被识别出来。(那不勒斯费迪南德||大学)
数据集:RAISE dataset
结合残差和稀疏卷积编码实现图像超分辨RL-CSC,其中卷积稀疏编码可以迭代的学习出输入特征并稀疏编码,同时残差可以再网络加深时保持训练的稳定。在工作Learned Iterative Shrinkage Threshold Algorithm (LISTA)的基础上改进,利用全卷积的方式实现,增加了模型的可解释性。
一些相关的循环结构:
code
Dataset: Berkeley Segmentation Dataset,BSD100,Urban100
检测蜜蜂运动轨迹,实现了对于致密、迅速无规则对象的轨迹跟踪。主要方法是利用基于分割的检测方法获得短距离的局部轨迹,随后利用目标识别模型来融合这些轨迹。主要的贡献在于建立一种称为像素识别(Pixel personality)的机制来从进行轨迹融合。
(from 冲绳科技)
一些蜜蜂的轨迹跟总结果
蜜蜂Dataset,密集慎入
多种背景光照变化下的目标检测,在140个网络摄像头的5M张照片上测评了yolo算法的应对不同光照变化的能力。研究表明算法无法适应光照变化和夜间环境,并建议未来的目标检测算法应该在相关数据集上进行训练才能保证各个时间段的有效性。(from普渡大学)
不同场景下的光照变化:
不同时间下的变化:
一种准确高效的字符识别方法,(from 艾斯尤特大学 Egypt)
阿拉伯手写字符KFUPM Handwritten Arabic TexT (KHATT):http://khatt.ideas2serve.net/index.php
利用简单快速的线性求解器方法解决了相机卷帘快门带来的绝对定位问题, 使用了6点求解器达到了R6P求解器的效果。(from 日本国立情报研究所)
ref:https://www.nii.ac.jp/en/
基于RGB-D点云的三维卷积无模型位姿估计,与通常需要目标三维模型的位姿估计问题不同的是,这一工作使用了两个步骤,通过3D卷积处理了RGB-D点云信息进行点云估计。实现了1cm的定位精度和5度的角度精度。并在真实的机器人抓取任务中获得90%的准确率。此外在研究过程中,还利用运动捕捉系统实现了点云的精确标注,能为训练提供大量高精度标记的点云数据。(from 南洋理工)
三维卷积网络模型:
在ros中的流程,点云>体素化>旋转/平移>轨迹规划
通过变化分解实现快速全局点云刚性配准, 为了解决点云的刚性配准问题,避免BnB方法庞大的计算量和低效的约束评价方法,研究人员提出了一种具有不变性的矢量,将6D刚体变换分解为了3D旋转和平移的搜索。在减小计算维度的条件下提高了效率,并利用新的数据结果3D Integral Volume来加速Bound过程。(from 复旦 tum)
选择和平移的空间限制Bound:
Synthetic Data:Stanford 3D Scanning Repository [50], Chicken,Rhino and T-rex from Mians dataset [51], [52],Camera from the Stefan Hinterstoissers dataset [53] and Hand from the Large Geometric Models Archive at Georgia Tech [54]。
Real data:Stanford Scanning Models, Indoor Scan Data(from matlab), Clinical Data(3D MRI 数据)
http://graphics.stanford.edu/data/3Dscanrep/
http://campar.in.tum.de/Main/StefanHinterstoisser
https://www.cc.gatech.edu/projects/
[1] Title: Mid-Level Visual Representations Improve Generalization and Sample Efficiency for Learning Active Tasks
Authors:Alexander Sax, Bradley Emi, Amir R. Zamir, Leonidas Guibas, Silvio Savarese, Jitendra Malik
[2] Title: The role of visual saliency in the automation of seismic interpretation
Authors:Muhammad Amir Shafiq, Tariq Alshawi, Zhiling Long, Ghassan AlRegib
[3] Title: Image Super-Resolution via RL-CSC: When Residual Learning Meets Convolutional Sparse Coding
Authors:Menglei Zhang, Zhou Liu, Lei Yu
[4] Title: High Quality Monocular Depth Estimation via Transfer Learning
Authors:Ibraheem Alhashim, Peter Wonka
[5] Title: Large-Scale Object Detection of Images from Network Cameras in Variable Ambient Lighting Conditions
Authors:Caleb Tung, Matthew R. Kelleher, Ryan J. Schlueter, Binhan Xu, Yung-Hsiang Lu, George K. Thiruvathukal, Yen-Kuang Chen, Yang Lu
[6] Title: Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks
Authors:Mohamed Yousef, Khaled F. Hussain, Usama S. Mohammed
[7] Title: Fast Perceptual Image Enhancement
Authors:Etienne de Stoutz, Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Luc Van Gool
[8] Title: Do GANs leave artificial fingerprints?
Authors:Francesco Marra, Diego Gragnaniello, Luisa Verdoliva, Giovanni Poggi
[9] Title: Sequential Gating Ensemble Network for Noise Robust Multi-Scale Face Restoration
Authors:Zhibo Chen, Jianxin Lin, Tiankuang Zhou, Feng Wu
[10] Title: Pixel personality for dense object tracking in a 2D honeybee hive
Authors:Katarzyna Bozek, Laetitia Hebert, Alexander S Mikheyev, Greg J Stephens
[11] Title: PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation
Authors:Sida Peng, Yuan Liu, Qixing Huang, Hujun Bao, Xiaowei Zhou
[12] Title: Predicting Group Cohesiveness in Images
Authors:Shreya Ghosh, Abhinav Dhall, Nicu Sebe
[13] Title: The meaning of “most” for visual question answering models
Authors:Alexander Kuhnle, Ann Copestake
[14] Title: Total Variation with Overlapping Group Sparsity and Lp Quasinorm for Infrared Image Deblurring under Salt-and-Pepper Noise
Authors:Xingguo Liua, Yinping Chena, Zhenming Penga, Juan Wu
[15] Title: SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks
Authors:Bo Li, Wei Wu, Qiang Wang, Fangyi Zhang, Junliang Xing, Junjie Yan
[16] Title: Sex-Classification from Cell-Phones Periocular Iris Images
Authors:Juan Tapia, Claudia Arellano, Ignacio Viedma
[17] Title: Unsupervised monocular stereo matching
Authors:Zhimin Zhang, Jianzhong Qiao, Shukuan Lin
[18] Title: Path-Invariant Map Networks
Authors:Zaiwei Zhang, Zhenxiao Liang, Lemeng Wu, Xiaowei Zhou, Qixing Huang
[19] Title: Actor Conditioned Attention Maps for Video Action Detection
Authors:Oytun Ulutan, Swati Rallapalli, Mudhakar Srivatsa, B.S. Manjunath
[20] Title: Solar Potential Analysis of Rooftops Using Satellite Imagery
Authors:Akash Kumar, S. Indu
[21] Title: Cascaded V-Net using ROI masks for brain tumor segmentation
Authors:Adrià Casamitjana, Marcel Catà, Irina Sánchez, Marc Combalia, Verónica Vilaplana
[22] Title: Leishmaniasis Parasite Segmentation and Classification using Deep Learning
Authors:Marc Górriz, Albert Aparicio, Berta Raventós, Verónica Vilaplana, Elisa Sayrol, Daniel López-Codina
[23] Title: Fingerprint Presentation Attack Detection: Generalization and Efficiency
Authors:Tarang Chugh, Anil K. Jain
[24] Title: Monte-Carlo Sampling applied to Multiple Instance Learning for Histological Image Classification
Authors:Marc Combalia, Veronica Vilaplana
[25] Title: Linear solution to the minimal absolute pose rolling shutter problem
Authors:Zuzana Kukelova, Cenek Albl, Akihiro Sugimoto, Tomas Pajdla
[26] Title: CoSpace: Common Subspace Learning from Hyperspectral-Multispectral Correspondences
Authors:Danfeng Hong, Naoto Yokoya, Jocelyn Chanussot, Xiao Xiang Zhu
[27] Title: A High-Performance CNN Method for Offline Handwritten Chinese Character Recognition and Visualization
Authors:Pavlo Melnyk, Zhiqiang You, Keqin Li
[28] Title: DART: Domain-Adversarial Residual-Transfer Networks for Unsupervised Cross-Domain Image Classification
Authors:Xianghong Fang, Haoli Bai, Ziyi Guo, Bin Shen, Steven Hoi, Zenglin Xu
[29] Title: Brain MRI super-resolution using 3D generative adversarial networks
Authors:Irina Sanchez, Veronica Vilaplana
[30] Title: Feature Preserving and Uniformity-controllable Point Cloud Simplification on Graph
Authors:Junkun Qi, Wei Hu, Zongming Guo
[31] Title: EANet: Enhancing Alignment for Cross-Domain Person Re-identification
Authors:Houjing Huang, Wenjie Yang, Xiaotang Chen, Xin Zhao, Kaiqi Huang, Jinbin Lin, Guan Huang, Dalong Du
[32] Title: Rendu basé image avec contraintes sur les gradients
Authors:Grégoire Nieto (LJK), Frédéric Devernay (PRIMA), James Crowley (PERVASIVE)
[33] Title: Skeleton Transformer Networks: 3D Human Pose and Skinned Mesh from Single RGB Image
Authors:Yusuke Yoshiyasu, Ryusuke Sagawa, Ko Ayusawa, Akihiko Murai
[34] Title: A Deep Learning based Framework to Detect and Recognize Humans using Contactless Palmprints in the Wild
Authors:Yang Liu, Ajay Kumar
[35] Title: Support Vector Guided Softmax Loss for Face Recognition
Authors:Xiaobo Wang, Shuo Wang, Shifeng Zhang, Tianyu Fu, Hailin Shi, Tao Mei
[36] Title: Fast and Globally Optimal Rigid Registration of 3D Point Sets by Transformation Decomposition
Authors:Xuechen Li, Yinlong Liu, Yiru Wang, Chen Wang, Manning Wang, Zhijian Song
[37] Title: Annotation-cost Minimization for Medical Image Segmentation using Suggestive Mixed Supervision Fully Convolutional Networks
Authors:Yash Bhalgat, Meet Shah, Suyash Awate
[38] Title: Monocular 3D Pose Recovery via Nonconvex Sparsity with Theoretical Analysis
Authors:Jianqiao Wangni, Dahua Lin, Ji Liu, Kostas Daniilidis, Jianbo Shi
[39] Title: CamLoc: Pedestrian Location Detection from Pose Estimation on Resource-constrained Smart-cameras
Authors:Adrian Cosma, Ion Emilian Radoi, Valentin Radu
[40] Title: CFA Bayer image sequence denoising and demosaicking chain
Authors:Antoni Buades, Joan Duran
[41] Title: Class-Aware Adversarial Lung Nodule Synthesis in CT Images
Authors:Jie Yang, Siqi Liu, Sasa Grbic, Arnaud Arindra Adiyoso Setio, Zhoubing Xu, Eli Gibson, Guillaume Chabin, Bogdan Georgescu, Andrew F. Laine, Dorin Comaniciu
[42] Title: Epipolar Geometry based Learning of Multi-view Depth and Ego-Motion from Monocular Sequences
Authors:Vignesh Prasad, Dipanjan Das, Brojeshwar Bhowmick
[43] Title: Towards a topological-geometrical theory of group equivariant non-expansive operators for data analysis and machine learning
Authors:Mattia G. Bergomi, Patrizio Frosini, Daniela Giorgi, Nicola Quercioli
[44] Title: An introduction to domain adaptation and transfer learning
Authors:Wouter M. Kouw
[45] Title: BNN+: Improved Binary Network Training
Authors:Sajad Darabi, Mouloud Belbahri, Matthieu Courbariaux, Vahid Partovi Nia
[46] Title: Cluster-Based Active Learning
Authors:Fábio Perez, Rémi Lebret, Karl Aberer
[47] Title: Deep Residual Learning in the JPEG Transform Domain
Authors:Max Ehrlich, Larry Davis
[48] Title: ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Authors:Ao Ren, Tianyun Zhang, Shaokai Ye, Jiayu Li, Wenyao Xu, Xuehai Qian, Xue Lin, Yanzhi Wang
[49] Title: Machine learning in resting-state fMRI analysis
Authors:Meenakshi Khosla, Keith Jamison, Gia H. Ngo, Amy Kuceyeski, Mert R. Sabuncu
[50] Title: Quantized Guided Pruning for Efficient Hardware Implementations of Convolutional Neural Networks
Authors:Ghouthi Boukli Hacene (ELEC), Vincent Gripon, Matthieu Arzel (ELEC), Nicolas Farrugia (ELEC), Yoshua Bengio (DIRO)
[51] Title: 3D Convolution on RGB-D Point Clouds for Accurate Model-free Object Pose Estimation
Authors:Zhongang Cai, Cunjun Yu, Quang-Cuong Pham
[52] Title: Kymatio: Scattering Transforms in Python
Authors:Mathieu Andreux, Tomás Angles, Georgios Exarchakis, Roberto Leonarduzzi, Gaspar Rochette, Louis Thiry, John Zarka, Stéphane Mallat, Joakim andén, Eugene Belilovsky, Joan Bruna, Vincent Lostanlen, Matthew J. Hirn, Edouard Oyallon, Sixhin Zhang, Carmine Cella, Michael Eickenberg
Papers from arxiv.org
更多精彩请移步主页