前言
之前Amusi整理了1467篇CVPR 2020所有论文PDF下载资源,详见:全在这里了!
CVPR 2020 论文开源项目合集,同时欢迎各位大佬提交issue,分享CVPR 2020开源项目
关于往年CV顶会论文(如CVPR 2019、ICCV 2019、ECCV 2018)以及其他优质CV论文和大盘点,详见: https://github.com/amusi/daily-paper-computer-vision
Exploring Self-attention for Image Recognition
论文:https://hszhao.github.io/papers/cvpr20_san.pdf
代码:https://github.com/hszhao/SAN
Improving Convolutional Networks with Self-Calibrated Convolutions
主页:https://mmcheng.net/scconv/
论文:http://mftp.mmcheng.net/Papers/20cvprSCNet.pdf
代码:https://github.com/backseason/SCNet
Rethinking Depthwise Separable Convolutions: How Intra-Kernel Correlations Lead to Improved MobileNets
Compositional Convolutional Neural Networks: A Deep Architecture with Innate Robustness to Partial Occlusion
论文:https://arxiv.org/abs/2003.04490
代码:https://github.com/AdamKortylewski/CompositionalNets
Spatially Attentive Output Layer for Image Classification
论文:https://arxiv.org/abs/2004.07570
代码(好像被原作者删除了):https://github.com/ildoonet/spatially-attentive-output-layer
Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax
AugFPN: Improving Multi-scale Feature Learning for Object Detection
Noise-Aware Fully Webly Supervised Object Detection
Learning a Unified Sample Weighting Network for Object Detection
D2Det: Towards High Quality Object Detection and Instance Segmentation
论文:http://openaccess.thecvf.com/content_CVPR_2020/papers/Cao_D2Det_Towards_High_Quality_Object_Detection_and_Instance_Segmentation_CVPR_2020_paper.pdf
代码:https://github.com/JialeCao001/D2Det
Dynamic Refinement Network for Oriented and Densely Packed Object Detection
论文下载链接:https://arxiv.org/abs/2005.09973
代码和数据集:https://github.com/Anymake/DRN_CVPR2020
Scale-Equalizing Pyramid Convolution for Object Detection
论文:https://arxiv.org/abs/2005.03101
代码:https://github.com/jshilong/SEPC
Revisiting the Sibling Head in Object Detector
论文:https://arxiv.org/abs/2003.07540
代码:https://github.com/Sense-X/TSD
Scale-equalizing Pyramid Convolution for Object Detection
Detection in Crowded Scenes: One Proposal, Multiple Predictions
Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection
BiDet: An Efficient Binarized Object Detector
Harmonizing Transferability and Discriminability for Adapting Object Detectors
CentripetalNet: Pursuing High-quality Keypoint Pairs for Object Detection
Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection
EfficientDet: Scalable and Efficient Object Detection
SESS: Self-Ensembling Semi-Supervised 3D Object Detection
论文: https://arxiv.org/abs/1912.11803
代码:https://github.com/Na-Z/sess
Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection
论文: https://arxiv.org/abs/2006.04356
代码:https://github.com/dleam/Associate-3Ddet
What You See is What You Get: Exploiting Visibility for 3D Object Detection
主页:https://www.cs.cmu.edu/~peiyunh/wysiwyg/
论文:https://arxiv.org/abs/1912.04986
代码:https://github.com/peiyunh/wysiwyg
Learning Depth-Guided Convolutions for Monocular 3D Object Detection
Structure Aware Single-stage 3D Object Detection from Point Cloud
论文:http://openaccess.thecvf.com/content_CVPR_2020/html/He_Structure_Aware_Single-Stage_3D_Object_Detection_From_Point_Cloud_CVPR_2020_paper.html
代码:https://github.com/skyhehe123/SA-SSD
IDA-3D: Instance-Depth-Aware 3D Object Detection from Stereo Vision for Autonomous Driving
论文:http://openaccess.thecvf.com/content_CVPR_2020/papers/Peng_IDA-3D_Instance-Depth-Aware_3D_Object_Detection_From_Stereo_Vision_for_Autonomous_CVPR_2020_paper.pdf
代码:https://github.com/swords123/IDA-3D
Train in Germany, Test in The USA: Making 3D Object Detectors Generalize
论文:https://arxiv.org/abs/2005.08139
代码:https://github.com/cxy1997/3D_adapt_auto_driving
MLCVNet: Multi-Level Context VoteNet for 3D Object Detection
3DSSD: Point-based 3D Single Stage Object Detector
CVPR 2020 Oral
论文:https://arxiv.org/abs/2002.10187
代码:https://github.com/tomztyang/3DSSD
Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation
论文:https://arxiv.org/abs/2004.03572
代码:https://github.com/zju3dv/disprcn
End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
论文:https://arxiv.org/abs/2004.03080
代码:https://github.com/mileyan/pseudo-LiDAR_e2e
DSGN: Deep Stereo Geometry Network for 3D Object Detection
LiDAR-based Online 3D Video Object Detection with Graph-based Message Passing and Spatiotemporal Transformer Attention
PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
论文:https://arxiv.org/abs/1912.13192
代码:https://github.com/sshaoshuai/PV-RCNN
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Memory Enhanced Global-Local Aggregation for Video Object Detection
论文:https://arxiv.org/abs/2003.12063
代码:https://github.com/Scalsol/mega.pytorch
SiamCAR: Siamese Fully Convolutional Classification and Regression for Visual Tracking
D3S – A Discriminative Single Shot Segmentation Tracker
ROAM: Recurrently Optimizing Tracking Model
论文:https://arxiv.org/abs/1907.12006
代码:https://github.com/skyoung/ROAM
Siam R-CNN: Visual Tracking by Re-Detection
Cooling-Shrinking Attack: Blinding the Tracker with Imperceptible Noises
High-Performance Long-Term Tracking with Meta-Updater
论文:https://arxiv.org/abs/2004.00305
代码:https://github.com/Daikenan/LTMU
AutoTrack: Towards High-Performance Visual Tracking for UAV with Automatic Spatio-Temporal Regularization
论文:https://arxiv.org/abs/2003.12949
代码:https://github.com/vision4robotics/AutoTrack
Probabilistic Regression for Visual Tracking
MAST: A Memory-Augmented Self-supervised Tracker
Siamese Box Adaptive Network for Visual Tracking
3D-ZeF: A 3D Zebrafish Tracking Benchmark Dataset
Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation
论文:暂无
代码:https://github.com/JianqiangWan/Super-BPD
Single-Stage Semantic Segmentation from Image Labels
论文:https://arxiv.org/abs/2005.08104
代码:https://github.com/visinf/1-stage-wseg
Learning Texture Invariant Representation for Domain Adaptation of Semantic Segmentation
MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
Unsupervised Intra-domain Adaptation for Semantic Segmentation through Self-Supervision
Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation
Temporally Distributed Networks for Fast Video Segmentation
论文:https://arxiv.org/abs/2004.01800
代码:https://github.com/feinanshan/TDNet
Context Prior for Scene Segmentation
论文:https://arxiv.org/abs/2004.01547
代码:https://git.io/ContextPrior
Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
论文:https://arxiv.org/abs/2003.13328
代码:https://github.com/Andrew-Qibin/SPNet
Cars Can’t Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks
Learning Dynamic Routing for Semantic Segmentation
论文:https://arxiv.org/abs/2003.10401
代码:https://github.com/yanwei-li/DynamicRouting
D2Det: Towards High Quality Object Detection and Instance Segmentation
论文:http://openaccess.thecvf.com/content_CVPR_2020/papers/Cao_D2Det_Towards_High_Quality_Object_Detection_and_Instance_Segmentation_CVPR_2020_paper.pdf
代码:https://github.com/JialeCao001/D2Det
PolarMask: Single Shot Instance Segmentation with Polar Representation
CenterMask : Real-Time Anchor-Free Instance Segmentation
BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
Deep Snake for Real-Time Instance Segmentation
Mask Encoding for Single Shot Instance Segmentation
论文:https://arxiv.org/abs/2003.11712
代码:https://github.com/aim-uofa/AdelaiDet
Pixel Consensus Voting for Panoptic Segmentation
BANet: Bidirectional Aggregation Network with Occlusion Handling for Panoptic Segmentation
论文:https://arxiv.org/abs/2003.14031
代码:https://github.com/Mooonside/BANet
A Transductive Approach for Video Object Segmentation
论文:https://arxiv.org/abs/2004.07193
代码:https://github.com/microsoft/transductive-vos.pytorch
State-Aware Tracker for Real-Time Video Object Segmentation
论文:https://arxiv.org/abs/2003.00482
代码:https://github.com/MegviiDetection/video_analyst
Learning Fast and Robust Target Models for Video Object Segmentation
Learning Video Object Segmentation from Unlabeled Videos
Superpixel Segmentation with Fully Convolutional Networks
AOWS: Adaptive and optimal network width search with latency constraints
Densely Connected Search Space for More Flexible Neural Architecture Search
论文:https://arxiv.org/abs/1906.09607
代码:https://github.com/JaminFong/DenseNAS
MTL-NAS: Task-Agnostic Neural Architecture Search towards General-Purpose Multi-Task Learning
论文:https://arxiv.org/abs/2003.14058
代码:https://github.com/bhpfelix/MTLNAS
FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions
论文下载链接:https://arxiv.org/abs/2004.05565
代码:https://github.com/facebookresearch/mobile-vision
Neural Architecture Search for Lightweight Non-Local Networks
Rethinking Performance Estimation in Neural Architecture Search
CARS: Continuous Evolution for Efficient Neural Architecture Search
Distribution-induced Bidirectional Generative Adversarial Network for Graph Representation Learning
PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer
Semantically Mutil-modal Image Synthesis
Unpaired Portrait Drawing Generation via Asymmetric Cycle Mapping
Learning to Cartoonize Using White-box Cartoon Representations
论文:https://github.com/SystemErrorWang/White-box-Cartoonization/blob/master/paper/06791.pdf
主页:https://systemerrorwang.github.io/White-box-Cartoonization/
代码:https://github.com/SystemErrorWang/White-box-Cartoonization
解读:https://zhuanlan.zhihu.com/p/117422157
Demo视频:https://www.bilibili.com/video/av56708333
GAN Compression: Efficient Architectures for Interactive Conditional GANs
论文:https://arxiv.org/abs/2003.08936
代码:https://github.com/mit-han-lab/gan-compression
Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral Distributions
COCAS: A Large-Scale Clothes Changing Person Dataset for Re-identification
论文:https://arxiv.org/abs/2005.07862
数据集:暂无
Transferable, Controllable, and Inconspicuous Adversarial Attacks on Person Re-identification With Deep Mis-Ranking
论文:https://arxiv.org/abs/2004.04199
代码:https://github.com/whj363636/Adversarial-attack-on-Person-ReID-With-Deep-Mis-Ranking
Pose-guided Visible Part Matching for Occluded Person ReID
Weakly supervised discriminative feature learning with state information for person identification
PointASNL: Robust Point Clouds Processing using Nonlocal Neural Networks with Adaptive Sampling
Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds
论文下载链接:https://arxiv.org/abs/2003.12971
代码:https://github.com/raoyongming/PointGLR
Grid-GCN for Fast and Scalable Point Cloud Learning
论文:https://arxiv.org/abs/1912.02984
代码:https://github.com/Xharlie/Grid-GCN
FPConv: Learning Local Flattening for Point Convolution
PointAugment: an Auto-Augmentation Framework for Point Cloud Classification
RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
论文:https://arxiv.org/abs/1911.11236
代码:https://github.com/QingyongHu/RandLA-Net
解读:https://zhuanlan.zhihu.com/p/105433460
Weakly Supervised Semantic Point Cloud Segmentation:Towards 10X Fewer Labels
论文:https://arxiv.org/abs/2004.0409
代码:https://github.com/alex-xun-xu/WeakSupPointCloudSeg
PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation
Learning to Segment 3D Point Clouds in 2D Image Space
论文:https://arxiv.org/abs/2003.05593
代码:https://github.com/WPI-VISLab/Learning-to-Segment-3D-Point-Clouds-in-2D-Image-Space
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation
D3Feat: Joint Learning of Dense Detection and Description of 3D Local Features
RPM-Net: Robust Point Matching using Learned Features
Cascaded Refinement Network for Point Cloud Completion
P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
An Efficient PointLSTM for Point Clouds Based Gesture Recognition
CurricularFace: Adaptive Curriculum Learning Loss for Deep Face Recognition
论文:https://arxiv.org/abs/2004.00288
代码:https://github.com/HuangYG123/CurricularFace
Learning Meta Face Recognition in Unseen Domains
Searching Central Difference Convolutional Networks for Face Anti-Spoofing
论文:https://arxiv.org/abs/2003.04092
代码:https://github.com/ZitongYu/CDCN
Suppressing Uncertainties for Large-Scale Facial Expression Recognition
论文:https://arxiv.org/abs/2002.10392
代码(即将开源):https://github.com/kaiwang960112/Self-Cure-Network
Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images
AvatarMe: Realistically Renderable 3D Facial Reconstruction "in-the-wild"
FaceScape: a Large-scale High Quality 3D Face Dataset and Detailed Riggable 3D Face Prediction
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation
Distribution-Aware Coordinate Representation for Human Pose Estimation
主页:https://ilovepose.github.io/coco/
论文:https://arxiv.org/abs/1910.06278
代码:https://github.com/ilovepose/DarkPose
Fusing Wearable IMUs with Multi-View Images for Human Pose Estimation: A Geometric Approach
主页:https://www.zhe-zhang.com/cvpr2020
论文:https://arxiv.org/abs/2003.11163
代码:https://github.com/CHUNYUWANG/imu-human-pose-pytorch
Bodies at Rest: 3D Human Pose and Shape Estimation from a Pressure Image using Synthetic Data
论文下载链接:https://arxiv.org/abs/2004.01166
代码:https://github.com/Healthcare-Robotics/bodies-at-rest
数据集:https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/KOA4ML
Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis
Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation
VIBE: Video Inference for Human Body Pose and Shape Estimation
Back to the Future: Joint Aware Temporal Deep Learning 3D Human Pose Estimation
Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS
Correlating Edge, Pose with Parsing
论文:https://arxiv.org/abs/2005.01431
代码:https://github.com/ziwei-zh/CorrPM
ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection
论文:https://arxiv.org/abs/2003.07493
代码:https://github.com/GXYM/DRRG
SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network
Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition
论文:https://arxiv.org/abs/2003.06606
代码:https://github.com/Canjie-Luo/Text-Image-Augmentation
SuperGlue: Learning Feature Matching with Graph Neural Networks
Closed-Loop Matters: Dual Regression Networks for Single Image Super-Resolution
Learning Texture Transformer Network for Image Super-Resolution
论文:https://arxiv.org/abs/2006.04139
代码:https://github.com/FuzhiYang/TTSR
Image Super-Resolution with Cross-Scale Non-Local Attention and Exhaustive Self-Exemplars Mining
Structure-Preserving Super Resolution with Gradient Guidance
论文:https://arxiv.org/abs/2003.13081
代码:https://github.com/Maclory/SPSR
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Analysis and a New Strategy
论文:https://arxiv.org/abs/2004.00448
代码:https://github.com/clovaai/cutblur
TDAN: Temporally-Deformable Alignment Network for Video Super-Resolution
Space-Time-Aware Multi-Resolution Video Enhancement
Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
DMCP: Differentiable Markov Channel Pruning for Neural Networks
Forward and Backward Information Retention for Accurate Binary Neural Networks
论文:https://arxiv.org/abs/1909.10788
代码:https://github.com/htqin/IR-Net
Towards Efficient Model Compression via Learned Global Ranking
HRank: Filter Pruning using High-Rank Feature Map
GAN Compression: Efficient Architectures for Interactive Conditional GANs
论文:https://arxiv.org/abs/2003.08936
代码:https://github.com/mit-han-lab/gan-compression
Group Sparsity: The Hinge Between Filter Pruning and Decomposition for Network Compression
论文:https://arxiv.org/abs/2003.08935
代码:https://github.com/ofsoundof/group_sparsity
Oops! Predicting Unintentional Action in Video
主页:https://oops.cs.columbia.edu/
论文:https://arxiv.org/abs/1911.11206
代码:https://github.com/cvlab-columbia/oops
数据集:https://oops.cs.columbia.edu/data
PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition
Intra- and Inter-Action Understanding via Temporal Action Parsing
3DV: 3D Dynamic Voxel for Action Recognition in Depth Video
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding
TEA: Temporal Excitation and Aggregation for Action Recognition
论文:https://arxiv.org/abs/2004.01398
代码:https://github.com/Phoenix1327/tea-action-recognition
X3D: Expanding Architectures for Efficient Video Recognition
论文:https://arxiv.org/abs/2004.04730
代码:https://github.com/facebookresearch/SlowFast
Temporal Pyramid Network for Action Recognition
主页:https://decisionforce.github.io/TPN
论文:https://arxiv.org/abs/2004.03548
代码:https://github.com/decisionforce/TPN
Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition
BiFuse: Monocular 360◦ Depth Estimation via Bi-Projection Fusion
Focus on defocus: bridging the synthetic to real domain gap for depth estimation
Bi3D: Stereo Depth Estimation via Binary Classifications
论文:https://arxiv.org/abs/2005.07274
代码:https://github.com/NVlabs/Bi3D
AANet: Adaptive Aggregation Network for Efficient Stereo Matching
Towards Better Generalization: Joint Depth-Pose Learning without PoseNet
论文:https://github.com/B1ueber2y/TrianFlow
代码:https://github.com/B1ueber2y/TrianFlow
On the uncertainty of self-supervised monocular depth estimation
3D Packing for Self-Supervised Monocular Depth Estimation
Domain Decluttering: Simplifying Images to Mitigate Synthetic-Real Domain Shift and Improve Depth Estimation
MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion
EPOS: Estimating 6D Pose of Objects with Symmetries
主页:http://cmp.felk.cvut.cz/epos
论文:https://arxiv.org/abs/2004.00605
G2L-Net: Global to Local Network for Real-time 6D Pose Estimation with Embedding Vector Features
论文:https://arxiv.org/abs/2003.11089
代码:https://github.com/DC1991/G2L_Net
HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation
论文:https://arxiv.org/abs/2004.00060
主页:http://vision.sice.indiana.edu/projects/hopenet
Monocular Real-time Hand Shape and Motion Capture using Multi-modal Data
论文:https://arxiv.org/abs/2003.09572
代码:https://github.com/CalciferZh/minimal-hand
JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection
论文:https://arxiv.org/abs/2004.08515
代码:https://github.com/kerenfu/JLDCF/
UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders
主页:http://dpfan.net/d3netbenchmark/
论文:https://arxiv.org/abs/2004.05763
代码:https://github.com/JingZhang617/UCNet
A Physics-based Noise Formation Model for Extreme Low-light Raw Denoising
论文:https://arxiv.org/abs/2003.12751
代码:https://github.com/Vandermode/NoiseModel
CycleISP: Real Image Restoration via Improved Data Synthesis
论文:https://arxiv.org/abs/2003.07761
代码:https://github.com/swz30/CycleISP
Multi-Scale Progressive Fusion Network for Single Image Deraining
论文:https://arxiv.org/abs/2003.10985
代码:https://github.com/kuihua/MSPFN
Cascaded Deep Video Deblurring Using Temporal Sharpness Prior
Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
论文:https://arxiv.org/abs/2004.13388
代码:https://github.com/BookerDeWitt/MSBDN-DFF
ASLFeat: Learning Local Features of Accurate Shape and Localization
论文:https://arxiv.org/abs/2003.10071
代码:https://github.com/lzx551402/aslfeat
VC R-CNN:Visual Commonsense R-CNN
Hierarchical Conditional Relation Networks for Video Question Answering
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-training
Learning for Video Compression with Hierarchical Quality and Recurrent Enhancement
FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation
论文:http://openaccess.thecvf.com/content_CVPR_2020/html/Gui_FeatureFlow_Robust_Video_Interpolation_via_Structure-to-Texture_Generation_CVPR_2020_paper.html
代码:https://github.com/CM-BF/FeatureFlow
Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution
Space-Time-Aware Multi-Resolution Video Enhancement
Scene-Adaptive Video Frame Interpolation via Meta-Learning
Softmax Splatting for Video Frame Interpolation
Diversified Arbitrary Style Transfer via Deep Feature Perturbation
Collaborative Distillation for Ultra-Resolution Universal Style Transfer
论文:https://arxiv.org/abs/2003.08436
代码:https://github.com/mingsun-tse/collaborative-distillation
Inter-Region Affinity Distillation for Road Marking Segmentation
PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection
Detailed 2D-3D Joint Representation for Human-Object Interaction
论文:https://arxiv.org/abs/2004.08154
代码:https://github.com/DirtyHarryLYL/DJ-RN
Cascaded Human-Object Interaction Recognition
论文:https://arxiv.org/abs/2003.04262
代码:https://github.com/tfzhou/C-HOI
VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions
The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction
Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction
Collaborative Motion Prediction via Neural Motion Message Passing
MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird’s Eye View Maps
论文:https://arxiv.org/abs/2003.06754
代码:https://github.com/pxiangwu/MotionNet
Learning by Analogy: Reliable Supervision from Transformations for Unsupervised Optical Flow Estimation
Evade Deep Image Retrieval by Stashing Private Images in the Hash Space
Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content
Single-Image HDR Reconstruction by Learning to Reverse the Camera Pipeline
主页:https://www.cmlab.csie.ntu.edu.tw/~yulunliu/SingleHDR
论文下载链接:https://www.cmlab.csie.ntu.edu.tw/~yulunliu/SingleHDR_/00942.pdf
代码:https://github.com/alex04072000/SingleHDR
Towards Large yet Imperceptible Adversarial Image Perturbations with Perceptual Color Distance
Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild
Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization
主页:https://shunsukesaito.github.io/PIFuHD/
论文:https://arxiv.org/abs/2004.00452
代码:https://github.com/facebookresearch/pifuhd
论文:http://openaccess.thecvf.com/content_CVPR_2020/papers/Patel_TailorNet_Predicting_Clothing_in_3D_as_a_Function_of_Human_CVPR_2020_paper.pdf
代码:https://github.com/chaitanya100100/TailorNet
数据集:https://github.com/zycliao/TailorNet_dataset
Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion
论文:http://openaccess.thecvf.com/content_CVPR_2020/papers/Chibane_Implicit_Functions_in_Feature_Space_for_3D_Shape_Reconstruction_and_CVPR_2020_paper.pdf
代码:https://github.com/jchibane/if-net
论文:http://openaccess.thecvf.com/content_CVPR_2020/papers/Mir_Learning_to_Transfer_Texture_From_Clothing_Images_to_3D_Humans_CVPR_2020_paper.pdf
代码:https://github.com/aymenmir1/pix2surf
Uncertainty-Aware CNNs for Depth Completion: Uncertainty from Beginning to End
论文:https://arxiv.org/abs/2006.03349
代码:https://github.com/abdo-eldesokey/pncnn
3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure Prior
Syntax-Aware Action Targeting for Video Captioning
Holistically-Attracted Wireframe Parser
论文:http://openaccess.thecvf.com/content_CVPR_2020/html/Xue_Holistically-Attracted_Wireframe_Parsing_CVPR_2020_paper.html
代码:https://github.com/cherubicXN/hawp
3D-ZeF: A 3D Zebrafish Tracking Benchmark Dataset
TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style
Oops! Predicting Unintentional Action in Video
主页:https://oops.cs.columbia.edu/
论文:https://arxiv.org/abs/1911.11206
代码:https://github.com/cvlab-columbia/oops
数据集:https://oops.cs.columbia.edu/data
The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction
Open Compound Domain Adaptation
Intra- and Inter-Action Understanding via Temporal Action Parsing
Dynamic Refinement Network for Oriented and Densely Packed Object Detection
论文下载链接:https://arxiv.org/abs/2005.09973
代码和数据集:https://github.com/Anymake/DRN_CVPR2020
COCAS: A Large-Scale Clothes Changing Person Dataset for Re-identification
论文:https://arxiv.org/abs/2005.07862
数据集:暂无
KeypointNet: A Large-scale 3D Keypoint Dataset Aggregated from Numerous Human Annotations
论文:https://arxiv.org/abs/2002.12687
数据集:https://github.com/qq456cvb/KeypointNet
MSeg: A Composite Dataset for Multi-domain Semantic Segmentation
AvatarMe: Realistically Renderable 3D Facial Reconstruction "in-the-wild"
Learning to Autofocus
FaceScape: a Large-scale High Quality 3D Face Dataset and Detailed Riggable 3D Face Prediction
Bodies at Rest: 3D Human Pose and Shape Estimation from a Pressure Image using Synthetic Data
论文下载链接:https://arxiv.org/abs/2004.01166
代码:https://github.com/Healthcare-Robotics/bodies-at-rest
数据集:https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/KOA4ML
FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
主页:https://anyirao.com/projects/SceneSeg.html
论文下载链接:https://arxiv.org/abs/2004.02678
代码:https://github.com/AnyiRao/SceneSeg
Deep Homography Estimation for Dynamic Scenes
论文:https://arxiv.org/abs/2004.02132
数据集:https://github.com/lcmhoang/hmg-dynamics
Assessing Image Quality Issues for Real-World Problems
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World
PANDA: A Gigapixel-level Human-centric Video Dataset
论文:https://arxiv.org/abs/2003.04852
数据集:http://www.panda-dataset.com/
IntrA: 3D Intracranial Aneurysm Dataset for Deep Learning
Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS
CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus
Learning to Learn Single Domain Generalization
Open Compound Domain Adaptation
Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision
论文:http://www.cvlibs.net/publications/Niemeyer2020CVPR.pdf
代码:https://github.com/autonomousvision/differentiable_volumetric_rendering
QEBA: Query-Efficient Boundary-Based Blackbox Attack
Equalization Loss for Long-Tailed Object Recognition
Instance-aware Image Colorization
Contextual Residual Aggregation for Ultra High-Resolution Image Inpainting
论文:https://arxiv.org/abs/2005.09704
代码:https://github.com/Atlas200dk/sample-imageinpainting-HiFill
Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching
Epipolar Transformers
论文:https://arxiv.org/abs/2005.04551
代码:https://github.com/yihui-he/epipolar-transformers
Bringing Old Photos Back to Life
MaskFlownet: Asymmetric Feature Matching with Learnable Occlusion Mask
论文:https://arxiv.org/abs/2003.10955
代码:https://github.com/microsoft/MaskFlownet
Self-Supervised Viewpoint Learning from Image Collections
Towards Discriminability and Diversity: Batch Nuclear-norm Maximization under Label Insufficient Situations
Oral
论文:https://arxiv.org/abs/2003.12237
代码:https://github.com/cuishuhao/BNM
Towards Learning Structure via Consensus for Face Segmentation and Parsing
Plug-and-Play Algorithms for Large-scale Snapshot Compressive Imaging
Oral
论文:https://arxiv.org/abs/2003.13654
代码:https://github.com/liuyang12/PnP-SCI
Lightweight Photometric Stereo for Facial Details Recovery
Footprints and Free Space from a Single Color Image
论文:https://arxiv.org/abs/2004.06376
代码:https://github.com/nianticlabs/footprints
Self-Supervised Monocular Scene Flow Estimation
Quasi-Newton Solver for Robust Non-Rigid Registration
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
主页:https://anyirao.com/projects/SceneSeg.html
论文下载链接:https://arxiv.org/abs/2004.02678
代码:https://github.com/AnyiRao/SceneSeg
DeepFLASH: An Efficient Network for Learning-based Medical Image Registration
论文:https://arxiv.org/abs/2004.02097
代码:https://github.com/jw4hv/deepflash
Self-Supervised Scene De-occlusion
Polarized Reflection Removal with Perfect Alignment in the Wild
Background Matting: The World is Your Green Screen
What Deep CNNs Benefit from Global Covariance Pooling: An Optimization Perspective
论文:https://arxiv.org/abs/2003.11241
代码:https://github.com/ZhangLi-CS/GCP_Optimization
Look-into-Object: Self-supervised Structure Modeling for Object Recognition
Video Object Grounding using Semantic Roles in Language Description
Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives
SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
论文:https://arxiv.org/abs/2003.07064
代码:https://github.com/oskyhn/CNNs-Without-Borders
GhostNet: More Features from Cheap Operations
论文:https://arxiv.org/abs/1911.11907
代码:https://github.com/iamhankai/ghostnet
AdderNet: Do We Really Need Multiplications in Deep Learning?
Deep Image Harmonization via Domain Verification
Blurry Video Frame Interpolation
Extremely Dense Point Correspondences using a Learned Feature Descriptor
Filter Grafting for Deep Neural Networks
Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation
Detecting Attended Visual Targets in Video
论文:https://arxiv.org/abs/2003.02501
代码:https://github.com/ejcgt/attention-target-detection
Deep Image Spatial Transformation for Person Image Generation
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
https://github.com/charlesCXK/3D-SketchAware-SSC
https://github.com/Anonymous20192020/Anonymous_CVPR5767
https://github.com/avirambh/ScopeFlow
https://github.com/csbhr/CDVD-TSP
https://github.com/ymcidence/TBH
https://github.com/yaoyao-liu/mnemonics
https://github.com/meder411/Tangent-Images
https://github.com/KaihuaTang/Scene-Graph-Benchmark.pytorch
https://github.com/sjmoran/deep_local_parametric_filters
https://github.com/charlesCXK/3D-SketchAware-SSC
https://github.com/bermanmaxim/AOWS
https://github.com/dc3ea9f/look-into-object
FADNet: A Fast and Accurate Network for Disparity Estimation
https://github.com/rFID-submit/RandomFID:不确定中没中
https://github.com/JackSyu/AE-MSR:不确定中没中
https://github.com/fastconvnets/cvpr2020:不确定中没中
https://github.com/aimagelab/meshed-memory-transformer:不确定中没中
https://github.com/TWSFar/CRGNet:不确定中没中
https://github.com/CVPR-2020/CDARTS:不确定中没中
https://github.com/anucvml/ddn-cvprw2020:不确定中没中
https://github.com/dl-model-recommend/model-trust:不确定中没中
https://github.com/apratimbhattacharyya18/CVPR-2020-Corr-Prior:不确定中没中
https://github.com/onetcvpr/O-Net:不确定中没中
https://github.com/502463708/Microcalcification_Detection:不确定中没中
https://github.com/anonymous-for-review/cvpr-2020-deep-smoke-machine:不确定中没中
https://github.com/anonymous-for-review/cvpr-2020-smoke-recognition-dataset:不确定中没中
https://github.com/cvpr-nonrigid/dataset:不确定中没中
https://github.com/theFool32/PPBA:不确定中没中
https://github.com/Realtime-Action-Recognition/Realtime-Action-Recognition