gdengden

Object Detection(目标检测神文)

https://blog.csdn.net/hw5226349/article/details/81906882?utm_source=blogxgwz5

目标检测神文，非常全而且持续在更新。转发自：https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html，如有侵权联系删除。
更新时间：
20190109

我会跟进原作者博客持续更新，加入自己对目标检测领域的一些新研究及论文解读。博客根据需求直接进行关键字搜索，例如2018，可找到最新论文。

文章目录

Papers
- - Deep Neural Networks for Object Detection
  - OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks
- R-CNN
  - Rich feature hierarchies for accurate object detection and semantic segmentation
- Fast R-CNN
  - Fast R-CNN
  - A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection
- Faster R-CNN
  - Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
  - R-CNN minus R
  - Faster R-CNN in MXNet with distributed implementation and data parallelization
  - Contextual Priming and Feedback for Faster R-CNN
  - An Implementation of Faster RCNN with Study for Region Sampling
  - Interpretable R-CNN
- Light-Head R-CNN
  - Light-Head R-CNN: In Defense of Two-Stage Object Detector
  - Cascade R-CNN: Delving into High Quality Object Detection
- MultiBox
  - Scalable Object Detection using Deep Neural Networks
  - Scalable, High-Quality Object Detection
- SPP-Net
  - Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
  - DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection
  - Object Detectors Emerge in Deep Scene CNNs
  - segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection
  - Object Detection Networks on Convolutional Feature Maps
  - Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction
  - DeepBox: Learning Objectness with Convolutional Networks
- MR-CNN
  - Object detection via a multi-region & semantic segmentation-aware CNN model
- YOLO
  - You Only Look Once: Unified, Real-Time Object Detection
  - darkflow - translate darknet to tensorflow. Load trained weights, retrain/fine-tune them using tensorflow, export constant graph def to C++
  - Start Training YOLO with Our Own Data
  - YOLO: Core ML versus MPSNNGraph
  - TensorFlow YOLO object detection on Android
  - Computer Vision in iOS – Object Detection
- YOLOv2
  - YOLO9000: Better, Faster, Stronger
  - darknet_scripts
  - Yolo_mark: GUI for marking bounded boxes of objects in images for training Yolo v2
  - LightNet: Bringing pjreddie’s DarkNet out of the shadows
  - YOLO v2 Bounding Box Tool
- YOLOv3
  - YOLOv3: An Incremental Improvement
  - YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers
  - AttentionNet: Aggregating Weak Directions for Accurate Object Detection
- DenseBox
  - DenseBox: Unifying Landmark Localization with End to End Object Detection
- SSD
  - SSD: Single Shot MultiBox Detector
- DSSD
  - DSSD : Deconvolutional Single Shot Detector
  - Enhancement of SSD by concatenating feature maps for object detection
  - Context-aware Single-Shot Detector
  - Feature-Fused SSD: Fast Detection for Small Objects
- FSSD
  - FSSD: Feature Fusion Single Shot Multibox Detector
  - Weaving Multi-scale Context for Single Shot Detector
- ESSD
  - Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network
  - Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection
  - MDSSD: Multi-scale Deconvolutional Single Shot Detector for small objects
- Inside-Outside Net (ION)
  - Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
  - Adaptive Object Detection Using Adjacency and Zoom Prediction
  - G-CNN: an Iterative Grid Based Object Detector
- Factors in Finetuning Deep Model for object detection
  - Factors in Finetuning Deep Model for Object Detection with Long-tail Distribution
  - We don’t need no bounding-boxes: Training object class detectors using only human verification
  - HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection
  - A MultiPath Network for Object Detection
- CRAFT
  - CRAFT Objects from Images
- OHEM
  - Training Region-based Object Detectors with Online Hard Example Mining
  - S-OHEM: Stratified Online Hard Example Mining for Object Detection
  - Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers
- R-FCN
  - R-FCN: Object Detection via Region-based Fully Convolutional Networks
  - R-FCN-3000 at 30fps: Decoupling Detection and Classification
  - Recycle deep features for better object detection
- MS-CNN
  - A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
  - Multi-stage Object Detection with Group Recursive Learning
  - Subcategory-aware Convolutional Neural Networks for Object Proposals and Detection
- PVANET
  - PVANet: Lightweight Deep Neural Networks for Real-time Object Detection
- GBD-Net
  - Gated Bi-directional CNN for Object Detection
  - Crafting GBD-Net for Object Detection
  - StuffNet: Using ‘Stuff’ to Improve Object Detection
  - Generalized Haar Filter based Deep Networks for Real-Time Object Detection in Traffic Scene
  - Hierarchical Object Detection with Deep Reinforcement Learning
  - Learning to detect and localize many objects from few examples
  - Speed/accuracy trade-offs for modern convolutional object detectors
  - SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving
- Feature Pyramid Network (FPN)
  - Feature Pyramid Networks for Object Detection
  - Action-Driven Object Detection with Top-Down Visual Attentions
  - Beyond Skip Connections: Top-Down Modulation for Object Detection
  - Wide-Residual-Inception Networks for Real-time Object Detection
  - Attentional Network for Visual Object Detection
  - Learning Chained Deep Features and Classifiers for Cascade in Object Detection
  - DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling
  - Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries
  - Spatial Memory for Context Reasoning in Object Detection
  - Accurate Single Stage Detector Using Recurrent Rolling Convolution
  - Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection
  - LCDet: Low-Complexity Fully-Convolutional Neural Networks for Object Detection in Embedded Systems
  - Point Linking Network for Object Detection
  - Perceptual Generative Adversarial Networks for Small Object Detection
  - Few-shot Object Detection
  - Yes-Net: An effective Detector Based on Global Information
  - SMC Faster R-CNN: Toward a scene-specialized multi-object detector
  - Towards lightweight convolutional neural networks for object detection
  - RON: Reverse Connection with Objectness Prior Networks for Object Detection
  - Mimicking Very Efficient Network for Object Detection
  - Residual Features and Unified Prediction Network for Single Stage Detection
  - Deformable Part-based Fully Convolutional Network for Object Detection
  - Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors
  - Recurrent Scale Approximation for Object Detection in CNN
- DSOD
  - DSOD: Learning Deeply Supervised Object Detectors from Scratch
  - Object Detection from Scratch with Deep Supervision
  - Focal Loss for Dense Object Detection
  - Focal Loss Dense Detector for Vehicle Surveillance
  - CoupleNet: Coupling Global Structure with Local Parts for Object Detection
  - Incremental Learning of Object Detectors without Catastrophic Forgetting
  - Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection
  - StairNet: Top-Down Semantic Aggregation for Accurate One Shot Detection
  - Dynamic Zoom-in Network for Fast Object Detection in Large Images
  - Zero-Annotation Object Detection with Web Knowledge Transfer
- MegDet
  - MegDet: A Large Mini-Batch Object Detector
  - Single-Shot Refinement Neural Network for Object Detection
  - Receptive Field Block Net for Accurate and Fast Object Detection
  - An Analysis of Scale Invariance in Object Detection - SNIP
  - Feature Selective Networks for Object Detection
  - Learning a Rotation Invariant Detector with Rotatable Bounding Box
  - Scalable Object Detection for Stylized Objects
  - Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids
  - Deep Regionlets for Object Detection
  - Training and Testing Object Detectors with Virtual Images
  - Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video
  - Spot the Difference by Object Detection
  - Localization-Aware Active Learning for Object Detection
  - Object Detection with Mask-based Feature Encoding
  - LSTD: A Low-Shot Transfer Detector for Object Detection
  - Domain Adaptive Faster R-CNN for Object Detection in the Wild
  - Pseudo Mask Augmented Object Detection
  - Revisiting RCNN: On Awakening the Classification Power of Faster RCNN
  - Decoupled Classification Refinement: Hard False Positive Suppression for Object Detection
  - Learning Region Features for Object Detection
  - Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection
  - Object Detection for Comics using Manga109 Annotations
  - Task-Driven Super Resolution: Object Detection in Low-resolution Images
  - Transferring Common-Sense Knowledge for Object Detection
  - Multi-scale Location-aware Kernel Representation for Object Detection
  - Loss Rank Mining: A General Hard Example Mining Method for Real-time Detectors
  - DetNet: A Backbone network for Object Detection
  - Robust Physical Adversarial Attack on Faster R-CNN Object Detector
  - AdvDetPatch: Attacking Object Detectors with Adversarial Patches
  - Attacking Object Detectors via Imperceptible Patches on Background
  - Physical Adversarial Examples for Object Detectors
  - Quantization Mimic: Towards Very Tiny CNN for Object Detection
  - Object detection at 200 Frames Per Second
  - Object Detection using Domain Randomization and Generative Adversarial Refinement of Synthetic Images
  - SNIPER: Efficient Multi-Scale Training
  - Soft Sampling for Robust Object Detection
  - MetaAnchor: Learning to Detect Objects with Customized Anchors
  - Localization Recall Precision (LRP): A New Performance Metric for Object Detection
  - Auto-Context R-CNN
  - Pooling Pyramid Network for Object Detection
  - Modeling Visual Context is Key to Augmenting Object Detection Datasets
  - Dual Refinement Network for Single-Shot Object Detection
  - Acquisition of Localization Confidence for Accurate Object Detection
  - CornerNet: Detecting Objects as Paired Keypoints
  - Unsupervised Hard Example Mining from Videos for Improved Object Detection
  - SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection
  - A Survey of Modern Object Detection Literature using Deep Learning
  - Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usages
  - Deep Feature Pyramid Reconfiguration for Object Detection
  - MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection
  - Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks
  - Deep Learning for Generic Object Detection: A Survey
  - Training Confidence-Calibrated Classifier for Detecting Out-of-Distribution Samples
  - ScratchDet:Exploring to Train Single-Shot Object Detectors from Scratch
  - Fast and accurate object detection in high resolution 4K and 8K video using GPUs
  - Hybrid Knowledge Routed Modules for Large-scale Object Detection
  - Gradient Harmonized Single-stage Detector
  - M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network
  - BAN: Focusing on Boundary Context for Object Detection
  - Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector
  - R2CNN++: Multi-Dimensional Attention Based Rotation Invariant Detector with Robust Anchor Strategy
  - DeRPN: Taking a further step toward more general object detection
  - Fast Efficient Object Detection Using Selective Attention
  - Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects
  - Efficient Coarse-to-Fine Non-Local Module for the Detection of Small Objects
  - Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection
  - Grid R-CNN
  - Transferable Adversarial Attacks for Image and Video Object Detection
  - Anchor Box Optimization for Object Detection
  - AutoFocus: Efficient Multi-Scale Inference
  - Practical Adversarial Attack Against Object Detector
  - Learning Efficient Detector with Semi-supervised Adaptive Distillation
- Non-Maximum Suppression (NMS)
  - End-to-End Integration of a Convolutional Network, Deformable Parts Model and Non-Maximum Suppression
  - A convnet for non-maximum suppression
  - Soft-NMS – Improving Object Detection With One Line of Code
  - Learning non-maximum suppression
  - Relation Networks for Object Detection
- Adversarial Examples
  - Adversarial Examples that Fool Detectors
  - Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods
- Weakly Supervised Object Detection
  - Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection
  - Weakly supervised object detection using pseudo-strong labels
  - Saliency Guided End-to-End Learning for Weakly Supervised Object Detection
  - Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection
- Video Object Detection
  - Learning Object Class Detectors from Weakly Annotated Video
  - Analysing domain shift factors between videos and images for object detection
  - Video Object Recognition
  - Deep Learning for Saliency Prediction in Natural Video
  - T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos
  - Object Detection from Video Tubelets with Convolutional Neural Networks
  - Object Detection in Videos with Tubelets and Multi-context Cues
  - Context Matters: Refining Object Detection in Video with Recurrent Neural Networks
  - CNN Based Object Detection in Large Video Images
  - Object Detection in Videos with Tubelet Proposal Networks
  - Flow-Guided Feature Aggregation for Video Object Detection
  - Video Object Detection using Faster R-CNN
  - Improving Context Modeling for Video Object Detection and Tracking
  - Temporal Dynamic Graph LSTM for Action-driven Video Object Detection
  - Mobile Video Object Detection with Temporally-Aware Feature Maps
  - Towards High Performance Video Object Detection
  - Impression Network for Video Object Detection
  - Spatial-Temporal Memory Networks for Video Object Detection
  - 3D-DETNet: a Single Stage Video-Based Vehicle Detector
  - Object Detection in Videos by Short and Long Range Object Linking
  - Object Detection in Video with Spatiotemporal Sampling Networks
  - Towards High Performance Video Object Detection for Mobiles
  - Optimizing Video Object Detection via a Scale-Time Lattice
  - Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing
  - Fast Object Detection in Compressed Video
  - Tube-CNN: Modeling temporal evolution of appearance for object detection in video
- Object Detection on Mobile Devices
  - Pelee: A Real-Time Object Detection System on Mobile Devices
- Object Detection in 3D
  - Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks
  - Complex-YOLO: Real-time 3D Object Detection on Point Clouds
  - Focal Loss in 3D Object Detection
- Object Detection on RGB-D
  - Learning Rich Features from RGB-D Images for Object Detection and Segmentation
  - Differential Geometry Boosts Convolutional Neural Networks for Object Detection
  - A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation
- Zero-Shot Object Detection
  - Zero-Shot Detection
  - Zero-Shot Object Detection
  - Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts
  - Zero-Shot Object Detection by Hybrid Region Embedding
- Salient Object Detection
  - Best Deep Saliency Detection Models (CVPR 2016 & 2015)
  - Large-scale optimization of hierarchical features for saliency prediction in natural images
  - Predicting Eye Fixations using Convolutional Neural Networks
  - Saliency Detection by Multi-Context Deep Learning
  - DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection
  - SuperCNN: A Superpixelwise Convolutional Neural Network for Salient Object Detection
  - Shallow and Deep Convolutional Networks for Saliency Prediction
  - Recurrent Attentional Networks for Saliency Detection
  - Two-Stream Convolutional Networks for Dynamic Saliency Prediction
- Unconstrained Salient Object Detection
  - Unconstrained Salient Object Detection via Proposal Subset Optimization
  - DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection
  - Salient Object Subitizing
  - Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection
  - Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs
  - Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection
  - A Deep Multi-Level Network for Saliency Prediction
  - Visual Saliency Detection Based on Multiscale Deep CNN Features
  - A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection
  - Deeply supervised salient object detection with short connections
  - Weakly Supervised Top-down Salient Object Detection
  - SalGAN: Visual Saliency Prediction with Generative Adversarial Networks
  - Visual Saliency Prediction Using a Mixture of Deep Neural Networks
  - A Fast and Compact Salient Score Regression Network Based on Fully Convolutional Network
  - Saliency Detection by Forward and Backward Cues in Deep-CNNs
  - Supervised Adversarial Networks for Image Saliency Detection
  - Group-wise Deep Co-saliency Detection
  - Towards the Success Rate of One: Real-time Unconstrained Salient Object Detection
  - Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection
  - Learning Uncertain Convolutional Features for Accurate Saliency Detection
  - Deep Edge-Aware Saliency Detection
  - Self-explanatory Deep Salient Object Detection
  - PiCANet: Learning Pixel-wise Contextual Attention in ConvNets and Its Application in Saliency Detection
  - DeepFeat: A Bottom Up and Top Down Saliency Model Based on Deep Features of Convolutional Neural Nets
  - Recurrently Aggregating Deep Features for Salient Object Detection
  - Deep saliency: What is learnt by a deep network about saliency?
  - Contrast-Oriented Deep Neural Networks for Salient Object Detection
  - Salient Object Detection by Lossless Feature Reflection
  - HyperFusion-Net: Densely Reflective Fusion for Salient Object Detection
- Video Saliency Detection
  - Deep Learning For Video Saliency Detection
  - Video Salient Object Detection Using Spatiotemporal Deep Features
  - Predicting Video Saliency with Object-to-Motion CNN and Two-layer Convolutional LSTM
- Visual Relationship Detection
  - Visual Relationship Detection with Language Priors
  - ViP-CNN: A Visual Phrase Reasoning Convolutional Neural Network for Visual Relationship Detection
  - Visual Translation Embedding Network for Visual Relation Detection
  - Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection
  - Detecting Visual Relationships with Deep Relational Networks
  - Identifying Spatial Relations in Images using Convolutional Neural Networks
  - PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN
  - Natural Language Guided Visual Relationship Detection
  - Detecting Visual Relationships Using Box Attention
  - Google AI Open Images - Visual Relationship Track
  - Context-Dependent Diffusion Network for Visual Relationship Detection
  - A Problem Reduction Approach for Visual Relationships Detection
- Face Deteciton
  - Multi-view Face Detection Using Deep Convolutional Neural Networks
  - From Facial Parts Responses to Face Detection: A Deep Learning Approach
  - Compact Convolutional Neural Network Cascade for Face Detection
  - Face Detection with End-to-End Integration of a ConvNet and a 3D Model
  - CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection
  - Towards a Deep Learning Framework for Unconstrained Face Detection
  - Supervised Transformer Network for Efficient Face Detection
  - UnitBox: An Advanced Object Detection Network
  - Bootstrapping Face Detection with Hard Negative Examples
  - Grid Loss: Detecting Occluded Faces
  - A Multi-Scale Cascade Fully Convolutional Network Face Detector
- MTCNN
  - Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks
  - Face Detection using Deep Learning: An Improved Faster RCNN Approach
  - Faceness-Net: Face Detection through Deep Facial Part Responses
  - Multi-Path Region-Based Convolutional Neural Network for Accurate Detection of Unconstrained “Hard Faces”
  - End-To-End Face Detection and Recognition
  - Face R-CNN
  - Face Detection through Scale-Friendly Deep Convolutional Networks
  - Scale-Aware Face Detection
  - Detecting Faces Using Inside Cascaded Contextual CNN
  - Multi-Branch Fully Convolutional Network for Face Detection
  - SSH: Single Stage Headless Face Detector
  - Dockerface: an easy to install and use Faster R-CNN face detector in a Docker container
  - FaceBoxes: A CPU Real-time Face Detector with High Accuracy
  - S3FD: Single Shot Scale-invariant Face Detector
  - Detecting Faces Using Region-based Fully Convolutional Networks
  - AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
  - Face Attention Network: An effective Face Detector for the Occluded Faces
  - Feature Agglomeration Networks for Single Stage Face Detection
  - Face Detection Using Improved Faster RCNN
  - PyramidBox: A Context-assisted Single Shot Face Detector
  - A Fast Face Detection Method via Convolutional Neural Network
  - Beyond Trade-off: Accelerate FCN-based Face Detector with Higher Accuracy
  - Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks
  - SFace: An Efficient Network for Face Detection in Large Scale Variations
  - Survey of Face Detection on Low-quality Images
  - Anchor Cascade for Efficient Face Detection
  - Adversarial Attacks on Face Detectors using Neural Net based Constrained Optimization
  - Selective Refinement Network for High Performance Face Detection
  - DSFD: Dual Shot Face Detector
  - Learning Better Features for Face Detection with Feature Fusion and Segmentation Supervision
  - FA-RPN: Floating Region Proposals for Face Detection
- Detect Small Faces
  - Finding Tiny Faces
  - Detecting and counting tiny faces
  - Seeing Small Faces from Robust Anchor’s Perspective
  - Face-MagNet: Magnifying Feature Maps to Detect Small Faces
  - Robust Face Detection via Learning Small Faces on Hard Images
  - SFA: Small Faces Attention Face Detector
- Person Head Detection
  - Context-aware CNNs for person head detection
  - Detecting Heads using Feature Refine Net and Cascaded Multi-scale Architecture
  - A Comparison of CNN-based Face and Head Detectors for Real-Time Video Surveillance Applications
  - FCHD: A fast and accurate head detector
- Pedestrian Detection / People Detection
  - Pedestrian Detection aided by Deep Learning Semantic Tasks
  - Deep Learning Strong Parts for Pedestrian Detection
  - Taking a Deeper Look at Pedestrians
  - Convolutional Channel Features
  - End-to-end people detection in crowded scenes
  - Learning Complexity-Aware Cascades for Deep Pedestrian Detection
  - Deep convolutional neural networks for pedestrian detection
  - Scale-aware Fast R-CNN for Pedestrian Detection
  - New algorithm improves speed and accuracy of pedestrian detection
  - Pushing the Limits of Deep CNNs for Pedestrian Detection
  - A Real-Time Deep Learning Pedestrian Detector for Robot Navigation
  - A Real-Time Pedestrian Detector using Deep Learning for Human-Aware Navigation
  - Is Faster R-CNN Doing Well for Pedestrian Detection?
  - Unsupervised Deep Domain Adaptation for Pedestrian Detection
  - Reduced Memory Region Based Deep Convolutional Neural Network Detection
  - Fused DNN: A deep neural network fusion approach to fast and robust pedestrian detection
  - Detecting People in Artwork with CNNs
  - Multispectral Deep Neural Networks for Pedestrian Detection
  - Deep Multi-camera People Detection
  - Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters
  - What Can Help Pedestrian Detection?
  - Illuminating Pedestrians via Simultaneous Detection & Segmentation
  - Rotational Rectification Network for Robust Pedestrian Detection
  - STD-PD: Generating Synthetic Training Data for Pedestrian Detection in Unannotated Videos
  - Too Far to See? Not Really! — Pedestrian Detection with Scale-aware Localization Policy
  - Repulsion Loss: Detecting Pedestrians in a Crowd
  - Aggregated Channels Network for Real-Time Pedestrian Detection
  - Illumination-aware Faster R-CNN for Robust Multispectral Pedestrian Detection
  - Exploring Multi-Branch and High-Level Semantic Networks for Improving Pedestrian Detection
  - Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond
  - PCN: Part and Context Information for Pedestrian Detection with CNNs
  - Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature Aggregation
  - Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd
  - Multispectral Pedestrian Detection via Simultaneous Detection and Segmentation
  - Pedestrian Detection with Autoregressive Network Phases
- Vehicle Detection
  - DAVE: A Unified Framework for Fast Vehicle Detection and Annotation
  - Evolving Boxes for fast Vehicle Detection
  - Fine-Grained Car Detection for Visual Census Estimation
  - SINet: A Scale-insensitive Convolutional Neural Network for Fast Vehicle Detection
  - Label and Sample: Efficient Training of Vehicle Object Detector from Sparsely Labeled Data
  - Domain Randomization for Scene-Specific Car Detection and Pose Estimation
  - ShuffleDet: Real-Time Vehicle Detection Network in On-board Embedded UAV Imagery
- Traffic-Sign Detection
  - Traffic-Sign Detection and Classification in the Wild
  - Evaluating State-of-the-art Object Detector on Challenging Traffic Light Data
  - Detecting Small Signs from Large Images
  - Localized Traffic Sign Detection with Multi-scale Deconvolution Networks
  - Detecting Traffic Lights by Single Shot Detection
  - A Hierarchical Deep Architecture and Mini-Batch Selection Method For Joint Traffic Sign and Light Detection
- Skeleton Detection
  - Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs
  - DeepSkeleton: Learning Multi-task Scale-associated Deep Side Outputs for Object Skeleton Extraction in Natural Images
  - SRN: Side-output Residual Network for Object Symmetry Detection in the Wild
  - Hi-Fi: Hierarchical Feature Integration for Skeleton Detection
- Fruit Detection
  - Deep Fruit Detection in Orchards
  - Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards
- Shadow Detection
  - Fast Shadow Detection from a Single Image Using a Patched Convolutional Neural Network
  - A+D-Net: Shadow Detection with Adversarial Shadow Attenuation
  - Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal
  - Direction-aware Spatial Context Features for Shadow Detection
  - Direction-aware Spatial Context Features for Shadow Detection and Removal
- Others Detection
  - Deep Deformation Network for Object Landmark Localization
  - Fashion Landmark Detection in the Wild
  - Deep Learning for Fast and Accurate Fashion Item Detection
  - OSMDeepOD - OSM and Deep Learning based Object Detection from Aerial Imagery (formerly known as “OSM-Crosswalk-Detection”)
  - Selfie Detection by Synergy-Constraint Based Convolutional Neural Network
  - Associative Embedding:End-to-End Learning for Joint Detection and Grouping
  - Deep Cuboid Detection: Beyond 2D Bounding Boxes
  - Automatic Model Based Dataset Generation for Fast and Accurate Crop and Weeds Detection
  - Deep Learning Logo Detection with Data Expansion by Synthesising Context
  - Scalable Deep Learning Logo Detection
  - Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks
  - Automatic Handgun Detection Alarm in Videos Using Deep Learning
  - Objects as context for part detection
  - Using Deep Networks for Drone Detection
  - Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
  - Target Driven Instance Detection
  - DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion
  - VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition
  - Grab, Pay and Eat: Semantic Food Detection for Smart Restaurants
  - ReMotENet: Efficient Relevant Motion Event Detection for Large-scale Home Surveillance Videos
  - Deep Learning Object Detection Methods for Ecological Camera Trap Data
  - EL-GAN: Embedding Loss Driven Generative Adversarial Networks for Lane Detection
  - Towards End-to-End Lane Detection: an Instance Segmentation Approach
  - iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection
  - Densely Supervised Grasp Detector (DSGD)
- Object Proposal
  - DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers
  - Scale-aware Pixel-wise Object Proposal Networks
  - Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization
  - Learning to Segment Object Proposals via Recursive Neural Networks
  - Learning Detection with Diverse Proposals
  - ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond
  - Improving Small Object Proposals for Company Logo Detection
  - Open Logo Detection Challenge
  - AttentionMask: Attentive, Efficient Object Proposal Generation Focusing on Small Objects
- Localization
  - Beyond Bounding Boxes: Precise Localization of Objects in Images
  - Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning
  - Weakly Supervised Object Localization Using Size Estimates
  - Active Object Localization with Deep Reinforcement Learning
  - Localizing objects using referring expressions
  - LocNet: Improving Localization Accuracy for Object Detection
  - Learning Deep Features for Discriminative Localization
  - ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization
  - Ensemble of Part Detectors for Simultaneous Classification and Localization
  - STNet: Selective Tuning of Convolutional Networks for Object Localization
  - Soft Proposal Networks for Weakly Supervised Object Localization
  - Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN
- Tutorials / Talks
  - Convolutional Feature Maps: Elements of efficient (and accurate) CNN-based object detection
  - Towards Good Practices for Recognition & Detection
  - Work in progress: Improving object detection and instance segmentation for small objects
  - Object Detection with Deep Learning: A Review
- Projects
  - Detectron
  - TensorBox: a simple framework for training neural networks to detect objects in images
  - Object detection in torch: Implementation of some object detection frameworks in torch
  - Using DIGITS to train an Object Detection network
  - FCN-MultiBox Detector
  - KittiBox: A car detection model implemented in Tensorflow.
  - Deformable Convolutional Networks + MST + Soft-NMS
  - How to Build a Real-time Hand-Detector using Neural Networks (SSD) on Tensorflow
  - Metrics for object detection
  - MobileNetv2-SSDLite
- Leaderboard
  - Detection Results: VOC2012
- Tools
  - BeaverDam: Video annotation tool for deep learning training labels
- Blogs
  - Convolutional Neural Networks for Object Detection
  - Introducing automatic object detection to visual search (Pinterest)
  - Deep Learning for Object Detection with DIGITS
  - Analyzing The Papers Behind Facebook’s Computer Vision Approach
  - Easily Create High Quality Object Detectors with Deep Learning
  - How to Train a Deep-Learned Object Detection Model in the Microsoft Cognitive Toolkit
  - Object Detection in Satellite Imagery, a Low Overhead Approach
  - You Only Look Twice — Multi-Scale Object Detection in Satellite Imagery With Convolutional Neural Networks
  - Faster R-CNN Pedestrian and Car Detection
  - Small U-Net for vehicle detection
  - Region of interest pooling explained
  - Supercharge your Computer Vision models with the TensorFlow Object Detection API
  - Understanding SSD MultiBox — Real-Time Object Detection In Deep Learning
  - One-shot object detection
  - An overview of object detection: one-stage methods

Method	backbone	test size	VOC2007	VOC2010	VOC2012	ILSVRC 2013	MSCOCO 2015	Speed
OverFeat						24.3%
R-CNN	AlexNet		58.5%	53.7%	53.3%	31.4%
R-CNN	VGG17		66.0%
SPP_net	ZF-5		54.2%			31.84%
DeepID-Net			64.1%			50.3%
NoC			73.3%		68.8%
Fast-RCNN	VGG16		70.0%	68.8%	68.4%		19.7%(@[0.5-0.95]), 35.9%(@0.5)
MR-CNN			78.2%		73.9%
Faster-RCNN	VGG16		78.8%		75.9%		21.9%(@[0.5-0.95]), 42.7%(@0.5)	198ms
Faster-RCNN	ResNet101		85.6%		83.8%		37.4%(@[0.5-0.95]), 59.0%(@0.5)
YOLO			63.4%		57.9%			45 fps
YOLO	VGG-16		66.4%					21 fps
YOLOv2		448x448	78.6%		73.4%		21.6%(@[0.5-0.95]), 44.0%(@0.5)	40 fps
SSD	VGG16	300x300	77.2%		75.8%		25.1%(@[0.5-0.95]), 43.1%(@0.5)	46 fps
SSD	VGG16	512x512	79.8%		78.5%		28.8%(@[0.5-0.95]), 48.5%(@0.5)	19 fps
SSD	ResNet101	300x300					28.0%(@[0.5-0.95])	16 fps
SSD	ResNet101	512x512					31.2%(@[0.5-0.95])	8 fps
DSSD	ResNet101	300x300					28.0%(@[0.5-0.95])	8 fps
DSSD	ResNet101	500x500					33.2%(@[0.5-0.95])	6 fps
ION			79.2%		76.4%
CRAFT			75.7%		71.3%	48.5%
OHEM			78.9%		76.3%		25.5%(@[0.5-0.95]), 45.9%(@0.5)
R-FCN	ResNet50		77.4%					0.12sec(K40), 0.09sec(TitianX)
R-FCN	ResNet101		79.5%					0.17sec(K40), 0.12sec(TitianX)
R-FCN(ms train)	ResNet101		83.6%		82.0%		31.5%(@[0.5-0.95]), 53.2%(@0.5)
PVANet 9.0			84.9%		84.2%			750ms(CPU), 46ms(TitianX)
RetinaNet	ResNet101-FPN
Light-Head R-CNN	Xception*	800/1200					31.5%@[0.5:0.95]	95 fps
Light-Head R-CNN	Xception*	700/1100					30.7%@[0.5:0.95]	102 fps

Papers

Deep Neural Networks for Object Detection

paper: http://papers.nips.cc/paper/5207-deep-neural-networks-for-object-detection.pdf

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

arxiv: http://arxiv.org/abs/1312.6229
github: https://github.com/sermanet/OverFeat
code: http://cilvr.nyu.edu/doku.php?id=software:overfeat:start

R-CNN

Rich feature hierarchies for accurate object detection and semantic segmentation

intro: R-CNN
arxiv: http://arxiv.org/abs/1311.2524
supp: http://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr-supp.pdf
slides: http://www.image-net.org/challenges/LSVRC/2013/slides/r-cnn-ilsvrc2013-workshop.pdf
slides: http://www.cs.berkeley.edu/~rbg/slides/rcnn-cvpr14-slides.pdf
github: https://github.com/rbgirshick/rcnn
notes: http://zhangliliang.com/2014/07/23/paper-note-rcnn/
caffe-pr(“Make R-CNN the Caffe detection example”): https://github.com/BVLC/caffe/pull/482

Fast R-CNN

arxiv: http://arxiv.org/abs/1504.08083
slides: http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf
github: https://github.com/rbgirshick/fast-rcnn
github(COCO-branch): https://github.com/rbgirshick/fast-rcnn/tree/coco
webcam demo: https://github.com/rbgirshick/fast-rcnn/pull/29
notes: http://zhangliliang.com/2015/05/17/paper-note-fast-rcnn/
notes: http://blog.csdn.net/linj_m/article/details/48930179
github(“Fast R-CNN in MXNet”): https://github.com/precedenceguo/mx-rcnn
github: https://github.com/mahyarnajibi/fast-rcnn-torch
github: https://github.com/apple2373/chainer-simple-fast-rnn
github: https://github.com/zplizzi/tensorflow-fast-rcnn

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection

intro: CVPR 2017
arxiv: https://arxiv.org/abs/1704.03414
paper: http://abhinavsh.info/papers/pdfs/adversarial_object_detection.pdf
github(Caffe): https://github.com/xiaolonw/adversarial-frcnn

Faster R-CNN

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

intro: NIPS 2015
arxiv: http://arxiv.org/abs/1506.01497
gitxiv: http://www.gitxiv.com/posts/8pfpcvefDYn2gSgXk/faster-r-cnn-towards-real-time-object-detection-with-region
slides: http://web.cs.hacettepe.edu.tr/~aykut/classes/spring2016/bil722/slides/w05-FasterR-CNN.pdf
github(official, Matlab): https://github.com/ShaoqingRen/faster_rcnn
github: https://github.com/rbgirshick/py-faster-rcnn
github(MXNet): https://github.com/msracver/Deformable-ConvNets/tree/master/faster_rcnn
github: https://github.com//jwyang/faster-rcnn.pytorch
github: https://github.com/mitmul/chainer-faster-rcnn
github: https://github.com/andreaskoepf/faster-rcnn.torch
github: https://github.com/ruotianluo/Faster-RCNN-Densecap-torch
github: https://github.com/smallcorgi/Faster-RCNN_TF
github: https://github.com/CharlesShang/TFFRCNN
github(C++ demo): https://github.com/YihangLou/FasterRCNN-Encapsulation-Cplusplus
github: https://github.com/yhenon/keras-frcnn
github: https://github.com/Eniac-Xie/faster-rcnn-resnet
github(C++): https://github.com/D-X-Y/caffe-faster-rcnn/tree/dev

R-CNN minus R

intro: BMVC 2015
arxiv: http://arxiv.org/abs/1506.06981

Faster R-CNN in MXNet with distributed implementation and data parallelization

github: https://github.com/dmlc/mxnet/tree/master/example/rcnn

Contextual Priming and Feedback for Faster R-CNN

intro: ECCV 2016. Carnegie Mellon University
paper: http://abhinavsh.info/context_priming_feedback.pdf
poster: http://www.eccv2016.org/files/posters/P-1A-20.pdf

An Implementation of Faster RCNN with Study for Region Sampling

intro: Technical Report, 3 pages. CMU
arxiv: https://arxiv.org/abs/1702.02138
github: https://github.com/endernewton/tf-faster-rcnn

Interpretable R-CNN

intro: North Carolina State University & Alibaba
keywords: AND-OR Graph (AOG)
arxiv: https://arxiv.org/abs/1711.05226

Light-Head R-CNN

Light-Head R-CNN: In Defense of Two-Stage Object Detector

intro: Tsinghua University & Megvii Inc
arxiv: https://arxiv.org/abs/1711.07264
github(official, Tensorflow): https://github.com/zengarden/light_head_rcnn
github: https://github.com/terrychenism/Deformable-ConvNets/blob/master/rfcn/symbols/resnet_v1_101_rfcn_light.py#L784

##Cascade R-CNN

Cascade R-CNN: Delving into High Quality Object Detection

intro: CVPR 2018. UC San Diego
arxiv: https://arxiv.org/abs/1712.00726
github(Caffe, official): https://github.com/zhaoweicai/cascade-rcnn

MultiBox

Scalable Object Detection using Deep Neural Networks

intro: first MultiBox. Train a CNN to predict Region of Interest.
arxiv: http://arxiv.org/abs/1312.2249
github: https://github.com/google/multibox
blog: https://research.googleblog.com/2014/12/high-quality-object-detection-at-scale.html

Scalable, High-Quality Object Detection

intro: second MultiBox
arxiv: http://arxiv.org/abs/1412.1441
github: https://github.com/google/multibox

SPP-Net

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

intro: ECCV 2014 / TPAMI 2015
arxiv: http://arxiv.org/abs/1406.4729
github: https://github.com/ShaoqingRen/SPP_net
notes: http://zhangliliang.com/2014/09/13/paper-note-sppnet/

DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection

intro: PAMI 2016
intro: an extension of R-CNN. box pre-training, cascade on region proposals, deformation layers and context representations
project page: http://www.ee.cuhk.edu.hk/˜wlouyang/projects/imagenetDeepId/index.html
arxiv: http://arxiv.org/abs/1412.5661

Object Detectors Emerge in Deep Scene CNNs

intro: ICLR 2015
arxiv: http://arxiv.org/abs/1412.6856
paper: https://www.robots.ox.ac.uk/~vgg/rg/papers/zhou_iclr15.pdf
paper: https://people.csail.mit.edu/khosla/papers/iclr2015_zhou.pdf
slides: http://places.csail.mit.edu/slide_iclr2015.pdf

segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection

intro: CVPR 2015
project(code+data): https://www.cs.toronto.edu/~yukun/segdeepm.html
arxiv: https://arxiv.org/abs/1502.04275
github: https://github.com/YknZhu/segDeepM

Object Detection Networks on Convolutional Feature Maps

intro: TPAMI 2015
keywords: NoC
arxiv: http://arxiv.org/abs/1504.06066

Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction

arxiv: http://arxiv.org/abs/1504.03293
slides: http://www.ytzhang.net/files/publications/2015-cvpr-det-slides.pdf
github: https://github.com/YutingZhang/fgs-obj

DeepBox: Learning Objectness with Convolutional Networks

keywords: DeepBox
arxiv: http://arxiv.org/abs/1505.02146
github: https://github.com/weichengkuo/DeepBox

MR-CNN

Object detection via a multi-region & semantic segmentation-aware CNN model

intro: ICCV 2015. MR-CNN
arxiv: http://arxiv.org/abs/1505.01749
github: https://github.com/gidariss/mrcnn-object-detection
notes: http://zhangliliang.com/2015/05/17/paper-note-ms-cnn/
notes: http://blog.cvmarcher.com/posts/2015/05/17/multi-region-semantic-segmentation-aware-cnn/

YOLO

You Only Look Once: Unified, Real-Time Object Detection

arxiv: http://arxiv.org/abs/1506.02640
code: http://pjreddie.com/darknet/yolo/
github: https://github.com/pjreddie/darknet
blog: https://pjreddie.com/publications/yolo/
slides: https://docs.google.com/presentation/d/1aeRvtKG21KHdD5lg6Hgyhx5rPq_ZOsGjG5rJ1HP7BbA/pub?start=false&loop=false&delayms=3000&slide=id.p
reddit: https://www.reddit.com/r/MachineLearning/comments/3a3m0o/realtime_object_detection_with_yolo/
github: https://github.com/gliese581gg/YOLO_tensorflow
github: https://github.com/xingwangsfu/caffe-yolo
github: https://github.com/frankzhangrui/Darknet-Yolo
github: https://github.com/BriSkyHekun/py-darknet-yolo
github: https://github.com/tommy-qichang/yolo.torch
github: https://github.com/frischzenger/yolo-windows
github: https://github.com/AlexeyAB/yolo-windows
github: https://github.com/nilboy/tensorflow-yolo

darkflow - translate darknet to tensorflow. Load trained weights, retrain/fine-tune them using tensorflow, export constant graph def to C++

blog: https://thtrieu.github.io/notes/yolo-tensorflow-graph-buffer-cpp
github: https://github.com/thtrieu/darkflow

Start Training YOLO with Our Own Data

intro: train with customized data and class numbers/labels. Linux / Windows version for darknet.
blog: http://guanghan.info/blog/en/my-works/train-yolo/
github: https://github.com/Guanghan/darknet

YOLO: Core ML versus MPSNNGraph

intro: Tiny YOLO for iOS implemented using CoreML but also using the new MPS graph API.
blog: http://machinethink.net/blog/yolo-coreml-versus-mps-graph/
github: https://github.com/hollance/YOLO-CoreML-MPSNNGraph

TensorFlow YOLO object detection on Android

intro: Real-time object detection on Android using the YOLO network with TensorFlow
github: https://github.com/natanielruiz/android-yolo

Computer Vision in iOS – Object Detection

blog: https://sriraghu.com/2017/07/12/computer-vision-in-ios-object-detection/
github:https://github.com/r4ghu/iOS-CoreML-Yolo

YOLOv2

YOLO9000: Better, Faster, Stronger

arxiv: https://arxiv.org/abs/1612.08242
code: http://pjreddie.com/yolo9000/
github(Chainer): https://github.com/leetenki/YOLOv2
github(Keras): https://github.com/allanzelener/YAD2K
github(PyTorch): https://github.com/longcw/yolo2-pytorch
github(Tensorflow): https://github.com/hizhangp/yolo_tensorflow
github(Windows): https://github.com/AlexeyAB/darknet
github: https://github.com/choasUp/caffe-yolo9000
github: https://github.com/philipperemy/yolo-9000

darknet_scripts

intro: Auxilary scripts to work with (YOLO) darknet deep learning famework. AKA -> How to generate YOLO anchors?
github: https://github.com/Jumabek/darknet_scripts

Yolo_mark: GUI for marking bounded boxes of objects in images for training Yolo v2

github: https://github.com/AlexeyAB/Yolo_mark

LightNet: Bringing pjreddie’s DarkNet out of the shadows

github: https://github.com//explosion/lightnet

YOLO v2 Bounding Box Tool

intro: Bounding box labeler tool to generate the training data in the format YOLO v2 requires.
github: https://github.com/Cartucho/yolo-boundingbox-labeler-GUI

YOLOv3

YOLOv3: An Incremental Improvement

project page: https://pjreddie.com/darknet/yolo/
arxiv: https://arxiv.org/abs/1804.02767
github: https://github.com/DeNA/PyTorch_YOLOv3

YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers

arxiv：https://arxiv.org/abs/1811.05588

AttentionNet: Aggregating Weak Directions for Accurate Object Detection

intro: ICCV 2015
intro: state-of-the-art performance of 65% (AP) on PASCAL VOC 2007/2012 human detection task
arxiv: http://arxiv.org/abs/1506.07704
slides: https://www.robots.ox.ac.uk/~vgg/rg/slides/AttentionNet.pdf
slides: http://image-net.org/challenges/talks/lunit-kaist-slide.pdf

DenseBox

DenseBox: Unifying Landmark Localization with End to End Object Detection

arxiv: http://arxiv.org/abs/1509.04874
demo: http://pan.baidu.com/s/1mgoWWsS
KITTI result: http://www.cvlibs.net/datasets/kitti/eval_object.php

SSD

SSD: Single Shot MultiBox Detector

intro: ECCV 2016 Oral
arxiv: http://arxiv.org/abs/1512.02325
paper: http://www.cs.unc.edu/~wliu/papers/ssd.pdf
slides: http://www.cs.unc.edu/~wliu/papers/ssd_eccv2016_slide.pdf
github(Official): https://github.com/weiliu89/caffe/tree/ssd
video: http://weibo.com/p/2304447a2326da963254c963c97fb05dd3a973
github: https://github.com/zhreshold/mxnet-ssd
github: https://github.com/zhreshold/mxnet-ssd.cpp
github: https://github.com/rykov8/ssd_keras
github: https://github.com/balancap/SSD-Tensorflow
github: https://github.com/amdegroot/ssd.pytorch
github(Caffe): https://github.com/chuanqi305/MobileNet-SSD
What’s the diffience in performance between this new code you pushed and the previous code? #327
https://github.com/weiliu89/caffe/issues/327

DSSD

DSSD : Deconvolutional Single Shot Detector

intro: UNC Chapel Hill & Amazon Inc
arxiv: https://arxiv.org/abs/1701.06659
github: https://github.com/chengyangfu/caffe/tree/dssd
github: https://github.com/MTCloudVision/mxnet-dssd
demo: http://120.52.72.53/www.cs.unc.edu/c3pr90ntc0td/~cyfu/dssd_lalaland.mp4

Enhancement of SSD by concatenating feature maps for object detection

intro: rainbow SSD (R-SSD)
arxiv: https://arxiv.org/abs/1705.09587

Context-aware Single-Shot Detector

keywords: CSSD, DiCSSD, DeCSSD, effective receptive fields (ERFs), theoretical receptive fields (TRFs)
arxiv: https://arxiv.org/abs/1707.08682

Feature-Fused SSD: Fast Detection for Small Objects

https://arxiv.org/abs/1709.05054

FSSD

FSSD: Feature Fusion Single Shot Multibox Detector

https://arxiv.org/abs/1712.00960

Weaving Multi-scale Context for Single Shot Detector

intro: WeaveNet
keywords: fuse multi-scale information
arxiv: https://arxiv.org/abs/1712.03149

ESSD

Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network

arxiv: https://arxiv.org/abs/1801.05918

Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection

arxiv: https://arxiv.org/abs/1802.06488

MDSSD: Multi-scale Deconvolutional Single Shot Detector for small objects

intro: Zhengzhou University
arxiv: https://arxiv.org/abs/1805.07009

Inside-Outside Net (ION)

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

intro: “0.8s per image on a Titan X GPU (excluding proposal generation) without two-stage bounding-box regression and 1.15s per image with it”.
arxiv: http://arxiv.org/abs/1512.04143
slides: http://www.seanbell.ca/tmp/ion-coco-talk-bell2015.pdf
coco-leaderboard: http://mscoco.org/dataset/#detections-leaderboard

Adaptive Object Detection Using Adjacency and Zoom Prediction

intro: CVPR 2016. AZ-Net
arxiv: http://arxiv.org/abs/1512.07711
github: https://github.com/luyongxi/az-net
youtube: https://www.youtube.com/watch?v=YmFtuNwxaNM

G-CNN: an Iterative Grid Based Object Detector

arxiv: http://arxiv.org/abs/1512.07729

Factors in Finetuning Deep Model for object detection

Factors in Finetuning Deep Model for Object Detection with Long-tail Distribution

intro: CVPR 2016.rank 3rd for provided data and 2nd for external data on ILSVRC 2015 object detection
project page: http://www.ee.cuhk.edu.hk/~wlouyang/projects/ImageNetFactors/CVPR16.html
arxiv: http://arxiv.org/abs/1601.05150

We don’t need no bounding-boxes: Training object class detectors using only human verification

arxiv: http://arxiv.org/abs/1602.08405

HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

arxiv: http://arxiv.org/abs/1604.00600

A MultiPath Network for Object Detection

intro: BMVC 2016. Facebook AI Research (FAIR)
arxiv: http://arxiv.org/abs/1604.02135
github: https://github.com/facebookresearch/multipathnet

CRAFT

CRAFT Objects from Images

intro: CVPR 2016. Cascade Region-proposal-network And FasT-rcnn. an extension of Faster R-CNN
project page: http://byangderek.github.io/projects/craft.html
arxiv: https://arxiv.org/abs/1604.03239
paper: http://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Yang_CRAFT_Objects_From_CVPR_2016_paper.pdf
github: https://github.com/byangderek/CRAFT

OHEM

Training Region-based Object Detectors with Online Hard Example Mining

intro: CVPR 2016 Oral. Online hard example mining (OHEM)
arxiv: http://arxiv.org/abs/1604.03540
paper: http://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Shrivastava_Training_Region-Based_Object_CVPR_2016_paper.pdf
github(Official): https://github.com/abhi2610/ohem
author page: http://abhinav-shrivastava.info/

S-OHEM: Stratified Online Hard Example Mining for Object Detection

arxiv: https://arxiv.org/abs/1705.02233

Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers

intro: CVPR 2016
keywords: scale-dependent pooling (SDP), cascaded rejection classifiers (CRC)
paper: http://www-personal.umich.edu/~wgchoi/SDP-CRC_camready.pdf

R-FCN

R-FCN: Object Detection via Region-based Fully Convolutional Networks

arxiv: http://arxiv.org/abs/1605.06409
github: https://github.com/daijifeng001/R-FCN
github(MXNet): https://github.com/msracver/Deformable-ConvNets/tree/master/rfcn
github: https://github.com/Orpine/py-R-FCN
github: https://github.com/PureDiors/pytorch_RFCN
github: https://github.com/bharatsingh430/py-R-FCN-multiGPU
github: https://github.com/xdever/RFCN-tensorflow

R-FCN-3000 at 30fps: Decoupling Detection and Classification

arxiv: https://arxiv.org/abs/1712.01802

Recycle deep features for better object detection

arxiv: http://arxiv.org/abs/1607.05066

MS-CNN

A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection

intro: ECCV 2016
intro: 640×480: 15 fps, 960×720: 8 fps
arxiv: http://arxiv.org/abs/1607.07155
github: https://github.com/zhaoweicai/mscnn
poster: http://www.eccv2016.org/files/posters/P-2B-38.pdf

Multi-stage Object Detection with Group Recursive Learning

intro: VOC2007: 78.6%, VOC2012: 74.9%
arxiv: http://arxiv.org/abs/1608.05159

Subcategory-aware Convolutional Neural Networks for Object Proposals and Detection

intro: WACV 2017. SubCNN
arxiv: http://arxiv.org/abs/1604.04693
github: https://github.com/tanshen/SubCNN

PVANET

PVANet: Lightweight Deep Neural Networks for Real-time Object Detection

intro: Presented at NIPS 2016 Workshop on Efficient Methods for Deep Neural Networks (EMDNN). Continuation of arXiv:1608.08021
arxiv: https://arxiv.org/abs/1611.08588
github: https://github.com/sanghoon/pva-faster-rcnn
leaderboard(PVANet 9.0): http://host.robots.ox.ac.uk:8080/leaderboard/displaylb.php?challengeid=11&compid=4

GBD-Net

Gated Bi-directional CNN for Object Detection

intro: The Chinese University of Hong Kong & Sensetime Group Limited
paper: http://link.springer.com/chapter/10.1007/978-3-319-46478-7_22
mirror: https://pan.baidu.com/s/1dFohO7v

Crafting GBD-Net for Object Detection

intro: winner of the ImageNet object detection challenge of 2016. CUImage and CUVideo
intro: gated bi-directional CNN (GBD-Net)
arxiv: https://arxiv.org/abs/1610.02579
github: https://github.com/craftGBD/craftGBD

StuffNet: Using ‘Stuff’ to Improve Object Detection

arxiv: https://arxiv.org/abs/1610.05861

Generalized Haar Filter based Deep Networks for Real-Time Object Detection in Traffic Scene

arxiv: https://arxiv.org/abs/1610.09609

Hierarchical Object Detection with Deep Reinforcement Learning

intro: Deep Reinforcement Learning Workshop (NIPS 2016)
project page: https://imatge-upc.github.io/detection-2016-nipsws/
arxiv: https://arxiv.org/abs/1611.03718
slides: http://www.slideshare.net/xavigiro/hierarchical-object-detection-with-deep-reinforcement-learning
github: https://github.com/imatge-upc/detection-2016-nipsws
blog: http://jorditorres.org/nips/

Learning to detect and localize many objects from few examples

arxiv: https://arxiv.org/abs/1611.05664

Speed/accuracy trade-offs for modern convolutional object detectors

intro: CVPR 2017. Google Research
arxiv: https://arxiv.org/abs/1611.10012

SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving

arxiv: https://arxiv.org/abs/1612.01051
github: https://github.com/BichenWuUCB/squeezeDet
github: https://github.com/fregu856/2D_detection

Feature Pyramid Network (FPN)

Feature Pyramid Networks for Object Detection

intro: Facebook AI Research
arxiv: https://arxiv.org/abs/1612.03144

Action-Driven Object Detection with Top-Down Visual Attentions

arxiv: https://arxiv.org/abs/1612.06704

Beyond Skip Connections: Top-Down Modulation for Object Detection

intro: CMU & UC Berkeley & Google Research
arxiv: https://arxiv.org/abs/1612.06851

Wide-Residual-Inception Networks for Real-time Object Detection

intro: Inha University
arxiv: https://arxiv.org/abs/1702.01243

Attentional Network for Visual Object Detection

intro: University of Maryland & Mitsubishi Electric Research Laboratories
arxiv: https://arxiv.org/abs/1702.01478

Learning Chained Deep Features and Classifiers for Cascade in Object Detection

keykwords: CC-Net
intro: chained cascade network (CC-Net). 81.1% mAP on PASCAL VOC 2007
arxiv: https://arxiv.org/abs/1702.07054

DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling

intro: ICCV 2017 (poster)
arxiv: https://arxiv.org/abs/1703.10295

Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries

intro: CVPR 2017
arxiv: https://arxiv.org/abs/1704.03944

Spatial Memory for Context Reasoning in Object Detection

arxiv: https://arxiv.org/abs/1704.04224

Accurate Single Stage Detector Using Recurrent Rolling Convolution

intro: CVPR 2017. SenseTime
keywords: Recurrent Rolling Convolution (RRC)
arxiv: https://arxiv.org/abs/1704.05776
github: https://github.com/xiaohaoChen/rrc_detection

Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection

arxiv: https://arxiv.org/abs/1704.05775

LCDet: Low-Complexity Fully-Convolutional Neural Networks for Object Detection in Embedded Systems

intro: Embedded Vision Workshop in CVPR. UC San Diego & Qualcomm Inc
arxiv: https://arxiv.org/abs/1705.05922

Point Linking Network for Object Detection

intro: Point Linking Network (PLN)
arxiv: https://arxiv.org/abs/1706.03646

Perceptual Generative Adversarial Networks for Small Object Detection

arxiv: https://arxiv.org/abs/1706.05274

Few-shot Object Detection

arxiv: https://arxiv.org/abs/1706.08249

Yes-Net: An effective Detector Based on Global Information

arxiv: https://arxiv.org/abs/1706.09180

SMC Faster R-CNN: Toward a scene-specialized multi-object detector

arxiv: https://arxiv.org/abs/1706.10217

Towards lightweight convolutional neural networks for object detection

arxiv: https://arxiv.org/abs/1707.01395

RON: Reverse Connection with Objectness Prior Networks for Object Detection

intro: CVPR 2017
arxiv: https://arxiv.org/abs/1707.01691
github: https://github.com/taokong/RON

Mimicking Very Efficient Network for Object Detection

intro: CVPR 2017. SenseTime & Beihang University
paper: http://openaccess.thecvf.com/content_cvpr_2017/papers/Li_Mimicking_Very_Efficient_CVPR_2017_paper.pdf

Residual Features and Unified Prediction Network for Single Stage Detection

https://arxiv.org/abs/1707.05031

Deformable Part-based Fully Convolutional Network for Object Detection

intro: BMVC 2017 (oral). Sorbonne Universités & CEDRIC
arxiv: https://arxiv.org/abs/1707.06175

Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors

intro: ICCV 2017
arxiv: https://arxiv.org/abs/1707.06399

Recurrent Scale Approximation for Object Detection in CNN

intro: ICCV 2017
keywords: Recurrent Scale Approximation (RSA)
arxiv: https://arxiv.org/abs/1707.09531
github: https://github.com/sciencefans/RSA-for-object-detection

DSOD

DSOD: Learning Deeply Supervised Object Detectors from Scratch

intro: ICCV 2017. Fudan University & Tsinghua University & Intel Labs China
arxiv: https://arxiv.org/abs/1708.01241
github: https://github.com/szq0214/DSOD

Object Detection from Scratch with Deep Supervision

arxiv: https://arxiv.org/abs/1809.09294

##RetinaNet

Focal Loss for Dense Object Detection

intro: ICCV 2017 Best student paper award. Facebook AI Research
keywords: RetinaNet
arxiv: https://arxiv.org/abs/1708.02002

Focal Loss Dense Detector for Vehicle Surveillance

arxiv: https://arxiv.org/abs/1803.01114

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

intro: ICCV 2017
arxiv: https://arxiv.org/abs/1708.02863

Incremental Learning of Object Detectors without Catastrophic Forgetting

intro: ICCV 2017. Inria
arxiv: https://arxiv.org/abs/1708.06977

Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection

arxiv: https://arxiv.org/abs/1709.04347

StairNet: Top-Down Semantic Aggregation for Accurate One Shot Detection

arxiv: https://arxiv.org/abs/1709.05788

Dynamic Zoom-in Network for Fast Object Detection in Large Images

https://arxiv.org/abs/1711.05187

Zero-Annotation Object Detection with Web Knowledge Transfer

intro: NTU, Singapore & Amazon
keywords: multi-instance multi-label domain adaption learning framework
arxiv: https://arxiv.org/abs/1711.05954

MegDet

MegDet: A Large Mini-Batch Object Detector

intro: Peking University & Tsinghua University & Megvii Inc
arxiv: https://arxiv.org/abs/1711.07240

Single-Shot Refinement Neural Network for Object Detection

arxiv: https://arxiv.org/abs/1711.06897
github: https://github.com/sfzhang15/RefineDet
github: https://github.com/MTCloudVision/RefineDet-Mxnet

Receptive Field Block Net for Accurate and Fast Object Detection

intro: RFBNet
arxiv: https://arxiv.org/abs/1711.07767
github: https://github.com//ruinmessi/RFBNet

An Analysis of Scale Invariance in Object Detection - SNIP

intro: CVPR 2018
arxiv: https://arxiv.org/abs/1711.08189
github: https://github.com/bharatsingh430/snip

Feature Selective Networks for Object Detection

arxiv: https://arxiv.org/abs/1711.08879

Learning a Rotation Invariant Detector with Rotatable Bounding Box

arxiv: https://arxiv.org/abs/1711.09405
github(official, Caffe): https://github.com/liulei01/DRBox

Scalable Object Detection for Stylized Objects

intro: Microsoft AI & Research Munich
arxiv: https://arxiv.org/abs/1711.09822

Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids

arxiv: https://arxiv.org/abs/1712.00886
github: https://github.com/szq0214/GRP-DSOD

Deep Regionlets for Object Detection

keywords: region selection network, gating network
arxiv: https://arxiv.org/abs/1712.02408

Training and Testing Object Detectors with Virtual Images

intro: IEEE/CAA Journal of Automatica Sinica
arxiv: https://arxiv.org/abs/1712.08470

Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video

keywords: object mining, object tracking, unsupervised object discovery by appearance-based clustering, self-supervised detector adaptation
arxiv: https://arxiv.org/abs/1712.08832

Spot the Difference by Object Detection

intro: Tsinghua University & JD Group
arxiv: https://arxiv.org/abs/1801.01051

Localization-Aware Active Learning for Object Detection

arxiv: https://arxiv.org/abs/1801.05124

Object Detection with Mask-based Feature Encoding

arxiv: https://arxiv.org/abs/1802.03934

LSTD: A Low-Shot Transfer Detector for Object Detection

intro: AAAI 2018
arxiv: https://arxiv.org/abs/1803.01529

Domain Adaptive Faster R-CNN for Object Detection in the Wild

intro: CVPR 2018. ETH Zurich & ESAT/PSI
arxiv: https://arxiv.org/abs/1803.03243
github(official. Caffe): https://github.com/yuhuayc/da-faster-rcnn

Pseudo Mask Augmented Object Detection

arxiv: https://arxiv.org/abs/1803.05858

Revisiting RCNN: On Awakening the Classification Power of Faster RCNN

intro: ECCV 2018
keywords: DCR V1
arxiv: https://arxiv.org/abs/1803.06799
github(official, MXNet): https://github.com/bowenc0221/Decoupled-Classification-Refinement

Decoupled Classification Refinement: Hard False Positive Suppression for Object Detection

keywords: DCR V2
arxiv: https://arxiv.org/abs/1810.04002
github(official, MXNet): https://github.com/bowenc0221/Decoupled-Classification-Refinement

Learning Region Features for Object Detection

intro: Peking University & MSRA
arxiv: https://arxiv.org/abs/1803.07066

Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection

intro: Singapore Management University & Zhejiang University
arxiv: https://arxiv.org/abs/1803.08208

Object Detection for Comics using Manga109 Annotations

intro: University of Tokyo & National Institute of Informatics, Japan
arxiv: https://arxiv.org/abs/1803.08670

Task-Driven Super Resolution: Object Detection in Low-resolution Images

arxiv: https://arxiv.org/abs/1803.11316

Transferring Common-Sense Knowledge for Object Detection

arxiv: https://arxiv.org/abs/1804.01077

Multi-scale Location-aware Kernel Representation for Object Detection

intro: CVPR 2018
arxiv: https://arxiv.org/abs/1804.00428
github: https://github.com/Hwang64/MLKP

Loss Rank Mining: A General Hard Example Mining Method for Real-time Detectors

intro: National University of Defense Technology
arxiv: https://arxiv.org/abs/1804.04606

DetNet: A Backbone network for Object Detection

intro: Tsinghua University & Megvii Inc
arxiv: https://arxiv.org/abs/1804.06215

Robust Physical Adversarial Attack on Faster R-CNN Object Detector

arxiv: https://arxiv.org/abs/1804.05810

AdvDetPatch: Attacking Object Detectors with Adversarial Patches

arxiv: https://arxiv.org/abs/1806.02299

Attacking Object Detectors via Imperceptible Patches on Background

https://arxiv.org/abs/1809.05966

Physical Adversarial Examples for Object Detectors

intro: WOOT 2018
arxiv: https://arxiv.org/abs/1807.07769

Quantization Mimic: Towards Very Tiny CNN for Object Detection

arxiv: https://arxiv.org/abs/1805.02152

Object detection at 200 Frames Per Second

intro: United Technologies Research Center-Ireland
arxiv: https://arxiv.org/abs/1805.06361

Object Detection using Domain Randomization and Generative Adversarial Refinement of Synthetic Images

intro: CVPR 2018 Deep Vision Workshop
arxiv: https://arxiv.org/abs/1805.11778

SNIPER: Efficient Multi-Scale Training

intro: University of Maryland
keywords: SNIPER (Scale Normalization for Image Pyramid with Efficient Resampling)
arxiv: https://arxiv.org/abs/1805.09300
github: https://github.com/mahyarnajibi/SNIPER

Soft Sampling for Robust Object Detection

arxiv: https://arxiv.org/abs/1806.06986

MetaAnchor: Learning to Detect Objects with Customized Anchors

intro: Megvii Inc (Face++) & Fudan University
arxiv: https://arxiv.org/abs/1807.00980

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

intro: ECCV 2018. Middle East Technical University
arxiv: https://arxiv.org/abs/1807.01696
github: https://github.com/cancam/LRP

Auto-Context R-CNN

intro: Rejected by ECCV18
arxiv: https://arxiv.org/abs/1807.02842

Pooling Pyramid Network for Object Detection

intro: Google AI Perception
arxiv: https://arxiv.org/abs/1807.03284

Modeling Visual Context is Key to Augmenting Object Detection Datasets

intro: ECCV 2018
arxiv: https://arxiv.org/abs/1807.07428

Dual Refinement Network for Single-Shot Object Detection

arxiv: https://arxiv.org/abs/1807.08638

Acquisition of Localization Confidence for Accurate Object Detection

intro: ECCV 2018
arxiv: https://arxiv.org/abs/1807.11590
gihtub: https://github.com/vacancy/PreciseRoIPooling

CornerNet: Detecting Objects as Paired Keypoints

intro: ECCV 2018
keywords: IoU-Net, PreciseRoIPooling
arxiv: https://arxiv.org/abs/1808.01244
github: https://github.com/umich-vl/CornerNet

Unsupervised Hard Example Mining from Videos for Improved Object Detection

intro: ECCV 2018
arxiv: https://arxiv.org/abs/1808.04285

SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection

arxiv: https://arxiv.org/abs/1808.04974

A Survey of Modern Object Detection Literature using Deep Learning

arxiv: https://arxiv.org/abs/1808.07256

Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usages

intro: BMVC 2018
arxiv: https://arxiv.org/abs/1807.11013
github: https://github.com/lyxok1/Tiny-DSOD

Deep Feature Pyramid Reconfiguration for Object Detection

intro: ECCV 2018
arxiv: https://arxiv.org/abs/1808.07993

MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection

intro: ICPR 2018
arxiv: https://arxiv.org/abs/1809.01791

Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks

https://arxiv.org/abs/1809.03193

Deep Learning for Generic Object Detection: A Survey

https://arxiv.org/abs/1809.02165

Training Confidence-Calibrated Classifier for Detecting Out-of-Distribution Samples

intro: ICLR 2018
arxiv: https://github.com/alinlab/Confident_classifier

ScratchDet:Exploring to Train Single-Shot Object Detectors from Scratch

arxiv: https://arxiv.org/abs/1810.08425
github: https://github.com/KimSoybean/ScratchDet

Fast and accurate object detection in high resolution 4K and 8K video using GPUs

intro: Best Paper Finalist at IEEE High Performance Extreme Computing Conference (HPEC) 2018
intro: Carnegie Mellon University
arxiv: https://arxiv.org/abs/1810.10551

Hybrid Knowledge Routed Modules for Large-scale Object Detection

intro: NIPS 2018
arxiv: https://arxiv.org/abs/1810.12681
github(official, PyTorch): https://github.com/chanyn/HKRM

Gradient Harmonized Single-stage Detector

intro: AAAI 2019
arxiv: https://arxiv.org/abs/1811.05181

M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network

intro: AAAI 2019
arxiv: https://arxiv.org/abs/1811.04533
github: https://github.com/qijiezhao/M2Det

BAN: Focusing on Boundary Context for Object Detection

arxiv：https://arxiv.org/abs/1811.05243

Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector

intro: WACV 2019
arxiv: https://arxiv.org/abs/1811.08342

R2CNN++: Multi-Dimensional Attention Based Rotation Invariant Detector with Robust Anchor Strategy

arxiv: https://arxiv.org/abs/1811.07126
github: https://github.com/DetectionTeamUCAS/R2CNN-Plus-Plus_Tensorflow

DeRPN: Taking a further step toward more general object detection

intro: AAAI 2019
intro: South China University of Technology
arxiv: https://arxiv.org/abs/1811.06700
github: https://github.com/HCIILAB/DeRPN

Fast Efficient Object Detection Using Selective Attention

arxiv：https://arxiv.org/abs/1811.07502

Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects

arxiv：https://arxiv.org/abs/1811.10862

Efficient Coarse-to-Fine Non-Local Module for the Detection of Small Objects

arxiv：https://arxiv.org/abs/1811.12152

Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection

arxiv：https://arxiv.org/abs/1811.11318

Grid R-CNN

intro: SenseTime
arxiv: https://arxiv.org/abs/1811.12030

Transferable Adversarial Attacks for Image and Video Object Detection

-arxiv：https://arxiv.org/abs/1811.12641

Anchor Box Optimization for Object Detection

intro: University of Illinois at Urbana-Champaign & Microsoft Research
arxiv: https://arxiv.org/abs/1812.00469

AutoFocus: Efficient Multi-Scale Inference

intro: University of Maryland
arxiv: https://arxiv.org/abs/1812.01600

###Few-shot Object Detection via Feature Reweighting

arxiv：https://arxiv.org/abs/1812.01866

Practical Adversarial Attack Against Object Detector

arxiv：https://arxiv.org/abs/1812.10217

Learning Efficient Detector with Semi-supervised Adaptive Distillation

intro: SenseTime Research
arxiv: https://arxiv.org/abs/1901.00366
github: https://github.com/Tangshitao/Semi-supervised-Adaptive-Distillation

Non-Maximum Suppression (NMS)

End-to-End Integration of a Convolutional Network, Deformable Parts Model and Non-Maximum Suppression

intro: CVPR 2015
arxiv: http://arxiv.org/abs/1411.5309
paper: http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Wan_End-to-End_Integration_of_2015_CVPR_paper.pdf

A convnet for non-maximum suppression

arxiv: http://arxiv.org/abs/1511.06437
Improving Object Detection With One Line of Code

Soft-NMS – Improving Object Detection With One Line of Code

intro: ICCV 2017. University of Maryland
keywords: Soft-NMS
arxiv: https://arxiv.org/abs/1704.04503
github: https://github.com/bharatsingh430/soft-nms

Learning non-maximum suppression

intro: CVPR 2017
project page: https://www.mpi-inf.mpg.de/departments/computer-vision-and-multimodal-computing/research/object-recognition-and-scene-understanding/learning-nms/
arxiv: https://arxiv.org/abs/1705.02950
github: https://github.com/hosang/gossipnet

Relation Networks for Object Detection

intro: CVPR 2018 oral
arxiv: https://arxiv.org/abs/1711.11575
github(official, MXNet): https://github.com/msracver/Relation-Networks-for-Object-Detection

Adversarial Examples

Adversarial Examples that Fool Detectors

intro: University of Illinois
arxiv: https://arxiv.org/abs/1712.02494

Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods

project page: http://nicholas.carlini.com/code/nn_breaking_detection/
arxiv: https://arxiv.org/abs/1705.07263
github: https://github.com/carlini/nn_breaking_detection

Weakly Supervised Object Detection

Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection

intro: CVPR 2016
arxiv: http://arxiv.org/abs/1604.05766

Weakly supervised object detection using pseudo-strong labels

arxiv: http://arxiv.org/abs/1607.04731

Saliency Guided End-to-End Learning for Weakly Supervised Object Detection

intro: IJCAI 2017
arxiv: https://arxiv.org/abs/1706.06768

Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

intro: TPAMI 2017. National Institutes of Health (NIH) Clinical Center
arxiv: https://arxiv.org/abs/1801.03145

Video Object Detection

Learning Object Class Detectors from Weakly Annotated Video

intro: CVPR 2012
paper: https://www.vision.ee.ethz.ch/publications/papers/proceedings/eth_biwi_00905.pdf

Analysing domain shift factors between videos and images for object detection

arxiv: https://arxiv.org/abs/1501.01186

Video Object Recognition

slides: http://vision.princeton.edu/courses/COS598/2015sp/slides/VideoRecog/Video Object Recognition.pptx

Deep Learning for Saliency Prediction in Natural Video

intro: Submitted on 12 Jan 2016
keywords: Deep learning, saliency map, optical flow, convolution network, contrast features
paper: https://hal.archives-ouvertes.fr/hal-01251614/document

T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos

intro: Winning solution in ILSVRC2015 Object Detection from Video(VID) Task
arxiv: http://arxiv.org/abs/1604.02532
github: https://github.com/myfavouritekk/T-CNN

Object Detection from Video Tubelets with Convolutional Neural Networks

intro: CVPR 2016 Spotlight paper
arxiv: https://arxiv.org/abs/1604.04053
paper: http://www.ee.cuhk.edu.hk/~wlouyang/Papers/KangVideoDet_CVPR16.pdf
gihtub: https://github.com/myfavouritekk/vdetlib

Object Detection in Videos with Tubelets and Multi-context Cues

intro: SenseTime Group
slides: http://www.ee.cuhk.edu.hk/~xgwang/CUvideo.pdf
slides: http://image-net.org/challenges/talks/Object Detection in Videos with Tubelets and Multi-context Cues - Final.pdf

Context Matters: Refining Object Detection in Video with Recurrent Neural Networks

intro: BMVC 2016
keywords: pseudo-labeler
arxiv: http://arxiv.org/abs/1607.04648
paper: http://vision.cornell.edu/se3/wp-content/uploads/2016/07/video_object_detection_BMVC.pdf

CNN Based Object Detection in Large Video Images

intro: WangTao @ 爱奇艺
keywords: object retrieval, object detection, scene classification
slides: http://on-demand.gputechconf.com/gtc/2016/presentation/s6362-wang-tao-cnn-based-object-detection-large-video-images.pdf

Object Detection in Videos with Tubelet Proposal Networks

arxiv: https://arxiv.org/abs/1702.06355

Flow-Guided Feature Aggregation for Video Object Detection

intro: MSRA
arxiv: https://arxiv.org/abs/1703.10025

Video Object Detection using Faster R-CNN

blog: http://andrewliao11.github.io/object_detection/faster_rcnn/
github: https://github.com/andrewliao11/py-faster-rcnn-imagenet

Improving Context Modeling for Video Object Detection and Tracking

http://image-net.org/challenges/talks_2017/ilsvrc2017_short(poster).pdf

Temporal Dynamic Graph LSTM for Action-driven Video Object Detection

intro: ICCV 2017
arxiv: https://arxiv.org/abs/1708.00666

Mobile Video Object Detection with Temporally-Aware Feature Maps

arxiv: https://arxiv.org/abs/1711.06368

Towards High Performance Video Object Detection

arxiv: https://arxiv.org/abs/1711.11577

Impression Network for Video Object Detection

arxiv: https://arxiv.org/abs/1712.05896

Spatial-Temporal Memory Networks for Video Object Detection

arxiv: https://arxiv.org/abs/1712.06317

3D-DETNet: a Single Stage Video-Based Vehicle Detector

arxiv: https://arxiv.org/abs/1801.01769

Object Detection in Videos by Short and Long Range Object Linking

arxiv: https://arxiv.org/abs/1801.09823

Object Detection in Video with Spatiotemporal Sampling Networks

intro: University of Pennsylvania, 2Dartmouth College
arxiv: https://arxiv.org/abs/1803.05549

Towards High Performance Video Object Detection for Mobiles

intro: Microsoft Research Asia
arxiv: https://arxiv.org/abs/1804.05830

Optimizing Video Object Detection via a Scale-Time Lattice

intro: CVPR 2018
project page: http://mmlab.ie.cuhk.edu.hk/projects/ST-Lattice/
arxiv: https://arxiv.org/abs/1804.05472
github: https://github.com/hellock/scale-time-lattice

Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing

https://arxiv.org/abs/1809.01701

Fast Object Detection in Compressed Video

arxiv：https://arxiv.org/abs/1811.11057

Tube-CNN: Modeling temporal evolution of appearance for object detection in video

intro: INRIA/ENS
arxiv: https://arxiv.org/abs/1812.02619

Object Detection on Mobile Devices

Pelee: A Real-Time Object Detection System on Mobile Devices

intro: ICLR 2018 workshop track
intro: based on the SSD
arxiv: https://arxiv.org/abs/1804.06882
github: https://github.com/Robert-JunWang/Pelee

Object Detection in 3D

Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks

arxiv: https://arxiv.org/abs/1609.06666

Complex-YOLO: Real-time 3D Object Detection on Point Clouds

intro: Valeo Schalter und Sensoren GmbH & Ilmenau University of Technology
arxiv: https://arxiv.org/abs/1803.06199

Focal Loss in 3D Object Detection

arxiv: https://arxiv.org/abs/1809.06065
github: https://github.com/pyun-ram/FL3D

Object Detection on RGB-D

Learning Rich Features from RGB-D Images for Object Detection and Segmentation

arxiv: http://arxiv.org/abs/1407.5736

Differential Geometry Boosts Convolutional Neural Networks for Object Detection

intro: CVPR 2016
paper: http://www.cv-foundation.org/openaccess/content_cvpr_2016_workshops/w23/html/Wang_Differential_Geometry_Boosts_CVPR_2016_paper.html

A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation

arxiv: https://arxiv.org/abs/1703.03347

Zero-Shot Object Detection

Zero-Shot Detection

intro: Australian National University
keywords: YOLO
arxiv: https://arxiv.org/abs/1803.07113

Zero-Shot Object Detection

arxiv: https://arxiv.org/abs/1804.04340

Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts

intro: Australian National University
arxiv: https://arxiv.org/abs/1803.06049

Zero-Shot Object Detection by Hybrid Region Embedding

intro: Middle East Technical University & Hacettepe University
arxiv: https://arxiv.org/abs/1805.06157

Salient Object Detection

This task involves predicting the salient regions of an image given by human eye fixations.

Best Deep Saliency Detection Models (CVPR 2016 & 2015)

page: http://i.cs.hku.hk/~yzyu/vision.html

Large-scale optimization of hierarchical features for saliency prediction in natural images

paper: http://coxlab.org/pdfs/cvpr2014_vig_saliency.pdf

Predicting Eye Fixations using Convolutional Neural Networks

paper: http://www.escience.cn/system/file?fileId=72648

Saliency Detection by Multi-Context Deep Learning

paper: http://www.cv-foundation.org/openaccess/content_cvpr_2015/papers/Zhao_Saliency_Detection_by_2015_CVPR_paper.pdf

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection

arxiv: http://arxiv.org/abs/1510.05484

SuperCNN: A Superpixelwise Convolutional Neural Network for Salient Object Detection

paper: www.shengfenghe.com/supercnn-a-superpixelwise-convolutional-neural-network-for-salient-object-detection.html

Shallow and Deep Convolutional Networks for Saliency Prediction

intro: CVPR 2016
arxiv: http://arxiv.org/abs/1603.00845
github: https://github.com/imatge-upc/saliency-2016-cvpr

Recurrent Attentional Networks for Saliency Detection

intro: CVPR 2016. recurrent attentional convolutional-deconvolution network (RACDNN)
arxiv: http://arxiv.org/abs/1604.03227

Two-Stream Convolutional Networks for Dynamic Saliency Prediction

arxiv: http://arxiv.org/abs/1607.04730

Unconstrained Salient Object Detection

Unconstrained Salient Object Detection via Proposal Subset Optimization

intro: CVPR 2016
project page: http://cs-people.bu.edu/jmzhang/sod.html
paper: http://cs-people.bu.edu/jmzhang/SOD/CVPR16SOD_camera_ready.pdf
github: https://github.com/jimmie33/SOD
caffe model zoo: https://github.com/BVLC/caffe/wiki/Model-Zoo#cnn-object-proposal-models-for-salient-object-detection

DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection

paper: http://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Liu_DHSNet_Deep_Hierarchical_CVPR_2016_paper.pdf

Salient Object Subitizing

intro: CVPR 2015
intro: predicting the existence and the number of salient objects in an image using holistic cues
project page: http://cs-people.bu.edu/jmzhang/sos.html
arxiv: http://arxiv.org/abs/1607.07525
paper: http://cs-people.bu.edu/jmzhang/SOS/SOS_preprint.pdf
caffe model zoo: https://github.com/BVLC/caffe/wiki/Model-Zoo#cnn-models-for-salient-object-subitizing

Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection

intro: ACMMM 2016. deeply-supervised recurrent convolutional neural network (DSRCNN)
arxiv: http://arxiv.org/abs/1608.05177

Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs

intro: ECCV 2016
arxiv: http://arxiv.org/abs/1608.05186

Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection

arxiv: http://arxiv.org/abs/1608.08029

A Deep Multi-Level Network for Saliency Prediction

arxiv: http://arxiv.org/abs/1609.01064

Visual Saliency Detection Based on Multiscale Deep CNN Features

intro: IEEE Transactions on Image Processing
arxiv: http://arxiv.org/abs/1609.02077

A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection

intro: DSCLRCN
arxiv: https://arxiv.org/abs/1610.01708

Deeply supervised salient object detection with short connections

intro: IEEE TPAMI 2018 (IEEE CVPR 2017)
arxiv: https://arxiv.org/abs/1611.04849
github(official, Caffe): https://github.com/Andrew-Qibin/DSS
github(Tensorflow): https://github.com/Joker316701882/Salient-Object-Detection

Weakly Supervised Top-down Salient Object Detection

intro: Nanyang Technological University
arxiv: https://arxiv.org/abs/1611.05345

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks

project page: https://imatge-upc.github.io/saliency-salgan-2017/
arxiv: https://arxiv.org/abs/1701.01081

Visual Saliency Prediction Using a Mixture of Deep Neural Networks

arxiv: https://arxiv.org/abs/1702.00372

A Fast and Compact Salient Score Regression Network Based on Fully Convolutional Network

arxiv: https://arxiv.org/abs/1702.00615

Saliency Detection by Forward and Backward Cues in Deep-CNNs

arxiv: https://arxiv.org/abs/1703.00152

Supervised Adversarial Networks for Image Saliency Detection

arxiv: https://arxiv.org/abs/1704.07242

Group-wise Deep Co-saliency Detection

arxiv: https://arxiv.org/abs/1707.07381

Towards the Success Rate of One: Real-time Unconstrained Salient Object Detection

intro: University of Maryland College Park & eBay Inc
arxiv: https://arxiv.org/abs/1708.00079

Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection

intro: ICCV 2017
arixv: https://arxiv.org/abs/1708.02001

Learning Uncertain Convolutional Features for Accurate Saliency Detection

intro: Accepted as a poster in ICCV 2017
arxiv: https://arxiv.org/abs/1708.02031

Deep Edge-Aware Saliency Detection

arxiv: https://arxiv.org/abs/1708.04366

Self-explanatory Deep Salient Object Detection

intro: National University of Defense Technology, China & National University of Singapore
arxiv: https://arxiv.org/abs/1708.05595

PiCANet: Learning Pixel-wise Contextual Attention in ConvNets and Its Application in Saliency Detection

arxiv: https://arxiv.org/abs/1708.06433

DeepFeat: A Bottom Up and Top Down Saliency Model Based on Deep Features of Convolutional Neural Nets

arxiv: https://arxiv.org/abs/1709.02495

Recurrently Aggregating Deep Features for Salient Object Detection

intro: AAAI 2018
paper: https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16775/16281

Deep saliency: What is learnt by a deep network about saliency?

intro: 2nd Workshop on Visualisation for Deep Learning in the 34th International Conference On Machine Learning
arxiv: https://arxiv.org/abs/1801.04261

Contrast-Oriented Deep Neural Networks for Salient Object Detection

intro: TNNLS
arxiv: https://arxiv.org/abs/1803.11395

Salient Object Detection by Lossless Feature Reflection

intro: IJCAI 2018
arxiv: https://arxiv.org/abs/1802.06527

HyperFusion-Net: Densely Reflective Fusion for Salient Object Detection

arxiv: https://arxiv.org/abs/1804.05142

Video Saliency Detection

Deep Learning For Video Saliency Detection

arxiv: https://arxiv.org/abs/1702.00871

Video Salient Object Detection Using Spatiotemporal Deep Features

arxiv: https://arxiv.org/abs/1708.01447

Predicting Video Saliency with Object-to-Motion CNN and Two-layer Convolutional LSTM

arxiv: https://arxiv.org/abs/1709.06316

Visual Relationship Detection

Visual Relationship Detection with Language Priors

intro: ECCV 2016 oral
paper: https://cs.stanford.edu/people/ranjaykrishna/vrd/vrd.pdf
github: https://github.com/Prof-Lu-Cewu/Visual-Relationship-Detection

ViP-CNN: A Visual Phrase Reasoning Convolutional Neural Network for Visual Relationship Detection

intro: Visual Phrase reasoning Convolutional Neural Network (ViP-CNN), Visual Phrase Reasoning Structure (VPRS)
arxiv: https://arxiv.org/abs/1702.07191

Visual Translation Embedding Network for Visual Relation Detection

arxiv: https://www.arxiv.org/abs/1702.08319

Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection

intro: CVPR 2017 spotlight paper
arxiv: https://arxiv.org/abs/1703.03054

Detecting Visual Relationships with Deep Relational Networks

intro: CVPR 2017 oral. The Chinese University of Hong Kong
arxiv: https://arxiv.org/abs/1704.03114

Identifying Spatial Relations in Images using Convolutional Neural Networks

arxiv: https://arxiv.org/abs/1706.04215

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

intro: ICCV
arxiv: https://arxiv.org/abs/1708.01956

Natural Language Guided Visual Relationship Detection

arxiv: https://arxiv.org/abs/1711.06032

Detecting Visual Relationships Using Box Attention

intro: Google AI & IST Austria
arxiv: https://arxiv.org/abs/1807.02136

Google AI Open Images - Visual Relationship Track

intro: Detect pairs of objects in particular relationships
kaggle: https://www.kaggle.com/c/google-ai-open-images-visual-relationship-track

Context-Dependent Diffusion Network for Visual Relationship Detection

intro: 2018 ACM Multimedia Conference
arxiv: https://arxiv.org/abs/1809.06213

A Problem Reduction Approach for Visual Relationships Detection

intro: ECCV 2018 Workshop
arxiv: https://arxiv.org/abs/1809.09828

Face Deteciton

Multi-view Face Detection Using Deep Convolutional Neural Networks

intro: Yahoo
arxiv: http://arxiv.org/abs/1502.02766
github: https://github.com/guoyilin/FaceDetection_CNN

From Facial Parts Responses to Face Detection: A Deep Learning Approach

intro: ICCV 2015. CUHK
project page: http://personal.ie.cuhk.edu.hk/~ys014/projects/Faceness/Faceness.html
arxiv: https://arxiv.org/abs/1509.06451
paper: http://www.cv-foundation.org/openaccess/content_iccv_2015/papers/Yang_From_Facial_Parts_ICCV_2015_paper.pdf

Compact Convolutional Neural Network Cascade for Face Detection

arxiv: http://arxiv.org/abs/1508.01292
github: https://github.com/Bkmz21/FD-Evaluation
github: https://github.com/Bkmz21/CompactCNNCascade

Face Detection with End-to-End Integration of a ConvNet and a 3D Model

intro: ECCV 2016
arxiv: https://arxiv.org/abs/1606.00850
github(MXNet): https://github.com/tfwu/FaceDetection-ConvNet-3D

CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection

intro: CMU
arxiv: https://arxiv.org/abs/1606.05413

Towards a Deep Learning Framework for Unconstrained Face Detection

intro: overlap with CMS-RCNN
arxiv: https://arxiv.org/abs/1612.05322

Supervised Transformer Network for Efficient Face Detection

arxiv: http://arxiv.org/abs/1607.05477

UnitBox: An Advanced Object Detection Network

intro: ACM MM 2016
keywords: IOULoss
arxiv: http://arxiv.org/abs/1608.01471

Bootstrapping Face Detection with Hard Negative Examples

author: 万韶华 @ 小米.
intro: Faster R-CNN, hard negative mining. state-of-the-art on the FDDB dataset
arxiv: http://arxiv.org/abs/1608.02236

Grid Loss: Detecting Occluded Faces

intro: ECCV 2016
arxiv: https://arxiv.org/abs/1609.00129
paper: http://lrs.icg.tugraz.at/pubs/opitz_eccv_16.pdf
poster: http://www.eccv2016.org/files/posters/P-2A-34.pdf

A Multi-Scale Cascade Fully Convolutional Network Face Detector

intro: ICPR 2016
arxiv: http://arxiv.org/abs/1609.03536

MTCNN

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks

project page: https://kpzhang93.github.io/MTCNN_face_detection_alignment/index.html
arxiv: https://arxiv.org/abs/1604.02878
github(official, Matlab): https://github.com/kpzhang93/MTCNN_face_detection_alignment
github: https://github.com/pangyupo/mxnet_mtcnn_face_detection
github: https://github.com/DaFuCoding/MTCNN_Caffe
github(MXNet): https://github.com/Seanlinx/mtcnn
github: https://github.com/Pi-DeepLearning/RaspberryPi-FaceDetection-MTCNN-Caffe-With-Motion
github(Caffe): https://github.com/foreverYoungGitHub/MTCNN
github: https://github.com/CongWeilin/mtcnn-caffe
github(OpenCV+OpenBlas): https://github.com/AlphaQi/MTCNN-light
github(Tensorflow+golang): https://github.com/jdeng/goface

Face Detection using Deep Learning: An Improved Faster RCNN Approach

intro: DeepIR Inc
arxiv: https://arxiv.org/abs/1701.08289

Faceness-Net: Face Detection through Deep Facial Part Responses

intro: An extended version of ICCV 2015 paper
arxiv: https://arxiv.org/abs/1701.08393

Multi-Path Region-Based Convolutional Neural Network for Accurate Detection of Unconstrained “Hard Faces”

intro: CVPR 2017. MP-RCNN, MP-RPN
arxiv: https://arxiv.org/abs/1703.09145

End-To-End Face Detection and Recognition

arxiv: https://arxiv.org/abs/1703.10818

Face R-CNN

arxiv: https://arxiv.org/abs/1706.01061

Face Detection through Scale-Friendly Deep Convolutional Networks

arxiv: https://arxiv.org/abs/1706.02863

Scale-Aware Face Detection

intro: CVPR 2017. SenseTime & Tsinghua University
arxiv: https://arxiv.org/abs/1706.09876

Detecting Faces Using Inside Cascaded Contextual CNN

intro: CVPR 2017. Tencent AI Lab & SenseTime
paper: http://ai.tencent.com/ailab/media/publications/Detecting_Faces_Using_Inside_Cascaded_Contextual_CNN.pdf

Multi-Branch Fully Convolutional Network for Face Detection

arxiv: https://arxiv.org/abs/1707.06330

SSH: Single Stage Headless Face Detector

intro: ICCV 2017. University of Maryland
arxiv: https://arxiv.org/abs/1708.03979
github(official, Caffe): https://github.com/mahyarnajibi/SSH

Dockerface: an easy to install and use Faster R-CNN face detector in a Docker container

arxiv: https://arxiv.org/abs/1708.04370

FaceBoxes: A CPU Real-time Face Detector with High Accuracy

intro: IJCB 2017
keywords: Rapidly Digested Convolutional Layers (RDCL), Multiple Scale Convolutional Layers (MSCL)
intro: the proposed detector runs at 20 FPS on a single CPU core and 125 FPS using a GPU for VGA-resolution images
arxiv: https://arxiv.org/abs/1708.05234
github(Caffe): https://github.com/zeusees/FaceBoxes

S3FD: Single Shot Scale-invariant Face Detector

intro: ICCV 2017. Chinese Academy of Sciences
intro: can run at 36 FPS on a Nvidia Titan X (Pascal) for VGA-resolution images
arxiv: https://arxiv.org/abs/1708.05237
github(Caffe, official): https://github.com/sfzhang15/SFD
github: https://github.com//clcarwin/SFD_pytorch

Detecting Faces Using Region-based Fully Convolutional Networks

arxiv: https://arxiv.org/abs/1709.05256

AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection

arxiv: https://arxiv.org/abs/1709.07326

Face Attention Network: An effective Face Detector for the Occluded Faces

arxiv: https://arxiv.org/abs/1711.07246

Feature Agglomeration Networks for Single Stage Face Detection

arxiv: https://arxiv.org/abs/1712.00721

Face Detection Using Improved Faster RCNN

intro: Huawei Cloud BU
arxiv: https://arxiv.org/abs/1802.02142

PyramidBox: A Context-assisted Single Shot Face Detector

intro: Baidu, Inc
arxiv: https://arxiv.org/abs/1803.07737

A Fast Face Detection Method via Convolutional Neural Network

intro: Neurocomputing
arxiv: https://arxiv.org/abs/1803.10103

Beyond Trade-off: Accelerate FCN-based Face Detector with Higher Accuracy

intro: CVPR 2018. Beihang University & CUHK & Sensetime
arxiv: https://arxiv.org/abs/1804.05197

Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks

intro: CVPR 2018
arxiv: https://arxiv.org/abs/1804.06039
github: https://github.com/Jack-CV/PCN

SFace: An Efficient Network for Face Detection in Large Scale Variations

intro: Beihang University & Megvii Inc. (Face++)
arxiv: https://arxiv.org/abs/1804.06559

Survey of Face Detection on Low-quality Images

arxiv: https://arxiv.org/abs/1804.07362

Anchor Cascade for Efficient Face Detection

intro: The University of Sydney
arxiv: https://arxiv.org/abs/1805.03363

Adversarial Attacks on Face Detectors using Neural Net based Constrained Optimization

intro: IEEE MMSP
arxiv: https://arxiv.org/abs/1805.12302

Selective Refinement Network for High Performance Face Detection

https://arxiv.org/abs/1809.02693

DSFD: Dual Shot Face Detector

arxiv：https://arxiv.org/abs/1810.10220

Learning Better Features for Face Detection with Feature Fusion and Segmentation Supervision

arxiv：https://arxiv.org/abs/1811.08557

FA-RPN: Floating Region Proposals for Face Detection

arxiv: https://arxiv.org/abs/1812.05586

Detect Small Faces

Finding Tiny Faces

intro: CVPR 2017. CMU
project page: http://www.cs.cmu.edu/~peiyunh/tiny/index.html
arxiv: https://arxiv.org/abs/1612.04402
github(official, Matlab): https://github.com/peiyunh/tiny
github(inference-only): https://github.com/chinakook/hr101_mxnet
github: https://github.com/cydonia999/Tiny_Faces_in_Tensorflow

Detecting and counting tiny faces

intro: ENS Paris-Saclay. ExtendedTinyFaces
intro: Detecting and counting small objects - Analysis, review and application to counting
arxiv: https://arxiv.org/abs/1801.06504
github: https://github.com/alexattia/ExtendedTinyFaces

Seeing Small Faces from Robust Anchor’s Perspective

intro: CVPR 2018
arxiv: https://arxiv.org/abs/1802.09058

Face-MagNet: Magnifying Feature Maps to Detect Small Faces

intro: WACV 2018
keywords: Face Magnifier Network (Face-MageNet)
arxiv: https://arxiv.org/abs/1803.05258
github: https://github.com/po0ya/face-magnet

Robust Face Detection via Learning Small Faces on Hard Images

intro: Johns Hopkins University & Stanford University
arxiv: https://arxiv.org/abs/1811.11662
github: https://github.com/bairdzhang/smallhardface

SFA: Small Faces Attention Face Detector

intro: Jilin University
arxiv: https://arxiv.org/abs/1812.08402

Person Head Detection

Context-aware CNNs for person head detection

intro: ICCV 2015
project page: http://www.di.ens.fr/willow/research/headdetection/
arxiv: http://arxiv.org/abs/1511.07917
github: https://github.com/aosokin/cnn_head_detection

Detecting Heads using Feature Refine Net and Cascaded Multi-scale Architecture

arxiv: https://arxiv.org/abs/1803.09256

A Comparison of CNN-based Face and Head Detectors for Real-Time Video Surveillance Applications

https://arxiv.org/abs/1809.03336

FCHD: A fast and accurate head detector

arxiv: https://arxiv.org/abs/1809.08766
github(PyTorch, official): https://github.com/aditya-vora/FCHD-Fully-Convolutional-Head-Detector

Pedestrian Detection / People Detection

Pedestrian Detection aided by Deep Learning Semantic Tasks

intro: CVPR 2015
project page: http://mmlab.ie.cuhk.edu.hk/projects/TA-CNN/
arxiv: http://arxiv.org/abs/1412.0069

Deep Learning Strong Parts for Pedestrian Detection

intro: ICCV 2015. CUHK. DeepParts
intro: Achieving 11.89% average miss rate on Caltech Pedestrian Dataset
paper: http://personal.ie.cuhk.edu.hk/~pluo/pdf/tianLWTiccv15.pdf

Taking a Deeper Look at Pedestrians

intro: CVPR 2015
arxiv: https://arxiv.org/abs/1501.05790

Convolutional Channel Features

intro: ICCV 2015
arxiv: https://arxiv.org/abs/1504.07339
github: https://github.com/byangderek/CCF

End-to-end people detection in crowded scenes

arxiv: http://arxiv.org/abs/1506.04878
github: https://github.com/Russell91/reinspect
ipn: http://nbviewer.ipython.org/github/Russell91/ReInspect/blob/master/evaluation_reinspect.ipynb
youtube: https://www.youtube.com/watch?v=QeWl0h3kQ24

Learning Complexity-Aware Cascades for Deep Pedestrian Detection

intro: ICCV 2015
arxiv: https://arxiv.org/abs/1507.05348

Deep convolutional neural networks for pedestrian detection

arxiv: http://arxiv.org/abs/1510.03608
github: https://github.com/DenisTome/DeepPed

Scale-aware Fast R-CNN for Pedestrian Detection

arxiv: https://arxiv.org/abs/1510.08160

New algorithm improves speed and accuracy of pedestrian detection

blog: http://www.eurekalert.org/pub_releases/2016-02/uoc–nai020516.php

Pushing the Limits of Deep CNNs for Pedestrian Detection

intro: “set a new record on the Caltech pedestrian dataset, lowering the log-average miss rate from 11.7% to 8.9%”
arxiv: http://arxiv.org/abs/1603.04525

A Real-Time Deep Learning Pedestrian Detector for Robot Navigation

arxiv: http://arxiv.org/abs/1607.04436

A Real-Time Pedestrian Detector using Deep Learning for Human-Aware Navigation

arxiv: http://arxiv.org/abs/1607.04441

Is Faster R-CNN Doing Well for Pedestrian Detection?

intro: ECCV 2016
arxiv: http://arxiv.org/abs/1607.07032
github: https://github.com/zhangliliang/RPN_BF/tree/RPN-pedestrian

Unsupervised Deep Domain Adaptation for Pedestrian Detection

intro: ECCV Workshop 2016
arxiv: https://arxiv.org/abs/1802.03269

Reduced Memory Region Based Deep Convolutional Neural Network Detection

intro: IEEE 2016 ICCE-Berlin
arxiv: http://arxiv.org/abs/1609.02500

Fused DNN: A deep neural network fusion approach to fast and robust pedestrian detection

arxiv: https://arxiv.org/abs/1610.03466

Detecting People in Artwork with CNNs

intro: ECCV 2016 Workshops
arxiv: https://arxiv.org/abs/1610.08871

Multispectral Deep Neural Networks for Pedestrian Detection

intro: BMVC 2016 oral
arxiv: https://arxiv.org/abs/1611.02644

Deep Multi-camera People Detection

arxiv: https://arxiv.org/abs/1702.04593

Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters

intro: CVPR 2017
project page: http://ml.cs.tsinghua.edu.cn:5000/publications/synunity/
arxiv: https://arxiv.org/abs/1703.06283
github(Tensorflow): https://github.com/huangshiyu13/RPNplus

What Can Help Pedestrian Detection?

intro: CVPR 2017. Tsinghua University & Peking University & Megvii Inc.
keywords: Faster R-CNN, HyperLearner
arxiv: https://arxiv.org/abs/1705.02757
paper: http://openaccess.thecvf.com/content_cvpr_2017/papers/Mao_What_Can_Help_CVPR_2017_paper.pdf

Illuminating Pedestrians via Simultaneous Detection & Segmentation

arxiv: https://arxiv.org/abs/1706.08564

Rotational Rectification Network for Robust Pedestrian Detection

intro: CMU & Volvo Construction
arxiv: https://arxiv.org/abs/1706.08917

STD-PD: Generating Synthetic Training Data for Pedestrian Detection in Unannotated Videos

intro: The University of North Carolina at Chapel Hill
arxiv: https://arxiv.org/abs/1707.09100

Too Far to See? Not Really! — Pedestrian Detection with Scale-aware Localization Policy

arxiv: https://arxiv.org/abs/1709.00235

Repulsion Loss: Detecting Pedestrians in a Crowd

arxiv: https://arxiv.org/abs/1711.07752

Aggregated Channels Network for Real-Time Pedestrian Detection

arxiv: https://arxiv.org/abs/1801.00476

Illumination-aware Faster R-CNN for Robust Multispectral Pedestrian Detection

intro: State Key Lab of CAD&CG, Zhejiang University
arxiv: https://arxiv.org/abs/1803.05347

Exploring Multi-Branch and High-Level Semantic Networks for Improving Pedestrian Detection

arxiv: https://arxiv.org/abs/1804.00872

Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond

arxiv: https://arxiv.org/abs/1804.02047

PCN: Part and Context Information for Pedestrian Detection with CNNs

intro: British Machine Vision Conference(BMVC) 2017
arxiv: https://arxiv.org/abs/1804.04483

Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature Aggregation

intro: ECCV 2018. Hikvision Research Institute
arxiv: https://arxiv.org/abs/1807.01438

Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

intro: ECCV 2018
arxiv: https://arxiv.org/abs/1807.08407

Multispectral Pedestrian Detection via Simultaneous Detection and Segmentation

intro: BMVC 2018
arxiv: https://arxiv.org/abs/1808.04818

Pedestrian Detection with Autoregressive Network Phases

intro: Michigan State University
arxiv: https://arxiv.org/abs/1812.00440

Vehicle Detection

DAVE: A Unified Framework for Fast Vehicle Detection and Annotation

intro: ECCV 2016
arxiv: http://arxiv.org/abs/1607.04564

Evolving Boxes for fast Vehicle Detection

arxiv: https://arxiv.org/abs/1702.00254

Fine-Grained Car Detection for Visual Census Estimation

intro: AAAI 2016
arxiv: https://arxiv.org/abs/1709.02480

SINet: A Scale-insensitive Convolutional Neural Network for Fast Vehicle Detection

intro: IEEE Transactions on Intelligent Transportation Systems (T-ITS)
arxiv: https://arxiv.org/abs/1804.00433

Label and Sample: Efficient Training of Vehicle Object Detector from Sparsely Labeled Data

intro: UC Berkeley
arxiv: https://arxiv.org/abs/1808.08603

Domain Randomization for Scene-Specific Car Detection and Pose Estimation

arxiv：https://arxiv.org/abs/1811.05939

ShuffleDet: Real-Time Vehicle Detection Network in On-board Embedded UAV Imagery

intro: ECCV 2018, UAVision 2018
arxiv: https://arxiv.org/abs/1811.06318

Traffic-Sign Detection

Traffic-Sign Detection and Classification in the Wild

intro: CVPR 2016
project page(code+dataset): http://cg.cs.tsinghua.edu.cn/traffic-sign/
paper: http://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Zhu_Traffic-Sign_Detection_and_CVPR_2016_paper.pdf
code & model: http://cg.cs.tsinghua.edu.cn/traffic-sign/data_model_code/newdata0411.zip

Evaluating State-of-the-art Object Detector on Challenging Traffic Light Data

intro: CVPR 2017 workshop
paper: http://openaccess.thecvf.com/content_cvpr_2017_workshops/w9/papers/Jensen_Evaluating_State-Of-The-Art_Object_CVPR_2017_paper.pdf

Detecting Small Signs from Large Images

intro: IEEE Conference on Information Reuse and Integration (IRI) 2017 oral
arxiv: https://arxiv.org/abs/1706.08574

Localized Traffic Sign Detection with Multi-scale Deconvolution Networks

arxiv: https://arxiv.org/abs/1804.10428

Detecting Traffic Lights by Single Shot Detection

intro: ITSC 2018
arxiv: https://arxiv.org/abs/1805.02523

A Hierarchical Deep Architecture and Mini-Batch Selection Method For Joint Traffic Sign and Light Detection

intro: IEEE 15th Conference on Computer and Robot Vision
arxiv: https://arxiv.org/abs/1806.07987
demo: https://www.youtube.com/watch?v=_YmogPzBXOw&feature=youtu.be

Skeleton Detection

Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs

arxiv: http://arxiv.org/abs/1603.09446
github: https://github.com/zeakey/DeepSkeleton

DeepSkeleton: Learning Multi-task Scale-associated Deep Side Outputs for Object Skeleton Extraction in Natural Images

arxiv: http://arxiv.org/abs/1609.03659

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild

intro: CVPR 2017
arxiv: https://arxiv.org/abs/1703.02243
github: https://github.com/KevinKecc/SRN

Hi-Fi: Hierarchical Feature Integration for Skeleton Detection

arxiv: https://arxiv.org/abs/1801.01849

Fruit Detection

Deep Fruit Detection in Orchards

arxiv: https://arxiv.org/abs/1610.03677

Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards

intro: The Journal of Field Robotics in May 2016
project page: http://confluence.acfr.usyd.edu.au/display/AGPub/
arxiv: https://arxiv.org/abs/1610.08120

Shadow Detection

Fast Shadow Detection from a Single Image Using a Patched Convolutional Neural Network

arxiv: https://arxiv.org/abs/1709.09283

A+D-Net: Shadow Detection with Adversarial Shadow Attenuation

arxiv: https://arxiv.org/abs/1712.01361

Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal

arxiv: https://arxiv.org/abs/1712.02478

Direction-aware Spatial Context Features for Shadow Detection

intro: CVPR 2018
arxiv: https://arxiv.org/abs/1712.04142

Direction-aware Spatial Context Features for Shadow Detection and Removal

intro: The Chinese University of Hong Kong & The Hong Kong Polytechnic University
arxiv: https://arxiv.org/abs/1805.04635

Others Detection

Deep Deformation Network for Object Landmark Localization

arxiv: http://arxiv.org/abs/1605.01014

Fashion Landmark Detection in the Wild

intro: ECCV 2016
project page: http://personal.ie.cuhk.edu.hk/~lz013/projects/FashionLandmarks.html
arxiv: http://arxiv.org/abs/1608.03049
github(Caffe): https://github.com/liuziwei7/fashion-landmarks

Deep Learning for Fast and Accurate Fashion Item Detection

intro: Kuznech Inc.
intro: MultiBox and Fast R-CNN
paper: https://kddfashion2016.mybluemix.net/kddfashion_finalSubmissions/Deep Learning for Fast and Accurate Fashion Item Detection.pdf

OSMDeepOD - OSM and Deep Learning based Object Detection from Aerial Imagery (formerly known as “OSM-Crosswalk-Detection”)

github: https://github.com/geometalab/OSMDeepOD

Selfie Detection by Synergy-Constraint Based Convolutional Neural Network

intro: IEEE SITIS 2016
arxiv: https://arxiv.org/abs/1611.04357

Associative Embedding:End-to-End Learning for Joint Detection and Grouping

arxiv: https://arxiv.org/abs/1611.05424

Deep Cuboid Detection: Beyond 2D Bounding Boxes

intro: CMU & Magic Leap
arxiv: https://arxiv.org/abs/1611.10010

Automatic Model Based Dataset Generation for Fast and Accurate Crop and Weeds Detection

arxiv: https://arxiv.org/abs/1612.03019

Deep Learning Logo Detection with Data Expansion by Synthesising Context

arxiv: https://arxiv.org/abs/1612.09322

Scalable Deep Learning Logo Detection

arxiv: https://arxiv.org/abs/1803.11417

Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks

arxiv: https://arxiv.org/abs/1702.00307

Automatic Handgun Detection Alarm in Videos Using Deep Learning

arxiv: https://arxiv.org/abs/1702.05147
results: https://github.com/SihamTabik/Pistol-Detection-in-Videos

Objects as context for part detection

arxiv: https://arxiv.org/abs/1703.09529

Using Deep Networks for Drone Detection

intro: AVSS 2017
arxiv: https://arxiv.org/abs/1706.05726

Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

intro: ICCV 2017
arxiv: https://arxiv.org/abs/1708.01642

Target Driven Instance Detection

arxiv: https://arxiv.org/abs/1803.04610

DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion

arxiv: https://arxiv.org/abs/1709.04577

VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition

intro: ICCV 2017
arxiv: https://arxiv.org/abs/1710.06288
github: https://github.com/SeokjuLee/VPGNet

Grab, Pay and Eat: Semantic Food Detection for Smart Restaurants

arxiv: https://arxiv.org/abs/1711.05128

ReMotENet: Efficient Relevant Motion Event Detection for Large-scale Home Surveillance Videos

intro: WACV 2018
arxiv: https://arxiv.org/abs/1801.02031

Deep Learning Object Detection Methods for Ecological Camera Trap Data

intro: Conference of Computer and Robot Vision. University of Guelph
arxiv: https://arxiv.org/abs/1803.10842

EL-GAN: Embedding Loss Driven Generative Adversarial Networks for Lane Detection

arxiv: https://arxiv.org/abs/1806.05525

Towards End-to-End Lane Detection: an Instance Segmentation Approach

arxiv: https://arxiv.org/abs/1802.05591

iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection

intro: BMVC 2018
project page: https://gaochen315.github.io/iCAN/
arxiv: https://arxiv.org/abs/1808.10437
github: https://github.com/vt-vl-lab/iCAN

Densely Supervised Grasp Detector (DSGD)

https://arxiv.org/abs/1810.03962

Object Proposal

DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers

arxiv: http://arxiv.org/abs/1510.04445
github: https://github.com/aghodrati/deepproposal

Scale-aware Pixel-wise Object Proposal Networks

intro: IEEE Transactions on Image Processing
arxiv: http://arxiv.org/abs/1601.04798

Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization

intro: BMVC 2016. AttractioNet
arxiv: https://arxiv.org/abs/1606.04446
github: https://github.com/gidariss/AttractioNet

Learning to Segment Object Proposals via Recursive Neural Networks

arxiv: https://arxiv.org/abs/1612.01057

Learning Detection with Diverse Proposals

intro: CVPR 2017
keywords: differentiable Determinantal Point Process (DPP) layer, Learning Detection with Diverse Proposals (LDDP)
arxiv: https://arxiv.org/abs/1704.03533

ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond

keywords: product detection
arxiv: https://arxiv.org/abs/1704.06752

Improving Small Object Proposals for Company Logo Detection

intro: ICMR 2017
arxiv: https://arxiv.org/abs/1704.08881

Open Logo Detection Challenge

intro: BMVC 2018
keywords: QMUL-OpenLogo
project page: https://qmul-openlogo.github.io/
arxiv: https://arxiv.org/abs/1807.01964

AttentionMask: Attentive, Efficient Object Proposal Generation Focusing on Small Objects

intro: ACCV 2018 oral
arxiv: https://arxiv.org/abs/1811.08728
github: https://github.com/chwilms/AttentionMask

Localization

Beyond Bounding Boxes: Precise Localization of Objects in Images

intro: PhD Thesis
homepage: http://www.eecs.berkeley.edu/Pubs/TechRpts/2015/EECS-2015-193.html
phd-thesis: http://www.eecs.berkeley.edu/Pubs/TechRpts/2015/EECS-2015-193.pdf
github(“SDS using hypercolumns”): https://github.com/bharath272/sds

Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning

arxiv: http://arxiv.org/abs/1503.00949

Weakly Supervised Object Localization Using Size Estimates

arxiv: http://arxiv.org/abs/1608.04314

Active Object Localization with Deep Reinforcement Learning

intro: ICCV 2015
keywords: Markov Decision Process
arxiv: https://arxiv.org/abs/1511.06015

Localizing objects using referring expressions

intro: ECCV 2016
keywords: LSTM, multiple instance learning (MIL)
paper: http://www.umiacs.umd.edu/~varun/files/refexp-ECCV16.pdf
github: https://github.com/varun-nagaraja/referring-expressions

LocNet: Improving Localization Accuracy for Object Detection

intro: CVPR 2016 oral
arxiv: http://arxiv.org/abs/1511.07763
github: https://github.com/gidariss/LocNet

Learning Deep Features for Discriminative Localization

homepage: http://cnnlocalization.csail.mit.edu/
arxiv: http://arxiv.org/abs/1512.04150
github(Tensorflow): https://github.com/jazzsaxmafia/Weakly_detector
github: https://github.com/metalbubble/CAM
github: https://github.com/tdeboissiere/VGG16CAM-keras

ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization

intro: ECCV 2016
project page: http://www.di.ens.fr/willow/research/contextlocnet/
arxiv: http://arxiv.org/abs/1609.04331
github: https://github.com/vadimkantorov/contextlocnet

Ensemble of Part Detectors for Simultaneous Classification and Localization

arxiv: https://arxiv.org/abs/1705.10034

STNet: Selective Tuning of Convolutional Networks for Object Localization

arxiv: https://arxiv.org/abs/1708.06418

Soft Proposal Networks for Weakly Supervised Object Localization

intro: ICCV 2017
arxiv: https://arxiv.org/abs/1709.01829

Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN

intro: ACM MM 2017
arxiv: https://arxiv.org/abs/1709.08295

Tutorials / Talks

Convolutional Feature Maps: Elements of efficient (and accurate) CNN-based object detection

slides: http://research.microsoft.com/en-us/um/people/kahe/iccv15tutorial/iccv2015_tutorial_convolutional_feature_maps_kaiminghe.pdf

Towards Good Practices for Recognition & Detection

intro: Hikvision Research Institute. Supervised Data Augmentation (SDA)
slides: http://image-net.org/challenges/talks/2016/Hikvision_at_ImageNet_2016.pdf

Work in progress: Improving object detection and instance segmentation for small objects

https://docs.google.com/presentation/d/1OTfGn6mLe1VWE8D0q6Tu_WwFTSoLGd4OF8WCYnOWcVo/edit#slide=id.g37418adc7a_0_229

Object Detection with Deep Learning: A Review

arxiv: https://arxiv.org/abs/1807.05511

Projects

Detectron

intro: FAIR’s research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
github: https://github.com/facebookresearch/Detectron

TensorBox: a simple framework for training neural networks to detect objects in images

intro: “The basic model implements the simple and robust GoogLeNet-OverFeat algorithm. We additionally provide an implementation of the ReInspect algorithm”
github: https://github.com/Russell91/TensorBox

Object detection in torch: Implementation of some object detection frameworks in torch

github: https://github.com/fmassa/object-detection.torch

Using DIGITS to train an Object Detection network

github: https://github.com/NVIDIA/DIGITS/blob/master/examples/object-detection/README.md

FCN-MultiBox Detector

intro: Full convolution MultiBox Detector (like SSD) implemented in Torch.
github: https://github.com/teaonly/FMD.torch

KittiBox: A car detection model implemented in Tensorflow.

keywords: MultiNet
intro: KittiBox is a collection of scripts to train out model FastBox on the Kitti Object Detection Dataset
github: https://github.com/MarvinTeichmann/KittiBox

Deformable Convolutional Networks + MST + Soft-NMS

github: https://github.com/bharatsingh430/Deformable-ConvNets

How to Build a Real-time Hand-Detector using Neural Networks (SSD) on Tensorflow

blog: https://towardsdatascience.com/how-to-build-a-real-time-hand-detector-using-neural-networks-ssd-on-tensorflow-d6bac0e4b2ce
github: https://github.com//victordibia/handtracking

Metrics for object detection

intro: Most popular metrics used to evaluate object detection algorithms
github: https://github.com/rafaelpadilla/Object-Detection-Metrics

MobileNetv2-SSDLite

intro: Caffe implementation of SSD and SSDLite detection on MobileNetv2, converted from tensorflow.
github: https://github.com/chuanqi305/MobileNetv2-SSDLite

Leaderboard

Detection Results: VOC2012

intro: Competition “comp4” (train on additional data)
homepage: http://host.robots.ox.ac.uk:8080/leaderboard/displaylb.php?challengeid=11&compid=4

Tools

BeaverDam: Video annotation tool for deep learning training labels

https://github.com/antingshen/BeaverDam

Blogs

Convolutional Neural Networks for Object Detection

http://rnd.azoft.com/convolutional-neural-networks-object-detection/

Introducing automatic object detection to visual search (Pinterest)

keywords: Faster R-CNN
blog: https://engineering.pinterest.com/blog/introducing-automatic-object-detection-visual-search
demo: https://engineering.pinterest.com/sites/engineering/files/Visual Search V1 - Video.mp4
review: https://news.developer.nvidia.com/pinterest-introduces-the-future-of-visual-search/?mkt_tok=eyJpIjoiTnpaa01UWXpPRE0xTURFMiIsInQiOiJJRjcybjkwTmtmallORUhLOFFFODBDclFqUlB3SWlRVXJXb1MrQ013TDRIMGxLQWlBczFIeWg0TFRUdnN2UHY2ZWFiXC9QQVwvQzBHM3B0UzBZblpOSmUyU1FcLzNPWXI4cml2VERwTTJsOFwvOEk9In0%3D

Deep Learning for Object Detection with DIGITS

blog: https://devblogs.nvidia.com/parallelforall/deep-learning-object-detection-digits/

Analyzing The Papers Behind Facebook’s Computer Vision Approach

keywords: DeepMask, SharpMask, MultiPathNet
blog: https://adeshpande3.github.io/adeshpande3.github.io/Analyzing-the-Papers-Behind-Facebook’s-Computer-Vision-Approach/

Easily Create High Quality Object Detectors with Deep Learning

intro: dlib v19.2
blog: http://blog.dlib.net/2016/10/easily-create-high-quality-object.html

How to Train a Deep-Learned Object Detection Model in the Microsoft Cognitive Toolkit

blog: https://blogs.technet.microsoft.com/machinelearning/2016/10/25/how-to-train-a-deep-learned-object-detection-model-in-cntk/
github: https://github.com/Microsoft/CNTK/tree/master/Examples/Image/Detection/FastRCNN

Object Detection in Satellite Imagery, a Low Overhead Approach

part 1: https://medium.com/the-downlinq/object-detection-in-satellite-imagery-a-low-overhead-approach-part-i-cbd96154a1b7#.2csh4iwx9
part 2: https://medium.com/the-downlinq/object-detection-in-satellite-imagery-a-low-overhead-approach-part-ii-893f40122f92#.f9b7dgf64

You Only Look Twice — Multi-Scale Object Detection in Satellite Imagery With Convolutional Neural Networks

part 1: https://medium.com/the-downlinq/you-only-look-twice-multi-scale-object-detection-in-satellite-imagery-with-convolutional-neural-38dad1cf7571#.fmmi2o3of
part 2: https://medium.com/the-downlinq/you-only-look-twice-multi-scale-object-detection-in-satellite-imagery-with-convolutional-neural-34f72f659588#.nwzarsz1t

Faster R-CNN Pedestrian and Car Detection

blog: https://bigsnarf.wordpress.com/2016/11/07/faster-r-cnn-pedestrian-and-car-detection/
ipn: https://gist.github.com/bigsnarfdude/2f7b2144065f6056892a98495644d3e0#file-demo_faster_rcnn_notebook-ipynb
github: https://github.com/bigsnarfdude/Faster-RCNN_TF

Small U-Net for vehicle detection

blog: https://medium.com/@vivek.yadav/small-u-net-for-vehicle-detection-9eec216f9fd6#.md4u80kad

Region of interest pooling explained

blog: https://deepsense.io/region-of-interest-pooling-explained/
github: https://github.com/deepsense-io/roi-pooling

Supercharge your Computer Vision models with the TensorFlow Object Detection API

blog: https://research.googleblog.com/2017/06/supercharge-your-computer-vision-models.html
github: https://github.com/tensorflow/models/tree/master/object_detection

Understanding SSD MultiBox — Real-Time Object Detection In Deep Learning

https://towardsdatascience.com/understanding-ssd-multibox-real-time-object-detection-in-deep-learning-495ef744fab

One-shot object detection

http://machinethink.net/blog/object-detection/

An overview of object detection: one-stage methods

https://www.jeremyjordan.me/object-detection-one-stage/

你可能感兴趣的:(计算机视觉,基本)

DCMNet一种用于目标检测的轻量级骨干结构模型详解及代码复现清风AI 深度学习算法详解及代码复现深度学习机器学习计算机视觉人工智能算法目标检测
模型背景在深度学习技术快速发展的背景下，目标检测领域取得了显著进展。早期的手工特征提取方法如Viola-Jones和HOG逐渐被卷积神经网络（CNN）取代，其中AlexNet在2012年的ILSVRC比赛中表现突出，推动了CNN在计算机视觉中的广泛应用。然而，这些早期模型在精度和效率方面仍存在不足，尤其是在处理复杂场景和小目标时表现不佳。这为DCMNet等新型轻量化目标检测模型的出现提供了契机，旨
YashanDB数据操作数据库
本章节将介绍YashanDB数据库中表相关的基本语法和示例。插入数据通过执行INSERT语句往表中插入数据：CREATETABLEinsert_tb(c1INT,c2CHAR(10));INSERTINTOinsert_tbVALUES(4,'hello');INSERTINTOinsert_tbVALUES(1,'world'),(2,'nihao'),(3,'shijie');COMMIT;删
YashanDB事务操作数据库
本章节将介绍YashanDB数据库中事务相关的基本语法和示例。提交事务前，用户在事务过程做的任何修改只有自己能看到，其他用户无法看到，并可以通过回滚操作将数据恢复。提交事务后，其他用户可看到修改后的数据，此时无法通过回滚操作将数据恢复。提交事务执行COMMIT语句提交事务：CREATETABLECOM_TB(c1INT);INSERTINTOCOM_TBVALUES(1),(2),(3);COMM
DeepSeek全栈接入指南：从零到生产环境的深度实践量子纠缠BUG DeepSeek部署 AI DeepSeek 人工智能深度学习机器学习
第一章：DeepSeek技术体系全景解析1.1认知DeepSeek技术生态DeepSeek作为新一代人工智能技术平台，构建了覆盖算法开发、模型训练、服务部署的全链路技术栈。其核心能力体现在：1.1.1多模态智能引擎自然语言处理：支持文本生成（NLG）、语义理解（NLU）、情感分析等计算机视觉：提供图像分类、目标检测、OCR识别等CV能力语音交互：包含语音识别（ASR）、语音合成（TTS）及声纹识别
两天速通力扣HOT100[DAY2] (55~100) WynnLu 算法 leetcode c++
两天速通力扣HOT100[DAY2](55~100)本题解旨在以最简单的语言总结hot100各题思路，为每一题提供一个思考入口，但想要手撕出来，需要自己认真推理细节。目录回溯55~62二分查找63~68栈69~73堆74~76贪心77~80动态规划81~90多维动态规划91~95技巧96~10055、全排列思路回溯基本思想：DFS+状态还原面对前方n种选择的时候，循环选择其中一种，做出对应的改变并
基于Linux终端的Mplayer媒体播放器控制系统磨十三 linux 运维服务器
一、项目概述项目名称：基于Linux终端的Mplayer媒体播放器控制系统核心功能：多级菜单交互（主菜单/播放列表）：系统提供主菜单和播放列表两个层级的菜单，用户可以通过方向键在菜单项之间导航，并使用Enter键选择操作。这种设计使得用户能够方便地浏览和选择不同的功能。播放控制（播放/暂停/停止/上下曲目）：支持基本的播放控制功能，包括播放、暂停、停止以及上下曲目的切换。此外，还提供了倍速播放（如
Linux 下使用tracepath进行网络诊断分析 linux
简介tracepath命令是Linux中的一个网络诊断工具，类似于traceroute，但专门用于跟踪到目标主机的网络路径，同时自动处理路径MTU发现。这是一种简单的方法，可以找出机器和远程目的地之间的跃点，同时还可以识别沿途的任何问题。基本语法tracepath[options]：要跟踪路径的目标目的地的IP地址或主机名常用选项-n：以数字形式显示跳转地址（无需DNS解析）-l：设置数据包的长度
YashanDB数据操作数据库
本章节将介绍YashanDB数据库中表相关的基本语法和示例。插入数据通过执行INSERT语句往表中插入数据：CREATETABLEinsert_tb(c1INT,c2CHAR(10));INSERTINTOinsert_tbVALUES(4,'hello');INSERTINTOinsert_tbVALUES(1,'world'),(2,'nihao'),(3,'shijie');COMMIT;删
软件供应链安全工具链研究系列—RASP自适应威胁免疫平台（下篇） DevSecOps选型指南安全网络软件供应链安全工具 HW
在“软件供应链安全工具链研究系列—RASP自适应威胁免疫平台-上篇”中我们提到了RASP工具的基本能力、原理以及工具的应用场景，了解到了RASP工具在各场景下发挥的价值。那么在当今高强度攻防对抗的大场景下，RASP作为最后一道防线，不论是从高危漏洞修复还是应对高级攻击技术，都有着更高的要求。1.工具应具备的能力建议1.1技术能力方面建议1.1.1虚拟补丁技术RASP在为应用系统赋予威胁免疫能力的同
在数据分析工作中运用因果推断模型的实践指南 theskylife #因果分析数据分析大数据人工智能 AI 因果分析
目录1.写在开头2.因果推断模型的基础2.1因果关系vs.相关关系2.2基本概念和术语3.常见的因果推断方法3.1随机对照试验（RCTs）3.2工具变量法（IV）3.3回归不连续设计（RDD）4.因果推断的实际应用4.1案例研究1：使用RCTs分析营销活动的效果4.1.1背景和问题描述4.1.2实验设计和数据收集4.1.3数据分析和结果解释4.2案例研究2：应用工具变量法解决价格对销量的影响问题4
HTML基本标签详解请叫我飞哥@ HTML5 html 前端
HTML基本标签详解HTML（超文本标记语言）是构建网页的基础，以下是一些常用的HTML基本标签及其详细说明：定义：整个HTML文档的根元素。示例：定义：文档的头部，包含元数据（如标题、字符集、样式等）。示例：文档标题定义：文档的标题，显示在浏览器的标题栏或标签页上。示例：我的网页定义：文档的主体，包含可见的内容。示例：欢迎来到我的网页-定义：定义标题，从最重要的标题（）到最不重要的标题（）。示例
Python学习_很好的学习笔记自用百年渔翁_肯肯测试开发
Onthispage...(hide)1. 基本安装2. Python文档2.1 推荐资源站点2.2 其他参考资料2.3 代码示例3. 常用工具3.1 PythonIDE3.2 内置类库使用参考3.3
【CodeBlocks】搭建OpenCV环境指南万众珩
【CodeBlocks】搭建OpenCV环境指南CodeBlocks搭建OpenCV环境项目地址:https://gitcode.com/Resource-Bundle-Collection/e1e1a本资源提供了详细的教程，帮助您在CodeBlocks集成开发环境中顺利搭建OpenCV环境。OpenCV是一个开源的计算机视觉和机器学习软件库，广泛应用于图像处理和视频分析领域。通过这篇指南，即便是
【Linux入门】正则三剑客：grep、sed和wak Karoku066 linux 运维服务器 bash ssh
文章目录gerp一、基本概述二、基本语法三、常用选项1.搜索选项2.正则表达式选项3.其他选项四、示例sedsed编辑器的介绍sed流编辑器的工作过程解决sed命令处理大文件效率慢的问题解决方案一：使用`split`命令分割文件解决方案二：优化`sed`命令的使用解决方案三：使用更高效的工具解决方案四：并行处理总结sed命令的基本格式与选项基本操作格式执行多条命令的格式常用选项sed命令的操作符s
注意力机制（Attention Mechanism）详细分类与介绍 Jason_Orton 分类数据挖掘人工智能
注意力机制（AttentionMechanism）是近年来在深度学习中非常流行的一种技术，特别是在自然语言处理（NLP）、计算机视觉等任务中，具有显著的效果。它的核心思想是模仿人类在处理信息时的注意力分配方式，根据不同部分的重要性给予不同的关注程度。1.注意力机制的背景与动机在传统的深度学习模型（如RNN、CNN等）中，信息处理通常是按照固定的规则和结构进行的，模型对输入的各个部分给予相同的关注。
Java高频面试之SE-20 牛马baby java 面试开发语言
hello啊，各位观众姥爷们！！！本baby今天又来了！哈哈哈哈哈嗝Java的泛型是什么？Java泛型（Generics）是Java5引入的一项重要特性，用于增强代码的类型安全性和重用性。泛型允许在定义类、接口和方法时使用类型参数，从而使代码更加通用且减少类型转换的需求。1.泛型的基本概念泛型的核心思想是参数化类型，即在定义类、接口或方法时，使用一个或多个类型参数（通常用T、E、K、V等表示），在
LangChain入门：使用Python和通义千问打造免费的Qwen大模型聊天机器人闯江湖50年 langchain python 机器人人工智能
前言LangChain是一个用于开发由大型语言模型（LargeLanguageModels，简称LLMs）驱动的应用程序的框架。它提供了一个灵活的框架，使得开发者可以构建具有上下文感知能力和推理能力的应用程序，这些应用程序可以利用公司的数据和APIs。这个框架由几个部分组成。LangChain库：Python和JavaScript库。包含了各种组件的接口和集成，一个基本的运行时，用于将这些组件组合
python天气数据分析与处理,用python数据分析天气 2401_84504019 人工智能
本篇文章给大家谈谈python天气预报可视化分析报告，以及基于python的天气预测系统研究，希望对各位有所帮助，不要忘了收藏本站喔。基于大数据重庆市气象数据分析摘要信息化社会内需要与之针对性的信息获取途径，但是途径的扩展基本上为人们所努力的方向，由于站在的角度存在偏差，人们经常能够获得不同类型信息，这也是技术最为难以攻克的课题。针对气象数据等问题，对气象信息进行研究分析，然后开发设计出气象数据分
图神经网络：拓扑数据分析的新时代 Jason_Orton 神经网络数据分析人工智能
随着图数据的广泛应用，图神经网络（GraphNeuralNetwork,GNN）作为一种强大的深度学习工具，逐渐成为机器学习领域中的一颗新星。图数据在许多现实世界问题中无处不在，诸如社交网络、交通网络、分子结构、推荐系统等都可以被建模为图结构。图神经网络通过直接处理图结构数据，能够更好地捕捉节点之间的关系信息，从而在众多任务中展现出了优异的性能。本文将深入探讨图神经网络的基本原理、常见的算法、应用
python 基本用法选与握 #python python 人工智能开发语言
1[None]importnumpyasnp#创建一个示例数组img_pre=np.array([[1,2,3],[4,5,6]])#使用...进行索引result=img_pre[...][None]print("原始数组形状:",img_pre.shape)print("操作后数组形状:",result.shape)代码解释...操作符：...（省略号）在NumPy中是一个特殊的索引对象，它表
微信小程序复制功能青青子衿越微信小程序小程序
在微信公众平台隐私协议中加剪贴板设置-基本设置审核通过后app.json中添加"permission":{"scope.writeClipboard":{"desc":"你的剪贴板将用于小程序的复制操作"}},index.ts//复制指定内容handleCopy(){console.log("复制");wx.setClipboardData({data:this.data.verification
react-native入门之核心组件与原生组件 crayon-shin-chan surprise #react-native react native react
文档：核心组件与原生组件·ReactNative中文网1.简介ReactNative是一个使用React和应用平台的原生功能来构建Android和iOS应用的开源框架。可以使用JavaScript来访问移动平台的API，使用React组件来描述UI的外观和行为2.视图在Android和iOS开发中，一个视图是UI的基本组成部分屏幕上的一个小矩形元素、可用于显示文本、图像或响应用户输入。甚至应用程序
【创作话题】Wireshark插件开发实用技巧分享热爱分享的博士僧 wireshark 测试工具网络
开发Wireshark插件能够极大地扩展Wireshark的功能，使其能够解析和分析特定协议的数据包。以下是一些实用技巧，帮助您更高效地进行Wireshark插件开发：1.熟悉Lua脚本语言Wireshark支持使用Lua脚本语言来编写插件。Lua是一种轻量级的脚本语言，易于学习且功能强大，非常适合用于快速原型设计和开发Wireshark插件。掌握Lua的基本语法、数据结构（如表）以及如何在Lua
河南省统计年鉴面板数据2000-2022的指标说明用数据说话用数据决策数据库
变量行政区划代码年份地区经度纬度所属省份长江经济带综合-行政区划-市级个数（个）综合-行政区划-市级个数（个）-省辖市个数（个）综合-行政区划-市级个数（个）-县级市个数（个）综合-行政区划-县级个数（个）综合-行政区划-市辖区个数（个）综合-行政区划-镇个数（个）综合-行政区划-乡个数（个）综合-行政区划-街道办事处（个）综合-行政区划-居民委员会（个）综合-行政区划-村民委员会（个）综合-基本
Java常用开源库: apache HttpClient 4.x, oktttp, jetty HttpClient wzj_whut 后端
文章目录apachehttpclientGETPOSTFormPOSTString上传文件/Multipart设置超时启用cookieokhttp基本用法上传文件POST设置超时websocketjettyhttpclientapachehttpclienthttps://mvnrepository.com/artifact/org.apache.httpcomponents/httpclient
[AI] [ComfyUI]理解ComyUI的基本原理及其图像生成技术技术小甜甜 AI探索者人工智能 AI作画
ComyUI作为一种图像生成框架，其背后的核心技术基于潜在空间的概念，并通过各种深度学习模块实现高效的图像生成与本地部署。本文将详细探讨ComyUI的基本原理，涵盖其在图像生成中的关键概念，包括潜在空间、VAE模块、噪声处理以及CLIP编码器节点的作用。1.潜在空间的存在与生成效率什么是潜在空间？潜在空间（LatentSpace）是指数据压缩后的低维空间。在图像生成中，潜在空间的引入极大地提高了生
作为一名测试工程师如何学习Kubernetes(k8s)技能网络安全小宇哥学习 kubernetes 容器计算机网络 web安全安全 dubbo
前言Kubernetes(K8s)作为云原生时代的关键技术之一，对于运维工程师、开发工程师以及测试工程师来说，都是一门需要掌握的重要技术。作为一名软件测试工程师，学习Kubernetes是一个有助于提升自动化测试、容器化测试以及云原生应用测试能力的重要过程。以下是一个系统性的学习路径和建议：一、了解基础概念1）容器技术：学习Docker等容器技术的基础知识，了解容器的基本概念、镜像、容器运行与管理
自编码器（Autoencoders）路野yue 机器学习人工智能深度学习
自编码器（Autoencoders）:自编码器由编码器和解码器组成，编码器将输入数据压缩为低维表示，解码器将其还原为原始数据。通过训练，自编码器能够学习数据的有效表示，常用于降维和特征提取。相比于独立模型，它的输入输出更灵活，且可以在输入完成后在完成解码。1.基本结构自编码器由两部分组成：编码器（Encoder）：将输入数据压缩为低维表示（编码）。解码器（Decoder）：从编码中重建原始数据。2
数据结构之链表简介：原理、实现与应用陈辰学长数据结构链表网络
数据结构之链表简介：原理、实现与应用一、引言在计算机科学中，数据结构是组织和存储数据的方式，而链表是一种非常基础且重要的数据结构。链表以其动态性、灵活性和高效性，在许多编程场景中被广泛应用。本文将详细介绍链表的基本概念、实现方式以及应用场景，帮助读者深入理解链表的原理和优势。二、链表的基本概念链表是一种线性数据结构，由一系列节点组成，每个节点包含两部分：数据部分和指向下一个节点的指针。链表的头节点
天气API接口在日常生活与商业决策中的应用 FB13713612741 python
天气，作为自然界中最不可控却又对人类活动影响巨大的因素之一，其变化无常的特性使得人们长期以来都在寻找预测和控制它的方法。随着科技的进步，尤其是互联网和大数据技术的发展，天气信息的获取和应用变得更加便捷和高效。天气API接口，作为连接天气数据与各类应用的桥梁，正逐步渗透到我们日常生活的方方面面，并在商业决策中发挥着越来越重要的作用。一、天气API接口的基本概念与技术原理天气API接口是一种提供天气数
Java开发中，spring mvc 的线程怎么调用？小麦麦子 spring mvc
今天逛知乎，看到最近很多人都在问spring mvc 的线程http://www.maiziedu.com/course/java/ 的启动问题，觉得挺有意思的，那哥们儿问的也听仔细，下面的回答也很详尽，分享出来，希望遇对遇到类似问题的Java开发程序猿有所帮助。问题：在用spring mvc架构的网站上，设一线程在虚拟机启动时运行，线程里有一全局
maven依赖范围 bitcarter maven
1.test 测试的时候才会依赖，编译和打包不依赖，如junit不被打包 2.compile 只有编译和打包时才会依赖 3.provided 编译和测试的时候依赖，打包不依赖，如：tomcat的一些公用jar包 4.runtime 运行时依赖，编译不依赖 5.默认compile 依赖范围compile是支持传递的，test不支持传递 1.传递的意思是项目A，引用
Jaxb org.xml.sax.saxparseexception : premature end of file darrenzhu xml premature JAXB
如果在使用JAXB把xml文件unmarshal成vo(XSD自动生成的vo)时碰到如下错误： org.xml.sax.saxparseexception : premature end of file 很有可能时你直接读取文件为inputstream，然后将inputstream作为构建unmarshal需要的source参数。InputSource inputSource = new In
CSS Specificity 周凡杨 html 权重 Specificity css
有时候对于页面元素设置了样式，可为什么页面的显示没有匹配上呢？ because specificity CSS 的选择符是有权重的，当不同的选择符的样式设置有冲突时，浏览器会采用权重高的选择符设置的样式。规则： HTML标签的权重是1 Class 的权重是10 Id 的权重是100
java与servlet g21121 servlet
servlet 搞java web开发的人一定不会陌生，而且大家还会时常用到它。下面是java官方网站上对servlet的介绍： java官网对于servlet的解释写道 Java Servlet Technology Overview Servlets are the Java platform technology of choice for extending and enha
eclipse中安装maven插件 510888780 eclipse maven
1.首先去官网下载 Maven： http://www.apache.org/dyn/closer.cgi/maven/binaries/apache-maven-3.2.3-bin.tar.gz 下载完成之后将其解压，我将解压后的文件夹：apache-maven-3.2.3，并将它放在 D:\tools目录下，即 maven 最终的路径是：D:\tools\apache-mave
jpa@OneToOne关联关系布衣凌宇 jpa
Nruser里的pruserid关联到Pruser的主键id，实现对一个表的增删改，另一个表的数据随之增删改。 Nruser实体类 //***************************************************************** @Entity @Table(name="nruser") @DynamicInsert @Dynam
我的spring学习笔记11-Spring中关于声明式事务的配置 aijuans spring 事务配置
这两天学到事务管理这一块，结合到之前的terasoluna框架，觉得书本上讲的还是简单阿。我就把我从书本上学到的再结合实际的项目以及网上看到的一些内容，对声明式事务管理做个整理吧。我看得Spring in Action第二版中只提到了用TransactionProxyFactoryBean和<tx:advice/>,定义注释驱动这三种，我承认后两种的内容很好，很强大。但是实际的项目当中
java 动态代理简单实现 antlove java handler proxy dynamic service
dynamicproxy.service.HelloService package dynamicproxy.service; public interface HelloService { public void sayHello(); } dynamicproxy.service.impl.HelloServiceImpl package dynamicp
JDBC连接数据库百合不是茶 JDBC编程 JAVA操作oracle数据库
如果我们要想连接oracle公司的数据库，就要首先下载oralce公司的驱动程序，将这个驱动程序的jar包导入到我们工程中; JDBC链接数据库的代码和固定写法; 1,加载oracle数据库的驱动; &nb
单例模式中的多线程分析 bijian1013 java thread 多线程 java多线程
谈到单例模式，我们立马会想到饿汉式和懒汉式加载，所谓饿汉式就是在创建类时就创建好了实例，懒汉式在获取实例时才去创建实例，即延迟加载。饿汉式： package com.bijian.study; public class Singleton { private Singleton() { } // 注意这是private 只供内部调用 private static
javascript读取和修改原型特别需要注意原型的读写不具有对等性 bijian1013 JavaScript prototype
对于从原型对象继承而来的成员，其读和写具有内在的不对等性。比如有一个对象A，假设它的原型对象是B，B的原型对象是null。如果我们需要读取A对象的name属性值，那么JS会优先在A中查找，如果找到了name属性那么就返回；如果A中没有name属性，那么就到原型B中查找name，如果找到了就返回；如果原型B中也没有
【持久化框架MyBatis3六】MyBatis3集成第三方DataSource bit1129 dataSource
MyBatis内置了数据源的支持，如： <environments default="development"> <environment id="development"> <transactionManager type="JDBC" /> <data
我程序中用到的urldecode和base64decode,MD5 bitcarter c MD5 base64decode urldecode
这里是base64decode和urldecode，Md5在附件中。因为我是在后台所以需要解码： string Base64Decode(const char* Data,int DataByte,int& OutByte) { //解码表 const char DecodeTable[] = { 0, 0, 0, 0, 0, 0
腾讯资深运维专家周小军：QQ与微信架构的惊天秘密 ronin47
社交领域一直是互联网创业的大热门，从PC到移动端，从OICQ、MSN到QQ。到了移动互联网时代，社交领域应用开始彻底爆发，直奔黄金期。腾讯在过去几年里，社交平台更是火到爆，QQ和微信坐拥几亿的粉丝，QQ空间和朋友圈各种刷屏，写心得，晒照片，秀视频，那么谁来为企鹅保驾护航呢？支撑QQ和微信海量数据背后的架构又有哪些惊天内幕呢？本期大讲堂的内容来自今年2月份ChinaUnix对腾讯社交网络运营服务中心
java-69-旋转数组的最小元素。把一个数组最开始的若干个元素搬到数组的末尾，我们称之为数组的旋转。输入一个排好序的数组的一个旋转，输出旋转数组的最小元素 bylijinnan java
public class MinOfShiftedArray { /** * Q69 旋转数组的最小元素 * 把一个数组最开始的若干个元素搬到数组的末尾，我们称之为数组的旋转。输入一个排好序的数组的一个旋转，输出旋转数组的最小元素。 * 例如数组{3, 4, 5, 1, 2}为{1, 2, 3, 4, 5}的一个旋转，该数组的最小值为1。 */ publ
看博客，应该是有方向的 Cb123456 反省看博客
看博客，应该是有方向的: 我现在就复习以前的，在补补以前不会的，现在还不会的，同时完善完善项目，也看看别人的博客. 我刚突然想到的: 1.应该看计算机组成原理，数据结构，一些算法，还有关于android,java的。 2.对于我，也快大四了，看一些职业规划的，以及一些学习的经验，看看别人的工作总结的. 为什么要写
[开源与商业]做开源项目的人生活上一定要朴素,尽量减少对官方和商业体系的依赖 comsci 开源项目
为什么这样说呢？因为科学和技术的发展有时候需要一个平缓和长期的积累过程，但是行政和商业体系本身充满各种不稳定性和不确定性，如果你希望长期从事某个科研项目，但是却又必须依赖于某种行政和商业体系，那其中的过程必定充满各种风险。。。所以，为避免这种不确定性风险，我
一个 sql优化（[精华] 一个查询优化的分析调整全过程！很值得一看） cwqcwqmax9 sql
见 http://www.itpub.net/forum.php?mod=viewthread&tid=239011 Web翻页优化实例提交时间: 2004-6-18 15:37:49 回复发消息环境： Linux ve
Hibernat and Ibatis dashuaifu Hibernate ibatis
Hibernate VS iBATIS 简介 Hibernate 是当前最流行的O/R mapping框架，当前版本是3.05。它出身于sf.net，现在已经成为Jboss的一部分了 iBATIS 是另外一种优秀的O/R mapping框架，当前版本是2.0。目前属于apache的一个子项目了。相对Hibernate“O/R”而言，iBATIS 是一种“Sql Mappi
备份MYSQL脚本 dcj3sjt126com mysql
#!/bin/sh # this shell to backup mysql #1413161683@qq.com (QQ:1413161683 DuChengJiu) _dbDir=/var/lib/mysql/ _today=`date +%w` _bakDir=/usr/backup/$_today [ ! -d $_bakDir ] && mkdir -p
iOS第三方开源库的吐槽和备忘 dcj3sjt126com ios
转自 ibireme的博客做iOS开发总会接触到一些第三方库，这里整理一下，做一些吐槽。目前比较活跃的社区仍旧是Github，除此以外也有一些不错的库散落在Google Code、SourceForge等地方。由于Github社区太过主流，这里主要介绍一下Github里面流行的iOS库。首先整理了一份 Github上排名靠
html wlwmanifest.xml eoems html xml
所谓优化wp_head()就是把从wp_head中移除不需要元素，同时也可以加快速度。步骤：加入到function.php remove_action('wp_head', 'wp_generator'); //wp-generator移除wordpress的版本号，本身blog的版本号没什么意义，但是如果让恶意玩家看到，可能会用官网公布的漏洞攻击blog remov
浅谈Java定时器发展 hacksin java 并发 timer 定时器
java在jdk1.3中推出了定时器类Timer,而后在jdk1.5后由Dou Lea从新开发出了支持多线程的ScheduleThreadPoolExecutor，从后者的表现来看，可以考虑完全替代Timer了。 Timer与ScheduleThreadPoolExecutor对比： 1. Timer始于jdk1.3,其原理是利用一个TimerTask数组当作队列
移动端页面侧边导航滑入效果 ini jquery Web html5 css javascirpt
效果体验：http://hovertree.com/texiao/mobile/2.htm可以使用移动设备浏览器查看效果。效果使用到jquery-2.1.4.min.js，该版本的jQuery库是用于支持HTML5的浏览器上，不再兼容IE8以前的浏览器，现在移动端浏览器一般都支持HTML5，所以使用该jQuery没问题。HTML文件代码： <!DOCTYPE html> <h
AspectJ+Javasist记录日志 kane_xie aspectj javasist
在项目中碰到这样一个需求，对一个服务类的每一个方法，在方法开始和结束的时候分别记录一条日志，内容包括方法名，参数名+参数值以及方法执行的时间。 @Override public String get(String key) { // long start = System.currentTimeMillis(); // System.out.println("Be
redis学习笔记 MJC410621 redis NoSQL
1)nosql数据库主要由以下特点：非关系型的、分布式的、开源的、水平可扩展的。 1，处理超大量的数据 2，运行在便宜的PC服务器集群上， 3，击碎了性能瓶颈。 1)对数据高并发读写。 2)对海量数据的高效率存储和访问。 3)对数据的高扩展性和高可用性。 redis支持的类型： Sring 类型 set name lijie get name lijie set na
使用redis实现分布式锁 qifeifei
在多节点的系统中，如何实现分布式锁机制，其中用redis来实现是很好的方法之一，我们先来看一下jedis包中，有个类名BinaryJedis,它有个方法如下： public Long setnx(final byte[] key, final byte[] value) { checkIsInMulti(); client.setnx(key, value); ret
BI并非万能，中层业务管理报表要另辟蹊径张老师的菜大数据 BI 商业智能信息化
BI是商业智能的缩写，是可以帮助企业做出明智的业务经营决策的工具，其数据来源于各个业务系统，如ERP、CRM、SCM、进销存、HER、OA等。 BI系统不同于传统的管理信息系统，他号称是一个整体应用的解决方案，是融入管理思想的强大系统：有着系统整体的设计思想，支持对所有
安装rvm后出现rvm not a function 或者ruby -v后提示没安装ruby的问题 wudixiaotie function
1.在~/.bashrc最后加入 [[ -s "$HOME/.rvm/scripts/rvm" ]] && source "$HOME/.rvm/scripts/rvm" 2.重新启动terminal输入： rvm use ruby-2.2.1 --default 把当前安装的ruby版本设为默