luo_bosir

Object Detection(目标检测神文)----1

Object Detection(目标检测神文)

置顶 2018年08月21日 14:25:28 Mars_WH 阅读数：12695

目标检测神文，非常全而且持续在更新。转发自：https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html，如有侵权联系删除。
更新时间：
20190226

我会跟进原作者博客持续更新，加入自己对目标检测领域的一些新研究及论文解读。博客根据需求直接进行关键字搜索，例如2018，可找到最新论文。

文章目录

Papers
- 损失函数
  - [CVPR2019] Generalized Intersection over Union: A Metric and A Loss for Bounding Box Regression
  - Deep Neural Networks for Object Detection
  - OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks
- R-CNN
  - Rich feature hierarchies for accurate object detection and semantic segmentation
- Fast R-CNN
  - Fast R-CNN
  - A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection
- Faster R-CNN
  - Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
  - R-CNN minus R
  - Faster R-CNN in MXNet with distributed implementation and data parallelization
  - Contextual Priming and Feedback for Faster R-CNN
  - An Implementation of Faster RCNN with Study for Region Sampling
  - Interpretable R-CNN
  - [AAAI2019]Object Detection based on Region Decomposition and Assembly
- Light-Head R-CNN
  - Light-Head R-CNN: In Defense of Two-Stage Object Detector
  - Cascade R-CNN: Delving into High Quality Object Detection
- MultiBox
  - Scalable Object Detection using Deep Neural Networks
  - Scalable, High-Quality Object Detection
- SPP-Net
  - Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
  - DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection
  - Object Detectors Emerge in Deep Scene CNNs
  - segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection
  - Object Detection Networks on Convolutional Feature Maps
  - Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction
  - DeepBox: Learning Objectness with Convolutional Networks
- MR-CNN
  - Object detection via a multi-region & semantic segmentation-aware CNN model
- YOLO
  - You Only Look Once: Unified, Real-Time Object Detection
  - darkflow - translate darknet to tensorflow. Load trained weights, retrain/fine-tune them using tensorflow, export constant graph def to C++
  - Start Training YOLO with Our Own Data
  - YOLO: Core ML versus MPSNNGraph
  - TensorFlow YOLO object detection on Android
  - Computer Vision in iOS – Object Detection
- YOLOv2
  - YOLO9000: Better, Faster, Stronger
  - darknet_scripts
  - Yolo_mark: GUI for marking bounded boxes of objects in images for training Yolo v2
  - LightNet: Bringing pjreddie’s DarkNet out of the shadows
  - YOLO v2 Bounding Box Tool
- YOLOv3
  - YOLOv3: An Incremental Improvement
  - YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers
  - AttentionNet: Aggregating Weak Directions for Accurate Object Detection
- DenseBox
  - DenseBox: Unifying Landmark Localization with End to End Object Detection
- SSD
  - SSD: Single Shot MultiBox Detector
- DSSD
  - DSSD : Deconvolutional Single Shot Detector
  - Enhancement of SSD by concatenating feature maps for object detection
  - Context-aware Single-Shot Detector
  - Feature-Fused SSD: Fast Detection for Small Objects
- FSSD
  - FSSD: Feature Fusion Single Shot Multibox Detector
  - Weaving Multi-scale Context for Single Shot Detector
- ESSD
  - Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network
  - Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection
  - MDSSD: Multi-scale Deconvolutional Single Shot Detector for small objects
- Inside-Outside Net (ION)
  - Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
  - Adaptive Object Detection Using Adjacency and Zoom Prediction
  - G-CNN: an Iterative Grid Based Object Detector
- Factors in Finetuning Deep Model for object detection
  - Factors in Finetuning Deep Model for Object Detection with Long-tail Distribution
  - We don’t need no bounding-boxes: Training object class detectors using only human verification
  - HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection
  - A MultiPath Network for Object Detection
- CRAFT
  - CRAFT Objects from Images
- OHEM
  - Training Region-based Object Detectors with Online Hard Example Mining
  - S-OHEM: Stratified Online Hard Example Mining for Object Detection
  - Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers
- R-FCN
  - R-FCN: Object Detection via Region-based Fully Convolutional Networks
  - R-FCN-3000 at 30fps: Decoupling Detection and Classification
  - Recycle deep features for better object detection
- MS-CNN
  - A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection
  - Multi-stage Object Detection with Group Recursive Learning
  - Subcategory-aware Convolutional Neural Networks for Object Proposals and Detection
- PVANET
  - PVANet: Lightweight Deep Neural Networks for Real-time Object Detection
- GBD-Net
  - Gated Bi-directional CNN for Object Detection
  - Crafting GBD-Net for Object Detection
  - StuffNet: Using ‘Stuff’ to Improve Object Detection
  - Generalized Haar Filter based Deep Networks for Real-Time Object Detection in Traffic Scene
  - Hierarchical Object Detection with Deep Reinforcement Learning
  - Learning to detect and localize many objects from few examples
  - Speed/accuracy trade-offs for modern convolutional object detectors
  - SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving
- Feature Pyramid Network (FPN)
  - Feature Pyramid Networks for Object Detection
  - Action-Driven Object Detection with Top-Down Visual Attentions
  - Beyond Skip Connections: Top-Down Modulation for Object Detection
  - Wide-Residual-Inception Networks for Real-time Object Detection
  - Attentional Network for Visual Object Detection
  - Learning Chained Deep Features and Classifiers for Cascade in Object Detection
  - DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling
  - Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries
  - Spatial Memory for Context Reasoning in Object Detection
  - Accurate Single Stage Detector Using Recurrent Rolling Convolution
  - Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection
  - LCDet: Low-Complexity Fully-Convolutional Neural Networks for Object Detection in Embedded Systems
  - Point Linking Network for Object Detection
  - Perceptual Generative Adversarial Networks for Small Object Detection
  - Few-shot Object Detection
  - Yes-Net: An effective Detector Based on Global Information
  - SMC Faster R-CNN: Toward a scene-specialized multi-object detector
  - Towards lightweight convolutional neural networks for object detection
  - RON: Reverse Connection with Objectness Prior Networks for Object Detection
  - Mimicking Very Efficient Network for Object Detection
  - Residual Features and Unified Prediction Network for Single Stage Detection
  - Deformable Part-based Fully Convolutional Network for Object Detection
  - Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors
  - Recurrent Scale Approximation for Object Detection in CNN
- DSOD
  - DSOD: Learning Deeply Supervised Object Detectors from Scratch
  - Object Detection from Scratch with Deep Supervision
  - Focal Loss for Dense Object Detection
  - Focal Loss Dense Detector for Vehicle Surveillance
  - CoupleNet: Coupling Global Structure with Local Parts for Object Detection
  - Incremental Learning of Object Detectors without Catastrophic Forgetting
  - Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection
  - StairNet: Top-Down Semantic Aggregation for Accurate One Shot Detection
  - Dynamic Zoom-in Network for Fast Object Detection in Large Images
  - Zero-Annotation Object Detection with Web Knowledge Transfer
- MegDet
  - MegDet: A Large Mini-Batch Object Detector
  - Single-Shot Refinement Neural Network for Object Detection
  - Receptive Field Block Net for Accurate and Fast Object Detection
  - An Analysis of Scale Invariance in Object Detection - SNIP
  - Feature Selective Networks for Object Detection
  - Learning a Rotation Invariant Detector with Rotatable Bounding Box
  - Scalable Object Detection for Stylized Objects
  - Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids
  - Deep Regionlets for Object Detection
  - Training and Testing Object Detectors with Virtual Images
  - Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video
  - Spot the Difference by Object Detection
  - Localization-Aware Active Learning for Object Detection
  - Object Detection with Mask-based Feature Encoding
  - LSTD: A Low-Shot Transfer Detector for Object Detection
  - Domain Adaptive Faster R-CNN for Object Detection in the Wild
  - Pseudo Mask Augmented Object Detection
  - Revisiting RCNN: On Awakening the Classification Power of Faster RCNN
  - Decoupled Classification Refinement: Hard False Positive Suppression for Object Detection
  - Learning Region Features for Object Detection
  - Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection
  - Object Detection for Comics using Manga109 Annotations
  - Task-Driven Super Resolution: Object Detection in Low-resolution Images
  - Transferring Common-Sense Knowledge for Object Detection
  - Multi-scale Location-aware Kernel Representation for Object Detection
  - Loss Rank Mining: A General Hard Example Mining Method for Real-time Detectors
  - DetNet: A Backbone network for Object Detection
  - Robust Physical Adversarial Attack on Faster R-CNN Object Detector
  - AdvDetPatch: Attacking Object Detectors with Adversarial Patches
  - Attacking Object Detectors via Imperceptible Patches on Background
  - Physical Adversarial Examples for Object Detectors
  - Quantization Mimic: Towards Very Tiny CNN for Object Detection
  - Object detection at 200 Frames Per Second
  - Object Detection using Domain Randomization and Generative Adversarial Refinement of Synthetic Images
  - SNIPER: Efficient Multi-Scale Training
  - Soft Sampling for Robust Object Detection
  - MetaAnchor: Learning to Detect Objects with Customized Anchors
  - Localization Recall Precision (LRP): A New Performance Metric for Object Detection
  - Auto-Context R-CNN
  - Pooling Pyramid Network for Object Detection
  - Modeling Visual Context is Key to Augmenting Object Detection Datasets
  - Dual Refinement Network for Single-Shot Object Detection
  - Acquisition of Localization Confidence for Accurate Object Detection
  - CornerNet: Detecting Objects as Paired Keypoints
  - Unsupervised Hard Example Mining from Videos for Improved Object Detection
  - SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection
  - A Survey of Modern Object Detection Literature using Deep Learning
  - Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usages
  - Deep Feature Pyramid Reconfiguration for Object Detection
  - MDCN: Multi-Scale, Deep Inception Convolutional Neural Networks for Efficient Object Detection
  - Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks
  - Deep Learning for Generic Object Detection: A Survey
  - Training Confidence-Calibrated Classifier for Detecting Out-of-Distribution Samples
  - ScratchDet:Exploring to Train Single-Shot Object Detectors from Scratch
  - Fast and accurate object detection in high resolution 4K and 8K video using GPUs
  - Hybrid Knowledge Routed Modules for Large-scale Object Detection
  - Gradient Harmonized Single-stage Detector
  - M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network
  - BAN: Focusing on Boundary Context for Object Detection
  - Multi-layer Pruning Framework for Compressing Single Shot MultiBox Detector
  - R2CNN++: Multi-Dimensional Attention Based Rotation Invariant Detector with Robust Anchor Strategy
  - DeRPN: Taking a further step toward more general object detection
  - Fast Efficient Object Detection Using Selective Attention
  - Sampling Techniques for Large-Scale Object Detection from Sparsely Annotated Objects
  - Efficient Coarse-to-Fine Non-Local Module for the Detection of Small Objects
  - Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection
  - Grid R-CNN
  - Transferable Adversarial Attacks for Image and Video Object Detection
  - Anchor Box Optimization for Object Detection
  - AutoFocus: Efficient Multi-Scale Inference
  - Practical Adversarial Attack Against Object Detector
  - Learning Efficient Detector with Semi-supervised Adaptive Distillation
  - Scale-Aware Trident Networks for Object Detection
  - Region Proposal by Guided Anchoring
  - Consistent Optimization for Single-Shot Object Detection
  - Bottom-up Object Detection by Grouping Extreme and Center Points
  - A Single-shot Object Detector with Feature Aggragation and Enhancement
  - Bag of Freebies for Training Object Detection Neural Networks
- Non-Maximum Suppression (NMS)
  - End-to-End Integration of a Convolutional Network, Deformable Parts Model and Non-Maximum Suppression
  - A convnet for non-maximum suppression
  - Soft-NMS – Improving Object Detection With One Line of Code
  - Learning non-maximum suppression
  - Relation Networks for Object Detection
  - Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes
  - Daedalus: Breaking Non-Maximum Suppression in Object Detection via Adversarial Examples
- Adversarial Examples
  - Adversarial Examples that Fool Detectors
  - Adversarial Examples Are Not Easily Detected: Bypassing Ten Detection Methods
- Weakly Supervised Object Detection
  - Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection
  - Weakly supervised object detection using pseudo-strong labels
  - Saliency Guided End-to-End Learning for Weakly Supervised Object Detection
  - Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection
- Video Object Detection
  - Learning Object Class Detectors from Weakly Annotated Video
  - Analysing domain shift factors between videos and images for object detection
  - Video Object Recognition
  - Deep Learning for Saliency Prediction in Natural Video
  - T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos
  - Object Detection from Video Tubelets with Convolutional Neural Networks
  - Object Detection in Videos with Tubelets and Multi-context Cues
  - Context Matters: Refining Object Detection in Video with Recurrent Neural Networks
  - CNN Based Object Detection in Large Video Images
  - Object Detection in Videos with Tubelet Proposal Networks
  - Flow-Guided Feature Aggregation for Video Object Detection
  - Video Object Detection using Faster R-CNN
  - Improving Context Modeling for Video Object Detection and Tracking
  - Temporal Dynamic Graph LSTM for Action-driven Video Object Detection
  - Mobile Video Object Detection with Temporally-Aware Feature Maps
  - Towards High Performance Video Object Detection
  - Impression Network for Video Object Detection
  - Spatial-Temporal Memory Networks for Video Object Detection
  - 3D-DETNet: a Single Stage Video-Based Vehicle Detector
  - Object Detection in Videos by Short and Long Range Object Linking
  - Object Detection in Video with Spatiotemporal Sampling Networks
  - Towards High Performance Video Object Detection for Mobiles
  - Optimizing Video Object Detection via a Scale-Time Lattice
  - Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing
  - Fast Object Detection in Compressed Video
  - Tube-CNN: Modeling temporal evolution of appearance for object detection in video
  - AdaScale: Towards Real-time Video Object Detection Using Adaptive Scaling
- Object Detection on Mobile Devices
  - Pelee: A Real-Time Object Detection System on Mobile Devices
- Object Detection in 3D
  - Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks
  - Complex-YOLO: Real-time 3D Object Detection on Point Clouds
  - Focal Loss in 3D Object Detection
  - 3D Object Detection Using Scale Invariant and Feature Reweighting Networks
  - 3D Backbone Network for 3D Object Detection
- Object Detection on RGB-D
  - Learning Rich Features from RGB-D Images for Object Detection and Segmentation
  - Differential Geometry Boosts Convolutional Neural Networks for Object Detection
  - A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation
- Zero-Shot Object Detection
  - Zero-Shot Detection
  - Zero-Shot Object Detection
  - Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts
  - Zero-Shot Object Detection by Hybrid Region Embedding
- Salient Object Detection
  - Best Deep Saliency Detection Models (CVPR 2016 & 2015)
  - Large-scale optimization of hierarchical features for saliency prediction in natural images
  - Predicting Eye Fixations using Convolutional Neural Networks
  - Saliency Detection by Multi-Context Deep Learning
  - DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection
  - SuperCNN: A Superpixelwise Convolutional Neural Network for Salient Object Detection
  - Shallow and Deep Convolutional Networks for Saliency Prediction
  - Recurrent Attentional Networks for Saliency Detection
  - Two-Stream Convolutional Networks for Dynamic Saliency Prediction
- Unconstrained Salient Object Detection
  - Unconstrained Salient Object Detection via Proposal Subset Optimization
  - DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection
  - Salient Object Subitizing
  - Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection
  - Saliency Detection via Combining Region-Level and Pixel-Level Predictions with CNNs
  - Edge Preserving and Multi-Scale Contextual Neural Network for Salient Object Detection
  - A Deep Multi-Level Network for Saliency Prediction
  - Visual Saliency Detection Based on Multiscale Deep CNN Features
  - A Deep Spatial Contextual Long-term Recurrent Convolutional Network for Saliency Detection
  - Deeply supervised salient object detection with short connections
  - Weakly Supervised Top-down Salient Object Detection
  - SalGAN: Visual Saliency Prediction with Generative Adversarial Networks
  - Visual Saliency Prediction Using a Mixture of Deep Neural Networks
  - A Fast and Compact Salient Score Regression Network Based on Fully Convolutional Network
  - Saliency Detection by Forward and Backward Cues in Deep-CNNs
  - Supervised Adversarial Networks for Image Saliency Detection
  - Group-wise Deep Co-saliency Detection
  - Towards the Success Rate of One: Real-time Unconstrained Salient Object Detection
  - Amulet: Aggregating Multi-level Convolutional Features for Salient Object Detection
  - Learning Uncertain Convolutional Features for Accurate Saliency Detection
  - Deep Edge-Aware Saliency Detection
  - Self-explanatory Deep Salient Object Detection
  - PiCANet: Learning Pixel-wise Contextual Attention in ConvNets and Its Application in Saliency Detection
  - DeepFeat: A Bottom Up and Top Down Saliency Model Based on Deep Features of Convolutional Neural Nets
  - Recurrently Aggregating Deep Features for Salient Object Detection
  - Deep saliency: What is learnt by a deep network about saliency?
  - Contrast-Oriented Deep Neural Networks for Salient Object Detection
  - Salient Object Detection by Lossless Feature Reflection
  - HyperFusion-Net: Densely Reflective Fusion for Salient Object Detection
- Video Saliency Detection
  - Deep Learning For Video Saliency Detection
  - Video Salient Object Detection Using Spatiotemporal Deep Features
  - Predicting Video Saliency with Object-to-Motion CNN and Two-layer Convolutional LSTM
- Visual Relationship Detection
  - Visual Relationship Detection with Language Priors
  - ViP-CNN: A Visual Phrase Reasoning Convolutional Neural Network for Visual Relationship Detection
  - Visual Translation Embedding Network for Visual Relation Detection
  - Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection
  - Detecting Visual Relationships with Deep Relational Networks
  - Identifying Spatial Relations in Images using Convolutional Neural Networks
  - PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN
  - Natural Language Guided Visual Relationship Detection
  - Detecting Visual Relationships Using Box Attention
  - Google AI Open Images - Visual Relationship Track
  - Context-Dependent Diffusion Network for Visual Relationship Detection
  - A Problem Reduction Approach for Visual Relationships Detection
- Face Deteciton
  - Multi-view Face Detection Using Deep Convolutional Neural Networks
  - From Facial Parts Responses to Face Detection: A Deep Learning Approach
  - Compact Convolutional Neural Network Cascade for Face Detection
  - Face Detection with End-to-End Integration of a ConvNet and a 3D Model
  - CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection
  - Towards a Deep Learning Framework for Unconstrained Face Detection
  - Supervised Transformer Network for Efficient Face Detection
  - UnitBox: An Advanced Object Detection Network
  - Bootstrapping Face Detection with Hard Negative Examples
  - Grid Loss: Detecting Occluded Faces
  - A Multi-Scale Cascade Fully Convolutional Network Face Detector
- MTCNN
  - Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks
  - Face Detection using Deep Learning: An Improved Faster RCNN Approach
  - Faceness-Net: Face Detection through Deep Facial Part Responses
  - Multi-Path Region-Based Convolutional Neural Network for Accurate Detection of Unconstrained “Hard Faces”
  - End-To-End Face Detection and Recognition
  - Face R-CNN
  - Face Detection through Scale-Friendly Deep Convolutional Networks
  - Scale-Aware Face Detection
  - Detecting Faces Using Inside Cascaded Contextual CNN
  - Multi-Branch Fully Convolutional Network for Face Detection
  - SSH: Single Stage Headless Face Detector
  - Dockerface: an easy to install and use Faster R-CNN face detector in a Docker container
  - FaceBoxes: A CPU Real-time Face Detector with High Accuracy
  - S3FD: Single Shot Scale-invariant Face Detector
  - Detecting Faces Using Region-based Fully Convolutional Networks
  - AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
  - Face Attention Network: An effective Face Detector for the Occluded Faces
  - Feature Agglomeration Networks for Single Stage Face Detection
  - Face Detection Using Improved Faster RCNN
  - PyramidBox: A Context-assisted Single Shot Face Detector
  - A Fast Face Detection Method via Convolutional Neural Network
  - Beyond Trade-off: Accelerate FCN-based Face Detector with Higher Accuracy
  - Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks
  - SFace: An Efficient Network for Face Detection in Large Scale Variations
  - Survey of Face Detection on Low-quality Images
  - Anchor Cascade for Efficient Face Detection
  - Adversarial Attacks on Face Detectors using Neural Net based Constrained Optimization
  - Selective Refinement Network for High Performance Face Detection
  - DSFD: Dual Shot Face Detector
  - Learning Better Features for Face Detection with Feature Fusion and Segmentation Supervision
  - FA-RPN: Floating Region Proposals for Face Detection
  - Robust and High Performance Face Detector
  - DAFE-FD: Density Aware Feature Enrichment for Face Detection
  - Improved Selective Refinement Network for Face Detection
  - Revisiting a single-stage method for face detection
- Detect Small Faces
  - Finding Tiny Faces
  - Detecting and counting tiny faces
  - Seeing Small Faces from Robust Anchor’s Perspective
  - Face-MagNet: Magnifying Feature Maps to Detect Small Faces
  - Robust Face Detection via Learning Small Faces on Hard Images
  - SFA: Small Faces Attention Face Detector
- Person Head Detection
  - Context-aware CNNs for person head detection
  - Detecting Heads using Feature Refine Net and Cascaded Multi-scale Architecture
  - A Comparison of CNN-based Face and Head Detectors for Real-Time Video Surveillance Applications
  - FCHD: A fast and accurate head detector
- Pedestrian Detection / People Detection
  - Pedestrian Detection aided by Deep Learning Semantic Tasks
  - Deep Learning Strong Parts for Pedestrian Detection
  - Taking a Deeper Look at Pedestrians
  - Convolutional Channel Features
  - End-to-end people detection in crowded scenes
  - Learning Complexity-Aware Cascades for Deep Pedestrian Detection
  - Deep convolutional neural networks for pedestrian detection
  - Scale-aware Fast R-CNN for Pedestrian Detection
  - New algorithm improves speed and accuracy of pedestrian detection
  - Pushing the Limits of Deep CNNs for Pedestrian Detection
  - A Real-Time Deep Learning Pedestrian Detector for Robot Navigation
  - A Real-Time Pedestrian Detector using Deep Learning for Human-Aware Navigation
  - Is Faster R-CNN Doing Well for Pedestrian Detection?
  - Unsupervised Deep Domain Adaptation for Pedestrian Detection
  - Reduced Memory Region Based Deep Convolutional Neural Network Detection
  - Fused DNN: A deep neural network fusion approach to fast and robust pedestrian detection
  - Detecting People in Artwork with CNNs
  - Multispectral Deep Neural Networks for Pedestrian Detection
  - Box-level Segmentation Supervised Deep Neural Networks for Accurate and Real-time Multispectral Pedestrian Detection
  - Deep Multi-camera People Detection
  - Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters
  - What Can Help Pedestrian Detection?
  - Illuminating Pedestrians via Simultaneous Detection & Segmentation
  - Rotational Rectification Network for Robust Pedestrian Detection
  - STD-PD: Generating Synthetic Training Data for Pedestrian Detection in Unannotated Videos
  - Too Far to See? Not Really! — Pedestrian Detection with Scale-aware Localization Policy
  - Repulsion Loss: Detecting Pedestrians in a Crowd
  - Aggregated Channels Network for Real-Time Pedestrian Detection
  - Illumination-aware Faster R-CNN for Robust Multispectral Pedestrian Detection
  - Exploring Multi-Branch and High-Level Semantic Networks for Improving Pedestrian Detection
  - Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond
  - PCN: Part and Context Information for Pedestrian Detection with CNNs
  - Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature Aggregation
  - Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd
  - Multispectral Pedestrian Detection via Simultaneous Detection and Segmentation
  - Pedestrian Detection with Autoregressive Network Phases
  - The Cross-Modality Disparity Problem in Multispectral Pedestrian Detection
- Vehicle Detection
  - DAVE: A Unified Framework for Fast Vehicle Detection and Annotation
  - Evolving Boxes for fast Vehicle Detection
  - Fine-Grained Car Detection for Visual Census Estimation
  - SINet: A Scale-insensitive Convolutional Neural Network for Fast Vehicle Detection
  - Label and Sample: Efficient Training of Vehicle Object Detector from Sparsely Labeled Data
  - Domain Randomization for Scene-Specific Car Detection and Pose Estimation
  - ShuffleDet: Real-Time Vehicle Detection Network in On-board Embedded UAV Imagery
- Traffic-Sign Detection
  - Traffic-Sign Detection and Classification in the Wild
  - Evaluating State-of-the-art Object Detector on Challenging Traffic Light Data
  - Detecting Small Signs from Large Images
  - Localized Traffic Sign Detection with Multi-scale Deconvolution Networks
  - Detecting Traffic Lights by Single Shot Detection
  - A Hierarchical Deep Architecture and Mini-Batch Selection Method For Joint Traffic Sign and Light Detection
- Skeleton Detection
  - Object Skeleton Extraction in Natural Images by Fusing Scale-associated Deep Side Outputs
  - DeepSkeleton: Learning Multi-task Scale-associated Deep Side Outputs for Object Skeleton Extraction in Natural Images
  - SRN: Side-output Residual Network for Object Symmetry Detection in the Wild
  - Hi-Fi: Hierarchical Feature Integration for Skeleton Detection
- Fruit Detection
  - Deep Fruit Detection in Orchards
  - Image Segmentation for Fruit Detection and Yield Estimation in Apple Orchards
- Shadow Detection
  - Fast Shadow Detection from a Single Image Using a Patched Convolutional Neural Network
  - A+D-Net: Shadow Detection with Adversarial Shadow Attenuation
  - Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal
  - Direction-aware Spatial Context Features for Shadow Detection
  - Direction-aware Spatial Context Features for Shadow Detection and Removal
- Others Detection
  - Deep Deformation Network for Object Landmark Localization
  - Fashion Landmark Detection in the Wild
  - Deep Learning for Fast and Accurate Fashion Item Detection
  - OSMDeepOD - OSM and Deep Learning based Object Detection from Aerial Imagery (formerly known as “OSM-Crosswalk-Detection”)
  - Selfie Detection by Synergy-Constraint Based Convolutional Neural Network
  - Associative Embedding:End-to-End Learning for Joint Detection and Grouping
  - Deep Cuboid Detection: Beyond 2D Bounding Boxes
  - Automatic Model Based Dataset Generation for Fast and Accurate Crop and Weeds Detection
  - Deep Learning Logo Detection with Data Expansion by Synthesising Context
  - Scalable Deep Learning Logo Detection
  - Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks
  - Automatic Handgun Detection Alarm in Videos Using Deep Learning
  - Objects as context for part detection
  - Using Deep Networks for Drone Detection
  - Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
  - Target Driven Instance Detection
  - DeepVoting: An Explainable Framework for Semantic Part Detection under Partial Occlusion
  - VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition
  - Grab, Pay and Eat: Semantic Food Detection for Smart Restaurants
  - ReMotENet: Efficient Relevant Motion Event Detection for Large-scale Home Surveillance Videos
  - Deep Learning Object Detection Methods for Ecological Camera Trap Data
  - EL-GAN: Embedding Loss Driven Generative Adversarial Networks for Lane Detection
  - Towards End-to-End Lane Detection: an Instance Segmentation Approach
  - iCAN: Instance-Centric Attention Network for Human-Object Interaction Detection
  - Densely Supervised Grasp Detector (DSGD)
- Object Proposal
  - DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers
  - Scale-aware Pixel-wise Object Proposal Networks
  - Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization
  - Learning to Segment Object Proposals via Recursive Neural Networks
  - Learning Detection with Diverse Proposals
  - ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond
  - Improving Small Object Proposals for Company Logo Detection
  - Open Logo Detection Challenge
  - AttentionMask: Attentive, Efficient Object Proposal Generation Focusing on Small Objects
- Localization
  - Beyond Bounding Boxes: Precise Localization of Objects in Images
  - Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning
  - Weakly Supervised Object Localization Using Size Estimates
  - Active Object Localization with Deep Reinforcement Learning
  - Localizing objects using referring expressions
  - LocNet: Improving Localization Accuracy for Object Detection
  - Learning Deep Features for Discriminative Localization
  - ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization
  - Ensemble of Part Detectors for Simultaneous Classification and Localization
  - STNet: Selective Tuning of Convolutional Networks for Object Localization
  - Soft Proposal Networks for Weakly Supervised Object Localization
  - Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN
- Tutorials / Talks
  - Convolutional Feature Maps: Elements of efficient (and accurate) CNN-based object detection
  - Towards Good Practices for Recognition & Detection
  - Work in progress: Improving object detection and instance segmentation for small objects
  - Object Detection with Deep Learning: A Review
- Projects
  - Detectron
  - TensorBox: a simple framework for training neural networks to detect objects in images
  - Object detection in torch: Implementation of some object detection frameworks in torch
  - Using DIGITS to train an Object Detection network
  - FCN-MultiBox Detector
  - KittiBox: A car detection model implemented in Tensorflow.
  - Deformable Convolutional Networks + MST + Soft-NMS
  - How to Build a Real-time Hand-Detector using Neural Networks (SSD) on Tensorflow
  - Metrics for object detection
  - MobileNetv2-SSDLite
- Leaderboard
  - Detection Results: VOC2012
- Tools
  - BeaverDam: Video annotation tool for deep learning training labels
- Blogs
  - Convolutional Neural Networks for Object Detection
  - Introducing automatic object detection to visual search (Pinterest)
  - Deep Learning for Object Detection with DIGITS
  - Analyzing The Papers Behind Facebook’s Computer Vision Approach
  - Easily Create High Quality Object Detectors with Deep Learning
  - How to Train a Deep-Learned Object Detection Model in the Microsoft Cognitive Toolkit
  - Object Detection in Satellite Imagery, a Low Overhead Approach
  - You Only Look Twice — Multi-Scale Object Detection in Satellite Imagery With Convolutional Neural Networks
  - Faster R-CNN Pedestrian and Car Detection
  - Small U-Net for vehicle detection
  - Region of interest pooling explained
  - Supercharge your Computer Vision models with the TensorFlow Object Detection API
  - Understanding SSD MultiBox — Real-Time Object Detection In Deep Learning
  - One-shot object detection
  - An overview of object detection: one-stage methods
- deep learning object detection

Method	backbone	test size	VOC2007	VOC2010	VOC2012	ILSVRC 2013	MSCOCO 2015	Speed
OverFeat						24.3%
R-CNN	AlexNet		58.5%	53.7%	53.3%	31.4%
R-CNN	VGG17		66.0%
SPP_net	ZF-5		54.2%			31.84%
DeepID-Net			64.1%			50.3%
NoC			73.3%		68.8%
Fast-RCNN	VGG16		70.0%	68.8%	68.4%		19.7%(@[0.5-0.95]), 35.9%(@0.5)
MR-CNN			78.2%		73.9%
Faster-RCNN	VGG16		78.8%		75.9%		21.9%(@[0.5-0.95]), 42.7%(@0.5)	198ms
Faster-RCNN	ResNet101		85.6%		83.8%		37.4%(@[0.5-0.95]), 59.0%(@0.5)
YOLO			63.4%		57.9%			45 fps
YOLO	VGG-16		66.4%					21 fps
YOLOv2		448x448	78.6%		73.4%		21.6%(@[0.5-0.95]), 44.0%(@0.5)	40 fps
SSD	VGG16	300x300	77.2%		75.8%		25.1%(@[0.5-0.95]), 43.1%(@0.5)	46 fps
SSD	VGG16	512x512	79.8%		78.5%		28.8%(@[0.5-0.95]), 48.5%(@0.5)	19 fps
SSD	ResNet101	300x300					28.0%(@[0.5-0.95])	16 fps
SSD	ResNet101	512x512					31.2%(@[0.5-0.95])	8 fps
DSSD	ResNet101	300x300					28.0%(@[0.5-0.95])	8 fps
DSSD	ResNet101	500x500					33.2%(@[0.5-0.95])	6 fps
ION			79.2%		76.4%
CRAFT			75.7%		71.3%	48.5%
OHEM			78.9%		76.3%		25.5%(@[0.5-0.95]), 45.9%(@0.5)
R-FCN	ResNet50		77.4%					0.12sec(K40), 0.09sec(TitianX)
R-FCN	ResNet101		79.5%					0.17sec(K40), 0.12sec(TitianX)
R-FCN(ms train)	ResNet101		83.6%		82.0%		31.5%(@[0.5-0.95]), 53.2%(@0.5)
PVANet 9.0			84.9%		84.2%			750ms(CPU), 46ms(TitianX)
RetinaNet	ResNet101-FPN
Light-Head R-CNN	Xception*	800/1200					31.5%@[0.5:0.95]	95 fps
Light-Head R-CNN	Xception*	700/1100					30.7%@[0.5:0.95]	102 fps
STDN			80.9 (07+12)
RefineDet			83.8 (07+12)		83.5 (07++12)		41.8
SNIP							45.7
Relation-Network							32.5
Cascade R-CNN							42.8
MLKP			80.6 (07+12)		77.2 (07++12)		28.6
Fitness-NMS							41.8
RFBNet			82.2 (07+12)
CornerNet							42.1
PFPNet			84.1 (07+12)		83.7 (07++12)		39.4
Pelee			70.9 (07+12)
HKRM			78.8 (07+12)				37.8
M2Det							44.2
SIN			76.0 (07+12)		73.1 (07++12)		23.2

Papers

你可能感兴趣的:(CNN,卷积神经网络)

100天持续行动—Day01 Richard_DL
今天开始站着学习，发现效率大幅提升。把fast.ai的Lesson1的后半部分和Lesson2看完了。由于Keras版本和视频中的不一致，运行notebook时经常出现莫名其妙的错误，导致自己只动手实践了视频中的一小部分内容。为了赶时间，我打算先把与CNN相关的视频过一遍。然后尽快开始做自己的项目。明天继续加油，争取把Lesson3和Lesson4看完。
yolov5＞onnx＞ncnn＞apk 图像处理大大大大大牛啊 opencv实战代码讲解 yolo onnx ncnn 安卓
一.yolov5pt模型转onnx条件：colabnotebookyolov51.安装环境!pipinstallonnx>=1.7.0#forONNXexport!pipinstallcoremltools==4.0#forCoreMLexport!pipinstallonnx-simplifier2.修改common.py在classFocus下面
个人学习笔记7-6：动手学深度学习pytorch版-李沐浪子L 深度学习深度学习笔记计算机视觉 python 人工智能神经网络 pytorch
#人工智能##深度学习##语义分割##计算机视觉##神经网络#计算机视觉13.11全卷积网络全卷积网络（fullyconvolutionalnetwork，FCN）采用卷积神经网络实现了从图像像素到像素类别的变换。引入l转置卷积（transposedconvolution）实现的，输出的类别预测与输入图像在像素级别上具有一一对应关系：通道维的输出即该位置对应像素的类别预测。13.11.1构造模型下
计算机视觉中，Pooling的作用 Wils0nEdwards 计算机视觉人工智能
在计算机视觉中，Pooling（池化）是一种常见的操作，主要用于卷积神经网络（CNN）中。它通过对特征图进行下采样，减少数据的空间维度，同时保留重要的特征信息。Pooling的作用可以归纳为以下几个方面：1.降低计算复杂度与内存需求Pooling操作通过对特征图进行下采样，减少了特征图的空间分辨率（例如，高度和宽度）。这意味着网络需要处理的数据量会减少，从而降低了计算量和内存需求。这对大型神经网络
探索创新科技： Lite-Mono - 简约高效的小型化Mono框架杭律沛Meris
探索创新科技：Lite-Mono-简约高效的小型化Mono框架Lite-Mono[CVPR2023]Lite-Mono:ALightweightCNNandTransformerArchitectureforSelf-SupervisedMonocularDepthEstimation项目地址:https://gitcode.com/gh_mirrors/li/Lite-Mono如果你在寻找一个轻
数据分析-24-时间序列预测之基于keras的VMD-LSTM和VMD-CNN-LSTM预测风速皮皮冰燃数据分析数据分析
文章目录1普通的LSTM模型1.1数据重采样1.2数据标准化1.3切分窗口1.4划分数据集1.5建立模型1.6预测效果2VMD-LSTM模型2.1VMD分解时间序列2.2对每一个IMF建立LSTM模型2.2.1IMF1—LSTM2.2.2IMF2-LSTM2.2.3统一代码2.3评估效果3CNN-LSTM模型3.1数据预处理3.2建立模型3.3效果预测4VMD-CNN-LSTM模型4.1VMD分解
基于深度学习的农作物病害检测 SEU-WYL 深度学习dnn 深度学习人工智能
基于深度学习的农作物病害检测利用卷积神经网络（CNN）、生成对抗网络（GAN）、Transformer等深度学习技术，自动识别和分类农作物的病害，帮助农业工作者提高作物管理效率、减少损失。1.农作物病害检测的挑战病害种类繁多：农作物病害的类型多样，不同病害在同一作物上的表现差异很大，同时同一种病害在不同生长阶段的症状也可能不同。环境影响：天气、光照、湿度等外部环境因素会影响农作物的表现，使得病害检
yolov5单目测距+速度测量+目标跟踪 cv_2025 YOLO 目标跟踪人工智能计算机视觉机器学习图像处理 opencv
要在YOLOv5中添加测距和测速功能，您需要了解以下两个部分的原理：单目测距算法单目测距是使用单个摄像头来估计场景中物体的距离。常见的单目测距算法包括基于视差的方法（如立体匹配）和基于深度学习的方法（如神经网络）。基于深度学习的方法通常使用卷积神经网络（CNN）来学习从图像到深度图的映射关系。单目测距代码单目测距涉及到坐标转换，代码如下：defconvert_2D_to_3D(point2D,R,
探索深度学习的奥秘：从理论到实践的奇幻之旅小周不想卷深度学习
目录引言：穿越智能的迷雾一、深度学习的奇幻起源：从感知机到神经网络1.1感知机的启蒙1.2神经网络的诞生与演进1.3深度学习的崛起二、深度学习的核心魔法：神经网络架构2.1前馈神经网络（FeedforwardNeuralNetwork,FNN）2.2卷积神经网络（CNN）2.3循环神经网络（RNN）及其变体（LSTM,GRU）2.4生成对抗网络（GAN）三、深度学习的魔法秘籍：算法与训练3.1损失
卷积神经网络（CNN）详细介绍及其原理详解（二） FFmpeg123 Pytorch cnn 深度学习人工智能
接上一文继续;五、全连接层假设还是上面人的脑袋的示例，现在我们已经通过卷积和池化提取到了这个人的眼睛、鼻子和嘴的特征，如果我想利用这些特征来识别这个图片是否是人的脑袋该怎么办呢？此时我们只需要将提取到的所有特征图进行“展平”，将其维度变为1×x1×x1×x，这个过程就是全连接的过程。也就是说，此步我们将所有的特征都展开并进行运算，最后会得到一个概率值，这个概率值就是输入图片是否是人的概率，这个过程
【AI大咖】再认识Yann LeCun，一个可能是拥有最多中文名的男人喜欢打酱油的老鸟再认识Yann LeCun 一个可能是拥有最多中文名的男人
https://www.toutiao.com/i6693678422733881860/上一期扒了扛起深度学习大旗的Hinton先生，今天聊一位他的学生，深度学习中CNN的崛起离不开的男人——YannLeCun。一位陪伴Hinton三十年磨一剑，最终笑傲AI界的法国人。让我们一起记住这张面孔。作者|小满言有三编辑|小满言有三130秒了解LeCunYannLeCun，CNN之父，纽约大学终身教授，
TextCNN：文本卷积神经网络模型一只天蝎编程语言---Python cnn 深度学习机器学习
目录什么是TextCNN定义TextCNN类初始化一个model实例输出model什么是TextCNNTextCNN（TextConvolutionalNeuralNetwork）是一种用于处理文本数据的卷积神经网（CNN）。通过在文本数据上应用卷积操作来提取局部特征，这些特征可以捕捉到文本中的局部模式，如n-gram（连续的n个单词或字符）。定义TextCNN类importtorch.nnasn
机器学习到底是个啥旷_9b08
机器学习是装逼神器？曾几何时，当我还在本科打dota玩屁股的时候，身边总有一帮大神。听他们谈话我的心情是。。。大佬中有各路高手前端、后段、java三大架构。。。但最令本渣一听到就仰慕甚至肃然起敬的是当听到卷积神经网络的时候。顿时就有种掉线三十分钟别人都是六神装的感觉。另外，班会上别班小哥用说用机器学习把图片转换成梵高风格时自己班妹纸那一声声尖叫怕是很难忘掉了。。。好在家里爸妈给了次重新做人的机会，
影像设备国产替代究竟有多重要？这家企业提前布局8K时代 8K超高清科技媒体智能硬件人工智能
从过往看，国产替代不是一个新概念，更是一个从被动到主动的转变。1.“黑屏计划”与互联网2008年是特殊的一年。这一年，中国成为世界上最大的互联网国家。根据中国互联网络信息中心（CNNIC）统计数据显示，我国网民数达到2.98亿人，互联网普及率达22.6%。网民数量居世界第一位，平均每5个人中就有一个是网络公民。也是在PC互联网进入巅峰时期的这一年，中国网民们突然收到了一则通知，提及若Office用
深度学习之基于Tensorflow卷积神经网络水果蔬菜分类识别系统 qq1744828575 python python plotly
欢迎大家点赞、收藏、关注、评论啦，由于篇幅有限，只展示了部分核心代码。文章目录一项目简介二、功能三、系统四.总结一项目简介一、项目背景与目标背景：在现代农业、智能零售等领域，自动化分类与识别技术对于提高效率、优化供应链管理具有重要意义。为了响应这一需求，本项目旨在构建一个基于深度学习技术的水果蔬菜分类识别系统。目标：构建一个准确率高、性能稳定的水果蔬菜分类识别模型，利用Tensorflow框架
探秘3D UNet-PyTorch：高效三维图像分割利器鲍凯印Fox
探秘3DUNet-PyTorch：高效三维图像分割利器在医学影像处理、计算机视觉和自动驾驶等领域，三维图像的理解与分析至关重要。而是一个基于PyTorch实现的深度学习模型，专为三维图像分割任务设计。本文将深入剖析该项目的技术细节，应用场景及特性，以期吸引更多的开发者和研究人员参与其中。项目简介3DUNet是2DUNet的三维扩展，其结构保持了卷积神经网络的对称性，采用跳跃连接的方式保留了不同尺度
论文学习笔记 VMamba: Visual State Space Model Wils0nEdwards 学习笔记
概览这篇论文的动机源于在计算机视觉领域设计计算高效的网络架构的持续需求。当前的视觉模型如卷积神经网络（CNNs）和视觉Transformer（ViTs）在处理大规模视觉任务时展现出良好的表现，但都存在各自的局限性。特别是，ViTs尽管在处理大规模数据上具有优势，但其自注意力机制的二次复杂度对高分辨率图像处理时的计算成本极高。因此，研究者希望通过引入新的架构来降低这种复杂度，并提高视觉任务的效率。现
《自然语言处理 Transformer 模型详解》黑色叉腰丶大魔王自然语言处理 transformer 人工智能
一、引言在自然语言处理领域，Transformer模型的出现是一个重大的突破。它摒弃了传统的循环神经网络（RNN）和卷积神经网络（CNN）架构，完全基于注意力机制，在机器翻译、文本生成、问答系统等众多任务中取得了卓越的性能。本文将深入讲解Transformer模型的原理、结构和应用。二、Transformer模型的背景在Transformer出现之前，RNN及其变体（如LSTM和GRU）是自然语言
9. 卷积神经网络工程实践路小漫
小姐姐归来，带着蜜汁微笑，啦啦啦～这次讲的应该是一些成功的神经网络架构，毕竟我们不能总重复造轮子，借鉴很重要AlexNet结构AlexNet的架构如图，有5个卷积层问题1输入是：227×227×3的图像第一层(卷积层1)：96个大小为11×11的滤波器，步长为4问题：卷积层的输出是？*答案：55×55×96问题2问题：这一层的超参数的个数是多少？答案：(11×11×3)×96=35k问题3输入：2
深度学习算法在图算法中的应用（图卷积网络GCN和图自编码器GAE）大嘤三喵军团深度学习算法网络
深度学习算法在图算法中的应用1.图卷积网络（GraphConvolutionalNetworks,GCN）图卷积网络（GCN）是一种将卷积神经网络（ConvolutionalNeuralNetworks,CNN）推广到图结构数据的方法。GCN被广泛用于节点分类、图分类、链接预测等任务。优势和好处灵活性：GCN可以处理不规则和不均匀的数据结构，比如社交网络、分子结构、交通网络等。高效性：GCN使用局
Deep learning for Computer Vision with Python（1）从零开始入门计算机视觉 Hazelyu27 计算机视觉大数据计算机视觉深度学习
本书的内容分成三个部分：1.初始阶段初始阶段学习：机器学习、神经网络、卷积神经网络、建立数据集。2.实践阶段实践阶段：深入学习深度学习，理解先进技术，发现最佳实践方式。3.图像网络阶段完成计算机视觉领域的经验积累。使用大规模数据集和真实图片案例作为数据集，包括年龄和性别预测，交通工具模型识别。本书提供了对应网站：http://pyimg.co/fnkxk本文介绍前两章内容：基本介绍和深度学习简介。
微积分在神经架构搜索中的应用光剑书架上的书深度强化学习原理与实战元学习原理与实战计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
微积分在神经架构搜索中的应用1.背景介绍随着深度学习技术的飞速发展,神经网络模型的复杂度也在不断提高,从最初的简单全连接网络,到如今的卷积神经网络、循环神经网络、注意力机制等各种复杂的神经网络架构。这些先进的神经网络架构大大提高了深度学习模型的性能,但同时也给神经网络的设计和调优带来了巨大的挑战。手工设计神经网络架构通常需要大量的专业知识和经验积累,过程繁琐复杂,难以推广。为了解决这一问题,神经架
产品笔记之数据来源木马良人
1.中国互联网络发展状况统计报告，每半年发布1次，http://www.cnnic.cn/hlwfzyj/hlwxzbg/。2.中国信息通信研究院-手机市场运行分析报告，每月发布1次，http://www.caict.ac.cn/kxyj/qwfb/qwsj/。3.Questmobile：http://www.questmobile.com.cn/blog.html4.易观千帆https://qi
目标检测-YOLOv1 wydxry 深度学习目标检测 YOLO 人工智能
YOLOv1介绍YOLOv1（YouOnlyLookOnceversion1）是一种用于目标检测的深度学习算法，由JosephRedmon等人于2016年提出。它基于单个卷积神经网络，将目标检测任务转化为一个回归问题，通过在图像上划分网格并预测每个网格中是否包含目标以及目标的位置和类别来实现目标检测。YOLOv1的主要特点包括：快速的检测速度：相比于传统的目标检测算法，YOLOv1具有更快的检测速
线性代数|机器学习-P33卷积神经网络ImageNet和卷积规则取个名字真难呐算法机器学习矩阵人工智能线性代数
文章目录1.ImageNet2.卷积计算2.1两个多项式卷积2.2函数卷积2.3循环卷积3.周期循环矩阵和非周期循环矩阵4.循环卷积特征值4.1卷积计算的分解4.2运算量4.3二维卷积公式5.KroneckerProduct1.ImageNetImageNet的论文paper链接如下：详细请直接阅读相关论文即可通过网盘分享的文件：imagenet_cvpr09.pdf链接:https://pan.
Pointnet++改进即插即用系列：全网首发DilatedReparamBlock |即插即用，提升特征提取模块性能 AICurator Pointnet++改进专栏 python 深度学习 pytorch
简介：1.该教程提供大量的首发改进的方式，降低上手难度，多种结构改进，助力寻找创新点！2.本篇文章对Pointnet++特征提取模块进行改进，加入DilatedReparamBlock，提升性能。3.专栏持续更新，紧随最新的研究内容。目录1.理论介绍2.修改步骤2.1步骤一2.2步骤二2.3步骤三1.理论介绍近年来，大核卷积神经网络(ConvNets)得到了广泛的研究关注，但有两个尚未解决的关键问
基于深度学习的动态场景理解 SEU-WYL 深度学习dnn 深度学习人工智能
基于深度学习的动态场景理解是一种通过计算机视觉技术自动分析和解释动态环境中物体、事件和交互的能力。该技术在自动驾驶、智能监控、机器人导航、增强现实等领域有着广泛应用，通过深度学习模型，特别是卷积神经网络（CNNs）、递归神经网络（RNNs）、图神经网络（GNNs）等，对复杂动态场景进行实时解读。1.动态场景理解的核心技术1.1卷积神经网络（CNNs）**卷积神经网络（CNNs）**擅长处理图像数据
深度学习特征提取魔改版太强了！发文香饽饽！深度之眼深度学习干货人工智能干货人工智能深度学习机器学习论文特征提取
要说CV领域经久不衰的研究热点，特征提取可以占一席，毕竟SLAM、三维重建等重要应用的底层都离不开它。再加上近几年深度学习兴起，用深度学习做特征提取逐渐成了主流，比传统算法无论是性能、准确性还是效率都更胜一筹。目前比较常见的深度学习特征提取方法有基于transformer、基于CNN、基于LSTM以及基于GAN，都发展的比较成熟。但为了追求更快速、准确、鲁棒的特征点提取，研究者们开始致力于改进深度
PyTorch库学习之nn.ConvTranspose2d(模块) Midsummer-逐梦 #torch pytorch 学习人工智能
PyTorch库学习之nn.ConvTranspose2d(模块)一、简介nn.ConvTranspose2d是PyTorch中的一个模块，用于实现二维转置卷积（也称为反卷积或上采样卷积）。转置卷积通常用于生成比输入更大的输出，例如在生成对抗网络（GANs）和卷积神经网络（CNNs）的解码器部分。二、语法和参数语法torch.nn.ConvTranspose2d(in_channels,out_c
LeYOLO 用于目标检测的新型可扩展和高效CNN架构 | 最新轻量化SOTA! 5GFLOP下无对手！迪菲赫尔曼 YOLOv8改进实战目标检测 cnn 架构 pytorch 深度学习轻量化
本改进已集成到YOLOv8-Magic框架。论文地址：https://arxiv.org/pdf/2406.14239代码地址：https://github.com/LilianHollard/LeYOLO/tree/main在深度神经网络中，计算效率对于目标检测至关重要，尤其是在新型模型更倾向于速度而非计算效率（浮点运算次数，FLOP）的情况下。这种演变在一定程度上忽视了嵌入式和面向移动的AI目
枚举的构造函数中抛出异常会怎样 bylijinnan java enum 单例
首先从使用enum实现单例说起。为什么要用enum来实现单例？这篇文章（ http://javarevisited.blogspot.sg/2012/07/why-enum-singleton-are-better-in-java.html）阐述了三个理由： 1.enum单例简单、容易，只需几行代码： public enum Singleton { INSTANCE;
CMake 教程 aigo C++
转自：http://xiang.lf.blog.163.com/blog/static/127733322201481114456136/ CMake是一个跨平台的程序构建工具，比如起自己编写Makefile方便很多。介绍：http://baike.baidu.com/view/1126160.htm 本文件不介绍CMake的基本语法，下面是篇不错的入门教程： http:
cvc-complex-type.2.3: Element 'beans' cannot have character Cb123456 spring Webgis
cvc-complex-type.2.3: Element 'beans' cannot have character Line 33 in XML document from ServletContext resource [/WEB-INF/backend-servlet.xml] is i
jquery实例:随页面滚动条滚动而自动加载内容 120153216 jquery
<script language="javascript"> $(function (){ var i = 4;$(window).bind("scroll", function (event){ //滚动条到网页头部的高度，兼容ie,ff,chrome var top = document.documentElement.s
将数据库中的数据转换成dbs文件何必如此 sql dbs
旗正规则引擎通过数据库配置器（DataBuilder）来管理数据库，无论是Oracle，还是其他主流的数据都支持，操作方式是一样的。旗正规则引擎的数据库配置器是用于编辑数据库结构信息以及管理数据库表数据，并且可以执行SQL 语句，主要功能如下。 1)数据库生成表结构信息：主要生成数据库配置文件(.conf文
在IBATIS中配置SQL语句的IN方式 357029540 ibatis
在使用IBATIS进行SQL语句配置查询时，我们一定会遇到通过IN查询的地方，在使用IN查询时我们可以有两种方式进行配置参数：String和List。具体使用方式如下： 1.String:定义一个String的参数userIds，把这个参数传入IBATIS的sql配置文件，sql语句就可以这样写： <select id="getForms" param
Spring3 MVC 笔记（一） 7454103 spring mvc bean REST JSF
自从 MVC 这个概念提出来之后 struts1.X struts2.X jsf 。。。。。这个view 层的技术一个接一个！都用过！不敢说哪个绝对的强悍！要看业务，和整体的设计！最近公司要求开发个新系统！
Timer与Spring Quartz 定时执行程序 darkranger spring bean 工作 quartz
有时候需要定时触发某一项任务。其实在jdk1.3，java sdk就通过java.util.Timer提供相应的功能。一个简单的例子说明如何使用，很简单： 1、第一步，我们需要建立一项任务，我们的任务需要继承java.util.TimerTask package com.test; import java.text.SimpleDateFormat; import java.util.Date;
大端小端转换，le32_to_cpu 和cpu_to_le32 aijuans C语言相关
大端小端转换，le32_to_cpu 和cpu_to_le32 字节序 http://oss.org.cn/kernel-book/ldd3/ch11s04.html 小心不要假设字节序. PC 存储多字节值是低字节为先(小端为先, 因此是小端), 一些高级的平台以另一种方式(大端)
Nginx负载均衡配置实例详解 avords
[导读] 负载均衡是我们大流量网站要做的一个东西，下面我来给大家介绍在Nginx服务器上进行负载均衡配置方法，希望对有需要的同学有所帮助哦。负载均衡先来简单了解一下什么是负载均衡，单从字面上的意思来理解就可以解负载均衡是我们大流量网站要做的一个东西，下面我来给大家介绍在Nginx服务器上进行负载均衡配置方法，希望对有需要的同学有所帮助哦。负载均衡先来简单了解一下什么是负载均衡
乱说的 houxinyou 框架敏捷开发软件测试
从很久以前，大家就研究框架，开发方法，软件工程，好多！反正我是搞不明白！这两天看好多人研究敏捷模型，瀑布模型！也没太搞明白. 不过感觉和程序开发语言差不多，瀑布就是顺序，敏捷就是循环. 瀑布就是需求、分析、设计、编码、测试一步一步走下来。而敏捷就是按摸块或者说迭代做个循环，第个循环中也一样是需求、分析、设计、编码、测试一步一步走下来。也可以把软件开发理
欣赏的价值——一个小故事 bijian1013 有效辅导欣赏欣赏的价值
　　第一次参加家长会，幼儿园的老师说："您的儿子有多动症，在板凳上连三分钟都坐不了，你最好带他去医院看一看。"　　回家的路上，儿子问她老师都说了些什么，她鼻子一酸，差点流下泪来。因为全班30位小朋友，惟有他表现最差；惟有对他，老师表现出不屑，然而她还在告诉她的儿子："老师表扬你了，说宝宝原来在板凳上坐不了一分钟，现在能坐三分钟。其他妈妈都非常羡慕妈妈，因为全班只有宝宝
包冲突问题的解决方法 bingyingao eclipse maven exclusions 包冲突
包冲突是开发过程中很常见的问题：其表现有： 1.明明在eclipse中能够索引到某个类，运行时却报出找不到类。 2.明明在eclipse中能够索引到某个类的方法，运行时却报出找不到方法。 3.类及方法都有，以正确编译成了.class文件，在本机跑的好好的，发到测试或者正式环境就抛如下异常： java.lang.NoClassDefFoundError: Could not in
【Spark七十五】Spark Streaming整合Flume-NG三之接入log4j bit1129 Stream
先来一段废话：实际工作中，业务系统的日志基本上是使用Log4j写入到日志文件中的，问题的关键之处在于业务日志的格式混乱，这给对日志文件中的日志进行统计分析带来了极大的困难，或者说，基本上无法进行分析，每个人写日志的习惯不同，导致日志行的格式五花八门，最后只能通过grep来查找特定的关键词缩小范围，但是在集群环境下，每个机器去grep一遍，分析一遍，这个效率如何可想之二，大好光阴都浪费在这上面了
sudoku solver in Haskell bookjovi sudoku haskell
这几天没太多的事做，想着用函数式语言来写点实用的程序，像fib和prime之类的就不想提了（就一行代码的事），写什么程序呢？在网上闲逛时发现sudoku游戏，sudoku十几年前就知道了，学生生涯时也想过用C/Java来实现个智能求解，但到最后往往没写成，主要是用C/Java写的话会很麻烦。现在写程序，本人总是有一种思维惯性，总是想把程序写的更紧凑，更精致，代码行数最少，所以现
java apache ftpClient bro_feng java
最近使用apache的ftpclient插件实现ftp下载，遇见几个问题，做如下总结。 1. 上传阻塞，一连串的上传，其中一个就阻塞了，或是用storeFile上传时返回false。查了点资料，说是FTP有主动模式和被动模式。将传出模式修改为被动模式ftp.enterLocalPassiveMode();然后就好了。看了网上相关介绍，对主动模式和被动模式区别还是比较的模糊，不太了解被动模
读《研磨设计模式》-代码笔记-工厂方法模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ package design.pattern; /* * 工厂方法模式：使一个类的实例化延迟到子类 * 某次，我在工作不知不觉中就用到了工厂方法模式（称为模板方法模式更恰当。2012-10-29）： * 有很多不同的产品，它
面试记录语 chenyu19891124 招聘
或许真的在一个平台上成长成什么样，都必须靠自己去努力。有了好的平台让自己展示，就该好好努力。今天是自己单独一次去面试别人，感觉有点小紧张，说话有点打结。在面试完后写面试情况表，下笔真的好难，尤其是要对面试人的情况说明真的好难。今天面试的是自己同事的同事，现在的这个同事要离职了，介绍了我现在这位同事以前的同事来面试。今天这位求职者面试的是配置管理，期初看了简历觉得应该很适合做配置管理，但是今天面
Fire Workflow 1.0正式版终于发布了 comsci 工作 workflow Google
Fire Workflow 是国内另外一款开源工作流，作者是著名的非也同志，哈哈.... 官方网站是 http://www.fireflow.org 经过大家努力,Fire Workflow 1.0正式版终于发布了正式版主要变化: 1、增加IWorkItem.jumpToEx(...)方法，取消了当前环节和目标环节必须在同一条执行线的限制，使得自由流更加自由 2、增加IT
Python向脚本传参 daizj python 脚本传参
如果想对python脚本传参数，python中对应的argc, argv(c语言的命令行参数)是什么呢？需要模块：sys 参数个数：len(sys.argv) 脚本名： sys.argv[0] 参数1： sys.argv[1] 参数2： sys.argv[
管理用户分组的命令gpasswd dongwei_6688 passwd
NAME： gpasswd - administer the /etc/group file SYNOPSIS： gpasswd group gpasswd -a user group gpasswd -d user group gpasswd -R group gpasswd -r group gpasswd [-A user,...] [-M user,...] g
郝斌老师数据结构课程笔记 dcj3sjt126com 数据结构与算法
<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<
yii2 cgridview加上选择框进行操作 dcj3sjt126com GridView
页面代码 <?=Html::beginForm(['controller/bulk'],'post');?> <?=Html::dropDownList('action','',[''=>'Mark selected as: ','c'=>'Confirmed','nc'=>'No Confirmed'],['class'=>'dropdown',])
linux mysql fypop linux
enquiry mysql version in centos linux yum list installed | grep mysql yum -y remove mysql-libs.x86_64 enquiry mysql version in yum repositoryyum list | grep mysql oryum -y list mysql* install mysq
Scramble String hcx2013 String
Given a string s1, we may represent it as a binary tree by partitioning it to two non-empty substrings recursively. Below is one possible representation of s1 = "great":
跟我学Shiro目录贴 jinnianshilongnian 跟我学shiro
历经三个月左右时间，《跟我学Shiro》系列教程已经完结，暂时没有需要补充的内容，因此生成PDF版供大家下载。最近项目比较紧，没有时间解答一些疑问，暂时无法回复一些问题，很抱歉，不过可以加群（334194438/348194195）一起讨论问题。 ----广告-----------------------------------------------------
nginx日志切割并使用flume-ng收集日志 liyonghui160com
nginx的日志文件没有rotate功能。如果你不处理，日志文件将变得越来越大，还好我们可以写一个nginx日志切割脚本来自动切割日志文件。第一步就是重命名日志文件，不用担心重命名后nginx找不到日志文件而丢失日志。在你未重新打开原名字的日志文件前，nginx还是会向你重命名的文件写日志，linux是靠文件描述符而不是文件名定位文件。第二步向nginx主
Oracle死锁解决方法 pda158 oracle
　select p.spid,c.object_name,b.session_id,b.oracle_username,b.os_user_name from v$process p,v$session a, v$locked_object b,all_objects c where p.addr=a.paddr and a.process=b.process and c.object_id=b.
java之List排序 shiguanghui list排序
在Java Collection Framework中定义的List实现有Vector，ArrayList和LinkedList。这些集合提供了对对象组的索引访问。他们提供了元素的添加与删除支持。然而，它们并没有内置的元素排序支持。　　你能够使用java.util.Collections类中的sort()方法对List元素进行排序。你既可以给方法传递
servlet单例多线程 utopialxw 单例多线程 servlet
转自http://www.cnblogs.com/yjhrem/articles/3160864.html 和 http://blog.chinaunix.net/uid-7374279-id-3687149.html Servlet 单例多线程 Servlet如何处理多个请求访问？Servlet容器默认是采用单实例多线程的方式处理多个请求的：1.当web服务器启动的