Joshua_Li_

My Jumble of Computer Vision

I am going to maintain this page to record a few things about computer vision that I have read, am doing, or will have a look at. Previously I’d like to write short notes of the papers that I have read. It is a good way to remember and understand the ideas of the authors. But gradually I found that I forget much portion of what I had learnt because in addition to paper I also derive knowledges from others’ blogs, online courses and reports, not recording them at all. Besides, I need a place to keep a list of what I should have a look at but do not at the time when I discover them. This page will be much like a catalog.

PAPERS AND PROJECTS

OBJECT/SALIENCY DETECTION

Acquisition of Localization Confidence for Accurate Object Detectinon (PDF, Project/Code)
A Single Shot Text Detector with Scale-adaptive Anchors (PDF)
Small-scale Pedestrian Detection Based on Somatic Topology Localization and Temporal Feature Aggregation (PDF)
Object detection at 200 Frames Per Second (PDF, )
DetNet: A Backbone network for Object Detection (PDF, Reading Note)
Zero-Shot Object Detection (PDF)
Unsupervised Discovery of Object Landmarks as Structural Representations (PDF, Project/Code)
Cascade R-CNN: Delving into High Quality Object Detection (PDF, PROJECT/CODE)
Path Aggregation Network for Instance Segmentation (PDF)
ClickBAIT-v2: Training an Object Detector in Real-Time (PDF)
Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection (PDF)
Complex-YOLO: Real-time 3D Object Detection on Point Clouds (PDF)
Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts (PDF)
Domain Adaptive Faster R-CNN for Object Detection in the Wild (PDF)
Chinese Text in the Wild (PDF, Project/Code)
TSSD: Temporal Single-Shot Detector Based on Attention and LSTM for Robotic Intelligent Perception (PDF)
Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection (PDF, Reading Note)
Object Detection in Videos by Short and Long Range Object Linking (PDF)
Learning a Rotation Invariant Detector with Rotatable Bounding Box (PDF, Project/Code)
Detecting Curve Text in the Wild: New Dataset and New Solution (PDF, Project/Code)
Single Shot Text Detector with Regional Attention (PDF, Project/Code)
Single-Shot Refinement Neural Network for Object Detection (PDF, Project/Code, Reading Note)
$S^3$ FD: Single Shot Scale-invariant Face Detector (PDF, Code/Project, Reading Note)
MegDet: A Large Mini-Batch Object Detector (PDF)
Light-Head R-CNN: In Defense of Two-Stage Object Detector (PDF)
Interpretable R-CNN (PDF)
Cascade Region Proposal and Global Context for Deep Object Detection (PDF)
PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection (PDF, Project/Code, Reading Note)
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks (PDF, Reading Note)
Object Detection from Video Tubelets with Convolutional Neural Networks (PDF, Reading Note)
R-FCN: Object Detection via Region-based Fully Convolutional Networks (PDF, Project/Code, Reading Note)
SSD: Single Shot MultiBox Detector (PDF, Project/Code, Reading Note)
Pushing the Limits of Deep CNNs for Pedestrian Detection (PDF, Reading Note)
Object Detection by Labeling Superpixels(PDF, Reading Note)
Crafting GBD-Net for Object Detection (PDF, Projct/Code)
code for CUImage and CUVideo, the object detection champion of ImageNet 2016.
Fused DNN: A deep neural network fusion approach to fast and robust pedestrian detection (PDF, Reading Note)
Training Region-based Object Detectors with Online Hard Example Mining (PDF, Reading Note)
Detecting People in Artwork with CNNs (PDF, Project/Code)
Deeply supervised salient object detection with short connections (PDF)
Learning to detect and localize many objects from few examples (PDF)
Multi-Scale Saliency Detection using Dictionary Learning (PDF)
Straight to Shapes: Real-time Detection of Encoded Shapes (PDF)
Weakly Supervised Cascaded Convolutional Networks (PDF, Reading Note)
Speed/accuracy trade-offs for modern convolutional object detectors (PDF, Reading Note)
Object Detection via End-to-End Integration of Aspect Ratio and Context Aware Part-based Models and Fully Convolutional Networks (PDF)
Feature Pyramid Networks for Object Detection (PDF, Reading Note)
COCO-Stuff: Thing and Stuff Classes in Context (PDF)
Finding Tiny Faces (PDF)
Beyond Skip Connections: Top-Down Modulation for Object Detection (PDF, Reading Note)
YOLO9000: Better, Faster, Stronger (PDF, Project/Code, Reading Note)
Quantitative Analysis of Automatic Image Cropping Algorithms: A Dataset and Comparative Study (PDF)
To Boost or Not to Boost? On the Limits of Boosted Trees for Object Detection (PDF)
Pixel Objectness (PDF, Project/Code, Reading Note)
DSSD: Deconvolutional Single Shot Detector (PDF, Reading Note)
A Fast and Compact Salient Score Regression Network Based on Fully Convolutional Network (PDF)
Wide-Residual-Inception Networks for Real-time Object Detection (PDF)
Zoom Out-and-In Network with Recursive Training for Object Proposal (PDF, Project/Code)
Improving Object Detection with Region Similarity Learning (PDF)
Tree-Structured Reinforcement Learning for Sequential Object Localization (PDF)
Weakly Supervised Object Localization Using Things and Stuff Transfer (PDF)
Unsupervised learning from video to detect foreground objects in single images (PDF)
A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection (PDF, Project/Code)
A Learning non-maximum suppression (PDF)
Real Time Image Saliency for Black Box Classifiers (PDF)
An Efficient Approach for Object Detection and Tracking of Objects in a Video with Variable Background (PDF)
RON: Reverse Connection with Objectness Prior Networks for Object Detection (PDF, Project/Code)
Deformable Part-based Fully Convolutional Network for Object Detection (PDF, Reading Note)
Recurrent Scale Approximation for Object Detection in CNN (PDF)
DSOD: Learning Deeply Supervised Object Detectors from Scratch (PDF, Project/Code, Reading Note)
PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN (PDF)
Focal Loss for Dense Object Detection (PDF)
Learning Uncertain Convolutional Features for Accurate Saliency Detection (PDF)
Optimizing Region Selection for Weakly Supervised Object Detection (PDF)
Kill Two Birds With One Stone: Boosting Both Object Detection Accuracy and Speed With adaptive Patch-of-Interest Composition (PDF)
Flow-Guided Feature Aggregation for Video Object Detection (PDF)
BlitzNet: A Real-Time Deep Network for Scene Understanding ([PDF]( BlitzNet: A Real-Time Deep Network for Scene Understanding), Project/Code)
RON: Reverse Connection with Objectness Prior Networks for Object Detection (PDF)
Soft Proposal Networks for Weakly Supervised Object Localization (PDF, Project/Code)
Feature-Fused SSD: Fast Detection for Small Objects (PDF)
Light Cascaded Convolutional Neural Networks for Accurate Player Detection (PDF)
Personalized Saliency and its Prediction (PDF)
WeText: Scene Text Detection under Weak Supervision (PDF)
VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition (PDF, Project/Code)

SEGMENTATION/PARSING

Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells (PDF)
Deep Learning for Semantic Segmentation on Minimal Hardware (PDF)
TernausNetV2: Fully Convolutional Network for Instance Segmentation (PDF, Project/Code)
Stacked U-Nets: A No-Frills Approach to Natural Image Segmentation (PDF, Project/Code)
Deep Object Co-Segmentation (PDF)
Fusing Hierarchical Convolutional Features for Human Body Segmentation and Clothing Fashion Classification (PDF)
ShuffleSeg: Real-time Semantic Segmentation Network (PDF)
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation (PDF, Project/Code)
Learning random-walk label propagation for weakly-supervised semantic segmentation (PDF)
Panoptic Segmentation (PDF, Reading Note)
Learning to Segment Every Thing (PDF, Project/Code)
Deep Extreme Cut: From Extreme Points to Object Segmentation (PDF)
Instance-aware Semantic Segmentation via Multi-task Network Cascades (PDF, Project/Code)
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation (PDF, Reading Note)
Learning Deconvolution Network for Semantic Segmentation (PDF, Reading Note)
Semantic Object Parsing with Graph LSTM (PDF, Reading Note)
Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding (PDF, Reading Note)
Learning to Segment Moving Objects in Videos (PDF, Reading Note)
Deep Structured Features for Semantic Segmentation (PDF)

We propose a highly structured neural network architecture for semantic segmentation of images that combines i) a Haar wavelet-based tree-like convolutional neural network (CNN), ii) a random layer realizing a radial basis function kernel approximation, and iii) a linear classifier. While stages i) and ii) are completely pre-specified, only the linear classifier is learned from data. Thanks to its high degree of structure, our architecture has a very small memory footprint and thus fits onto low-power embedded and mobile platforms. We apply the proposed architecture to outdoor scene and aerial image semantic segmentation and show that the accuracy of our architecture is competitive with conventional pixel classification CNNs. Furthermore, we demonstrate that the proposed architecture is data efficient in the sense of matching the accuracy of pixel classification CNNs when trained on a much smaller data set.
CNN-aware Binary Map for General Semantic Segmentation (PDF)
Learning to Refine Object Segments (PDF)
Clockwork Convnets for Video Semantic Segmentation(PDF, Project/Code)
Convolutional Gated Recurrent Networks for Video Segmentation (PDF)
Efficient Convolutional Neural Network with Binary Quantization Layer (PDF)
One-Shot Video Object Segmentation (PDF)
Fully Convolutional Instance-aware Semantic Segmentation (PDF, Projcet/Code, Reading Note)
Semantic Segmentation using Adversarial Networks (PDF)
Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes (PDF)
Deep Watershed Transform for Instance Segmentation (PDF)
InstanceCut: from Edges to Instances with MultiCut (PDF)
The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation (PDF)
Improving Fully Convolution Network for Semantic Segmentation (PDF)
Video Scene Parsing with Predictive Feature Learning (PDF)
Training Bit Fully Convolutional Network for Fast Semantic Segmentation (PDF)
Pyramid Scene Parsing Network (PDF, Reading Note)
Mining Pixels: Weakly Supervised Semantic Segmentation Using Image Labels (PDF)
FastMask: Segment Object Multi-scale Candidates in One Shot (PDF, Project/Code, Reading Note)
A New Convolutional Network-in-Network Structure and Its Applications in Skin Detection, Semantic Segmentation, and Artifact Reduction (PDF, Reading Note)
FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos (PDF)
Visual Saliency Prediction Using a Mixture of Deep Neural Networks (PDF)
PixelNet: Representation of the pixels, by the pixels, and for the pixels (PDF, Project/Code)
Super-Trajectory for Video Segmentation (PDF)
Understanding Convolution for Semantic Segmentation (PDF, Reading Note)
Adversarial Examples for Semantic Image Segmentation (PDF)
Large Kernel Matters – Improve Semantic Segmentation by Global Convolutional Network (PDF)
Deep Image Matting (PDF, Reading Note)
Mask R-CNN (PDF, Caffe Implementation, TuSimple Implementation on MXNet, TensorFlow Implementation, Reading Note)
Predicting Deeper into the Future of Semantic Segmentation (PDF)
Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks (PDF, Project/Code)
One-Shot Video Object Segmentation (PDF, Project/Code)
Semantic Instance Segmentation via Deep Metric Learning (PDF)
Not All Pixels Are Equal: Difficulty-aware Semantic Segmentation via Deep Layer Cascade (PDF)
Semantically-Guided Video Object Segmentation (PDF)
Recurrent Multimodal Interaction for Referring Image Segmentation (PDF)
Loss Max-Pooling for Semantic Image Segmentation (PDF)
Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation (PDF)
Learning Video Object Segmentation with Visual Memory (PDF)
A Review on Deep Learning Techniques Applied to Semantic Segmentation (PDF)
BiSeg: Simultaneous Instance Segmentation and Semantic Segmentation with Fully Convolutional Networks (PDF)
Rethinking Atrous Convolution for Semantic Image Segmentation (PDF)
Discriminative Localization in CNNs for Weakly-Supervised Segmentation of Pulmonary Nodules (PDF)
Superpixel-based semantic segmentation trained by statistical process control (PDF)
The Devil is in the Decoder (PDF)
Semantic Segmentation with Reverse Attention (PDF)
Learning Deconvolution Network for Semantic Segmentation (PDF, Project/Code)
Depth Adaptive Deep Neural Network for Semantic Segmentation (PDF)
Semantic Instance Segmentation with a Discriminative Loss Function (PDF)
A Cost-Sensitive Visual Question-Answer Framework for Mining a Deep And-OR Object Semantics from Web Images (PDF)
ICNet for Real-Time Semantic Segmentation on High-Resolution Images (PDF, Project/Code)
Pyramid Scene Parsing Network (PDF, Project/Code, Reading Note)
Learning to Segment Instances in Videos with Spatial Propagation Network (PDF, Project/Code)
Learning Affinity via Spatial Propagation Networks (PDF, Project/Code)

TRACKING

Multiple People Tracking Using Hierarchical Deep Tracklet Re-identification (PDF)
Fully-Convolutional Siamese Networks for Object Tracking (PDF)
Joint Flow: Temporal Flow Fields for Multi Person Tracking (PDF)
Trajectory Factory: Tracklet Cleaving and Re-connection by Deep Siamese Bi-GRU for Multiple Object Tracking (PDF)
Machine Learning Methods for Solving Assignment Problems in Multi-Target Tracking (PDF)
Multi-Target, Multi-Camera Tracking by Hierarchical Clustering: Recent Progress on DukeMTMC Project (PDF)
Detect-and-Track: Efficient Pose Estimation in Videos (PDF)
Track, then Decide: Category-Agnostic Vision-based Multi-Object Tracking (PDF)
Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking (PDF, Reading Note)
Joint Tracking and Segmentation of Multiple Targets (PDF, Reading Note)
Deep Tracking on the Move: Learning to Track the World from a Moving Vehicle using Recurrent Neural Networks (PDF)
Convolutional Regression for Visual Tracking (PDF)
Kernelized Correlation Filters(Project CODE1 CODE2)
Online Visual Multi-Object Tracking via Labeled Random Finite Set Filtering (PDF)
SANet: Structure-Aware Network for Visual Tracking (PDF)
Semantic tracking: Single-target tracking with inter-supervised convolutional networks (PDF)
On The Stability of Video Detection and Tracking (PDF)
Dual Deep Network for Visual Tracking (PDF)
Deep Motion Features for Visual Tracking (PDF)
Robust and Real-time Deep Tracking Via Multi-Scale Domain Adaptation (PDF, Project/Code)
Instance Flow Based Online Multiple Object Tracking (PDF)
PathTrack: Fast Trajectory Annotation with Path Supervision (PDF)
Good Features to Correlate for Visual Tracking (PDF)
Re3 : Real-Time Recurrent Regression Networks for Object Tracking (PDF)
Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning (PDF, Project/Code)
Simple Online and Realtime Tracking with a Deep Association Metric (PDF)
Learning Policies for Adaptive Tracking with Deep Feature Cascades (PDF)
Recurrent Filter Learning for Visual Tracking (PDF)
Tracking Persons-of-Interest via Unsupervised Representation Adaptation (PDF)
Detect to Track and Track to Detect (PDF, Project/Code, Reading Note)

POSE ESTIMATION

Learning to Estimate 3D Human Pose and Shape from a Single Color Image (PDF, Project/Code)
Ordinal Depth Supervision for 3D Human Pose Estimation (PDF, Project/Code)
Simple Baselines for Human Pose Estimation and Tracking (PDF)
End-to-end Recovery of Human Shape and Pose (PDF, PROJECT/CODE, Code)
PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model (PDF)
DensePose: Dense Human Pose Estimation In The Wild (PDF, Project/Code)
Cascaded Pyramid Network for Multi-Person Pose Estimation (PDF)
Chained Predictions Using Convolutional Neural Networks (PDF, Reading Note)
CRF-CNN: Modeling Structured Information in Human Pose Estimation (PDF)
Convolutional Pose Machines (PDF, Project/Code, Reading Note)
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields (PDF, Project/Code, Reading Note)
Towards Accurate Multi-person Pose Estimation in the Wild (PDF, Reading Note)
Adversarial PoseNet: A Structure-aware Convolutional Network for Human Pose Estimation (PDF)
Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose (PDF, Project/Code)
Learning Feature Pyramids for Human Pose Estimation (PDF, Project/Code)
Joint Multi-Person Pose Estimation and Semantic Part Segmentation (PDF)
DeepPrior++: Improving Fast and Accurate 3D Hand Pose Estimation (PDF)
Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image (PDF)
Human Pose Regression by Combining Indirect Part Detection and Contextual Information (PDF)
Dual Path Networks for Multi-Person Human Pose Estimation (PDF)

ACTION RECOGNITION/EVENT DETECTION/VIDEO

PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation (PDF, Project/Code)
Superframes, A Temporal Video Segmentation (PDF)
Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation (PDF)
2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning (PDF)
Real-Time End-to-End Action Detection with Two-Stream Networks (PDF)
Learning Video-Story Composition via Recurrent Neural Network (PDF)
Real-world Anomaly Detection in Surveillance Videos (PDF)
Fully-Coupled Two-Stream Spatiotemporal Networks for Extremely Low Resolution Action Recognition (PDF)
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward (PDF, Project/Code)
Making a long story short: A Multi-Importance Semantic for Fast-Forwarding Egocentric Videos (PDF)
Attentional Pooling for Action Recognition (PDF, Project/Code)
Pooling the Convolutional Layers in Deep ConvNets for Action Recognition (PDF, Reading Note)
Two-Stream Convolutional Networks for Action Recognition in Videos (PDF, Reading Note)
YouTube-8M: A Large-Scale Video Classification Benchmark (PDF, Project/Code)
Spatiotemporal Residual Networks for Video Action Recognition (PDF)
An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data (PDF)
Fast Video Classification via Adaptive Cascading of Deep Models (PDF)
Video Pixel Networks (PDF)
Plug-and-Play CNN for Crowd Motion Analysis: An Application in Abnormal Event Detection (PDF)
EM-Based Mixture Models Applied to Video Event Detection (PDF)
Video Captioning and Retrieval Models with Semantic Attention (PDF)
Title Generation for User Generated Videos (PDF)
Review of Action Recognition and Detection Methods (PDF)
RECURRENT MIXTURE DENSITY NETWORK FOR SPATIOTEMPORAL VISUAL ATTENTION (PDF)
Self-Supervised Video Representation Learning With Odd-One-Out Networks (PDF)
Recurrent Memory Addressing for describing videos (PDF)
Online Real time Multiple Spatiotemporal Action Localisation and Prediction on a Single Platform (PDF)
Real-Time Video Highlights for Yahoo Esports (PDF)
Surveillance Video Parsing with Single Frame Supervision (PDF)
Anomaly Detection in Video Using Predictive Convolutional Long Short-Term Memory Networks (PDF)
Action Recognition with Dynamic Image Networks (PDF)
ActionFlowNet: Learning Motion Representation for Action Recognition (PDF)
Video Propagation Networks (PDF)
Detecting events and key actors in multi-person videos (PDF)
A Pursuit of Temporal Accuracy in General Activity Detection (PDF, Reading Note)
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos (PDF)
Deceiving Google’s Cloud Video Intelligence API Built for Summarizing Videos (PDF)
Incremental Tube Construction for Human Action Detection (PDF)
Unsupervised Action Proposal Ranking through Proposal Recombination (PDF)
CERN: Confidence-Energy Recurrent Network for Group Activity Recognition (PDF)
Forecasting Human Dynamics from Static Images (PDF)
Interpretable 3D Human Action Analysis with Temporal Convolutional Networks (PDF)
Training object class detectors with click supervision (PDF)
Skeleton-based Action Recognition with Convolutional Neural Networks (PDF)
Online growing neural gas for anomaly detection in changing surveillance scenes (PDF)
Learning Person Trajectory Representations for Team Activity Analysis (PDF)
Concurrence-Aware Long Short-Term Sub-Memories for Person-Person Action Recognition (PDF)
Video Imagination from a Single Image with Transformation Generation (PDF, Project/Code)
Optimizing Deep CNN-Based Queries over Video Streams at Scale (PDF, Project/Code, Reading Note)
Extreme Low Resolution Activity Recognition with Multi-Siamese Embedding Learning (PDF)
Predicting Human Activities Using Stochastic Grammar (PDF)
Discriminative convolutional Fisher vector network for action recognition (PDF)
Extreme Low Resolution Activity Recognition with Multi-Siamese Embedding Learning (PDF)
Exploiting Semantic Contextualization for Interpretation of Human Activity in Videos (PDF)
Lattice Long Short-Term Memory for Human Action Recognition (PDF)
Kinship Verification from Videos using Spatio-Temporal Texture Features and Deep Learning (PDF)
Fast-Forward Video Based on Semantic Extraction (PDF)
Emotion Detection on TV Show Transcripts with Sequence-based Convolutional Neural Networks (PDF)
ConvNet Architecture Search for Spatiotemporal Feature Learning (PDF, Project/Code, Github)
Fully Context-Aware Video Prediction (PDF)

FACE

Learning towards Minimum Hyperspherical Energy (PDF, Project/Code)
Consensus-Driven Propagation in Massive Unlabeled Data for Face Recognition (PDF, Code/Project)
Arbitrary Facial Attribute Editing: Only Change What You Want (PDF, Project/Code)
Anchor Cascade for Efficient Face Detection (PDF)
Real-Time Rotation-Invariant Face Detection with Progressive Calibration Networks (PDF, Reading Note)
MobileFaceNets: Efficient CNNs for Accurate Real-time Face Verification on Mobile Devices (PDF)
Survey of Face Detection on Low-quality Images (PDF)
PyramidBox: A Context-assisted Single Shot Face Detector (PDF)
SFace: An Efficient Network for Face Detection in Large Scale Variations ([PDF](SFace: An Efficient Network for Face Detection in Large Scale Variations))
Deep Facial Expression Recognition: A Survey (PDF)
Deep Face Recognition: A Survey (PDF)
Deep Semantic Face Deblurring (PDF, Project/Code)
Evaluation of Dense 3D Reconstruction from 2D Face Images in the Wild (PDF)
SSH: Single Stage Headless Face Detector (PDF, Project/Code)
Detecting and counting tiny faces (PDF, Project/Code)
Training Deep Face Recognition Systems with Synthetic Data (PDF)
Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification (PDF, Project/Code)
Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (PDF, Project/Code, Code Caffe)
Deep Architectures for Face Attributes (PDF)
Face Detection with End-to-End Integration of a ConvNet and a 3D Model (PDF, Reading Note, Project/Code)
A CNN Cascade for Landmark Guided Semantic Part Segmentation (PDF, Project/Code)
Kernel Selection using Multiple Kernel Learning and Domain Adaptation in Reproducing Kernel Hilbert Space, for Face Recognition under Surveillance Scenario (PDF)
An All-In-One Convolutional Neural Network for Face Analysis (PDF)
Fast Face-swap Using Convolutional Neural Networks (PDF)
Cross-Age Reference Coding for Age-Invariant Face Recognition and Retrieval (Project/Code)
CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection (Project/Code)
Face Synthesis from Facial Identity Features (PDF)
DeepFace: Face Generation using Deep Learning (PDF)
Emotion Recognition in the Wild via Convolutional Neural Networks and Mapped Binary Patterns (PDF, Project/Code)
EmotioNet Challenge: Recognition of facial expressions of emotion in the wild (PDF)
Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation (PDF)
Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network (PDF)
Deep Alignment Network: A convolutional neural network for robust face alignment (PDF, Project/Code)
Scale-Aware Face Detection (PDF)
SSH: Single Stage Headless Face Detector (PDF)
AffectNet: A Database for Facial Expression, Valence, and Arousal Computing in the Wild (PDF)
SphereFace: Deep Hypersphere Embedding for Face Recognition (PDF, Project/Code)
Age Group and Gender Estimation in the Wild with Deep RoR Architecture (PDF)
Island Loss for Learning Discriminative Features in Facial Expression Recognition (PDF)
Temporal Non-Volume Preserving Approach to Facial Age-Progression and Age-Invariant Face Recognition (PDF)

OPTICAL FLOW

LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation (PDF, Project/Code)
DeepFlow: Large displacement optical flow with deep matching (PDF, Project/Code)
Guided Optical Flow Learning (PDF)

IMAGE PROCESSING

CartoonGAN: Generative Adversarial Networks for Photo Cartoonization (PDF)
Image Inpainting for Irregular Holes Using Partial Convolutions (PDF)
Neural Aesthetic Image Reviewer (PDF, Reading Note)
Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression (PDF)
Learning Intelligent Dialogs for Bounding Box Annotation (PDF)
Real-time video stabilization and mosaicking for monitoring and surveillance (PDF, Project/Code)
Learning Recursive Filter for Low-Level Vision via a Hybrid Neural Network (PDF, Project/Code)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding(PDF, Project/Code)
A Learned Representation For Artistic Style(PDF)
Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification (PDF, Project/Code)
Pixel Recurrent Neural Networks (PDF)
Conditional Image Generation with PixelCNN Decoders (PDF, Project/Code)
RAISR: Rapid and Accurate Image Super Resolution (PDF)
Photo-Quality Evaluation based on Computational Aesthetics: Review of Feature Extraction Techniques (PDF)
Fast color transfer from multiple images (PDF)
Bringing Impressionism to Life with Neural Style Transfer in Come Swim (PDF)
PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications (PDF, (Project/CODE)[https://github.com/openai/pixel-cnn])
Deep Photo Style Transfer (PDF)
A Neural Representation of Sketch Drawings (PDF)
Visual Attribute Transfer through Deep Image Analogy (PDF)
Deep Semantics-Aware Photo Adjustment (PDF)
Diversified Texture Synthesis with Feed-forward Networks (PDF, Project/Code)
Real-Time Neural Style Transfer for Videos (PDF)
Creatism: A deep-learning photographer capable of creating professional work (PDF)
Deep Image Harmonization (PDF, Project/Code)
Neural Color Transfer between Images (PDF)
Deeper, Broader and Artier Domain Generalization (PDF)

3D

Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling (PDF, Project/Code)

CNN AND DEEP LEARNING

https://arxiv.org/abs/1805.07883 (PDF)
Rethinking ImageNet Pre-training (PDF)
Learning From Positive and Unlabeled Data: A Survey (PDF)
Gather-Excite: Exploiting Feature Context in Convolutional Neural Networks (PDF, Project/Code)
DropBlock: A regularization method for convolutional networks (PDF)
Differentiable Abstract Interpretation for Provably Robust Neural Networks (PDF, Project/Code)
Adding One Neuron Can Eliminate All Bad Local Minima (PDF)
Step Size Matters in Deep Learning (PDF)
Do Better ImageNet Models Transfer Better? (PDF)
Robust Classification with Convolutional Prototype Learning (PDF, Project/Code)
Fast Feature Extraction with CNNs with Pooling Layers (PDF)
Network Transplanting (PDF)
An Information-Theoretic View for Deep Learning (PDF)
Understanding Individual Neuron Importance Using Information Theory (PDF)
Understanding Convolutional Neural Network Training with Information Theory (PDF)
The unreasonable effectiveness of the forget gate (PDF)
Discovering Hidden Factors of Variation in Deep Networks (PDF)
Regularizing Deep Networks by Modeling and Predicting Label Structure (PDF)
Hierarchical Novelty Detection for Visual Object Recognition (PDF)
Guide Me: Interacting with Deep Networks (PDF)
Studying Invariances of Trained Convolutional Neural Networks (PDF)
Deep Residual Networks and Weight Initialization (PDF)
WNGrad: Learn the Learning Rate in Gradient Descent (PDF)
Understanding the Loss Surface of Neural Networks for Binary Classification (PDF)
Tell Me Where to Look: Guided Attention Inference Network (PDF)
Convolutional Neural Networks with Alternately Updated Clique (PDF, Project/Code)
Visual Interpretability for Deep Learning: a Survey (PDF)
Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey (PDF)
CNNs are Globally Optimal Given Multi-Layer Support (PDF)
Take it in your stride: Do we need striding in CNNs? (PDF)
Gradients explode - Deep Networks are shallow - ResNet explained (PDF)
Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates (PDF, Project/Code)
Data Distillation: Towards Omni-Supervised Learning (PDF)
Peephole: Predicting Network Performance Before Training (PDF)
AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks (PDF)
Gradual Tuning: a better way of Fine Tuning the parameters of a Deep Neural Network (PDF)
CondenseNet: An Efficient DenseNet using Learned Group Convolutions (PDF, Project/Code)
Population Based Training of Neural Networks (PDF)
Knowledge Concentration: Learning 100K Object Classifiers in a Single CNN (PDF)
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions (PDF)
Unleashing the Potential of CNNs for Interpretable Few-Shot Learning (PDF)
Non-local Neural Networks (PDF, Caffe2)
Log-DenseNet: How to Sparsify a DenseNet (PDF)
Don’t Decay the Learning Rate, Increase the Batch Size (PDF)
Guarding Against Adversarial Domain Shifts with Counterfactual Regularization (PDF)
UberNet: Training a ‘Universal’ Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory (PDF, Project/Code)
What makes ImageNet good for transfer learning? (PDF, Project/Code, Reading Note)

The tremendous success of features learnt using the ImageNet classification task on a wide range of transfer tasks begs the question: what are the intrinsic properties of the ImageNet dataset that are critical for learning good, general-purpose features? This work provides an empirical investigation of various facets of this question: Is more pre-training data always better? How does feature quality depend on the number of training examples per class? Does adding more object classes improve performance? For the same data budget, how should the data be split into classes? Is fine-grained recognition necessary for learning good features? Given the same number of training classes, is it better to have coarse classes or fine-grained classes? Which is better: more classes or more examples per class?
Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units (PDF)
Densely Connected Convolutional Networks (PDF, Project/Code, Reading Note)
Decoupled Neural Interfaces using Synthetic Gradients (PDF)

Training directed neural networks typically requires forward-propagating data through a computation graph, followed by backpropagating error signal, to produce weight updates. All layers, or more generally, modules, of the network are therefore locked, in the sense that they must wait for the remainder of the network to execute forwards and propagate error backwards before they can be updated. In this work we break this constraint by decoupling modules by introducing a model of the future computation of the network graph. These models predict what the result of the modeled sub-graph will produce using only local information. In particular we focus on modeling error gradients: by using the modeled synthetic gradient in place of true backpropagated error gradients we decouple subgraphs, and can update them independently and asynchronously.
Rethinking the Inception Architecture for Computer Vision (PDF, Reading Note)

In this paper, several network designing choices are discussed, including factorizing convolutions into smaller kernels and asymmetric kernels, utility of auxiliary classifiers and reducing grid size using convolution stride rather than pooling.
Factorized Convolutional Neural Networks (PDF, Reading Note)
Do semantic parts emerge in Convolutional Neural Networks? (PDF, Reading Note)
A Critical Review of Recurrent Neural Networks for Sequence Learning (PDF)
Image Compression with Neural Networks (Project/Code)
Graph Convolutional Networks (Project/Code)
Understanding intermediate layers using linear classifier probes (PDF, Reading Note)
Learning What and Where to Draw (PDF, Project/Code)
On the interplay of network structure and gradient convergence in deep learning (PDF)
Deep Learning with Separable Convolutions (PDF)
Grad-CAM: Why did you say that? Visual Explanations from Deep Networks via Gradient-based Localization (PDF, Project/Code)
Optimization of Convolutional Neural Network using Microcanonical Annealing Algorithm (PDF)
Deep Pyramidal Residual Networks (PDF)
Impatient DNNs - Deep Neural Networks with Dynamic Time Budgets (PDF)
Uncertainty in Deep Learning (PDF, Project/Code)
This is the PhD Thesis of Yarin Gal.
Tensorial Mixture Models (PDF, Project/Code)
Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks (PDF)
Why Deep Neural Networks? (PDF)
Local Similarity-Aware Deep Feature Embedding (PDF)
A Review of 40 Years of Cognitive Architecture Research: Focus on Perception, Attention, Learning and Applications (PDF)
Professor Forcing: A New Algorithm for Training Recurrent Networks (PDF)
On the expressive power of deep neural networks(PDF)
What Is the Best Practice for CNNs Applied to Visual Instance Retrieval? (PDF)
Deep Convolutional Neural Network Design Patterns (PDF, Project/Code)
Tricks from Deep Learning (PDF)
A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models (PDF)
Multi-Shot Mining Semantic Part Concepts in CNNs (PDF)
Aggregated Residual Transformations for Deep Neural Networks (PDF, Reading Note)
PolyNet: A Pursuit of Structural Diversity in Very Deep Networks (PDF)
On the Exploration of Convolutional Fusion Networks for Visual Recognition (PDF)
ResFeats: Residual Network Based Features for Image Classification (PDF)
Object Recognition with and without Objects (PDF)
LCNN: Lookup-based Convolutional Neural Network (PDF, Reading Note)
Inductive Bias of Deep Convolutional Networks through Pooling Geometry (PDF, Project/Code)
Wider or Deeper: Revisiting the ResNet Model for Visual Recognition (PDF, Reading Note)
Multi-Scale Context Aggregation by Dilated Convolutions (PDF, Project/Code)
Large-Margin Softmax Loss for Convolutional Neural Networks (PDF, mxnet Code, Caffe Code)
Adversarial Examples Detection in Deep Networks with Convolutional Filter Statistics (PDF)
Feedback Networks (PDF)
Visualizing Residual Networks (PDF)
Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks (PDF, Project/Code)
Understanding trained CNNs by indexing neuron selectivity (PDF)
Benchmarking State-of-the-Art Deep Learning Software Tools (PDF, Project/Code)
Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models (PDF)
Visualizing Deep Neural Network Decisions: Prediction Difference Analysis (PDF, Project/Code)
ShaResNet: reducing residual network parameter number by sharing weights (PDF)
Deep Forest: Towards An Alternative to Deep Neural Networks (PDF, Project/Code)
All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation (PDF)
Genetic CNN (PDF)
Deformable Convolutional Networks (PDF)
Quality Resilient Deep Neural Networks (PDF)
How ConvNets model Non-linear Transformations (PDF)
Active Convolution: Learning the Shape of Convolution for Image Classification (PDF)
Multi-Scale Dense Convolutional Networks for Efficient Prediction (PDF, Project/Code)
Coordinating Filters for Faster Deep Neural Networks (PDF, Project/Code)
A Genetic Programming Approach to Designing Convolutional Neural Network Architectures (PDF)
On Generalization and Regularization in Deep Learning (PDF)
Interpretable Explanations of Black Boxes by Meaningful Perturbation (PDF)
Energy Propagation in Deep Convolutional Neural Networks (PDF)
Introspection: Accelerating Neural Network Training By Learning Weight Evolution (PDF)
Deeply-Supervised Nets (PDF)
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units (PDF)
Inception Recurrent Convolutional Neural Network for Object Recognition (PDF)
Residual Attention Network for Image Classification (PDF)
The Landscape of Deep Learning Algorithms (PDF)
Pixel Deconvolutional Networks (PDF)
Dilated Residual Networks (PDF)
A Kernel Redundancy Removing Policy for Convolutional Neural Network (PDF)
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour (PDF)
Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification (PDF, Project/Code, Reading Note)
VisualBackProp: efficient visualization of CNNs (PDF)
Pruning Convolutional Neural Networks for Resource Efficient Inference (PDF, Project/Code)
Zero-Shot Learning - A Comprehensive Evaluation of the Good, the Bad and the Ugly (PDF, Project/Code)
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices (PDF, Caffe Implementation)
Submanifold Sparse Convolutional Networks (PDF, Project/Code)
Dual Path Networks (PDF)
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression (PDF, Project/Code, Reading Note)
Memory-Efficient Implementation of DenseNets (PDF)
Residual Attention Network for Image Classification (PDF, Project/Code)
An Effective Training Method For Deep Convolutional Neural Network (PDF)
Learning to Transfer (PDF)
Learning Efficient Convolutional Networks through Network Slimming (PDF, Project/Code)
Super-Convergence: Very Fast Training of Residual Networks Using Large Learning Rates (PDF, Project/Code)
Hierarchical loss for classification (PDF)
Convolutional Gaussian Processes (PDF, Code/Project)
Interpretable Convolutional Neural Networks (PDF)
What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? (PDF)
Porcupine Neural Networks: (Almost) All Local Optima are Global (PDF)
Generalization in Deep Learning (PDF)
A systematic study of the class imbalance problem in convolutional neural networks (PDF)
Interpretable Transformations with Encoder-Decoder Networks (PDF, Project/Code)
One pixel attack for fooling deep neural networks (PDF)

SINGLE-SHOT/UNSUPERVISED LEARNING

Zero-Shot Object Detection by Hybrid Region Embedding (PDF, Project/Code)
Deep Triplet Ranking Networks for One-Shot Recognition (PDF)
Avatar-Net: Multi-scale Zero-shot Style Transfer by Feature Decoration (PDF)

GAN

Outfit Generation and Style Extraction via Bidirectional LSTM and Autoencoder (PDF)
Pioneer Networks: Progressively Growing Generative Autoencoder (PDF)
Transferring GANs: generating images from limited data (PDF, Project/Code)
Painting Generation Using Conditional Generative Adversarial Net (PDF, Project/Code)
MGGAN: Solving Mode Collapse using Manifold Guided Training (PDF)
Multimodal Unsupervised Image-to-Image Translation (PDF, Project/Code)
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond (PDF)
Face Aging with Contextual Generative Adversarial Nets (PDF, Project/Code)
Deformable GANs for Pose-based Human Image Generation (PDF, Project/Code)
ComboGAN: Unrestrained Scalability for Image Domain Translation (PDF, Project/Code)
Eye In-Painting with Exemplar Generative Adversarial Networks (PDF)
Disentangled Person Image Generation (PDF)
Fader Networks: Manipulating Images by Sliding Attributes (PDF, Code/Project)
Are GANs Created Equal? A Large-Scale Study (PDF)
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation (PDF, Project/Code)
Two Birds with One Stone: Iteratively Learn Facial Attributes with GANs (PDF, Project/Code)
Spectral Normalization for Generative Adversarial Networks (PDF)
XGAN: Unsupervised Image-to-Image Translation for many-to-many Mappings (PDF)
How Generative Adversarial Nets and its variants Work: An Overview of GAN (PDF)
DNA-GAN: Learning Disentangled Representations from Multi-Attribute Images (PDF, Project/Code)
Sobolev GAN (PDF)
Data Augmentation Generative Adversarial Networks (PDF)
Conditional Autoencoders with Adversarial Information Factorization (PDF, Project/Code)
Progressive Growing of GANs for Improved Quality, Stability, and Variation (PDF, Project/Code, Torch, PyTorch, Reading Note)
Bayesian GAN (PDF, Project/Code)
Metric Learning-based Generative Adversarial Network (PDF)
Flexible Prior Distributions for Deep Generative Models (PDF)
Data Augmentation in Classification using GAN (PDF)
Semantically Decomposing the Latent Spaces of Generative Adversarial Networks (PDF)
Multi-View Data Generation Without View Supervision (PDF)
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks (PDF)
Generative Adversarial Networks (PDF)
Stacked Generative Adversarial Networks (PDF)
Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks (PDF)
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (PDF)
Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks (PDF)
NIPS 2016 Tutorial: Generative Adversarial Networks (PDF)
Wasserstein GAN (PDF)
Adversarial Discriminative Domain Adaptation (PDF, Reading Note)
Generative Adversarial Nets with Labeled Data by Activation Maximization (PDF)
Triple Generative Adversarial Nets (PDF)
On the Quantative Evaluation of Deep Generative Models (PDF)
Adversarial Transformation Networks: Learning to Generate Adversarial Examples (PDF)
Improved Training of Wasserstein GANs (PDF, Project/Code)
Generate To Adapt: Aligning Domains using Generative Adversarial Networks (PDF)
Adversarial Generator-Encoder Networks (PDF, Project/Code)
Training Triplet Networks with GAN (PDF)
Multi-Agent Diverse Generative Adversarial Networks (PDF)
GP-GAN: Towards Realistic High-Resolution Image Blending (PDF, Project/Code)
BEGAN: Boundary Equilibrium Generative Adversarial Networks (PDF)
MAGAN: Margin Adaptation for Generative Adversarial Networks (PDF)
Pose Guided Person Image Generation (PDF)
On the Effects of Batch and Weight Normalization in Generative Adversarial Networks (PDF, Project/Code)
Aesthetic-Driven Image Enhancement by Adversarial Learning (PDF)
VEEGAN: Reducing Mode Collapse in GANs using Implicit Variational Learning (PDF, Project/Code
MoCoGAN: Decomposing Motion and Content for Video Generation (PDF, Project/Code)
Generative Adversarial Networks: An Overview ((PDF)[https://arxiv.org/abs/1710.07035])
SalGAN: Visual Saliency Prediction with Generative Adversarial Networks (PDF, Project/Code)

MACHINE LEARNING

Metric Learning with Adaptive Density Discrimination (PDF, PyTorch, TF)
Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent (PDF)
计算机视觉与机器学习【随机森林】
计算机视觉与机器学习【深度学习中的激活函数】
我爱机器学习机器学习干货站
Bayesian Reasoning and Machine Learning
Stochastic Gradient Descent as Approximate Bayesian Inference (PDF)

LIGHT-WEIGHT MODEL/EMBEDDED/MOBILE/MODEL COMPRESSION

ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware (PDF, Project/Code)
FD-MobileNet: Improved MobileNet with a Fast Downsampling Strategy (PDF)
Quantization Mimic: Towards Very Tiny CNN for Object Detection (PDF)
Pelee: A Real-Time Object Detection System on Mobile Devices (PDF, Project/Code, Reading Note)
MobileNetV2: Inverted Residuals and Linear Bottlenecks (PDF, Reading Note)
SBNet: Sparse Blocks Network for Fast Inference (PDF, Project/Code)
IGCV2: Interleaved Structured Sparse Convolutional Neural Networks (PDF)
FitNets: Hints for Thin Deep Nets (PDF)
Building Efficient ConvNets using Redundant Feature Pruning (PDF, Project/Code)
Multi-Scale Dense Networks for Resource Efficient Image Classification (PDF)
Net-Trim: Convex Pruning of Deep Neural Networks with Performance Guarantee (pdf)
NISP: Pruning Networks using Neuron Importance Score Propagation (PDF)
Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks (PDF)
Comprehensive Evaluation of OpenCL-based Convolutional Neural Network Accelerators in Xilinx and Altera FPGAs (PDF)
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference (PDF)
Two-Bit Networks for Deep Learning on Resource-Constrained Embedded Devices (PDF)
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size (PDF, Project/Code)
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications (PDF, Caffe Implementation, Reading Note)
Binarized Convolutional Neural Networks with Separable Filters for Efficient Hardware Acceleration (PDF)
Channel Pruning for Accelerating Very Deep Neural Networks (PDF, Project/Code)
Quantized Convolutional Neural Networks for Mobile Devices (PDF, Project/Code)
Squeeze-and-Excitation Networks (PDF)
Domain-adaptive deep network compression (PDF)
Embedded Binarized Neural Networks (PDF)
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures (PDF)
A Survey of Model Compression and Acceleration for Deep Neural Networks ([https://arxiv.org/abs/1710.09282])

ReID

Video-based Person Re-identification via 3D Convolutional Networks and Non-local Attention (PDF)
Attention-Aware Compositional Network for Person Re-identification (PDF)
Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification (PDF, Project/Code)
Features for Multi-Target Multi-Camera Tracking and Re-Identification (PDF)
Video Person Re-identification by Temporal Residual Learning (PDF)
Harmonious Attention Network for Person Re-Identification (PDF)
In Defense of the Triplet Loss for Person Re-Identification (PDF)
Deep Spatial Feature Reconstruction for Partial Person Re-identification: Alignment-Free Approach (PDF)
AlignedReID: Surpassing Human-Level Performance in Person Re-Identification (PDF)
A Discriminatively Learned CNN Embedding for Person Re-identification (PDF, Project/Code)
Learning Deep Neural Networks for Vehicle Re-ID with Visual-spatio-temporal Path Proposals (PDF)
Beyond triplet loss: a deep quadruplet network for person re-identification (PDF)
Person Re-identification by Local Maximal Occurrence Representation and Metric Learning (PDF, Project/Code)
Person Re-identification: Past, Present and Future (PDF)
Unsupervised Person Re-identification: Clustering and Fine-tuning (PDF, Project/Code)
Jointly Attentive Spatial-Temporal Pooling Networks for Video-based Person Re-Identification (PDF)
Divide and Fuse: A Re-ranking Approach for Person Re-identification (PDF)
Learning Deep Context-aware Features over Body and Latent Parts for Person Re-identification (PDF)
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis (PDF, Project/Code)

FASHION

Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba (PDF)
Visually-Aware Fashion Recommendation and Design with Generative Image Models ([PDF](Visually-Aware Fashion Recommendation and Design with Generative Image Models))
Be Your Own Prada: Fashion Synthesis with Structural Coherence (PDF, Project/Code, Reading Note)
Style2Vec: Representation Learning for Fashion Items from Style Sets (PDF)
Dress like a Star: Retrieving Fashion Products from Videos (PDF)
The Conditional Analogy GAN: Swapping Fashion Articles on People Images (PDF)

OTHER

Deep Clustering for Unsupervised Learning of Visual Features (PDF)
Detecting Visual Relationships Using Box Attention (PDF)
Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition (PDF, Project/Code)
Learning to See in the Dark(PDF)
A Variational U-Net for Conditional Appearance and Shape Generation (PDF, Project/Code)
Synthesizing Images of Humans in Unseen Poses (PDF)
End-to-end weakly-supervised semantic alignment (PDF, Project/Code)
Dense Optical Flow based Change Detection Network Robust to Difference of Camera Viewpoints (PDF)
Dual-Path Convolutional Image-Text Embedding (PDF, Project/Code)
The Promise and Peril of Human Evaluation for Model Interpretability (PDF)
Semantic Image Retrieval via Active Grounding of Visual Situations (PDF)
LIFT: Learned Invariant Feature Transform (PDF)
Learning Aligned Cross-Modal Representations from Weakly Aligned Data (PDF, Project/Code)
Multi-Task Curriculum Transfer Deep Learning of Clothing Attributes (PDF)
End-to-end Learning of Deep Visual Representations for Image Retrieval (PDF)
SoundNet: Learning Sound Representations from Unlabeled Video (PDF)
Bags of Local Convolutional Features for Scalable Instance Search (PDF, Project/Code)
Universal Correspondence Network (PDF, Project/Code)
Judging a Book By its Cover (PDF)
Generalisation and Sharing in Triplet Convnets for Sketch based Visual Search (PDF)
Analysis and Optimization of Loss Functions for Multiclass, Top-k, and Multilabel Classification (PDF)
Automatic generation of large-scale handwriting fonts via style learning (PDF)
Image Retrieval with Deep Local Features and Attention-based Keypoints (PDF)
Visual Discovery at Pinterest (PDF)
Learning to Detect Human-Object Interactions (PDF, Project/Code, Reading Note)
Learning Deep Features via Congenerous Cosine Loss for Person Recognition (PDF)
Large-Scale Evolution of Image Classifiers (PDF)
Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection (PDF)
Twitter100k: A Real-world Dataset for Weakly Supervised Cross-Media Retrieval (PDF, Project/Code)
Mixture of Counting CNNs: Adaptive Integration of CNNs Specialized to Specific Appearance for Crowd Counting (PDF)
Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of-the-Art (PDF, Project/Code)
Learning Features by Watching Objects Move (PDF, Project/Code)
GMS: Grid-based Motion Statistics for Fast, Ultra-robust Feature Correspondence (PDF, Project/Code)
ResnetCrowd: A Residual Deep Learning Architecture for Crowd Counting, Violent Behaviour Detection and Crowd Density Level Classification (PDF)
Learning Cross-modal Embeddings for Cooking Recipes and Food Images (PDF, Project/Code)
Convolutional neural network architecture for geometric matching (PDF, Project/Code)
Semantic Compositional Networks for Visual Captioning (PDF, Project/Code)
CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting (PDF)
Understanding Black-box Predictions via Influence Functions (PDF)
Learning a Repression Network for Precise Vehicle Search (PDF)
Visual Graph Mining (PDF)
A Deep Multimodal Approach for Cold-start Music Recommendation (PDF)
A Multilayer-Based Framework for Online Background Subtraction with Freely Moving Cameras (PDF)
A self-organizing neural network architecture for learning human-object interactions (PDF)

INTERESTING FINDS

RESOURCES/PERSPECTIVES

Build your own x
PyTorch Exercise Codes for Deep Learning Researchers
Python Regular Expressions Cheat Sheet
PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
A Gentle Introduction to Transfer Learning for Image Classification
GAN Timeline
A timeline showing the development of Generative Adversarial Networks (GAN).
arXiv(Computer Vision and Pattern Recognition)
A good place to explore latest papers.
Awesome Computer Vision
A curated list of awesome computer vision resources.
Awesome Deep Vision
A curated list of deep learning resources for computer vision.
Awesome MXNet
This page contains a curated list of awesome MXnet examples, tutorials and blogs.
Awesome TensorFlow
A curated list of awesome TensorFlow experiments, libraries, and projects.
gans-awesome-applications
Curated list of awesome GAN applications and demonstrations.
Deep Reinforcement Learning survey
This paper list is a bit different from others. The author puts some opinion and summary on it. However, to understand the whole paper, you still have to read it by yourself!
TensorFlow 官方文档中文版
TensorTalk
A place to find latest work’s codes.
OTB Results
Object tracking benchmark
Adversarial Nets Papers
Creating Human-Level AI
cv-tricks.com
Find deep learning models for your mobile platform
ICCV 2017 Open Access Repository

PROJECTS

Data Augmentation for Computer Vision with PyTorch
Neural Network Distiller
Distiller is an open-source Python package for neural network compression research.
Neural Network Tools: Converter, Constructor and Analyser
For caffe, pytorch, tensorflow, draknet and so on.
TensorFlow Examples
TensorFlow Tutorial with popular machine learning algorithms implementation. This tutorial was designed for easily diving into TensorFlow, through examples.It is suitable for beginners who want to find clear and concise examples about TensorFlow. For readability, the tutorial includes both notebook and code with explanations.
TensorFlow Tutorials
These tutorials are intended for beginners in Deep Learning and TensorFlow. Each tutorial covers a single topic. The source-code is well-documented. There is a YouTube video for each tutorial.
Home Surveilance with Facial Recognition
Deep Learning algorithms with TensorFlow
This repository is a collection of various Deep Learning algorithms implemented using the TensorFlow library. This package is intended as a command line utility you can use to quickly train and evaluate popular Deep Learning models and maybe use them as benchmark/baseline in comparison to your custom models/datasets.
TensorLayer
TensorLayer is designed to use by both Researchers and Engineers, it is a transparent library built on the top of Google TensorFlow. It is designed to provide a higher-level API to TensorFlow in order to speed-up experimentations and developments. TensorLayer is easy to be extended and modified. In addition, we provide many examples and tutorials to help you to go through deep learning and reinforcement learning.
Easily Create High Quality Object Detectors with Deep Learning
Using dlib to train a CNN to detect.
Command Line Neural Network
Neuralcli provides a simple command line interface to a python implementation of a simple classification neural network. Neuralcli allows a quick way and easy to get instant feedback on a hypothesis or to play around with one of the most popular concepts in machine learning today.
LSTM for Human Activity Recognition
Human activity recognition using smartphones dataset and an LSTM RNN. The project is based on Tesorflow. A MXNet implementation is MXNET-Scala Human Activity Recognition.
YOLO in caffe
This is a caffe implementation of the YOLO:Real-Time Object Detection.
SSD: Single Shot MultiBox Object Detector in mxnet
MTCNN face detection and alignment in MXNet
This is a python/mxnet implementation of Zhang’s work .
CNTK Examples: Image/Detection/Fast R-CNN
Self Driving (Toy) Ferrari
Finding Lane Lines on the Road
Magenta
Magenta is a project from the Google Brain team that asks: Can we use machine learning to create compelling art and music? If so, how? If not, why not?
Adversarial Nets Papers
The classical Papers about adversarial nets
Mushreco
Make a photo of a mushroom and see which species it is. Determine over 200 different species.
Neural Enhance
The neural network is hallucinating details based on its training from example images. It’s not reconstructing your photo exactly as it would have been if it was HD. That’s only possible in Hollywood — but using deep learning as “Creative AI” works and it is just as cool!
CNN Models by CVGJ
This repository contains convolutional neural network (CNN) models trained on ImageNet by Marcel Simon at the Computer Vision Group Jena (CVGJ) using the Caffe framework. Each model is in a separate subfolder and contains everything needed to reproduce the results. This repository focuses currently contains the batch-normalization-variants of AlexNet and VGG19 as well as the training code for Residual Networks (Resnet).
YOLO2

YOLOv2 uses a few tricks to improve training and increase performance. Like Overfeat and SSD we use a fully-convolutional model, but we still train on whole images, not hard negatives. Like Faster R-CNN we adjust priors on bounding boxes instead of predicting the width and height outright. However, we still predict the x and y coordinates directly. The full details are in our paper soon to be released on Arxiv, stay tuned!
Lightened CNN for Deep Face Representation
The Deep Face Representation Experiment is based on Convolution Neural Network to learn a robust feature for face verification task.
Recurrent dreams and filling in
MTCNN in MXnet
openai-gemm

Open single and half precision gemm implementations. The main speedups over cublas are with small minibatch and in fp16 data formats.
Neural Style

style transfer with mxnet
Can Convolutional Neural Networks Crack Sudoku Puzzles?
cleverhans

This repository contains the source code for cleverhans , a Python library to benchmark machine learning systems’ vulnerability to adversarial examples.
A deep learning traffic light detector using dlib and a few images from Google street view
Paints Chainer
Calculate deep convolution neurAl network on Cell Unit
Deep Video Analytics
Deep Video Analytics provides a platform for indexing and extracting information from videos and images. Deep learning detection and recognition algorithms are used for indexing individual frames / images along with detected objects. The goal of Deep Video analytics is to become a quickly customizable platform for developing visual & video analytics applications, while benefiting from seamless integration with state or the art models released by the vision research community.
Yolo_mark
Windows GUI for marking bounded boxes of objects in images for training Yolo v2
Yolo-Windows v2 - Windows version of Yolo Convolutional Neural Networks
An Unsupervised Distance Learning Framework for Multimedia Retrieva
awesome-deep-vision-web-demo
Mini Caffe
Minimal runtime core of Caffe, Forward only, GPU support and Memory efficiency.
Picasso: A free open-source visualizer for Convolutional Neural Networks
Picasso is a free open-source (Eclipse Public License) DNN visualization tool that gives you partial occlusion and saliency maps with minimal fuss.
pix2code: Generating Code from a Graphical User Interface Screenshot
MTCNN-light
this repository is the implementation of MTCNN with no framework, Just need opencv and openblas. “Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks”, implemented with C++，no framework
MobileNet-MXNet
This is a MXNet implementation of Google’s MobileNets.
NoScope: 1000x Faster Deep Learning Queries over Video
Caffe2 C++ Tutorials and Examples
Web Image Downloader Tools

NEWS/BLOGS

A Comprehensive Hands-on Guide to Transfer Learning with Real-World Applications in Deep Learning
Attention? Attention!
Intel’s Neural Compute Stick 2 is 8 times faster than its predecessor
How fast is my model?
Depthwise separable convolutions for machine learning
The Building Blocks of Interpretability
Setting the learning rate of your neural network.
The Root Cause of Slow Neural Net Training
Why is it hard to train deep neural networks? Degeneracy, not vanishing gradients, is the key
ResNet, AlexNet, VGG, Inception: Understanding various architectures of Convolutional Networks
Neural Networks For Recommender Systems
MIT Technology Review
A good place to keep up the trends.
LAB41
Lab41 is a Silicon Valley challenge lab where experts from the U.S. Intelligence Community (IC), academia, industry, and In-Q-Tel come together to gain a better understanding of how to work with — and ultimately use — big data.
Partnership on AI
Amazon, DeepMind/Google, Facebook, IBM, and Microsoft announced that they will create a non-profit organization that will work to advance public understanding of artificial intelligence technologies (AI) and formulate best practices on the challenges and opportunities within the field. Academics, non-profits, and specialists in policy and ethics will be invited to join the Board of the organization, named the Partnership on Artificial Intelligence to Benefit People and Society (Partnership on AI).
爱可可-爱生活老师的推荐十分值得一看
Guide to deploying deep-learning inference networks and realtime object recognition tutorial for NVIDIA Jetson TX1
A Return to Machine Learning
This post is aimed at artists and other creative people who are interested in a survey of recent developments in machine learning research that intersect with art and culture. If you’ve been following ML research recently, you might find some of the experiments interesting but will want to skip most of the explanations.
ResNets, HighwayNets, and DenseNets, Oh My!
This post walks through the logic behind three recent deep learning architectures: ResNet, HighwayNet, and DenseNet. Each make it more possible to successfully trainable deep networks by overcoming the limitations of traditional network design.
How to build a robot that “sees” with $100 and TensorFlow

I wanted to build a robot that could recognize objects. Years of experience building computer programs and doing test-driven development have turned me into a menace working on physical projects. In the real world, testing your buggy device can burn down your house, or at least fry your motor and force you to wait a couple of days for replacement parts to arrive.
Navigating the unsupervised learning landscape
Unsupervised learning is the Holy Grail of Deep Learning. The goal of unsupervised learning is to create general systems that can be trained with little data. Very little data.
Deconvolution and Checkerboard Artifacts
Facial Recognition on a Jetson TX1 in Tensorflow
Here’s a way to hack facial recognition system together in relatively short time on NVIDIA’s Jetson TX1.
Deep Learning with Generative and Generative Adverserial Networks – ICLR 2017 Discoveries
This blog post gives an overview of Deep Learning with Generative and Adverserial Networks related papers submitted to ICLR 2017.
Unsupervised Deep Learning – ICLR 2017 Discoveries
This blog post gives an overview of papers related to Unsupervised Deep Learning submitted to ICLR 2017.
You Only Look Twice — Multi-Scale Object Detection in Satellite Imagery With Convolutional Neural Networks
Deep Learning isn’t the brain
iSee: Using deep learning to remove eyeglasses from faces
Decoding The Thought Vector
Algorithmia will help you make your own AI-powered photo filters
Deep Learning Enables You to Hide Screen when Your Boss is Approaching
对偶学习：一种新的机器学习范式
How to Train a GAN? Tips and tricks to make GANs work

While research in Generative Adversarial Networks (GANs) continues to improve the fundamental stability of these models, we use a bunch of tricks to train them and make them stable day to day.
Highlights of IEEE Big Data 2016: Nearest Neighbours, Outliers and Deep Learning
Some CNN visualization tools and techniques

Besides this post, the others written by the author are also worthy of reading.
Deep Learning 2016: The Year in Review
GANs will change the world
colah’s blog
Analysis of Dropout
NIPS 2016 Review
【榜单】GitHub 最受欢迎深度学习应用项目 Top 16（持续更新）
Why use SVM?
TensorFlow Image Recognition on a Raspberry Pi
Building Your Own Deep Learning Box
Vehicle tracking using a support vector machine vs. YOLO
Understanding, generalisation, and transfer learning in deep neural networks
NVIDIA Announces The Jetson TX2, Powered By NVIDIA’s “Denver 2” CPU & Pascal Graphics
Can FPGAs Beat GPUs in Accelerating Next-Generation Deep Learning?
Flexible Image Tagging with Fast0Tag
Eye Fidelity: How Deep Learning Will Help Your Smartphone Track Your Gaze
Using Deep Learning to Find Similar Dresses
Rules of Machine Learning: Best Practices for ML Engineering
Neural Network Architectures
A Brief History of CNNs in Image Segmentation: From R-CNN to Mask R-CNN
晓雷机器学习笔记
Image Classification with 5 methods
How do Convolutional Neural Networks work?
基于深度卷积神经网络进行人脸识别的原理是什么
10 Deep Learning projects based on Apache MXNet
Off the Convex Path
图像风格迁移(Neural Style)简史
ML notes: Why the log-likelihood?
Generative Adversarial Networks (GANs): Engine and Applications
Compressing deep neural nets
Sigmoidal
A Gentle Introduction to the Bag-of-Words Model
Fantastic GANs and where to find them
Fantastic GANs and where to find them II

BENCHMARK/LEADERBOARD/DATASET

STAIR Actions: A Video Dataset of Everyday Home Actions
STAIR Actions is a video dataset consisting of 100 everyday human action categories.
Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking
EPIC-Kitchens
The largest dataset in first-person (egocentric) vision; multi-faceted non-scripted recordings in native environments - i.e. the wearers’ homes, capturing all daily activities in the kitchen over multiple days. Annotations are collected using a novel `live’ audio commentary approach.
Large-Scale Landmark Recognition: A Challenge
Low-Power Image Recognition Challenge
Open Images Dataset
Open Images is a dataset of ~9 million URLs to images that have been annotated with image-level labels and bounding boxes spanning thousands of classes.
Visual Tracker Benchmark
This website contains data and code of the benchmark evaluation of online visual tracking algorithms. Join visual-tracking Google groups for further updates, discussions, or QnAs.
Multiple Object Tracking Benchmark
With this benchmark we would like to pave the way for a unified framework towards more meaningful quantification of multi-target tracking.
Leaderboards for the Evaluations on PASCAL VOC Data
Open Images dataset
Open Images is a dataset of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories.
Open Sourcing 223GB of Driving Data
223GB of image frames and log data from 70 minutes of driving in Mountain View on two separate days, with one day being sunny, and the other overcast.
MS COCO
UMDFaces Dataset
UMDFaces is a face dataset which has 367,920 faces of 8,501 subjects. From this page you can download the entire dataset and the trained model for predicting the localization of the 21 keypoints.
VideoNet
VideoNet is a new initiative to bring together the community of researchers that have put effort into creating benchmarks for video tasks.
YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video
KITTI Vision Benchmark Suite
Duke: A New Large-scale Person Re-identification Dataset derived from DukeMTMC
Duke is a subset of the DukeMTMC for image-based re-ID, in the format of the Market-1501 dataset. The original dataset contains 85-minute high-resolution videos from 8 different cameras. Hand-drawn pedestrain bounding boxes are available.
Releasing the World’s Largest Street-level Imagery Dataset for Teaching Machines to See

Today we present the Mapillary Vistas Dataset—the world’s largest and most diverse publicly available, pixel-accurately and instance-specifically annotated street-level imagery dataset for empowering autonomous mobility and transport at the global scale.
WEBVISION DATASET

The WebVision dataset is designed to facilitate the research on learning visual representation from noisy web data. Our goal is to disentangle the deep learning techniques from huge human labor on annotating large-scale vision dataset. We release this large scale web images dataset as a benchmark to advance the research on learning from web data, including weakly supervised visual representation learning, visual transfer learning, text and vision, etc.
DukeMTMC4ReID

DukeMTMC4ReID dataset is new large-scale real-world person re-id dataset based on DukeMTMC.
Person Re-identification Datasets

Person re-identification has drawn intensive attention in the computer vision society in recent decades. As far as we know, this page collects all public datasets that have been tested by person re-identification algorithms.
MIT Saliency Benchmark
How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)
PoseTrack: A Benchmark for Human Pose Estimation and Tracking

TOOLKITS

Netron is a viewer for neural network, deep learning and machine learning models.
Bring Deep Learning to small devices An open source deep learning platform for low bit computation
Albumentations fast image augmentation library and easy to use wrapper around other libraries.
FeatherCNN
FeatherCNN is a high performance inference engine for convolutional neural networks.
Caffe
Caffe is a deep learning framework made with expression, speed, and modularity in mind. It is developed by the Berkeley Vision and Learning Center (BVLC) and by community contributors. Yangqing Jia created the project during his PhD at UC Berkeley. Caffe is released under the BSD 2-Clause license.
Caffe2
Caffe2 is a deep learning framework made with expression, speed, and modularity in mind. It is an experimental refactoring of Caffe, and allows a more flexible way to organize computation.
Caffe on Intel
This fork of BVLC/Caffe is dedicated to improving performance of this deep learning framework when running on CPU, in particular Intel® Xeon processors (HSW+) and Intel® Xeon Phi processors
TensorFlow
TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) that flow between them. This flexible architecture lets you deploy computation to one or more CPUs or GPUs in a desktop, server, or mobile device without rewriting code. TensorFlow also includes TensorBoard, a data visualization toolkit.
MXNet
MXNet is a deep learning framework designed for both efficiency and flexibility. It allows you to mix the flavours of symbolic programming and imperative programming to maximize efficiency and productivity. In its core, a dynamic dependency scheduler that automatically parallelizes both symbolic and imperative operations on the fly. A graph optimization layer on top of that makes symbolic execution fast and memory efficient. The library is portable and lightweight, and it scales to multiple GPUs and multiple machines.
neon
neon is Nervana’s Python based Deep Learning framework and achieves the fastest performance on modern deep neural networks such as AlexNet, VGG and GoogLeNet. Designed for ease-of-use and extensibility.
Piotr’s Computer Vision Matlab Toolbox
This toolbox is meant to facilitate the manipulation of images and video in Matlab. Its purpose is to complement, not replace, Matlab’s Image Processing Toolbox, and in fact it requires that the Matlab Image Toolbox be installed. Emphasis has been placed on code efficiency and code reuse. Thanks to everyone who has given me feedback - you’ve helped make this toolbox more useful and easier to use.
NVIDIA Developer
nvCaffe
A special branch of caffe is used on TX1 which includes support for FP16.
dlib
Dlib is a modern C++ toolkit containing machine learning algorithms and tools for creating complex software in C++ to solve real world problems. It is used in both industry and academia in a wide range of domains including robotics, embedded devices, mobile phones, and large high performance computing environments. Dlib’s open source licensing allows you to use it in any application, free of charge.
OpenCV
OpenCV is released under a BSD license and hence it’s free for both academic and commercial use. It has C++, C, Python and Java interfaces and supports Windows, Linux, Mac OS, iOS and Android. OpenCV was designed for computational efficiency and with a strong focus on real-time applications.
CNNdroid
CNNdroid is an open source library for execution of trained convolutional neural networks on Android devices.
tiny dnn
tiny-dnn is a C++11 implementation of deep learning. It is suitable for deep learning on limited computational resource, embedded systems and IoT devices.

An introduction to this toolkit at《Deep learning with C++ - an introduction to tiny-dnn》by Taiga Nomi
CaffeMex
A multi-GPU & memory-reduced MAT-Caffe on LINUX and WINDOWS
ARCore ARCore is a platform for building augmented reality apps on Android. ARCore uses three key technologies to integrate virtual content with the real world as seen through your phone’s camera
CNTK Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit.
ONNX ONNX is a open format to represent deep learning models. With ONNX, AI developers can more easily move models between state-of-the-art tools and choose the combination that is best for them. ONNX is developed and supported by a community of partners.
PyToune is a Keras-like framework for PyTorch and handles much of the boilerplating code needed to train neural networks.
Deep Learning Studio - Desktop DeepCognition.ai is a single user solution that runs locally on your hardware. Desktop version allows you to train models on your GPU(s) without uploading data to the cloud. The platform supports transparent multi-GPU training for up to 4 GPUs. Additional GPUs are supported in Deep Learning Studio – Enterprise.

LEARNING/TRICKS/TIPS

Training Tips for the Transformer Model
Deep Learning Courses
Backpropagation Algorithm
A website that explain how Backpropagation Algorithm works.
Deep Learning (textbook authored by Ian Goodfellow and Yoshua Bengio and Aaron Courville)
The Deep Learning textbook is a resource intended to help students and practitioners enter the field of machine learning in general and deep learning in particular.
Neural Networks and Deep Learning (online book authored by Michael Nielsen)
Neural Networks and Deep Learning is a free online book. The book will teach you about 1) Neural networks, a beautiful biologically-inspired programming paradigm which enables a computer to learn from observational data and 2) Deep learning, a powerful set of techniques for learning in neural networks. Neural networks and deep learning currently provide the best solutions to many problems in image recognition, speech recognition, and natural language processing. This book will teach you many of the core concepts behind neural networks and deep learning.
Computer Vision: Algorithms and Applications
This book is largely based on the computer vision courses that Richard Szeliski has co-taught at the University of Washington (2008, 2005, 2001) and Stanford (2003) with Steve Seitz and David Fleet.
Must Know Tips/Tricks in Deep Neural Networks
Many implementation details for DCNNs are collected and concluded. Extensive implementation details are introduced, i.e., tricks or tips, for building and training your own deep networks.
The zen of gradient descent
Deriving the Gradient for the Backward Pass of Batch Normalization
Reinforcement Learning: An Introduction
An overview of gradient descent optimization algorithms
Regularizing neural networks by penalizing confident predictions
What you need to know about data augmentation for machine learning
Plentiful high-quality data is the key to great machine learning models. But good data doesn’t grow on trees, and that scarcity can impede the development of a model. One way to get around a lack of data is to augment your dataset. Smart approaches to programmatic data augmentation can increase the size of your training set 10-fold or more. Even better, your model will often be more robust (and prevent overfitting) and can even be simpler due to a better training set.
[Guide to deploying deep-learning inference networks and realtime object recognition tutorial for NVIDIA Jetson TX1]
The Effect of Resolution on Deep Neural Network Image Classification Accuracy
The author explored the impact of both spatial resolution and training dataset size on the classification performance of deep neural networks in this post.
深度学习调参的技巧
CNN怎么调参数
视频多目标跟踪当前（2014,2015,2016）比较好的算法有哪些
5 algorithms to train a neural network
Towards Good Practices for Recognition & Detection
海康威视研究院ImageNet2016竞赛经验分享
What are the differences between Random Forest and Gradient Tree Boosting algorithms
为什么现在的CNN模型都是在GoogleNet、VGGNet或者AlexNet上调整的
神经网络与深度学习
ILSVRC2016目标检测任务回顾(上)——图像目标检测(DET)
ILSVRC2016目标检测任务回顾(下)——视频目标检测(VID)
How to Train a GAN? Tips and tricks to make GANs work
令人拍案叫绝的Wasserstein GAN
Mathematics for Computer Science
生成式对抗网络 GAN 的研究进展与展望
A guide to receptive field arithmetic for Convolutional Neural Networks
见微知著：细粒度图像分析进展
目标跟踪相关资源
Intro to Neural Networks and Machine Learning
Tips for Training Recurrent Neural Networks
A Gentle Introduction to Mini-Batch Gradient Descent and How to Configure Batch Size
Learning To See（机器学习计算机视觉入门）（英文字幕）
37 Reasons why your Neural Network is not working
My Neural Network isn’t working! What should I do?
Tutorial on Deep Generative Models
卷积神经网络：从基础技术到研究前景

SKILLS

ABOUT CAFFE

Set Up Caffe on Ubuntu14.04 64bit+NVIDIA GTX970M+CUDA7.0
VS2013配置Caffe卷积神经网络工具（64位Windows 7）——建立工程
VS2013配置Caffe卷积神经网络工具（64位Windows 7）——准备依赖库

SETTING UP

Installation of NVIDIA GPU Driver and CUDA Toolkit
Tensorflow v0.10 installed from scratch on Ubuntu 16.04, CUDA 8.0RC+Patch, cuDNN v5.1 with a 1080GTX
DL小钢炮攒机心得帮你踩坑

你可能感兴趣的:(计算机视觉,DL)

霍夫变换（Hough Transform）算法原来详解和纯C++代码实现以及OpenCV中的使用示例点云SLAM 算法图形图像处理算法 opencv 图像处理与计算机视觉算法直线提取检测目标检测霍夫变换算法
霍夫变换（HoughTransform）是一种经典的图像处理与计算机视觉算法，广泛用于检测图像中的几何形状，例如直线、圆、椭圆等。其核心思想是将图像空间中的“点”映射到参数空间中的“曲线”，从而将形状检测问题转化为参数空间中的峰值检测问题。一、霍夫变换基本思想输入：边缘图像（如经过Canny边缘检测）输出：一组满足几何模型的形状（如直线、圆）关键思想：图像空间中的一个点→参数空间中的一个曲线参数空
windows安装pnpm后报错：pnpm : 无法将“pnpm”项识别为 cmdlet、函数、脚本文件或可运行程序的名称。 Ithao2 Vue npm 前端 node.js
使用npm方式安装pnpm,命令如下：npminstall-gpnpm安装完以后，执行pnpm-v查看版本号：pnpm-v执行完发现报错：pnpm:无法将“pnpm”项识别为cmdlet、函数、脚本文件或可运行程序的名称。尝试配置环境变量，重启后均不生效。解决方案：使用PowerShell进行安装1.以管理员用户打开PowerShell，执行如下命令：iwrhttps://get.pnpm.io/
Kafka系列之：Dead Letter Queue死信队列DLQ 快乐骑行^_^ Kafka Kafka系列 Dead Letter Queue 死信队列 DLQ
Kafka系列之：DeadLetterQueue死信队列DLQ一、死信队列二、参数errors.tolerance三、创建死信队列主题四、在启用安全性的情况下使用死信队列更多内容请阅读博主这篇博客：Kafka系列之：KafkaConnect深入探讨-错误处理和死信队列一、死信队列死信队列（DLQ）仅适用于接收器连接器。当一条记录以JSON格式到达接收器连接器时，但接收器连接器配置期望另一种格式，如
react-native android 环境搭建
环境：macjava版本：Java11最重要：一定要一定要一定要react涉及到很多的依赖下载，gradle和react相关的，第一次安装环境时有外网环境会快速很多。安装nodejs安装react-nativenpminstallreact-native-clinpminstallreact-native创建一个新项目react-nativeinitfirstReact替换gradle下载源rep
android查看so路径
之前遇到过一个问题，apk中有一个so无法确定其路径，是由哪个依赖引入的，网上查询一番后这里记录一下。build.gradle中添加如下任务//列出所有包含有so文件的库信息tasks.whenTaskAdded{task->if(task.name=='mergeDebugNativeLibs'){//如果是有多个flavor，则用mergeFlavorDebugNativeLibs的形式tas
RocketMQ 之死信队列 firepation RocketMQ rocketmq
在分布式消息系统中，消息的可靠传递和处理至关重要。然而，由于各种原因（如消息处理失败、消费超时等），一些消息可能无法被正常消费。这些无法被消费的消息如果不加以处理，会影响系统的稳定性和数据一致性。为了解决这一问题，RocketMQ提供了死信队列（DeadLetterQueue，DLQ）机制。本文将深入探讨RocketMQ的死信队列，包括其实现原理、应用场景以及使用示例。什么是死信队列？死信队列是一
.Net程序集强签名详解
强签名：1.可以将强签名的dll注册到GAC，不同的应用程序可以共享同一dll。2.强签名的库，或者应用程序只能引用强签名的dll，不能引用未强签名的dll，但是未强签名的dll可以引用强签名的dll。3.强签名无法保护源代码，强签名的dll是可以被反编译的。4.强签名的dll可以防止第三方恶意篡改。强签名的方法：1.有源代码：1.1使用vstoolcommand：snk–kmykey.snk生成
.NET nupkg包的深度解析与安全防护指南深盾科技 .net
在.NET开发领域，nupkg包是开发者们不可或缺的工具。它不仅是代码分发和资源共享的核心载体，还贯穿了开发、构建、部署的全流程。今天，我们将深入探讨nupkg包的核心功能、打包发布流程以及安全防护措施，帮助你在.NET开发中更加得心应手。nupkg包的核心功能nupkg是NuGet包的文件格式，本质上是一个ZIP压缩包，包含编译后的程序集（.dll文件）、调试符号（.pdb文件）、描述文件（.n
FPGA 设计中的 “Create HDL Wrapper“ 和 “Generating Output Products“ 的区别行者.................. fpga开发
CreateHDLWrapper(创建HDL包装器)目的：为顶层设计模块（通常是BlockDesign/IPIntegrator设计）创建一个HDL包装文件功能：将图形化/框图设计的BlockDesign转换为可综合的HDL代码（Verilog或VHDL）创建一个顶层模块，将所有IP核和连接实例化使用场景：当使用IPIntegrator创建BlockDesign后需要将图形化设计转换为HDL代码以
目标检测（object detection）加油吧zkf 目标检测目标检测人工智能计算机视觉
目标检测作为计算机视觉的核心技术，在自动驾驶、安防监控、医疗影像等领域发挥着不可替代的作用。本文将系统讲解目标检测的概念、原理、主流模型、常见数据集及应用场景，帮助读者构建对这一技术的完整认知。一、目标检测的核心概念目标检测（ObjectDetection）是指在图像或视频中自动定位并识别出所有感兴趣的目标的技术。它需要解决两个核心问题：分类（Classification）：确定图像中每个目标的类
c++中如何排查死锁三月微风 c++java 开发语言
排查死锁（deadlock）是多线程C++开发中的一项核心调试技能，死锁通常是因为多个线程交叉持有资源而相互等待导致程序卡死。下面详细讲讲如何排查和预防死锁：一、死锁的常见成因锁获取顺序不一致（最常见）多个互斥量之间相互等待一个线程尝试多次加锁同一个非递归互斥锁忘记释放锁条件变量使用错误（如wait时未持锁）二、排查死锁的方法✅1.日志调试法在加锁和解锁前后打日志，确认：哪些线程获取了锁哪个线程卡
微算法科技的前沿探索：量子机器学习算法在视觉任务中的革新应用 MicroTech2025 量子计算算法
在信息技术飞速发展的今天，计算机视觉作为人工智能领域的重要分支，正逐步渗透到我们生活的方方面面。从自动驾驶到人脸识别，从医疗影像分析到安防监控，计算机视觉技术展现了巨大的应用潜力。然而，随着视觉任务复杂度的不断提升，传统机器学习算法在处理大规模、高维度数据时遇到了计算瓶颈。在此背景下，量子计算作为一种颠覆性的计算模式，以其独特的并行处理能力和指数级增长的计算空间，为解决这一难题提供了新的思路。微算
windows exe爬虫：exe抓包程序猿阿三爬虫项目实战 exe抓包
不论任何爬虫，抓包是获取数据最直接和最方便的方式，这章节我们一起看一下windowsexe是如何拦截数据的。用mitmproxy/Charles/Fiddler或Wireshark拦截它的HTTP/HTTPS/TCP流量。如果是HTTPS，安装并信任代理的根证书。由于exe大部分可能走的是自定义应用层协议。在不知情所拦截应用使用的流量时，所以建议用Wireshark。本文利用python代码，实现
Windows qt打包编译好的程序 new_zhou windows qt 开发语言打包程序
在release模式下生成exe后，往外发布时需要附带运行环境（即需要的dll等）打包流程：1、将生成的exe拷贝到单独一个文件夹中；2、在应用程序中找到对应的qt终端，注意此处的终端要与自己编译exe的编译器一致。使用的是32位的话则选择32位的终端。3、打开终端后，使用cd命令切换到步骤1中所新建文件夹的路径4、使用命令进行拷贝。windeployqtxxx.exe执行完上述命令后，会将依赖的
vue如何实现Cascader 级联选择器(二级全部选中只展示一级，三级全部选中只展示二级) 小周同学: vue vue.js
select提交重置级联exportdefault{data(){return{ruleForm:{selectLabel:[],idList:[],},citiesList:[],rules:{selectLabel:[{type:'array',required:true,message:'多选不能为空',trigger:'change'}],},props:{multiple:true,va
使用Adb wifi Android真机运行Uni-app pony1688 adb uni-app android
1、手机安装Adbwifi,我的用是这个：ADBWiFi(com.rair.adbwifi)-5.1.5-应用-酷安2、手机上运行ADB，运行后点击开始后界面如下3、如果手机已root,在电脑上运行adbconnect192.168.200.33:5555就可以连上了（注意:(1)不要进PowerShell,否则报错：无法将“adb”项识别为cmdlet、函数、脚本文件或可运行程序的名称。...(
Mac上的java_home命令的作用
https://my.oschina.net/shishaomeng/blog/537444摘要:刚上手Mac还是有些别扭的，尤其安装个JDK都跟Windows不一样，而且是完全的不同本文仅针对macosx10.5+,其他版本有可能出现不适.JDK安装JDK1.6安装系统默认自带jdk1.6，如因意外被卸载，可从如下地址下载安装：https://support.apple.com/kb/DL157
Uni-app 生命周期与钩子：程序的“生命”旅程普宁Max uni-app vue
Uni-app生命周期与钩子一、应用生命周期(AppLifecycle)onLaunch什么时候触发？常用场景？onShow什么时候触发？常用场景？onHide什么时候触发？常用场景？onError什么时候触发？常用场景？onPageNotFound什么时候触发？常用场景？onUnhandledRejection什么时候触发？常用场景？onThemeChange什么时候触发？常用场景？二、页面生命
mac m1安装大模型工具vllm liliangcsdn macos
1更新系统环境参考vllm官网文档，vllm对applem1平台macos,xcoder,clang有如下要求OS:macOSSonomaorlaterSDK:XCode15.4orlaterwithCommandLineToolsCompiler:AppleClang>=15.0.0在AppStore更新macOS和XCoder，依据XCoder版本号安装commandlinetools。htt
springboot数据脱敏（接口级别） WuWuII java spring boot java spring 脱敏
文章目录自定义脱敏注解脱敏注解接口脱敏注解反射+AOP实现字段脱敏切面定义脱敏策略脱敏策略的接口电话号码脱敏策略邮箱脱敏不脱敏姓名脱敏身份证号脱敏Jackson+AOP实现脱敏定义序列化序列化实现脱敏切面定义Jackson+ThreadLocal+拦截器实现脱敏定义ThreadLocal自定义序列化序列化配置拦截器定义拦截器添加到spring脱敏指定接口总结主要通过注解+aop+序列化/jacks
添加行号（python版）
添加行号#打开PyCharm，新建一个新的py文件，取名demo，生成demo.py文件lines_maxlenth=0#定义新的变量，储存最长的代码长度line_numbers=1#每次加一，代表当前正在添行号的位置code_in=open("demo.py","r").readlines()#打开demo.py文件，读取所有内容code_out=open("demo_new.py","w")#
dll常见错误解决方案，dll报错必装，Visual C++ 下载安装～烈工具包 microsoft c++开发语言
下载链接：https://pan.xunlei.com/s/VO5BXZj2rePcJzbRTeVWJ-xhA1?pwd=kepu#安装步骤1、下载后点击红色框的exe运行2、点击下一步3、选择要安装的dll组件（建议默认就行）4、安装中（默认安装在系统盘，不要管）5、安装完成
shell脚本实现Hive库表迁移 docsz hive Linux shell
1、获取hive所有库的建表语句#获取hive所有库的建表语句#!/bin/bashmkdir-p~/hive/tables/tablesDDL#获取库名hive-e"showdatabases;">~/hive/databases.txtsed-i'1,3d'~/hive/databases.txtsed-i'$d'~/hive/databases.txtcat~/hive/databases.
HIVE（二） 2301_78012738 hive 数据仓库
目录访问HIVE的三种方式DDLDML数据操作向表中装载数据数据导出常用函数Like和RLike分组Join排序分区表和分桶表访问HIVE的三种方式启动Hive命令，CtrlC退出客户端，执行测试语句，与sql一致[wyc@hadoop102hive]$bin/hive经验小结：在hive中执行语句报错：ExecutionError,returncode2fromorg.apache.hadoop
C语言均方根法计算交流电压有效值 whik1194 c语言开发语言 FPGA HLS
#include"stdio.h"#include"stdlib.h"#include"stdint.h"#include"string.h"#include"math.h"//#defineSAMPLE1000#definePIacos(-1)intmain(intargc,char*argv[]){floatsum=0;floatrms=0;intSAMPLE=atoi(argv[1]);if
WPF学习笔记（2）——x名称空间详解上幽冥宇少 WPF C#WPF学习笔记初学者 C#VS2013
先说一些基本的，.NET的模块称为程序集（Assembly）。一般情况下，用VS创建的是解决方案（Solution），一个解决方案就是一个完整的程序。解决方案中包含若干个项目（Project），每个项目是可以独立编译的，他的编译结果是一个程序集。常见的程序集是以.exe为扩展名的可执行程序或者是以.dll为扩展名的动态链接库，大多数情况下，我们说“引用其他程序集”的时候，说的是动态链接库。因为.N
微信小程序--事件绑定饭饭FF 微信小程序小程序
1.事件绑定方式方式一：bind:事件名例如方式二：bind事件名例如2.事件常用类型微信小程序中有许多的事件类型，常用的包括以下几种：1.bindtap:点击事件，当用户点击该元素时触发2.bindlongtap：长按事件，当用户长按该元素时触发3.bindinput：输入事件，当用户输入内容时触发4.bindscrolltolower：滚动到底部时间，当列表滚动到底部时触发5.bindchan
异常处理：@ControllerAdvice, @ExceptionHandler, @ResponseStatus, @Valid, @DataAccessException 张紫娃注解 java
注解名称来源框架/规范典型使用场景版本（引入年份）是否推荐使用@DataAccessExceptionSpringFramework封装JDBC/MyBatis等数据访问异常Spring1.0（2004）✅@TransactionalSpringFramework声明数据库事务（如Service层操作）Spring2.0（2007）✅@ExceptionHandlerSpringMVC方法内捕获并
OpenCV图片操作100例：从入门到精通指南（1）总有刁民想爱朕ha opencv 计算机视觉人工智能
OpenCV图片操作100例：从入门到精通指南本文整理了100个OpenCV实用技巧，涵盖图像处理各个领域，助你轻松掌握计算机视觉核心技能！一、入门必备：基础操作1.图像读写与显示importcv2#读取图像（BGR格式）img=cv2.imread('image.jpg')#显示图像cv2.imshow('示例图片',img)cv2.waitKey(0)#按任意键退出cv2.destroyAll
OpenCV图片操作100例：从入门到精通指南（3）总有刁民想爱朕ha opencv 人工智能计算机视觉
高效学习路径：1️⃣分阶段学习：入门：1-20例（基础操作）进阶：21-50例（图像处理）高级：51-100例（计算机视觉）2️⃣项目驱动学习：证件照背景替换（1-15例）停车场车位检测（30-45例）视频运动追踪（70-85例）3️⃣性能优化技巧：#使用UMat加速图像处理umat_img=cv2.UMat(img)processed=cv2.GaussianBlur(umat_img,(5,5
Algorithm 香水浓 java Algorithm
冒泡排序 public static void sort(Integer[] param) { for (int i = param.length - 1; i > 0; i--) { for (int j = 0; j < i; j++) { int current = param[j]; int next = param[j + 1];
mongoDB 复杂查询表达式开窍的石头 mongodb
1:count Pg: db.user.find().count(); 统计多少条数据 2:不等于$ne Pg: db.user.find({_id:{$ne:3}},{name:1,sex:1,_id:0}); 查询id不等于3的数据。 3：大于$gt $gte(大于等于) &n
Jboss Java heap space异常解决方法, jboss OutOfMemoryError : PermGen space 0624chenhong jvm jboss
转自 http://blog.csdn.net/zou274/article/details/5552630 解决办法： window->preferences->java->installed jres->edit jre 把default vm arguments 的参数设为-Xms64m -Xmx512m ----------------
文件上传下载解析相对路径不懂事的小屁孩文件上传
有点坑吧，弄这么一个简单的东西弄了一天多，身边还有大神指导着，网上各种百度着。下面总结一下遇到的问题：文件上传，在页面上传的时候，不要想着去操作绝对路径，浏览器会对客户端的信息进行保护，避免用户信息收到攻击。在上传图片，或者文件时，使用form表单来操作。前台通过form表单传输一个流到后台，而不是ajax传递参数到后台，代码如下: <form action=&
怎么实现qq空间批量点赞换个号韩国红果果 qq
纯粹为了好玩！！逻辑很简单 1 打开浏览器console；输入以下代码。先上添加赞的代码 var tools={}; //添加所有赞 function init(){ document.body.scrollTop=10000; setTimeout(function(){document.body.scrollTop=0;},2000);//加
判断是否为中文灵静志远中文
方法一： public class Zhidao { public static void main(String args[]) { String s = "sdf灭礌 kjl d{';\fdsjlk是"; int n=0; for(int i=0; i<s.length(); i++) { n = (int)s.charAt(i); if((
一个电话面试后总结 a-john 面试
今天，接了一个电话面试，对于还是初学者的我来说，紧张了半天。面试的问题分了层次，对于一类问题，由简到难。自己觉得回答不好的地方作了一下总结：在谈到集合类的时候，举几个常用的集合类，想都没想，直接说了list,map。然后对list和map分别举几个类型： list方面：ArrayList,LinkedList。在谈到他们的区别时，愣住了
MSSQL中Escape转义的使用 aijuans MSSQL
IF OBJECT_ID('tempdb..#ABC') is not null drop table tempdb..#ABC create table #ABC ( PATHNAME NVARCHAR(50) ) insert into #ABC SELECT N'/ABCDEFGHI' UNION ALL SELECT N'/ABCDGAFGASASSDFA' UNION ALL
一个简单的存储过程 asialee mysql 存储过程构造数据批量插入
今天要批量的生成一批测试数据，其中中间有部分数据是变化的，本来想写个程序来生成的，后来想到存储过程就可以搞定，所以随手写了一个，记录在此： DELIMITER $$ DROP PROCEDURE IF EXISTS inse
annot convert from HomeFragment_1 to Fragment 百合不是茶 android 导包错误
创建了几个类继承Fragment, 需要将创建的类存储在ArrayList<Fragment>中; 出现不能将new 出来的对象放到队列中,原因很简单; 创建类时引入包是:import android.app.Fragment; 创建队列和对象时使用的包是:import android.support.v4.ap
Weblogic10两种修改端口的方法 bijian1013 weblogic 端口号配置管理 config.xml
一.进入控制台进行修改 1.进入控制台: http://127.0.0.1:7001/console 2.展开左边树菜单域结构->环境->服务器-->点击AdminServer(管理) &
mysql 操作指令征客丶 mysql
一、连接mysql 进入 mysql 的安装目录； $ bin/mysql -p [host IP 如果是登录本地的mysql 可以不写 -p 直接 -u] -u [userName] -p 输入密码，回车，接连；二、权限操作［如果你很了解mysql数据库后，你可以直接去修改系统表，然后用 mysql> flush privileges; 指令让权限生效］ 1、赋权 mys
【Hive一】Hive入门 bit1129 hive
Hive安装与配置 Hive的运行需要依赖于Hadoop，因此需要首先安装Hadoop2.5.2，并且Hive的启动前需要首先启动Hadoop。 Hive安装和配置的步骤 1. 从如下地址下载Hive0.14.0 http://mirror.bit.edu.cn/apache/hive/ 2.解压hive，在系统变
ajax 三种提交请求的方法 BlueSkator Ajax jqery
1、ajax 提交请求 $.ajax({ type:"post", url : "${ctx}/front/Hotel/getAllHotelByAjax.do", dataType : "json", success : function(result) { try { for(v
mongodb开发环境下的搭建入门 braveCS 运维
linux下安装mongodb 1）官网下载mongodb-linux-x86_64-rhel62-3.0.4.gz 2）linux 解压 gzip -d mongodb-linux-x86_64-rhel62-3.0.4.gz; mv mongodb-linux-x86_64-rhel62-3.0.4 mongodb-linux-x86_64-rhel62-
编程之美-最短摘要的生成 bylijinnan java 数据结构算法编程之美
import java.util.HashMap; import java.util.Map; import java.util.Map.Entry; public class ShortestAbstract { /** * 编程之美最短摘要的生成 * 扫描过程始终保持一个[pBegin,pEnd]的range,初始化确保[pBegin,pEnd]的ran
json数据解析及typeof chengxuyuancsdn js typeof json解析
// json格式 var people='{"authors": [{"firstName": "AAA","lastName": "BBB"},' +' {"firstName": "CCC&
流程系统设计的层次和目标 comsci 设计模式数据结构 sql 框架脚本
流程系统设计的层次和目标
RMAN List和report 命令 daizj oracle list report rman
LIST 命令使用RMAN LIST 命令显示有关资料档案库中记录的备份集、代理副本和映像副本的信息。使用此命令可列出： • RMAN 资料档案库中状态不是AVAILABLE 的备份和副本 • 可用的且可以用于还原操作的数据文件备份和副本 • 备份集和副本，其中包含指定数据文件列表或指定表空间的备份 • 包含指定名称或范围的所有归档日志备份的备份集和副本 • 由标记、完成时间、可
二叉树:红黑树 dieslrae 二叉树
红黑树是一种自平衡的二叉树,它的查找,插入,删除操作时间复杂度皆为O(logN),不会出现普通二叉搜索树在最差情况时时间复杂度会变为O(N)的问题. 红黑树必须遵循红黑规则,规则如下 1、每个节点不是红就是黑。 2、根总是黑的 &
C语言homework3，7个小题目的代码 dcj3sjt126com c
1、打印100以内的所有奇数。 # include <stdio.h> int main(void) { int i; for (i=1; i<=100; i++) { if (i%2 != 0) printf("%d ", i); } return 0; } 2、从键盘上输入10个整数，
自定义按钮, 图片在上, 文字在下, 居中显示 dcj3sjt126com 自定义
#import <UIKit/UIKit.h> @interface MyButton : UIButton -(void)setFrame:(CGRect)frame ImageName:(NSString*)imageName Target:(id)target Action:(SEL)action Title:(NSString*)title Font:(CGFloa
MySQL查询语句练习题，测试足够用了 flyvszhb sql mysql
http://blog.sina.com.cn/s/blog_767d65530101861c.html 1.创建student和score表 CREATE TABLE student ( id INT(10) NOT NULL UNIQUE PRIMARY KEY , name VARCHAR
转：MyBatis Generator 详解 happyqing mybatis
MyBatis Generator 详解 http://blog.csdn.net/isea533/article/details/42102297 MyBatis Generator详解 http://git.oschina.net/free/Mybatis_Utils/blob/master/MybatisGeneator/MybatisGeneator.
让程序员少走弯路的14个忠告 jingjing0907 工作计划学习
无论是谁，在刚进入某个领域之时，有再大的雄心壮志也敌不过眼前的迷茫：不知道应该怎么做，不知道应该做什么。下面是一名软件开发人员所学到的经验，希望能对大家有所帮助 1.不要害怕在工作中学习。只要有电脑，就可以通过电子阅读器阅读报纸和大多数书籍。如果你只是做好自己的本职工作以及分配的任务，那是学不到很多东西的。如果你盲目地要求更多的工作，也是不可能提升自己的。放
nginx和NetScaler区别流浪鱼 nginx
NetScaler是一个完整的包含操作系统和应用交付功能的产品，Nginx并不包含操作系统，在处理连接方面，需要依赖于操作系统，所以在并发连接数方面和防DoS攻击方面，Nginx不具备优势。 2.易用性方面差别也比较大。Nginx对管理员的水平要求比较高，参数比较多，不确定性给运营带来隐患。在NetScaler常见的配置如健康检查，HA等，在Nginx上的配置的实现相对复杂。 3.策略灵活度方
第11章动画效果（下） onestopweb 动画
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
FAQ - SAP BW BO roadmap blueoxygen BO BW
http://www.sdn.sap.com/irj/boc/business-objects-for-sap-faq Besides, I care that how to integrate tightly. By the way, for BW consultants, please just focus on Query Designer which i
关于java堆内存溢出的几种情况 tomcat_oracle java jvm jdk thread
【情况一】：　　 java.lang.OutOfMemoryError: Java heap space：这种是java堆内存不够，一个原因是真不够，另一个原因是程序中有死循环；　　如果是java堆内存不够的话，可以通过调整JVM下面的配置来解决：　　<jvm-arg>-Xms3062m</jvm-arg> 　　<jvm-arg>-Xmx
Manifest.permission_group权限组阿尔萨斯 Permission
结构继承关系 public static final class Manifest.permission_group extends Object java.lang.Object android. Manifest.permission_group 常量 ACCOUNTS 直接通过统计管理器访问管理的统计 COST_MONEY可以用来让用户花钱但不需要通过与他们直接牵涉的权限 D