马卫飞

caffe网络模型中各层功能的详解

Layer（层）是Caffe中最庞大最繁杂的模块。由于Caffe强调模块化设计，因此只允许每个layer完成一类特定的计算，例如convolution操作、pooling、非线性变换、内积运算，以及数据加载、归一化和损失计算等。layer这个类可以说是里面最终的一个基本类了，深度网络呢就是一层一层的layer，相互之间通过blob传输数据连接起来。

我们先看一张图：

然后我们从头文件看看：

Caffe中与Layer相关的头文件有7个，

layer.hpp: 父类Layer，定义所有layer的基本接口。
data_layers.hpp: 继承自父类Layer，定义与输入数据操作相关的子Layer，例如DataLayer，HDF5DataLayer和ImageDataLayer等。
vision_layers.hpp: 继承自父类Layer，定义与特征表达相关的子Layer，例如ConvolutionLayer，PoolingLayer和LRNLayer等。
neuron_layers.hpp: 继承自父类Layer，定义与非线性变换相关的子Layer，例如ReLULayer，TanHLayer和SigmoidLayer等。
loss_layers.hpp: 继承自父类Layer，定义与输出误差计算相关的子Layer，例如EuclideanLossLayer，SoftmaxWithLossLayer和HingeLossLayer等。
common_layers.hpp: 继承自父类Layer，定义与中间结果数据变形、逐元素操作相关的子Layer，例如ConcatLayer，InnerProductLayer和SoftmaxLayer等。
layer_factory.hpp: Layer工厂模式类，负责维护现有可用layer和相应layer构造方法的映射表。

1.About

layer.hpp

和layer相关的头文件有：

common_layers.hpp
data_layers.hpp
layer.hpp
loss_layers.hpp
neuron_layers.hpp
vision_layers.hpp

其中``layer.hpp是抽象出来的基类，其他都是在其基础上的继承，也即剩下的五个头文件和上图中的五个部分。在layer.hpp`头文件里，包含了这几个头文件：

#include "caffe/blob.hpp"
#include "caffe/common.hpp"
#include "caffe/proto/caffe.pb.h"
#include "caffe/util/device_alternate.hpp"

在device_alternate.hpp中，通过#ifdef CPU_ONLY定义了一些宏来取消GPU的调用：

#define STUB_GPU(classname)
#define STUB_GPU_FORWARD(classname, funcname)
#define STUB_GPU_BACKWARD(classname, funcname)

layer中有这三个主要参数：

LayerParameter layer_param_;      // 这个是protobuf文件中存储的layer参数
vector>> blobs_;        // 这个存储的是layer的参数，在程序中用的
vector param_propagate_down_;        // 这个bool表示是否计算各个blob参数的diff，即传播误差

Layer类的构建函数explicit Layer(const LayerParameter& param) : layer_param_(param)会尝试从protobuf文件读取参数。其三个主要接口：

virtual void SetUp(const vector*>& bottom, vector*>* top)
inline Dtype Forward(const vector*>& bottom, vector*>* top);
inline void Backward(const vector*>& top, const vector& propagate_down, const *>* bottom);

SetUp函数需要根据实际的参数设置进行实现，对各种类型的参数初始化；Forward和Backward对应前向计算和反向更新，输入统一都是bottom，输出为top，其中Backward里面有个propagate_down参数，用来表示该Layer是否反向传播参数。

在Forward和Backward的具体实现里，会根据Caffe::mode()进行对应的操作，即使用cpu或者gpu进行计算，两个都实现了对应的接口Forward_cpu、Forward_gpu和Backward_cpu、Backward_gpu，这些接口都是virtual，具体还是要根据layer的类型进行对应的计算（注意：有些layer并没有GPU计算的实现，所以封装时加入了CPU的计算作为后备）。另外，还实现了ToProto的接口，将Layer的参数写入到protocol buffer文件中。

data_layers.hpp

data_layers.hpp这个头文件包含了这几个头文件：

#include "boost/scoped_ptr.hpp"
#include "hdf5.h"
#include "leveldb/db.h"
#include "lmdb.h"

#include "caffe/blob.hpp"
#include "caffe/common.hpp"
#include "caffe/filler.hpp"
#include "caffe/internal_thread.hpp"
#include "caffe/layer.hpp"
#include "caffe/proto/caffe.pb.h"

看到hdf5、leveldb、lmdb，确实是与具体数据相关了。data_layer作为原始数据的输入层，处于整个网络的最底层，它可以从数据库leveldb、lmdb中读取数据，也可以直接从内存中读取，还可以从hdf5，甚至是原始的图像读入数据。

关于这几个数据库，简介如下：

LevelDB是Google公司搞的一个高性能的key/value存储库，调用简单，数据是被Snappy压缩，据说效率很多，可以减少磁盘I/O，具体例子可以看看维基百科。

而LMDB（Lightning Memory-Mapped Database），是个和levelDB类似的key/value存储库，但效果似乎更好些，其首页上写道“ultra-fast，ultra-compact”，这个有待进一步学习啊～～

HDF（Hierarchical Data Format）是一种为存储和处理大容量科学数据而设计的文件格式及相应的库文件，当前最流行的版本是HDF5,其文件包含两种基本数据对象：

群组（group）：类似文件夹，可以包含多个数据集或下级群组；
数据集（dataset）：数据内容，可以是多维数组，也可以是更复杂的数据类型。

以上内容来自维基百科，关于使用可以参考[HDF5 小试——高大上的多对象文件格式](HDF5 小试——高大上的多对象文件格式)，后续会再详细的研究下怎么用。

caffe/filler.hpp的作用是在网络初始化时，根据layer的定义进行初始参数的填充，下面的代码很直观，根据FillerParameter指定的类型进行对应的参数填充。

// A function to get a specific filler from the specification given in
// FillerParameter. Ideally this would be replaced by a factory pattern,
// but we will leave it this way for now.
template 
Filler* GetFiller(const FillerParameter& param) {
  const std::string& type = param.type();
  if (type == "constant") {
    return new ConstantFiller(param);
  } else if (type == "gaussian") {
    return new GaussianFiller(param);
  } else if (type == "positive_unitball") {
    return new PositiveUnitballFiller(param);
  } else if (type == "uniform") {
    return new UniformFiller(param);
  } else if (type == "xavier") {
    return new XavierFiller(param);
  } else {
    CHECK(false) << "Unknown filler name: " << param.type();
  }
  return (Filler*)(NULL);
}

internal_thread.hpp里面封装了pthread函数，继承的子类可以得到一个单独的线程，主要作用是在计算当前的一批数据时，在后台获取新一批的数据。

关于data_layer，基本要注意的我都在图片上标注了。

neuron_layers.hpp

输入了data后，就要计算了，比如常见的sigmoid、tanh等等，这些都计算操作被抽象成了neuron_layers.hpp里面的类NeuronLayer，这个层只负责具体的计算，因此明确定义了输入ExactNumBottomBlobs()和ExactNumTopBlobs()都是常量1,即输入一个blob，输出一个blob。

common_layers.hpp

NeruonLayer仅仅负责简单的一对一计算，而剩下的那些复杂的计算则通通放在了common_layers.hpp中。像ArgMaxLayer、ConcatLayer、FlattenLayer、SoftmaxLayer、SplitLayer和SliceLayer等各种对blob增减修改的操作。

loss_layers.hpp

前面的data layer和common layer都是中间计算层，虽然会涉及到反向传播，但传播的源头来自于loss_layer，即网络的最终端。这一层因为要计算误差，所以输入都是2个blob，输出1个blob。

vision_layers.hpp

vision_layer主要是图像卷积的操作，像convolusion、pooling、LRN都在里面，按官方文档的说法，是可以输出图像的，这个要看具体实现代码了。里面有个im2col的实现，看caffe作者的解释，主要是为了加速卷积的。

layer_factory.hpp

layer_factory比较重要我就放在下一篇里面了。

2. Detail

在这一Section中，我们深入到上一小节所讲的集中layer的细节中去。对于一些常用的layer，如卷积层，池化层（Pooling），还给出对应的proto代码。

2.1. 数据层（data_layers）

数据通过数据层进入Caffe，数据层在整个网络的底部。数据可以来自高效的数据库（LevelDB 或者 LMDB），直接来自内存。如果不追求高效性，可以以HDF5或者一般图像的格式从硬盘读取数据。

一些基本的操作，如：mean subtraction, scaling, random cropping, and mirroring均可以直接在数据层上进行指定。

1 Database

类型：Data

必须参数：

source: 包含数据的目录名称
batch_size: 一次处理的输入的数量

可选参数：

rand_skip: 在开始的时候从输入中跳过这个数值，这在异步随机梯度下降（SGD）的时候非常有用
backend [default LEVELDB]: 选择使用 LEVELDB 或者 LMDB

2 In-Memory

类型: MemoryData

必需参数：

batch_size, channels, height, width: 指定从内存读取数据的大小

MemoryData层直接从内存中读取数据，而不是拷贝过来。因此，要使用它的话，你必须调用MemoryDataLayer::Reset (from C++)或者Net.set_input_arrays (from Python)以此指定一块连续的数据（通常是一个四维张量）。

3 HDF5 Input

类型: HDF5Data

必要参数：

source: 需要读取的文件名
batch_size：一次处理的输入的数量

4 HDF5 Output

类型: HDF5Output

必要参数：

file_name: 输出的文件名

HDF5的作用和这节中的其他的层不一样，它是把输入的blobs写到硬盘

5 Images

类型: ImageData

必要参数：

source: text文件的名字，每一行给出一张图片的文件名和label
batch_size: 一个batch中图片的数量

可选参数：

rand_skip：在开始的时候从输入中跳过这个数值，这在异步随机梯度下降（SGD）的时候非常有用
shuffle [default false]
new_height, new_width: 把所有的图像resize到这个大小

6 Windows

类型：WindowData

7 Dummy

类型：DummyData

Dummy 层用于development 和debugging。具体参数DummyDataParameter。

2.2. 激励层（neuron_layers）

一般来说，激励层是element-wise的操作，输入和输出的大小相同，一般情况下就是一个非线性函数。

输入：

n×c×h×w

输出：

n×c×h×w

1 ReLU / Rectified-Linear and Leaky-ReLU

类型: ReLU

例子:

layer {
  name: "relu1"
  type: "ReLU"
  bottom: "conv1"
  top: "conv1"
}

可选参数：

negative_slope [default 0]：指定输入值小于零时的输出。

ReLU是目前使用做多的激励函数，主要因为其收敛更快，并且能保持同样效果。标准的ReLU函数为max(x, 0)，而一般为当x > 0时输出x，但x <= 0时输出negative_slope。RELU层支持in-place计算，这意味着bottom的输出和输入相同以避免内存的消耗。

ReLU(x)=max{0,x}

2 Sigmoid

类型：Sigmoid

例子：

layer {
  name: "encode1neuron"
  bottom: "encode1"
  top: "encode1neuron"
  type: "Sigmoid"
}

Sigmoid层通过 sigmoid(x) 计算每一个输入x的输出，函数如下图。

σ(x)=11+exp-x

3 TanH / Hyperbolic Tangent

类型: TanH

例子:

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "TanH"
}

TanH层通过 tanh(x) 计算每一个输入x的输出，函数如下图。请注意sigmoid函数和TanH函数在纵轴上的区别。sigmoid函数将实数映射到(0,1)。TanH将实数映射到(-1,1)。

tanh(x)=expx-exp-xexpx+exp-x

4 Absolute Value

类型: AbsVal

例子:

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "AbsVal"
}

ABSVAL层通过 abs(x) 计算每一个输入x的输出。

5 Power

类型： Power

例子：

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "Power"
  power_param {
    power: 1
    scale: 1
    shift: 0
  }
}

可选参数：

power [default 1]
scale [default 1]
shift [default 0]

POWER层通过 (shift + scale * x) ^ power计算每一个输入x的输出。

6 BNLL

类型: BNLL

例子：

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: BNLL
}

BNLL (binomial normal log likelihood) 层通过 log(1 + exp(x)) 计算每一个输入x的输出。

2.3. 视觉层（vision_layers）

1 卷积层(Convolution)

类型：Convolution

例子：

layers { 
    name: "conv1" 
    type: CONVOLUTION 
    bottom: "data" 
    top: "conv1" 
    blobs_lr: 1               # learning rate multiplier for the filters 
    blobs_lr: 2               # learning rate multiplier for the biases 
    weight_decay: 1           # weight decay multiplier for the filters 
    weight_decay: 0           # weight decay multiplier for the biases 
    convolution_param { 
        num_output: 96        # learn 96 filters 
        kernel_size: 11       # each filter is 11x11 
        stride: 4             # step 4 pixels between each filter application 
        weight_filler { 
            type: "gaussian"  # initialize the filters from a Gaussian 
            std: 0.01         # distribution with stdev 0.01 (default mean: 0) } 
            bias_filler { 
                type: "constant" # initialize the biases to zero (0) 
                value: 0 
            } 
        }
    }
}

blobs_lr: 学习率调整的参数，在上面的例子中设置权重学习率和运行中求解器给出的学习率一样，同时是偏置学习率为权重的两倍。
weight_decay：

卷积层的重要参数

必须参数：

num_output (c_o)：过滤器的个数
kernel_size (or kernel_h and kernel_w)：过滤器的大小（也就是所谓“核”的大小）。

建议参数：

weight_filler [default type: ‘constant’ value: 0]：参数的初始化方法

可选参数：

bias_filler：偏置的初始化方法
bias_term [default true]：指定是否是否开启偏置项
pad (or pad_h and pad_w) [default 0]：指定在输入的每一边加上多少个像素
stride (or stride_h and stride_w) [default 1]：指定过滤器的步长
group (g) [default 1]: 如果g>1，那么将每个滤波器都限定只与某个输入的子集有关联。换句话说，将输入分为g组，同时将输出也分为g组。那么第i组输出只与第i组输入有关。

通过卷积后的大小变化：

输入：

n×ci×hi×wi

输出：

n×co×ho×wo

其中：ho=(hi+2×padh−kernelh)/strideh+1。wo通过同样的方法计算。

2 池化层（Pooling）

类型：Pooling

例子：

layers { 
    name: "pool1" 
    type: POOLING 
    bottom: "conv1" 
    top: "pool1" 
    pooling_param { 
        pool: MAX 
        kernel_size: 3 # pool over a 3x3 region 
        stride: 2 # step two pixels (in the bottom blob) between pooling regions 
    }
}

卷积层的重要参数

必需参数：

kernel_size (or kernel_h and kernel_w)：过滤器的大小

可选参数：

pool [default MAX]：pooling的方法，目前有MAX, AVE, 和STOCHASTIC三种方法
pad (or pad_h and pad_w) [default 0]：指定在输入的每一遍加上多少个像素
stride (or stride_h and stride_w) [default 1]：指定过滤器的步长

通过池化后的大小变化：

输入：

n×ci×hi×wi

输出：

n×co×ho×wo

其中：ho=(hi+2×padh−kernelh)/strideh+1。wo通过同样的方法计算。

3 Local Response Normalization (LRN)

类型：LRN

可选参数：

local_size [default 5]：对于cross channel LRN为需要求和的邻近channel的数量；对于within channel LRN为需要求和的空间区域的边长；
alpha [default 1]：scaling参数；
beta [default 5]：指数；
norm_region [default ACROSS_CHANNELS]: 选择LRN实现的方法：1. ACROSS_CHANNELS ；2. WITHIN_CHANNEL

LRN（Local Response Normalization）是对一个局部的输入区域进行的归一化。有两种不同的形式：1. ACCROSS_CHANNEL；2. WITHIN_CHANNEL。其实很好从字面上进行理解。第一种方法综合了不同的channel，而在一个channel里面只取1*1（所以size是localsize×1×1）。而在第二种方法中，不在channel方向上扩展，只在单一channel上进行空间扩展（所以size是1×localsize×localsize）。

计算公式：对每一个输入除以(1+(α/n)⋅∑ix2i)β

在这里，参数α是scaling参数，参数β是指数。而参数n对应local region的大小。

2.4. 损失层（Loss Layers）

深度学习是通过最小化输出和目标的Loss来驱动学习。

1 Softmax

类型: SoftmaxWithLoss

关于Softmax的内容，可以参考我之前的博客：【机器学习】Softmax Regression简介。Softmax Loss层应用于多标签分类。对于输入，计算了multinomial logistic loss。在概念上近似等于一个Softmax层加上一个multinomial logistic loss层。但在梯度的计算上更加稳定。

2 Sum-of-Squares / Euclidean

类型: EuclideanLoss

Euclidean loss层计算了两个输入差的平方和：

12N\sumi=1N||x1i-x2i||2x

3 Hinge / Margin

类型: HingeLoss

例子：

L1 Normlayers { 
    name: "loss" 
    type: HINGE_LOSS 
    bottom: "pred" 
    bottom: "label"
} 
L2 Normlayers { 
    name: "loss" 
    type: HINGE_LOSS 
    bottom: "pred" 
    bottom: "label" 
    top: "loss" 
    hinge_loss_param { 
        norm: L2 
    }
}

可选参数：

norm [default L1]: 选择L1或者L2范数

输入：

n×c×h×w Predictions
n×1×1×1 Labels

输出

1×1×1×1 Computed Loss

4 Sigmoid Cross-Entropy

类型：SigmoidCrossEntropyLoss

5 Infogain

类型：InfoGainLoss

6 Accuracy and Top-k

类型：Accuracy

用来计算输出和目标的正确率，事实上这不是一个loss，而且没有backward这一步。

2.5. 一般层（Common Layers）

1 全连接层 Inner Product

类型：InnerProduct

例子：

layer {
  name: "fc8"
  type: "InnerProduct"
  # learning rate and decay multipliers for the weights
  param { lr_mult: 1 decay_mult: 1 }
  # learning rate and decay multipliers for the biases
  param { lr_mult: 2 decay_mult: 0 }
  inner_product_param {
    num_output: 1000
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
  bottom: "fc7"
  top: "fc8"
}

必要参数：

num_output (c_o)：过滤器的个数

可选参数：

weight_filler [default type: ‘constant’ value: 0]：参数的初始化方法
bias_filler：偏置的初始化方法
bias_term [default true]：指定是否是否开启偏置项

通过全连接层后的大小变化：

输入：n×ci×hi×wi
输出：n×co×1×1

2 Splitting

类型：Split

Splitting层可以把一个输入blob分离成多个输出blobs。这个用在当需要把一个blob输入到多个输出层的时候。

3 Flattening

类型：Flatten

Flatten层是把一个输入的大小为n * c * h * w变成一个简单的向量，其大小为 n * (c*h*w) * 1 * 1。

4 Reshape

类型：Reshape

例子：

  layer {
    name: "reshape"
    type: "Reshape"
    bottom: "input"
    top: "output"
    reshape_param {
      shape {
        dim: 0  # copy the dimension from below
        dim: 2
        dim: 3
        dim: -1 # infer it from the other dimensions
      }
    }
  }

输入：单独的一个blob，可以是任意维；

输出：同样的blob，但是它的维度已经被我们人为地改变，维度的数据由reshap_param定义。

可选参数：

shape

Reshape层被用于改变输入的维度，而不改变输入的具体数据。就像Flatten层一样。只是维度被改变而已，这个过程不涉及数据的拷贝。

输出的维度由ReshapeParam proto控制。可以直接使用数字进行指定。设定输入的某一维到输出blob中去。此外，还有两个数字值得说一下：

0 直接从底层复制。例如，如果是底层是一个2在它的第一维，那么顶层在它的第一维也有一个2。
-1 从其他的数据里面推测这一维应该是多少。

5 Concatenation

类型：Concat

例子：

layer {
  name: "concat"
  bottom: "in1"
  bottom: "in2"
  top: "out"
  type: "Concat"
  concat_param {
    axis: 1
  }
}

可选参数：

axis [default 1]：0代表链接num，1代表链接channels

通过全连接层后的大小变化：

输入：从1到K的每一个blob的大小：ni×ci×h×w

输出：

如果axis = 0: (n1+n2+...+nK)×c1×h×w，需要保证所有输入的ci相同。
如果axis = 1: n1×(c1+c2+...+cK)×h×w，需要保证所有输入的n_i 相同。

通过Concatenation层，可以把多个的blobs链接成一个blob。

6 Slicing

类型：Slice

例子：

layer {
  name: "slicer_label"
  type: "Slice"
  bottom: "label"
  ## Example of label with a shape N x 3 x 1 x 1
  top: "label1"
  top: "label2"
  top: "label3"
  slice_param {
    axis: 1
    slice_point: 1
    slice_point: 2
  }
}

Slice层可以将输入层变成多个输出层。这些输出层沿一个给定的维度存在。axis指定了目标的轴，slice_point则指定了选择维度的序号。

7 Elementwise Operations

类型：Eltwise

8 Argmax

类型：ArgMax

9 Softmax

类型：Softmax

10 Mean-Variance Normalization

类型：MVN

==================================================================================================================================

每个layer的输入数据来自一些'bottom' blobs, 输出一些'top' blobs。Caffe中每种类型layer的参数说明定义在caffe.proto文件中，具体的layer参数值则定义在具体应用的protocals buffer网络结构说明文件中。例如，卷积层（ConvolutionLayer）的参数说明在caffe.proto中是如下定义的，

 
    // in caffe.proto  
 // Message that stores parameters used by ConvolutionLayer  
 message ConvolutionParameter {  
   optional uint32 num_output = 1; // The number of outputs for the layer  
   optional bool bias_term = 2 [default = true]; // whether to have bias terms  
   // Pad, kernel size, and stride are all given as a single value for equal  
   // dimensions in height and width or as Y, X pairs.  
   optional uint32 pad = 3 [default = 0]; // The padding size (equal in Y, X)  
   optional uint32 pad_h = 9 [default = 0]; // The padding height  
   optional uint32 pad_w = 10 [default = 0]; // The padding width  
   optional uint32 kernel_size = 4; // The kernel size (square)  
   optional uint32 kernel_h = 11; // The kernel height  
   optional uint32 kernel_w = 12; // The kernel width  
   optional uint32 group = 5 [default = 1]; // The group size for group conv  
   optional uint32 stride = 6 [default = 1]; // The stride (equal in Y, X)  
   optional uint32 stride_h = 13; // The stride height  
   optional uint32 stride_w = 14; // The stride width  
   optional FillerParameter weight_filler = 7; // The filler for the weight  
   optional FillerParameter bias_filler = 8; // The filler for the bias  
   enum Engine {  
     DEFAULT = 0;  
     CAFFE = 1;  
     CUDNN = 2;  
   }  
   optional Engine engine = 15 [default = DEFAULT];  
 }  
 
  

其中的参数说明包括卷积核的个数、大小和步长等。在examples\mnist\lenet_train_test.prototxt网络结构说明文件中，具体一个卷积层（ConvolutionLayer）是这样定义的，

 
    # in examples\mnist\lenet_train_test.prototxt  
 layer {  
   name: "conv1" // 层的名字  
   type: "Convolution" // 层的类型，说明具体执行哪一种计算  
   bottom: "data" // 层的输入数据Blob的名字  
   top: "conv1" // 层的输出数据Blob的名字  
   param { // 层的权值和偏置相关参数  
     lr_mult: 1  
   }  
   param {  
     lr_mult: 2  
   }  
   convolution_param { // 卷积层卷积运算相关的参数  
     num_output: 20  
     kernel_size: 5  
     stride: 1  
     weight_filler {  
       type: "xavier"  
     }  
     bias_filler {  
       type: "constant"  
     }  
   }  
 }  
 
  

层的输入输出结构，图示是这样的，

每种类型的layer需要定义三种关键操作LayerSetUp, Forward, Backward：

LayerSetUp: 网络构建时初始化层和层的连接
Forward: 网络数据前向传递，给定bottom输入数据，计算输出到top
Backward：网络误差反向传递，给定top的梯度，计算bottom的梯度并存储到bottom blob

Layer的设计主要就是SetUp、Forward、Backward函数（层一开始的时候的设置、然后就是前传和反传）

这其中的SetUp的实现又依赖于CheckBlobCounts、LayerSetUp、Reshape等的实现。这其中Reshape又是必须要实现的，因为它是纯虚函数

这其中的Forward中又依赖于Forward_cpu、Forward_gpu，这其中Forward_cpu又是必须要实现的。

这其中的Backward中又依赖于Backward_cpu、Backward_gpu，这其中Backward_cpu 又是必须要实现的。

=================================================================================================================================

首先layer必须要实现一个forward function，前递函数当然功能可以自己定义啦，在forward中呢他会从input也就是Layer的bottom，对了caffe里面网络的前一层是叫bottom的，从bottom中获取blob，并且计算输出的Blob，当然他们也会实现一个反向传播，根据他们的input的blob以及output blob的error gradient 梯度误差计算得到该层的梯度误差。从公式中也可以看到：

δl=((wl+1)Tδl+1)σ′(zl)

想学好caffe建议看源码，layer.hpp:

 
    #ifndef CAFFE_LAYER_H_  
 #define CAFFE_LAYER_H_  
   
 #include   
 #include   
 #include   
   
 #include "caffe/blob.hpp"  
 #include "caffe/common.hpp"  
 #include "caffe/layer_factory.hpp"  
 #include "caffe/proto/caffe.pb.h"  
 #include "caffe/util/device_alternate.hpp"  
   
 namespace caffe {  
   
 /** 
  * @brief An interface for the units of computation which can be composed into a 
  *        Net. 
  * 
  * Layer%s must implement a Forward function, in which they take their input 
  * (bottom) Blob%s (if any) and compute their output Blob%s (if any). 
  * They may also implement a Backward function, in which they compute the error 
  * gradients with respect to their input Blob%s, given the error gradients with 
  * their output Blob%s. 
  */  
 template <typename Dtype>  
 class Layer {  
  public:  
 /* 
 首先获得当前网络的Phase，是train还是test，在初始化列表初始化LayerParameter,之后blobs_这里存放的是一个指向blob类的shared_ptr指针的一个vector，在这里是申请空间，然后将传入的layer_param中的blob拷贝过来。 
 */  
 // 显示的构造函数不需要重写，任何初始工作在SetUp()中完成  
 // 构造方法只复制层参数说明的值，如果层说明参数中提供了权值和偏置参数，也复制  
   explicit Layer(const LayerParameter& param)  
     : layer_param_(param) {  
       // Set phase and copy blobs (if there are any).  
 // 训练还是测试？phase    
       phase_ = param.phase();  
       if (layer_param_.blobs_size() > 0) {  
 // 将blobs_的大小设置为参数中的大小    
         blobs_.resize(layer_param_.blobs_size());  
         for (int i = 0; i < layer_param_.blobs_size(); ++i) {  
 // 新建若干个Blob   
           blobs_[i].reset(new Blob());  
 // 从blob文件中获取数据  
           blobs_[i]->FromProto(layer_param_.blobs(i));  
         }  
       }//用protobuf 传入的参数对blobs_ 做初始化，blobs_ 是一个vector 存放指向Blob类的智能指针。  
   
       #ifdef USE_MPI  
       //If this is a gather layer, all it subsequent layer doesn't need gradient sync.  
       //We will only change itself's property here,  
       //subsequent layers will be inferred in the Net  
     if (is_gathering()){  
         set_need_sync(false);  
       }else{  
         set_need_sync(true);  
       }  
       #endif  
     }  
   virtual ~Layer() {}  
 初始化函数SetUp，每个Layer对象都必须遵循固定的调用模式,  
   /** 
    * @brief Implements common layer setup functionality. 
    * @brief 实现每个layer对象的setup函数 
    * @param bottom the preshaped input blobs 
    * @param bottom 层的输入数据，blob中的存储空间已申请 
    * @param top 
    *     the allocated but unshaped output blobs, to be shaped by Reshape 
    * @param top 层的输出数据，blob对象以构造但是其中的存储空间未申请， 
    *     具体空间大小需根据bottom blob大小和layer_param_共同决定，具体在Reshape函数现实 
    * 
    * Checks that the number of bottom and top blobs is correct. 
    * Calls LayerSetUp to do special layer setup for individual layer types, 
    * followed by Reshape to set up sizes of top blobs and internal buffers. 
    * Sets up the loss weight multiplier blobs for any non-zero loss weights. 
    * This method may not be overridden. 
    * 1. 检查输入输出blob个数是否满足要求，每个层能处理的输入输出数据不一样 
    * 2. 调用LayerSetUp函数初始化特殊的层，每个Layer子类需重写这个函数完成定制的初始化 
    * 3. 调用Reshape函数为top blob分配合适大小的存储空间 
    * 4. 为每个top blob设置损失权重乘子，非LossLayer为的top blob其值为零 
    * 
    * 此方法非虚函数，不用重写，模式固定 
    */  
   void SetUp(const vector*>& bottom,  
       const vector*>& top) {  
     CheckBlobCounts(bottom, top);  
     LayerSetUp(bottom, top);  
     Reshape(bottom, top);  
     SetLossWeights(top);  
   }  
 /每个子类Layer必须重写的初始化函数LayerSetUp，  
   /** 
    * @brief Does layer-specific setup: your layer should implement this function 
    *        as well as Reshape. 
    * @brief 定制初始化，每个子类layer必须实现此虚函数 
    * 
    * @param bottom 
    *     the preshaped input blobs, whose data fields store the input data for 
    *     this layer 
    * @param bottom 
    *     输入blob, 数据成员data_和diff_存储了相关数据 
    * @param top 
    *     the allocated but unshaped output blobs 
    * @param top 
    *     输出blob, blob对象已构造但数据成员的空间尚未申请 
    * 
    * This method should do one-time layer specific setup. This includes reading 
    * and processing relevent parameters from the layer_param_. 
    * Setting up the shapes of top blobs and internal buffers should be done in 
    * Reshape, which will be called before the forward pass to 
    * adjust the top blob sizes. 
    * 此方法执行一次定制化的层初始化，包括从layer_param_读入并处理相关的层权值和偏置参数， 
    * 调用Reshape函数申请top blob的存储空间 
    */  
   virtual void LayerSetUp(const vector*>& bottom,  
       const vector*>& top) {}  
 /每个子类Layer必须重写的Reshape函数，完成top blob形状的设置并为其分配存储空间，  
    /** 
    * @brief Adjust the shapes of top blobs and internal buffers to accomodate 
    *        the shapes of the bottom blobs. 
    * @brief 根据bottom blob的形状和layer_param_计算top blob的形状并为其分配存储空间 
    * 
    * @param bottom the input blobs, with the requested input shapes 
    * @param top the top blobs, which should be reshaped as needed 
    * 
    * This method should reshape top blobs as needed according to the shapes 
    * of the bottom (input) blobs, as well as reshaping any internal buffers 
    * and making any other necessary adjustments so that the layer can 
    * accomodate the bottom blobs. 
    */  
   virtual void Reshape(const vector*>& bottom,  
       const vector*>& top) = 0;  
   
   /** 
    * @brief Given the bottom blobs, compute the top blobs and the loss. 
    * 
    * @param bottom 
    *     the input blobs, whose data fields store the input data for this layer 
    * @param top 
    *     the preshaped output blobs, whose data fields will store this layers' 
    *     outputs 
    * \return The total loss from the layer. 
    * 
    * The Forward wrapper calls the relevant device wrapper function 
    * (Forward_cpu or Forward_gpu) to compute the top blob values given the 
    * bottom blobs.  If the layer has any non-zero loss_weights, the wrapper 
    * then computes and returns the loss. 
    * 
    * Your layer should implement Forward_cpu and (optionally) Forward_gpu. 
    */  
 //前向传播函数Forward和反向传播函数Backward  
 /* 
 首先是Forward.这其实是一个装饰器，继承之后在调用的调用其相应的forward_cpu或者forward_gpu，根据输入的input data blob计算相应的output data blob，同时会反应这一层layer的total loss. 
 */  
   inline Dtype Forward(const vector*>& bottom,  
       const vector*>& top);  
   
   /** 
    * @brief Given the top blob error gradients, compute the bottom blob error 
    *        gradients. 
    * 
    * @param top 
    *     the output blobs, whose diff fields store the gradient of the error 
    *     with respect to themselves 
    * @param propagate_down 
    *     a vector with equal length to bottom, with each index indicating 
    *     whether to propagate the error gradients down to the bottom blob at 
    *     the corresponding index 
    * @param bottom 
    *     the input blobs, whose diff fields will store the gradient of the error 
    *     with respect to themselves after Backward is run 
    * 
    * The Backward wrapper calls the relevant device wrapper function 
    * (Backward_cpu or Backward_gpu) to compute the bottom blob diffs given the 
    * top blob diffs. 
    * 
    * Your layer should implement Forward_cpu and (optionally) Forward_gpu. 
    */  
 /* 
 BackWard，实现的是反向传播，也就是给定top blob额error gradient 计算得到bottom的error gradient。其输入时 output blobs ，在Ouput blobs里面的diff存储的就是其相应的error gradients。其中propagate_down这个参数跟Bottom的长度是一样的，每一个Index用来指定是否需要反向传播error gradients 到对应的bottom blob。而bottom 这里面的diff 区域存放的就是BackWard计算出来相应的gradient error. 
 */  
   inline void Backward(const vector*>& top,  
       const vector<bool>& propagate_down,  
       const vector*>& bottom);  
   
   /** 
    * @brief Returns the vector of learnable parameter blobs. 
    */  
   vector > >& blobs() {  
     return blobs_;//返回vector  blobs_  
   }  
   
   /** 
    * @brief Returns the layer parameter. 
    */  
 //返回layer parameter  
   const LayerParameter& layer_param() const { return layer_param_; }  
   
   /** 
    * @brief Writes the layer parameter to a protocol buffer 
    */  
 //将layer plarameter 写入protobuf  
   virtual void ToProto(LayerParameter* param, bool write_diff = false);  
   
 //返回 ,设置一个blob top 在给定 index 的 loss  
   /** 
    * @brief Returns the scalar loss associated with a top blob at a given index. 
    */  
   inline Dtype loss(const int top_index) const {  
     return (loss_.size() > top_index) ? loss_[top_index] : Dtype(0);  
   }  
   
   /** 
    * @brief Sets the loss associated with a top blob at a given index. 
    */  
   inline void set_loss(const int top_index, const Dtype value) {  
     if (loss_.size() <= top_index) {  
       loss_.resize(top_index + 1, Dtype(0));  
     }  
     loss_[top_index] = value;  
   }  
 //一些返回特定参数的函数：  
   /** 
    * 获得bottom或者top blob的数量状态，比较简单，看名字即可 
    */  
     // 虚函数，而且还是内联的，返回层类型    
   virtual inline const char* type() const { return ""; }    
     
    // 虚函数，获得bottom blob的精确个数    
   virtual inline int ExactNumBottomBlobs() const { return -1; }    
     
    // 虚函数，获得bottom blob的最小个数    
   virtual inline int MinBottomBlobs() const { return -1; }    
     
    // 虚函数，获得bottom blob的最大个数    
   virtual inline int MaxBottomBlobs() const { return -1; }    
     
    // 虚函数，获得top blob的精确个数    
   virtual inline int ExactNumTopBlobs() const { return -1; }    
     
    // 虚函数，获得top blob的最小个数    
   virtual inline int MinTopBlobs() const { return -1; }    
     
    // 虚函数，获得top blob的最大个数    
   virtual inline int MaxTopBlobs() const { return -1; }    
     
    // 虚函数，bottom blob和top blob的个数是否一致    
   virtual inline bool EqualNumBottomTopBlobs() const { return false; }    
     
    // 返回当前层是否自动创建匿名top blobs    
    // 如果返回true，表明网络初始化的时候创建了了足够多的匿名top blobs    
    // 来满足ExactNumTopBlobs或者MinTopBlobs所要求的top blobs的个数    
   virtual inline bool AutoTopBlobs() const { return false; }    
 /* 
 AllowforceBackward用来设置是否强制梯度返回，因为有些层其实不需要梯度信息 ，后面两个函数分别查看以及设置是是否需要计算梯度。 
 */    
   
    // 对于一个给定的bottom blob，返回是否允许强制反传    
   virtual inline bool AllowForceBackward(const int bottom_index) const {    
     return true;    
   }    
   
 //set_param_propagate_down，param_propagate_down 函数：设置对于那些bottom 需要反向传播。  
   /** 
    * @brief Specifies whether the layer should compute gradients w.r.t. a 
    *        parameter at a particular index given by param_id. 
    * 
    * You can safely ignore false values and always compute gradients 
    * for all parameters, but possibly with wasteful computation. 
    */  
   inline bool param_propagate_down(const int param_id) {  
     return (param_propagate_down_.size() > param_id) ?  
         param_propagate_down_[param_id] : false;  
   }  
   /** 
    * @brief Sets whether the layer should compute gradients w.r.t. a 
    *        parameter at a particular index given by param_id. 
    */  
   inline void set_param_propagate_down(const int param_id, const bool value) {  
     if (param_propagate_down_.size() <= param_id) {  
       param_propagate_down_.resize(param_id + 1, true);  
     }  
     param_propagate_down_[param_id] = value;  
   }  
   
   #ifdef USE_MPI  
   /** 
    * @brief Checks whether the layer accepts specifed parallel type 
    * 
    * If not supported, will halt the program with hints 
    */  
   inline virtual bool is_gathering() {return false;}  
   inline virtual bool is_scattering() {return false;}  
   inline bool need_sync(){return need_sync_;}  
   inline void set_need_sync(bool val){need_sync_ = val;}  
   #endif  
   
   
 protected:  
   /** The protobuf that stores the layer parameters */  
   // 层说明参数，从protocal buffers格式的网络结构说明文件中读取  
   LayerParameter layer_param_;  
   /** The phase: TRAIN or TEST */  
   // 层状态，参与网络的训练还是测试  
   Phase phase_;  
   /** The vector that stores the learnable parameters as a set of blobs. */  
   // 层权值和偏置参数，使用向量是因为权值参数和偏置是分开保存在两个blob中的  
   vector > > blobs_;  
   /** Vector indicating whether to compute the diff of each param blob. */  
   // 标志每个top blob是否需要计算反向传递的梯度值  
   vector<bool> param_propagate_down_;  
   
   /** The vector that indicates whether each top blob has a non-zero weight in 
    *  the objective function. */  
   // 非LossLayer为零，LossLayer中表示每个top blob计算的loss的权重  
   vector loss_;  
   
   #ifdef USE_MPI  
   /** 
    * For parallel use 
    */  
   bool need_sync_;  
   #endif  
 /这两个函数非虚函数，它们内部会调用如下虚函数完成数据前向传递和  
 /误差反向传播，根据执行环境的不同每个子类Layer必须重写CPU和GPU版本，  
   /** @brief Using the CPU device, compute the layer output. */  
   virtual void Forward_cpu(const vector*>& bottom,  
       const vector*>& top) = 0;  
   /** 
    * @brief Using the GPU device, compute the layer output. 
    *        Fall back to Forward_cpu() if unavailable. 
    */  
   virtual void Forward_gpu(const vector*>& bottom,  
       const vector*>& top) {  
     // LOG(WARNING) << "Using CPU code as backup.";  
     return Forward_cpu(bottom, top);  
   }  
   
   /** 
    * @brief Using the CPU device, compute the gradients for any parameters and 
    *        for the bottom blobs if propagate_down is true. 
    */  
   virtual void Backward_cpu(const vector*>& top,  
       const vector<bool>& propagate_down,  
       const vector*>& bottom) = 0;  
   /** 
    * @brief Using the GPU device, compute the gradients for any parameters and 
    *        for the bottom blobs if propagate_down is true. 
    *        Fall back to Backward_cpu() if unavailable. 
    */  
   virtual void Backward_gpu(const vector*>& top,  
       const vector<bool>& propagate_down,  
       const vector*>& bottom) {  
     // LOG(WARNING) << "Using CPU code as backup.";  
     Backward_cpu(top, propagate_down, bottom);  
   }  
   
   /** 
    * Called by the parent Layer's SetUp to check that the number of bottom 
    * and top Blobs provided as input match the expected numbers specified by 
    * the {ExactNum,Min,Max}{Bottom,Top}Blobs() functions. 
    */  
   virtual void CheckBlobCounts(const vector*>& bottom,  
                                const vector*>& top) {  
     if (ExactNumBottomBlobs() >= 0) {  
       CHECK_EQ(ExactNumBottomBlobs(), bottom.size())  
           << type() << " Layer takes " << ExactNumBottomBlobs()  
           << " bottom blob(s) as input.";  
     }// 保证输入bottom 数量和要求的相同  
     if (MinBottomBlobs() >= 0) {  
       CHECK_LE(MinBottomBlobs(), bottom.size())  
           << type() << " Layer takes at least " << MinBottomBlobs()  
           << " bottom blob(s) as input.";  
     }//保证输入的bottom数量大于或等于要求的最小数量  
     if (MaxBottomBlobs() >= 0) {  
       CHECK_GE(MaxBottomBlobs(), bottom.size())  
           << type() << " Layer takes at most " << MaxBottomBlobs()  
           << " bottom blob(s) as input.";  
     }//保证输入的bottom数量小于或等于要求的最大数量  
     if (ExactNumTopBlobs() >= 0) {  
       CHECK_EQ(ExactNumTopBlobs(), top.size())  
           << type() << " Layer produces " << ExactNumTopBlobs()  
           << " top blob(s) as output.";  
     }// 保证输入top数量和要求的相同  
     if (MinTopBlobs() >= 0) {  
       CHECK_LE(MinTopBlobs(), top.size())  
           << type() << " Layer produces at least " << MinTopBlobs()  
           << " top blob(s) as output.";  
     }//保证输入的top数量大于或等于要求的最小数量  
     if (MaxTopBlobs() >= 0) {  
       CHECK_GE(MaxTopBlobs(), top.size())  
           << type() << " Layer produces at most " << MaxTopBlobs()  
           << " top blob(s) as output.";  
     }//保证输入的top数量小于或等于要求的最大数量  
     if (EqualNumBottomTopBlobs()) {  
       CHECK_EQ(bottom.size(), top.size())  
           << type() << " Layer produces one top blob as output for each "  
           << "bottom blob input.";  
     }//保证输入的bottom数量和输出的top数量相同  
   }  
   
   /** 
    * Called by SetUp to initialize the weights associated with any top blobs in 
    * the loss function. Store non-zero loss weights in the diff blob. 
    */  
 /* 
 SetLoss是非常重要的一个步骤，是被SetUp调用来初始化top bottom的weights，并且存储非零的loss weights 在diff blob里面 
 */  
   inline void SetLossWeights(const vector*>& top) {  
     const int num_loss_weights = layer_param_.loss_weight_size();  
     if (num_loss_weights) {  
       CHECK_EQ(top.size(), num_loss_weights) << "loss_weight must be "  
           "unspecified or specified once per top blob.";  
       for (int top_id = 0; top_id < top.size(); ++top_id) {  
         const Dtype loss_weight = layer_param_.loss_weight(top_id);  
         if (loss_weight == Dtype(0)) { continue; }//如果为0不对loss进行操作  
         this->set_loss(top_id, loss_weight);  
         const int count = top[top_id]->count();  
         Dtype* loss_multiplier = top[top_id]->mutable_cpu_diff();  
         caffe_set(count, loss_weight, loss_multiplier);//将loss_multiplier设为loss_weight  
       }   
     }  
   }  
   
   DISABLE_COPY_AND_ASSIGN(Layer);  
 };  // class Layer  
   
 /* 
 前传调用对应的Forward_cpu或者Forward_gpu而我们知道Forward_cpu是纯虚函数，必须要实而Forward_gpu是虚函数，如果不实现就调用 Forward_cpu函数了。前传（你必须实现自己的Forward_cpu，实现Forward_gpu是可选的） 
 */  
 // Forward and backward wrappers. You should implement the cpu and  
 // gpu specific implementations instead, and should not change these  
 // functions.  
 template <typename Dtype>  
 inline Dtype Layer::Forward(const vector*>& bottom,  
     const vector*>& top) {  
   Dtype loss = 0;    
   // 根据bottom设置top的形状    
   Reshape(bottom, top);    
   // 设置运行模式CPU or GPU    
   switch (Caffe::mode()) {    
   case Caffe::CPU:    
     // 调用CPU的前传    
     Forward_cpu(bottom, top);    
     // 前传计算完之后计算损失（只有最后一层才进行计算，其余层都不用）    
     for (int top_id = 0; top_id < top.size(); ++top_id) {    
       if (!this->loss(top_id)) { continue; }    
       const int count = top[top_id]->count();    
       // 获取前传的数据    
       const Dtype* data = top[top_id]->cpu_data();    
       // 获取梯度（\frac{\partial Loss}{\partial net}）    
       const Dtype* loss_weights = top[top_id]->cpu_diff();    
       // data与loss_weight的点积，即得损失函数关于当前层权重的偏导了    
     // \frac{\partial Loss}{\partial net} * \frac{\partial net}{\frac{W}}    
     // = \frac{\partial Loss}{\partial W}    
       loss += caffe_cpu_dot(count, data, loss_weights);    
     }    
     break;    
   case Caffe::GPU:    
     // GPU前传    
     Forward_gpu(bottom, top);    
 #ifndef CPU_ONLY    
     // 同上，只不过这里用GPU来计算点积了    
     for (int top_id = 0; top_id < top.size(); ++top_id) {    
       if (!this->loss(top_id)) { continue; }    
       const int count = top[top_id]->count();    
       // 获取GPU上的数据    
       const Dtype* data = top[top_id]->gpu_data();    
       const Dtype* loss_weights = top[top_id]->gpu_diff();    
       Dtype blob_loss = 0;    
       caffe_gpu_dot(count, data, loss_weights, &blob_loss);    
       loss += blob_loss;    
     }    
 #endif    
     break;  
   default:  
     LOG(FATAL) << "Unknown caffe mode.";  
   }  
   return loss;  
 }  
   
 template <typename Dtype>  
 inline void Layer::Backward(const vector*>& top,  
     const vector<bool>& propagate_down,  
     const vector*>& bottom) {  
   switch (Caffe::mode()) {  
   case Caffe::CPU:  
     Backward_cpu(top, propagate_down, bottom);  
 //根据blob top 的error 梯度（diff）计算bottom 的 error 梯度。 propagate_down 是长度   
 //和bottom 相同的vector ，用于控制是否需要对对应的bottom 元素传播梯度。具体layer具体定义。  
     break;  
   case Caffe::GPU:  
     Backward_gpu(top, propagate_down, bottom);  
     break;  
   default:  
     LOG(FATAL) << "Unknown caffe mode.";  
   }  
 }  
 Layer的序列化函数,将layer的层说明参数layer_param_，层权值和偏置  
 参数blobs_复制到LayerParameter对象，便于写到磁盘，  
 // Serialize LayerParameter to protocol buffer  
 template <typename Dtype>  
 void Layer::ToProto(LayerParameter* param, bool write_diff) {  
   param->Clear();  
   param->CopyFrom(layer_param_); // 复制层说明参数layer_param_  
   param->clear_blobs();  
   // 复制层权值和偏置参数blobs_  
   for (int i = 0; i < blobs_.size(); ++i) {  
     blobs_[i]->ToProto(param->add_blobs(), write_diff);  
   }  
 }  
   
 }  // namespace caffe  
   
 #endif  // CAFFE_LAYER_H_  
 
  

里面还有一个比较重要的文件但是比较枯燥，我也对GPU不是很懂。一起看看device_alternate.hpp：

 
    
      #ifndef CAFFE_UTIL_DEVICE_ALTERNATE_H_  
 #define CAFFE_UTIL_DEVICE_ALTERNATE_H_  
   
 #ifdef CPU_ONLY  // CPU-only Caffe.  
   
 #include   
   
 // Stub out GPU calls as unavailable.  
 //打印出GPU不可以使用  
 #define NO_GPU LOG(FATAL) << "Cannot use GPU in CPU-only Caffe: check mode."  
 // 定义给定类的前向和反向（GPU和CPU）传播的函数定义  
 #define STUB_GPU(classname) \  
 template <typename Dtype> \  
 void classname::Forward_gpu(const vector*>& bottom, \  
     const vector*>& top) { NO_GPU; } \  
 template <typename Dtype> \  
 void classname::Backward_gpu(const vector*>& top, \  
     const vector<bool>& propagate_down, \  
     const vector*>& bottom) { NO_GPU; } \  
   
 #define STUB_GPU_FORWARD(classname, funcname) \  
 template <typename Dtype> \  
 void classname::funcname##_##gpu(const vector*>& bottom, \  
     const vector*>& top) { NO_GPU; } \  
   
 #define STUB_GPU_BACKWARD(classname, funcname) \  
 template <typename Dtype> \  
 void classname::funcname##_##gpu(const vector*>& top, \  
     const vector<bool>& propagate_down, \  
     const vector*>& bottom) { NO_GPU; } \  
   
 #else  // Normal GPU + CPU Caffe.  
   
 #include   
 #include   
 #include   
 #include   
 #include   // cuda driver types  
 #ifdef USE_CUDNN  // cuDNN acceleration library.  
 #include "caffe/util/cudnn.hpp"  
 #endif  
   
 //  
 // CUDA macros  
 //  
   
 // CUDA: various checks for different function calls.  
 #define CUDA_CHECK(condition) \  
   /* Code block avoids redefinition of cudaError_t error */ \  
   do { \  
     cudaError_t error = condition; \  
     CHECK_EQ(error, cudaSuccess) << " " << cudaGetErrorString(error); \  
   } while (0)  
   
 #define CUBLAS_CHECK(condition) \  
   do { \  
     cublasStatus_t status = condition; \  
     CHECK_EQ(status, CUBLAS_STATUS_SUCCESS) << " " \  
       << caffe::cublasGetErrorString(status); \  
   } while (0)  
   
 #define CURAND_CHECK(condition) \  
   do { \  
     curandStatus_t status = condition; \  
     CHECK_EQ(status, CURAND_STATUS_SUCCESS) << " " \  
       << caffe::curandGetErrorString(status); \  
   } while (0)  
 //caffe采取的线程格和线程块的维数设计  
 // blockDim.x* gridDim.x表示的是该线程格所有线程的数量  
 //n表示核函数总共要处理的元素个数  
 #define CUDA_KERNEL_LOOP(i, n) \  
   for (int i = blockIdx.x * blockDim.x + threadIdx.x; \  
        i < (n); \  
        i += blockDim.x * gridDim.x)  
   
 // CUDA: check for error after kernel execution and exit loudly if there is one.  
 #define CUDA_POST_KERNEL_CHECK CUDA_CHECK(cudaPeekAtLastError())  
   
 namespace caffe {  
   
   
 //CUDA的lib错误报告  
 const char* cublasGetErrorString(cublasStatus_t error);  
 const char* curandGetErrorString(curandStatus_t error);  
   
 // CUDA: thread number configuration.  
 // Use 1024 threads per block, which requires cuda sm_2x or above,  
 // or fall back to attempt compatibility (best of luck to you).  
 #if __CUDA_ARCH__ >= 200  
     const int CAFFE_CUDA_NUM_THREADS = 1024;  
 #else  
     const int CAFFE_CUDA_NUM_THREADS = 512;  
 #endif  
   
   
 //CUDA线程的块的数量  
 inline int CAFFE_GET_BLOCKS(const int N) {  
   return (N + CAFFE_CUDA_NUM_THREADS - 1) / CAFFE_CUDA_NUM_THREADS;  
 }  
   
 }  // namespace caffe  
   
 #endif  // CPU_ONLY  
   
 #endif  // CAFFE_UTIL_DEVICE_ALTERNATE_H_  
 
 
  

你可能感兴趣的:(DL_ML_CNN原理,CNN--ANN--Deep,Learning)

【AI热点】OpenAI新发布API技术深度洞察碣石潇湘无限路人工智能
以下内容基于对OpenAI最新发布的AgentAPI及相关工具的官方信息、技术演示和已有报道进行综合解读与深度分析，供您参考。本报告将围绕最新发布的ResponsesAPI（智能体核心新接口）、内置工具（websearch、filesearch、computeruse）、全新的AgentsSDK以及核心安全与可观测性机制，帮助您深入理解其原理、特性及应用价值。一、背景：为什么要推出新的AgentA
【云原生】深入浅出 K8s 设备插件技术（Device Plugin）碣石潇湘无限路 kubernetes 容器云原生
摘要：Kubernetes提供了DevicePlugin机制，用于向kubelet上报硬件信息并配置容器资源。本文以NVIDIAGPUPlugin为例，通俗易懂并深入浅出地剖析注册、ListAndWatch、Allocate及kubelet管理流程，介绍常见问题和配置要点。先用一张原理概览图把DevicePlugin和kubelet之间的交互勾勒出来，让大家感受下插件技术的整体架构（示例以NVID
使用 Python 编写网络爬虫：从入门到实战 Manaaaaaaa python 爬虫开发语言
网络爬虫是一种自动化获取网页信息的程序，通常用于数据采集、信息监控等领域。Python是一种广泛应用于网络爬虫开发的编程语言，具有丰富的库和框架来简化爬虫的编写和执行过程。本文将介绍如何使用Python编写网络爬虫，包括基本原理、常用库和实战案例。一、原理介绍网络爬虫是一种自动化程序，通过模拟浏览器的行为向网络服务器发送HTTP请求，获取网页内容并进一步提取所需信息的过程。网络爬虫主要用于数据采集
论单调队列优化DP VU-zFaith870 c++动态规划推荐算法
前情提要，参考资料：单调队列优化DP（超详细！！！）-endl\n-博客园【动态规划】选择数字（单调队列优化dp）_哔哩哔哩_bilibili背景：最近作者快被DP逼疯了，写篇博客做记录。以下是对各DP的原理阐释：单调队列通过队列元素的吸入与弹出，形成单调性的结构，使算法能够进行线性处理，大大优化了时间复杂度。接下来讲解单调队列在区间DP、背包DP、树形DP还有数位DP中的应用：1.单调队列优化区
DeepSeek：技术创作者的内容革命，从代码到爆文的AI全栈攻略不想加班的码小牛人工智能 ai chatgpt
一、为什么技术创作者需要关注DeepSeek？作为CSDN的资深用户，你是否经历过这些痛点？选题焦虑：技术热点日新月异，如何抓住「大模型优化」或「量子计算落地」等前沿方向？写作卡顿：明明代码跑通了，却在技术原理描述环节反复修改效率瓶颈：既要写技术文档又要运营专栏，时间永远不够用DeepSeek的多模态理解能力（支持代码+自然语言混合输入）和领域自适应特性（自动识别技术文档/教程/测评等文体），让它
SDN技术解码：架构革新与数字化转型实践指南 ——从控制平面到AI融合的网络进化论不想加班的码小牛架构平面人工智能网络协议
一、引言：SDN如何重塑网络价值体系？在数字化浪潮下，传统网络架构的僵化性已成为制约业务创新的瓶颈。SDN（软件定义网络）通过解耦控制与转发平面，将网络从“黑盒设备”转变为“可编程服务”，为云计算、物联网等领域提供动态、智能的网络底座。例如，某金融企业通过SDN实现跨地域数据中心流量智能调度，业务故障恢复时间缩短至分钟级。二、SDN核心架构与技术原理1.三层架构：控制-转发-应用的协同生态•控制层
NPU的工作原理：神经网络计算的流水线绿算技术 NPU架构介绍神经网络人工智能深度学习
NPU的工作原理可以概括为以下几个步骤：1.模型加载·将训练好的神经网络模型加载到NPU的内存中。2.数据输入·输入数据（如图像、语音）通过接口传输到NPU。3.计算执行·NPU根据模型结构，依次执行卷积、池化、全连接等计算任务。·矩阵乘法单元和卷积加速器并行工作，高效完成计算。4.结果输出·计算完成后，输出结果（如分类标签、检测框）返回给主机或其他处理器。5.任务调度·在多任务场景下，NPU的任
数据处理的革命性引擎绿算技术 DPU架构介绍硬件工程科技缓存
随着数据量的爆炸式增长和计算需求的多样化，传统的CPU和GPU已经无法完全满足现代数据中心和高性能计算的需求。在这样的背景下，DPU（DataProcessingUnit，数据处理单元）应运而生。DPU是一种专为数据处理和网络加速设计的处理器，正在成为数据中心和云计算架构中的重要组成部分。接下来，由绿算技术与大家一起学习DPU有哪些功能、技术、原理等等内容。DPU的功能：数据处理的“全能选手”DP
TPAMI 2024 | 学习人类教育智慧：以学生为中心的知识蒸馏方法小白学视觉论文解读 IEEE TPAMI 知识蒸馏 TPAMI 论文解读深度学习
题目：LearningFromHumanEducationalWisdom:AStudent-CenteredKnowledgeDistillationMethod学习人类教育智慧：以学生为中心的知识蒸馏方法作者：S.Yang;J.Yang;M.Zhou;Z.Huang;W.-S.Zheng;X.Yang;J.Ren摘要现有的知识蒸馏研究通常侧重于以教师为中心的方法，其中教师网络根据自身标准进行训
必看！一文读懂知识蒸馏技术小天才学习机打游戏人工智能知识图谱神经网络 langchain windows
导读最近，DeepSeek的爆火让大家对人工智能领域的技术发展又有了新的关注。而知识蒸馏作为深度学习中一项重要的技术，也在背后默默地发挥着作用，今天就来给大家详细介绍一下知识蒸馏及其相关原理。1.知识蒸馏是什么在深度学习领域，大型模型（如DeepSeek）通常具有强大的性能，但它们的计算量和参数量都非常庞大，这使得它们难以在资源受限的设备（如移动设备或嵌入式设备）上部署。例如，GPT-3在570G
分布式系统中分布式ID生成方案的技术详解心存の思念分布式
分布式系统中分布式ID生成方案的技术详解在复杂的分布式系统中，数据被分散存储在不同的节点上，每个节点都有自己独立的数据库。为了保证数据的唯一性和一致性，我们需要为每个数据项生成一个全局唯一的主键ID。本文将详细解析几种常用的分布式ID生成方案，包括它们的工作原理、优缺点以及适用场景。一、分布式系统唯一ID的特点全局唯一性：不能出现重复的ID号，这是最基本的要求。趋势递增：在MySQLInnoDB引
模拟退火算法详解琛哥的程序算法模拟退火算法机器学习
一、引言模拟退火算法（SimulatedAnnealing，简称SA）是一种通用概率型优化算法，用来在一个大的搜寻空间内找寻问题的最优解。其出发点是基于物理中固体物质的退火过程与一般组合优化问题之间的相似性。模拟退火算法从某一较高初温出发，伴随温度参数的不断下降,结合概率突跳特性在解空间中随机寻找目标函数的全局最优解，即在局部最优解能概率性地跳出并最终趋于全局最优。二、算法原理物理退火过程加温过程
LangChain深度解析以及主要应用场景小Rr langchain python django numpy
文章目录LangChain是什么？LangChain的核心组件（1）PromptTemplates（提示模板）原理代码实例应用场景提示词优化策略（2）LLMs（大语言模型）原理代码实例应用场景调优策略（3）Chains（多步任务链）原理代码实例应用场景优化策略（4）Memory（记忆）原理代码实例应用场景优化策略（5）Agents（智能代理）原理代码实例应用场景优化策略案例分享案例1：电商智能客服
【蓝桥杯】24省赛：数字串个数遥感小萌新蓝桥杯蓝桥杯职场和发展
思路本质是组合数学问题：9个数字组成10000位数字有9**10000可能不包括3的可能8**10000不包括7的可能8**10000既不包括3也不包括77**10000根据容斥原理：结果为9∗∗10000−8∗∗10000−8∗∗10000+7∗∗100009**10000-8**10000-8**10000+7**100009∗∗10000−8∗∗10000−8∗∗10000+7∗∗10000
人脸识别生物特征脱敏：不可逆编码技术与隐私保护实战燃灯工作室 Ai 自动化 pytorch tensorflow 人工智能
一、技术原理与数学基础1.1特征脱敏核心思想脱敏函数f:Rd→Rk(k
模型可解释性：基于博弈论的SHAP值计算与特征贡献度分析（附PyTorch/TensorFlow实现）燃灯工作室 Ai pytorch tensorflow 人工智能
一、技术原理与数学推导（含典型案例）1.1Shapley值基础公式SHAP值基于合作博弈论中的Shapley值，计算公式为：ϕi=∑S⊆F∖{i}∣S∣!(∣F∣−∣S∣−1)!∣F∣![f(S∪{i})−f(S)]\phi_i=\sum_{S\subseteqF\setminus\{i\}}\frac{|S|!(|F|-|S|-1)!}{|F|!}[f(S\cup\{i\})-f(S)]ϕi=S
模型可解释性：基于因果推理的反事实生成与决策可视化燃灯工作室 Ai 人工智能数学建模学习机器学习
1.技术原理与数学公式1.1因果推理基础结构方程模型（SEM）：X=fX(PaX,UX)X=f_X(Pa_X,U_X)X=fX(PaX,UX)其中PaXPa_XPaX为父节点集合，UXU_XUX为外生变量反事实定义：YX=x(u)=Ydo(X=x)(u)Y_{X=x}(u)=Y_{do(X=x)}(u)YX=x(u)=Ydo(X=x)(u)表示在相同背景条件uuu下，强制变量XXX取xxx时的结果
自动化特征选择：基于模型重要性的递归消除原理与实战指南燃灯工作室 Ai 自动化运维
一、技术原理与数学公式1.1递归特征消除（RFE）核心思想J(S)=∑i=1n∣wi∣(特征重要性评分)J(S)=\sum_{i=1}^n|w_i|\quad(特征重要性评分)J(S)=i=1∑n∣wi∣(特征重要性评分)Sk+1=Sk−arg⁡min⁡fJ(Sk∖{f})(迭代消除策略)S_{k+1}=S_k-\arg\min_{f}J(S_k\setminus\{f\})\quad(迭代消除策
推理流水线DAG调度：多模型组合执行优化方案燃灯工作室 Ai 人工智能数学建模学习机器学习计算机视觉
一、技术原理与数学模型1.1DAG调度核心公式设推理流水线由n个模型节点组成，定义：V={v1,v2,...,vn}V=\{v_1,v_2,...,v_n\}V={v1,v2,...,vn}为节点集合E={(vi,vj)∣vi→vj}E=\{(v_i,v_j)|v_i\rightarrowv_j\}E={(vi,vj)∣vi→vj}为边集合C(vi)C(v_i)C(vi)为节点viv_ivi的计算
边缘设备模型量化部署：TFLite INT8校准实现细节深度解析燃灯工作室 Ai 人工智能机器学习
一、技术原理与数学公式INT8量化的核心是通过线性映射将浮点数值范围（[-max,max]）映射到8位整数范围（[-128,127]）。校准过程通过分析真实数据分布确定最优缩放因子（scale）和零点（zeropoint）：量化公式：Q=round(float_valuescale)+zero_pointQ=round(\frac{float\_value}{scale})+zero\_point
基于时间序列预测的推理服务弹性扩缩容实战指南：（行业案例+数学推导+源码解析）燃灯工作室 Ai 计算机视觉语音识别目标检测机器学习人工智能
技术原理（数学公式）整体架构请求量预测→扩缩容决策→资源配置动态调整三阶段闭环，周期为5-30分钟核心预测模型（时间序列预测）LSTM预测公式（CSDN兼容格式）：$$h_t=\text{LSTM}(x_t,h_{t-1})\\\hat{y}_{t+1}=W_h\cdoth_t+b_h$$其中Wh∈Rd×1W_h\in\mathbb{R}^{d\times1}Wh∈Rd×1为权重矩阵，ddd为隐藏
深度解析A/B测试中的哈希分桶策略：从原理到实战的流量分层方案燃灯工作室 Python 哈希算法算法
一、技术原理与数学基础1.1哈希分桶的核心机制核心公式：桶编号=Hash(用户ID+实验层种子)modN基于**双重哈希（DoubleHashing）**实现流量的完全正交切割：{∀u∈U,Layerij(u)=H(H(u∣∣seedj)∣∣seedi)mod N∀i≠k,H(⋅)满足P(Layeri(u)=m∩Layerk(u)=n)=1/(N2)\begin{cases}\forallu\i
AUTOSAR从入门到精通-汽车电子电气架构（EEA）格图素书汽车
目录前言算法原理EEA发展历程->分布式架构（distributed）：->基于域的集中式架构(DCUbasedcentralized)：->基于域融合的带状架构(DCUfusionbasedzonal)：什么是电子电气架构？EEA的特点EEA发展的三大阶段特征第一阶段：分布式架构第二阶段：基于域的集中式架构（转型中）第三阶段：基于域融合的带状架构（未来趋势）车载电子电气架构作用EEA开发工作内容
【图像处理】ISP(Image Signal Processor) 图像处理器的用途和工作原理？ AndrewHZ 图像处理基石图像处理智能手机影像系统算法深度学习人工智能 ISP
ISP（图像信号处理器）是数字影像设备的“视觉大脑”，负责将传感器捕获的原始电信号转化为我们看到的高清图像。以下从用途和工作原理两方面通俗解析：一、ISP的核心用途：让照片“更像眼睛看到的”提升画质：降噪：去除暗光下的噪点（如手机夜景模式，通过多帧合成+算法抑制噪点）。色彩还原：校正传感器偏色（例如索尼传感器常偏黄，ISP通过白平衡算法还原真实色彩）。动态范围优化：保留高光和暗部细节（类似HDR，
《侯捷 C++ 系列精品课学习之旅：知识盛宴与成长感悟》一朵忧伤的蔷薇 c++学习 jvm
一、初遇C++：基础与语法的探索课程伊始，侯捷老师以深入浅出的方式，为我们讲解了C++的基础语法。从变量、数据类型到控制结构，每一个知识点都被剖析得细致入微。我印象尤为深刻的是老师对指针的讲解。指针作为C++的核心概念之一，向来以其抽象和复杂而让初学者望而却步。然而，侯老师通过生动形象的比喻和丰富的示例，将指针的原理和应用讲解得通俗易懂。他将指针比作地址，就像现实生活中的门牌号，通过它我们可以准确
NAT 和 IP 直接通信的区别曹天骄 tcp/ip 服务器网络协议
1.NAT的工作原理NAT（NetworkAddressTranslation，网络地址转换）是一种网络技术，用于将私有网络中的IP地址映射到公共网络中的IP地址，或者在不同的网络之间转换IP地址。NAT的主要目的是解决IPv4地址不足的问题，同时提供一定程度的安全性和灵活性。NAT设备（如路由器或防火墙）会在数据包经过时修改其源IP地址或目标IP地址。常见的NAT类型包括：静态NAT：将私有IP
Linux中断机制详解：从原理到实践 AllenBright #Linux linux 运维服务器
想象一下医院的急诊科：当有危重病人到达时，护士会立即按下紧急呼叫按钮，打断医生当前的常规工作，优先处理最紧急的情况。这种中断响应机制正是计算机系统中中断（Interrupt）的核心思想。在Linux内核中，中断是硬件与软件交互的核心机制，直接关系到系统的响应速度、吞吐量和稳定性。本文将深入剖析Linux中断的工作原理，并演示如何在实际操作中管理和优化中断。1.中断的本质与分类1.1什么是中断？中断
2024最新版头歌实践教学平台数据库原理与应用实训答案泠波数据库
实训一:数据定义和操纵(4课时)初识MySQL数据库第1关：创建数据库mysql-uroot-p123123-h127.0.0.1createdatabaseMyDb;showdatabases;第2关：创建表mysql-uroot-p123123-h127.0.0.1createdatabaseTestDb;createtablet_emp(idint,namevarchar(32),deptI
路由器配置DHCP服务器执笔忆长安网络网络协议负载均衡
文章目录前言1.DHCP的优点2.DHCP的使用3.DHCP的工作原理4.dhcp服务器实验总结前言DHCP（动态主机配置协议）是一个局域网的网络协议。指的是由服务器控制一段IP地址范围，客户机登录服务器时就可以自动获得服务器分配的IP地址和子网掩码。默认情况下，DHCP作为WindowsServer的一个服务组件不会被系统自动安装，还需要管理员手动安装并进行必要的配置。1.DHCP的优点1.简化
Python 爬虫：一文掌握 SVG 映射反爬虫数据知道 2025年爬虫和逆向教程 python 爬虫 microsoft 爬虫逆向数据采集
更多内容请见：爬虫和逆向教程-专栏介绍和目录文章目录1.SVG概述1.1SVG的优点1.1映射反爬虫的原理2.SVG映射反爬虫的示例3.应对SVG映射反爬虫的方法3.1解析SVG图像3.2处理自定义字体3.3使用OCR技术3.4动态生成SVG的处理4.实战案例4.1使用SVG映射显示价格4.2解析SVG文件并提取其中的内容和属性4.3模拟交互行为4.4使用无头浏览器4.5某网站使用SVG实现动态验
Java实现的简单双向Map，支持重复Value superlxw1234 java 双向map
关键字：Java双向Map、DualHashBidiMap 有个需求，需要根据即时修改Map结构中的Value值，比如，将Map中所有value=V1的记录改成value=V2，key保持不变。数据量比较大，遍历Map性能太差，这就需要根据Value先找到Key，然后去修改。即：既要根据Key找Value，又要根据Value
PL/SQL触发器基础及例子百合不是茶 oracle数据库触发器 PL/SQL编程
触发器的简介; 触发器的定义就是说某个条件成立的时候，触发器里面所定义的语句就会被自动的执行。因此触发器不需要人为的去调用，也不能调用。触发器和过程函数类似过程函数必须要调用, 一个表中最多只能有12个触发器类型的,触发器和过程函数相似触发器不需要调用直接执行, 触发时间：指明触发器何时执行，该值可取： before：表示在数据库动作之前触发
[时空与探索]穿越时空的一些问题 comsci 问题
我们还没有进行过任何数学形式上的证明,仅仅是一个猜想..... 这个猜想就是; 任何有质量的物体(哪怕只有一微克)都不可能穿越时空,该物体强行穿越时空的时候,物体的质量会与时空粒子产生反应,物体会变成暗物质,也就是说,任何物体穿越时空会变成暗物质..(暗物质就我的理
easy ui datagrid上移下移一行商人shang js 上移下移 easyui datagrid
/** * 向上移动一行 * * @param dg * @param row */ function moveupRow(dg, row) { var datagrid = $(dg); var index = datagrid.datagrid("getRowIndex", row); if (isFirstRow(dg, row)) {
Java反射 oloz 反射
本人菜鸟，今天恰好有时间，写写博客，总结复习一下java反射方面的知识，欢迎大家探讨交流学习指教首先看看java中的Class package demo; public class ClassTest { /*先了解java中的Class*/ public static void main(String[] args) { //任何一个类都
springMVC 使用JSR-303 Validation验证杨白白 spring mvc
JSR-303是一个数据验证的规范，但是spring并没有对其进行实现，Hibernate Validator是实现了这一规范的，通过此这个实现来讲SpringMVC对JSR-303的支持。 JSR-303的校验是基于注解的，首先要把这些注解标记在需要验证的实体类的属性上或是其对应的get方法上。登录需要验证类 public class Login { @NotEmpty
log4j 香水浓 log4j
log4j.rootCategory=DEBUG, STDOUT, DAILYFILE, HTML, DATABASE #log4j.rootCategory=DEBUG, STDOUT, DAILYFILE, ROLLINGFILE, HTML #console log4j.appender.STDOUT=org.apache.log4j.ConsoleAppender log4
使用ajax和history.pushState无刷新改变页面URL agevs jquery 框架 Ajax html5 chrome
表现如果你使用chrome或者firefox等浏览器访问本博客、github.com、plus.google.com等网站时，细心的你会发现页面之间的点击是通过ajax异步请求的，同时页面的URL发生了了改变。并且能够很好的支持浏览器前进和后退。是什么有这么强大的功能呢？ HTML5里引用了新的API，history.pushState和history.replaceState，就是通过
centos中文乱码 AILIKES centos OS ssh
一、CentOS系统访问 g.cn ，发现中文乱码。于是用以前的方式：yum -y install fonts-chinese CentOS系统安装后，还是不能显示中文字体。我使用 gedit 编辑源码，其中文注释也为乱码。后来，终于找到以下方法可以解决，需要两个中文支持的包： fonts-chinese-3.02-12.
触发器 baalwolf 触发器
触发器(trigger)：监视某种情况，并触发某种操作。触发器创建语法四要素：1.监视地点(table) 2.监视事件(insert/update/delete) 3.触发时间(after/before) 4.触发事件(insert/update/delete) 语法： create trigger triggerName after/before
JS正则表达式的i m g bijian1013 JavaScript 正则表达式
g:表示全局（global)模式，即模式将被应用于所有字符串，而非在发现第一个匹配项时立即停止。 i:表示不区分大小写（case-insensitive）模式，即在确定匹配项时忽略模式与字符串的大小写。 m:表示
HTML5模式和Hashbang模式 bijian1013 JavaScript AngularJS Hashbang模式 HTML5模式
我们可以用$locationProvider来配置$location服务（可以采用注入的方式，就像AngularJS中其他所有东西一样）。这里provider的两个参数很有意思，介绍如下。 html5Mode 一个布尔值，标识$location服务是否运行在HTML5模式下。 ha
[Maven学习笔记六]Maven生命周期 bit1129 maven
从mvn test的输出开始说起当我们在user-core中执行mvn test时，执行的输出如下： /software/devsoftware/jdk1.7.0_55/bin/java -Dmaven.home=/software/devsoftware/apache-maven-3.2.1 -Dclassworlds.conf=/software/devs
【Hadoop七】基于Yarn的Hadoop Map Reduce容错 bit1129 hadoop
运行于Yarn的Map Reduce作业，可能发生失败的点包括 Task Failure Application Master Failure Node Manager Failure Resource Manager Failure 1. Task Failure 任务执行过程中产生的异常和JVM的意外终止会汇报给Application Master。僵死的任务也会被A
记一次数据推送的异常解决端口解决 ronin47 记一次数据推送的异常解决
　　需求：从db获取数据然后推送到B 程序开发完成，上jboss,刚开始报了很多错，逐一解决，可最后显示连接不到数据库。机房的同事说可以ping 通。　　自已画了个图，逐一排除，把linux 防火墙　和　setenforce　设置最低。　　　service iptables stop
巧用视错觉-UI更有趣 brotherlamp UI ui视频 ui教程 ui自学 ui资料
我们每个人在生活中都曾感受过视错觉（optical illusion）的魅力。视错觉现象是双眼跟我们开的一个玩笑，而我们往往还心甘情愿地接受我们看到的假象。其实不止如此，视觉错现象的背后还有一个重要的科学原理——格式塔原理。格式塔原理解释了人们如何以视觉方式感觉物体，以及图像的结构，视角，大小等要素是如何影响我们的视觉的。在下面这篇文章中，我们首先会简单介绍一下格式塔原理中的基本概念，
线段树-poj1177-N个矩形求边长（离散化+扫描线） bylijinnan 数据结构算法线段树
package com.ljn.base; import java.util.Arrays; import java.util.Comparator; import java.util.Set; import java.util.TreeSet; /** * POJ 1177 (线段树+离散化+扫描线)，题目链接为http://poj.org/problem?id=1177
HTTP协议详解 chicony http协议
引言
Scala设计模式 chenchao051 设计模式 scala
Scala设计模式我的话：在国外网站上看到一篇文章，里面详细描述了很多设计模式，并且用Java及Scala两种语言描述，清晰的让我们看到各种常规的设计模式，在Scala中是如何在语言特性层面直接支持的。基于文章很nice，我利用今天的空闲时间将其翻译，希望大家能一起学习，讨论。翻译
安装mysql daizj mysql 安装
安装mysql (1)删除linux上已经安装的mysql相关库信息。rpm -e xxxxxxx --nodeps (强制删除) 执行命令rpm -qa |grep mysql 检查是否删除干净 (2)执行命令 rpm -i MySQL-server-5.5.31-2.el
HTTP状态码大全 dcj3sjt126com http状态码
完整的 HTTP 1.1规范说明书来自于RFC 2616，你可以在http://www.talentdigger.cn/home/link.php?url=d3d3LnJmYy1lZGl0b3Iub3JnLw%3D%3D在线查阅。HTTP 1.1的状态码被标记为新特性，因为许多浏览器只支持 HTTP 1.0。你应只把状态码发送给支持 HTTP 1.1的客户端，支持协议版本可以通过调用request
asihttprequest上传图片 dcj3sjt126com ASIHTTPRequest
NSURL *url =@"yourURL"; ASIFormDataRequest*currentRequest =[ASIFormDataRequest requestWithURL:url]; [currentRequest setPostFormat:ASIMultipartFormDataPostFormat];[currentRequest se
C语言中，关键字static的作用 e200702084 C++c C#
在C语言中，关键字static有三个明显的作用： 1)在函数体，局部的static变量。生存期为程序的整个生命周期，（它存活多长时间）；作用域却在函数体内（它在什么地方能被访问（空间））。一个被声明为静态的变量在这一函数被调用过程中维持其值不变。因为它分配在静态存储区，函数调用结束后并不释放单元，但是在其它的作用域的无法访问。当再次调用这个函数时，这个局部的静态变量还存活，而且用在它的访
win7/8使用curl geeksun win7
1. WIN7/8下要使用curl，需要下载curl-7.20.0-win64-ssl-sspi.zip和Win64OpenSSL_Light-1_0_2d.exe。下载地址： http://curl.haxx.se/download.html 请选择不带SSL的版本，否则还需要安装SSL的支持包 2. 可以给Windows增加c
Creating a Shared Repository; Users Sharing The Repository hongtoushizi git
转载自： http://www.gitguys.com/topics/creating-a-shared-repository-users-sharing-the-repository/ Commands discussed in this section: git init –bare git clone git remote git pull git p
Java实现字符串反转的8种或9种方法 Josh_Persistence 异或反转递归反转二分交换反转 java字符串反转栈反转
注：对于第7种使用异或的方式来实现字符串的反转，如果不太看得明白的，可以参照另一篇博客： http://josh-persistence.iteye.com/blog/2205768 /** * */ package com.wsheng.aggregator.algorithm.string; import java.util.Stack; /**
代码实现任意容量倒水问题 home198979 PHP 算法倒水
形象化设计模式实战 HELLO!架构 redis命令源码解析倒水问题：有两个杯子，一个A升，一个B升，水有无限多，现要求利用这两杯子装C
Druid datasource zhb8015 druid
推荐大家使用数据库连接池 DruidDataSource. http://code.alibabatech.com/wiki/display/Druid/DruidDataSource DruidDataSource经过阿里巴巴数百个应用一年多生产环境运行验证，稳定可靠。它最重要的特点是：监控、扩展和性能。下载和Maven配置看这里： http
两种启动监听器ApplicationListener和ServletContextListener spjich java spring 框架
引言:有时候需要在项目初始化的时候进行一系列工作，比如初始化一个线程池，初始化配置文件，初始化缓存等等，这时候就需要用到启动监听器，下面分别介绍一下两种常用的项目启动监听器 ServletContextListener 特点: 依赖于sevlet容器，需要配置web.xml 使用方法: public class StartListener implements
JavaScript Rounding Methods of the Math object 何不笑 JavaScript Math
The next group of methods has to do with rounding decimal values into integers. Three methods — Math.ceil(), Math.floor(), and Math.round() — handle rounding in differen