JH0lmes

caffe中的layer

layer是神经网络搭建的脚手架，理解了layer，才能盖好神经网络这座摩天大楼。

下图是一张关于layer的思维导图，在功力到达一定程度的时候才可练此功，到时一定会有不一样的收获。

1. Outline

此部分主要概述了与layer有关的方方面面。

1.1 layer.hpp

与layer相关的头文件：

layer.hpp

common_layers.hpp
data_layers.hpp
loss_layers.hpp
neuron_layers.hpp
vision_layers.hp

其中layer.hpp是抽象出来的基类，下面的的五个头文件都是在其基础上的继承，如上图中的五个部分。在layer.hpp头文件里，包含了这几个头文件：

#include "caffe/blob.hpp"
#include "caffe/common.hpp"
#include "caffe/proto/caffe.pb.h"
#include "caffe/util/device_alternate.hpp"

在device_alternate.hpp中，通过 #ifdef CPU_ONLY定义了一些宏来取消GPU的调用：

#define STUB_GPU(classname)
#define STUB_GPU_FORWARD(classname, funcname)
#define STUB_GPU_BACKWARD(classname, funcname)

layer中有这三个主要参数：

LayerParameter layer_param_;           // 这个是protobuf文件中存储的layer参数
vector>> blobs_; // 这个存储的是layer的参数，在程序中用的
vector param_propagate_down_;    // 这个bool表示是否计算各个blob参数的diff，即传播误差

Layer类的构建函数 explicit Layer(const LayerParameter& param) : layer_param_(param) 会尝试从protobuf文件读取参数。其三个主要接口：

virtual void SetUp(const vector*>& bottom, vector*>* top)
inline Dtype Forward(const vector*>& bottom, vector*>* top);
inline void Backward(const vector*>& top, const vector& propagate_down, const *>* bottom);

SetUp 函数需要根据实际的参数设置进行实现，对各种类型的参数初始化；Forward 和 Backward 对应前向计算和反向更新，输入统一都是 bottom，输出为 top，其中 Backward 里面有个 propagate_down 参数，用来表示该Layer是否反向传播参数。

在 Forward 和 Backward 的具体实现里，会根据 Caffe::mode() 进行对应的操作，即使用cpu或者gpu进行计算，两个都实现了对应的接口 Forward_cpu、Forward_gpu 和 Backward_cpu、Backward_gpu，这些接口都是virtual，具体还是要根据layer的类型进行对应的计算（注意：有些layer并没有GPU计算的实现，所以封装时加入了CPU的计算作为后备）。另外，还实现了ToProto的接口，将Layer的参数写入到protocol buffer文件中。

1.2 data_layers.hpp

data_layers.hpp这个头文件包含了这几个头文件：

#include "boost/scoped_ptr.hpp"
#include "hdf5.h"
#include "leveldb/db.h"
#include "lmdb.h"
#include "caffe/blob.hpp"
#include "caffe/common.hpp"
#include "caffe/filler.hpp"
#include "caffe/internal_thread.hpp"
#include "caffe/layer.hpp"
#include "caffe/proto/caffe.pb.h"

caffe使用的几个数据库：

LMDB（Lightning Memory-Mapped Database）是一个和levelDB类似的key/value存储库，其常用于单标签数据，像分类等问题。

HDF5（Hierarchical Data Format）是一种为存储和处理大容量科学数据而设计的文件格式及相应的库文件，一般应用于多标签的任务上，比如给定一张猫的图片，它的标签不仅仅只包括它是猫这个标签，还包括它很胖，它很大这种标签，就使用hdf5比较合适。

鉴于目标检测的使用环境，使用最多的是lmdb格式，data_layer作为原始数据的输入层，处于整个网络的最底层，它从数据库lmdb中读取数据，当然也可以直接从内存中读取，还可以从hdf5，甚至是原始的图像读入数据。

caffe/filler.hpp的作用是在网络初始化时，根据layer的定义进行初始参数的填充，根据FillerParameter指定的类型进行对应的参数填充。

// A function to get a specific filler from the specification given in
// FillerParameter. Ideally this would be replaced by a factory pattern,
// but we will leave it this way for now.

template 
Filler* GetFiller(const FillerParameter& param) {
  const std::string& type = param.type();
  if (type == "constant") {
    return new ConstantFiller(param);
  } else if (type == "gaussian") {
    return new GaussianFiller(param);
  } else if (type == "positive_unitball") {
    return new PositiveUnitballFiller(param);
  } else if (type == "uniform") {
    return new UniformFiller(param);
  } else if (type == "xavier") {
    return new XavierFiller(param);
  } else {
    CHECK(false) << "Unknown filler name: " << param.type();
  }
  return (Filler*)(NULL);
}

internal_thread.hpp里面封装了pthread函数，继承的子类可以得到一个单独的线程，主要作用是在计算当前的一批数据时，在后台获取新一批的数据。

1.3 neuron_layers.hpp

输入data，就需进行计算，如常见的sigmoid、tanh，这些计算被抽象成neuron_layers.hpp里的类NeuronLayer，这个层只负责具体的计算，因此明确定义了输入ExactNumBottomBlobs()和ExactNumTopBlobs()都是常量1,即输入一个blob，输出一个blob。

1.4 vision_layers.hpp

主要完成的是图像的卷积操作，像conv、pooling、LRN都在里面，也可输出图像，要看具体实现代码。然后里面有个im2col的实现，按caffe作者的解释，主要是为了加速卷积的。

1.5 loss_layers.hpp

前面的data layer和common layer都是中间计算层，虽然会涉及到反向传播，但传播的源头来自于loss_layer，即网络的最终端。这一层因为要计算误差，所以输入都是2个blob，输出1个blob。

1.6 common_layers.hpp

NeruonLayer仅负责简单的一对一计算，而剩下的那些复杂的计算则通通放在了common_layers.hpp中。像ArgMaxLayer、ConcatLayer、FlattenLayer、SoftmaxLayer、SplitLayer和SliceLayer等各种对blob增减修改的操作。

2.Detail

深入layer的细节中去，对于一些常用的layer，如卷积层，池化层，还给出对应的proto代码。

2.1 数据层（data_layers）

数据通过数据层LMDB进入Caffe，数据层在整个网络的底部。

一些基本的操作，如：mean subtraction, scaling, random cropping, and mirroring均可直接在数据层上进行指定。

2.1.1 Database

类型：Data。

必须参数：

source: 包含数据的目录名称
batch_size: 一次处理的输入的数量

可选参数：

rand_skip: 在开始的时候从输入中跳过这个数值，在异步随机梯度下降（SGD）的时候非常有用
backend [default LEVELDB]: 选择使用 LEVELDB 或者 LMDB

2.1.2 In-Memory

类型: MemoryData

必需参数：

batch_size, channels, height, width: 指定从内存读取数据的大小

MemoryData层直接从内存中读取数据，而不是拷贝过来。因此，要使用它的话，你必须调用MemoryDataLayer::Reset (from C++)或者Net.set_input_arrays (from Python)以此指定一块连续的数据（通常是一个四维张量）。

2.1.3 Images

类型: ImageData

必要参数：

source: text文件的名字，每一行给出一张图片的文件名和label
batch_size: 一个batch中图片的数量

可选参数：

rand_skip：在开始的时候从输入中跳过这个数值，在异步随机梯度下降（SGD）的时候非常有用
shuffle [default false]
new_height, new_width: 把所有的图像resize到这个大小

2.1.4 Windows

类型：WindowData

2.1.5 Dummy

类型：DummyData

Dummy 层用于development 和debugging。具体参数DummyDataParameter。

2.2 激励层（neuron_layers）

激励层是element-wise的操作，输入和输出的大小相同，一般情况下是一个非线性函数。

2.2.1 ReLU / Rectified-Linear and Leaky-ReLU

类型: ReLU

例子：

layer {
  name: "relu1"
  type: "ReLU"
  bottom: "conv1"
  top: "conv1"
}

可选参数：

negative_slope [default 0]：指定输入值小于零时的输出。

ReLU是目前使用做多的激励函数，主要因为其收敛快，并且能保持同样效果。标准的ReLU函数为max(x, 0)，而一般为当x > 0时输出x，但x <= 0时输出negative_slope。RELU层支持in-place计算，这意味着bottom的输出和输入相同以避免内存的消耗。

2.2.2 Sigmoid

类型：Sigmoid

例子：

layer {
  name: "encode1neuron"
  bottom: "encode1"
  top: "encode1neuron"
  type: "Sigmoid"
}

Sigmoid层通过 sigmoid(x) 计算每一个输入x的输出，函数如下图。

2.2.3 TanH / Hyperbolic Tangent

类型: TanH

例子：

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "TanH"
}

TanH层通过 tanh(x) 计算每一个输入x的输出，函数如下图。请注意sigmoid函数和TanH函数在纵轴上的区别。sigmoid函数将实数映射到(0,1)。TanH将实数映射到(-1,1)。

2.2.4 Absolute Value

类型: AbsVal

例子：

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "AbsVal"
}

ABSVAL层通过 abs(x) 计算每一个输入x的输出。

2.2.5 Power

类型： Power

例子：

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "Power"
  power_param {
    power: 1
    scale: 1
    shift: 0
  }
}

可选参数：

power [default 1]
scale [default 1]
shift [default 0]

POWER层通过 (shift + scale * x) ^ power计算每一个输入x的输出。

2.2.6 BNLL

类型: BNLL

例子：

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: BNLL
}

BNLL (binomial normal log likelihood) 层通过 log(1 + exp(x)) 计算每一个输入x的输出。

2.3视觉层（vision_layers）

2.3.1 卷积层(Conv)

类型：Convolution

例子：

layers { 
    name: "conv1" 
    type: CONVOLUTION 
    bottom: "data" 
    top: "conv1" 
    blobs_lr: 1               # learning rate multiplier for the filters 
    blobs_lr: 2               # learning rate multiplier for the biases 
    weight_decay: 1           # weight decay multiplier for the filters 
    weight_decay: 0           # weight decay multiplier for the biases 
    convolution_param { 
        num_output: 96        # learn 96 filters 
        kernel_size: 11       # each filter is 11x11 
        stride: 4             # step 4 pixels between each filter application 
        weight_filler { 
            type: "gaussian"  # initialize the filters from a Gaussian 
            std: 0.01         # distribution with stdev 0.01 (default mean: 0) } 
            bias_filler { 
                type: "constant" # initialize the biases to zero (0) 
                value: 0 
            } 
        }
    }
}

blobs_lr: 学习率调整的参数，在上面的例子中设置权重学习率和运行中求解器给出的学习率一样，同时是偏置学习率为权重的两倍。
weight_decay：

卷积层的重要参数

必须参数：

num_output (c_o)：过滤器的个数
kernel_size (or kernel_h and kernel_w)：过滤器的大小（也就是所谓“核”的大小）。

建议参数：

weight_filler [default type: ‘constant’ value: 0]：参数的初始化方法

可选参数：

bias_filler：偏置的初始化方法
bias_term [default true]：指定是否是否开启偏置项
pad (or pad_h and pad_w) [default 0]：指定在输入的每一边加上多少个像素
stride (or stride_h and stride_w) [default 1]：指定过滤器的步长
group (g) [default 1]: 如果g>1，那么将每个滤波器都限定只与某个输入的子集有关联。换句话说，将输入分为g组，同时将输出也分为g组。那么第i组输出只与第i组输入有关。

2.3.2 池化层（Pooling）

类型：Pooling

例子：

layers { 
    name: "pool1" 
    type: POOLING 
    bottom: "conv1" 
    top: "pool1" 
    pooling_param { 
        pool: MAX 
        kernel_size: 3 # pool over a 3x3 region 
        stride: 2 # step two pixels (in the bottom blob) between pooling regions 
    }
}

卷积层的重要参数

必需参数：

kernel_size (or kernel_h and kernel_w)：过滤器的大小

可选参数：

pool [default MAX]：pooling的方法，目前有MAX, AVE, 和STOCHASTIC三种方法
pad (or pad_h and pad_w) [default 0]：指定在输入的每一遍加上多少个像素
stride (or stride_h and stride_w) [default 1]：指定过滤器的步长

2.3.3 Local Response Normalization (LRN)

类型：LRN

可选参数：

local_size [default 5]：对于cross channel LRN为需要求和的邻近channel的数量；对于within channel LRN为需要求和的空间区域的边长；
alpha [default 1]：scaling参数；
beta [default 5]：指数；
norm_region [default ACROSS_CHANNELS]: 选择LRN实现的方法：1. ACROSS_CHANNELS ；2. WITHIN_CHANNEL

LRN（Local Response Normalization）对一个局部的输入区域进行的归一化。有两种不同的形式：1. ACCROSS_CHANNEL；2. WITHIN_CHANNEL。其实很好从字面上进行理解。第一种方法综合了不同的channel，而在一个channel里面只取1*1（所以size是localsize×1×1）。而在第二种方法中，不在channel方向上扩展，只在单一channel上进行空间扩展（所以size是1×localsize×localsize）。

计算公式：对每一个输入除以

，其中参数α是scaling参数，参数β是指数。而参数n对应local region的大小。

2.4 损失层（Loss Layers）

深度学习是通过最小化输出和目标的Loss来驱动学习。

2.4.1 Softmax

类型: SoftmaxWithLoss

Softmax Loss层应用于多标签分类。对于输入，计算了multinomial logistic loss。在概念上近似等于一个Softmax层加上一个multinomial logistic loss层，但在梯度的计算上更加稳定。

2.4.2 Sum-of-Squares / Euclidean

类型: EuclideanLoss

Euclidean loss层计算了两个输入差的平方和：

2.4.3 Hinge / Margin

类型: HingeLoss

例子：

L1 Normlayers { 
    name: "loss" 
    type: HINGE_LOSS 
    bottom: "pred" 
    bottom: "label"
} 
L2 Normlayers { 
    name: "loss" 
    type: HINGE_LOSS 
    bottom: "pred" 
    bottom: "label" 
    top: "loss" 
    hinge_loss_param { 
        norm: L2 
    }
}

可选参数：

norm [default L1]: 选择L1或者L2范数

2.4.4 Sigmoid Cross-Entropy

类型：SigmoidCrossEntropyLoss

2.4.5 Infogain

类型：InfoGainLoss

2.4.6 Accuracy and Top-k

类型：Accuracy

用来计算输出和目标的正确率，事实上这不是一个loss，而且没有backward这一步。

2.5. 一般层（Common Layers）

2.5.1 全连接层 Inner Product

类型：InnerProduct

例子：

layer {
  name: "fc8"
  type: "InnerProduct"
  # learning rate and decay multipliers for the weights
  param { lr_mult: 1 decay_mult: 1 }
  # learning rate and decay multipliers for the biases
  param { lr_mult: 2 decay_mult: 0 }
  inner_product_param {
    num_output: 1000
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
  bottom: "fc7"
  top: "fc8"
}

必要参数：

num_output (c_o)：过滤器的个数

可选参数：

weight_filler [default type: ‘constant’ value: 0]：参数的初始化方法
bias_filler：偏置的初始化方法
bias_term [default true]：指定是否是否开启偏置项

2.5.2 Splitting

类型：Split

Splitting层可以把一个输入blob分离成多个输出blobs。这个用在当需要把一个blob输入到多个输出层的时候。

2.5.3 Flattening

类型：Flatten

Flatten层是把一个输入的大小为n * c * h * w变成一个简单的向量，其大小为 n * (c*h*w) * 1 * 1。

2.5.4 Reshape

类型：Reshape

例子：

layer {
    name: "reshape"
    type: "Reshape"
    bottom: "input"
    top: "output"
    reshape_param {
      shape {
        dim: 0  # copy the dimension from below
        dim: 2
        dim: 3
        dim: -1 # infer it from the other dimensions
      }
    }
  }

输入：单独的一个blob，可以是任意维；

输出：同样的blob，但是它的维度已经被我们人为地改变，维度的数据由reshap_param定义。

可选参数：

shape

Reshape层被用于改变输入的维度，而不改变输入的具体数据。就像Flatten层一样。只是维度被改变而已，这个过程不涉及数据的拷贝。

输出的维度由ReshapeParam proto控制。可以直接使用数字进行指定。设定输入的某一维到输出blob中去。此外，还有两个数字值得说一下：

0 直接从底层复制。例如，如果是底层是一个2在它的第一维，那么顶层在它的第一维也有一个2。
-1 从其他的数据里面推测这一维应该是多少。

2.5.5 Concatenation

类型：Concat

例子：

layer {
  name: "concat"
  bottom: "in1"
  bottom: "in2"
  top: "out"
  type: "Concat"
  concat_param {
    axis: 1
  }
}

可选参数：

axis [default 1]：0代表链接num，1代表链接channels

通过Concatenation层，可以把多个的blobs链接成一个blob。

2.5.6 Slicing

类型：Slice

例子：

layer {
  name: "slicer_label"
  type: "Slice"
  bottom: "label"
  ## Example of label with a shape N x 3 x 1 x 1
  top: "label1"
  top: "label2"
  top: "label3"
  slice_param {
    axis: 1
    slice_point: 1
    slice_point: 2
  }
}

Slice层可以将输入层变成多个输出层。这些输出层沿一个给定的维度存在。axis指定了目标的轴，slice_point则指定了选择维度的序号。

2.5.7 Elementwise Operations

类型：Eltwise

2.5.8 Argmax

类型：ArgMax

2.5.9 Softmax

类型：Softmax

2.5.10 Mean-Variance Normalization

类型：MVN

3.综述

每个layer的输入数据来自一些 'bottom' blobs, 输出一些 'top' blobs。Caffe中每种类型layer的参数说明定义在caffe.proto文件中，具体的layer参数值则定义在具体应用的protocals buffer网络结构说明文件中。例如，卷积层（ConvolutionLayer）的参数说明在caffe.proto中是如下定义的，

// in caffe.proto  
// Message that stores parameters used by ConvolutionLayer  
message ConvolutionParameter {  
  optional uint32 num_output = 1; // The number of outputs for the layer  
  optional bool bias_term = 2 [default = true]; // whether to have bias terms  
  // Pad, kernel size, and stride are all given as a single value for equal  
  // dimensions in height and width or as Y, X pairs.  
  optional uint32 pad = 3 [default = 0]; // The padding size (equal in Y, X)  
  optional uint32 pad_h = 9 [default = 0]; // The padding height  
  optional uint32 pad_w = 10 [default = 0]; // The padding width  
  optional uint32 kernel_size = 4; // The kernel size (square)  
  optional uint32 kernel_h = 11; // The kernel height  
  optional uint32 kernel_w = 12; // The kernel width  
  optional uint32 group = 5 [default = 1]; // The group size for group conv  
  optional uint32 stride = 6 [default = 1]; // The stride (equal in Y, X)  
  optional uint32 stride_h = 13; // The stride height  
  optional uint32 stride_w = 14; // The stride width  
  optional FillerParameter weight_filler = 7; // The filler for the weight  
  optional FillerParameter bias_filler = 8; // The filler for the bias  
  enum Engine {  
    DEFAULT = 0;  
    CAFFE = 1;  
    CUDNN = 2;  
  }  
  optional Engine engine = 15 [default = DEFAULT];  
}

其中的参数说明包括卷积核的个数、大小和步长等。在examples\mnist\lenet_train_test.prototxt网络结构说明文件中，具体一个卷积层（ConvolutionLayer）是这样定义的

# in examples\mnist\lenet_train_test.prototxt  
layer {  
  name: "conv1" // 层的名字  
  type: "Convolution" // 层的类型，说明具体执行哪一种计算  
  bottom: "data" // 层的输入数据Blob的名字  
  top: "conv1" // 层的输出数据Blob的名字  
  param { // 层的权值和偏置相关参数  
    lr_mult: 1  
  }  
  param {  
    lr_mult: 2  
  }  
  convolution_param { // 卷积层卷积运算相关的参数  
    num_output: 20  
    kernel_size: 5  
    stride: 1  
    weight_filler {  
      type: "xavier"  
    }  
    bias_filler {  
      type: "constant"  
    }  
  }  
}

每种类型的layer需要定义三种关键操作LayerSetUp, Forward, Backward：

LayerSetUp: 网络构建时初始化层和层的连接
Forward: 网络数据前向传递，给定bottom输入数据，计算输出到top
Backward：网络误差反向传递，给定top的梯度，计算bottom的梯度并存储到bottom blob

Layer的设计主要就是SetUp、Forward、Backward函数（层一开始的时候的设置、然后就是前传和反传）。

caffe的源码，layer.hpp：

#ifndef CAFFE_LAYER_H_    
#define CAFFE_LAYER_H_    
    
#include     
#include     
#include     
    
#include "caffe/blob.hpp"    
#include "caffe/common.hpp"    
#include "caffe/layer_factory.hpp"    
#include "caffe/proto/caffe.pb.h"    
#include "caffe/util/device_alternate.hpp"    
    
namespace caffe {    
    
/**  
 * @brief An interface for the units of computation which can be composed into a  
 *        Net.  
 *  
 * Layer%s must implement a Forward function, in which they take their input  
 * (bottom) Blob%s (if any) and compute their output Blob%s (if any).  
 * They may also implement a Backward function, in which they compute the error  
 * gradients with respect to their input Blob%s, given the error gradients with  
 * their output Blob%s.  
 */    
template     
class Layer {    
 public:    
/*  
首先获得当前网络的Phase，是train还是test，在初始化列表初始化LayerParameter,之后blobs_这里存放的是一个指向blob类的shared_ptr指针的一个vector，在这里是申请空间，然后将传入的layer_param中的blob拷贝过来。  
*/    
// 显示的构造函数不需要重写，任何初始工作在SetUp()中完成    
// 构造方法只复制层参数说明的值，如果层说明参数中提供了权值和偏置参数，也复制    
  explicit Layer(const LayerParameter& param)    
    : layer_param_(param) {    
      // Set phase and copy blobs (if there are any).    
// 训练还是测试？phase      
      phase_ = param.phase();    
      if (layer_param_.blobs_size() > 0) {    
// 将blobs_的大小设置为参数中的大小      
        blobs_.resize(layer_param_.blobs_size());    
        for (int i = 0; i < layer_param_.blobs_size(); ++i) {    
// 新建若干个Blob     
          blobs_[i].reset(new Blob());    
// 从blob文件中获取数据    
          blobs_[i]->FromProto(layer_param_.blobs(i));    
        }    
      }//用protobuf 传入的参数对blobs_ 做初始化，blobs_ 是一个vector 存放指向Blob类的智能指针。    
    
      #ifdef USE_MPI    
      //If this is a gather layer, all it subsequent layer doesn't need gradient sync.    
      //We will only change itself's property here,    
      //subsequent layers will be inferred in the Net    
    if (is_gathering()){    
        set_need_sync(false);    
      }else{    
        set_need_sync(true);    
      }    
      #endif    
    }    
  virtual ~Layer() {}    
////////////////初始化函数SetUp，每个Layer对象都必须遵循固定的调用模式,    
  /**  
   * @brief Implements common layer setup functionality.  
   * @brief 实现每个layer对象的setup函数  
   * @param bottom the preshaped input blobs  
   * @param bottom 层的输入数据，blob中的存储空间已申请  
   * @param top  
   *     the allocated but unshaped output blobs, to be shaped by Reshape  
   * @param top 层的输出数据，blob对象以构造但是其中的存储空间未申请，  
   *     具体空间大小需根据bottom blob大小和layer_param_共同决定，具体在Reshape函数现实  
   *  
   * Checks that the number of bottom and top blobs is correct.  
   * Calls LayerSetUp to do special layer setup for individual layer types,  
   * followed by Reshape to set up sizes of top blobs and internal buffers.  
   * Sets up the loss weight multiplier blobs for any non-zero loss weights.  
   * This method may not be overridden.  
   * 1. 检查输入输出blob个数是否满足要求，每个层能处理的输入输出数据不一样  
   * 2. 调用LayerSetUp函数初始化特殊的层，每个Layer子类需重写这个函数完成定制的初始化  
   * 3. 调用Reshape函数为top blob分配合适大小的存储空间  
   * 4. 为每个top blob设置损失权重乘子，非LossLayer为的top blob其值为零  
   *  
   * 此方法非虚函数，不用重写，模式固定  
   */    
  void SetUp(const vector*>& bottom,    
      const vector*>& top) {    
    CheckBlobCounts(bottom, top);    
    LayerSetUp(bottom, top);    
    Reshape(bottom, top);    
    SetLossWeights(top);    
  }    
/////////////////每个子类Layer必须重写的初始化函数LayerSetUp，    
  /**  
   * @brief Does layer-specific setup: your layer should implement this function  
   *        as well as Reshape.  
   * @brief 定制初始化，每个子类layer必须实现此虚函数  
   *  
   * @param bottom  
   *     the preshaped input blobs, whose data fields store the input data for  
   *     this layer  
   * @param bottom  
   *     输入blob, 数据成员data_和diff_存储了相关数据  
   * @param top  
   *     the allocated but unshaped output blobs  
   * @param top  
   *     输出blob, blob对象已构造但数据成员的空间尚未申请  
   *  
   * This method should do one-time layer specific setup. This includes reading  
   * and processing relevent parameters from the layer_param_.  
   * Setting up the shapes of top blobs and internal buffers should be done in  
   * Reshape, which will be called before the forward pass to  
   * adjust the top blob sizes.  
   * 此方法执行一次定制化的层初始化，包括从layer_param_读入并处理相关的层权值和偏置参数，  
   * 调用Reshape函数申请top blob的存储空间  
   */    
  virtual void LayerSetUp(const vector*>& bottom,    
      const vector*>& top) {}    
/////////////////////每个子类Layer必须重写的Reshape函数，完成top blob形状的设置并为其分配存储空间，    
   /**  
   * @brief Adjust the shapes of top blobs and internal buffers to accomodate  
   *        the shapes of the bottom blobs.  
   * @brief 根据bottom blob的形状和layer_param_计算top blob的形状并为其分配存储空间  
   *  
   * @param bottom the input blobs, with the requested input shapes  
   * @param top the top blobs, which should be reshaped as needed  
   *  
   * This method should reshape top blobs as needed according to the shapes  
   * of the bottom (input) blobs, as well as reshaping any internal buffers  
   * and making any other necessary adjustments so that the layer can  
   * accomodate the bottom blobs.  
   */    
  virtual void Reshape(const vector*>& bottom,    
      const vector*>& top) = 0;    
    
  /**  
   * @brief Given the bottom blobs, compute the top blobs and the loss.  
   *  
   * @param bottom  
   *     the input blobs, whose data fields store the input data for this layer  
   * @param top  
   *     the preshaped output blobs, whose data fields will store this layers'  
   *     outputs  
   * \return The total loss from the layer.  
   *  
   * The Forward wrapper calls the relevant device wrapper function  
   * (Forward_cpu or Forward_gpu) to compute the top blob values given the  
   * bottom blobs.  If the layer has any non-zero loss_weights, the wrapper  
   * then computes and returns the loss.  
   *  
   * Your layer should implement Forward_cpu and (optionally) Forward_gpu.  
   */    
//////////////前向传播函数Forward和反向传播函数Backward    
/*  
首先是Forward.这其实是一个装饰器，继承之后在调用的调用其相应的forward_cpu或者forward_gpu，根据输入的input data blob计算相应的output data blob，同时会反应这一层layer的total loss.  
*/    
  inline Dtype Forward(const vector*>& bottom,    
      const vector*>& top);    
    
  /**  
   * @brief Given the top blob error gradients, compute the bottom blob error  
   *        gradients.  
   *  
   * @param top  
   *     the output blobs, whose diff fields store the gradient of the error  
   *     with respect to themselves  
   * @param propagate_down  
   *     a vector with equal length to bottom, with each index indicating  
   *     whether to propagate the error gradients down to the bottom blob at  
   *     the corresponding index  
   * @param bottom  
   *     the input blobs, whose diff fields will store the gradient of the error  
   *     with respect to themselves after Backward is run  
   *  
   * The Backward wrapper calls the relevant device wrapper function  
   * (Backward_cpu or Backward_gpu) to compute the bottom blob diffs given the  
   * top blob diffs.  
   *  
   * Your layer should implement Forward_cpu and (optionally) Forward_gpu.  
   */    
/*  
BackWard，实现的是反向传播，也就是给定top blob额error gradient 计算得到bottom的error gradient。其输入时 output blobs ，在Ouput blobs里面的diff存储的就是其相应的error gradients。其中propagate_down这个参数跟Bottom的长度是一样的，每一个Index用来指定是否需要反向传播error gradients 到对应的bottom blob。而bottom 这里面的diff 区域存放的就是BackWard计算出来相应的gradient error.  
*/    
  inline void Backward(const vector*>& top,    
      const vector& propagate_down,    
      const vector*>& bottom);    
    
  /**  
   * @brief Returns the vector of learnable parameter blobs.  
   */    
  vector > >& blobs() {    
    return blobs_;//返回vector  blobs_    
  }    
    
  /**  
   * @brief Returns the layer parameter.  
   */    
//返回layer parameter    
  const LayerParameter& layer_param() const { return layer_param_; }    
    
  /**  
   * @brief Writes the layer parameter to a protocol buffer  
   */    
//将layer plarameter 写入protobuf    
  virtual void ToProto(LayerParameter* param, bool write_diff = false);    
    
//返回 ,设置一个blob top 在给定 index 的 loss    
  /**  
   * @brief Returns the scalar loss associated with a top blob at a given index.  
   */    
  inline Dtype loss(const int top_index) const {    
    return (loss_.size() > top_index) ? loss_[top_index] : Dtype(0);    
  }    
    
  /**  
   * @brief Sets the loss associated with a top blob at a given index.  
   */    
  inline void set_loss(const int top_index, const Dtype value) {    
    if (loss_.size() <= top_index) {    
      loss_.resize(top_index + 1, Dtype(0));    
    }    
    loss_[top_index] = value;    
  }    
//一些返回特定参数的函数：    
  /**  
   * 获得bottom或者top blob的数量状态，比较简单，看名字即可  
   */    
    // 虚函数，而且还是内联的，返回层类型      
  virtual inline const char* type() const { return ""; }      
      
   // 虚函数，获得bottom blob的精确个数      
  virtual inline int ExactNumBottomBlobs() const { return -1; }      
      
   // 虚函数，获得bottom blob的最小个数      
  virtual inline int MinBottomBlobs() const { return -1; }      
      
   // 虚函数，获得bottom blob的最大个数      
  virtual inline int MaxBottomBlobs() const { return -1; }      
      
   // 虚函数，获得top blob的精确个数      
  virtual inline int ExactNumTopBlobs() const { return -1; }      
      
   // 虚函数，获得top blob的最小个数      
  virtual inline int MinTopBlobs() const { return -1; }      
      
   // 虚函数，获得top blob的最大个数      
  virtual inline int MaxTopBlobs() const { return -1; }      
      
   // 虚函数，bottom blob和top blob的个数是否一致      
  virtual inline bool EqualNumBottomTopBlobs() const { return false; }      
      
   // 返回当前层是否自动创建匿名top blobs      
   // 如果返回true，表明网络初始化的时候创建了了足够多的匿名top blobs      
   // 来满足ExactNumTopBlobs或者MinTopBlobs所要求的top blobs的个数      
  virtual inline bool AutoTopBlobs() const { return false; }      
/*  
AllowforceBackward用来设置是否强制梯度返回，因为有些层其实不需要梯度信息 ，后面两个函数分别查看以及设置是是否需要计算梯度。  
*/      
    
   // 对于一个给定的bottom blob，返回是否允许强制反传      
  virtual inline bool AllowForceBackward(const int bottom_index) const {      
    return true;      
  }      
    
//set_param_propagate_down，param_propagate_down 函数：设置对于那些bottom 需要反向传播。    
  /**  
   * @brief Specifies whether the layer should compute gradients w.r.t. a  
   *        parameter at a particular index given by param_id.  
   *  
   * You can safely ignore false values and always compute gradients  
   * for all parameters, but possibly with wasteful computation.  
   */    
  inline bool param_propagate_down(const int param_id) {    
    return (param_propagate_down_.size() > param_id) ?    
        param_propagate_down_[param_id] : false;    
  }    
  /**  
   * @brief Sets whether the layer should compute gradients w.r.t. a  
   *        parameter at a particular index given by param_id.  
   */    
  inline void set_param_propagate_down(const int param_id, const bool value) {    
    if (param_propagate_down_.size() <= param_id) {    
      param_propagate_down_.resize(param_id + 1, true);    
    }    
    param_propagate_down_[param_id] = value;    
  }    
    
  #ifdef USE_MPI    
  /**  
   * @brief Checks whether the layer accepts specifed parallel type  
   *  
   * If not supported, will halt the program with hints  
   */    
  inline virtual bool is_gathering() {return false;}    
  inline virtual bool is_scattering() {return false;}    
  inline bool need_sync(){return need_sync_;}    
  inline void set_need_sync(bool val){need_sync_ = val;}    
  #endif    
    
    
protected:    
  /** The protobuf that stores the layer parameters */    
  // 层说明参数，从protocal buffers格式的网络结构说明文件中读取    
  LayerParameter layer_param_;    
  /** The phase: TRAIN or TEST */    
  // 层状态，参与网络的训练还是测试    
  Phase phase_;    
  /** The vector that stores the learnable parameters as a set of blobs. */    
  // 层权值和偏置参数，使用向量是因为权值参数和偏置是分开保存在两个blob中的    
  vector > > blobs_;    
  /** Vector indicating whether to compute the diff of each param blob. */    
  // 标志每个top blob是否需要计算反向传递的梯度值    
  vector param_propagate_down_;    
    
  /** The vector that indicates whether each top blob has a non-zero weight in  
   *  the objective function. */    
  // 非LossLayer为零，LossLayer中表示每个top blob计算的loss的权重    
  vector loss_;    
    
  #ifdef USE_MPI    
  /**  
   * For parallel use  
   */    
  bool need_sync_;    
  #endif    
/////////////////////////////这两个函数非虚函数，它们内部会调用如下虚函数完成数据前向传递和    
/////////////////////////////误差反向传播，根据执行环境的不同每个子类Layer必须重写CPU和GPU版本，    
  /** @brief Using the CPU device, compute the layer output. */    
  virtual void Forward_cpu(const vector*>& bottom,    
      const vector*>& top) = 0;    
  /**  
   * @brief Using the GPU device, compute the layer output.  
   *        Fall back to Forward_cpu() if unavailable.  
   */    
  virtual void Forward_gpu(const vector*>& bottom,    
      const vector*>& top) {    
    // LOG(WARNING) << "Using CPU code as backup.";    
    return Forward_cpu(bottom, top);    
  }    
    
  /**  
   * @brief Using the CPU device, compute the gradients for any parameters and  
   *        for the bottom blobs if propagate_down is true.  
   */    
  virtual void Backward_cpu(const vector*>& top,    
      const vector& propagate_down,    
      const vector*>& bottom) = 0;    
  /**  
   * @brief Using the GPU device, compute the gradients for any parameters and  
   *        for the bottom blobs if propagate_down is true.  
   *        Fall back to Backward_cpu() if unavailable.  
   */    
  virtual void Backward_gpu(const vector*>& top,    
      const vector& propagate_down,    
      const vector*>& bottom) {    
    // LOG(WARNING) << "Using CPU code as backup.";    
    Backward_cpu(top, propagate_down, bottom);    
  }    
    
  /**  
   * Called by the parent Layer's SetUp to check that the number of bottom  
   * and top Blobs provided as input match the expected numbers specified by  
   * the {ExactNum,Min,Max}{Bottom,Top}Blobs() functions.  
   */    
  virtual void CheckBlobCounts(const vector*>& bottom,    
                               const vector*>& top) {    
    if (ExactNumBottomBlobs() >= 0) {    
      CHECK_EQ(ExactNumBottomBlobs(), bottom.size())    
          << type() << " Layer takes " << ExactNumBottomBlobs()    
          << " bottom blob(s) as input.";    
    }// 保证输入bottom 数量和要求的相同    
    if (MinBottomBlobs() >= 0) {    
      CHECK_LE(MinBottomBlobs(), bottom.size())    
          << type() << " Layer takes at least " << MinBottomBlobs()    
          << " bottom blob(s) as input.";    
    }//保证输入的bottom数量大于或等于要求的最小数量    
    if (MaxBottomBlobs() >= 0) {    
      CHECK_GE(MaxBottomBlobs(), bottom.size())    
          << type() << " Layer takes at most " << MaxBottomBlobs()    
          << " bottom blob(s) as input.";    
    }//保证输入的bottom数量小于或等于要求的最大数量    
    if (ExactNumTopBlobs() >= 0) {    
      CHECK_EQ(ExactNumTopBlobs(), top.size())    
          << type() << " Layer produces " << ExactNumTopBlobs()    
          << " top blob(s) as output.";    
    }// 保证输入top数量和要求的相同    
    if (MinTopBlobs() >= 0) {    
      CHECK_LE(MinTopBlobs(), top.size())    
          << type() << " Layer produces at least " << MinTopBlobs()    
          << " top blob(s) as output.";    
    }//保证输入的top数量大于或等于要求的最小数量    
    if (MaxTopBlobs() >= 0) {    
      CHECK_GE(MaxTopBlobs(), top.size())    
          << type() << " Layer produces at most " << MaxTopBlobs()    
          << " top blob(s) as output.";    
    }//保证输入的top数量小于或等于要求的最大数量    
    if (EqualNumBottomTopBlobs()) {    
      CHECK_EQ(bottom.size(), top.size())    
          << type() << " Layer produces one top blob as output for each "    
          << "bottom blob input.";    
    }//保证输入的bottom数量和输出的top数量相同    
  }    
    
  /**  
   * Called by SetUp to initialize the weights associated with any top blobs in  
   * the loss function. Store non-zero loss weights in the diff blob.  
   */    
/*  
SetLoss是非常重要的一个步骤，是被SetUp调用来初始化top bottom的weights，并且存储非零的loss weights 在diff blob里面  
*/    
  inline void SetLossWeights(const vector*>& top) {    
    const int num_loss_weights = layer_param_.loss_weight_size();    
    if (num_loss_weights) {    
      CHECK_EQ(top.size(), num_loss_weights) << "loss_weight must be "    
          "unspecified or specified once per top blob.";    
      for (int top_id = 0; top_id < top.size(); ++top_id) {    
        const Dtype loss_weight = layer_param_.loss_weight(top_id);    
        if (loss_weight == Dtype(0)) { continue; }//如果为0不对loss进行操作    
        this->set_loss(top_id, loss_weight);    
        const int count = top[top_id]->count();    
        Dtype* loss_multiplier = top[top_id]->mutable_cpu_diff();    
        caffe_set(count, loss_weight, loss_multiplier);//将loss_multiplier设为loss_weight    
      }     
    }    
  }    
    
  DISABLE_COPY_AND_ASSIGN(Layer);    
};  // class Layer    
    
/*  
前传调用对应的Forward_cpu或者Forward_gpu而我们知道Forward_cpu是纯虚函数，必须要实而Forward_gpu是虚函数，如果不实现就调用 Forward_cpu函数了。前传（你必须实现自己的Forward_cpu，实现Forward_gpu是可选的）  
*/    
// Forward and backward wrappers. You should implement the cpu and    
// gpu specific implementations instead, and should not change these    
// functions.    
template     
inline Dtype Layer::Forward(const vector*>& bottom,    
    const vector*>& top) {    
  Dtype loss = 0;      
  // 根据bottom设置top的形状      
  Reshape(bottom, top);      
  // 设置运行模式CPU or GPU      
  switch (Caffe::mode()) {      
  case Caffe::CPU:      
    // 调用CPU的前传      
    Forward_cpu(bottom, top);      
    // 前传计算完之后计算损失（只有最后一层才进行计算，其余层都不用）      
    for (int top_id = 0; top_id < top.size(); ++top_id) {      
      if (!this->loss(top_id)) { continue; }      
      const int count = top[top_id]->count();      
      // 获取前传的数据      
      const Dtype* data = top[top_id]->cpu_data();      
      // 获取梯度（\frac{\partial Loss}{\partial net}）      
      const Dtype* loss_weights = top[top_id]->cpu_diff();      
      // data与loss_weight的点积，即得损失函数关于当前层权重的偏导了      
    // \frac{\partial Loss}{\partial net} * \frac{\partial net}{\frac{W}}      
    // = \frac{\partial Loss}{\partial W}      
      loss += caffe_cpu_dot(count, data, loss_weights);      
    }      
    break;      
  case Caffe::GPU:      
    // GPU前传      
    Forward_gpu(bottom, top);      
#ifndef CPU_ONLY      
    // 同上，只不过这里用GPU来计算点积了      
    for (int top_id = 0; top_id < top.size(); ++top_id) {      
      if (!this->loss(top_id)) { continue; }      
      const int count = top[top_id]->count();      
      // 获取GPU上的数据      
      const Dtype* data = top[top_id]->gpu_data();      
      const Dtype* loss_weights = top[top_id]->gpu_diff();      
      Dtype blob_loss = 0;      
      caffe_gpu_dot(count, data, loss_weights, &blob_loss);      
      loss += blob_loss;      
    }      
#endif      
    break;    
  default:    
    LOG(FATAL) << "Unknown caffe mode.";    
  }    
  return loss;    
}    
    
template     
inline void Layer::Backward(const vector*>& top,    
    const vector& propagate_down,    
    const vector*>& bottom) {    
  switch (Caffe::mode()) {    
  case Caffe::CPU:    
    Backward_cpu(top, propagate_down, bottom);    
//根据blob top 的error 梯度（diff）计算bottom 的 error 梯度。 propagate_down 是长度     
//和bottom 相同的vector ，用于控制是否需要对对应的bottom 元素传播梯度。具体layer具体定义。    
    break;    
  case Caffe::GPU:    
    Backward_gpu(top, propagate_down, bottom);    
    break;    
  default:    
    LOG(FATAL) << "Unknown caffe mode.";    
  }    
}    
////////////////Layer的序列化函数,将layer的层说明参数layer_param_，层权值和偏置    
////////////////参数blobs_复制到LayerParameter对象，便于写到磁盘，    
// Serialize LayerParameter to protocol buffer    
template     
void Layer::ToProto(LayerParameter* param, bool write_diff) {    
  param->Clear();    
  param->CopyFrom(layer_param_); // 复制层说明参数layer_param_    
  param->clear_blobs();    
  // 复制层权值和偏置参数blobs_    
  for (int i = 0; i < blobs_.size(); ++i) {    
    blobs_[i]->ToProto(param->add_blobs(), write_diff);    
  }    
}    
    
}  // namespace caffe    
    
#endif  // CAFFE_LAYER_H_

device_alternate.hpp：

#ifndef CAFFE_UTIL_DEVICE_ALTERNATE_H_  
#define CAFFE_UTIL_DEVICE_ALTERNATE_H_  
  
#ifdef CPU_ONLY  // CPU-only Caffe.  
  
#include   
  
// Stub out GPU calls as unavailable.  
//打印出GPU不可以使用  
#define NO_GPU LOG(FATAL) << "Cannot use GPU in CPU-only Caffe: check mode."  
// 定义给定类的前向和反向（GPU和CPU）传播的函数定义  
#define STUB_GPU(classname) \  
template  \  
void classname::Forward_gpu(const vector*>& bottom, \  
    const vector*>& top) { NO_GPU; } \  
template  \  
void classname::Backward_gpu(const vector*>& top, \  
    const vector& propagate_down, \  
    const vector*>& bottom) { NO_GPU; } \  
  
#define STUB_GPU_FORWARD(classname, funcname) \  
template  \  
void classname::funcname##_##gpu(const vector*>& bottom, \  
    const vector*>& top) { NO_GPU; } \  
  
#define STUB_GPU_BACKWARD(classname, funcname) \  
template  \  
void classname::funcname##_##gpu(const vector*>& top, \  
    const vector& propagate_down, \  
    const vector*>& bottom) { NO_GPU; } \  
  
#else  // Normal GPU + CPU Caffe.  
  
#include   
#include   
#include   
#include   
#include   // cuda driver types  
#ifdef USE_CUDNN  // cuDNN acceleration library.  
#include "caffe/util/cudnn.hpp"  
#endif  
  
//  
// CUDA macros  
//  
  
// CUDA: various checks for different function calls.  
#define CUDA_CHECK(condition) \  
  /* Code block avoids redefinition of cudaError_t error */ \  
  do { \  
    cudaError_t error = condition; \  
    CHECK_EQ(error, cudaSuccess) << " " << cudaGetErrorString(error); \  
  } while (0)  
  
#define CUBLAS_CHECK(condition) \  
  do { \  
    cublasStatus_t status = condition; \  
    CHECK_EQ(status, CUBLAS_STATUS_SUCCESS) << " " \  
      << caffe::cublasGetErrorString(status); \  
  } while (0)  
  
#define CURAND_CHECK(condition) \  
  do { \  
    curandStatus_t status = condition; \  
    CHECK_EQ(status, CURAND_STATUS_SUCCESS) << " " \  
      << caffe::curandGetErrorString(status); \  
  } while (0)  
//caffe采取的线程格和线程块的维数设计  
// blockDim.x* gridDim.x表示的是该线程格所有线程的数量  
//n表示核函数总共要处理的元素个数  
#define CUDA_KERNEL_LOOP(i, n) \  
  for (int i = blockIdx.x * blockDim.x + threadIdx.x; \  
       i < (n); \  
       i += blockDim.x * gridDim.x)  
  
// CUDA: check for error after kernel execution and exit loudly if there is one.  
#define CUDA_POST_KERNEL_CHECK CUDA_CHECK(cudaPeekAtLastError())  
  
namespace caffe {  
  
  
//CUDA的lib错误报告  
const char* cublasGetErrorString(cublasStatus_t error);  
const char* curandGetErrorString(curandStatus_t error);  
  
// CUDA: thread number configuration.  
// Use 1024 threads per block, which requires cuda sm_2x or above,  
// or fall back to attempt compatibility (best of luck to you).  
#if __CUDA_ARCH__ >= 200  
    const int CAFFE_CUDA_NUM_THREADS = 1024;  
#else  
    const int CAFFE_CUDA_NUM_THREADS = 512;  
#endif  
  
  
//CUDA线程的块的数量  
inline int CAFFE_GET_BLOCKS(const int N) {  
  return (N + CAFFE_CUDA_NUM_THREADS - 1) / CAFFE_CUDA_NUM_THREADS;  
}  
  
}  // namespace caffe  
  
#endif  // CPU_ONLY  
  
#endif  // CAFFE_UTIL_DEVICE_ALTERNATE_H_

4. 总结

结合官方文档，再加画图和看代码，终于对整个layer层有了个基本认识：data负责输入，vision负责卷积相关的计算，neuron和common负责中间部分的数据计算，而loss是最后一部分，负责计算反向传播的误差。具体的实现都在src/caffe/layers里面，慢慢再研究研究。

在这些抽象的基类头文件里，看起来挺累，好在各种搜索，也能学到一些技巧，如，巧用宏定义来简写C，C++代码，使用模板方法，将有大量重复接口和参数的类抽象为一个宏定义，达到简化代码的目的。

你可能感兴趣的:(caffe中的layer)

关于沟通这件事，项目经理不需要每次都面对面进行流程大师兄
很多项目经理都会遇到这样的问题，项目中由于事情太多，根本没有足够的时间去召开会议，那在这种情况下如何去有效地管理项目中的利益相关者？当然，不建议电子邮件也不需要开会的话，建议可以采取下面几种方式来形成有效的沟通，这几种方式可以帮助你努力的通过各种办法来保持和各方面的联系。项目经理首先要问自己几个问题，项目中哪些利益相关者是必须要进行沟通的？可以列出项目中所有的利益相关者清单，同时也整理出项目中哪些
机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
element实现动态路由+面包屑软件技术NINI vue案例 vue.js 前端
el-breadcrumb是ElementUI组件库中的一个面包屑导航组件，它用于显示当前页面的路径，帮助用户快速理解和导航到应用的各个部分。在Vue.js项目中，如果你已经安装了ElementUI，就可以很方便地使用el-breadcrumb组件。以下是一个基本的使用示例：安装ElementUI（如果你还没有安装的话）:你可以通过npm或yarn来安装ElementUI。bash复制代码npmi
10月|愿你的青春不负梦想-读书笔记-01 Tracy的小书斋
本书的作者是俞敏洪，大家都很熟悉他了吧。俞敏洪老师是我行业的领头羊吧，也是我事业上的偶像。本日摘录他书中第一章中的金句：『一个人如果什么目标都没有，就会浑浑噩噩，感觉生命中缺少能量。能给我们能量的，是对未来的期待。第一件事，我始终为了进步而努力。与其追寻全世界的骏马，不如种植丰美的草原，到时骏马自然会来。第二件事，我始终有阶段性的目标。什么东西能给我能量？答案是对未来的期待。』读到这里的时候，我便
C语言如何定义宏函数？小九格物 c语言
在C语言中，宏函数是通过预处理器定义的，它在编译之前替换代码中的宏调用。宏函数可以模拟函数的行为，但它们不是真正的函数，因为它们在编译时不会进行类型检查，也不会分配存储空间。宏函数的定义通常使用#define指令，后面跟着宏的名称和参数列表，以及宏展开后的代码。宏函数的定义方式：1.基本宏函数：这是最简单的宏函数形式，它直接定义一个表达式。#defineSQUARE(x)((x)*(x))2.带参
c++ 的iostream 和 c++的stdio的区别和联系黄卷青灯77 c++算法开发语言 iostream stdio
在C++中，iostream和C语言的stdio.h都是用于处理输入输出的库，但它们在设计、用法和功能上有许多不同。以下是两者的区别和联系：区别1.编程风格iostream（C++风格）：C++标准库中的输入输出流类库，支持面向对象的输入输出操作。典型用法是cin（输入）和cout（输出），使用>操作符来处理数据。更加类型安全，支持用户自定义类型的输入输出。#includeintmain(){in
Long类型前后端数据不一致 igotyback 前端
响应给前端的数据浏览器控制台中response中看到的Long类型的数据是正常的到前端数据不一致前后端数据类型不匹配是一个常见问题，尤其是当后端使用Java的Long类型（64位）与前端JavaScript的Number类型（最大安全整数为2^53-1，即16位）进行数据交互时，很容易出现精度丢失的问题。这是因为JavaScript中的Number类型无法安全地表示超过16位的整数。为了解决这个问
mysql禁用远程登录 igotyback mysql
去mysql库中的user表里，将host都改成localhost之后刷新权限FLUSHPRIVILEGES;
店群合一模式下的社区团购新发展——结合链动 2+1 模式、AI 智能名片与 S2B2C 商城小程序源码说私域人工智能小程序
摘要：本文探讨了店群合一的社区团购平台在当今商业环境中的重要性和优势。通过分析店群合一模式如何将互联网社群与线下终端紧密结合，阐述了链动2+1模式、AI智能名片和S2B2C商城小程序源码在这一模式中的应用价值。这些创新元素的结合为社区团购带来了新的机遇，提升了用户信任感、拓展了营销渠道，并实现了线上线下的完美融合。一、引言随着互联网技术的不断发展，社区团购作为一种新兴的商业模式，在满足消费者日常需
向内而求陈陈_19b4
10月27日，阴。阅读书目:《次第花开》。作者:希阿荣博堪布，是当今藏传佛家宁玛派最伟大的上师法王，如意宝晋美彭措仁波切颇具影响力的弟子之一。多年以来，赴海内外各地弘扬佛法，以正式授课、现场开示、发表文章等多种方法指导佛学弟子修行佛法。代表作《寂静之道》、《生命这出戏》、《透过佛法看世界》自出版以来一直是佛教类书籍中的畅销书。图片发自App金句:1.佛陀说，一切痛苦的根源在于我们长期以来对自身及外
消息中间件有哪些常见类型 xmh-sxh-1314 java
消息中间件根据其设计理念和用途，可以大致分为以下几种常见类型：点对点消息队列（Point-to-PointMessagingQueues）：在这种模型中，消息被发送到特定的队列中，消费者从队列中取出并处理消息。队列中的消息只能被一个消费者消费，消费后即被删除。常见的实现包括IBM的MQSeries、RabbitMQ的部分使用场景等。适用于任务分发、负载均衡等场景。发布/订阅消息模型（Pub/Sub
水平垂直居中的几种方法（总结） LJ小番茄 CSS_玄学语言 html javascript 前端 css css3
1.使用flexbox的justify-content和align-items.parent{display:flex;justify-content:center;/*水平居中*/align-items:center;/*垂直居中*/height:100vh;/*需要指定高度*/}2.使用grid的place-items:center.parent{display:grid;place-item
每日一题——第八十二题互联网打工人no1 C语言程序设计每日一练 c语言
题目：将一个控制台输入的字符串中的所有元音字母复制到另一字符串中#include#include#include#include#defineMAX_INPUT1024boolisVowel(charp);intmain(){charinput[MAX_INPUT];charoutput[MAX_INPUT];printf("请输入一串字符串：\n");fgets(input,sizeof(inp
WPF中的ComboBox控件几种数据绑定的方式互联网打工人no1 wpf c#
一、用字典给ItemsSource赋值（此绑定用的地方很多，建议熟练掌握）在XMAL中：在CS文件中privatevoidBindData(){DictionarydicItem=newDictionary();dicItem.add(1,"北京");dicItem.add(2,"上海");dicItem.add(3,"广州");cmb_list.ItemsSource=dicItem;cmb_l
直抒《紫罗兰永恒花园外传》雷姆的黑色童话
没看过《紫罗兰永恒花园》的我莫名的看完了《紫罗兰永恒花园外传》，又莫名的被故事中的姐妹之情狠狠地感动了的一把。感动何在：困苦中相依为命的姐妹二人被迫分离，用一个人的自由换取另一个人的幸福。之后，虽相隔不知几许依旧心心念念彼此牵挂。这种深深的姐妹情谊就是令我为之动容的所在。贝拉和泰勒分别影片开始，海天之间一个孩童凭栏眺望，手中拿着折旧的信纸。镜头一转，挑灯伏案的薇尔莉特正在打字机前奋笔疾书。这些片段
直返最高等级与直返APP：无需邀请码的返利新体验古楼
随着互联网的普及和电商的兴起，直返模式逐渐成为一种流行的商业模式。在这种模式下，消费者通过购买产品或服务，获得一定的返利，并可以分享给更多的人。其中，直返最高等级和直返APP是直返模式中的重要概念和工具。本文将详细介绍直返最高等级的概念、直返APP的使用以及与邀请码的关系。【高省】APP（高佣金领导者）是一个自用省钱佣金高，分享推广赚钱多的平台，百度有几百万篇报道，运行三年，稳定可靠。高省APP，
读《人世间》有感一0一
这个寒假，就如同朋友圈中的一段话：一闭眼，一睁眼假期还有5天，在一闭眼一睁眼假期还有12天；再一闭眼一睁眼假期还有20天；不敢睡，不敢睡啊……受疫情影响，这个假期变得漫长又煎熬，我也无时无刻不关注着疫情的变化。当然这样的一个假期，我还真得要感谢周翔，因为他有个爱看书的习惯，所以家里有不少他看过的书，可以让我随意挑选，因此也让我的假期不至于那么无所事事。这次我选了一本梁晓声的《人世间》，作为一名语文
SQL Server_查询某一数据库中的所有表的内容 qq_42772833 SQL Server 数据库 sqlserver
1.查看所有表的表名要列出CrabFarmDB数据库中的所有表（名），可以使用以下SQL语句：USECrabFarmDB;--切换到目标数据库GOSELECTTABLE_NAMEFROMINFORMATION_SCHEMA.TABLESWHERETABLE_TYPE='BASETABLE';对这段SQL脚本的解释：SELECTTABLE_NAME：这个语句的作用是从查询结果中选择TABLE_NAM
四章-32-点要素的聚合彩云飘过
本文基于腾讯课堂老胡的课《跟我学Openlayers--基础实例详解》做的学习笔记，使用的openlayers5.3.xapi。源码见1032.html，对应的官网示例https://openlayers.org/en/latest/examples/cluster.htmlhttps://openlayers.org/en/latest/examples/earthquake-clusters.
【加密社】Solidity 中的事件机制及其应用加密社闲侃区块链智能合约区块链
加密社引言在Solidity合约开发过程中，事件（Events）是一种非常重要的机制。它们不仅能够让开发者记录智能合约的重要状态变更，还能够让外部系统（如前端应用）监听这些状态的变化。本文将详细介绍Solidity中的事件机制以及如何利用不同的手段来触发、监听和获取这些事件。事件存储的地方当我们在Solidity合约中使用emit关键字触发事件时，该事件会被记录在区块链的交易收据中。具体而言，事件
使用LLaVa和Ollama实现多模态RAG示例 llzwxh888 python 人工智能开发语言
本文将详细介绍如何使用LLaVa和Ollama实现多模态RAG（检索增强生成），通过提取图像中的结构化数据、生成图像字幕等功能来展示这一技术的强大之处。安装环境首先，您需要安装以下依赖包：!pipinstallllama-index-multi-modal-llms-ollama!pipinstallllama-index-readers-file!pipinstallunstructured!p
如何部分格式化提示模板:LangChain中的高级技巧 nseejrukjhad langchain java 服务器 python
标题:如何部分格式化提示模板:LangChain中的高级技巧内容:如何部分格式化提示模板:LangChain中的高级技巧引言在使用大型语言模型(LLM)时,提示工程是一个关键环节。LangChain提供了强大的提示模板功能,让我们能更灵活地构建和管理提示。本文将介绍LangChain中一个高级特性-部分格式化提示模板,这个技巧可以让你的提示管理更加高效和灵活。什么是部分格式化提示模板?部分格式化提
GitHub上克隆项目 bigbig猩猩 github
从GitHub上克隆项目是一个简单且直接的过程，它允许你将远程仓库中的项目复制到你的本地计算机上，以便进行进一步的开发、测试或学习。以下是一个详细的步骤指南，帮助你从GitHub上克隆项目。一、准备工作1.安装Git在克隆GitHub项目之前，你需要在你的计算机上安装Git工具。Git是一个开源的分布式版本控制系统，用于跟踪和管理代码变更。你可以从Git的官方网站（https://git-scm.
读书||陶新华《教育中的积极心理学》1—28 流水淙淙2022
读一本好书，尤如和一位高尚者对话，亦能对人的精神进行洗礼。但是若不能和实践结合起来，也只能落到空读书的状态。读书摘要与感想1、塞利格曼在《持续的幸福》一书中提出了幸福2.0理论，提出幸福由5个元素决定——积极情绪、投入的工作和生活、目标和意义、和谐的人际关系、成就感。2、人的大脑皮层在进行智力活动时，都伴有皮下中枢活动，对这些活动进行体验请假，并由此产生了情感解读。人的情绪情感体验总是优先于大脑的
走向以教育叙事为载体的教育叙事研究 666小飞鱼
今天我读了吴松超老师的《给教师的68条建写作建议》中的第23条《如何通过教育叙事走向研究》，吴老师在文中与我们分享了一个德育案例，这是一个反面的案例，意在告知我们在处理问题时，不能就考虑的点太窄，思考要全面。走向教育叙事研究，教师要有敏锐的“感知力”，这个感知力来自于背后专业知识的支撑，思维能力以及广阔的视野和见识等。所以对于同一件事处理方法不同，这个就是教师背后“敏锐力”的不同造成的，也就是说是
ARM中断处理过程落汤老狗嵌入式linux
一、前言本文主要以ARM体系结构下的中断处理为例，讲述整个中断处理过程中的硬件行为和软件动作。具体整个处理过程分成三个步骤来描述：1、第二章描述了中断处理的准备过程2、第三章描述了当发生中的时候，ARM硬件的行为3、第四章描述了ARM的中断进入过程4、第五章描述了ARM的中断退出过程本文涉及的代码来自3.14内核。另外，本文注意描述ARM指令集的内容，有些sourcecode为了简短一些，删除了T
如何成为段子手欣雅阅读
我是一个尬聊大师，与朋友聊天经常把话题聊死，留我一个人在群里，望着自己打下的最后一句话无语凝噎。看到风趣幽默的朋友与人聊天，很是艳羡，觉得自己何时才能成为这样的段子手呢？一、段子是什么？“段子”一词在百度百科上的解释：本是相声中的一个艺术术语，指的是相声作品中一节或一段艺术内容。我的理解：段子就是一些搞笑的故事或者笑话。二、为什么要会说段子？不知道大家有没有这样的朋友，本来很无趣的聚会，只要有他参
Python 实现图片裁剪（附代码） | Python工具剑客阿良_ALiang
前言本文提供将图片按照自定义尺寸进行裁剪的工具方法，一如既往的实用主义。环境依赖ffmpeg环境安装，可以参考我的另一篇文章：windowsffmpeg安装部署_阿良的博客-CSDN博客本文主要使用到的不是ffmpeg，而是ffprobe也在上面这篇文章中的zip包中。ffmpy安装：pipinstallffmpy-ihttps://pypi.douban.com/simple代码不废话了，上代码
【华为OD技术面试真题 - 技术面】- python八股文真题题库（4) 算法大师华为od 面试 python
华为OD面试真题精选专栏：华为OD面试真题精选目录:2024华为OD面试手撕代码真题目录以及八股文真题目录文章目录华为OD面试真题精选**1.Python中的`with`**用途和功能自动资源管理示例：文件操作上下文管理协议示例代码工作流程解析优点2.\_\_new\_\_和**\_\_init\_\_**区别__new____init__区别总结3.**切片（Slicing）操作**基本切片语法
Python爬虫解析工具之xpath使用详解 eqa11 python 爬虫开发语言
文章目录Python爬虫解析工具之xpath使用详解一、引言二、环境准备1、插件安装2、依赖库安装三、xpath语法详解1、路径表达式2、通配符3、谓语4、常用函数四、xpath在Python代码中的使用1、文档树的创建2、使用xpath表达式3、获取元素内容和属性五、总结Python爬虫解析工具之xpath使用详解一、引言在Python爬虫开发中，数据提取是一个至关重要的环节。xpath作为一门
eclipse maven IXHONG eclipse
eclipse中使用maven插件的时候，运行run as maven build的时候报错 -Dmaven.multiModuleProjectDirectory system propery is not set. Check $M2_HOME environment variable and mvn script match. 可以设一个环境变量M2_HOME指
timer cancel方法的一个小实例 alleni123 多线程 timer
package com.lj.timer; import java.util.Date; import java.util.Timer; import java.util.TimerTask; public class MyTimer extends TimerTask { private int a; private Timer timer; pub
MySQL数据库在Linux下的安装 ducklsl mysql
1.建好一个专门放置MySQL的目录 /mysql/db数据库目录 /mysql/data数据库数据文件目录 2.配置用户，添加专门的MySQL管理用户 >groupadd mysql ----添加用户组 >useradd -g mysql mysql ----在mysql用户组中添加一个mysql用户 3.配置，生成并安装MySQL >cmake -D
spring------>>cvc-elt.1: Cannot find the declaration of element Array_06 spring bean
将-------- <?xml version="1.0" encoding="UTF-8"?> <beans xmlns="http://www.springframework.org/schema/beans" xmlns:xsi="http://www.w3
maven发布第三方jar的一些问题 cugfy maven
maven中发布第三方jar到nexus仓库使用的是 deploy:deploy-file命令有许多参数，具体可查看 http://maven.apache.org/plugins/maven-deploy-plugin/deploy-file-mojo.html 以下是一个例子： mvn deploy:deploy-file -DgroupId=xpp3
MYSQL下载及安装 357029540 mysql
好久没有去安装过MYSQL，今天自己在安装完MYSQL过后用navicat for mysql去厕测试链接的时候出现了10061的问题，因为的的MYSQL是最新版本为5.6.24，所以下载的文件夹里没有my.ini文件，所以在网上找了很多方法还是没有找到怎么解决问题，最后看到了一篇百度经验里有这个的介绍，按照其步骤也完成了安装，在这里给大家分享下这个链接的地址
ios TableView cell的布局张亚雄 tableview
cell.imageView.image = [UIImage imageNamed:[imageArray objectAtIndex:[indexPath row]]]; CGSize itemSize = CGSizeMake(60, 50); &nbs
Java编码转义 adminjun java 编码转义
import java.io.UnsupportedEncodingException; /** * 转换字符串的编码 */ public class ChangeCharset { /** 7位ASCII字符，也叫作ISO646-US、Unicode字符集的基本拉丁块 */ public static final Strin
Tomcat 配置和spring aijuans spring
简介 Tomcat启动时，先找系统变量CATALINA_BASE，如果没有，则找CATALINA_HOME。然后找这个变量所指的目录下的conf文件夹，从中读取配置文件。最重要的配置文件：server.xml 。要配置tomcat，基本上了解server.xml，context.xml和web.xml。 Server.xml -- tomcat主
Java打印当前目录下的所有子目录和文件 ayaoxinchao 递归 File
其实这个没啥技术含量，大湿们不要操笑哦，只是做一个简单的记录，简单用了一下递归算法。 import java.io.File; /** * @author Perlin * @date 2014-6-30 */ public class PrintDirectory { public static void printDirectory(File f
linux安装mysql出现libs报冲突解决 BigBird2012 linux
linux安装mysql出现libs报冲突解决安装mysql出现 file /usr/share/mysql/ukrainian/errmsg.sys from install of MySQL-server-5.5.33-1.linux2.6.i386 conflicts with file from package mysql-libs-5.1.61-4.el6.i686
jedis连接池使用实例 bijian1013 redis jedis连接池 jedis
实例代码： package com.bijian.study; import java.util.ArrayList; import java.util.List; import redis.clients.jedis.Jedis; import redis.clients.jedis.JedisPool; import redis.clients.jedis.JedisPoo
关于朋友 bingyingao 朋友兴趣爱好维持
成为朋友的必要条件：志相同，道不合，可以成为朋友。譬如马云、周星驰一个是商人，一个是影星，可谓道不同，但都很有梦想，都要在各自领域里做到最好，当他们遇到一起，互相欣赏，可以畅谈两个小时。志不同，道相合，也可以成为朋友。譬如有时候看到两个一个成绩很好每次考试争做第一，一个成绩很差的同学是好朋友。他们志向不相同，但他
【Spark七十九】Spark RDD API一 bit1129 spark
aggregate package spark.examples.rddapi import org.apache.spark.{SparkConf, SparkContext} //测试RDD的aggregate方法 object AggregateTest { def main(args: Array[String]) { val conf = new Spar
ktap 0.1 released bookjovi kernel tracing
Dear, I'm pleased to announce that ktap release v0.1, this is the first official release of ktap project, it is expected that this release is not fully functional or very stable and we welcome bu
能保存Properties文件注释的Properties工具类 BrokenDreams properties
今天遇到一个小需求：由于java.util.Properties读取属性文件时会忽略注释，当写回去的时候，注释都没了。恰好一个项目中的配置文件会在部署后被某个Java程序修改一下，但修改了之后注释全没了，可能会给以后的参数调整带来困难。所以要解决这个问题。 &nb
读《研磨设计模式》-代码笔记-外观模式-Facade bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ /* * 百度百科的定义： * Facade（外观）模式为子系统中的各类（或结构与方法）提供一个简明一致的界面， * 隐藏子系统的复杂性，使子系统更加容易使用。他是为子系统中的一组接口所提供的一个一致的界面 * * 可简单地
After Effects教程收集 cherishLC After Effects
1、中文入门 http://study.163.com/course/courseMain.htm?courseId=730009 2、videocopilot英文入门教程（中文字幕） http://www.youku.com/playlist_show/id_17893193.html 英文原址： http://www.videocopilot.net/basic/ 素
Linux Apache 安装过程 crabdave apache
Linux Apache 安装过程下载新版本： apr-1.4.2.tar.gz（下载网站：http://apr.apache.org/download.cgi） apr-util-1.3.9.tar.gz（下载网站：http://apr.apache.org/download.cgi） httpd-2.2.15.tar.gz（下载网站：http://httpd.apac
Shell学习之变量赋值和引用 daizj shell 变量引用赋值
本文转自：http://www.cnblogs.com/papam/articles/1548679.html Shell编程中，使用变量无需事先声明，同时变量名的命名须遵循如下规则：首个字符必须为字母（a-z，A-Z）中间不能有空格，可以使用下划线（_）不能使用标点符号不能使用bash里的关键字（可用help命令查看保留关键字）需要给变量赋值时，可以这么写：
Java SE 第一讲（Java SE入门、JDK的下载与安装、第一个Java程序、Java程序的编译与执行） dcj3sjt126com java jdk
Java SE 第一讲： Java SE：Java Standard Edition Java ME: Java Mobile Edition Java EE：Java Enterprise Edition Java是由Sun公司推出的（今年初被Oracle公司收购）。收购价格：74亿美金 J2SE、J2ME、J2EE JDK：Java Development
YII给用户登录加上验证码 dcj3sjt126com yii
1、在SiteController中添加如下代码： /** * Declares class-based actions. */ public function actions() { return array( // captcha action renders the CAPTCHA image displ
Lucene使用说明 dyy_gusi Lucene search 分词器
Lucene使用说明 1、lucene简介 1.1、什么是lucene Lucene是一个全文搜索框架，而不是应用产品。因此它并不像baidu或者googleDesktop那种拿来就能用，它只是提供了一种工具让你能实现这些产品和功能。 1.2、lucene能做什么要回答这个问题，先要了解lucene的本质。实际
学习编程并不难,做到以下几点即可! gcq511120594 数据结构编程算法
不论你是想自己设计游戏，还是开发iPhone或安卓手机上的应用，还是仅仅为了娱乐，学习编程语言都是一条必经之路。编程语言种类繁多，用途各异，然而一旦掌握其中之一，其他的也就迎刃而解。作为初学者，你可能要先从Java或HTML开始学，一旦掌握了一门编程语言，你就发挥无穷的想象，开发各种神奇的软件啦。 1、确定目标学习编程语言既充满乐趣，又充满挑战。有些花费多年时间学习一门编程语言的大学生到
Java面试十问之三：Java与C++内存回收机制的差别 HNUlanwei java C++finalize()堆栈内存回收
大家知道， Java 除了那 8 种基本类型以外，其他都是对象类型（又称为引用类型）的数据。 JVM 会把程序创建的对象存放在堆空间中，那什么又是堆空间呢？其实，堆（ Heap）是一个运行时的数据存储区，从它可以分配大小各异的空间。一般，运行时的数据存储区有堆（ Heap）和堆栈（ Stack），所以要先看它们里面可以分配哪些类型的对象实体，然后才知道如何均衡使用这两种存储区。一般来说，栈中存放的
第二章 Nginx+Lua开发入门 jinnianshilongnian nginx lua
Nginx入门本文目的是学习Nginx+Lua开发，对于Nginx基本知识可以参考如下文章： nginx启动、关闭、重启 http://www.cnblogs.com/derekchen/archive/2011/02/17/1957209.html agentzh 的 Nginx 教程 http://openresty.org/download/agentzh-nginx-tutor
MongoDB windows安装基本命令 liyonghui160com
windows安装安装目录： D:\MongoDB\ 新建目录 D:\MongoDB\data\db 4.启动进城： cd D:\MongoDB\bin mongod -dbpath D:\MongoDB\data\db &n
Linux下通过源码编译安装程序 pda158 linux
一、程序的组成部分　　Linux下程序大都是由以下几部分组成：　　二进制文件：也就是可以运行的程序文件　　库文件：就是通常我们见到的lib目录下的文件　　配置文件：这个不必多说，都知道　　帮助文档：通常是我们在linux下用man命令查看的命令的文档　　二、linux下程序的存放目录　　linux程序的存放目录大致有三个地方：　　/etc, /b
WEB开发编程的职业生涯４个阶段 shw3588 编程 Web 工作生活
觉得自己什么都会 2007年从学校毕业，凭借自己原创的ASP毕业设计，以为自己很厉害似的，信心满满去东莞找工作，找面试成功率确实很高，只是工资不高，但依旧无法磨灭那过分的自信，那时候什么考勤系统、什么OA系统、什么ERP，什么都觉得有信心，这样的生涯大概持续了约一年。根本不是自己想的那样 2008年开始接触很多工作相关的东西，发现太多东西自己根本不会，都需要去学，不管是asp还是js，
遭遇jsonp同域下变作post请求的坑 vb2005xu jsonp 同域post
今天迁移一个站点时遇到一个坑爹问题,同一个jsonp接口在跨域时都能调用成功,但是在同域下调用虽然成功,但是数据却有问题. 此处贴出我的后端代码片段 $mi_id = htmlspecialchars(trim($_GET['mi_id '])); $mi_cv = htmlspecialchars(trim($_GET['mi_cv '])); 贴出我前端代码片段: $.aj