xqp_dream

Caffe学习：Layers

感谢：http://blog.csdn.net/u011762313/article/details/47361571

目录：

Vision Layers
- Convolution
- Pooling
- Local Response Normalization LRN
- im2col
Loss Layers
- Softmax
- Sum-of-Squares Euclidean
- Hinge Margin
- Sigmoid Cross-Entropy
- Infogain
- Accuracy and Top-k
Activation Neuron Layers
- ReLU Rectified-Linear and Leaky-ReLU
- Sigmoid
- TanH Hyperbolic Tangent
- Absolute Value
- Power
- BNLL
Data Layers
- Database
- In-Memory
- HDF5 Input
- HDF5 Output
- Images
- Windows
- Dummy
Common Layers
- Inner Product
- Splitting
- Flattening
- Reshape
- Concatenation
- Slicing
- Elementwise Operations
- Argmax
- Softmax
- Mean-Variance Normalization

原文

要想创建一个Caffe模型，需要在prototxt中定义一个model architecture（模型架构）。
Caffe自带的Layer及其参数被定义在caffe.proto中。

Vision Layers

头文件： ./include/caffe/vision_layers.hpp

Vision layers 通常以图片images作为输入，运算后产生输出的也是图片images。对于图片而言，可能是单通道的(c=1)，例如灰度图，或者三通道的 (c=3)，例如RGB图。但是，对于Vision layers而言，最重要的特性是输入的spatial structure（空间结构）。2D的几何形状有助于输入处理，大部分的Vision layers工作是对于输入图片中的某一个区域做一个特定的处理，产生一个相应的输出。与此相反，其他大部分的layers会忽略输入的空间结构，而只是将输入视为一个很大的向量，维度为： c*h*w。

Convolution

类型（type）：Convolution（卷积层）
CPU 实现： ./src/caffe/layers/convolution_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/convolution_layer.cu
参数（convolution_param）：
必要：
- num_output (c_o): the number of filters（滤波器数目）
- kernel_size (or kernel_h and kernel_w): specifies height and width of each filter（每一个滤波器的大小）
强烈推荐：
- weight_filler [default type: ‘constant’ value: 0]（滤波器权重，默认为0）
可选：
- bias_term [default true]: specifies whether to learn and apply a set of additive biases to the filter outputs（是否添加bias-偏置项，默认为True）
- pad (or pad_h and pad_w) [default 0]: specifies the number of pixels to (implicitly) add to each side of the input（为输入添加边界的像素大小，默认为0）
- stride (or stride_h and stride_w) [default 1]: specifies the intervals at which to apply the filters to the input（每一次使用滤波器处理输入图片时，前后两次处理区域的间隔，即“步进”，默认为1）
- group (g) [default 1]: If g > 1, we restrict the connectivity of each filter to a subset of the input. Specifically, the input and output channels are separated into g groups, and the ith output group channels will be only connected to the ith input group channels.（默认为1，如果大于1：将限制每一个滤波器只与输入的一部分连接。输入、输出通道会被分隔为不同的g个groups，并且第i个输出group只会与第i个输出group相关）
输入（Input）
n * c_i * h_i * w_i
输出（Output）
n * c_o * h_o * w_o，其中h_o = (h_i + 2 * pad_h - kernel_h) / stride_h + 1；w_o类似
例子(详见 ./examples/imagenet/imagenet_train_val.prototxt)

layer {
  name: "conv1"                  # 名称：conv1
  type: "Convolution"            # 类型：卷积层
  bottom: "data"                 # 输入层：数据层
  top: "conv1"                   # 输出层：卷积层1
  # 滤波器（filters）的学习速率因子和衰减因子
  param { lr_mult: 1 decay_mult: 1 }
  # 偏置项（biases）的学习速率因子和衰减因子
  param { lr_mult: 2 decay_mult: 0 }
  convolution_param {
    num_output: 96               # 96个滤波器（filters）
    kernel_size: 11              # 每个滤波器（filters）大小为11*11
    stride: 4                    # 每次滤波间隔为4个像素
    weight_filler {
      type: "gaussian"           # 初始化高斯滤波器（Gaussian）
      std: 0.01                  # 标准差为0.01， 均值默认为0
    }
    bias_filler {
      type: "constant"           # 初始化偏置项（bias）为零
      value: 0
    }
  }
}

卷积层（The Convolution layer）利用一系列具有学习功能的滤波器（learnable filters）对输入的图像进行卷积操作，每一个滤波器（filter）对于一个特征（feature ）会产生一个输出图像（output image）。

Pooling

类型（type）：Pooling（池化层）
CPU 实现： ./src/caffe/layers/pooling_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/pooling_layer.cu
参数（pooling_param）：
- 必要：
  - kernel_size (or kernel_h and kernel_w): specifies height and width of each filter（每一个滤波器的大小）
- 可选：
  - pool [default MAX]: the pooling method. Currently MAX, AVE, or STOCHASTIC（pooling方法，目前有MAX、AVE,和STOCHASTIC三种，默认为MAX）
  - pad (or pad_h and pad_w) [default 0]: specifies the number of pixels to (implicitly) add to each side of the input（为输入添加边界的像素大小，默认为0）
  - stride (or stride_h and stride_w) [default 1]: specifies the intervals at which to apply the filters to the input（每一次使用滤波器处理输入图片时，前后两次处理区域的间隔，即“步进”，默认为1）
输入（Input）
- n * c_i * h_i * w_i
输出（Output）
- n * c_o * h_o * w_o，其中h_o = (h_i + 2 * pad_h - kernel_h) / stride_h + 1；w_o类似
例子(详见 ./examples/imagenet/imagenet_train_val.prototxt)

layer {
  name: "pool1"                 # 名称：pool1
  type: "Pooling"               # 类型：池化层
  bottom: "conv1"               # 输入层：卷积层conv1
  top: "pool1"                  # 输出层：池化层pool1
  pooling_param {
    pool: MAX                   # pool方法：MAX
    kernel_size: 3              # 每次pool区域为3*3像素大小
    stride: 2                   # pool步进为2
  }
}

Local Response Normalization (LRN)

类型（type）：LRN（局部响应归一化层）
CPU 实现： ./src/caffe/layers/lrn_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/lrn_layer.cu
参数（lrn_param）：
- 可选：
  - local_size [default 5]: the number of channels to sum over (for cross channel LRN) or the side length of the square region to sum over (for within channel LRN)（对于cross channel LRN，表示需要求和的channel的数量；对于within channel LRN表示需要求和的空间区域的边长；默认为5）
  - alpha [default 1]: the scaling parameter（缩放参数，默认为1）
  - beta [default 5]: the exponent（指数，默认为5）
  - norm_region [default ACROSS_CHANNELS]: whether to sum over adjacent channels (ACROSS_CHANNELS) or nearby spatial locaitons (WITHIN_CHANNEL)（选择基准区域，是ACROSS_CHANNELS => 相邻channels，还是WITHIN_CHANNEL => 同一 channel下的相邻空间区域；默认为ACROSS_CHANNELS）

LRN Layer对一个局部的输入区域进行归一化，有两种模式。ACROSS_CHANNELS模式，局部区域在相邻的channels之间拓展，不进行空间拓展，所以维度是local_size x 1 x 1。WITHIN_CHANNEL模式，局部区域进行空间拓展，但是是在不同的channels中，所以维度是1 x local_size x local_size。对于每一个输入，都要除以：，其中n是局部区域的大小，求和部分是对该输入值为中心的区域进行求和（必要时候可以补零）。

im2col

Im2col 是一个helper方法，用于将图片文件image转化为列矩阵，详细的细节不需要过多的了解。在Caffe中进行卷积操作，做矩阵乘法时，会用到Im2col方法。

Loss Layers

Caffe是通过最小化输出output与目标target之间的cost（loss）来驱动学习的。loss是由forward pass计算得出的，loss的gradient 是由backward pass计算得出的。

Softmax

类型（type）：SoftmaxWithLoss（广义线性回归分析损失层）

Softmax Loss Layer计算的是输入的多项式回归损失（multinomial logistic loss of the softmax of its inputs）。可以当作是将一个softmax layer和一个multinomial logistic loss layer连接起来，但是计算出的gradient更可靠。

Sum-of-Squares / Euclidean

类型（type）：EuclideanLoss（欧式损失层）

Euclidean loss layer计算两个不同输入之间的平方差之和，

Hinge / Margin

类型（type）：HingeLoss
CPU 实现： ./src/caffe/layers/hinge_loss_layer.cpp
CUDA、GPU实现：尚无
参数（hinge_loss_param）：
- 可选：
  - norm [default L1]: the norm used. Currently L1, L2（可以选择使用L1范数或者L2范数；默认为L1）
输入（Input）
- n * c * h * w Predictions（预测值）
- n * 1 * 1 * 1 Labels（标签值）
输出（Output）
- 1 * 1 * 1 * 1 Computed Loss（计算得出的loss值）
例子

# 使用L1范数
layer {
  name: "loss"                  # 名称：loss
  type: "HingeLoss"             # 类型：HingeLoss
  bottom: "pred"                # 输入：预测值
  bottom: "label"               # 输入：标签值
}

# 使用L2范数
layer {
  name: "loss"                  # 名称：loss
  type: "HingeLoss"             # 类型：HingeLoss
  bottom: "pred"                # 输入：预测值
  bottom: "label"               # 输入：标签值
  top: "loss"                   # 输出：loss值
  hinge_loss_param {
    norm: L2                    # 使用L2范数
  }
}

关于范数：

Sigmoid Cross-Entropy

类型（type）：SigmoidCrossEntropyLoss
（没有详解）

Infogain

类型（type）：InfogainLoss
（没有详解）

Accuracy and Top-k

类型（type）：Accuracy
计算输出的准确率（相对于target），事实上这不是一个loss layer，并且也没有backward pass。

Activation / Neuron Layers

激励层的操作都是element-wise的操作（针对每一个输入blob产生一个相同大小的输出）：

输入（Input）
- n * c * h * w
输出（Output）
- n * c * h * w

ReLU / Rectified-Linear and Leaky-ReLU

类型（type）：ReLU
CPU 实现： ./src/caffe/layers/relu_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/relu_layer.cu
参数（relu_param）：
- 可选：
  - negative_slope [default 0]: specifies whether to leak the negative part by multiplying it with the slope value rather than setting it to 0.（但当输入x小于0时，指定输出为negative_slope * x；默认值为0）
例子(详见 ./examples/imagenet/imagenet_train_val.prototxt)

layer {
  name: "relu1"
  type: "ReLU"
  bottom: "conv1"
  top: "conv1"
}

给定一个输入值x，ReLU layer的输出为：x > 0 ? x : negative_slope * x，如未给定参数negative_slope 的值，则为标准ReLU方法：max(x, 0)。ReLU layer支持in-place计算，输出会覆盖输入，以节省内存空间。

Sigmoid

类型（type）：Sigmoid
CPU 实现： ./src/caffe/layers/sigmoid_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/sigmoid_layer.cu
例子(详见 ./examples/mnist/mnist_autoencoder.prototxt)

layer {
  name: "encode1neuron"
  bottom: "encode1"
  top: "encode1neuron"
  type: "Sigmoid"
}

对于每一个输入值x，Sigmoid layer的输出为sigmoid(x)。

TanH / Hyperbolic Tangent

类型（type）：TanH
CPU 实现： ./src/caffe/layers/tanh_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/tanh_layer.cu
例子

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "TanH"
}

对于每一个输入值x，TanH layer的输出为tanh(x)。

Absolute Value

类型（type）：AbsVal
CPU 实现： ./src/caffe/layers/absval_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/absval_layer.cu
例子

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "AbsVal"
}

对于每一个输入值x，AbsVal layer的输出为abs(x)。

Power

类型（type）：Power
CPU 实现： ./src/caffe/layers/power_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/power_layer.cu
参数（power_param）：
- 可选：
  - power [default 1]（指数，默认为1）
  - scale [default 1]（比例，默认为1）
  - shift [default 0]（偏移，默认为0）
例子

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "Power"
  power_param {
    power: 1
    scale: 1
    shift: 0
  }
}

对于每一个输入值x，Power layer的输出为(shift + scale * x) ^ power。

BNLL

类型（type）：BNLL（二项正态对数似然，binomial normal log likelihood）
CPU 实现： ./src/caffe/layers/bnll_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/bnll_layer.cu
例子

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: BNLL
}

对于每一个输入值x，BNLL layer的输出为log(1 + exp(x))。

Data Layers

Data 通过Data Layers进入Caffe，Data Layers位于Net的底部。
Data 可以来自：1、高效的数据库（LevelDB 或 LMDB）；2、内存；3、HDF5或image文件（效率低）。
基本的输入预处理（例如：减去均值，缩放，随机裁剪，镜像处理）可以通过指定TransformationParameter达到。

Database

类型（type）：Data（数据库）
参数：
- 必要：
  - source: the name of the directory containing the database（数据库名称）
  - batch_size: the number of inputs to process at one time（每次处理的输入的数据量）
- 可选：
  - rand_skip: skip up to this number of inputs at the beginning; useful for asynchronous sgd（在开始的时候跳过这个数值量的输入；这对于异步随机梯度下降是非常有用的）
  - backend [default LEVELDB]: choose whether to use a LEVELDB or LMDB（选择使用LEVELDB 数据库还是LMDB数据库，默认为LEVELDB）

In-Memory

类型（type）：MemoryData
参数：
- 必要：
  - batch_size, channels, height, width: specify the size of input chunks to read from memory（4个值，确定每次读取输入数据量的大小）

Memory Data Layer从内存直接读取数据（而不是复制数据）。使用Memory Data Layer之前，必须先调用，MemoryDataLayer::Reset（C++方法）或Net.set_input_arrays（Python方法）以指定一个source来读取一个连续的数据块（4D，按行排列），每次读取大小由batch_size决定。

HDF5 Input

类型（type）：HDF5Data
参数：
- 必要：
  - source: the name of the file to read from（读取的文件的名称）
  - batch_size（每次处理的输入的数据量）

HDF5 Output

类型（type）：HDF5Output
参数：
- 必要：
  - file_name: name of file to write to（写入的文件的名称）
HDF5 output layer与这部分的其他layer的功能正好相反，不是读取而是写入。

Images

类型（type）：ImageData
参数：
- 必要：
  - source: name of a text file, with each line giving an image filename and label（一个text文件的名称，每一行指定一个image文件名和label）
  - batch_size: number of images to batch together（每次处理的image的数据）
- 可选：
  - rand_skip: （在开始的时候跳过这个数值量的输入）
  - shuffle [default false]（是否随机乱序，默认为否）
    -new_height, new_width: if provided, resize all images to this size（缩放所有的image到新的大小）

Windows

类型（type）：WindowData
（没有详解）

Dummy

类型（type）：DummyData

DummyData 用于开发和测试，详见DummyDataParameter（没有给出链接）。

Common Layers

Inner Product

类型（type）：Inner Product（全连接层）
CPU 实现： ./src/caffe/layers/inner_product_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/inner_product_layer.cu
参数（inner_product_param）：
- 必要：
  - num_output (c_o): the number of filters（滤波器数目）
- 强烈推荐：
  - weight_filler [default type: ‘constant’ value: 0]（滤波器权重；默认类型为constant，默认值为0）
- 可选：
  - bias_filler [default type: ‘constant’ value: 0]（bias-偏置项的值，默认类型为constant，默认值为0）
  - bias_term [default true]: specifies whether to learn and apply a set of additive biases to the filter outputs（是否添加bias-偏置项，默认为True）
输入（Input）
- n * c_i * h_i * w_i
输出（Output）
- n * c_o * 1 * 1
例子

layer {
  name: "fc8"                              # 名称：fc8
  type: "InnerProduct"                     # 类型：全连接层
  # 权重（weights）的学习速率因子和衰减因子
  param { lr_mult: 1 decay_mult: 1 }
  # 偏置项（biases）的学习速率因子和衰减因子
  param { lr_mult: 2 decay_mult: 0 }
  inner_product_param {
    num_output: 1000                       # 1000个滤波器（filters）
    weight_filler {
      type: "gaussian"                     # 初始化高斯滤波器（Gaussian）
      std: 0.01                            # 标准差为0.01， 均值默认为0
    }
    bias_filler {
      type: "constant"                     # 初始化偏置项（bias）为零
      value: 0
    }
  }
  bottom: "fc7"                            # 输入层：fc7
  top: "fc8"                               # 输出层：fc8
}

InnerProduct layer（常被称为全连接层）将输入视为一个vector，输出也是一个vector（height和width被设为1）

Splitting

类型（type）：Split

Split layer用于将一个输入的blob分离成多个输出的blob。这用于当需要将一个blob输入至多个输出layer时。

Flattening

类型（type）：Flatten

Flatten layer用于把一个维度为n * c * h * w的输入转化为一个维度为 n * (c*h*w)的向量输出。

Reshape

类型（type）：Reshape
CPU 实现： ./src/caffe/layers/reshape_layer.cpp
CUDA、GPU实现：尚无
参数（reshape_param）：
- 可选：
  - shape（改变后的维度，详见下面解释）
输入（Input）
- a single blob with arbitrary dimensions（一个任意维度的blob）
输出（Output）
- the same blob, with modified dimensions, as specified by reshape_param（相同内容的blob，但维度根据reshape_param改变）
例子

 layer {
    name: "reshape"                       # 名称：reshape
    type: "Reshape"                       # 类型：Reshape
    bottom: "input"                       # 输入层名称：input
    top: "output"                         # 输出层名称：output
    reshape_param {
      shape {
        dim: 0  # 这个维度与输入相同
        dim: 2
        dim: 3
        dim: -1 # 根据其他维度自动推测
      }
    }
  }

Reshape layer只改变输入数据的维度，但内容不变，也没有数据复制的过程，与Flatten layer类似。

输出维度由reshape_param 指定，正整数直接指定维度大小，下面两个特殊的值：

0 => 表示copy the respective dimension of the bottom layer，复制输入相应维度的值。
-1 => 表示infer this from the other dimensions，根据其他维度自动推测维度大小。reshape_param中至多只能有一个-1。

再举一个例子：如果指定reshape_param参数为：{ shape { dim: 0 dim: -1 } } ，那么输出和Flattening layer的输出是完全一样的。

Concatenation

类型（type）：Concat（连结层）
CPU 实现： ./src/caffe/layers/concat_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/concat_layer.cu
参数（concat_param）：
- 可选：
  - axis [default 1]: 0 for concatenation along num and 1 for channels.（0代表连结num，1代表连结channel）
输入（Input）
-n_i * c_i * h * w for each input blob i from 1 to K.（第i个blob的维度是n_i * c_i * h * w，共K个）
输出（Output）
- if axis = 0: (n_1 + n_2 + … + n_K) * c_1 * h * w, and all input c_i should be the same.（axis = 0时，输出 blob的维度为(n_1 + n_2 + … + n_K) * c_1 * h * w，要求所有的input的channel相同）
- if axis = 1: n_1 * (c_1 + c_2 + … + c_K) * h * w, and all input n_i should be the same.（axis = 0时，输出 blob的维度为n_1 * (c_1 + c_2 + … + c_K) * h * w，要求所有的input的num相同）
例子

layer {
  name: "concat"
  bottom: "in1"
  bottom: "in2"
  top: "out"
  type: "Concat"
  concat_param {
    axis: 1
  }
}

Concat layer用于把多个输入blob连结成一个输出blob。

Slicing

Slice layer用于将一个input layer分割成多个output layers，根据给定的维度（目前只能指定num或者channel）。

类型（type）：Slice
例子

layer {
  name: "slicer_label"
  type: "Slice"
  bottom: "label"
  ## 假设label的维度是：N x 3 x 1 x 1
  top: "label1"
  top: "label2"
  top: "label3"
  slice_param {
    axis: 1                        # 指定维度为channel
    slice_point: 1                 # 将label[~][1][~][~]赋给label1
    slice_point: 2                 # 将label[~][2][~][~]赋给label2
                                   # 将label[~][3][~][~]赋给label3
  }
}

axis表明是哪一个维度，slice_point是该维度的索引，slice_point的数量必须是top blobs的数量减1.

Elementwise Operations

类型（type）： Eltwise
（没有详解）

Argmax

类型（type）：ArgMax
（没有详解）

Softmax

类型（type）：Softmax
（没有详解）

Mean-Variance Normalization

类型（type）：MVN
（没有详解）

Vision Layers
- Convolution
- Pooling
- Local Response Normalization LRN
- im2col
Loss Layers
- Softmax
- Sum-of-Squares Euclidean
- Hinge Margin
- Sigmoid Cross-Entropy
- Infogain
- Accuracy and Top-k
Activation Neuron Layers
- ReLU Rectified-Linear and Leaky-ReLU
- Sigmoid
- TanH Hyperbolic Tangent
- Absolute Value
- Power
- BNLL
Data Layers
- Database
- In-Memory
- HDF5 Input
- HDF5 Output
- Images
- Windows
- Dummy
Common Layers
- Inner Product
- Splitting
- Flattening
- Reshape
- Concatenation
- Slicing
- Elementwise Operations
- Argmax
- Softmax
- Mean-Variance Normalization

原文

要想创建一个Caffe模型，需要在prototxt中定义一个model architecture（模型架构）。
Caffe自带的Layer及其参数被定义在caffe.proto中。

Vision Layers

头文件： ./include/caffe/vision_layers.hpp

Convolution

类型（type）：Convolution（卷积层）
CPU 实现： ./src/caffe/layers/convolution_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/convolution_layer.cu
参数（convolution_param）：
必要：
- num_output (c_o): the number of filters（滤波器数目）
- kernel_size (or kernel_h and kernel_w): specifies height and width of each filter（每一个滤波器的大小）
强烈推荐：
- weight_filler [default type: ‘constant’ value: 0]（滤波器权重，默认为0）
可选：
- bias_term [default true]: specifies whether to learn and apply a set of additive biases to the filter outputs（是否添加bias-偏置项，默认为True）
- pad (or pad_h and pad_w) [default 0]: specifies the number of pixels to (implicitly) add to each side of the input（为输入添加边界的像素大小，默认为0）
- stride (or stride_h and stride_w) [default 1]: specifies the intervals at which to apply the filters to the input（每一次使用滤波器处理输入图片时，前后两次处理区域的间隔，即“步进”，默认为1）
- group (g) [default 1]: If g > 1, we restrict the connectivity of each filter to a subset of the input. Specifically, the input and output channels are separated into g groups, and the ith output group channels will be only connected to the ith input group channels.（默认为1，如果大于1：将限制每一个滤波器只与输入的一部分连接。输入、输出通道会被分隔为不同的g个groups，并且第i个输出group只会与第i个输出group相关）
输入（Input）
n * c_i * h_i * w_i
输出（Output）
n * c_o * h_o * w_o，其中h_o = (h_i + 2 * pad_h - kernel_h) / stride_h + 1；w_o类似
例子(详见 ./examples/imagenet/imagenet_train_val.prototxt)

layer {
  name: "conv1"                  # 名称：conv1
  type: "Convolution"            # 类型：卷积层
  bottom: "data"                 # 输入层：数据层
  top: "conv1"                   # 输出层：卷积层1
  # 滤波器（filters）的学习速率因子和衰减因子
  param { lr_mult: 1 decay_mult: 1 }
  # 偏置项（biases）的学习速率因子和衰减因子
  param { lr_mult: 2 decay_mult: 0 }
  convolution_param {
    num_output: 96               # 96个滤波器（filters）
    kernel_size: 11              # 每个滤波器（filters）大小为11*11
    stride: 4                    # 每次滤波间隔为4个像素
    weight_filler {
      type: "gaussian"           # 初始化高斯滤波器（Gaussian）
      std: 0.01                  # 标准差为0.01， 均值默认为0
    }
    bias_filler {
      type: "constant"           # 初始化偏置项（bias）为零
      value: 0
    }
  }
}1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23

Pooling

类型（type）：Pooling（池化层）
CPU 实现： ./src/caffe/layers/pooling_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/pooling_layer.cu
参数（pooling_param）：
- 必要：
  - kernel_size (or kernel_h and kernel_w): specifies height and width of each filter（每一个滤波器的大小）
- 可选：
  - pool [default MAX]: the pooling method. Currently MAX, AVE, or STOCHASTIC（pooling方法，目前有MAX、AVE,和STOCHASTIC三种，默认为MAX）
  - pad (or pad_h and pad_w) [default 0]: specifies the number of pixels to (implicitly) add to each side of the input（为输入添加边界的像素大小，默认为0）
  - stride (or stride_h and stride_w) [default 1]: specifies the intervals at which to apply the filters to the input（每一次使用滤波器处理输入图片时，前后两次处理区域的间隔，即“步进”，默认为1）
输入（Input）
- n * c_i * h_i * w_i
输出（Output）
- n * c_o * h_o * w_o，其中h_o = (h_i + 2 * pad_h - kernel_h) / stride_h + 1；w_o类似
例子(详见 ./examples/imagenet/imagenet_train_val.prototxt)

layer {
  name: "pool1"                 # 名称：pool1
  type: "Pooling"               # 类型：池化层
  bottom: "conv1"               # 输入层：卷积层conv1
  top: "pool1"                  # 输出层：池化层pool1
  pooling_param {
    pool: MAX                   # pool方法：MAX
    kernel_size: 3              # 每次pool区域为3*3像素大小
    stride: 2                   # pool步进为2
  }
}1
2
3
4
5
6
7
8
9
10
11

Local Response Normalization (LRN)

类型（type）：LRN（局部响应归一化层）
CPU 实现： ./src/caffe/layers/lrn_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/lrn_layer.cu
参数（lrn_param）：
- 可选：
  - local_size [default 5]: the number of channels to sum over (for cross channel LRN) or the side length of the square region to sum over (for within channel LRN)（对于cross channel LRN，表示需要求和的channel的数量；对于within channel LRN表示需要求和的空间区域的边长；默认为5）
  - alpha [default 1]: the scaling parameter（缩放参数，默认为1）
  - beta [default 5]: the exponent（指数，默认为5）
  - norm_region [default ACROSS_CHANNELS]: whether to sum over adjacent channels (ACROSS_CHANNELS) or nearby spatial locaitons (WITHIN_CHANNEL)（选择基准区域，是ACROSS_CHANNELS => 相邻channels，还是WITHIN_CHANNEL => 同一 channel下的相邻空间区域；默认为ACROSS_CHANNELS）

LRN Layer对一个局部的输入区域进行归一化，有两种模式。ACROSS_CHANNELS模式，局部区域在相邻的channels之间拓展，不进行空间拓展，所以维度是local_size x 1 x 1。WITHIN_CHANNEL模式，局部区域进行空间拓展，但是是在不同的channels中，所以维度是1 x local_size x local_size。对于每一个输入，都要除以：，其中n是局部区域的大小，求和部分是对该输入值为中心的区域进行求和（必要时候可以补零）。

im2col

Im2col 是一个helper方法，用于将图片文件image转化为列矩阵，详细的细节不需要过多的了解。在Caffe中进行卷积操作，做矩阵乘法时，会用到Im2col方法。

Loss Layers

Caffe是通过最小化输出output与目标target之间的cost（loss）来驱动学习的。loss是由forward pass计算得出的，loss的gradient 是由backward pass计算得出的。

Softmax

类型（type）：SoftmaxWithLoss（广义线性回归分析损失层）

Sum-of-Squares / Euclidean

类型（type）：EuclideanLoss（欧式损失层）

Euclidean loss layer计算两个不同输入之间的平方差之和，

Hinge / Margin

类型（type）：HingeLoss
CPU 实现： ./src/caffe/layers/hinge_loss_layer.cpp
CUDA、GPU实现：尚无
参数（hinge_loss_param）：
- 可选：
  - norm [default L1]: the norm used. Currently L1, L2（可以选择使用L1范数或者L2范数；默认为L1）
输入（Input）
- n * c * h * w Predictions（预测值）
- n * 1 * 1 * 1 Labels（标签值）
输出（Output）
- 1 * 1 * 1 * 1 Computed Loss（计算得出的loss值）
例子

# 使用L1范数
layer {
  name: "loss"                  # 名称：loss
  type: "HingeLoss"             # 类型：HingeLoss
  bottom: "pred"                # 输入：预测值
  bottom: "label"               # 输入：标签值
}

# 使用L2范数
layer {
  name: "loss"                  # 名称：loss
  type: "HingeLoss"             # 类型：HingeLoss
  bottom: "pred"                # 输入：预测值
  bottom: "label"               # 输入：标签值
  top: "loss"                   # 输出：loss值
  hinge_loss_param {
    norm: L2                    # 使用L2范数
  }
}1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19

关于范数：

Sigmoid Cross-Entropy

类型（type）：SigmoidCrossEntropyLoss
（没有详解）

Infogain

类型（type）：InfogainLoss
（没有详解）

Accuracy and Top-k

类型（type）：Accuracy
计算输出的准确率（相对于target），事实上这不是一个loss layer，并且也没有backward pass。

Activation / Neuron Layers

激励层的操作都是element-wise的操作（针对每一个输入blob产生一个相同大小的输出）：

输入（Input）
- n * c * h * w
输出（Output）
- n * c * h * w

ReLU / Rectified-Linear and Leaky-ReLU

类型（type）：ReLU
CPU 实现： ./src/caffe/layers/relu_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/relu_layer.cu
参数（relu_param）：
- 可选：
  - negative_slope [default 0]: specifies whether to leak the negative part by multiplying it with the slope value rather than setting it to 0.（但当输入x小于0时，指定输出为negative_slope * x；默认值为0）
例子(详见 ./examples/imagenet/imagenet_train_val.prototxt)

layer {
  name: "relu1"
  type: "ReLU"
  bottom: "conv1"
  top: "conv1"
}1
2
3
4
5
6

Sigmoid

类型（type）：Sigmoid
CPU 实现： ./src/caffe/layers/sigmoid_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/sigmoid_layer.cu
例子(详见 ./examples/mnist/mnist_autoencoder.prototxt)

layer {
  name: "encode1neuron"
  bottom: "encode1"
  top: "encode1neuron"
  type: "Sigmoid"
}1
2
3
4
5
6

对于每一个输入值x，Sigmoid layer的输出为sigmoid(x)。

TanH / Hyperbolic Tangent

类型（type）：TanH
CPU 实现： ./src/caffe/layers/tanh_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/tanh_layer.cu
例子

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "TanH"
}1
2
3
4
5
6

对于每一个输入值x，TanH layer的输出为tanh(x)。

Absolute Value

类型（type）：AbsVal
CPU 实现： ./src/caffe/layers/absval_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/absval_layer.cu
例子

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "AbsVal"
}1
2
3
4
5
6

对于每一个输入值x，AbsVal layer的输出为abs(x)。

Power

类型（type）：Power
CPU 实现： ./src/caffe/layers/power_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/power_layer.cu
参数（power_param）：
- 可选：
  - power [default 1]（指数，默认为1）
  - scale [default 1]（比例，默认为1）
  - shift [default 0]（偏移，默认为0）
例子

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: "Power"
  power_param {
    power: 1
    scale: 1
    shift: 0
  }
}1
2
3
4
5
6
7
8
9
10
11

对于每一个输入值x，Power layer的输出为(shift + scale * x) ^ power。

BNLL

类型（type）：BNLL（二项正态对数似然，binomial normal log likelihood）
CPU 实现： ./src/caffe/layers/bnll_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/bnll_layer.cu
例子

layer {
  name: "layer"
  bottom: "in"
  top: "out"
  type: BNLL
}1
2
3
4
5
6

对于每一个输入值x，BNLL layer的输出为log(1 + exp(x))。

Data Layers

Database

类型（type）：Data（数据库）
参数：
- 必要：
  - source: the name of the directory containing the database（数据库名称）
  - batch_size: the number of inputs to process at one time（每次处理的输入的数据量）
- 可选：
  - rand_skip: skip up to this number of inputs at the beginning; useful for asynchronous sgd（在开始的时候跳过这个数值量的输入；这对于异步随机梯度下降是非常有用的）
  - backend [default LEVELDB]: choose whether to use a LEVELDB or LMDB（选择使用LEVELDB 数据库还是LMDB数据库，默认为LEVELDB）

In-Memory

类型（type）：MemoryData
参数：
- 必要：
  - batch_size, channels, height, width: specify the size of input chunks to read from memory（4个值，确定每次读取输入数据量的大小）

HDF5 Input

类型（type）：HDF5Data
参数：
- 必要：
  - source: the name of the file to read from（读取的文件的名称）
  - batch_size（每次处理的输入的数据量）

HDF5 Output

类型（type）：HDF5Output
参数：
- 必要：
  - file_name: name of file to write to（写入的文件的名称）
HDF5 output layer与这部分的其他layer的功能正好相反，不是读取而是写入。

Images

类型（type）：ImageData
参数：
- 必要：
  - source: name of a text file, with each line giving an image filename and label（一个text文件的名称，每一行指定一个image文件名和label）
  - batch_size: number of images to batch together（每次处理的image的数据）
- 可选：
  - rand_skip: （在开始的时候跳过这个数值量的输入）
  - shuffle [default false]（是否随机乱序，默认为否）
    -new_height, new_width: if provided, resize all images to this size（缩放所有的image到新的大小）

Windows

类型（type）：WindowData
（没有详解）

Dummy

类型（type）：DummyData

DummyData 用于开发和测试，详见DummyDataParameter（没有给出链接）。

Common Layers

Inner Product

类型（type）：Inner Product（全连接层）
CPU 实现： ./src/caffe/layers/inner_product_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/inner_product_layer.cu
参数（inner_product_param）：
- 必要：
  - num_output (c_o): the number of filters（滤波器数目）
- 强烈推荐：
  - weight_filler [default type: ‘constant’ value: 0]（滤波器权重；默认类型为constant，默认值为0）
- 可选：
  - bias_filler [default type: ‘constant’ value: 0]（bias-偏置项的值，默认类型为constant，默认值为0）
  - bias_term [default true]: specifies whether to learn and apply a set of additive biases to the filter outputs（是否添加bias-偏置项，默认为True）
输入（Input）
- n * c_i * h_i * w_i
输出（Output）
- n * c_o * 1 * 1
例子

layer {
  name: "fc8"                              # 名称：fc8
  type: "InnerProduct"                     # 类型：全连接层
  # 权重（weights）的学习速率因子和衰减因子
  param { lr_mult: 1 decay_mult: 1 }
  # 偏置项（biases）的学习速率因子和衰减因子
  param { lr_mult: 2 decay_mult: 0 }
  inner_product_param {
    num_output: 1000                       # 1000个滤波器（filters）
    weight_filler {
      type: "gaussian"                     # 初始化高斯滤波器（Gaussian）
      std: 0.01                            # 标准差为0.01， 均值默认为0
    }
    bias_filler {
      type: "constant"                     # 初始化偏置项（bias）为零
      value: 0
    }
  }
  bottom: "fc7"                            # 输入层：fc7
  top: "fc8"                               # 输出层：fc8
}1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

InnerProduct layer（常被称为全连接层）将输入视为一个vector，输出也是一个vector（height和width被设为1）

Splitting

类型（type）：Split

Split layer用于将一个输入的blob分离成多个输出的blob。这用于当需要将一个blob输入至多个输出layer时。

Flattening

类型（type）：Flatten

Flatten layer用于把一个维度为n * c * h * w的输入转化为一个维度为 n * (c*h*w)的向量输出。

Reshape

类型（type）：Reshape
CPU 实现： ./src/caffe/layers/reshape_layer.cpp
CUDA、GPU实现：尚无
参数（reshape_param）：
- 可选：
  - shape（改变后的维度，详见下面解释）
输入（Input）
- a single blob with arbitrary dimensions（一个任意维度的blob）
输出（Output）
- the same blob, with modified dimensions, as specified by reshape_param（相同内容的blob，但维度根据reshape_param改变）
例子

 layer {
    name: "reshape"                       # 名称：reshape
    type: "Reshape"                       # 类型：Reshape
    bottom: "input"                       # 输入层名称：input
    top: "output"                         # 输出层名称：output
    reshape_param {
      shape {
        dim: 0  # 这个维度与输入相同
        dim: 2
        dim: 3
        dim: -1 # 根据其他维度自动推测
      }
    }
  }1
2
3
4
5
6
7
8
9
10
11
12
13
14

Reshape layer只改变输入数据的维度，但内容不变，也没有数据复制的过程，与Flatten layer类似。

输出维度由reshape_param 指定，正整数直接指定维度大小，下面两个特殊的值：

0 => 表示copy the respective dimension of the bottom layer，复制输入相应维度的值。
-1 => 表示infer this from the other dimensions，根据其他维度自动推测维度大小。reshape_param中至多只能有一个-1。

再举一个例子：如果指定reshape_param参数为：{ shape { dim: 0 dim: -1 } } ，那么输出和Flattening layer的输出是完全一样的。

Concatenation

类型（type）：Concat（连结层）
CPU 实现： ./src/caffe/layers/concat_layer.cpp
CUDA、GPU实现： ./src/caffe/layers/concat_layer.cu
参数（concat_param）：
- 可选：
  - axis [default 1]: 0 for concatenation along num and 1 for channels.（0代表连结num，1代表连结channel）
输入（Input）
-n_i * c_i * h * w for each input blob i from 1 to K.（第i个blob的维度是n_i * c_i * h * w，共K个）
输出（Output）
- if axis = 0: (n_1 + n_2 + … + n_K) * c_1 * h * w, and all input c_i should be the same.（axis = 0时，输出 blob的维度为(n_1 + n_2 + … + n_K) * c_1 * h * w，要求所有的input的channel相同）
- if axis = 1: n_1 * (c_1 + c_2 + … + c_K) * h * w, and all input n_i should be the same.（axis = 0时，输出 blob的维度为n_1 * (c_1 + c_2 + … + c_K) * h * w，要求所有的input的num相同）
例子

layer {
  name: "concat"
  bottom: "in1"
  bottom: "in2"
  top: "out"
  type: "Concat"
  concat_param {
    axis: 1
  }
}1
2
3
4
5
6
7
8
9
10

Concat layer用于把多个输入blob连结成一个输出blob。

Slicing

Slice layer用于将一个input layer分割成多个output layers，根据给定的维度（目前只能指定num或者channel）。

类型（type）：Slice
例子

layer {
  name: "slicer_label"
  type: "Slice"
  bottom: "label"
  ## 假设label的维度是：N x 3 x 1 x 1
  top: "label1"
  top: "label2"
  top: "label3"
  slice_param {
    axis: 1                        # 指定维度为channel
    slice_point: 1                 # 将label[~][1][~][~]赋给label1
    slice_point: 2                 # 将label[~][2][~][~]赋给label2
                                   # 将label[~][3][~][~]赋给label3
  }
}1
2
3
4
5
6
7
8
9
10
11
12
13
14
15

axis表明是哪一个维度，slice_point是该维度的索引，slice_point的数量必须是top blobs的数量减1.

Elementwise Operations

类型（type）： Eltwise
（没有详解）

Argmax

类型（type）：ArgMax
（没有详解）

Softmax

类型（type）：Softmax
（没有详解）

Mean-Variance Normalization

类型（type）：MVN
（没有详解）

你可能感兴趣的:(deep,learning)

deepin 系统网络信息查看指南 deepin
deepin系统网络信息查看指南在Linux操作系统，如deepin和Ubuntu中，我们可以通过多种shell命令来查看网络信息和网络状态。本文将介绍这些命令，帮助您更好地理解和监控您的网络环境。1.ifconfig命令ifconfig是查看所有网卡信息的命令，但已被弃用，推荐使用ip命令。ifconfig2.ip命令ip命令用于查看所有网卡的信息。#查看所有接口信息：ipaddrshow#查看
什么是多模态机器学习：跨感知融合的智能前沿非凡暖阳人工智能神经网络
在人工智能的广阔天地里，多模态机器学习（MultimodalMachineLearning）作为一项前沿技术，正逐步解锁人机交互和信息理解的新境界。它超越了单一感官输入的限制，通过整合视觉、听觉、文本等多种数据类型，构建了一个更加丰富、立体的认知模型，为机器赋予了接近人类的综合感知与理解能力。本文将深入探讨多模态机器学习的定义、核心原理、关键技术、面临的挑战以及未来的应用前景，旨在为读者勾勒出这一
Python|基于DeepSeek大模型，实现文本内容仿写（8）写python的鑫哥 AI大模型实战应用人工智能 python 大模型 DeepSeek Kimi 文本仿写
前言本文是该专栏的第8篇，后面会持续分享AI大模型干货知识，记得关注。我们在处理文本数据项目的时候，有时可能会遇到这样的需求。比如说，指定某些文本模板样例，需要仿写或者生成该“模板”样例数据。再或者说，通过给予某些指定类型的关键词，生成关键词相关领域的文本素材或内容。如果单单投入人力去完成，这肯定是没问题，但耗费的更多是人力成本。而现阶段，对于这种需求，大大可以选择大模型去完成。而本文，笔者将基于
像素空间文生图之Imagen原理详解 funNLPer AI算法 Imagen stable diffusion AIGC
论文：PhotorealisticText-to-ImageDiffusionModelswithDeepLanguageUnderstanding项目地址：https://imagen.research.google/代码（非官方）：https://github.com/deep-floyd/IF模型权重：https://huggingface.co/DeepFloyd/IF-I-XL-v1.0
::v-deep的理解记得早睡~ vue.js 前端 javascript
vue样式穿透在刚开始使用element-ui组件库时，想要修改其内部的样式，但总是不生效，通过查询资料，了解到了深度作用选择器。如果希望scoped样式中的一个选择器能够作用得“更深”，例如影响子组件，可以使用>>>操作符：.a>>>.b{width:100%;height:100%;background:red;}但是像scss等预处理器却无法解析>>>，所以我们使用下面的方式：.a{/dee
蓝桥杯真题 - 公因数匹配 - 题解 ExRoc 蓝桥杯算法 c++
题目链接：https://www.lanqiao.cn/problems/3525/learning/个人评价：难度2星（满星：5）前置知识：调和级数整体思路题目描述不严谨，没说在无解的情况下要输出什么（比如nnn个111），所以我们先假设数据保证有解；从222到10610^6106枚举xxx作为约数，对于约数xxx去扫所有xxx的倍数，总共需要扫n2+n3+n4+⋯+nn≈nln⁡n\frac{
蓝桥杯真题 - 子树的大小 - 题解 ExRoc 蓝桥杯算法 c++
题目链接：https://www.lanqiao.cn/problems/3526/learning/个人评价：难度2星（满星：5）前置知识：无整体思路整体将节点编号−1-1−1，通过找规律可以发现，节点iii下一层最左边的节点编号是im+1im+1im+1，最右边的节点编号是im+mim+mim+m；用l,rl,rl,r分别标记当前层子树的最小节点编号与最大节点编号，每次让最左边的节点往下一层的
C#遇见TensorFlow.NET：开启机器学习的全新时代墨夶 C#学习资料1 机器学习 c#tensorflow
在当今快速发展的科技世界里，机器学习（MachineLearning,ML）已经成为推动创新的重要力量。从个性化推荐系统到自动驾驶汽车，ML的应用无处不在。对于那些习惯于使用C#进行开发的程序员来说，将机器学习集成到他们的项目中似乎是一项具有挑战性的任务。但随着TensorFlow.NET的出现，这一切变得不再困难。今天，我们将一起探索如何利用这一强大的工具，在熟悉的.NET环境中轻松构建、训练和
【JVM】—G1 GC日志详解一棵___大树 JVM jvm
G1GC日志详解⭐⭐⭐⭐⭐⭐Github主页https://github.com/A-BigTree笔记链接https://github.com/A-BigTree/Code_Learning⭐⭐⭐⭐⭐⭐如果可以，麻烦各位看官顺手点个star~文章目录G1GC日志详解1G1GC周期2G1日志开启与设置3YoungGC日志4MixedGC5FullGC关于G1回收器的前置知识点：【JVM】—深入理解
NLP 中文拼写检测纠正论文-04-Learning from the Dictionary 后端java
拼写纠正系列NLP中文拼写检测实现思路NLP中文拼写检测纠正算法整理NLP英文拼写算法，如果提升100W倍的性能？NLP中文拼写检测纠正Paperjava实现中英文拼写检查和错误纠正？可我只会写CRUD啊！一个提升英文单词拼写检测性能1000倍的算法？单词拼写纠正-03-leetcodeedit-distance72.力扣编辑距离NLP开源项目nlp-hanzi-similar汉字相似度word-
【已解决】ImportError: libnvinfer.so.8: cannot open shared object file: No such file or directory 小小小小祥 python
问题描述：按照tensorrt官方安装文档：https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html#installing-tar安装完成后，使用python测试导入tensorrtimporttensorrt上述代码报错：Traceback(mostrecentcalllast):File“main.py”,li
ASPICE 4.0引领自动驾驶未来：机器学习模型的特点与实践亚远景aspice 机器学习自动驾驶人工智能
ASPICE4.0-ML机器学习模型是针对汽车行业，特别是在汽车软件开发中，针对机器学习（MachineLearning,ML）应用的特定标准和过程。ASPICE（AutomotiveSPICE）是一种基于软件控制的系统开发过程的国际标准，旨在提升软件开发过程的质量、效率和可靠性。ASPICE4.0中的ML模型部分则进一步细化了机器学习在汽车软件开发中的具体要求和流程。以下是对ASPICE4.0-
利用Python运行Ansys Apdl ssssasda ansys apdl 流处理批处理 python
Ansys流处理1.学习资源2.版本要求3.pymapdl安装流程4.初始设置和本地启动mapdl5.PyMAPDL语法6.工具库7.与window的交互接口1.学习资源Ansys官网：https://www.ansys.com/zh-cnAnsysAcademic（Ansys学术）:https://www.ansys.com/zh-cn/academicAnsysLearningForum（An
DeepSeek V3：新一代开源 AI 模型，多语言编程能力卓越 that's boy 人工智能 chatgpt openai claude midjourney deepseek-v3
DeepSeekV3横空出世，以其强大的多语言编程能力和先进的技术架构，引发了业界的广泛关注。这款最新的AI模型不仅在性能上实现了质的飞跃，还采用了开源策略，为广大开发者提供了更广阔的探索空间。本文将深入解析DeepSeekV3的技术原理、主要功能、性能表现及应用场景，带您全面了解这款新一代AI模型。DeepSeekV3的核心亮点DeepSeekV3是一款基于混合专家（MoE）架构的大型语言模型，
深度剖析 DeepSeek V3 技术报告：架构创新与卓越性能表现微凉的衣柜科技头条人工智能大模型语言模型
随着人工智能（AI）技术的不断发展，各种大规模语言模型（LLM）层出不穷，DeepSeekV3作为其中的一员，凭借其出色的性能表现和创新的架构设计，吸引了广泛关注。本文将通过对官方发布的DeepSeekV3技术报告的深入解析，从多个维度剖析DeepSeekV3如何通过先进的技术手段，在保持性能卓越的同时优化计算和内存开销。一、性能卓越，超越同行DeepSeekV3在多个权威基准测试中展现了强大的性
【机器学习：三十二、强化学习：理论与应用】 KeyPan 机器学习机器学习机器人人工智能深度学习数据挖掘
1.强化学习概述**强化学习（ReinforcementLearning,RL）**是一种机器学习方法，旨在通过试验与反馈的交互，使智能体（Agent）在动态环境中学习决策策略，以最大化累积奖励（CumulativeReward）。相比监督学习和无监督学习，强化学习更关注长期目标，而非简单地从标签中学习。核心概念智能体（Agent）：进行学习和决策的主体。环境（Environment）：智能体所在
第三讲隐语架构 huang8666 人工智能
第三讲隐语架构产品层白屏黑屏两大模块通过可视化产品，降低终端用户的体验和演示成本通过模块化API降低技术集成商的研发成本隐语产品SecretPad：轻量化安装快速验证POC可定制集成SecretNote：Notebook形式交互式建模多节点一站式管理和交互运行状态跟踪算法层PSI/PIR、DataAnalysis、FederatedLearningPSI（PrivateSetIntesection
DeepSeek V3 ChatGPT 国产AI他来啦 Ag大雨人工智能 ai
国产开源之光app：DeepSeekV3强势出圈！各位技术爱好者们，今天必须给大家安利DeepSeekV3，它堪称开源AI领域横空出世的超级新星！研发团队以卓越智慧，用极低的成本打造出这一世界级AI，惊艳全球，让无数业内大佬都为之侧目，妥妥的“国产骄傲”。它的功能堪称全能，日常写作、翻译、问答轻松拿捏，独特的“深度思考”模式加上联网搜索，在编程、解题、文献解读等复杂任务里也游刃有余，推理思考能力一
Python机器学习之XGBoost从入门到实战(基本理论说明) 雪域枫蓝 Python Atificial Intelligence 机器学习 python 分布式
Xgboost从基础到实战XGBoost:eXtremeGradientBoosting*应用机器学习领域的一个强有力的工具*GradientBootingMachines(GBM)的优化表现，快速有效—深盟分布式机器学习开源平台(DistributedmachinelearningCommunity，DMLC)的分支—DMLC也开源流行的深度学习库mxnet*GBM：Machine：机器学习模型
YOLOv10-1.1部分代码阅读笔记-base.py 红色的山茶花 YOLO 笔记深度学习
base.pyultralytics\data\base.py目录base.py1.所需的库和模块2.classBaseDataset(Dataset):1.所需的库和模块#UltralyticsYOLO,AGPL-3.0licenseimportglobimportmathimportosimportrandomfromcopyimportdeepcopyfrommultiprocessing.
AUTOSAR汽车电子嵌入式编程精讲300篇-智能网联汽车CAN总线-基于电压信号的CAN总线入侵检测系统设计与实现格图素书汽车网络
目录前言入侵检测系统研究现状入侵检测系统建模CAN总线入侵检测威胁模型DeepSVDD模型入侵检测系统方案设计挑战和解决方案差分信号的采集与处理差分信号的特征提取入侵检测模型的设计入侵检测系统性能评估实验环境设置不同的车辆状态不同数量的攻击目标不同发送频率的攻击消息DeepSVDD模型与SVDD模型的比较本文篇幅较长，分为多篇，文章索引详见智能网联汽车CAN总线-发展现状智能网联汽车CAN总线-智
Windows 11安装DeepSpeed报错（Unable to pre-compile async_io）问题解决 happy coding windows gpt
Windows11安装DeepSpeed报错（Unabletopre-compileasync_io）问题解决报错如下Preparingmetadata(setup.py)...errorerror:subprocess-exited-with-error×pythonsetup.pyegg_infodidnotrunsuccessfully.│exitcode:1╰─>[17linesofout
机器学习和深度学习的概念你好呀我是裤裤深度学习笔记机器学习深度学习人工智能
MachineLearning机器学习，可以看作是找一个函数。这个函数是人类找不到的，所以交给机器来找。DifferenttypesofFunctions**Regression：**函数的输出是一个数值forexample：**Classification：**给出选项，让机器去选择。forexample：检测一个邮件是不是垃圾文件，就可以通过这个来做。选项是两个：垃圾文件or非垃圾文件。下面，
PLUTO：突破基于模仿学习的自动驾驶规划极限硅谷秋水机器学习自动驾驶人工智能自动驾驶人工智能机器学习计算机视觉
24年4月来自香港科技大学的论文“PLUTO:PushingtheLimitofImitationLearning-basedPlanningforAutonomousDriving”。PLUTO，突破基于模仿学习的自动驾驶规划极限。改进来自三个关键方面：一种纵向横向感知模型架构，可实现灵活多样的驾驶行为；一种创新的辅助损失计算方法，可广泛应用且可高效地进行批量计算；一种利用对比学习的训练框架，采
拿下美赛M奖之必备软件和网站！东方建模. 数学建模
目录前言：一.题目翻译与理解：DeepL+知云文献翻译二.查找文献：国内外平台结合使用三.论文撰写：Word或LaTeX+Overleaf四.公式输入与思维导图：MathType+XMind五.阅读文献与文献管理：AdobeReader+Zotero六.模型求解与编程：MATLAB+Python+Lingo七.图形绘制与结果可视化：MATLAB+Python+Origin八.流程图与示意图：亿图图
官宣开源阿里云与清华大学共建AI大模型推理项目Mooncake 阿里云大模型
2024年6月，国内优质大模型应用月之暗面Kimi与清华大学MADSys实验室（MachineLearning,AI,BigDataSystemsLab）联合发布了以KVCache为中心的大模型推理架构Mooncake。通过使用以KVCache为中心的PD分离和以存换算架构，大幅提升大模型应用Kimi智能助手推理吞吐的同时有效降低了推理成本，自发布以来受到业界广泛关注。近日，清华大学和研究组织9#
【机器学习】主动学习-增加标签的操作方法-样本池采样（Pool-Based Sampling） IT古董机器学习机器学习学习人工智能
Pool-BasedSamplingPool-basedsampling是一种主动学习（ActiveLearning）方法，与流式选择性采样不同，它假设有一个预先定义的未标注样本池，算法从中选择最有价值的样本进行标注，以提升模型的性能。这种方法广泛应用于需要人工标注的场景，例如文本分类、图像识别等。核心思想预先准备一个未标注数据池（UnlabeledDataPool）。使用初始标注数据训练一个模型
deepin 中 apt 与 dpkg 安装包管理工具的区别慵懒的猫mi linux deepin 运维
在Linux系统中，尤其是基于Debian的发行版如Ubuntu和Deepin，apt和dpkg是两种常用的包管理工具。它们在功能和使用场景上有一些显著的区别。本文将详细介绍这两种工具的主要区别以及它们的常用命令。1.主要区别1.1dpkg功能：dpkg侧重于本地软件包的管理。它主要用于安装、删除和查询本地的.deb文件。依赖管理：dpkg不会自动处理依赖关系。如果安装的包有依赖，需要手动安装这些
DeepSeek Artifacts：前端开发的新利器人工智能
DeepSeekArtifacts：前端开发的新利器人工智能领域创新不断，DeepSeekV3便是其中备受瞩目的工具之一。这款轻量级模型凭借在大语言模型（LLM）排行榜上的优异表现，以及亲民的价格和卓越的性能，在人工智能社区中广受关注。然而，它的姊妹工具DeepSeekArtifacts却因截然不同的缘由引发了热议。在本文中，我们将深入探究DeepSeekArtifacts。这是HuggingFa
6850亿参数混合专家(MoE)架构开源大模型！Deepseek V3全方位客观评测文档处理、逻辑推理、算法编程等多维度的真实能力水平！是卓越还是拉胯？真能超越Claude还是言过其实？ AI超元域 ai AI编程
本篇笔记所对应的视频：6850亿参数混合专家(MoE)架构开源大模型！DeepseekV3全方位客观评测文档处理、逻辑推理、算法编程等多维度的真实能力水平！是卓越还是拉胯？_哔哩哔哩_bilibiliDeepseek发布了最新Deepseekv3大模型，现在在huggingface上可以下载模型的权重文件了。而且我们还可以在Deepseek的官方直接使用v3模型。由于官方还没有发布详细的参数介绍，
ztree设置禁用节点 3213213333332132 JavaScript ztree json setDisabledNode Ajax
ztree设置禁用节点的时候注意，当使用ajax后台请求数据,必须要设置为同步获取数据，否者会获取不到节点对象，导致设置禁用没有效果。 $(function(){ showTree(); setDisabledNode(); });
JVM patch by Taobao bookjovi java HotSpot
在网上无意中看到淘宝提交的hotspot patch，共四个，有意思，记录一下。 7050685：jsdbproc64.sh has a typo in the package name 7058036：FieldsAllocationStyle=2 does not work in 32-bit VM 7060619：C1 should respect inline and
将session存储到数据库中 dcj3sjt126com sql PHP session
CREATE TABLE sessions ( id CHAR(32) NOT NULL, data TEXT, last_accessed TIMESTAMP NOT NULL, PRIMARY KEY (id) ); <?php /** * Created by PhpStorm. * User: michaeldu * Date
Vector 171815164 vector
public Vector<CartProduct> delCart(Vector<CartProduct> cart, String id) { for (int i = 0; i < cart.size(); i++) { if (cart.get(i).getId().equals(id)) { cart.remove(i);
各连接池配置参数比较 g21121 连接池
排版真心费劲，大家凑合看下吧，见谅~ Druid DBCP C3P0 Proxool 数据库用户名称 Username Username User 数据库密码 Password Password Password 驱动名
[简单]mybatis insert语句添加动态字段 53873039oycg mybatis
mysql数据库,id自增,配置如下： <insert id="saveTestTb" useGeneratedKeys="true" keyProperty="id" parameterType=&
struts2拦截器配置云端月影 struts2拦截器
struts2拦截器interceptor的三种配置方法方法1. 普通配置法 <struts> <package name="struts2" extends="struts-default"> &
IE中页面不居中，火狐谷歌等正常 aijuans IE中页面不居中
问题是首页在火狐、谷歌、所有IE中正常显示，列表页的页面在火狐谷歌中正常，在IE6、7、8中都不中，觉得可能那个地方设置的让IE系列都不认识，仔细查看后发现，列表页中没写HTML模板部分没有添加DTD定义，就是<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3
String,int,Integer,char 几个类型常见转换 antonyup_2006 html sql .net
如何将字串 String 转换成整数 int? int i = Integer.valueOf(my_str).intValue(); int i=Integer.parseInt(str); 如何将字串 String 转换成Integer ? Integer integer=Integer.valueOf(str); 如何将整数 int 转换成字串 String ? 1.
PL/SQL的游标类型百合不是茶显示游标(静态游标)隐式游标游标的更新和删除 %rowtype ref游标(动态游标)
游标是oracle中的一个结果集,用于存放查询的结果; PL/SQL中游标的声明; 1,声明游标 2,打开游标(默认是关闭的); 3,提取数据 4,关闭游标注意的要点:游标必须声明在declare中,使用open打开游标,fetch取游标中的数据,close关闭游标隐式游标:主要是对DML数据的操作隐
JUnit4中@AfterClass @BeforeClass @after @before的区别对比 bijian1013 JUnit4 单元测试
一.基础知识 JUnit4使用Java5中的注解（annotation），以下是JUnit4常用的几个annotation： @Before：初始化方法对于每一个测试方法都要执行一次（注意与BeforeClass区别，后者是对于所有方法执行一次）@After：释放资源对于每一个测试方法都要执行一次（注意与AfterClass区别，后者是对于所有方法执行一次
精通Oracle10编程SQL(12)开发包 bijian1013 oracle 数据库 plsql
/* *开发包 *包用于逻辑组合相关的PL/SQL类型（例如TABLE类型和RECORD类型）、PL/SQL项（例如游标和游标变量）和PL/SQL子程序（例如过程和函数） */ --包用于逻辑组合相关的PL/SQL类型、项和子程序，它由包规范和包体两部分组成 --建立包规范：包规范实际是包与应用程序之间的接口，它用于定义包的公用组件，包括常量、变量、游标、过程和函数等 --在包规
【EhCache二】ehcache.xml配置详解 bit1129 ehcache.xml
在ehcache官网上找了多次，终于找到ehcache.xml配置元素和属性的含义说明文档了，这个文档包含在ehcache.xml的注释中！ ehcache.xml ： http://ehcache.org/ehcache.xml ehcache.xsd ： http://ehcache.org/ehcache.xsd ehcache配置文件的根元素是ehcahe ehcac
java.lang.ClassNotFoundException: org.springframework.web.context.ContextLoaderL 白糖_ java eclipse spring tomcat Web
今天学习spring+cxf的时候遇到一个问题：在web.xml中配置了spring的上下文监听器： <listener> <listener-class>org.springframework.web.context.ContextLoaderListener</listener-class> </listener> 随后启动
angular.element boyitech AngularJS AngularJS API angular.element
angular.element 描述: 包裹着一部分DOM element或者是HTML字符串，把它作为一个jQuery元素来处理。（类似于jQuery的选择器啦）如果jQuery被引入了，则angular.element就可以看作是jQuery选择器，选择的对象可以使用jQuery的函数；如果jQuery不可用，angular.e
java-给定两个已排序序列，找出共同的元素。 bylijinnan java
import java.util.ArrayList; import java.util.Arrays; import java.util.List; public class CommonItemInTwoSortedArray { /** * 题目：给定两个已排序序列，找出共同的元素。 * 1.定义两个指针分别指向序列的开始。 * 如果指向的两个元素
sftp 异常，有遇到的吗？求解 Chen.H java jcraft auth jsch jschexception
com.jcraft.jsch.JSchException: Auth cancel at com.jcraft.jsch.Session.connect(Session.java:460) at com.jcraft.jsch.Session.connect(Session.java:154) at cn.vivame.util.ftp.SftpServerAccess.connec
[生物智能与人工智能]神经元中的电化学结构代表什么? comsci 人工智能
我这里做一个大胆的猜想,生物神经网络中的神经元中包含着一些化学和类似电路的结构,这些结构通常用来扮演类似我们在拓扑分析系统中的节点嵌入方程一样,使得我们的神经网络产生智能判断的能力,而这些嵌入到节点中的方程同时也扮演着"经验"的角色.... 我们可以尝试一下...在某些神经
通过LAC和CID获取经纬度信息 dai_lm lac cid
方法1：用浏览器打开http://www.minigps.net/cellsearch.html，然后输入lac和cid信息(mcc和mnc可以填0)，如果数据正确就可以获得相应的经纬度方法2：发送HTTP请求到http://www.open-electronics.org/celltrack/cell.php?hex=0&lac=<lac>&cid=&
JAVA的困难分析 datamachine java
前段时间转了一篇SQL的文章（http://datamachine.iteye.com/blog/1971896），文章不复杂，但思想深刻，就顺便思考了一下java的不足，当砖头丢出来，希望引点和田玉。 -----------------------------------------------------------------------------------------
小学5年级英语单词背诵第二课 dcj3sjt126com english word
money 钱 paper 纸 speak 讲，说 tell 告诉 remember 记得，想起 knock 敲，击，打 question 问题 number 数字，号码 learn 学会，学习 street 街道 carry 搬运，携带 send 发送，邮寄，发射 must 必须 light 灯，光线，轻的 front
linux下面没有tree命令 dcj3sjt126com linux
centos p安装 yum -y install tree mac os安装 brew install tree 首先来看tree的用法 tree 中文解释：tree 功能说明：以树状图列出目录的内容。语　　法：tree [-aACdDfFgilnNpqstux][-I <范本样式>][-P <范本样式
Map迭代方式，Map迭代，Map循环蕃薯耀 Map循环 Map迭代 Map迭代方式
Map迭代方式，Map迭代，Map循环 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 蕃薯耀 2015年
Spring Cache注解+Redis hanqunfeng spring
Spring3.1 Cache注解依赖jar包：  <dependency> <groupId>org.springframework.data</groupId> <artifactId>spring-data-redis</artifactId>
Guava中针对集合的 filter和过滤功能 jackyrong filter
在guava库中，自带了过滤器(filter)的功能，可以用来对collection 进行过滤，先看例子： @Test public void whenFilterWithIterables_thenFiltered() { List<String> names = Lists.newArrayList("John"
学习编程那点事 lampcy 编程 android PHP html5
一年前的夏天，我还在纠结要不要改行，要不要去学php？能学到真本事吗？改行能成功吗？太多的问题，我终于不顾一切，下定决心，辞去了工作，来到传说中的帝都。老师给的乘车方式还算有效，很顺利的就到了学校，赶巧了，正好学校搬到了新校区。先安顿了下来，过了个轻松的周末，第一次到帝都，逛逛吧！接下来的周一，是我噩梦的开始，学习内容对我这个零基础的人来说，除了勉强完成老师布置的作业外，我已经没有时间和精力去
架构师之流处理---------bytebuffer的mark,limit和flip nannan408 ByteBuffer
1.前言。如题，limit其实就是可以读取的字节长度的意思，flip是清空的意思，mark是标记的意思。 2.例子. 例子代码: String str = "helloWorld"; ByteBuffer buff = ByteBuffer.wrap(str.getBytes()); Sy
org.apache.el.parser.ParseException: Encountered " ":" ": "" at line 1, column 1 Everyday都不同 $转义 el表达式
最近在做Highcharts的过程中，在写js时，出现了以下异常：严重: Servlet.service() for servlet jsp threw exception org.apache.el.parser.ParseException: Encountered " ":" ": "" at line 1,
用Java实现发送邮件到163 tntxia java实现
/* 在java版经常看到有人问如何用javamail发送邮件？如何接收邮件？如何访问多个文件夹等。问题零散，而历史的回复早已经淹没在问题的海洋之中。本人之前所做过一个java项目，其中包含有WebMail功能，当初为用java实现而对javamail摸索了一段时间，总算有点收获。看到论坛中的经常有此方面的问题，因此把我的一些经验帖出来，希望对大家有些帮助。此篇仅介绍用
探索实体类存在的真正意义 java小叶檀 POJO
一. 实体类简述实体类其实就是俗称的POJO,这种类一般不实现特殊框架下的接口，在程序中仅作为数据容器用来持久化存储数据用的 POJO（Plain Old Java Objects）简单的Java对象它的一般格式就是 public class A{ private String id; public Str