Iriving_shu

caffe层解析之softmaxwithloss层

理论

caffe中的softmaxWithLoss其实是：
softmaxWithLoss = Multinomial Logistic Loss Layer + Softmax Layer
其核心公式为：

其中，其中y^为标签值，k为输入图像标签所对应的的神经元。m为输出的最大值，主要是考虑数值稳定性。

反向传播时：

对输入的zj进行求导得：

Caffe中使用

首先在Caffe中使用如下：

1 layer {
2 name: "loss"
3 type: "SoftmaxWithLoss"
4 bottom: "fc8"
5 bottom: "label"
6 top: "loss"
7 }

caffe中softmaxloss 层的参数如下：

// Message that stores parameters shared by loss layers
message LossParameter {
  // If specified, ignore instances with the given label.
  //忽略那些label
  optional int32 ignore_label = 1;
  // How to normalize the loss for loss layers that aggregate across batches,
  // spatial dimensions, or other dimensions.  Currently only implemented in
  // SoftmaxWithLoss and SigmoidCrossEntropyLoss layers.
  enum NormalizationMode {
    // Divide by the number of examples in the batch times spatial dimensions.
    // Outputs that receive the ignore label will NOT be ignored in computing
    // the normalization factor.
    //一次前向计算的loss除以所有的label数
    FULL = 0;
    // Divide by the total number of output locations that do not take the
    // ignore_label.  If ignore_label is not set, this behaves like FULL.
    //一次前向计算的loss除以所有的可用的label数
    VALID = 1;
    // Divide by the batch size.
    //除以batchsize大小，默认为batchsize大小。
    BATCH_SIZE = 2;
    // Do not normalize the loss.
    NONE = 3;
  }
  // For historical reasons, the default normalization for
  // SigmoidCrossEntropyLoss is BATCH_SIZE and *not* VALID.
  optional NormalizationMode normalization = 3 [default = VALID];
  // Deprecated.  Ignored if normalization is specified.  If normalization
  // is not specified, then setting this to false will be equivalent to
  // normalization = BATCH_SIZE to be consistent with previous behavior.
  //如果normalize==false，则normalization=BATCH_SIZE
  //如果normalize==true,则normalization=Valid
  optional bool normalize = 2;
}

首先来看一下softmaxwithloss的头文件：

#ifndef CAFFE_SOFTMAX_WITH_LOSS_LAYER_HPP_
#define CAFFE_SOFTMAX_WITH_LOSS_LAYER_HPP_

#include 

#include "caffe/blob.hpp"
#include "caffe/layer.hpp"
#include "caffe/proto/caffe.pb.h"

#include "caffe/layers/loss_layer.hpp"
#include "caffe/layers/softmax_layer.hpp"

namespace caffe {

/**
 * @brief Computes the multinomial logistic loss for a one-of-many
 *        classification task, passing real-valued predictions through a
 *        softmax to get a probability distribution over classes.
 *
 * This layer should be preferred over separate
 * SoftmaxLayer + MultinomialLogisticLossLayer
 * as its gradient computation is more numerically stable.
 * At test time, this layer can be replaced simply by a SoftmaxLayer.
 *
 * @param bottom input Blob vector (length 2)
 *   -# @f$ (N \times C \times H \times W) @f$
 *      the predictions @f$ x @f$, a Blob with values in
 *      @f$ [-\infty, +\infty] @f$ indicating the predicted score for each of
 *      the @f$ K = CHW @f$ classes. This layer maps these scores to a
 *      probability distribution over classes using the softmax function
 *      @f$ \hat{p}_{nk} = \exp(x_{nk}) /
 *      \left[\sum_{k'} \exp(x_{nk'})\right] @f$ (see SoftmaxLayer).
 *   -# @f$ (N \times 1 \times 1 \times 1) @f$
 *      the labels @f$ l @f$, an integer-valued Blob with values
 *      @f$ l_n \in [0, 1, 2, ..., K - 1] @f$
 *      indicating the correct class label among the @f$ K @f$ classes
 * @param top output Blob vector (length 1)
 *   -# @f$ (1 \times 1 \times 1 \times 1) @f$
 *      the computed cross-entropy classification loss: @f$ E =
 *        \frac{-1}{N} \sum\limits_{n=1}^N \log(\hat{p}_{n,l_n})
 *      @f$, for softmax output class probabilites @f$ \hat{p} @f$
 */
template <typename Dtype>
class SoftmaxWithLossLayer : public LossLayer {
 public:
   /**
    * @param param provides LossParameter loss_param, with options:
    *  - ignore_label (optional)
    *    Specify a label value that should be ignored when computing the loss.
    *  - normalize (optional, default true)
    *    If true, the loss is normalized by the number of (nonignored) labels
    *    present; otherwise the loss is simply summed over spatial locations.
    */
  explicit SoftmaxWithLossLayer(const LayerParameter& param)
      : LossLayer(param) {}
  virtual void LayerSetUp(const vector*>& bottom,
      const vector*>& top);
  virtual void Reshape(const vector*>& bottom,
      const vector*>& top);

  virtual inline const char* type() const { return "SoftmaxWithLoss"; }
  virtual inline int ExactNumBottomBlobs() const { return -1; }
  virtual inline int MinBottomBlobs() const { return 2; }
  virtual inline int MaxBottomBlobs() const { return 3; }
  virtual inline int ExactNumTopBlobs() const { return -1; }
  virtual inline int MinTopBlobs() const { return 1; }
  virtual inline int MaxTopBlobs() const { return 2; }

 protected:
  virtual void Forward_cpu(const vector*>& bottom,
      const vector*>& top);
  virtual void Forward_gpu(const vector*>& bottom,
      const vector*>& top);
  /**
   * @brief Computes the softmax loss error gradient w.r.t. the predictions.
   *
   * Gradients cannot be computed with respect to the label inputs (bottom[1]),
   * so this method ignores bottom[1] and requires !propagate_down[1], crashing
   * if propagate_down[1] is set.
   *
   * @param top output Blob vector (length 1), providing the error gradient with
   *      respect to the outputs
   *   -# @f$ (1 \times 1 \times 1 \times 1) @f$
   *      This Blob's diff will simply contain the loss_weight* @f$ \lambda @f$,
   *      as @f$ \lambda @f$ is the coefficient of this layer's output
   *      @f$\ell_i@f$ in the overall Net loss
   *      @f$ E = \lambda_i \ell_i + \mbox{other loss terms}@f$; hence
   *      @f$ \frac{\partial E}{\partial \ell_i} = \lambda_i @f$.
   *      (*Assuming that this top Blob is not used as a bottom (input) by any
   *      other layer of the Net.)
   * @param propagate_down see Layer::Backward.
   *      propagate_down[1] must be false as we can't compute gradients with
   *      respect to the labels.
   * @param bottom input Blob vector (length 2)
   *   -# @f$ (N \times C \times H \times W) @f$
   *      the predictions @f$ x @f$; Backward computes diff
   *      @f$ \frac{\partial E}{\partial x} @f$
   *   -# @f$ (N \times 1 \times 1 \times 1) @f$
   *      the labels -- ignored as we can't compute their error gradients
   */
  virtual void Backward_cpu(const vector*>& top,
      const vector<bool>& propagate_down, const vector*>& bottom);
  virtual void Backward_gpu(const vector*>& top,
      const vector<bool>& propagate_down, const vector*>& bottom);

  /// Read the normalization mode parameter and compute the normalizer based
  /// on the blob size.  If normalization_mode is VALID, the count of valid
  /// outputs will be read from valid_count, unless it is -1 in which case
  /// all outputs are assumed to be valid.
  virtual Dtype get_normalizer(
      LossParameter_NormalizationMode normalization_mode, Dtype valid_count);

  /// The internal SoftmaxLayer used to map predictions to a distribution.
  //声明softmax layer
  shared_ptr > softmax_layer_;
  /// prob stores the output probability predictions from the SoftmaxLayer.
  //存储经过softmax layer输出的概率
  Blob prob_;
  /// bottom vector holder used in call to the underlying 
 //softmax层前向函数的bottom
 SoftmaxLayer::Forward
  vector*> softmax_bottom_vec_;
  /// top vector holder used in call to the underlying SoftmaxLayer::Forward
  //softmax层前向函数的top
  vector*> softmax_top_vec_;
  // Whether to ignore instances with a certain label.
  //是否需要忽略掉label
  bool has_ignore_label_;
  /// The label indicating that an instance should be ignored.
  int ignore_label_;
  bool has_hard_ratio_;
  float hard_ratio_;
  bool has_hard_mining_label_;
  int hard_mining_label_;
  bool has_class_weight_;
  Blob class_weight_;
  Blob counts_;
  Blob loss_;
  /// How to normalize the output loss.
  //归一化loss类型
  LossParameter_NormalizationMode normalization_;
  bool has_cutting_point_;
  Dtype cutting_point_;
  std::string normalize_type_;

  int softmax_axis_, outer_num_, inner_num_;
};

}  // namespace caffe

#endif  // CAFFE_SOFTMAX_WITH_LOSS_LAYER_HPP_

具体函数实现

#include 
#include 
#include 

#include "caffe/layers/softmax_loss_layer.hpp"
#include "caffe/util/math_functions.hpp"

namespace caffe {

template <typename Dtype>
void SoftmaxWithLossLayer::LayerSetUp(
    const vector*>& bottom, const vector*>& top) {
  LossLayer::LayerSetUp(bottom, top);
  normalize_type_ =
    this->layer_param_.softmax_param().normalize_type();
    //归一化为softmax
  if (normalize_type_ == "Softmax") {
    LayerParameter softmax_param(this->layer_param_);
    softmax_param.set_type("Softmax");
    softmax_layer_ = LayerRegistry::CreateLayer(softmax_param);
    softmax_bottom_vec_.clear();
    softmax_bottom_vec_.push_back(bottom[0]);
    softmax_top_vec_.clear();
    softmax_top_vec_.push_back(&prob_);
    softmax_layer_->SetUp(softmax_bottom_vec_, softmax_top_vec_);
  }
  else if(normalize_type_ == "L2" || normalize_type_ == "L1") {
    LayerParameter normalize_param(this->layer_param_);
    normalize_param.set_type("Normalize");
    softmax_layer_ = LayerRegistry::CreateLayer(normalize_param);
    softmax_bottom_vec_.clear();
    softmax_bottom_vec_.push_back(bottom[0]);
    softmax_top_vec_.clear();
    softmax_top_vec_.push_back(&prob_);
    softmax_layer_->SetUp(softmax_bottom_vec_, softmax_top_vec_);
  }
  else {
    NOT_IMPLEMENTED;
  }

  has_ignore_label_ =
    this->layer_param_.loss_param().has_ignore_label();
  if (has_ignore_label_) {
    ignore_label_ = this->layer_param_.loss_param().ignore_label();
  }
  has_hard_ratio_ =
    this->layer_param_.softmax_param().has_hard_ratio();
  if (has_hard_ratio_) {
    hard_ratio_ = this->layer_param_.softmax_param().hard_ratio();
    CHECK_GE(hard_ratio_, 0);
    CHECK_LE(hard_ratio_, 1);
  }
  has_cutting_point_ =
    this->layer_param_.softmax_param().has_cutting_point();
  if (has_cutting_point_) {
    cutting_point_ = this->layer_param_.softmax_param().cutting_point();
    CHECK_GE(cutting_point_, 0);
    CHECK_LE(cutting_point_, 1);
  }
  has_hard_mining_label_ = this->layer_param_.softmax_param().has_hard_mining_label();
  if (has_hard_mining_label_) {
    hard_mining_label_ = this->layer_param_.softmax_param().hard_mining_label();
  }
  has_class_weight_ = (this->layer_param_.softmax_param().class_weight_size() != 0);
  softmax_axis_ =
    bottom[0]->CanonicalAxisIndex(this->layer_param_.softmax_param().axis());
  if (has_class_weight_) {
    class_weight_.Reshape({ bottom[0]->shape(softmax_axis_) });
    CHECK_EQ(this->layer_param_.softmax_param().class_weight().size(), bottom[0]->shape(softmax_axis_));
    for (int i = 0; i < bottom[0]->shape(softmax_axis_); i++) {
      class_weight_.mutable_cpu_data()[i] = (Dtype)this->layer_param_.softmax_param().class_weight(i);
    }
  }
  else {
    if (bottom.size() == 3) {
      class_weight_.Reshape({ bottom[0]->shape(softmax_axis_) });
      for (int i = 0; i < bottom[0]->shape(softmax_axis_); i++) {
        class_weight_.mutable_cpu_data()[i] = (Dtype)1.0;
      }
    }
  }
  if (!this->layer_param_.loss_param().has_normalization() &&
      this->layer_param_.loss_param().has_normalize()) {
    normalization_ = this->layer_param_.loss_param().normalize() ?
                     LossParameter_NormalizationMode_VALID :
                     LossParameter_NormalizationMode_BATCH_SIZE;
  } else {
    normalization_ = this->layer_param_.loss_param().normalization();
  }
}

template <typename Dtype>
void SoftmaxWithLossLayer::Reshape(
    const vector*>& bottom, const vector*>& top) {
  LossLayer::Reshape(bottom, top);
  softmax_layer_->Reshape(softmax_bottom_vec_, softmax_top_vec_);
  softmax_axis_ =
      bottom[0]->CanonicalAxisIndex(this->layer_param_.softmax_param().axis());
  outer_num_ = bottom[0]->count(0, softmax_axis_);
  inner_num_ = bottom[0]->count(softmax_axis_ + 1);
  counts_.Reshape({ outer_num_, inner_num_ });
  loss_.Reshape({ outer_num_, inner_num_ });
  CHECK_EQ(outer_num_ * inner_num_, bottom[1]->count())
      << "Number of labels must match number of predictions; "
      << "e.g., if softmax axis == 1 and prediction shape is (N, C, H, W), "
      << "label count (number of labels) must be N*H*W, "
      << "with integer values in {0, 1, ..., C-1}.";
  if (bottom.size() == 3) {
    CHECK_EQ(outer_num_ * inner_num_, bottom[2]->count())
      << "Number of loss weights must match number of label.";
  }
  if (top.size() >= 2) {
    // softmax output
    top[1]->ReshapeLike(*bottom[0]);
  }
  if (has_class_weight_) {
    CHECK_EQ(class_weight_.count(), bottom[0]->shape(1));
  }
}

template <typename Dtype>
Dtype SoftmaxWithLossLayer::get_normalizer(
    LossParameter_NormalizationMode normalization_mode, Dtype valid_count) {
  Dtype normalizer;
  switch (normalization_mode) {
    case LossParameter_NormalizationMode_FULL:
      normalizer = Dtype(outer_num_ * inner_num_);
      break;
    case LossParameter_NormalizationMode_VALID:
      if (valid_count == -1) {
        normalizer = Dtype(outer_num_ * inner_num_);
      } else {
        normalizer = valid_count;
      }
      break;
    case LossParameter_NormalizationMode_BATCH_SIZE:
      normalizer = Dtype(outer_num_);
      break;
    case LossParameter_NormalizationMode_NONE:
      normalizer = Dtype(1);
      break;
    default:
      LOG(FATAL) << "Unknown normalization mode: "
          << LossParameter_NormalizationMode_Name(normalization_mode);
  }
  // Some users will have no labels for some examples in order to 'turn off' a
  // particular loss in a multi-task setup. The max prevents NaNs in that case.
  return std::max(Dtype(1.0), normalizer);
}
//前向中主要利用softmax层输出每一个样本的对应的所有类别概率。如输入一只狗，则输出狗的概率，猫的概率，猴的概率。[0.8,0.1,0.1]
template <typename Dtype>
void SoftmaxWithLossLayer::Forward_cpu(
    const vector*>& bottom, const vector*>& top) {
  // The forward pass computes the softmax prob values.
  softmax_layer_->Forward(softmax_bottom_vec_, softmax_top_vec_);
  const Dtype* prob_data = prob_.cpu_data();
  const Dtype* label = bottom[1]->cpu_data();
  int dim = prob_.count() / outer_num_;
  Dtype count = 0;
  Dtype loss = 0;
  if (bottom.size() == 2) {
    for (int i = 0; i < outer_num_; ++i) {
      for (int j = 0; j < inner_num_; j++) {
        const int label_value = static_cast<int>(label[i * inner_num_ + j]);
        if (has_ignore_label_ && label_value == ignore_label_) {
          continue;
        }
        DCHECK_GE(label_value, 0);
        DCHECK_LT(label_value, prob_.shape(softmax_axis_));
        loss -= log(std::max(prob_data[i * dim + label_value * inner_num_ + j],
          Dtype(FLT_MIN)));
        count += 1;
      }
    }
  }
  else if(bottom.size() == 3) {
    const Dtype* weights = bottom[2]->cpu_data();
    for (int i = 0; i < outer_num_; ++i) {
      for (int j = 0; j < inner_num_; j++) {
        const int label_value = static_cast<int>(label[i * inner_num_ + j]);
        const Dtype weight_value = weights[i * inner_num_ + j] * (has_class_weight_? class_weight_.cpu_data()[label_value] : 1.0);
        if (weight_value == 0) continue;
        if (has_ignore_label_ && label_value == ignore_label_) {
          continue;
        }
        DCHECK_GE(label_value, 0);
        DCHECK_LT(label_value, prob_.shape(softmax_axis_));
        loss -= weight_value * log(std::max(prob_data[i * dim + label_value * inner_num_ + j],
          Dtype(FLT_MIN)));
        count += weight_value;
      }
    }
  }
  top[0]->mutable_cpu_data()[0] = loss / get_normalizer(normalization_, count);
  if (top.size() == 2) {
    top[1]->ShareData(prob_);
  }
}

template <typename Dtype>
void SoftmaxWithLossLayer::Backward_cpu(const vector*>& top,
    const vector<bool>& propagate_down, const vector*>& bottom) {
  if (propagate_down[1]) {
    LOG(FATAL) << this->type()
               << " Layer cannot backpropagate to label inputs.";
  }
  if (propagate_down[0]) {
    Dtype* bottom_diff = bottom[0]->mutable_cpu_diff();
    const Dtype* prob_data = prob_.cpu_data();
    caffe_copy(prob_.count(), prob_data, bottom_diff);
    const Dtype* label = bottom[1]->cpu_data();
    int dim = prob_.count() / outer_num_;
    Dtype count = 0;
    if (bottom.size() == 2) {
      for (int i = 0; i < outer_num_; ++i) {
        for (int j = 0; j < inner_num_; ++j) {
          const int label_value = static_cast<int>(label[i * inner_num_ + j]);
          if (has_ignore_label_ && label_value == ignore_label_) {
            for (int c = 0; c < bottom[0]->shape(softmax_axis_); ++c) {
              bottom_diff[i * dim + c * inner_num_ + j] = 0;
            }
          }
          else {
          //反向求导的公式的实现
            bottom_diff[i * dim + label_value * inner_num_ + j] -= 1;
            count += 1;
          }
        }
      }
    }
    else if (bottom.size() == 3) {
      const Dtype* weights = bottom[2]->cpu_data();
      for (int i = 0; i < outer_num_; ++i) {
        for (int j = 0; j < inner_num_; ++j) {
          const int label_value = static_cast<int>(label[i * inner_num_ + j]);
          const Dtype weight_value = weights[i * inner_num_ + j];
          if (has_ignore_label_ && label_value == ignore_label_) {
            for (int c = 0; c < bottom[0]->shape(softmax_axis_); ++c) {
              bottom_diff[i * dim + c * inner_num_ + j] = 0;
            }
          }
          else {
            bottom_diff[i * dim + label_value * inner_num_ + j] -= 1;
            for (int c = 0; c < bottom[0]->shape(softmax_axis_); ++c) {
              bottom_diff[i * dim + c * inner_num_ + j] *= weight_value * (has_class_weight_ ? class_weight_.cpu_data()[label_value] : 1.0);
            }
            if(weight_value != 0) count += weight_value;
          }
        }
      }
    }
    // Scale gradient
    //由归一化手段决定梯度的放缩
    Dtype loss_weight = top[0]->cpu_diff()[0] /
                        get_normalizer(normalization_, count);
    caffe_scal(prob_.count(), loss_weight, bottom_diff);
  }
}

#ifdef CPU_ONLY
STUB_GPU(SoftmaxWithLossLayer);
#endif

INSTANTIATE_CLASS(SoftmaxWithLossLayer);
REGISTER_LAYER_CLASS(SoftmaxWithLoss);

}  // namespace caffe

caffemodel特征可视化_Caffe学习笔记4图像特征进行可视化 weixin_39824801 caffemodel特征可视化
Caffe学习笔记4图像特征进行可视化本文为原创作品，未经本人同意，禁止转载，禁止用于商业用途！本人对博客使用拥有最终解释权欢迎关注我的博客：http://blog.csdn.net/hit2015spring和http://www.cnblogs.com/xujianqing/可以算是对它的翻译的总结吧，它可以算是学习笔记2的一个发展，2是介绍怎么提取特征，这是介绍怎么可视化特征1、准备工作首先
Caffe学习笔记1-安装以及代码结构 baobei0112 CNN 卷积神经网络
Caffe学习笔记1-安装以及代码结构ByYuFeiGan2014-12-09更新日期:2014-12-09安装按照官网教程安装，我在OSX10.9和Ubuntu14.04上面都安装成功了。主要麻烦在于gloggflagsgtest这几个依赖项是google上面的需要。由于我用Mac没有CUDA，所以安装时需要设置CPU_ONLY:=1。如果不是干净的系统，安装还是有点麻烦的比如我在OSX10.9
caffe学习笔记--写一个运行caffe.cpp的makefile thystar caffe学习
之前因为有caffe的项目要放到服务器上面,但是其实不需要在服务器上面重新安装caffe，所以写了个makefile.这里改写了个简单的，比较容易读的，只运行caffe.cpp，如果由其他的，可以按照makefile的规则添加就好。首先，还是要说一下关于caffe的依赖，参考之前的两篇博客：http://blog.csdn.net/thystar/article/details/51179064和
caffe学习笔记10.1--Fine-tuning a Pretrained Network for Style Recognition(new) thystar caffe学习
在之前的文章里，写过一个关于微调的博客，但是今天上去发现这部分已经更新了http://nbviewer.jupyter.org/github/BVLC/caffe/blob/master/examples/02-fine-tuning.ipynb，因此补一篇最新的，关于微调，前面的文章由讲，参考http://blog.csdn.net/thystar/article/details/5067553
caffe学习笔记（11）：多任务学习之HDF5Data类型数据集生成 guyunee deep learning matlab object detection 数据标签 caffe 深度学习
最近开始研究多任务学习（multi-tasklearning，MTL），先分享给大家：本文主要讲述数据集的建立，HDF5Data类型用于处理多标签数据，在网络中定义为：layer{name:"data"type:"HDF5Data"top:"data"top:"label"include{phase:TRAIN}hdf5_data_param{source:"list_train.txt"batc
Ubuntu14.04下配置Caffe+OpenCV2.4.10+CUDA7.5+cuDNN5.1.10 cuihaolong 3D Print 系统配置
1.CUDA配置与Tensorflow，Keras等深度学习框架一样的配置方法，一次配置可以重用，其他基础软件和依赖项亦可参考：Caffe学习笔记2--Ubuntu14.0464bit安装Caffe（GPU版本）Ubuntu14.04+Caffe+Cuda7.5+Opencv3.0安装教程Caffe+Ubuntu14.0464bit+CUDA6.5配置说明Caffe搭建：Ubuntu14.04+C
Caffe学习笔记（一）: 训练和测试自己的数据集 __Sunshine__ 笔记 Python caffe 训练数据集计算机视觉
1数据准备首先在caffe根目录下建立一个文件夹myfile，用于存放数据文件和后面的caffe模型相关文件。然后在myfile文件夹下建立build_lmdb和datatest两个文件夹，其中build_lmdb文件夹用于存放生成的lmdb文件，datatest文件夹存放图片数据。在datatest下主要有2个文件夹和2个.sh文件和2个.txt文件，其中train文件夹中存放待训练的图片，va
Caffe学习笔记6：过程小结 Zz鱼丸
之前写的学习笔记1用两种方法进行预测，今天发现有点不对。下面进行分析总结：先来看看Classifier的源代码#!/usr/bin/envpython"""ClassifierisanimageclassifierspecializationofNet."""importnumpyasnpimportcaffeclassClassifier(caffe.Net):"""Classifierexte
Caffe学习笔记11:Ubuntu 16.04 中 caffe 编译出现的错误——fatal error: hdf5.h: 没有那个文件或目录 weixin_41774576 Caffe
step1:cd/usr/lib/x86_64-linux-gnusudoln-slibhdf5_serial.so.8.0.2libhdf5.sosudoln-slibhdf5_serial_hl.so.8.0.2libhdf5_hl.sostep2：changeMakefile.config//打开Makefile.config将下面的INCLUDE_DIRS:=$(PYTHON_INCLUD
Caffe学习笔记(1)--在spyder中 import caffe spcq4 caffe学习笔记
在配置好caffe环境之后无法在anaconda的spyder中直接导入caffe的库，需现先将caffe的路径导入进去。操作如下:importsyscaffe_home='/home/kelly/DL/caffe-master/'sys.path.insert(0,caffe_home+'python')importcaffe
Caffe学习笔记(2)--spyder 下绘制网络结构 spcq4 caffe学习笔记 python caffe spyder 网络结构
直接使用Caffe中的python脚本绘制网络结构的方法请参照链接：http://www.cnblogs.com/denny402/p/5106764.html。因为本人在学习caffe的时候希望在anaconda的环境下区编辑，所以这里介绍如何在spyder中编写python程序来绘制网络结构图。程序如下：#将caffe包含到路径中importsyscaffe_home='/home/kelly
Caffe学习笔记（2）优化算法的选择 AshBringer555 Caffe
优化算法的选择参考：1、http://blog.csdn.net/u014595019/article/details/52989301caffe中的优化算法有以下六中可选项，他们分别是SGDAdaDeltaAdaGradAdamNesterovRMSProp1、SGDSGD全名stochasticgradientdescent，即随机梯度下降。不过这里的SGD其实跟MBGD(minibatchg
Caffe学习笔记 jiarenyf caffe
目录：安装与配置Tutorial学习PyCaffe学习buildtools学习其他安装与配置Ubuntu14.04安装Caffe（仅CPU）Ubuntu14.04安装CudaUbuntu14.04安装Caffe（GPU）Ubuntu14.04CuDNN安装（Caffe+Cuda7.0下）Tutorial学习Caffe学习：Blobs,Layers,andNetsCaffe学习：Forwardand
Caffe学习笔记（一） LaLa_2539
导言今天重新编译了OpenPose的Caffe修改版，准备用于网络的训练，在正式训练网络之前，想先通过实例的学习来对网络训练有大致的认识转化数据为LMDB格式CaffeforPython输入的预处理一、为何需要对输入减去均值？https://blog.csdn.net/GoodShot/article/details/80373372https://blog.csdn.net/dcxhun3/ar
Caffe学习笔记1：linux下建立自己的数据库训练和测试caffe中已有网络葭宝 caffe
本文是基于薛开宇《学习笔记3：基于自己的数据训练和测试“caffeNet”》基础上，从头到尾把实验跑了一遍~对该文中不清楚的地方做了更正和说明。主要工作如下：1、下载图片建立数据库2、将图片转化为256*256的lmdb格式3、计算图像均值4、定义网络修改部分参数1、下载图片建立数据库在caffe-master/data下新建一个属于自己的数据库命名为babyjia，并在该文件夹下创建train和
Caffe学习笔记（四）——Windows 下caffe配置相关问题说明缄默hong 深度学习
本文主要介绍：Win1064位系统下，再次配置caffe，遇到了一些新的问题，现对这些问题及其解决方法进行总结。详细的安装配置过程见以前博客：Caffe学习笔记（一）——Windows下caffe安装与配置1.CUDA的安装问题CUDA的安装过程可以参考CUDA7.5安装及配置(WIN764英伟达G卡VS2012)，但参考到第九步即可，第十步及其以后的过程可以不进行配置；2.编译过程中：无法打开输
Caffe学习笔记（1）：简单的数据可视化 Zongxian_Lee 深度学习 python学习笔记数据可视化
caffe的底层是c++写的，如果要进行数据可视化，需要借助其它的库或者是接口，如opencv,python或者是matlab，python的环境需要自行配置，因为我使用的都是网管同志已经配置好的深度学习服务器，所以不用管底层的一些配置问题，如果需要自行配置自己的机器，请参照：http://www.cnblogs.com/denny402/p/5088399.html当前目录为caffe的根目录，
caffe学习笔记12 -- R-CNN detection thystar caffe学习
这是caffe文档中NotebookExamples的倒数第二个例子，链接地址http://nbviewer.jupyter.org/github/BVLC/caffe/blob/master/examples/detection.ipynb这个例子用R-CNN做目标检测。R-CNN是一个先进的目标检测模型，它通过微调caffe模型提供分类区域。对于R-CNN系统和模型的详细介绍，参考Richfe
caffe学习笔记25-过拟合原因及分析 YiLiang_ deep learning caffe
1.过拟合原因：1）样本数量太少，抽样方法错误，抽样时没有足够正确考虑业务场景或业务特点，等等导致抽出的样本数据不能有效足够代表业务逻辑或业务场景2）样本里的噪音数据干扰过大，大到模型过分记住了噪音特征，反而忽略了真实的输入输出间的关系3）就是建模时的“逻辑假设”到了模型应用时已经不能成立了，模型没有通用性，选择参数更少的网络4）没有用dropout5）weight_decay:默认0.005，可
Caffe 学习笔记之CIFFA-10 静风儿
Caffe学习笔记之CIFFA-10背景知识今天小编就亲身实践利用前几天在Ubuntu14.04刚装好的caffe进行CIFFA-10的训练。CIFAR-10数据集包含60000张32x32的彩色图片，一共有十种类别，每种类别有6000张。数据集中有50000张训练集和10000张测试集。这个数据集一共分为了五组训练集和一组测试集，这样子，每组就有10000张随机组成的图片。虽然是随机的，但是在训
Caffe学习笔记（二）分类任务 yaoyz105 #Caffe 深度学习
笔记（二）：用Caffe训练好的模型进行分类任务的测试参考：Caffe学习系列(20)：用训练好的caffemodel来进行分类用Caffe搭建自己的网络，并用图片进行测试开发caffe的贾大牛团队，利用imagenet图片和caffenet模型训练好了一个caffemodel，该模型可以用来做分类任务。1.准备模型和数据1）caffemodel下载：bvlc_reference_caffenet
【caffe学习笔记——cifar10】win10+caffe环境下cifar10运行文章被改为VIP本文并不知情，且无法修改 caffe入门笔记
本人初学深度学习——caffe框架，想用几个实例来入门，cifar10为其中之一，在参考了博主汽车数据技术前瞻的帖子：http://blog.csdn.net/lance313/article/details/53964874之后，将学习内容进行了总结，总结的内容基本和我参考的帖子差不多，主要目的是加深印象并方便以后查阅。##cifar数据集的介绍##Cifar-10是由Hinton的两个大弟子A
caffe学习笔记 Gzzgz caffe
转自http://blog.csdn.net/u011762313/article/details/4730600目录：安装与配置Tutorial学习PyCaffe学习buildtools学习其他安装与配置Ubuntu14.04安装Caffe（仅CPU）Ubuntu14.04安装CudaUbuntu14.04安装Caffe（GPU）Ubuntu14.04CuDNN安装（Caffe+Cuda7.0下
【caffe学习笔记之5】Win10系统下Caffe的Python接口设置方法并绘制网络结构图 Shuai__ python caffe
【准备工作】前面几节介绍了win10系统下caffe-master的配置方法以及cifar10数据集的训练方法，并简要介绍了Matlab接口如何配置。想要更为形象的了解caffe框架下诸多网络模型的具体内涵，需要借助python接口的caffe.draw绘制网络图，因此，本节介绍caffe的Python接口配置方法。安装python使用anaconda版本，anaconda里面集成了很多关于pyt
【caffe学习笔记之8】Caffe运行Faster-RCNN算法实现目标检测(1) Shuai__ Matlab caffe 深度学习
【Faster-RCNN算法】FasterR-CNN（其中R对应于“Region(区域)”）是基于深度学习R-CNN系列目标检测最好的方法。使用VOC2007+2012训练集训练，VOC2007测试集测试mAP达到73.2%，目标检测的速度可以达到每秒5帧。技术上将RPN网络和FastR-CNN网络结合到了一起，将RPN获取到的proposal直接连到ROIpooling层，是一个CNN网络实现端
【caffe学习笔记之6】caffe-matlab/python训练LeNet模型并应用于mnist数据集（1） Shuai__ 深度学习 caffe python Matlab
【案例介绍】LeNet网络模型是一个用来识别手写数字的最经典的卷积神经网络，是YannLeCun在1998年设计并提出的，是早期卷积神经网络中最有代表性的实验系统之一，其论文是CNN领域第一篇经典之作。本篇博客详细介绍基于Matlab、Python训练lenet手写模型的案例，作为前几次caffe深度学习框架的阶段性总结。【数据准备】数据下载地址：http://yann.lecun.com/exd
caffe学习笔记6-matlab接口总结 YiLiang_ caffe
第一部分：用matlab接口操作网络，包括网络生成，数据读取及修改，存储caffeemodel,返回layer的类型1.设置网络：model='./models/bvlc_reference_caffenet/deploy.prototxt';weights='./models/bvlc_reference_caffenet/bvlc_reference_caffenet.caffemodel';
caffe学习笔记（一） SHERO_M caffe
ubuntu14.04.1下caffe的安装（cpumode）准备工作，安装各种依赖和OpenCV，代码如下：sudoapt-getinstalllibprotobuf-devlibleveldb-devlibsnappy-devlibopencv-devlibhdf5-serial-devprotobuf-compilersudoapt-getinstall--no-install-recomm
【caffe学习笔记之4】利用MATLAB接口运行cifar数据集 Shuai__ Matlab caffe Computer Vision 深度学习
【前期准备工作】参考上篇帖子：http://write.blog.csdn.net/postedit/539648741.确保模型训练成功，生成模型文件：cifar10_quick_iter_4000.caffemodel及均值文件：mean.binaryproto。注意，此处一定是生成caffemodel格式的模型文件，而非.h5模型文件，否则会导致Matlab运行崩溃。如何生成caffemod
caffe学习笔记21-VggNet论文笔记 YiLiang_ caffe deep learning
AlexNet输入要求256(图像大小)，均值是256的，减均值后再crop到227(输入图像大小)VGGNet输入要求256(图像大小)，均值是256的，减均值后再crop到224(输入图像大小)Vgg-Net:笔记CNNimprovement：有很多对其提出的CNN结构进行改进的方法。例如：1.Usesmallerreceptivewindowsizeandsmallerstrideofthe
算法单链的创建与删除换个号韩国红果果 c 算法
先创建结构体 struct student { int data; //int tag;//标记这是第几个 struct student *next; }; // addone 用于将一个数插入已从小到大排好序的链中 struct student *addone(struct student *h,int x){ if(h==NULL) //??????
《大型网站系统与Java中间件实践》第2章读后感白糖_ java中间件
断断续续花了两天时间试读了《大型网站系统与Java中间件实践》的第2章，这章总述了从一个小型单机构建的网站发展到大型网站的演化过程---整个过程会遇到很多困难，但每一个屏障都会有解决方案，最终就是依靠这些个解决方案汇聚到一起组成了一个健壮稳定高效的大型系统。看完整章内容，
zeus持久层spring事务单元测试 deng520159 java DAO spring jdbc
今天把zeus事务单元测试放出来,让大家指出他的毛病, 1.ZeusTransactionTest.java 单元测试 package com.dengliang.zeus.webdemo.test; import java.util.ArrayList; import java.util.List; import org.junit.Test; import
Rss 订阅开发周凡杨 html xml 订阅 rss 规范
RSS是 Really Simple Syndication的缩写（对rss2.0而言，是这三个词的缩写，对rss1.0而言则是RDF Site Summary的缩写，1.0与2.0走的是两个体系）。 RSS
分页查询实现 g21121 分页查询
在查询列表时我们常常会用到分页，分页的好处就是减少数据交换，每次查询一定数量减少数据库压力等等。按实现形式分前台分页和服务器分页：前台分页就是一次查询出所有记录，在页面中用js进行虚拟分页，这种形式在数据量较小时优势比较明显，一次加载就不必再访问服务器了，但当数据量较大时会对页面造成压力，传输速度也会大幅下降。服务器分页就是每次请求相同数量记录，按一定规则排序，每次取一定序号直接的数据
spring jms异步消息处理 510888780 jms
spring JMS对于异步消息处理基本上只需配置下就能进行高效的处理。其核心就是消息侦听器容器，常用的类就是DefaultMessageListenerContainer。该容器可配置侦听器的并发数量，以及配合MessageListenerAdapter使用消息驱动POJO进行消息处理。且消息驱动POJO是放入TaskExecutor中进行处理，进一步提高性能，减少侦听器的阻塞。具体配置如下：
highCharts柱状图布衣凌宇 hightCharts 柱图
第一步：导入 exporting.js,grid.js,highcharts.js;第二步：写controller @Controller@RequestMapping(value="${adminPath}/statistick")public class StatistickController { private UserServi
我的spring学习笔记2-IoC（反向控制依赖注入） aijuans spring mvc Spring 教程 spring3 教程 Spring 入门
IoC（反向控制依赖注入）这是Spring提出来了，这也是Spring一大特色。这里我不用多说，我们看Spring教程就可以了解。当然我们不用Spring也可以用IoC，下面我将介绍不用Spring的IoC。 IoC不是框架，她是java的技术，如今大多数轻量级的容器都会用到IoC技术。这里我就用一个例子来说明：如：程序中有 Mysql.calss 、Oracle.class 、SqlSe
TLS java简单实现 antlove java ssl keystore tls secure
1. SSLServer.java package ssl; import java.io.FileInputStream; import java.io.InputStream; import java.net.ServerSocket; import java.net.Socket; import java.security.KeyStore; import
Zip解压压缩文件百合不是茶 Zip格式解压 Zip流的使用文件解压
ZIP文件的解压缩实质上就是从输入流中读取数据。Java.util.zip包提供了类ZipInputStream来读取ZIP文件,下面的代码段创建了一个输入流来读取ZIP格式的文件; ZipInputStream in = new ZipInputStream(new FileInputStream(zipFileName)); &n
underscore.js 学习（一） bijian1013 JavaScript underscore
工作中需要用到underscore.js，发现这是一个包括了很多基本功能函数的js库，里面有很多实用的函数。而且它没有扩展 javascript的原生对象。主要涉及对Collection、Object、Array、Function的操作。学
java jvm常用命令工具——jstatd命令(Java Statistics Monitoring Daemon) bijian1013 java jvm jstatd
1.介绍 jstatd是一个基于RMI（Remove Method Invocation）的服务程序，它用于监控基于HotSpot的JVM中资源的创建及销毁，并且提供了一个远程接口允许远程的监控工具连接到本地的JVM执行命令。 jstatd是基于RMI的，所以在运行jstatd的服务
【Spring框架三】Spring常用注解之Transactional bit1129 transactional
Spring可以通过注解@Transactional来为业务逻辑层的方法(调用DAO完成持久化动作)添加事务能力，如下是@Transactional注解的定义： /* * Copyright 2002-2010 the original author or authors. * * Licensed under the Apache License, Version
我(程序员)的前进方向 bitray 程序员
作为一个普通的程序员,我一直游走在java语言中,java也确实让我有了很多的体会.不过随着学习的深入,java语言的新技术产生的越来越多,从最初期的javase,我逐渐开始转变到ssh,ssi,这种主流的码农,.过了几天为了解决新问题,webservice的大旗也被我祭出来了,又过了些日子jms架构的activemq也开始必须学习了.再后来开始了一系列技术学习,osgi,restful.....
nginx lua开发经验总结 ronin47
使用nginx lua已经两三个月了，项目接开发完毕了，这几天准备上线并且跟高德地图对接。回顾下来lua在项目中占得必中还是比较大的，跟PHP的占比差不多持平了，因此在开发中遇到一些问题备忘一下 1：content_by_lua中代码容量有限制，一般不要写太多代码，正常编写代码一般在100行左右（具体容量没有细心测哈哈，在4kb左右），如果超出了则重启nginx的时候会报 too long pa
java-66-用递归颠倒一个栈。例如输入栈{1,2,3,4,5}，1在栈顶。颠倒之后的栈为{5,4,3,2,1}，5处在栈顶 bylijinnan java
import java.util.Stack; public class ReverseStackRecursive { /** * Q 66.颠倒栈。 * 题目：用递归颠倒一个栈。例如输入栈{1,2,3,4,5}，1在栈顶。 * 颠倒之后的栈为{5,4,3,2,1}，5处在栈顶。 *1. Pop the top element *2. Revers
正确理解Linux内存占用过高的问题 cfyme linux
Linux开机后，使用top命令查看，4G物理内存发现已使用的多大3.2G，占用率高达80%以上： Mem: 3889836k total, 3341868k used, 547968k free, 286044k buffers Swap: 6127608k total,&nb
[JWFD开源工作流]当前流程引擎设计的一个急需解决的问题 comsci 工作流
当我们的流程引擎进入IRC阶段的时候，当循环反馈模型出现之后，每次循环都会导致一大堆节点内存数据残留在系统内存中，循环的次数越多，这些残留数据将导致系统内存溢出，并使得引擎崩溃。。。。。。而解决办法就是利用汇编语言或者其它系统编程语言，在引擎运行时，把这些残留数据清除掉。
自定义类的equals函数 dai_lm equals
仅作笔记使用 public class VectorQueue { private final Vector<VectorItem> queue; private class VectorItem { private final Object item; private final int quantity; public VectorI
Linux下安装R语言 datageek R语言 linux
命令如下：sudo gedit /etc/apt/sources.list1、deb http://mirrors.ustc.edu.cn/CRAN/bin/linux/ubuntu/ precise/ 2、deb http://dk.archive.ubuntu.com/ubuntu hardy universesudo apt-key adv --keyserver ke
如何修改mysql 并发数(连接数)最大值 dcj3sjt126com mysql
MySQL的连接数最大值跟MySQL没关系，主要看系统和业务逻辑了方法一：进入MYSQL安装目录打开MYSQL配置文件 my.ini 或 my.cnf查找 max_connections=100 修改为 max_connections=1000 服务里重起MYSQL即可　　方法二：MySQL的最大连接数默认是100客户端登录：mysql -uusername -ppass
单一功能原则 dcj3sjt126com 面向对象的程序设计软件设计编程原则
单一功能原则[ 编辑] SOLID 原则单一功能原则开闭原则 Liskov代换原则接口隔离原则依赖反转原则查论编在面向对象编程领域中，单一功能原则（Single responsibility principle）规定每个类都应该有
POJO、VO和JavaBean区别和联系 fanmingxing VO POJO javabean
POJO和JavaBean是我们常见的两个关键字，一般容易混淆，POJO全称是Plain Ordinary Java Object / Plain Old Java Object，中文可以翻译成：普通Java类，具有一部分getter/setter方法的那种类就可以称作POJO，但是JavaBean则比POJO复杂很多，JavaBean是一种组件技术，就好像你做了一个扳子，而这个扳子会在很多地方被
SpringSecurity3.X--LDAP：AD配置 hanqunfeng SpringSecurity
前面介绍过基于本地数据库验证的方式，参考http://hanqunfeng.iteye.com/blog/1155226，这里说一下如何修改为使用AD进行身份验证【只对用户名和密码进行验证，权限依旧存储在本地数据库中】。将配置文件中的如下部分删除：
mac mysql 修改密码 IXHONG mysql
$ sudo /usr/local/mysql/bin/mysqld_safe –user=root & //启动MySQL(也可以通过偏好设置面板来启动)$ sudo /usr/local/mysql/bin/mysqladmin -uroot password yourpassword //设置MySQL密码（注意，这是第一次MySQL密码为空的时候的设置命令，如果是修改密码，还需在-
设计模式--抽象工厂模式 kerryg 设计模式
抽象工厂模式：工厂模式有一个问题就是，类的创建依赖于工厂类，也就是说，如果想要拓展程序，必须对工厂类进行修改，这违背了闭包原则。我们采用抽象工厂模式，创建多个工厂类，这样一旦需要增加新的功能，直接增加新的工厂类就可以了，不需要修改之前的代码。总结：这个模式的好处就是，如果想增加一个功能，就需要做一个实现类，
评"高中女生军训期跳楼” nannan408
首先，先抛出我的观点，各位看官少点砖头。那就是，中国的差异化教育必须做起来。孔圣人有云：有教无类。不同类型的人，都应该有对应的教育方法。目前中国的一体化教育，不知道已经扼杀了多少创造性人才。我们出不了爱迪生，出不了爱因斯坦，很大原因，是我们的培养思路错了，我们是第一要“顺从”。如果不顺从，我们的学校，就会用各种方法，罚站，罚写作业，各种罚。军
scala如何读取和写入文件内容？ qindongliang1922 java jvm scala
直接看如下代码： package file import java.io.RandomAccessFile import java.nio.charset.Charset import scala.io.Source import scala.reflect.io.{File, Path} /** * Created by qindongliang on 2015/
C语言算法之百元买百鸡 qiufeihu c 算法
中国古代数学家张丘建在他的《算经》中提出了一个著名的“百钱买百鸡问题”，鸡翁一，值钱五，鸡母一，值钱三，鸡雏三，值钱一，百钱买百鸡，问翁，母，雏各几何？代码如下： #include <stdio.h> int main() { int cock,hen,chick; /*定义变量为基本整型*/ for(coc
Hadoop集群安全性：Hadoop中Namenode单点故障的解决方案及详细介绍AvatarNode wyz2009107220 NameNode
正如大家所知，NameNode在Hadoop系统中存在单点故障问题，这个对于标榜高可用性的Hadoop来说一直是个软肋。本文讨论一下为了解决这个问题而存在的几个solution。 1. Secondary NameNode 原理：Secondary NN会定期的从NN中读取editlog，与自己存储的Image进行合并形成新的metadata image 优点：Hadoop较早的版本都自带，

caffe层解析之softmaxwithloss层

理论

Caffe中使用

你可能感兴趣的:(caffe学习笔记)