pingye9

卷积神经网络(CNN)的简单实现(MNIST)

卷积神经网络(CNN)的基础介绍见http://blog.csdn.net/fengbingchun/article/details/50529500，这里主要以代码实现为主。

CNN是一个多层的神经网络，每层由多个二维平面组成，而每个平面由多个独立神经元组成。

以MNIST作为数据库，仿照LeNet-5和tiny-cnn( http://blog.csdn.net/fengbingchun/article/details/50573841 ) 设计一个简单的7层CNN结构如下：

输入层Input：神经元数量32*32=1024；

C1层：卷积窗大小5*5，输出特征图数量6，卷积窗种类6，输出特征图大小28*28，可训练参数(权值+阈值(偏置))5*5*6+6=150+6，神经元数量28*28*6=4704；

S2层：卷积窗大小2*2，输出下采样图数量6，卷积窗种类6，输出下采样图大小14*14，可训练参数1*6+6=6+6，神经元数量14*14*6=1176；

C3层：卷积窗大小5*5，输出特征图数量16，卷积窗种类6*16=96，输出特征图大小10*10，可训练参数5*5*(6*16)+16=2400+16，神经元数量10*10*16=1600；

S4层：卷积窗大小2*2，输出下采样图数量16，卷积窗种类16，输出下采样图大小5*5，可训练参数1*16+16=16+16，神经元数量5*5*16=400；

C5层：卷积窗大小5*5，输出特征图数量120，卷积窗种类16*120=1920，输出特征图大小1*1，可训练参数5*5*(16*120)+120=48000+120，神经元数量1*1*120=120；

输出层Output：卷积窗大小1*1，输出特征图数量10，卷积窗种类120*10=1200，输出特征图大小1*1，可训练参数1*(120*10)+10=1200+10，神经元数量1*1*10=10。

下面对实现执行过程进行描述说明：

1. 从MNIST数据库中分别获取训练样本和测试样本数据：

(1)、原有MNIST库中图像大小为28*28，这里缩放为32*32，数据值范围为[-1,1]，扩充值均取-1；总共60000个32*32训练样本，10000个32*32测试样本；

(2)、输出层有10个输出节点，在训练阶段，对应位置的节点值设为0.8，其它节点设为-0.8.

2. 初始化权值和阈值(偏置)：权值就是卷积图像，每一个特征图上的神经元共享相同的权值和阈值，特征图的数量等于阈值的个数

(1)、权值采用uniform rand的方法初始化；

(2)、阈值均初始化为0.

3. 前向传播：根据权值和阈值，主要计算每层神经元的值

(1)、输入层：每次输入一个32*32数据。

(2)、C1层：分别用每一个5*5的卷积图像去乘以32*32的图像，获得一个28*28的图像，即对应位置相加再求和，stride长度为1；一共6个5*5的卷积图像，然后对每一个神经元加上一个阈值，最后再通过tanh激活函数对每一神经元进行运算得到最终每一个神经元的结果。

(3)、S2层：对C1中6个28*28的特征图生成6个14*14的下采样图，相邻四个神经元分别进行相加求和，然后乘以一个权值，再求均值即除以4，然后再加上一个阈值，最后再通过tanh激活函数对每一神经元进行运算得到最终每一个神经元的结果。

(4)、C3层：由S2中的6个14*14下采样图生成16个10*10特征图，对于生成的每一个10*10的特征图，是由6个5*5的卷积图像去乘以6个14*14的下采样图，然后对应位置相加求和，然后对每一个神经元加上一个阈值，最后再通过tanh激活函数对每一神经元进行运算得到最终每一个神经元的结果。

(5)、S4层：由C3中16个10*10的特征图生成16个5*5下采样图，相邻四个神经元分别进行相加求和，然后乘以一个权值，再求均值即除以4，然后再加上一个阈值，最后再通过tanh激活函数对每一神经元进行运算得到最终每一个神经元的结果。

(6)、C5层：由S4中16个5*5下采样图生成120个1*1特征图，对于生成的每一个1*1的特征图，是由16个5*5的卷积图像去乘以16个5*5的下采用图，然后相加求和，然后对每一个神经元加上一个阈值，最后再通过tanh激活函数对每一神经元进行运算得到最终每一个神经元的结果。

(7)、输出层：即全连接层，输出层中的每一个神经元均是由C5层中的120个神经元乘以相对应的权值，然后相加求和；然后对每一个神经元加上一个阈值，最后再通过tanh激活函数对每一神经元进行运算得到最终每一个神经元的结果。

4. 反向传播：主要计算每层神经元、权值和阈值的误差，以用来更新权值和阈值

(1)、输出层：计算输出层神经元误差；通过mse损失函数的导数函数和tanh激活函数的导数函数来计算输出层神经元误差。

(2)、C5层：计算C5层神经元误差、输出层权值误差、输出层阈值误差；通过输出层神经元误差乘以输出层权值，求和，结果再乘以C5层神经元的tanh激活函数的导数，获得C5层每一个神经元误差；通过输出层神经元误差乘以C5层神经元获得输出层权值误差；输出层误差即为输出层阈值误差。

(3)、S4层：计算S4层神经元误差、C5层权值误差、C5层阈值误差；通过C5层权值乘以C5层神经元误差，求和，结果再乘以S4层神经元的tanh激活函数的导数，获得S4层每一个神经元误差；通过S4层神经元乘以C5层神经元误差，求和，获得C5层权值误差；C5层神经元误差即为C5层阈值误差。

(4)、C3层：计算C3层神经元误差、S4层权值误差、S4层阈值误差；

(5)、S2层：计算S2层神经元误差、C3层权值误差、C3层阈值误差；

(6)、C1层：计算C1层神经元误差、S2层权值误差、S2层阈值误差；

(7)、输入层：计算C1层权值误差、C1层阈值误差.

代码文件：

CNN.hpp：

#ifndef _CNN_HPP_  
#define _CNN_HPP_  
  
namespace ANN {  
  
#define width_image_input_CNN       32 //归一化图像宽  
#define height_image_input_CNN      32 //归一化图像高  
#define width_image_C1_CNN      28  
#define height_image_C1_CNN     28  
#define width_image_S2_CNN      14  
#define height_image_S2_CNN     14  
#define width_image_C3_CNN      10  
#define height_image_C3_CNN     10  
#define width_image_S4_CNN      5  
#define height_image_S4_CNN     5  
#define width_image_C5_CNN      1  
#define height_image_C5_CNN     1  
#define width_image_output_CNN      1  
#define height_image_output_CNN     1  
  
#define width_kernel_conv_CNN       5 //卷积核大小  
#define height_kernel_conv_CNN      5  
#define width_kernel_pooling_CNN    2  
#define height_kernel_pooling_CNN   2  
#define size_pooling_CNN        2  
  
#define num_map_input_CNN       1 //输入层map个数  
#define num_map_C1_CNN          6 //C1层map个数  
#define num_map_S2_CNN          6 //S2层map个数  
#define num_map_C3_CNN          16 //C3层map个数  
#define num_map_S4_CNN          16 //S4层map个数  
#define num_map_C5_CNN          120 //C5层map个数  
#define num_map_output_CNN      10 //输出层map个数  
  
#define num_patterns_train_CNN      60000 //训练模式对数(总数)  
#define num_patterns_test_CNN       10000 //测试模式对数(总数)  
#define num_epochs_CNN          100 //最大迭代次数  
#define accuracy_rate_CNN       0.97 //要求达到的准确率  
#define learning_rate_CNN       0.01 //学习率  
#define eps_CNN             1e-8  
  
#define len_weight_C1_CNN       150 //C1层权值数，5*5*6=150  
#define len_bias_C1_CNN         6 //C1层阈值数，6  
#define len_weight_S2_CNN       6 //S2层权值数,1*6=6  
#define len_bias_S2_CNN         6 //S2层阈值数,6  
#define len_weight_C3_CNN       2400 //C3层权值数，5*5*6*16  
#define len_bias_C3_CNN         16 //C3层阈值数,16  
#define len_weight_S4_CNN       16 //S4层权值数，1*16=16  
#define len_bias_S4_CNN         16 //S4层阈值数，16  
#define len_weight_C5_CNN       48000 //C5层权值数，5*5*16*120=48000  
#define len_bias_C5_CNN         120 //C5层阈值数，120  
#define len_weight_output_CNN       1200 //输出层权值数，120*10=1200  
#define len_bias_output_CNN     10 //输出层阈值数，10  
  
#define num_neuron_input_CNN        1024 //输入层神经元数，32*32=1024  
#define num_neuron_C1_CNN       4704 //C1层神经元数，28*28*6=4704  
#define num_neuron_S2_CNN       1176 //S2层神经元数，14*14*6=1176  
#define num_neuron_C3_CNN       1600 //C3层神经元数，10*10*16=1600  
#define num_neuron_S4_CNN       400 //S4层神经元数，5*5*16=400  
#define num_neuron_C5_CNN       120 //C5层神经元数，1*120=120  
#define num_neuron_output_CNN       10 //输出层神经元数，1*10=10  
  
class CNN {  
public:  
    CNN();  
    ~CNN();  
  
    void init(); //初始化，分配空间  
    bool train(); //训练  
    int predict(const unsigned char* data, int width, int height); //预测  
    bool readModelFile(const char* name); //读取已训练好的BP model  
  
protected:  
    typedef std::vectorint, int> > wi_connections;  
    typedef std::vectorint, int> > wo_connections;  
    typedef std::vectorint, int> > io_connections;  
  
    void release(); //释放申请的空间  
    bool saveModelFile(const char* name); //将训练好的model保存起来，包括各层的节点数，权值和阈值  
    bool initWeightThreshold(); //初始化，产生[-1, 1]之间的随机小数  
    bool getSrcData(); //读取MNIST数据  
    float test(); //训练完一次计算一次准确率  
    float activation_function_tanh(float x); //激活函数:tanh  
    float activation_function_tanh_derivative(float x); //激活函数tanh的导数  
    float activation_function_identity(float x);  
    float activation_function_identity_derivative(float x);  
    float loss_function_mse(float y, float t); //损失函数:mean squared error  
    float loss_function_mse_derivative(float y, float t);  
    void loss_function_gradient(const float* y, const float* t, float* dst, int len);  
    float dot_product(const float* s1, const float* s2, int len); //点乘  
    bool muladd(const float* src, float c, int len, float* dst); //dst[i] += c * src[i]  
    void init_variable(float* val, float c, int len);  
    bool uniform_rand(float* src, int len, float min, float max);  
    float uniform_rand(float min, float max);  
    int get_index(int x, int y, int channel, int width, int height, int depth);  
    void calc_out2wi(int width_in, int height_in, int width_out, int height_out, int depth_out, std::vector& out2wi);  
    void calc_out2bias(int width, int height, int depth, std::vector<int>& out2bias);  
    void calc_in2wo(int width_in, int height_in, int width_out, int height_out, int depth_in, int depth_out, std::vector& in2wo);  
    void calc_weight2io(int width_in, int height_in, int width_out, int height_out, int depth_in, int depth_out, std::vector& weight2io);  
    void calc_bias2out(int width_in, int height_in, int width_out, int height_out, int depth_in, int depth_out, std::vectorint> >& bias2out);  
  
    bool Forward_C1(); //前向传播  
    bool Forward_S2();  
    bool Forward_C3();  
    bool Forward_S4();  
    bool Forward_C5();  
    bool Forward_output();  
    bool Backward_output();  
    bool Backward_C5(); //反向传播  
    bool Backward_S4();  
    bool Backward_C3();  
    bool Backward_S2();  
    bool Backward_C1();  
    bool Backward_input();  
    bool UpdateWeights(); //更新权值、阈值  
    void update_weights_bias(const float* delta, float* weight, int len);  
  
private:  
    float* data_input_train; //原始标准输入数据，训练,范围：[-1, 1]  
    float* data_output_train; //原始标准期望结果，训练,范围：[-0.9, 0.9]  
    float* data_input_test; //原始标准输入数据，测试,范围：[-1, 1]  
    float* data_output_test; //原始标准期望结果，测试,范围：[-0.9, 0.9]  
    float* data_single_image;  
    float* data_single_label;  
  
    float weight_C1[len_weight_C1_CNN];  
    float bias_C1[len_bias_C1_CNN];  
    float weight_S2[len_weight_S2_CNN];  
    float bias_S2[len_bias_S2_CNN];  
    float weight_C3[len_weight_C3_CNN];  
    float bias_C3[len_bias_C3_CNN];  
    float weight_S4[len_weight_S4_CNN];  
    float bias_S4[len_bias_S4_CNN];  
    float weight_C5[len_weight_C5_CNN];  
    float bias_C5[len_bias_C5_CNN];  
    float weight_output[len_weight_output_CNN];  
    float bias_output[len_bias_output_CNN];  
  
    float neuron_input[num_neuron_input_CNN]; //data_single_image  
    float neuron_C1[num_neuron_C1_CNN];  
    float neuron_S2[num_neuron_S2_CNN];  
    float neuron_C3[num_neuron_C3_CNN];  
    float neuron_S4[num_neuron_S4_CNN];  
    float neuron_C5[num_neuron_C5_CNN];  
    float neuron_output[num_neuron_output_CNN];  
  
    float delta_neuron_output[num_neuron_output_CNN]; //神经元误差  
    float delta_neuron_C5[num_neuron_C5_CNN];  
    float delta_neuron_S4[num_neuron_S4_CNN];  
    float delta_neuron_C3[num_neuron_C3_CNN];  
    float delta_neuron_S2[num_neuron_S2_CNN];  
    float delta_neuron_C1[num_neuron_C1_CNN];  
    float delta_neuron_input[num_neuron_input_CNN];  
  
    float delta_weight_C1[len_weight_C1_CNN]; //权值、阈值误差  
    float delta_bias_C1[len_bias_C1_CNN];  
    float delta_weight_S2[len_weight_S2_CNN];  
    float delta_bias_S2[len_bias_S2_CNN];  
    float delta_weight_C3[len_weight_C3_CNN];  
    float delta_bias_C3[len_bias_C3_CNN];  
    float delta_weight_S4[len_weight_S4_CNN];  
    float delta_bias_S4[len_bias_S4_CNN];  
    float delta_weight_C5[len_weight_C5_CNN];  
    float delta_bias_C5[len_bias_C5_CNN];  
    float delta_weight_output[len_weight_output_CNN];  
    float delta_bias_output[len_bias_output_CNN];  
  
    std::vector out2wi_S2; // out_id -> [(weight_id, in_id)]  
    std::vector<int> out2bias_S2;  
    std::vector out2wi_S4;  
    std::vector<int> out2bias_S4;  
    std::vector in2wo_C3; // in_id -> [(weight_id, out_id)]  
    std::vector weight2io_C3; // weight_id -> [(in_id, out_id)]  
    std::vectorint> > bias2out_C3;  
    std::vector in2wo_C1;  
    std::vector weight2io_C1;  
    std::vectorint> > bias2out_C1;  
};  
  
}  
  
#endif //_CNN_HPP_

CNN.cpp：

#include   
#include   
#include   
#include   
#include   
#include   
#include   
#include   
  
#include   
  
namespace ANN {  
  
CNN::CNN()  
{  
    data_input_train = NULL;  
    data_output_train = NULL;  
    data_input_test = NULL;  
    data_output_test = NULL;  
    data_single_image = NULL;  
    data_single_label = NULL;  
}  
  
CNN::~CNN()  
{  
    release();  
}  
  
void CNN::release()  
{  
    if (data_input_train) {  
        delete[] data_input_train;  
        data_input_train = NULL;  
    }  
  
    if (data_output_train) {  
        delete[] data_output_train;  
        data_output_train = NULL;  
    }  
  
    if (data_input_test) {  
        delete[] data_input_test;  
        data_input_test = NULL;  
    }  
  
    if (data_output_test) {  
        delete[] data_output_test;  
        data_output_test = NULL;  
    }  
}  
  
void CNN::init_variable(float* val, float c, int len)  
{  
    for (int i = 0; i < len; i++) {  
        val[i] = c;  
    }  
}  
  
void CNN::init()  
{  
    int len1 = width_image_input_CNN * height_image_input_CNN * num_patterns_train_CNN;  
    data_input_train = new float[len1];  
    init_variable(data_input_train, -1.0, len1);  
  
    int len2 = num_map_output_CNN * num_patterns_train_CNN;  
    data_output_train = new float[len2];  
    init_variable(data_output_train, -0.9, len2);  
  
    int len3 = width_image_input_CNN * height_image_input_CNN * num_patterns_test_CNN;  
    data_input_test = new float[len3];  
    init_variable(data_input_test, -1.0, len3);  
  
    int len4 = num_map_output_CNN * num_patterns_test_CNN;  
    data_output_test = new float[len4];  
    init_variable(data_output_test, -0.9, len4);  
  
    initWeightThreshold();  
    getSrcData();  
}  
  
float CNN::uniform_rand(float min, float max)  
{  
    static std::mt19937 gen(1);  
    std::uniform_real_distribution<float> dst(min, max);  
    return dst(gen);  
}  
  
bool CNN::uniform_rand(float* src, int len, float min, float max)  
{  
    for (int i = 0; i < len; i++) {  
        src[i] = uniform_rand(min, max);  
    }  
  
    return true;  
}  
  
bool CNN::initWeightThreshold()  
{  
    srand(time(0) + rand());  
    const float scale = 6.0;  
  
    //const float_t weight_base = std::sqrt(scale_ / (fan_in + fan_out));  
    //fan_in = width_kernel_conv_CNN * height_kernel_conv_CNN * num_map_input_CNN = 5 * 5 * 1  
    //fan_out = width_kernel_conv_CNN * height_kernel_conv_CNN * num_map_C1_CNN = 5 * 5 * 6  
    float min_ = -std::sqrt(scale / (25.0 + 150.0));  
    float max_ = std::sqrt(scale / (25.0 + 150.0));  
    uniform_rand(weight_C1, len_weight_C1_CNN, min_, max_);  
    //for (int i = 0; i < len_weight_C1_CNN; i++) {  
    //  weight_C1[i] = -1 + 2 * ((float)rand()) / RAND_MAX; //[-1, 1]  
    //}  
    for (int i = 0; i < len_bias_C1_CNN; i++) {  
        bias_C1[i] = -1 + 2 * ((float)rand()) / RAND_MAX;//0.0;//  
    }  
  
    min_ = -std::sqrt(scale / (4.0 + 1.0));  
    max_ = std::sqrt(scale / (4.0 + 1.0));  
    uniform_rand(weight_S2, len_weight_S2_CNN, min_, max_);  
    //for (int i = 0; i < len_weight_S2_CNN; i++) {  
    //  weight_S2[i] = -1 + 2 * ((float)rand()) / RAND_MAX;  
    //}  
    for (int i = 0; i < len_bias_S2_CNN; i++) {  
        bias_S2[i] = -1 + 2 * ((float)rand()) / RAND_MAX;//0.0;//   
    }  
  
    min_ = -std::sqrt(scale / (150.0 + 400.0));  
    max_ = std::sqrt(scale / (150.0 + 400.0));  
    uniform_rand(weight_C3, len_weight_C3_CNN, min_, max_);  
    //for (int i = 0; i < len_weight_C3_CNN; i++) {  
    //  weight_C3[i] = -1 + 2 * ((float)rand()) / RAND_MAX;  
    //}  
    for (int i = 0; i < len_bias_C3_CNN; i++) {  
        bias_C3[i] = -1 + 2 * ((float)rand()) / RAND_MAX;//0.0;//   
    }  
  
    min_ = -std::sqrt(scale / (4.0 + 1.0));  
    max_ = std::sqrt(scale / (4.0 + 1.0));  
    uniform_rand(weight_S4, len_weight_S4_CNN, min_, max_);  
    //for (int i = 0; i < len_weight_S4_CNN; i++) {  
    //  weight_S4[i] = -1 + 2 * ((float)rand()) / RAND_MAX;  
    //}  
    for (int i = 0; i < len_bias_S4_CNN; i++) {  
        bias_S4[i] = -1 + 2 * ((float)rand()) / RAND_MAX; //0.0;//  
    }  
  
    min_ = -std::sqrt(scale / (400.0 + 3000.0));  
    max_ = std::sqrt(scale / (400.0 + 3000.0));  
    uniform_rand(weight_C5, len_weight_C5_CNN, min_, max_);  
    //for (int i = 0; i < len_weight_C5_CNN; i++) {  
    //  weight_C5[i] = -1 + 2 * ((float)rand()) / RAND_MAX;  
    //}  
    for (int i = 0; i < len_bias_C5_CNN; i++) {  
        bias_C5[i] =-1 + 2 * ((float)rand()) / RAND_MAX; //0.0;//   
    }  
  
    min_ = -std::sqrt(scale / (120.0 + 10.0));  
    max_ = std::sqrt(scale / (120.0 + 10.0));  
    uniform_rand(weight_output, len_weight_output_CNN, min_, max_);  
    //for (int i = 0; i < len_weight_output_CNN; i++) {  
    //  weight_output[i] = -1 + 2 * ((float)rand()) / RAND_MAX;  
    //}  
    for (int i = 0; i < len_bias_output_CNN; i++) {  
        bias_output[i] = -1 + 2 * ((float)rand()) / RAND_MAX;//0.0;//   
    }  
  
    return true;  
}  
  
static int reverseInt(int i)  
{  
    unsigned char ch1, ch2, ch3, ch4;  
    ch1 = i & 255;  
    ch2 = (i >> 8) & 255;  
    ch3 = (i >> 16) & 255;  
    ch4 = (i >> 24) & 255;  
    return((int)ch1 << 24) + ((int)ch2 << 16) + ((int)ch3 << 8) + ch4;  
}  
  
static void readMnistImages(std::string filename, float* data_dst, int num_image)  
{  
    const int width_src_image = 28;  
    const int height_src_image = 28;  
    const int x_padding = 2;  
    const int y_padding = 2;  
    const float scale_min = -1;  
    const float scale_max = 1;  
  
    std::ifstream file(filename, std::ios::binary);  
    assert(file.is_open());  
  
    int magic_number = 0;  
    int number_of_images = 0;  
    int n_rows = 0;  
    int n_cols = 0;  
    file.read((char*)&magic_number, sizeof(magic_number));  
    magic_number = reverseInt(magic_number);  
    file.read((char*)&number_of_images, sizeof(number_of_images));  
    number_of_images = reverseInt(number_of_images);  
    assert(number_of_images == num_image);  
    file.read((char*)&n_rows, sizeof(n_rows));  
    n_rows = reverseInt(n_rows);  
    file.read((char*)&n_cols, sizeof(n_cols));  
    n_cols = reverseInt(n_cols);  
    assert(n_rows == height_src_image && n_cols == width_src_image);  
  
    int size_single_image = width_image_input_CNN * height_image_input_CNN;  
  
    for (int i = 0; i < number_of_images; ++i) {  
        int addr = size_single_image * i;  
  
        for (int r = 0; r < n_rows; ++r) {  
            for (int c = 0; c < n_cols; ++c) {  
                unsigned char temp = 0;  
                file.read((char*)&temp, sizeof(temp));  
                data_dst[addr + width_image_input_CNN * (r + y_padding) + c + x_padding] = (temp / 255.0) * (scale_max - scale_min) + scale_min;  
            }  
        }  
    }  
}  
  
static void readMnistLabels(std::string filename, float* data_dst, int num_image)  
{  
    const float scale_min = -0.9;  
    const float scale_max = 0.9;  
  
    std::ifstream file(filename, std::ios::binary);  
    assert(file.is_open());  
  
    int magic_number = 0;  
    int number_of_images = 0;  
    file.read((char*)&magic_number, sizeof(magic_number));  
    magic_number = reverseInt(magic_number);  
    file.read((char*)&number_of_images, sizeof(number_of_images));  
    number_of_images = reverseInt(number_of_images);  
    assert(number_of_images == num_image);  
  
    for (int i = 0; i < number_of_images; ++i) {  
        unsigned char temp = 0;  
        file.read((char*)&temp, sizeof(temp));  
        data_dst[i * num_map_output_CNN + temp] = scale_max;  
    }  
}  
  
bool CNN::getSrcData()  
{  
    assert(data_input_train && data_output_train && data_input_test && data_output_test);  
  
    std::string filename_train_images = "D:/Download/MNIST/train-images.idx3-ubyte";  
    std::string filename_train_labels = "D:/Download/MNIST/train-labels.idx1-ubyte";  
    readMnistImages(filename_train_images, data_input_train, num_patterns_train_CNN);  
    /*unsigned char* p = new unsigned char[num_neuron_input_CNN]; 
    memset(p, 0, sizeof(unsigned char) * num_neuron_input_CNN); 
    for (int j = 0, i = 59998 * num_neuron_input_CNN; j< num_neuron_input_CNN; j++, i++) { 
        p[j] = (unsigned char)((data_input_train[i] + 1.0) / 2.0 * 255.0); 
    } 
    delete[] p;*/  
    readMnistLabels(filename_train_labels, data_output_train, num_patterns_train_CNN);  
    /*float* q = new float[num_neuron_output_CNN]; 
    memset(q, 0, sizeof(float) * num_neuron_output_CNN); 
    for (int j = 0, i = 59998 * num_neuron_output_CNN; j < num_neuron_output_CNN; j++, i++) { 
        q[j] = data_output_train[i]; 
    } 
    delete[] q;*/  
  
    std::string filename_test_images = "D:/Download/MNIST/t10k-images.idx3-ubyte";  
    std::string filename_test_labels = "D:/Download/MNIST/t10k-labels.idx1-ubyte";  
    readMnistImages(filename_test_images, data_input_test, num_patterns_test_CNN);  
    readMnistLabels(filename_test_labels, data_output_test, num_patterns_test_CNN);  
  
    return true;  
}  
  
bool CNN::train()  
{  
    out2wi_S2.clear();  
    out2bias_S2.clear();  
    out2wi_S4.clear();  
    out2bias_S4.clear();  
    in2wo_C3.clear();  
    weight2io_C3.clear();  
    bias2out_C3.clear();  
    in2wo_C1.clear();  
    weight2io_C1.clear();  
    bias2out_C1.clear();  
  
    calc_out2wi(width_image_C1_CNN, height_image_C1_CNN, width_image_S2_CNN, height_image_S2_CNN, num_map_S2_CNN, out2wi_S2);  
    calc_out2bias(width_image_S2_CNN, height_image_S2_CNN, num_map_S2_CNN, out2bias_S2);  
    calc_out2wi(width_image_C3_CNN, height_image_C3_CNN, width_image_S4_CNN, height_image_S4_CNN, num_map_S4_CNN, out2wi_S4);  
    calc_out2bias(width_image_S4_CNN, height_image_S4_CNN, num_map_S4_CNN, out2bias_S4);  
    calc_in2wo(width_image_C3_CNN, height_image_C3_CNN, width_image_S4_CNN, height_image_S4_CNN, num_map_C3_CNN, num_map_S4_CNN, in2wo_C3);  
    calc_weight2io(width_image_C3_CNN, height_image_C3_CNN, width_image_S4_CNN, height_image_S4_CNN, num_map_C3_CNN, num_map_S4_CNN, weight2io_C3);  
    calc_bias2out(width_image_C3_CNN, height_image_C3_CNN, width_image_S4_CNN, height_image_S4_CNN, num_map_C3_CNN, num_map_S4_CNN, bias2out_C3);  
    calc_in2wo(width_image_C1_CNN, height_image_C1_CNN, width_image_S2_CNN, height_image_S2_CNN, num_map_C1_CNN, num_map_C3_CNN, in2wo_C1);  
    calc_weight2io(width_image_C1_CNN, height_image_C1_CNN, width_image_S2_CNN, height_image_S2_CNN, num_map_C1_CNN, num_map_C3_CNN, weight2io_C1);  
    calc_bias2out(width_image_C1_CNN, height_image_C1_CNN, width_image_S2_CNN, height_image_S2_CNN, num_map_C1_CNN, num_map_C3_CNN, bias2out_C1);  
  
    int iter = 0;  
    for (iter = 0; iter < num_epochs_CNN; iter++) {  
        std::cout << "epoch: " << iter;  
  
        float accuracyRate = test();//0;  
        std::cout << ",    accuray rate: " << accuracyRate << std::endl;  
        if (accuracyRate > accuracy_rate_CNN) {  
            saveModelFile("cnn.model");  
            std::cout << "generate cnn model" << std::endl;  
            break;  
        }  
  
        for (int i = 0; i < num_patterns_train_CNN; i++) {  
            data_single_image = data_input_train + i * num_neuron_input_CNN;  
            data_single_label = data_output_train + i * num_neuron_output_CNN;  
  
            Forward_C1();  
            Forward_S2();  
            Forward_C3();  
            Forward_S4();  
            Forward_C5();  
            Forward_output();  
  
            Backward_output();  
            Backward_C5();  
            Backward_S4();  
            Backward_C3();  
            Backward_S2();  
            Backward_C1();  
            Backward_input();  
  
            UpdateWeights();  
        }  
    }  
  
    if (iter == num_epochs_CNN) {  
        saveModelFile("cnn.model");  
        std::cout << "generate cnn model" << std::endl;  
    }  
  
    return true;  
}  
  
float CNN::activation_function_tanh(float x)  
{  
    float ep = std::exp(x);  
    float em = std::exp(-x);  
  
    return (ep - em) / (ep + em);  
}  
  
float CNN::activation_function_tanh_derivative(float x)  
{  
    return (1.0 - x * x);  
}  
  
float CNN::activation_function_identity(float x)  
{  
    return x;  
}  
  
float CNN::activation_function_identity_derivative(float x)  
{  
    return 1;  
}  
  
float CNN::loss_function_mse(float y, float t)  
{  
    return (y - t) * (y - t) / 2;  
}  
  
float CNN::loss_function_mse_derivative(float y, float t)  
{  
    return (y - t);  
}  
  
void CNN::loss_function_gradient(const float* y, const float* t, float* dst, int len)  
{  
    for (int i = 0; i < len; i++) {  
        dst[i] = loss_function_mse_derivative(y[i], t[i]);  
    }  
}  
  
float CNN::dot_product(const float* s1, const float* s2, int len)  
{  
    float result = 0.0;  
  
    for (int i = 0; i < len; i++) {  
        result += s1[i] * s2[i];  
    }  
  
    return result;  
}  
  
bool CNN::muladd(const float* src, float c, int len, float* dst)  
{  
    for (int i = 0; i < len; i++) {  
        dst[i] += (src[i] * c);  
    }  
  
    return true;  
}  
  
int CNN::get_index(int x, int y, int channel, int width, int height, int depth)  
{  
    assert(x >= 0 && x < width);  
    assert(y >= 0 && y < height);  
    assert(channel >= 0 && channel < depth);  
    return (height * channel + y) * width + x;  
}  
  
bool CNN::Forward_C1()  
{  
    init_variable(neuron_C1, 0.0, num_neuron_C1_CNN);  
  
    /*for (int i = 0; i < num_map_C1_CNN; i++) { 
        int addr1 = i * width_image_C1_CNN * height_image_C1_CNN; 
        int addr2 = i * width_kernel_conv_CNN * height_kernel_conv_CNN; 
        float* image = &neuron_C1[0] + addr1; 
        const float* weight = &weight_C1[0] + addr2; 
 
        for (int y = 0; y < height_image_C1_CNN; y++) { 
            for (int x = 0; x < width_image_C1_CNN; x++) { 
                float sum = 0.0; 
                const float* image_input = data_single_image + y * width_image_input_CNN + x; 
 
                for (int m = 0; m < height_kernel_conv_CNN; m++) { 
                    for (int n = 0; n < width_kernel_conv_CNN; n++) { 
                        sum += weight[m * width_kernel_conv_CNN + n] * image_input[m * width_image_input_CNN + n]; 
                    } 
                } 
 
                image[y * width_image_C1_CNN + x] = activation_function_tanh(sum + bias_C1[i]); //tanh((w*x + b)) 
            } 
        } 
    }*/  
  
    for (int o = 0; o < num_map_C1_CNN; o++) {  
        for (int inc = 0; inc < num_map_input_CNN; inc++) {  
            int addr1 = get_index(0, 0, num_map_input_CNN * o + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_C1_CNN);  
            int addr2 = get_index(0, 0, inc, width_image_input_CNN, height_image_input_CNN, num_map_input_CNN);  
            int addr3 = get_index(0, 0, o, width_image_C1_CNN, height_image_C1_CNN, num_map_C1_CNN);  
  
            const float* pw = &weight_C1[0] + addr1;  
            const float* pi = data_single_image + addr2;  
            float* pa = &neuron_C1[0] + addr3;  
  
            for (int y = 0; y < height_image_C1_CNN; y++) {  
                for (int x = 0; x < width_image_C1_CNN; x++) {  
                    const float* ppw = pw;  
                    const float* ppi = pi + y * width_image_input_CNN + x;  
                    float sum = 0.0;  
  
                    for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                        for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                            sum += *ppw++ * ppi[wy * width_image_input_CNN + wx];  
                        }  
                    }  
  
                    pa[y * width_image_C1_CNN + x] += sum;  
                }  
            }  
        }  
  
        int addr3 = get_index(0, 0, o, width_image_C1_CNN, height_image_C1_CNN, num_map_C1_CNN);  
        float* pa = &neuron_C1[0] + addr3;  
        float b = bias_C1[o];  
        for (int y = 0; y < height_image_C1_CNN; y++) {  
            for (int x = 0; x < width_image_C1_CNN; x++) {  
                pa[y * width_image_C1_CNN + x] += b;  
            }  
        }  
    }  
  
    for (int i = 0; i < num_neuron_C1_CNN; i++) {  
        neuron_C1[i] = activation_function_tanh(neuron_C1[i]);  
    }  
  
    return true;  
}  
  
void CNN::calc_out2wi(int width_in, int height_in, int width_out, int height_out, int depth_out, std::vector& out2wi)  
{  
    for (int i = 0; i < depth_out; i++) {  
        int block = width_in * height_in * i;  
  
        for (int y = 0; y < height_out; y++) {  
            for (int x = 0; x < width_out; x++) {  
                int rows = y * width_kernel_pooling_CNN;  
                int cols = x * height_kernel_pooling_CNN;  
  
                wi_connections wi_connections_;  
                std::pair<int, int> pair_;  
  
                for (int m = 0; m < width_kernel_pooling_CNN; m++) {  
                    for (int n = 0; n < height_kernel_pooling_CNN; n++) {  
                        pair_.first = i;  
                        pair_.second = (rows + m) * width_in + cols + n + block;  
                        wi_connections_.push_back(pair_);  
                    }  
                }  
                out2wi.push_back(wi_connections_);  
            }  
        }  
    }  
}  
  
void CNN::calc_out2bias(int width, int height, int depth, std::vector<int>& out2bias)  
{  
    for (int i = 0; i < depth; i++) {  
        for (int y = 0; y < height; y++) {  
            for (int x = 0; x < width; x++) {  
                out2bias.push_back(i);  
            }  
        }  
    }  
}  
  
void CNN::calc_in2wo(int width_in, int height_in, int width_out, int height_out, int depth_in, int depth_out, std::vector& in2wo)  
{  
    int len = width_in * height_in * depth_in;  
    in2wo.resize(len);  
  
    for (int c = 0; c < depth_in; c++) {  
        for (int y = 0; y < height_in; y += height_kernel_pooling_CNN) {  
            for (int x = 0; x < width_in; x += width_kernel_pooling_CNN) {  
                int dymax = min(size_pooling_CNN, height_in - y);  
                int dxmax = min(size_pooling_CNN, width_in - x);  
                int dstx = x / width_kernel_pooling_CNN;  
                int dsty = y / height_kernel_pooling_CNN;  
  
                for (int dy = 0; dy < dymax; dy++) {  
                    for (int dx = 0; dx < dxmax; dx++) {  
                        int index_in = get_index(x + dx, y + dy, c, width_in, height_in, depth_in);  
                        int index_out = get_index(dstx, dsty, c, width_out, height_out, depth_out);  
  
                        wo_connections wo_connections_;  
                        std::pair<int, int> pair_;  
                        pair_.first = c;  
                        pair_.second = index_out;  
                        wo_connections_.push_back(pair_);  
  
                        in2wo[index_in] = wo_connections_;  
                    }  
                }  
            }  
        }  
    }  
}  
  
void CNN::calc_weight2io(int width_in, int height_in, int width_out, int height_out, int depth_in, int depth_out, std::vector& weight2io)  
{  
    int len = depth_in;  
    weight2io.resize(len);  
  
    for (int c = 0; c < depth_in; c++) {  
        for (int y = 0; y < height_in; y += height_kernel_pooling_CNN) {  
            for (int x = 0; x < width_in; x += width_kernel_pooling_CNN) {  
                int dymax = min(size_pooling_CNN, height_in - y);  
                int dxmax = min(size_pooling_CNN, width_in - x);  
                int dstx = x / width_kernel_pooling_CNN;  
                int dsty = y / height_kernel_pooling_CNN;  
  
                for (int dy = 0; dy < dymax; dy++) {  
                    for (int dx = 0; dx < dxmax; dx++) {  
                        int index_in = get_index(x + dx, y + dy, c, width_in, height_in, depth_in);  
                        int index_out = get_index(dstx, dsty, c, width_out, height_out, depth_out);  
  
                        std::pair<int, int> pair_;  
                        pair_.first = index_in;  
                        pair_.second = index_out;  
  
                        weight2io[c].push_back(pair_);  
                    }  
                }  
            }  
        }  
    }  
}  
  
void CNN::calc_bias2out(int width_in, int height_in, int width_out, int height_out, int depth_in, int depth_out, std::vectorint> >& bias2out)  
{  
    int len = depth_in;  
    bias2out.resize(len);  
  
    for (int c = 0; c < depth_in; c++) {  
        for (int y = 0; y < height_out; y++) {  
            for (int x = 0; x < width_out; x++) {  
                int index_out = get_index(x, y, c, width_out, height_out, depth_out);  
                bias2out[c].push_back(index_out);  
            }  
        }  
    }  
}  
  
bool CNN::Forward_S2()  
{  
    init_variable(neuron_S2, 0.0, num_neuron_S2_CNN);  
    float scale_factor = 1.0 / (width_kernel_pooling_CNN * height_kernel_pooling_CNN);  
  
    /*for (int i = 0; i < num_map_S2_CNN; i++) { 
        int addr1 = i * width_image_S2_CNN * height_image_S2_CNN; 
        int addr2 = i * width_image_C1_CNN * height_image_C1_CNN; 
 
        float* image = &neuron_S2[0] + addr1; 
        const float* image_input = &neuron_C1[0] + addr2; 
 
        for (int y = 0; y < height_image_S2_CNN; y++) { 
            for (int x = 0; x < width_image_S2_CNN; x++) { 
                float sum = 0.0; 
                int rows = y * height_kernel_pooling_CNN; 
                int cols = x * width_kernel_pooling_CNN; 
 
                for (int m = 0; m < height_kernel_pooling_CNN; m++) { 
                    for (int n = 0; n < width_kernel_pooling_CNN; n++) { 
                        sum += image_input[(rows + m) * width_image_C1_CNN + cols + n]; 
                    } 
                } 
 
                image[y * width_image_S2_CNN + x] = activation_function_tanh(sum * weight_S2[i] * scale_factor + bias_S2[i]); 
            } 
        } 
    }*/  
  
    assert(out2wi_S2.size() == num_neuron_S2_CNN);  
    assert(out2bias_S2.size() == num_neuron_S2_CNN);  
  
    for (int i = 0; i < num_neuron_S2_CNN; i++) {  
        const wi_connections& connections = out2wi_S2[i];  
        neuron_S2[i] = 0;  
  
        for (int index = 0; index < connections.size(); index++) {  
            neuron_S2[i] += weight_S2[connections[index].first] * neuron_C1[connections[index].second];  
        }  
  
        neuron_S2[i] *= scale_factor;  
        neuron_S2[i] += bias_S2[out2bias_S2[i]];  
    }  
  
    for (int i = 0; i < num_neuron_S2_CNN; i++) {  
        neuron_S2[i] = activation_function_tanh(neuron_S2[i]);  
    }  
  
    return true;  
}  
  
bool CNN::Forward_C3()  
{  
    init_variable(neuron_C3, 0.0, num_neuron_C3_CNN);  
  
    /*for (int i = 0; i < num_map_C3_CNN; i++) { 
        int addr1 = i * width_image_C3_CNN * height_image_C3_CNN; 
        int addr2 = i * width_kernel_conv_CNN * height_kernel_conv_CNN * num_map_S2_CNN; 
        float* image = &neuron_C3[0] + addr1; 
        const float* weight = &weight_C3[0] + addr2; 
 
        for (int j = 0; j < num_map_S2_CNN; j++) { 
            int addr3 = j * width_image_S2_CNN * height_image_S2_CNN; 
            int addr4 = j * width_kernel_conv_CNN * height_kernel_conv_CNN; 
            const float* image_input = &neuron_S2[0] + addr3; 
            const float* weight_ = weight + addr4; 
 
            for (int y = 0; y < height_image_C3_CNN; y++) { 
                for (int x = 0; x < width_image_C3_CNN; x++) { 
                    float sum = 0.0; 
                    const float* image_input_ = image_input + y * width_image_S2_CNN + x; 
 
                    for (int m = 0; m < height_kernel_conv_CNN; m++) { 
                        for (int n = 0; n < width_kernel_conv_CNN; n++) { 
                            sum += weight_[m * width_kernel_conv_CNN + n] * image_input_[m * width_image_S2_CNN + n]; 
                        } 
                    } 
 
                    image[y * width_image_C3_CNN + x] += sum; 
                } 
            } 
        } 
 
        for (int y = 0; y < height_image_C3_CNN; y++) { 
            for (int x = 0; x < width_image_C3_CNN; x++) { 
                image[y * width_image_C3_CNN + x] = activation_function_tanh(image[y * width_image_C3_CNN + x] + bias_C3[i]); 
            } 
        } 
    }*/  
  
    for (int o = 0; o < num_map_C3_CNN; o++) {  
        for (int inc = 0; inc < num_map_S2_CNN; inc++) {  
            int addr1 = get_index(0, 0, num_map_S2_CNN * o + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_C3_CNN * num_map_S2_CNN);  
            int addr2 = get_index(0, 0, inc, width_image_S2_CNN, height_image_S2_CNN, num_map_S2_CNN);  
            int addr3 = get_index(0, 0, o, width_image_C3_CNN, height_image_C3_CNN, num_map_C3_CNN);  
  
            const float* pw = &weight_C3[0] + addr1;  
            const float* pi = &neuron_S2[0] + addr2;  
            float* pa = &neuron_C3[0] + addr3;  
  
            for (int y = 0; y < height_image_C3_CNN; y++) {  
                for (int x = 0; x < width_image_C3_CNN; x++) {  
                    const float* ppw = pw;  
                    const float* ppi = pi + y * width_image_S2_CNN + x;  
                    float sum = 0.0;  
  
                    for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                        for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                            sum += *ppw++ * ppi[wy * width_image_S2_CNN + wx];  
                        }  
                    }  
  
                    pa[y * width_image_C3_CNN + x] += sum;  
                }  
            }  
        }  
  
        int addr3 = get_index(0, 0, o, width_image_C3_CNN, height_image_C3_CNN, num_map_C3_CNN);  
        float* pa = &neuron_C3[0] + addr3;  
        float b = bias_C3[o];  
        for (int y = 0; y < height_image_C3_CNN; y++) {  
            for (int x = 0; x < width_image_C3_CNN; x++) {  
                pa[y * width_image_C3_CNN + x] += b;  
            }  
        }  
    }  
  
    for (int i = 0; i < num_neuron_C3_CNN; i++) {  
        neuron_C3[i] = activation_function_tanh(neuron_C3[i]);  
    }  
  
    return true;  
}  
  
bool CNN::Forward_S4()  
{  
    float scale_factor = 1.0 / (width_kernel_pooling_CNN * height_kernel_pooling_CNN);  
    init_variable(neuron_S4, 0.0, num_neuron_S4_CNN);  
  
    /*for (int i = 0; i < num_map_S4_CNN; i++) { 
        int addr1 = i * width_image_S4_CNN * height_image_S4_CNN; 
        int addr2 = i * width_image_C3_CNN * height_image_C3_CNN; 
 
        float* image = &neuron_S4[0] + addr1; 
        const float* image_input = &neuron_C3[0] + addr2; 
 
        for (int y = 0; y < height_image_S4_CNN; y++) { 
            for (int x = 0; x < width_image_S4_CNN; x++) { 
                float sum = 0.0; 
                int rows = y * height_kernel_pooling_CNN; 
                int cols = x * width_kernel_pooling_CNN; 
 
                for (int m = 0; m < height_kernel_pooling_CNN; m++) { 
                    for (int n = 0; n < width_kernel_pooling_CNN; n++) { 
                        sum += image_input[(rows + m) * width_image_C3_CNN + cols + n]; 
                    } 
                } 
 
                image[y * width_image_S4_CNN + x] = activation_function_tanh(sum * weight_S4[i] * scale_factor + bias_S4[i]); 
            } 
        } 
    }*/  
  
    assert(out2wi_S4.size() == num_neuron_S4_CNN);  
    assert(out2bias_S4.size() == num_neuron_S4_CNN);  
  
    for (int i = 0; i < num_neuron_S4_CNN; i++) {  
        const wi_connections& connections = out2wi_S4[i];  
        neuron_S4[i] = 0.0;  
  
        for (int index = 0; index < connections.size(); index++) {  
            neuron_S4[i] += weight_S4[connections[index].first] * neuron_C3[connections[index].second];  
        }  
  
        neuron_S4[i] *= scale_factor;  
        neuron_S4[i] += bias_S4[out2bias_S4[i]];  
    }  
  
    for (int i = 0; i < num_neuron_S4_CNN; i++) {  
        neuron_S4[i] = activation_function_tanh(neuron_S4[i]);  
    }  
  
    //int count_num = 0;  
    //for (int i = 0; i < num_neuron_S4_CNN; i++) {  
    //  if (fabs(neuron_S4[i] - Tmp_neuron_S4[i]) > 0.0000001/*0.0000000001*/) {  
    //      count_num++;  
    //      std::cout << "i = " << i << " , old: " << neuron_S4[i] << " , new: " << Tmp_neuron_S4[i] << std::endl;  
    //  }  
    //}  
    //std::cout << "count_num: " << count_num << std::endl;  
  
    return true;  
}  
  
bool CNN::Forward_C5()  
{  
    init_variable(neuron_C5, 0.0, num_neuron_C5_CNN);  
  
    /*for (int i = 0; i < num_map_C5_CNN; i++) { 
        int addr1 = i * width_image_C5_CNN * height_image_C5_CNN; 
        int addr2 = i * width_kernel_conv_CNN * height_kernel_conv_CNN * num_map_S4_CNN; 
        float* image = &neuron_C5[0] + addr1; 
        const float* weight = &weight_C5[0] + addr2; 
 
        for (int j = 0; j < num_map_S4_CNN; j++) { 
            int addr3 = j * width_kernel_conv_CNN * height_kernel_conv_CNN; 
            int addr4 = j * width_image_S4_CNN * height_image_S4_CNN; 
            const float* weight_ = weight + addr3; 
            const float* image_input = &neuron_S4[0] + addr4; 
 
            for (int y = 0; y < height_image_C5_CNN; y++) { 
                for (int x = 0; x < width_image_C5_CNN; x++) { 
                    float sum = 0.0; 
                    const float* image_input_ = image_input + y * width_image_S4_CNN + x; 
 
                    for (int m = 0; m < height_kernel_conv_CNN; m++) { 
                        for (int n = 0; n < width_kernel_conv_CNN; n++) { 
                            sum += weight_[m * width_kernel_conv_CNN + n] * image_input_[m * width_image_S4_CNN + n]; 
                        } 
                    } 
 
                    image[y * width_image_C5_CNN + x] += sum; 
                } 
            } 
        } 
 
        for (int y = 0; y < height_image_C5_CNN; y++) { 
            for (int x = 0; x < width_image_C5_CNN; x++) { 
                image[y * width_image_C5_CNN + x] = activation_function_tanh(image[y * width_image_C5_CNN + x] + bias_C5[i]); 
            } 
        } 
    }*/  
  
    for (int o = 0; o < num_map_C5_CNN; o++) {  
        for (int inc = 0; inc < num_map_S4_CNN; inc++) {  
            int addr1 = get_index(0, 0, num_map_S4_CNN * o + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_C5_CNN * num_map_S4_CNN);  
            int addr2 = get_index(0, 0, inc, width_image_S4_CNN, height_image_S4_CNN, num_map_S4_CNN);  
            int addr3 = get_index(0, 0, o, width_image_C5_CNN, height_image_C5_CNN, num_map_C5_CNN);  
  
            const float *pw = &weight_C5[0] + addr1;  
            const float *pi = &neuron_S4[0] + addr2;  
            float *pa = &neuron_C5[0] + addr3;  
  
            for (int y = 0; y < height_image_C5_CNN; y++) {  
                for (int x = 0; x < width_image_C5_CNN; x++) {  
                    const float *ppw = pw;  
                    const float *ppi = pi + y * width_image_S4_CNN + x;  
                    float sum = 0.0;  
  
                    for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                        for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                            sum += *ppw++ * ppi[wy * width_image_S4_CNN + wx];  
                        }  
                    }  
  
                    pa[y * width_image_C5_CNN + x] += sum;  
                }  
            }  
        }  
  
        int addr3 = get_index(0, 0, o, width_image_C5_CNN, height_image_C5_CNN, num_map_C5_CNN);  
        float *pa = &neuron_C5[0] + addr3;  
        float b = bias_C5[o];  
        for (int y = 0; y < height_image_C5_CNN; y++) {  
            for (int x = 0; x < width_image_C5_CNN; x++) {  
                pa[y * width_image_C5_CNN + x] += b;  
            }  
        }  
    }  
  
    for (int i = 0; i < num_neuron_C5_CNN; i++) {  
        neuron_C5[i] = activation_function_tanh(neuron_C5[i]);  
    }  
  
    return true;  
}  
  
bool CNN::Forward_output()  
{  
    init_variable(neuron_output, 0.0, num_neuron_output_CNN);  
    /*float* image = &neuron_output[0]; 
    const float* weight = &weight_output[0]; 
 
    for (int i = 0; i < num_neuron_output_CNN; i++) { 
        for (int j = 0; j < num_neuron_C5_CNN; j++) { 
            image[i] += (weight[j * num_neuron_output_CNN + i] * neuron_C5[j]); 
        } 
 
        image[i] = activation_function_tanh(image[i] + bias_output[i]); 
    }*/  
  
    for (int i = 0; i < num_neuron_output_CNN; i++) {  
        neuron_output[i] = 0.0;  
  
        for (int c = 0; c < num_neuron_C5_CNN; c++) {  
            neuron_output[i] += weight_output[c * num_neuron_output_CNN + i] * neuron_C5[c];  
        }  
  
        neuron_output[i] += bias_output[i];  
    }  
  
    for (int i = 0; i < num_neuron_output_CNN; i++) {  
        neuron_output[i] = activation_function_tanh(neuron_output[i]);  
    }  
  
    return true;  
}  
  
bool CNN::Backward_output()  
{  
    init_variable(delta_neuron_output, 0.0, num_neuron_output_CNN);  
    /*float gradient[num_neuron_output_CNN]; 
    const float* t = &data_single_label[0]; 
    float tmp[num_neuron_output_CNN]; 
 
    for (int i = 0; i < num_neuron_output_CNN; i++) { 
        gradient[i] = loss_function_mse_derivative(neuron_output[i], t[i]); 
    } 
 
    for (int i = 0; i < num_neuron_output_CNN; i++) { 
        init_variable(tmp, 0.0, num_neuron_output_CNN); 
        tmp[i] = activation_function_tanh_derivative(neuron_output[i]); 
 
        delta_neuron_output[i] = dot_product(gradient, tmp, num_neuron_output_CNN); 
    }*/  
  
    float dE_dy[num_neuron_output_CNN];  
    init_variable(dE_dy, 0.0, num_neuron_output_CNN);  
    loss_function_gradient(neuron_output, data_single_label, dE_dy, num_neuron_output_CNN);  
      
    // delta = dE/da = (dE/dy) * (dy/da)  
    for (int i = 0; i < num_neuron_output_CNN; i++) {  
        float dy_da[num_neuron_output_CNN];  
        init_variable(dy_da, 0.0, num_neuron_output_CNN);  
  
        dy_da[i] = activation_function_tanh_derivative(neuron_output[i]);  
        delta_neuron_output[i] = dot_product(dE_dy, dy_da, num_neuron_output_CNN);  
    }  
  
    return true;  
}  
  
bool CNN::Backward_C5()  
{  
    init_variable(delta_neuron_C5, 0.0, num_neuron_C5_CNN);  
    init_variable(delta_weight_output, 0.0, len_weight_output_CNN);  
    init_variable(delta_bias_output, 0.0, len_bias_output_CNN);  
  
    /*for (int i = 0; i < num_neuron_C5_CNN; i++) { 
        delta_neuron_C5[i] = dot_product(&delta_neuron_output[0], &weight_output[0] + i * num_neuron_output_CNN, num_neuron_output_CNN); 
        delta_neuron_C5[i] *= activation_function_tanh_derivative(neuron_C5[i]); 
    } 
 
    for (int j = 0; j < num_neuron_C5_CNN; j++) { 
        muladd(&delta_neuron_output[0], neuron_C5[j], num_neuron_output_CNN, &delta_weight_output[0] + j * num_neuron_output_CNN); 
    } 
 
    for (int i = 0; i < num_neuron_output_CNN; i++) { 
        delta_bias_output[i] += delta_neuron_output[i]; 
    }*/  
  
    for (int c = 0; c < num_neuron_C5_CNN; c++) {  
        // propagate delta to previous layer  
        // prev_delta[c] += current_delta[r] * W_[c * out_size_ + r]  
        delta_neuron_C5[c] = dot_product(&delta_neuron_output[0], &weight_output[c * num_neuron_output_CNN], num_neuron_output_CNN);  
        delta_neuron_C5[c] *= activation_function_tanh_derivative(neuron_C5[c]);  
    }  
  
    // accumulate weight-step using delta  
    // dW[c * out_size + i] += current_delta[i] * prev_out[c]  
    for (int c = 0; c < num_neuron_C5_CNN; c++) {  
        muladd(&delta_neuron_output[0], neuron_C5[c], num_neuron_output_CNN, &delta_weight_output[0] + c * num_neuron_output_CNN);  
    }  
  
    for (int i = 0; i < len_bias_output_CNN; i++) {  
        delta_bias_output[i] += delta_neuron_output[i];  
    }  
  
    //int count_num = 0;  
    //for (int i = 0; i < num_neuron_C5_CNN; i++) {  
    //  if (fabs(delta_neuron_C5[i] - Tmp_delta_neuron_C5[i]) > 0.0000001/*0.0000000001*/) {  
    //      count_num++;  
    //  }  
    //}  
    //std::cout << "delta_neuron count_num: " << count_num << std::endl;  
    //count_num = 0;  
    //for (int i = 0; i < len_weight_output_CNN; i++) {  
    //  if (fabs(delta_weight_output[i] - Tmp_delta_weight_output[i]) > 0.0000001/*0.0000000001*/) {  
    //      count_num++;  
    //  }  
    //}  
    //std::cout << "delta_weight count_num: " << count_num << std::endl;  
    //count_num = 0;  
    //for (int i = 0; i < len_bias_output_CNN; i++) {  
    //  if (fabs(delta_bias_output[i] - Tmp_delta_bias_output[i]) > 0.0000001/*0.0000000001*/) {  
    //      count_num++;  
    //  }  
    //}  
    //std::cout << "delta_bias count_num: " << count_num << std::endl;  
  
    return true;  
}  
  
bool CNN::Backward_S4()  
{  
    init_variable(delta_neuron_S4, 0.0, num_neuron_S4_CNN);  
    init_variable(delta_weight_C5, 0.0, len_weight_C5_CNN);  
    init_variable(delta_bias_C5, 0.0, len_bias_C5_CNN);  
  
    /*for (int i = 0; i < num_map_S4_CNN; i++) { 
        for (int j = 0; j < num_map_C5_CNN; j++) { 
            int addr1 = width_kernel_conv_CNN * height_kernel_conv_CNN * (num_map_S4_CNN * j + i); 
            int addr2 = width_image_S4_CNN * height_image_S4_CNN * i; 
 
            const float* weight_c5 = &weight_C5[0] + addr1; 
            const float* delta_c5 = &delta_neuron_C5[0] + width_image_C5_CNN * height_image_C5_CNN * j; 
            float* delta_s4 = &delta_neuron_S4[0] + addr2; 
 
            for (int y = 0; y < height_image_C5_CNN; y++) { 
                for (int x = 0; x < width_image_C5_CNN; x++) { 
                    const float* weight_c5_ = weight_c5; 
                    const float delta_c5_ = delta_c5[y * width_image_C5_CNN + x]; 
                    float* delta_s4_ = delta_s4 + y * width_image_S4_CNN + x; 
 
                    for (int m = 0; m < height_kernel_conv_CNN; m++) { 
                        for (int n = 0; n < width_kernel_conv_CNN; n++) { 
                            delta_s4_[m * width_image_S4_CNN + n] += weight_c5_[m * width_kernel_conv_CNN + n] * delta_c5_; 
                        } 
                    } 
                } 
            } 
        } 
    } 
 
    for (int i = 0; i < num_neuron_S4_CNN; i++) { 
        delta_neuron_S4[i] *= activation_function_tanh_derivative(neuron_S4[i]); 
    } 
 
    for (int i = 0; i < num_map_S4_CNN; i++) {//////// 
        for (int j = 0; j < num_map_C5_CNN; j++) { 
            for (int y = 0; y < height_kernel_conv_CNN; y++) { 
                for (int x = 0; x < width_kernel_conv_CNN; x++) { 
                    int addr1 = (height_image_S4_CNN * i + y) * width_image_S4_CNN + x; 
                    int addr2 = (height_kernel_conv_CNN * (num_map_S4_CNN * j + i) + y) * width_kernel_conv_CNN + x; 
                    int addr3 = height_image_C5_CNN * j * width_image_C5_CNN; 
 
                    float dst = 0; 
                    const float* neuron_s4 = &neuron_S4[0] + addr1; 
                    const float* delta_c5 = &delta_neuron_C5[0] + addr3; 
 
                    for (int m = 0; m < height_image_C5_CNN; m++) { 
                        dst += dot_product(neuron_s4 + m * width_image_S4_CNN, delta_c5 + y * width_image_C5_CNN, width_image_C5_CNN); 
                    } 
 
                    delta_weight_C5[addr2] += dst; 
                } 
            } 
        } 
    } 
 
    for (int i = 0; i < num_map_C5_CNN; i++) { 
        delta_bias_C5[i] += delta_neuron_C5[i]; 
    }*/  
  
    // propagate delta to previous layer  
    for (int inc = 0; inc < num_map_S4_CNN; inc++) {  
        for (int outc = 0; outc < num_map_C5_CNN; outc++) {  
            int addr1 = get_index(0, 0, num_map_S4_CNN * outc + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_S4_CNN * num_map_C5_CNN);  
            int addr2 = get_index(0, 0, outc, width_image_C5_CNN, height_image_C5_CNN, num_map_C5_CNN);  
            int addr3 = get_index(0, 0, inc, width_image_S4_CNN, height_image_S4_CNN, num_map_S4_CNN);  
  
            const float* pw = &weight_C5[0] + addr1;  
            const float* pdelta_src = &delta_neuron_C5[0] + addr2;  
            float* pdelta_dst = &delta_neuron_S4[0] + addr3;  
  
            for (int y = 0; y < height_image_C5_CNN; y++) {  
                for (int x = 0; x < width_image_C5_CNN; x++) {  
                    const float* ppw = pw;  
                    const float ppdelta_src = pdelta_src[y * width_image_C5_CNN + x];  
                    float* ppdelta_dst = pdelta_dst + y * width_image_S4_CNN + x;  
  
                    for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                        for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                            ppdelta_dst[wy * width_image_S4_CNN + wx] += *ppw++ * ppdelta_src;  
                        }  
                    }  
                }  
            }  
        }  
    }  
  
    for (int i = 0; i < num_neuron_S4_CNN; i++) {  
        delta_neuron_S4[i] *= activation_function_tanh_derivative(neuron_S4[i]);  
    }  
  
    // accumulate dw  
    for (int inc = 0; inc < num_map_S4_CNN; inc++) {  
        for (int outc = 0; outc < num_map_C5_CNN; outc++) {  
            for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                    int addr1 = get_index(wx, wy, inc, width_image_S4_CNN, height_image_S4_CNN, num_map_S4_CNN);  
                    int addr2 = get_index(0, 0, outc, width_image_C5_CNN, height_image_C5_CNN, num_map_C5_CNN);  
                    int addr3 = get_index(wx, wy, num_map_S4_CNN * outc + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_S4_CNN * num_map_C5_CNN);  
  
                    float dst = 0.0;  
                    const float* prevo = &neuron_S4[0] + addr1;  
                    const float* delta = &delta_neuron_C5[0] + addr2;  
  
                    for (int y = 0; y < height_image_C5_CNN; y++) {  
                        dst += dot_product(prevo + y * width_image_S4_CNN, delta + y * width_image_C5_CNN, width_image_C5_CNN);  
                    }  
  
                    delta_weight_C5[addr3] += dst;  
                }  
            }  
        }  
    }  
  
    // accumulate db  
    for (int outc = 0; outc < num_map_C5_CNN; outc++) {  
        int addr2 = get_index(0, 0, outc, width_image_C5_CNN, height_image_C5_CNN, num_map_C5_CNN);  
        const float* delta = &delta_neuron_C5[0] + addr2;  
  
        for (int y = 0; y < height_image_C5_CNN; y++) {  
            for (int x = 0; x < width_image_C5_CNN; x++) {  
                delta_bias_C5[outc] += delta[y * width_image_C5_CNN + x];  
            }  
        }  
    }  
  
    return true;  
}  
  
bool CNN::Backward_C3()  
{  
    init_variable(delta_neuron_C3, 0.0, num_neuron_C3_CNN);  
    init_variable(delta_weight_S4, 0.0, len_weight_S4_CNN);  
    init_variable(delta_bias_S4, 0.0, len_bias_S4_CNN);  
  
    float scale_factor = 1.0 / (width_kernel_pooling_CNN * height_kernel_pooling_CNN);  
  
    /*for (int i = 0; i < num_map_C3_CNN; i++) { 
        int addr1 = width_image_S4_CNN * height_image_S4_CNN * i; 
        int addr2 = width_image_C3_CNN * height_image_C3_CNN * i; 
 
        const float* delta_s4 = &delta_neuron_S4[0] + addr1; 
        float* delta_c3 = &delta_neuron_C3[0] + addr2; 
        const float* neuron_c3 = &neuron_C3[0] + addr2; 
 
        for (int y = 0; y < height_image_C3_CNN; y++) { 
            for (int x = 0; x < width_image_C3_CNN; x++) { 
                float delta = 0.0; 
                int index = width_image_S4_CNN * (y / height_kernel_pooling_CNN) + x / width_kernel_pooling_CNN; 
                delta = weight_S4[i] * delta_s4[index]; 
 
                delta_c3[y * width_image_C3_CNN + x] = delta * scale_factor * activation_function_tanh_derivative(neuron_c3[y * width_image_C3_CNN + x]); 
            } 
        } 
    } 
 
    for (int i = 0; i < len_weight_S4_CNN; i++) { 
        int addr1 = width_image_C3_CNN * height_image_C3_CNN * i; 
        int addr2 = width_image_S4_CNN * height_image_S4_CNN * i; 
 
        const float* neuron_c3 = &neuron_C3[0] + addr1; 
        const float* delta_s4 = &delta_neuron_S4[0] + addr2; 
 
        float diff = 0.0; 
 
        for (int y = 0; y < height_image_C3_CNN; y++) { 
            for (int x = 0; x < width_image_C3_CNN; x++) { 
                int index = y / height_kernel_pooling_CNN * height_image_S4_CNN + x / width_kernel_pooling_CNN; 
 
                diff += neuron_c3[y * width_image_C3_CNN + x] * delta_s4[index]; 
            } 
        } 
 
        delta_weight_S4[i] += diff * scale_factor; 
    } 
 
    for (int i = 0; i < len_bias_S4_CNN; i++) { 
        int addr1 = width_image_S4_CNN * height_image_S4_CNN * i; 
        const float* delta_s4 = &delta_neuron_S4[0] + addr1; 
        float diff = 0; 
 
        for (int y = 0; y < height_image_S4_CNN; y++) { 
            for (int x = 0; x < width_image_S4_CNN; x++) { 
                diff += delta_s4[y * width_image_S4_CNN + x]; 
            } 
        } 
 
        delta_bias_S4[i] += diff; 
    }*/  
  
    assert(in2wo_C3.size() == num_neuron_C3_CNN);  
    assert(weight2io_C3.size() == len_weight_S4_CNN);  
    assert(bias2out_C3.size() == len_bias_S4_CNN);  
  
    for (int i = 0; i < num_neuron_C3_CNN; i++) {  
        const wo_connections& connections = in2wo_C3[i];  
        float delta = 0.0;  
  
        for (int j = 0; j < connections.size(); j++) {  
            delta += weight_S4[connections[j].first] * delta_neuron_S4[connections[j].second];  
        }  
  
        delta_neuron_C3[i] = delta * scale_factor * activation_function_tanh_derivative(neuron_C3[i]);  
    }  
  
    for (int i = 0; i < len_weight_S4_CNN; i++) {  
        const io_connections& connections = weight2io_C3[i];  
        float diff = 0;  
  
        for (int j = 0; j < connections.size(); j++) {  
            diff += neuron_C3[connections[j].first] * delta_neuron_S4[connections[j].second];  
        }  
  
        delta_weight_S4[i] += diff * scale_factor;  
    }  
  
    for (int i = 0; i < len_bias_S4_CNN; i++) {  
        const std::vector<int>& outs = bias2out_C3[i];  
        float diff = 0;  
  
        for (int o = 0; o < outs.size(); o++) {  
            diff += delta_neuron_S4[outs[o]];  
        }  
  
        delta_bias_S4[i] += diff;  
    }  
  
    return true;  
}  
  
bool CNN::Backward_S2()  
{  
    init_variable(delta_neuron_S2, 0.0, num_neuron_S2_CNN);  
    init_variable(delta_weight_C3, 0.0, len_weight_C3_CNN);  
    init_variable(delta_bias_C3, 0.0, len_bias_C3_CNN);  
  
    /*for (int i = 0; i < num_map_S2_CNN; i++) {//////////////// 
        int addr1 = width_kernel_conv_CNN * height_kernel_conv_CNN * num_map_C3_CNN * i; 
        int addr2 = width_kernel_conv_CNN * height_kernel_conv_CNN * i; 
        for (int j = 0; j < num_map_C3_CNN; j++) { 
            const float* weight_c3 = &weight_C3[0] + addr1 + j * width_kernel_conv_CNN * height_kernel_conv_CNN; 
            const float* delta_c3 = &delta_neuron_C3[0] + width_image_C3_CNN * height_image_C3_CNN * j; 
            float* delta_s2 = &delta_neuron_S2[0] + addr2; 
 
            for (int y = 0; y < height_image_C3_CNN; y++) { 
                for (int x = 0; x < width_image_C3_CNN; x++) { 
                    const float* weight_c3_ = weight_c3; 
                    const float delta_c3_ = delta_c3[y * width_image_C3_CNN + x]; 
                    float* delta_s2_ = delta_s2 + y * width_kernel_conv_CNN + x; 
 
                    for (int m = 0; m < height_kernel_conv_CNN; m++) { 
                        for (int n = 0; n < width_kernel_conv_CNN; n++) { 
                            delta_s2_[m * width_kernel_conv_CNN + n] += weight_c3_[m * width_kernel_conv_CNN + n] * delta_c3_; 
                        } 
                    } 
                } 
            } 
        } 
    } 
 
    for (int i = 0; i < num_neuron_S2_CNN; i++) { 
        delta_neuron_S2[i] *= activation_function_tanh_derivative(neuron_S2[i]); 
    } 
 
    for (int i = 0; i < num_map_S2_CNN; i++) {////////////////// 
        int addr1 = width_kernel_conv_CNN * height_kernel_conv_CNN * i; 
 
        for (int j = 0; j < num_map_C3_CNN; j++) { 
            int addr2 = width_kernel_conv_CNN * height_kernel_conv_CNN * i * j; 
            float* delta_weight_c3 = &delta_weight_C3[0] + addr2; 
 
            for (int y = 0; y < height_kernel_conv_CNN; y++) { 
                for (int x = 0; x < width_kernel_conv_CNN; x++) { 
                    float dst = 0; 
                    const float* neuron_s2 = &neuron_S2[0] + addr1 + y * width_kernel_conv_CNN + x; 
                    const float* delta_c3 = &delta_neuron_C3[0] + width_image_C3_CNN * height_image_C3_CNN * j; 
 
                    for (int m = 0; m < height_image_C3_CNN; m++) { 
                        dst += dot_product(neuron_s2 + m * width_kernel_conv_CNN, delta_c3 + y * width_image_C3_CNN, width_image_C3_CNN); 
                    } 
 
                    delta_weight_c3[y * width_kernel_conv_CNN + x] += dst; 
                } 
            } 
        } 
    } 
 
    for (int i = 0; i < num_map_C3_CNN; i++) { 
        const float* delta = &delta_neuron_C3[0] + width_image_C3_CNN * height_image_C3_CNN * i; 
 
        //delta_bias_C3[i] += std::accumulate(delta, delta + width_image_C3_CNN * height_image_C3_CNN, (float)0.0); 
        for (int y = 0; y < height_image_C3_CNN; y++) { 
            for (int x = 0; x < width_image_C3_CNN; x++) { 
                delta_bias_C3[i] += delta[y * width_image_C3_CNN + x]; 
            } 
        } 
    }*/  
  
    // propagate delta to previous layer  
    for (int inc = 0; inc < num_map_S2_CNN; inc++) {  
        for (int outc = 0; outc < num_map_C3_CNN; outc++) {  
            int addr1 = get_index(0, 0, num_map_S2_CNN * outc + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_S2_CNN * num_map_C3_CNN);  
            int addr2 = get_index(0, 0, outc, width_image_C3_CNN, height_image_C3_CNN, num_map_C3_CNN);  
            int addr3 = get_index(0, 0, inc, width_image_S2_CNN, height_image_S2_CNN, num_map_S2_CNN);  
  
            const float *pw = &weight_C3[0] + addr1;  
            const float *pdelta_src = &delta_neuron_C3[0] + addr2;;  
            float* pdelta_dst = &delta_neuron_S2[0] + addr3;  
  
            for (int y = 0; y < height_image_C3_CNN; y++) {  
                for (int x = 0; x < width_image_C3_CNN; x++) {  
                    const float* ppw = pw;  
                    const float ppdelta_src = pdelta_src[y * width_image_C3_CNN + x];  
                    float* ppdelta_dst = pdelta_dst + y * width_image_S2_CNN + x;  
  
                    for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                        for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                            ppdelta_dst[wy * width_image_S2_CNN + wx] += *ppw++ * ppdelta_src;  
                        }  
                    }  
                }  
            }  
        }  
    }  
  
    for (int i = 0; i < num_neuron_S2_CNN; i++) {  
        delta_neuron_S2[i] *= activation_function_tanh_derivative(neuron_S2[i]);  
    }  
  
    // accumulate dw  
    for (int inc = 0; inc < num_map_S2_CNN; inc++) {  
        for (int outc = 0; outc < num_map_C3_CNN; outc++) {  
            for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                    int addr1 = get_index(wx, wy, inc, width_image_S2_CNN, height_image_S2_CNN, num_map_S2_CNN);  
                    int addr2 = get_index(0, 0, outc, width_image_C3_CNN, height_image_C3_CNN, num_map_C3_CNN);  
                    int addr3 = get_index(wx, wy, num_map_S2_CNN * outc + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_S2_CNN * num_map_C3_CNN);  
                      
                    float dst = 0.0;  
                    const float* prevo = &neuron_S2[0] + addr1;  
                    const float* delta = &delta_neuron_C3[0] + addr2;  
  
                    for (int y = 0; y < height_image_C3_CNN; y++) {  
                        dst += dot_product(prevo + y * width_image_S2_CNN, delta + y * width_image_C3_CNN, width_image_C3_CNN);  
                    }  
  
                    delta_weight_C3[addr3] += dst;  
                }  
            }  
        }  
    }  
  
    // accumulate db  
    for (int outc = 0; outc < len_bias_C3_CNN; outc++) {  
        int addr1 = get_index(0, 0, outc, width_image_C3_CNN, height_image_C3_CNN, num_map_C3_CNN);  
        const float* delta = &delta_neuron_C3[0] + addr1;  
  
        for (int y = 0; y < height_image_C3_CNN; y++) {  
            for (int x = 0; x < width_image_C3_CNN; x++) {  
                delta_bias_C3[outc] += delta[y * width_image_C3_CNN + x];  
            }  
        }  
    }  
  
    return true;  
}  
  
bool CNN::Backward_C1()  
{  
    init_variable(delta_neuron_C1, 0.0, num_neuron_C1_CNN);  
    init_variable(delta_weight_S2, 0.0, len_weight_S2_CNN);  
    init_variable(delta_bias_S2, 0.0, len_bias_S2_CNN);  
  
    float scale_factor = 1.0 / (width_kernel_pooling_CNN * height_kernel_pooling_CNN);  
  
    /*for (int i = 0; i < num_map_C1_CNN; i++) { 
        int addr1 = width_image_S2_CNN * height_image_S2_CNN * i; 
        int addr2 = width_image_C1_CNN * height_image_C1_CNN * i; 
 
        const float* delta_s2 = &delta_neuron_S2[0] + addr1; 
        float* delta_c1 = &delta_neuron_C1[0] + addr2; 
        const float* neuron_c1 = &neuron_C1[0] + addr2; 
 
        for (int y = 0; y < height_image_C1_CNN; y++) { 
            for (int x = 0; x < width_image_C1_CNN; x++) { 
                float delta = 0.0; 
                int index = width_image_S2_CNN * (y / height_kernel_pooling_CNN) + x / width_kernel_pooling_CNN; 
                delta = weight_S2[i] * delta_s2[index]; 
 
                delta_c1[y * width_image_C1_CNN + x] = delta * scale_factor * activation_function_tanh_derivative(neuron_c1[y * width_image_C1_CNN + x]); 
            } 
        } 
    } 
 
    for (int i = 0; i < len_weight_S2_CNN; i++) { 
        int addr1 = width_image_C1_CNN * height_image_C1_CNN * i; 
        int addr2 = width_image_S2_CNN * height_image_S2_CNN * i; 
 
        const float* neuron_c1 = &neuron_C1[0] + addr1; 
        const float* delta_s2 = &delta_neuron_S2[0] + addr2; 
 
        float diff = 0.0; 
 
        for (int y = 0; y < height_image_C1_CNN; y++) { 
            for (int x = 0; x < width_image_C1_CNN; x++) { 
                int index = y / height_kernel_pooling_CNN * height_image_S2_CNN + x / width_kernel_pooling_CNN; 
 
                diff += neuron_c1[y * width_image_C1_CNN + x] * delta_s2[index]; 
            } 
        } 
 
        delta_weight_S2[i] += diff * scale_factor; 
    } 
 
    for (int i = 0; i < len_bias_S2_CNN; i++) { 
        int addr1 = width_image_S2_CNN * height_image_S2_CNN * i; 
        const float* delta_s2 = &delta_neuron_S2[0] + addr1; 
        float diff = 0; 
 
        for (int y = 0; y < height_image_S2_CNN; y++) { 
            for (int x = 0; x < width_image_S2_CNN; x++) { 
                diff += delta_s2[y * width_image_S2_CNN + x]; 
            } 
        } 
 
        delta_bias_S2[i] += diff; 
    }*/  
  
    assert(in2wo_C1.size() == num_neuron_C1_CNN);  
    assert(weight2io_C1.size() == len_weight_S2_CNN);  
    assert(bias2out_C1.size() == len_bias_S2_CNN);  
  
    for (int i = 0; i < num_neuron_C1_CNN; i++) {  
        const wo_connections& connections = in2wo_C1[i];  
        float delta = 0.0;  
  
        for (int j = 0; j < connections.size(); j++) {  
            delta += weight_S2[connections[j].first] * delta_neuron_S2[connections[j].second];  
        }  
  
        delta_neuron_C1[i] = delta * scale_factor * activation_function_tanh_derivative(neuron_C1[i]);  
    }  
  
    for (int i = 0; i < len_weight_S2_CNN; i++) {  
        const io_connections& connections = weight2io_C1[i];  
        float diff = 0.0;  
  
        for (int j = 0; j < connections.size(); j++) {  
            diff += neuron_C1[connections[j].first] * delta_neuron_S2[connections[j].second];  
        }  
  
        delta_weight_S2[i] += diff * scale_factor;  
    }  
  
    for (int i = 0; i < len_bias_S2_CNN; i++) {  
        const std::vector<int>& outs = bias2out_C1[i];  
        float diff = 0;  
  
        for (int o = 0; o < outs.size(); o++) {  
            diff += delta_neuron_S2[outs[o]];  
        }  
  
        delta_bias_S2[i] += diff;  
    }  
  
    return true;  
}  
  
bool CNN::Backward_input()  
{  
    init_variable(delta_neuron_input, 0.0, num_neuron_input_CNN);  
    init_variable(delta_weight_C1, 0.0, len_weight_C1_CNN);  
    init_variable(delta_bias_C1, 0.0, len_bias_C1_CNN);  
  
    /*for (int i = 0; i < num_map_input_CNN; i++) {/////////////////// 
        int addr1 = width_kernel_conv_CNN * height_kernel_conv_CNN * num_map_C1_CNN * i; 
        int addr2 = width_image_input_CNN * height_image_input_CNN * i; 
        for (int j = 0; j < num_map_C1_CNN; j++) { 
            const float* weight_c1 = &weight_C1[0] + addr1 + j * width_kernel_conv_CNN * height_kernel_conv_CNN; 
            const float* delta_c1 = &delta_neuron_C1[0] + width_image_C1_CNN * height_image_C1_CNN * j; 
            float* delta_input_ = &delta_neuron_input[0] + addr2; 
 
            for (int y = 0; y < height_image_C1_CNN; y++) { 
                for (int x = 0; x < width_image_C1_CNN; x++) { 
                    const float* weight_c1_ = weight_c1; 
                    const float delta_c1_ = delta_c1[y * width_image_C1_CNN + x]; 
                    float* delta_input_0 = delta_input_ + y * width_image_C1_CNN + x; 
 
                    for (int m = 0; m < height_kernel_conv_CNN; m++) { 
                        for (int n = 0; n < width_kernel_conv_CNN; n++) { 
                            delta_input_0[m * width_image_input_CNN + n] += weight_c1_[m * width_kernel_conv_CNN + n] * delta_c1_; 
                        } 
                    } 
                } 
            } 
        } 
    } 
 
    for (int i = 0; i < num_neuron_input_CNN; i++) { 
        delta_neuron_input[i] *= activation_function_identity_derivative(data_single_image[i]); 
    } 
 
    for (int i = 0; i < num_map_input_CNN; i++) {///////////// 
        int addr1 = width_image_input_CNN * height_image_input_CNN * i; 
 
        for (int j = 0; j < num_map_C1_CNN; j++) { 
            int addr2 = width_kernel_conv_CNN * height_kernel_conv_CNN * i * j; 
            float* delta_weight_c1 = &delta_weight_C1[0] + addr2; 
 
            for (int y = 0; y < height_kernel_conv_CNN; y++) { 
                for (int x = 0; x < width_kernel_conv_CNN; x++) { 
                    float dst = 0; 
                    const float* neuron_input_ = data_single_image + addr1 + y * width_image_input_CNN + x; 
                    const float* delta_c1 = &delta_neuron_C1[0] + width_image_C1_CNN * height_image_C1_CNN * j; 
 
                    for (int m = 0; m < height_image_C1_CNN; m++) { 
                        dst += dot_product(neuron_input_ + m * width_kernel_conv_CNN, delta_c1 + y * width_image_C1_CNN, width_image_C1_CNN); 
                    } 
 
                    delta_weight_c1[y * width_kernel_conv_CNN + x] += dst; 
                } 
            } 
        } 
    } 
 
    for (int i = 0; i < num_map_C1_CNN; i++) { 
        const float* delta = &delta_neuron_C1[0] + width_image_C1_CNN * height_image_C1_CNN * i; 
 
        //delta_bias_C1[i] += std::accumulate(delta, delta + width_image_C1_CNN * height_image_C1_CNN, (float)0.0); 
        for (int y = 0; y < height_image_C1_CNN; y++) { 
            for (int x = 0; x < width_image_C1_CNN; x++) { 
                delta_bias_C1[i] += delta[y * width_image_C1_CNN + x]; 
            } 
        } 
    }*/  
  
    // propagate delta to previous layer  
    for (int inc = 0; inc < num_map_input_CNN; inc++) {  
        for (int outc = 0; outc < num_map_C1_CNN; outc++) {  
            int addr1 = get_index(0, 0, num_map_input_CNN * outc + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_C1_CNN);  
            int addr2 = get_index(0, 0, outc, width_image_C1_CNN, height_image_C1_CNN, num_map_C1_CNN);  
            int addr3 = get_index(0, 0, inc, width_image_input_CNN, height_image_input_CNN, num_map_input_CNN);  
  
            const float* pw = &weight_C1[0] + addr1;  
            const float* pdelta_src = &delta_neuron_C1[0] + addr2;  
            float* pdelta_dst = &delta_neuron_input[0] + addr3;  
  
            for (int y = 0; y < height_image_C1_CNN; y++) {  
                for (int x = 0; x < width_image_C1_CNN; x++) {  
                    const float* ppw = pw;  
                    const float ppdelta_src = pdelta_src[y * width_image_C1_CNN + x];  
                    float* ppdelta_dst = pdelta_dst + y * width_image_input_CNN + x;  
  
                    for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                        for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                            ppdelta_dst[wy * width_image_input_CNN + wx] += *ppw++ * ppdelta_src;  
                        }  
                    }  
                }  
            }  
        }  
    }  
  
    for (int i = 0; i < num_neuron_input_CNN; i++) {  
        delta_neuron_input[i] *= activation_function_identity_derivative(data_single_image[i]/*neuron_input[i]*/);  
    }  
  
    // accumulate dw  
    for (int inc = 0; inc < num_map_input_CNN; inc++) {  
        for (int outc = 0; outc < num_map_C1_CNN; outc++) {  
            for (int wy = 0; wy < height_kernel_conv_CNN; wy++) {  
                for (int wx = 0; wx < width_kernel_conv_CNN; wx++) {  
                    int addr1 = get_index(wx, wy, inc, width_image_input_CNN, height_image_input_CNN, num_map_input_CNN);  
                    int addr2 = get_index(0, 0, outc, width_image_C1_CNN, height_image_C1_CNN, num_map_C1_CNN);  
                    int addr3 = get_index(wx, wy, num_map_input_CNN * outc + inc, width_kernel_conv_CNN, height_kernel_conv_CNN, num_map_C1_CNN);  
  
                    float dst = 0.0;  
                    const float* prevo = data_single_image + addr1;//&neuron_input[0]  
                    const float* delta = &delta_neuron_C1[0] + addr2;  
  
                    for (int y = 0; y < height_image_C1_CNN; y++) {  
                        dst += dot_product(prevo + y * width_image_input_CNN, delta + y * width_image_C1_CNN, width_image_C1_CNN);  
                    }  
  
                    delta_weight_C1[addr3] += dst;  
                }  
            }  
        }  
    }  
  
    // accumulate db  
    for (int outc = 0; outc < len_bias_C1_CNN; outc++) {  
        int addr1 = get_index(0, 0, outc, width_image_C1_CNN, height_image_C1_CNN, num_map_C1_CNN);  
        const float* delta = &delta_neuron_C1[0] + addr1;  
  
        for (int y = 0; y < height_image_C1_CNN; y++) {  
            for (int x = 0; x < width_image_C1_CNN; x++) {  
                delta_bias_C1[outc] += delta[y * width_image_C1_CNN + x];  
            }  
        }  
    }  
  
    return true;  
}  
  
void CNN::update_weights_bias(const float* delta, float* weight, int len)  
{  
    for (int i = 0; i < len; i++) {  
        float tmp = delta[i] * delta[i];  
        weight[i] -= learning_rate_CNN * delta[i] / (std::sqrt(tmp) + eps_CNN);  
    }  
}  
  
bool CNN::UpdateWeights()  
{  
    update_weights_bias(delta_weight_C1, weight_C1, len_weight_C1_CNN);  
    update_weights_bias(delta_bias_C1, bias_C1, len_bias_C1_CNN);  
  
    update_weights_bias(delta_weight_S2, weight_S2, len_weight_S2_CNN);  
    update_weights_bias(delta_bias_S2, bias_S2, len_bias_S2_CNN);  
  
    update_weights_bias(delta_weight_C3, weight_C3, len_weight_C3_CNN);  
    update_weights_bias(delta_bias_C3, bias_C3, len_bias_C3_CNN);  
  
    update_weights_bias(delta_weight_S4, weight_S4, len_weight_S4_CNN);  
    update_weights_bias(delta_bias_S4, bias_S4, len_bias_S4_CNN);  
  
    update_weights_bias(delta_weight_C5, weight_C5, len_weight_C5_CNN);  
    update_weights_bias(delta_bias_C5, bias_C5, len_bias_C5_CNN);  
  
    update_weights_bias(delta_weight_output, weight_output, len_weight_output_CNN);  
    update_weights_bias(delta_bias_output, bias_output, len_bias_output_CNN);  
  
    return true;  
}  
  
int CNN::predict(const unsigned char* data, int width, int height)  
{  
    assert(data && width == width_image_input_CNN && height == height_image_input_CNN);  
  
    const float scale_min = -1;  
    const float scale_max = 1;  
  
    float tmp[width_image_input_CNN * height_image_input_CNN];  
    for (int y = 0; y < height; y++) {  
        for (int x = 0; x < width; x++) {  
            tmp[y * width + x] = (data[y * width + x] / 255.0) * (scale_max - scale_min) + scale_min;  
        }  
    }  
  
    data_single_image = &tmp[0];  
  
    Forward_C1();  
    Forward_S2();  
    Forward_C3();  
    Forward_S4();  
    Forward_C5();  
    Forward_output();  
  
    int pos = -1;  
    float max_value = -9999.0;  
  
    for (int i = 0; i < num_neuron_output_CNN; i++) {  
        if (neuron_output[i] > max_value) {  
            max_value = neuron_output[i];  
            pos = i;  
        }  
    }  
  
    return pos;  
}  
  
bool CNN::readModelFile(const char* name)  
{  
    FILE* fp = fopen(name, "rb");  
    if (fp == NULL) {  
        return false;  
    }  
  
    int width_image_input =0;  
    int height_image_input = 0;  
    int width_image_C1 = 0;  
    int height_image_C1 = 0;  
    int width_image_S2 = 0;  
    int height_image_S2 = 0;  
    int width_image_C3 = 0;  
    int height_image_C3 = 0;  
    int width_image_S4 = 0;  
    int height_image_S4 = 0;  
    int width_image_C5 = 0;  
    int height_image_C5 = 0;  
    int width_image_output = 0;  
    int height_image_output = 0;  
  
    int width_kernel_conv = 0;  
    int height_kernel_conv = 0;  
    int width_kernel_pooling = 0;  
    int height_kernel_pooling = 0;  
  
    int num_map_input = 0;  
    int num_map_C1 = 0;  
    int num_map_S2 = 0;  
    int num_map_C3 = 0;  
    int num_map_S4 = 0;  
    int num_map_C5 = 0;  
    int num_map_output = 0;  
  
    int len_weight_C1 = 0;  
    int len_bias_C1 = 0;  
    int len_weight_S2 = 0;  
    int len_bias_S2 = 0;  
    int len_weight_C3 = 0;  
    int len_bias_C3 = 0;  
    int len_weight_S4 = 0;  
    int len_bias_S4 = 0;  
    int len_weight_C5 = 0;  
    int len_bias_C5 = 0;  
    int len_weight_output = 0;  
    int len_bias_output = 0;  
  
    int num_neuron_input = 0;  
    int num_neuron_C1 = 0;  
    int num_neuron_S2 = 0;  
    int num_neuron_C3 = 0;  
    int num_neuron_S4 = 0;  
    int num_neuron_C5 = 0;  
    int num_neuron_output = 0;  
  
    fread(&width_image_input, sizeof(int), 1, fp);  
    fread(&height_image_input, sizeof(int), 1, fp);  
    fread(&width_image_C1, sizeof(int), 1, fp);  
    fread(&height_image_C1, sizeof(int), 1, fp);  
    fread(&width_image_S2, sizeof(int), 1, fp);  
    fread(&height_image_S2, sizeof(int), 1, fp);  
    fread(&width_image_C3, sizeof(int), 1, fp);  
    fread(&height_image_C3, sizeof(int), 1, fp);  
    fread(&width_image_S4, sizeof(int), 1, fp);  
    fread(&height_image_S4, sizeof(int), 1, fp);  
    fread(&width_image_C5, sizeof(int), 1, fp);  
    fread(&height_image_C5, sizeof(int), 1, fp);  
    fread(&width_image_output, sizeof(int), 1, fp);  
    fread(&height_image_output, sizeof(int), 1, fp);  
  
    fread(&width_kernel_conv, sizeof(int), 1, fp);  
    fread(&height_kernel_conv, sizeof(int), 1, fp);  
    fread(&width_kernel_pooling, sizeof(int), 1, fp);  
    fread(&height_kernel_pooling, sizeof(int), 1, fp);  
  
    fread(&num_map_input, sizeof(int), 1, fp);  
    fread(&num_map_C1, sizeof(int), 1, fp);  
    fread(&num_map_S2, sizeof(int), 1, fp);  
    fread(&num_map_C3, sizeof(int), 1, fp);  
    fread(&num_map_S4, sizeof(int), 1, fp);  
    fread(&num_map_C5, sizeof(int), 1, fp);  
    fread(&num_map_output, sizeof(int), 1, fp);  
  
    fread(&len_weight_C1, sizeof(int), 1, fp);  
    fread(&len_bias_C1, sizeof(int), 1, fp);  
    fread(&len_weight_S2, sizeof(int), 1, fp);  
    fread(&len_bias_S2, sizeof(int), 1, fp);  
    fread(&len_weight_C3, sizeof(int), 1, fp);  
    fread(&len_bias_C3, sizeof(int), 1, fp);  
    fread(&len_weight_S4, sizeof(int), 1, fp);  
    fread(&len_bias_S4, sizeof(int), 1, fp);  
    fread(&len_weight_C5, sizeof(int), 1, fp);  
    fread(&len_bias_C5, sizeof(int), 1, fp);  
    fread(&len_weight_output, sizeof(int), 1, fp);  
    fread(&len_bias_output, sizeof(int), 1, fp);  
  
    fread(&num_neuron_input, sizeof(int), 1, fp);  
    fread(&num_neuron_C1, sizeof(int), 1, fp);  
    fread(&num_neuron_S2, sizeof(int), 1, fp);  
    fread(&num_neuron_C3, sizeof(int), 1, fp);  
    fread(&num_neuron_S4, sizeof(int), 1, fp);  
    fread(&num_neuron_C5, sizeof(int), 1, fp);  
    fread(&num_neuron_output, sizeof(int), 1, fp);  
  
    fread(weight_C1, sizeof(weight_C1), 1, fp);  
    fread(bias_C1, sizeof(bias_C1), 1, fp);  
    fread(weight_S2, sizeof(weight_S2), 1, fp);  
    fread(bias_S2, sizeof(bias_S2), 1, fp);  
    fread(weight_C3, sizeof(weight_C3), 1, fp);  
    fread(bias_C3, sizeof(bias_C3), 1, fp);  
    fread(weight_S4, sizeof(weight_S4), 1, fp);  
    fread(bias_S4, sizeof(bias_S4), 1, fp);  
    fread(weight_C5, sizeof(weight_C5), 1, fp);  
    fread(bias_C5, sizeof(bias_C5), 1, fp);  
    fread(weight_output, sizeof(weight_output), 1, fp);  
    fread(bias_output, sizeof(bias_output), 1, fp);  
  
    fflush(fp);  
    fclose(fp);  
  
    out2wi_S2.clear();  
    out2bias_S2.clear();  
    out2wi_S4.clear();  
    out2bias_S4.clear();  
  
    calc_out2wi(width_image_C1_CNN, height_image_C1_CNN, width_image_S2_CNN, height_image_S2_CNN, num_map_S2_CNN, out2wi_S2);  
    calc_out2bias(width_image_S2_CNN, height_image_S2_CNN, num_map_S2_CNN, out2bias_S2);  
    calc_out2wi(width_image_C3_CNN, height_image_C3_CNN, width_image_S4_CNN, height_image_S4_CNN, num_map_S4_CNN, out2wi_S4);  
    calc_out2bias(width_image_S4_CNN, height_image_S4_CNN, num_map_S4_CNN, out2bias_S4);  
  
    return true;  
}  
  
bool CNN::saveModelFile(const char* name)  
{  
    FILE* fp = fopen(name, "wb");  
    if (fp == NULL) {  
        return false;  
    }  
  
    int width_image_input = width_image_input_CNN;  
    int height_image_input = height_image_input_CNN;  
    int width_image_C1 = width_image_C1_CNN;  
    int height_image_C1 = height_image_C1_CNN;  
    int width_image_S2 = width_image_S2_CNN;  
    int height_image_S2 = height_image_S2_CNN;  
    int width_image_C3 = width_image_C3_CNN;  
    int height_image_C3 = height_image_C3_CNN;  
    int width_image_S4 = width_image_S4_CNN;  
    int height_image_S4 = height_image_S4_CNN;  
    int width_image_C5 = width_image_C5_CNN;  
    int height_image_C5 = height_image_C5_CNN;  
    int width_image_output = width_image_output_CNN;  
    int height_image_output = height_image_output_CNN;  
  
    int width_kernel_conv = width_kernel_conv_CNN;  
    int height_kernel_conv = height_kernel_conv_CNN;  
    int width_kernel_pooling = width_kernel_pooling_CNN;  
    int height_kernel_pooling = height_kernel_pooling_CNN;  
  
    int num_map_input = num_map_input_CNN;  
    int num_map_C1 = num_map_C1_CNN;  
    int num_map_S2 = num_map_S2_CNN;  
    int num_map_C3 = num_map_C3_CNN;  
    int num_map_S4 = num_map_S4_CNN;  
    int num_map_C5 = num_map_C5_CNN;  
    int num_map_output = num_map_output_CNN;  
  
    int len_weight_C1 = len_weight_C1_CNN;  
    int len_bias_C1 = len_bias_C1_CNN;  
    int len_weight_S2 = len_weight_S2_CNN;  
    int len_bias_S2 = len_bias_S2_CNN;  
    int len_weight_C3 = len_weight_C3_CNN;  
    int len_bias_C3 = len_bias_C3_CNN;  
    int len_weight_S4 = len_weight_S4_CNN;  
    int len_bias_S4 = len_bias_S4_CNN;  
    int len_weight_C5 = len_weight_C5_CNN;  
    int len_bias_C5 = len_bias_C5_CNN;  
    int len_weight_output = len_weight_output_CNN;  
    int len_bias_output = len_bias_output_CNN;  
  
    int num_neuron_input = num_neuron_input_CNN;  
    int num_neuron_C1 = num_neuron_C1_CNN;  
    int num_neuron_S2 = num_neuron_S2_CNN;  
    int num_neuron_C3 = num_neuron_C3_CNN;  
    int num_neuron_S4 = num_neuron_S4_CNN;  
    int num_neuron_C5 = num_neuron_C5_CNN;  
    int num_neuron_output = num_neuron_output_CNN;  
  
    fwrite(&width_image_input, sizeof(int), 1, fp);  
    fwrite(&height_image_input, sizeof(int), 1, fp);  
    fwrite(&width_image_C1, sizeof(int), 1, fp);  
    fwrite(&height_image_C1, sizeof(int), 1, fp);  
    fwrite(&width_image_S2, sizeof(int), 1, fp);  
    fwrite(&height_image_S2, sizeof(int), 1, fp);  
    fwrite(&width_image_C3, sizeof(int), 1, fp);  
    fwrite(&height_image_C3, sizeof(int), 1, fp);  
    fwrite(&width_image_S4, sizeof(int), 1, fp);  
    fwrite(&height_image_S4, sizeof(int), 1, fp);  
    fwrite(&width_image_C5, sizeof(int), 1, fp);  
    fwrite(&height_image_C5, sizeof(int), 1, fp);  
    fwrite(&width_image_output, sizeof(int), 1, fp);  
    fwrite(&height_image_output, sizeof(int), 1, fp);  
  
    fwrite(&width_kernel_conv, sizeof(int), 1, fp);  
    fwrite(&height_kernel_conv, sizeof(int), 1, fp);  
    fwrite(&width_kernel_pooling, sizeof(int), 1, fp);  
    fwrite(&height_kernel_pooling, sizeof(int), 1, fp);  
  
    fwrite(&num_map_input, sizeof(int), 1, fp);  
    fwrite(&num_map_C1, sizeof(int), 1, fp);  
    fwrite(&num_map_S2, sizeof(int), 1, fp);  
    fwrite(&num_map_C3, sizeof(int), 1, fp);  
    fwrite(&num_map_S4, sizeof(int), 1, fp);  
    fwrite(&num_map_C5, sizeof(int), 1, fp);  
    fwrite(&num_map_output, sizeof(int), 1, fp);  
  
    fwrite(&len_weight_C1, sizeof(int), 1, fp);  
    fwrite(&len_bias_C1, sizeof(int), 1, fp);  
    fwrite(&len_weight_S2, sizeof(int), 1, fp);  
    fwrite(&len_bias_S2, sizeof(int), 1, fp);  
    fwrite(&len_weight_C3, sizeof(int), 1, fp);  
    fwrite(&len_bias_C3, sizeof(int), 1, fp);  
    fwrite(&len_weight_S4, sizeof(int), 1, fp);  
    fwrite(&len_bias_S4, sizeof(int), 1, fp);  
    fwrite(&len_weight_C5, sizeof(int), 1, fp);  
    fwrite(&len_bias_C5, sizeof(int), 1, fp);  
    fwrite(&len_weight_output, sizeof(int), 1, fp);  
    fwrite(&len_bias_output, sizeof(int), 1, fp);  
  
    fwrite(&num_neuron_input, sizeof(int), 1, fp);  
    fwrite(&num_neuron_C1, sizeof(int), 1, fp);  
    fwrite(&num_neuron_S2, sizeof(int), 1, fp);  
    fwrite(&num_neuron_C3, sizeof(int), 1, fp);  
    fwrite(&num_neuron_S4, sizeof(int), 1, fp);  
    fwrite(&num_neuron_C5, sizeof(int), 1, fp);  
    fwrite(&num_neuron_output, sizeof(int), 1, fp);  
  
    fwrite(weight_C1, sizeof(weight_C1), 1, fp);  
    fwrite(bias_C1, sizeof(bias_C1), 1, fp);  
    fwrite(weight_S2, sizeof(weight_S2), 1, fp);  
    fwrite(bias_S2, sizeof(bias_S2), 1, fp);  
    fwrite(weight_C3, sizeof(weight_C3), 1, fp);  
    fwrite(bias_C3, sizeof(bias_C3), 1, fp);  
    fwrite(weight_S4, sizeof(weight_S4), 1, fp);  
    fwrite(bias_S4, sizeof(bias_S4), 1, fp);  
    fwrite(weight_C5, sizeof(weight_C5), 1, fp);  
    fwrite(bias_C5, sizeof(bias_C5), 1, fp);  
    fwrite(weight_output, sizeof(weight_output), 1, fp);  
    fwrite(bias_output, sizeof(bias_output), 1, fp);  
  
    fflush(fp);  
    fclose(fp);  
  
    return true;  
}  
  
float CNN::test()  
{  
    int count_accuracy = 0;  
  
    for (int num = 0; num < num_patterns_test_CNN; num++) {  
        data_single_image = data_input_test + num * num_neuron_input_CNN;  
        data_single_label = data_output_test + num * num_neuron_output_CNN;  
  
        Forward_C1();  
        Forward_S2();  
        Forward_C3();  
        Forward_S4();  
        Forward_C5();  
        Forward_output();  
  
        int pos_t = -1;  
        int pos_y = -2;  
        float max_value_t = -9999.0;  
        float max_value_y = -9999.0;  
  
        for (int i = 0; i < num_neuron_output_CNN; i++) {  
            if (neuron_output[i] > max_value_y) {  
                max_value_y = neuron_output[i];  
                pos_y = i;  
            }  
  
            if (data_single_label[i] > max_value_t) {  
                max_value_t = data_single_label[i];  
                pos_t = i;  
            }  
        }  
  
        if (pos_y == pos_t) {  
            ++count_accuracy;  
        }  
  
        Sleep(1);  
    }  
  
    //std::cout << "count_accuracy: " << count_accuracy << std::endl;  
    return (count_accuracy * 1.0 / num_patterns_test_CNN);  
}  
  
}

你可能感兴趣的:(神经网络,cnn,神经网络)

100天持续行动—Day01 Richard_DL
今天开始站着学习，发现效率大幅提升。把fast.ai的Lesson1的后半部分和Lesson2看完了。由于Keras版本和视频中的不一致，运行notebook时经常出现莫名其妙的错误，导致自己只动手实践了视频中的一小部分内容。为了赶时间，我打算先把与CNN相关的视频过一遍。然后尽快开始做自己的项目。明天继续加油，争取把Lesson3和Lesson4看完。
yolov5＞onnx＞ncnn＞apk 图像处理大大大大大牛啊 opencv实战代码讲解 yolo onnx ncnn 安卓
一.yolov5pt模型转onnx条件：colabnotebookyolov51.安装环境!pipinstallonnx>=1.7.0#forONNXexport!pipinstallcoremltools==4.0#forCoreMLexport!pipinstallonnx-simplifier2.修改common.py在classFocus下面
ai绘画工具midjourney怎么下载？附作品管理教程设计师早上好
Midjourney是一款功能强大的AI绘画工具，它使用机器学习技术和深度神经网络等算法，可以生成各种艺术风格的绘画作品。在创意设计、广告宣传等方面有着广泛的应用前景。那么，ai绘画工具midjourney怎么下载？本文将为您介绍Midjourney的下载以及作品的相关管理。一、Midjourney下载Midjourney的下载非常简单，只需打开Midjourney官网（点击“GetMidjour
吴恩达深度学习笔记(30)-正则化的解释极客Array
正则化（Regularization）深度学习可能存在过拟合问题——高方差，有两个解决方法，一个是正则化，另一个是准备更多的数据，这是非常可靠的方法，但你可能无法时时刻刻准备足够多的训练数据或者获取更多数据的成本很高，但正则化通常有助于避免过拟合或减少你的网络误差。如果你怀疑神经网络过度拟合了数据，即存在高方差问题，那么最先想到的方法可能是正则化，另一个解决高方差的方法就是准备更多数据，这也是非常
个人学习笔记7-6：动手学深度学习pytorch版-李沐浪子L 深度学习深度学习笔记计算机视觉 python 人工智能神经网络 pytorch
#人工智能##深度学习##语义分割##计算机视觉##神经网络#计算机视觉13.11全卷积网络全卷积网络（fullyconvolutionalnetwork，FCN）采用卷积神经网络实现了从图像像素到像素类别的变换。引入l转置卷积（transposedconvolution）实现的，输出的类别预测与输入图像在像素级别上具有一一对应关系：通道维的输出即该位置对应像素的类别预测。13.11.1构造模型下
计算机视觉中，Pooling的作用 Wils0nEdwards 计算机视觉人工智能
在计算机视觉中，Pooling（池化）是一种常见的操作，主要用于卷积神经网络（CNN）中。它通过对特征图进行下采样，减少数据的空间维度，同时保留重要的特征信息。Pooling的作用可以归纳为以下几个方面：1.降低计算复杂度与内存需求Pooling操作通过对特征图进行下采样，减少了特征图的空间分辨率（例如，高度和宽度）。这意味着网络需要处理的数据量会减少，从而降低了计算量和内存需求。这对大型神经网络
神经网络-损失函数红米煮粥神经网络人工智能深度学习
文章目录一、回归问题的损失函数1.均方误差（MeanSquaredError,MSE）2.平均绝对误差（MeanAbsoluteError,MAE）二、分类问题的损失函数1.0-1损失函数（Zero-OneLossFunction）2.交叉熵损失（Cross-EntropyLoss）3.合页损失（HingeLoss）三、总结在神经网络中，损失函数（LossFunction）扮演着至关重要的角色，它
BP神经网络的传递函数大胜归来19 MATLAB
BP网络一般都是用三层的，四层及以上的都比较少用；传输函数的选择，这个怎么说，假设你想预测的结果是几个固定值，如1,0等，满足某个条件输出1，不满足则0的话，首先想到的是hardlim函数，阈值型的，当然也可以考虑其他的；然后，假如网络是用来表达某种线性关系时，用purelin---线性传输函数；若是非线性关系的话，用别的非线性传递函数，多层网络时，每层不一定要用相同的传递函数，可以是三种配合，可
探索创新科技： Lite-Mono - 简约高效的小型化Mono框架杭律沛Meris
探索创新科技：Lite-Mono-简约高效的小型化Mono框架Lite-Mono[CVPR2023]Lite-Mono:ALightweightCNNandTransformerArchitectureforSelf-SupervisedMonocularDepthEstimation项目地址:https://gitcode.com/gh_mirrors/li/Lite-Mono如果你在寻找一个轻
神经网络传递函数sigmoid,神经网络传递函数作用快乐的小荣荣神经网络机器学习深度学习人工智能
神经网络传递函数选取不同会有特别大差别嘛？只是最后一层，但前面层是非线性，那么可能存在区别不大的情况。线性函数f(a*input)=af(input),一般来说，input为向量，最简化情况下，可以假设input的各个维度，a1=a2=a3。。。意味着你线性层只是简单的对输入做了scale~而神经网络能起作用的原因，在于通过足够复杂的非线性函数，来模拟任何的分布。所以，神经网络必须要用非线性函数。
Python和R均方根误差平均绝对误差算法模型亚图跨际 Python 交叉知识 R 回归模型误差指标归一化均方根误差生态状态指标神经网络成本误差气体排放气候模型多项式拟合
要点回归模型误差评估指标归一化均方根误差生态状态指标神经网络成本误差计算气体排放气候算法模型Python误差指标均方根误差和平均绝对误差均方根偏差或均方根误差是两个密切相关且经常使用的度量值之一，用于衡量真实值或预测值与观测值或估计值之间的差异。估计器θ^\hat{\theta}θ^相对于估计参数θ\thetaθ的RMSD定义为均方误差的平方根：RMSD⁡(θ^)=MSE⁡(θ^)=E((θ^−θ
数据分析-24-时间序列预测之基于keras的VMD-LSTM和VMD-CNN-LSTM预测风速皮皮冰燃数据分析数据分析
文章目录1普通的LSTM模型1.1数据重采样1.2数据标准化1.3切分窗口1.4划分数据集1.5建立模型1.6预测效果2VMD-LSTM模型2.1VMD分解时间序列2.2对每一个IMF建立LSTM模型2.2.1IMF1—LSTM2.2.2IMF2-LSTM2.2.3统一代码2.3评估效果3CNN-LSTM模型3.1数据预处理3.2建立模型3.3效果预测4VMD-CNN-LSTM模型4.1VMD分解
【NLP5-RNN模型、LSTM模型和GRU模型】一蓑烟雨紫洛 nlp rnn lstm gru nlp
RNN模型、LSTM模型和GRU模型1、什么是RNN模型RNN（RecurrentNeuralNetwork)中文称为循环神经网络，它一般以序列数据为输入，通过网络内部的结构设计有效捕捉序列之间的关系特征，一般也是以序列形式进行输出RNN的循环机制使模型隐层上一时间步产生的结果，能够作为当下时间步输入的一部分（当下时间步的输入除了正常的输入外还包括上一步的隐层输出）对当下时间步的输出产生影响2、R
基于深度学习的农作物病害检测 SEU-WYL 深度学习dnn 深度学习人工智能
基于深度学习的农作物病害检测利用卷积神经网络（CNN）、生成对抗网络（GAN）、Transformer等深度学习技术，自动识别和分类农作物的病害，帮助农业工作者提高作物管理效率、减少损失。1.农作物病害检测的挑战病害种类繁多：农作物病害的类型多样，不同病害在同一作物上的表现差异很大，同时同一种病害在不同生长阶段的症状也可能不同。环境影响：天气、光照、湿度等外部环境因素会影响农作物的表现，使得病害检
深度学习--对抗生成网络（GAN, Generative Adversarial Network） Ambition_LAO 深度学习生成对抗网络
对抗生成网络（GAN,GenerativeAdversarialNetwork）是一种深度学习模型，由IanGoodfellow等人在2014年提出。GAN主要用于生成数据，通过两个神经网络相互对抗，来生成以假乱真的新数据。以下是对GAN的详细阐述，包括其概念、作用、核心要点、实现过程、代码实现和适用场景。1.概念GAN由两个神经网络组成：生成器（Generator）和判别器（Discrimina
chatgpt赋能python：如何在Python中安装Keras库？ turensu ChatGpt python chatgpt keras 计算机
如何在Python中安装Keras库？Keras是一个简单易用的神经网络库，由FrançoisChollet编写。它在Python编程语言中实现了深度学习的功能，可以使您更轻松地构建和试验不同类型的神经网络。如果您是一名Python开发人员，肯定会想知道如何在您的Python项目中安装Keras库。在本文中，我们将向您展示如何安装和配置Keras库。步骤1：安装Python要使用Keras库，您需
如何理解深度学习的训练过程奋斗的草莓熊深度学习人工智能 python scikit-learn virtualenv numpy pandas
文章目录1.训练是干什么？2.预训练模型进行训练，主要更改的是预训练模型的什么东西？1.训练是干什么？以yolov5为例子，训练的目的是把一组输入猫狗图像放到神经网络中，得到一个输出模型，这个模型下次可以直接用来识别哪个是猫，哪个是狗2.预训练模型进行训练，主要更改的是预训练模型的什么东西？超参数（Hyperparameters）：这是模型结构中定义的参数，比如：卷积核大小（kernel_size
Keras深度学习框架入门及实战指南司莹嫣Maude
Keras深度学习框架入门及实战指南keraskeras-team/keras:是一个基于Python的深度学习库，它没有使用数据库。适合用于深度学习任务的开发和实现，特别是对于需要使用Python深度学习库的场景。特点是深度学习库、Python、无数据库。项目地址:https://gitcode.com/gh_mirrors/ke/keras一、项目介绍Keras简介Keras是一款高级神经网络
每天五分钟玩转深度学习PyTorch：模型参数优化器torch.optim 幻风_huanfeng 深度学习框架pytorch 深度学习 pytorch 人工智能神经网络机器学习优化算法
本文重点在机器学习或者深度学习中，我们需要通过修改参数使得损失函数最小化(或最大化)，优化算法就是一种调整模型参数更新的策略。在pytorch中定义了优化器optim，我们可以使用它调用封装好的优化算法，然后传递给它神经网络模型参数，就可以对模型进行优化。本文是学习第6步(优化器)，参考链接pytorch的学习路线随机梯度下降算法在深度学习和机器学习中，梯度下降算法是最常用的参数更新方法，它的公式
如何有效的学习AI大模型？ Python程序员罗宾学习人工智能语言模型自然语言处理架构
学习AI大模型是一个系统性的过程，涉及到多个学科的知识。以下是一些建议，帮助你更有效地学习AI大模型：基础知识储备：数学基础：学习线性代数、概率论、统计学和微积分等，这些是理解机器学习算法的数学基础。编程技能：掌握至少一种编程语言，如Python，因为大多数AI模型都是用Python实现的。理论学习：机器学习基础：了解监督学习、非监督学习、强化学习等基本概念。深度学习：学习神经网络的基本结构，如卷
【3.6 python中的numpy编写一个“手写数字识”的神经网络】 wang151038606 深度学习入门 python numpy 神经网络
3.6python中的numpy编写一个“手写数字识”的神经网络要使用Python中的NumPy库从头开始编写一个“手写数字识别”的神经网络，我们通常会处理MNIST数据集，这是一个广泛使用的包含手写数字的图像数据集。但是，完全用NumPy来实现神经网络（包括数据的加载、预处理、模型定义、前向传播、损失计算、反向传播和权重更新）是一个相当复杂的任务，因为NumPy本身不提供自动微分或高级优化算法（
yolov5单目测距+速度测量+目标跟踪 cv_2025 YOLO 目标跟踪人工智能计算机视觉机器学习图像处理 opencv
要在YOLOv5中添加测距和测速功能，您需要了解以下两个部分的原理：单目测距算法单目测距是使用单个摄像头来估计场景中物体的距离。常见的单目测距算法包括基于视差的方法（如立体匹配）和基于深度学习的方法（如神经网络）。基于深度学习的方法通常使用卷积神经网络（CNN）来学习从图像到深度图的映射关系。单目测距代码单目测距涉及到坐标转换，代码如下：defconvert_2D_to_3D(point2D,R,
探索深度学习的奥秘：从理论到实践的奇幻之旅小周不想卷深度学习
目录引言：穿越智能的迷雾一、深度学习的奇幻起源：从感知机到神经网络1.1感知机的启蒙1.2神经网络的诞生与演进1.3深度学习的崛起二、深度学习的核心魔法：神经网络架构2.1前馈神经网络（FeedforwardNeuralNetwork,FNN）2.2卷积神经网络（CNN）2.3循环神经网络（RNN）及其变体（LSTM,GRU）2.4生成对抗网络（GAN）三、深度学习的魔法秘籍：算法与训练3.1损失
卷积神经网络（CNN）详细介绍及其原理详解（二） FFmpeg123 Pytorch cnn 深度学习人工智能
接上一文继续;五、全连接层假设还是上面人的脑袋的示例，现在我们已经通过卷积和池化提取到了这个人的眼睛、鼻子和嘴的特征，如果我想利用这些特征来识别这个图片是否是人的脑袋该怎么办呢？此时我们只需要将提取到的所有特征图进行“展平”，将其维度变为1×x1×x1×x，这个过程就是全连接的过程。也就是说，此步我们将所有的特征都展开并进行运算，最后会得到一个概率值，这个概率值就是输入图片是否是人的概率，这个过程
【AI大咖】再认识Yann LeCun，一个可能是拥有最多中文名的男人喜欢打酱油的老鸟再认识Yann LeCun 一个可能是拥有最多中文名的男人
https://www.toutiao.com/i6693678422733881860/上一期扒了扛起深度学习大旗的Hinton先生，今天聊一位他的学生，深度学习中CNN的崛起离不开的男人——YannLeCun。一位陪伴Hinton三十年磨一剑，最终笑傲AI界的法国人。让我们一起记住这张面孔。作者|小满言有三编辑|小满言有三130秒了解LeCunYannLeCun，CNN之父，纽约大学终身教授，
【图像压缩】奇异值分解SVD灰色图像压缩（可设置压缩比）【含Matlab源码 4358期】 Matlab武动乾坤 Matlab图像处理（进阶版）matlab
✅博主简介：热爱科研的Matlab仿真开发者，修心和技术同步精进，Matlab项目合作可私信。个人主页：海神之光代码获取方式：海神之光Matlab王者学习之路—代码获取方式⛳️座右铭：行百里者，半于九十。更多Matlab仿真内容点击Matlab图像处理（进阶版）路径规划（Matlab）神经网络预测与分类（Matlab）优化求解（Matlab）语音处理（Matlab）信号处理（Matlab）车间调度
TextCNN：文本卷积神经网络模型一只天蝎编程语言---Python cnn 深度学习机器学习
目录什么是TextCNN定义TextCNN类初始化一个model实例输出model什么是TextCNNTextCNN（TextConvolutionalNeuralNetwork）是一种用于处理文本数据的卷积神经网（CNN）。通过在文本数据上应用卷积操作来提取局部特征，这些特征可以捕捉到文本中的局部模式，如n-gram（连续的n个单词或字符）。定义TextCNN类importtorch.nnasn
基于VGG的猫狗识别卑微小鹿 tensorflow tensorflow
由于猫和狗的数据在这里，所以就做了一下分类的神经网络1、首先进行图像处理：importcsvimportglobimportosimportrandomos.environ['TF_CPP_MIN_LOG_LEVEL']='2'importtensorflowastffromtensorflowimportkerasfromtensorflow.kerasimportlayersimportnum
机器学习到底是个啥旷_9b08
机器学习是装逼神器？曾几何时，当我还在本科打dota玩屁股的时候，身边总有一帮大神。听他们谈话我的心情是。。。大佬中有各路高手前端、后段、java三大架构。。。但最令本渣一听到就仰慕甚至肃然起敬的是当听到卷积神经网络的时候。顿时就有种掉线三十分钟别人都是六神装的感觉。另外，班会上别班小哥用说用机器学习把图片转换成梵高风格时自己班妹纸那一声声尖叫怕是很难忘掉了。。。好在家里爸妈给了次重新做人的机会，
影像设备国产替代究竟有多重要？这家企业提前布局8K时代 8K超高清科技媒体智能硬件人工智能
从过往看，国产替代不是一个新概念，更是一个从被动到主动的转变。1.“黑屏计划”与互联网2008年是特殊的一年。这一年，中国成为世界上最大的互联网国家。根据中国互联网络信息中心（CNNIC）统计数据显示，我国网民数达到2.98亿人，互联网普及率达22.6%。网民数量居世界第一位，平均每5个人中就有一个是网络公民。也是在PC互联网进入巅峰时期的这一年，中国网民们突然收到了一则通知，提及若Office用
xml解析小猪猪08 xml
1、DOM解析的步奏准备工作： 1.创建DocumentBuilderFactory的对象 2.创建DocumentBuilder对象 3.通过DocumentBuilder对象的parse(String fileName)方法解析xml文件 4.通过Document的getElem
每个开发人员都需要了解的一个SQL技巧 brotherlamp linux linux视频 linux教程 linux自学 linux资料
对于数据过滤而言CHECK约束已经算是相当不错了。然而它仍存在一些缺陷，比如说它们是应用到表上面的，但有的时候你可能希望指定一条约束，而它只在特定条件下才生效。使用SQL标准的WITH CHECK OPTION子句就能完成这点，至少Oracle和SQL Server都实现了这个功能。下面是实现方式： CREATE TABLE books ( id &
Quartz——CronTrigger触发器 eksliang quartz CronTrigger
转载请出自出处：http://eksliang.iteye.com/blog/2208295 一.概述 CronTrigger 能够提供比 SimpleTrigger 更有具体实际意义的调度方案，调度规则基于 Cron 表达式，CronTrigger 支持日历相关的重复时间间隔（比如每月第一个周一执行），而不是简单的周期时间间隔。二.Cron表达式介绍 1）Cron表达式规则表 Quartz
Informatica基础 18289753290 Informatica Monitor manager workflow Designer
1. 1）PowerCenter Designer：设计开发环境，定义源及目标数据结构；设计转换规则，生成ETL映射。 2）Workflow Manager：合理地实现复杂的ETL工作流，基于时间，事件的作业调度 3）Workflow Monitor：监控Workflow和Session运行情况，生成日志和报告 4）Repository Manager：
linux下为程序创建启动和关闭的的sh文件，scrapyd为例酷的飞上天空 scrapy
对于一些未提供service管理的程序每次启动和关闭都要加上全部路径，想到可以做一个简单的启动和关闭控制的文件下面以scrapy启动server为例，文件名为run.sh： #端口号，根据此端口号确定PID PORT=6800 #启动命令所在目录 HOME='/home/jmscra/scrapy/' #查询出监听了PORT端口
人--自私与无私永夜-极光
今天上毛概课,老师提出一个问题--人是自私的还是无私的,根源是什么? 从客观的角度来看,人有自私的行为,也有无私的
Ubuntu安装NS-3 环境脚本随便小屋 ubuntu
将附件下载下来之后解压，将解压后的文件ns3environment.sh复制到下载目录下（其实放在哪里都可以，就是为了和我下面的命令相统一）。输入命令： sudo ./ns3environment.sh >>result 这样系统就自动安装ns3的环境，运行的结果在result文件中，如果提示 com
创业的简单感受 aijuans 创业的简单感受
2009年11月9日我进入a公司实习，2012年4月26日，我离开a公司，开始自己的创业之旅。今天是2012年5月30日，我忽然很想谈谈自己创业一个月的感受。当初离开边锋时，我就对自己说：“自己选择的路，就是跪着也要把他走完”，我也做好了心理准备，准备迎接一次次的困难。我这次走出来，不管成败
如何经营自己的独立人脉 aoyouzi 如何经营自己的独立人脉
独立人脉不是父母、亲戚的人脉，而是自己主动投入构造的人脉圈。“放长线，钓大鱼”，先行投入才能产生后续产出。现在几乎做所有的事情都需要人脉。以银行柜员为例，需要拉储户，而其本质就是社会人脉，就是社交！很多人都说，人脉我不行，因为我爸不行、我妈不行、我姨不行、我舅不行……我谁谁谁都不行，怎么能建立人脉？我这里说的人脉，是你的独立人脉。以一个普通的银行柜员
JSP基础百合不是茶 jsp 注释隐式对象
1,JSP语句的声明 <%! 声明 %> 　　声明：这个就是提供java代码声明变量、方法等的场所。表达式 <%= 表达式 %> 　　这个相当于赋值，可以在页面上显示表达式的结果，程序代码段/小型指令　<% 程序代码片段 %> 2,JSP的注释
web.xml之session-config、mime-mapping bijian1013 java web.xml servlet session-config mime-mapping
session-config 1.定义： <session-config> <session-timeout>20</session-timeout> </session-config> 2.作用：用于定义整个WEB站点session的有效期限，单位是分钟。 mime-mapping 1.定义： <mime-m
互联网开放平台（1） Bill_chen 互联网 qq 新浪微博百度腾讯
现在各互联网公司都推出了自己的开放平台供用户创造自己的应用，互联网的开放技术欣欣向荣，自己总结如下： 1.淘宝开放平台(TOP) 网址：http://open.taobao.com/ 依赖淘宝强大的电子商务数据，将淘宝内部业务数据作为API开放出去，同时将外部ISV的应用引入进来。目前TOP的三条主线： TOP访问网站：open.taobao.com ISV后台：my.open.ta
【MongoDB学习笔记九】MongoDB索引 bit1129 mongodb
索引可以在任意列上建立索引索引的构造和使用与传统关系型数据库几乎一样,适用于Oracle的索引优化技巧也适用于Mongodb 使用索引可以加快查询,但同时会降低修改,插入等的性能内嵌文档照样可以建立使用索引测试数据 var p1 = { "name":"Jack", "age&q
JDBC常用API之外的总结白糖_ jdbc
做JAVA的人玩JDBC肯定已经很熟练了，像DriverManager、Connection、ResultSet、Statement这些基本类大家肯定很常用啦，我不赘述那些诸如注册JDBC驱动、创建连接、获取数据集的API了，在这我介绍一些写框架时常用的API，大家共同学习吧。 ResultSetMetaData获取ResultSet对象的元数据信息
apache VelocityEngine使用记录 bozch VelocityEngine
VelocityEngine是一个模板引擎，能够基于模板生成指定的文件代码。使用方法如下： VelocityEngine engine = new VelocityEngine();// 定义模板引擎 Properties properties = new Properties();// 模板引擎属
编程之美-快速找出故障机器 bylijinnan 编程之美
package beautyOfCoding; import java.util.Arrays; public class TheLostID { /*编程之美假设一个机器仅存储一个标号为ID的记录，假设机器总量在10亿以下且ID是小于10亿的整数，假设每份数据保存两个备份，这样就有两个机器存储了同样的数据。 1.假设在某个时间得到一个数据文件ID的列表，是
关于Java中redirect与forward的区别 chenbowen00 java servlet
在Servlet中两种实现： forward方式：request.getRequestDispatcher(“/somePage.jsp”).forward(request, response); redirect方式：response.sendRedirect(“/somePage.jsp”); forward是服务器内部重定向，程序收到请求后重新定向到另一个程序，客户机并不知
[信号与系统]人体最关键的两个信号节点 comsci 系统
如果把人体看做是一个带生物磁场的导体,那么这个导体有两个很重要的节点,第一个在头部,中医的名称叫做百汇穴, 另外一个节点在腰部,中医的名称叫做命门如果要保护自己的脑部磁场不受到外界有害信号的攻击,最简单的
oracle 存储过程执行权限 daizj oracle 存储过程权限执行者调用者
在数据库系统中存储过程是必不可少的利器，存储过程是预先编译好的为实现一个复杂功能的一段Sql语句集合。它的优点我就不多说了，说一下我碰到的问题吧。我在项目开发的过程中需要用存储过程来实现一个功能，其中涉及到判断一张表是否已经建立，没有建立就由存储过程来建立这张表。 CREATE OR REPLACE PROCEDURE TestProc IS fla
为mysql数据库建立索引 dengkane mysql 性能索引
前些时候，一位颇高级的程序员居然问我什么叫做索引，令我感到十分的惊奇，我想这绝不会是沧海一粟，因为有成千上万的开发者（可能大部分是使用MySQL的）都没有受过有关数据库的正规培训，尽管他们都为客户做过一些开发，但却对如何为数据库建立适当的索引所知较少，因此我起了写一篇相关文章的念头。最普通的情况，是为出现在where子句的字段建一个索引。为方便讲述，我们先建立一个如下的表。
学习C语言常见误区如何看懂一个程序如何掌握一个程序以及几个小题目示例 dcj3sjt126com c 算法
如果看懂一个程序，分三步 1、流程 2、每个语句的功能 3、试数如何学习一些小算法的程序尝试自己去编程解决它，大部分人都自己无法解决如果解决不了就看答案关键是把答案看懂，这个是要花很大的精力，也是我们学习的重点看懂之后尝试自己去修改程序，并且知道修改之后程序的不同输出结果的含义照着答案去敲调试错误
centos6.3安装php5.4报错 dcj3sjt126com centos6
报错内容如下: Resolving Dependencies --> Running transaction check ---> Package php54w.x86_64 0:5.4.38-1.w6 will be installed --> Processing Dependency: php54w-common(x86-64) = 5.4.38-1.w6 for
JSONP请求 flyer0126 jsonp
使用jsonp不能发起POST请求。 It is not possible to make a JSONP POST request. JSONP works by creating a <script> tag that executes Javascript from a different domain; it is not pos
Spring Security（03）——核心类简介 234390216 Authentication
核心类简介目录 1.1 Authentication 1.2 SecurityContextHolder 1.3 AuthenticationManager和AuthenticationProvider 1.3.1 &nb
在CentOS上部署JAVA服务 java--hhf java jdk centos Java服务
本文将介绍如何在CentOS上运行Java Web服务，其中将包括如何搭建JAVA运行环境、如何开启端口号、如何使得服务在命令执行窗口关闭后依旧运行第一步：卸载旧Linux自带的JDK ①查看本机JDK版本 java -version 结果如下 java version "1.6.0"
oracle、sqlserver、mysql常用函数对比[to_char、to_number、to_date] ldzyz007 oracle mysql SQL Server
oracle &n
记Protocol Oriented Programming in Swift of WWDC 2015 ningandjin protocol WWDC 2015 Swift2.0
其实最先朋友让我就这个题目写篇文章的时候，我是拒绝的，因为觉得苹果就是在炒冷饭，把已经流行了数十年的OOP中的“面向接口编程”还拿来讲，看完整个Session之后呢，虽然还是觉得在炒冷饭，但是毕竟还是加了蛋的，有些东西还是值得说说的。通常谈到面向接口编程，其主要作用是把系统设计和具体实现分离开，让系统的每个部分都可以在不影响别的部分的情况下，改变自身的具体实现。接口的设计就反映了系统
搭建 CentOS 6 服务器(15) - Keepalived、HAProxy、LVS rensanning keepalived
（一）Keepalived （1）安装 # cd /usr/local/src # wget http://www.keepalived.org/software/keepalived-1.2.15.tar.gz # tar zxvf keepalived-1.2.15.tar.gz # cd keepalived-1.2.15 # ./configure # make &a
ORACLE数据库SCN和时间的互相转换 tomcat_oracle oracle sql
SCN（System Change Number 简称 SCN）是当Oracle数据库更新后，由DBMS自动维护去累积递增的一个数字，可以理解成ORACLE数据库的时间戳，从ORACLE 10G开始，提供了函数可以实现SCN和时间进行相互转换；　　用途：在进行数据库的还原和利用数据库的闪回功能时，进行SCN和时间的转换就变的非常必要了；　　操作方法：　　1、通过dbms_f
Spring MVC 方法注解拦截器 xp9802 spring mvc
应用场景，在方法级别对本次调用进行鉴权，如api接口中有个用户唯一标示accessToken,对于有accessToken的每次请求可以在方法加一个拦截器，获得本次请求的用户，存放到request或者session域。 python中，之前在python flask中可以使用装饰器来对方法进行预处理，进行权限处理先看一个实例,使用@access_required拦截： ?