Leo_812

Face Alignment by Explicit Shape Regression（ESR）源码解析

最近在研究人脸对齐，Joint Cascade Face Detection and Alignment（ECCV14）这篇文章感觉不错，将对齐和人脸检测同时做了，而且速度非常快，精度也很高。不过菜鸟一下子看不懂，所以就翻了一下之前的文章，发现这些算法都是一点一点进化过来的。之前作者发表过Face Alignment at 3000 FPS via Regressing Local Binary Features,再之前CVPR2012的时候他们在Face Alignment by Explicit Shape Regression中提出了shape index feature，这个特征在3000fps中也有使用。
先简单的介绍一下论文，之后会给出注释过的源码，只注释了训练部分，因为这个代码有一段时间了，看一下思路学习一下就好。新的方法实在太多，而且性能更优越，这里只是为后续工作打基础。

论文简介

Face Alignment by Shape Regression

作者使用了多级回归的方式得到特征点，在3000fps中也用到了多级回归。一共训练了10个强分类器，每个强分类器输出一个shape的更新参数，shape更新之后再重新生成新的特征，训练并进行一下次迭代。这10个强分类器每一个都是由500个蕨分类器组成，有点像随机森林。

Shape-indexed (image) features

在介绍蕨分类器之前要先介绍一下Shape-indexed features，蕨分类器和回归树不同的地方在于，回归树是根据最小均方差来产生特征。而蕨分类器是自己生成特征。这里用到的特征就是Shape-indexed features ，论文中提到了计算方法。具体解释会在源码中给出说明。
1. Project the regression target(vectorial delta shape) to a random direction to produce a scalar.
2. Among P 2 features, select a feature with highest cor- relation to the scalar.
3. Repeat steps 1. and 2. F times to obtain F features.
4. Construct a fern by F features with random thresholds.

Random Ferns

随机蕨算法比较早的应用是在TLD算法中，然后被使用在ESR这篇论文中。下面我找了几篇比较好的随机厥参考文献，感觉第一篇说的最简洁清晰，不过他说道的是分类问题，对于ESR中的回归问题其实也一样，统计落入一个分桶中的所有差值做平均，就可以得到回归值。linux下没有截图工具，之后再加一些图说明一下。
单个厥分离器的准确率较低，所以通过类似随机森林的方法多次取特征，多次采样，对结果做voting或者average就可以得到较好的结果。

参考文献：
http://blog.csdn.net/huangynn/article/details/51730076
http://blog.sciencenet.cn/blog-465130-964430.html
http://blog.csdn.net/stayfoolish_fan/article/details/50506906
http://blog.csdn.net/stayfoolish_fan/article/details/50455359

工程以及训练样本我之后会放在github上，对原论文做了些修改去掉了论文里提到的similarity transform，可能精度会有一定下降，不过肉眼看不出，代码会清晰一些。

TrainDemo.cpp

函数入口，设置了一些基本的参数，加载数据，然后开始训练。

#include "FaceAlignment.h"
using namespace std;
using namespace cv;

int main(){

    char s[10];
    sprintf(s,"%.4d",1);
    string ss(s);
    cout<int img_num = 130;     //1340
    int candidate_pixel_num = 400;
    int fern_pixel_num = 5;
    int first_level_num = 10;
    int second_level_num = 500; 
    int landmark_num = 29;
    int initial_number = 20;
    vector > images;
    vector bbox;

    //加载训练图片
    cout<<"Read images..."<for(int i = 0;i < img_num;i++){
        string image_name = "/home/f/FaceAlignment-master/FaceAlignment/COFW_Dataset/trainingImages/";
        image_name = image_name + to_string(i+1) + ".jpg";
        Mat_ temp = imread(image_name,0);
        images.push_back(temp);
    }

    // 读取bounding_box.  x,y,width,height,center_x,center_y
    vector bounding_box;
    ifstream fin;
    fin.open("/home/f/FaceAlignment-master/FaceAlignment/COFW_Dataset/boundingbox.txt");
    for(int i = 0;i < img_num;i++){
        BoundingBox temp;
        fin>>temp.start_x>>temp.start_y>>temp.width>>temp.height;
        temp.centroid_x = temp.start_x + temp.width/2.0;
        temp.centroid_y = temp.start_y + temp.height/2.0;
        bounding_box.push_back(temp);
    }
    fin.close();

    // 读取特征点坐标
    vectordouble> > ground_truth_shapes;
    fin.open("/home/f/FaceAlignment-master/FaceAlignment/COFW_Dataset/keypoints.txt");
    for(int i = 0;i < img_num;i++){
        Mat_<double> temp(landmark_num,2);
        for(int j = 0;j < landmark_num;j++){
            fin>>temp(j,0); 
        }
        for(int j = 0;j < landmark_num;j++){
            fin>>temp(j,1); 
        }
        ground_truth_shapes.push_back(temp);
    }
    fin.close(); 
    //训练模型
    ShapeRegressor regressor;
    regressor.Train(images,ground_truth_shapes,bounding_box,first_level_num,second_level_num,
                    candidate_pixel_num,fern_pixel_num,initial_number);
    regressor.Save("/home/f/FaceAlignment-master/FaceAlignment/model.txt");
    return 0;
}

ShapeRegressor.cpp

这里也没有到真正的训练阶段，主要是预处理。

void ShapeRegressor::Train(const vector >& images,          //gray scale images
                   const vectordouble> >& ground_truth_shapes,    // a vector of N*2 matrix, where N is the number of landmarks
                   const vector& bounding_box,             // BoundingBox of faces
                   int first_level_num, int second_level_num,           // 10  500
                   int candidate_pixel_num, int fern_pixel_num,         // 400 5
                   int initial_num){                                    // 20 number of initial shapes for each input image
    cout<<"Start training..."<0].rows; 
    // data augmentation and multiple initialization 
    vector > augmented_images;
    vector augmented_bounding_box;
    vectordouble> > augmented_ground_truth_shapes;
    vectordouble> > current_shapes;       // 扩大之后的初始化shape，绝对坐标

    // 扩大训练数据
    RNG random_generator(getTickCount());
    for(int i = 0;i < images.size();i++){
        for(int j = 0;j < initial_num;j++){ //为每一幅图片产生initial_num个初始化shape
            int index = 0;
            do{
                index = random_generator.uniform(0, images.size());
            }while(index == i);
            augmented_images.push_back(images[i]);
            augmented_ground_truth_shapes.push_back(ground_truth_shapes[i]);
            augmented_bounding_box.push_back(bounding_box[i]); 
            // 1. Select ground truth shapes of other images as initial shapes
            // 2. Project current shape to bounding box of ground truth shapes
            // 绝对坐标点，包含了人脸位置信息，这里先转换为相对坐标，再通过bounding_box还原，去除绝对坐标的偏差
            Mat_<double> temp = ground_truth_shapes[index];
            temp = ProjectShape(temp, bounding_box[index]);
            temp = ReProjectShape(temp, bounding_box[i]);
            current_shapes.push_back(temp); 
        } 
    }

    // 求平均shape模型，结果保存为相对坐标
    // get mean shape from training shapes
    mean_shape_ = GetMeanShape(ground_truth_shapes,bounding_box); 

    // train fern cascades
    fern_cascades_.resize(first_level_num);
    vectordouble> > prediction;
    for(int i = 0;i < first_level_num;i++){
        cout<<"Training fern cascades: "<1<<" out of "<1, first_level_num);

        // update current shapes
        // 对每一副图片的形状进行更新,prediction[x] 中保存了n个特征点的位移更新量
        for(int j = 0;j < prediction.size();j++){
            current_shapes[j] = prediction[j] + ProjectShape(current_shapes[j], augmented_bounding_box[j]);
            current_shapes[j] = ReProjectShape(current_shapes[j],augmented_bounding_box[j]);
        }
    } 

}

FernCascade.cpp

这里开始进入正题，为每一幅图片生成400个特征点，然后放入到fern分类器中训练。生成特征点的部分稍微解释一下，从代码可以看出，就是先随机的生成400个随机点，这些点都在bounding_box里，然后计算每一个点距离shape特征点（29个point）的距离，找出距离最近的shape特征点的索引，那么这个特征点就是针对某一个shape特征点的特征。（特征有点乱，人脸特征点，我都在前面加了shape，特指人脸上的点）

vectordouble> > FernCascade::Train(const vector >& images,
                                    const vectordouble> >& current_shapes,
                                    const vectordouble> >& ground_truth_shapes,
                                    const vector & bounding_box,
                                    const Mat_<double>& mean_shape,
                                    int second_level_num,               //500
                                    int candidate_pixel_num,            //400
                                    int fern_pixel_num,                 //5
                                    int curr_level_num,
                                    int first_level_num){               //10
    Mat_<double> candidate_pixel_locations(candidate_pixel_num,2);      //特征点位置坐标（相对于mean_shape，75行）
    Mat_<int> nearest_landmark_index(candidate_pixel_num,1);            //特征点最近shape点的索引
    vectordouble> > regression_targets;                           //存放残差
    RNG random_generator(getTickCount());
    second_level_num_ = second_level_num;

    // calculate regression targets: the difference between ground truth shapes and current shapes
    // candidate_pixel_locations: the locations of candidate pixels, indexed relative to its nearest landmark on mean shape
    // 计算残差
    regression_targets.resize(current_shapes.size()); 
    for(int i = 0;i < current_shapes.size();i++){
        regression_targets[i] = ProjectShape(ground_truth_shapes[i],bounding_box[i]) 
                                - ProjectShape(current_shapes[i],bounding_box[i]);
    }

    // 生成 shape-indexed features 特征点
    // 在整张脸中生成400个随机点,并且找到和这400个随机点最近的shape点的索引
    for(int i = 0;i < candidate_pixel_num;i++){
        double x = random_generator.uniform(-1.0,1.0);
        double y = random_generator.uniform(-1.0,1.0);
        if(x*x + y*y > 1.0){    //x,y的值代表的相对坐标,这取值范围涵盖了整个boundingbox上的所有点
            i--;
            continue;
        }
        // find nearest landmark index
        double min_dist = 1e10;
        int min_index = 0;
        for(int j = 0;j < mean_shape.rows;j++){
            double temp = pow(mean_shape(j,0)-x,2.0) + pow(mean_shape(j,1)-y,2.0);
            if(temp < min_dist){
                min_dist = temp;
                min_index = j;
            }
        }
        candidate_pixel_locations(i,0) = x - mean_shape(min_index,0);       //
        candidate_pixel_locations(i,1) = y - mean_shape(min_index,1);
        nearest_landmark_index(i) = min_index;   
    }

    // for densities: each row is the pixel densities at each candidate pixels for an image
    // 求每幅图的400个特征点的特征值
    vector<vector<double> > densities;
    densities.resize(candidate_pixel_num);
    for(int i = 0;i < images.size();i++){
        Mat_<double> temp = ProjectShape(current_shapes[i],bounding_box[i]);
        for(int j = 0;j < candidate_pixel_num;j++){
            //这里不确定,应该是特征点相对于shape点的绝对坐标，但是计算方法有点奇怪，修改之后会偏移较大
            double project_x = (candidate_pixel_locations(j,0) + candidate_pixel_locations(j,1))*bounding_box[i].width/2.0;
            double project_y = (candidate_pixel_locations(j,0) + candidate_pixel_locations(j,1))*bounding_box[i].height/2.0;
            int index = nearest_landmark_index(j);
            int real_x = project_x + current_shapes[i](index,0);
            int real_y = project_y + current_shapes[i](index,1);
            //不能越界
            real_x = std::max(0.0,std::min((double)real_x,images[i].cols-1.0));
            real_y = std::max(0.0,std::min((double)real_y,images[i].rows-1.0));
            densities[j].push_back((int)images[i](real_y,real_x));  //j索引的是400个特征点的值,i索引的是所有图片
        }
    }

    //求 densities 的协方差   densities里面存储了所有训练图片的特征点
    // calculate the covariance between densities at each candidate pixels 
    Mat_<double> covariance(candidate_pixel_num,candidate_pixel_num);
    Mat_<double> mean;
    for(int i = 0;i < candidate_pixel_num;i++){
        for(int j = i;j< candidate_pixel_num;j++){
            double correlation_result = calculate_covariance(densities[i],densities[j]);
            covariance(i,j) = correlation_result;
            covariance(j,i) = correlation_result;
        }
    } 

    // train ferns
    // 训练蕨分类器，每个蕨分类器的输出对n个shape点的坐标做修正
    vectordouble> > prediction;
    prediction.resize(regression_targets.size());
    for(int i = 0;i < regression_targets.size();i++){
        prediction[i] = Mat::zeros(mean_shape.rows,2,CV_64FC1); 
    } 
    ferns_.resize(second_level_num);
    clock_t t = clock();
    for(int i = 0;i < second_level_num;i++){
        vectordouble> > temp = ferns_[i].Train(densities,covariance,candidate_pixel_locations,nearest_landmark_index,regression_targets,fern_pixel_num);     
        // update regression targets
        for(int j = 0;j < temp.size();j++){
            prediction[j] = prediction[j] + temp[j];
            //boost？ 每次都根据残差修改训练参数，但是没有像adaboost修改样本权重
            regression_targets[j] = regression_targets[j] - temp[j];
        }  
        //打印训练时间
        if((i+1) % 50 == 0){
            cout<<"Fern cascades: "<< curr_level_num << " out of "<< first_level_num<<"; "; 
            cout<<"Ferns: "<1<<" out of "<double remaining_level_num= (first_level_num - curr_level_num) * 500 + second_level_num - i; 
            double time_remaining = 0.02 * double(clock() - t)  / CLOCKS_PER_SEC * remaining_level_num;
            cout<<"Expected remaining time: "
                << (int)time_remaining / 60<<"min "<<(int)time_remaining % 60 <<"s"<return prediction;    
}

Fern.cpp

通过计算协方差求出5对特征点，根据特征点的差值产生一个2^5的分桶，统计每个分桶中落入的shape和ground_truth shape差值，求平均之后就是某个分桶中的shape的更新值。

vectordouble> > Fern::Train(const vector<vector<double> >& candidate_pixel_intensity,     //特征点的特征值
                                  const Mat_<double>& covariance,                               //特征点之间的协方差（找出最有辨别力的特征点）
                                  const Mat_<double>& candidate_pixel_locations,                //特征点的坐标（相对坐标）
                                  const Mat_<int>& nearest_landmark_index,                      //特征点索引
                                  const vectordouble> >& regression_targets,              //残差
                                  int fern_pixel_num){                                          //有效特征对的数量5

    fern_pixel_num_ = fern_pixel_num;
    landmark_num_ = regression_targets[0].rows;
    selected_pixel_index_.create(fern_pixel_num,2);     //the index of selected pixels pairs in fern
    selected_pixel_locations_.create(fern_pixel_num,4); //the locations of selected pixel pairs stored in the format (x_1,y_1,x_2,y_2) for each row
    selected_nearest_landmark_index_.create(fern_pixel_num,2);
    int candidate_pixel_num = candidate_pixel_locations.rows;

    // select pixel pairs from candidate pixels, this selection is based on the correlation between pixel 
    // densities and regression targets
    // for details, please refer to "Face Alignment by Explicit Shape Regression" 
    // threshold_: thresholds for each pair of pixels in fern 

    threshold_.create(fern_pixel_num,1);
    // get a random direction
    RNG random_generator(getTickCount());
    for(int i = 0;i < fern_pixel_num;i++){
        Mat_<double> random_direction(landmark_num_ ,2);
        random_generator.fill(random_direction,RNG::UNIFORM,-1.1,1.1);
        normalize(random_direction,random_direction);
        vector<double> projection_result(regression_targets.size(), 0); 
        // project regression targets along the random direction
        // 将regression targets 向随机方向投影
        for(int j = 0;j < regression_targets.size();j++){
            double temp = 0;
            temp = sum(regression_targets[j].mul(random_direction))[0]; 
            projection_result[j] = temp;    //随机方向的投影
        } 
        Mat_<double> covariance_projection_density(candidate_pixel_num,1);
        // 求随机方向投影和特征点的协方差
        for(int j = 0;j < candidate_pixel_num;j++){
            covariance_projection_density(j) = calculate_covariance(projection_result,candidate_pixel_intensity[j]);
        }
        // find max correlation
        // 找到方差最大的特征点
        double max_correlation = -1;
        int max_pixel_index_1 = 0;
        int max_pixel_index_2 = 0;
        for(int j = 0;j < candidate_pixel_num;j++){
            for(int k = 0;k < candidate_pixel_num;k++){
                double temp1 = covariance(j,j) + covariance(k,k) - 2*covariance(j,k);
                if(abs(temp1) < 1e-10){
                    continue;
                }
                bool flag = false;
                //???
                for(int p = 0;p < i;p++){
                    if(j == selected_pixel_index_(p,0) && k == selected_pixel_index_(p,1)){
                        flag = true;
                        break; 
                    }else if(j == selected_pixel_index_(p,1) && k == selected_pixel_index_(p,0)){
                        flag = true;
                        break;
                    } 
                }
                if(flag){
                    continue;
                } 
                double temp = (covariance_projection_density(j) - covariance_projection_density(k))
                    / sqrt(temp1);
                if(abs(temp) > max_correlation){
                    max_correlation = temp;
                    max_pixel_index_1 = j;
                    max_pixel_index_2 = k;
                }
            }
        }

        selected_pixel_index_(i,0) = max_pixel_index_1; 
        selected_pixel_index_(i,1) = max_pixel_index_2; 
        selected_pixel_locations_(i,0) = candidate_pixel_locations(max_pixel_index_1,0);
        selected_pixel_locations_(i,1) = candidate_pixel_locations(max_pixel_index_1,1);
        selected_pixel_locations_(i,2) = candidate_pixel_locations(max_pixel_index_2,0);
        selected_pixel_locations_(i,3) = candidate_pixel_locations(max_pixel_index_2,1);
        selected_nearest_landmark_index_(i,0) = nearest_landmark_index(max_pixel_index_1); 
        selected_nearest_landmark_index_(i,1) = nearest_landmark_index(max_pixel_index_2); 

        // get threshold for this pair
        double max_diff = -1;
        for(int j = 0;j < candidate_pixel_intensity[max_pixel_index_1].size();j++){
            double temp = candidate_pixel_intensity[max_pixel_index_1][j] - candidate_pixel_intensity[max_pixel_index_2][j];
            if(abs(temp) > max_diff){
                max_diff = abs(temp);
            }
        }
        threshold_(i) = random_generator.uniform(-0.2*max_diff,0.2*max_diff); 
    } 

    // determine the bins of each shape
    // 5个bit的分桶，统计落入每一个分桶的索引
    vector<vector<int> > shapes_in_bin;
    int bin_num = pow(2.0,fern_pixel_num);
    shapes_in_bin.resize(bin_num);
    for(int i = 0;i < regression_targets.size();i++){
        int index = 0;
        for(int j = 0;j < fern_pixel_num;j++){
            double density_1 = candidate_pixel_intensity[selected_pixel_index_(j,0)][i];
            double density_2 = candidate_pixel_intensity[selected_pixel_index_(j,1)][i];
            if(density_1 - density_2 >= threshold_(j)){
                index = index + pow(2.0,j);
            } 
        }
        shapes_in_bin[index].push_back(i);
    }

    // get bin output
    vectordouble> > prediction;
    prediction.resize(regression_targets.size());
    bin_output_.resize(bin_num);
    for(int i = 0;i < bin_num;i++){ //针对每一个分桶计算prediction[i]
        Mat_<double> temp = Mat::zeros(landmark_num_,2, CV_64FC1);
        int bin_size = shapes_in_bin[i].size();
        //求总的差值
        for(int j = 0;j < bin_size;j++){
            int index = shapes_in_bin[i][j];
            temp = temp + regression_targets[index]; 
        }
        if(bin_size == 0){
            bin_output_[i] = temp;
            continue; 
        }
        // 正则化，防止过拟合
        temp = (1.0/((1.0+1000.0/bin_size) * bin_size)) * temp;
        bin_output_[i] = temp;
        // 对每一个落入分桶中的shape的位移量进行更新
        for(int j = 0;j < bin_size;j++){
            int index = shapes_in_bin[i][j];
            prediction[index] = temp;
        }
    }
    return prediction;
}

通俗理解线性回归(Linear Regression) 小夏refresh 机器学习数据挖掘机器学习算法人工智能数据挖掘
线性回归,最简单的机器学习算法,当你看完这篇文章,你就会发现,线性回归是多么的简单.首先,什么是线性回归.简单的说,就是在坐标系中有很多点,线性回归的目的就是找到一条线使得这些点都在这条直线上或者直线的周围,这就是线性回归(LinearRegression).是不是有画面感了?那么我们上图片:![1.png][1]那么接下来,就让我们来看看具体的线性回归吧首先,我们以二维数据为例:我们有一组数据x
从0开始深度学习（4）——线性回归概念青石横刀策马从头学机器学习深度学习神经网络人工智能
1线性回归回归（regression）指能为一个或多个自变量与因变量之间的关系进行建模。1.1线性模型线性假设是指目标可以表示为特征的加权和，以房价和面积、房龄为例，可以有下面的式子：w称为权重（weight）b称为偏置（bias）、偏移量（offset）或截距（intercept）给定一个数据集，我们的目标是寻找模型的权重和偏置，使得根据模型做出的预测大体符合数据里的真实价格。1.2损失函数在我
python logistic regression_机器学习算法与Python实践之逻辑回归（Logistic Regression） weixin_39702649 python logistic regression
机器学习算法与Python实践这个系列主要是参考下载地址：https://bbs.pinggu.org/thread-2256090-1-1.html一、逻辑回归(LogisticRegression)Logisticregression(逻辑回归)是当前业界比较常用的机器学习方法，用于估计某种事物的可能性。之前在经典之作《数学之美》中也看到了它用于广告预测，也就是根据某广告被用户点击的可能性，把
python logistic模型_Python实践之逻辑回归（Logistic Regression） weixin_39922394 python logistic模型
机器学习算法与Python实践这个系列主要是参考《机器学习实战》这本书。因为自己想学习Python，然后也想对一些机器学习算法加深下了解，所以就想通过Python来实现几个比较常用的机器学习算法。恰好遇见这本同样定位的书籍，所以就参考这本书的过程来学习了。这节学习的是逻辑回归(LogisticRegression)，也算进入了比较正统的机器学习算法。啥叫正统呢？我概念里面机器学习算法一般是这样一个
Spark MLlib模型训练—回归算法 Random forest regression 不二人生 Spark ML 实战 spark-ml 回归随机森林
SparkMLlib模型训练—回归算法Randomforestregression随机森林回归(RandomForestRegression)是一种集成学习方法，通过结合多个决策树的预测结果来提升模型的准确性和稳健性。相较于单一的决策树模型，随机森林通过随机采样和多棵树的集成，减少了模型的方差，从而在处理复杂数据集时展现出更好的性能。本文将详细介绍随机森林回归的原理、实现方法、应用场景，并通过Sc
Spark MLlib模型训练—回归算法 GLR( Generalized Linear Regression) 猫猫姐 Spark 实战回归 spark-ml 线性回归 spark
SparkMLlib模型训练—回归算法GLR(GeneralizedLinearRegression)在大数据分析中，线性回归虽然常用，但在许多实际场景中，目标变量和特征之间的关系并非线性，这时广义线性回归（GeneralizedLinearRegression,GLR）便应运而生。GLR是线性回归的扩展，能够处理非正态分布的目标变量，广泛用于分类、回归以及其他统计建模任务。本文将深入探讨Spar
基于Python的机器学习系列（17）：梯度提升回归（Gradient Boosting Regression）会飞的Anthony 人工智能信息系统机器学习机器学习 python 回归
简介梯度提升（GradientBoosting）是一种强大的集成学习方法，类似于AdaBoost，但与其不同的是，梯度提升通过在每一步添加新的预测器来减少前一步预测器的残差。这种方法通过逐步改进模型，能够有效提高预测准确性。梯度提升回归的工作原理在梯度提升回归中，我们逐步添加预测器来修正模型的残差。以下是梯度提升的基本步骤：初始化模型：选择一个初始预测器h0(x)，计算该预测器的预测值。计算残差：
Datawhale X 李宏毅苹果书 AI夏令营｜机器学习基础之案例学习 Monyan 人工智能机器学习学习李宏毅深度学习
机器学习（MachineLearning,ML）：机器具有学习的能力，即让机器具备找一个函数的能力函数不同，机器学习的类别不同：回归（regression）：找到的函数的输出是一个数值或标量（scalar）。例如：机器学习预测某一个时间段内的PM2.5，机器要找到一个函数f，输入是跟PM2.5有关的的指数，输出是明天中午的PM2.5的值。分类（classification）：让机器做选择题，先准备
四十一、【人工智能】【机器学习】- Bayesian Logistic Regression算法模型暴躁的大熊人工智能人工智能机器学习算法
系列文章目录第一章【机器学习】初识机器学习第二章【机器学习】【监督学习】-逻辑回归算法(LogisticRegression)第三章【机器学习】【监督学习】-支持向量机(SVM)第四章【机器学习】【监督学习】-K-近邻算法(K-NN)第五章【机器学习】【监督学习】-决策树(DecisionTrees)第六章【机器学习】【监督学习】-梯度提升机(GradientBoostingMachine,GBM
regression机器学习回归预测模型参考学习后自我总结饮啦冰美式机器学习回归学习
简单来说，就是将样本的特征矩阵映射到样本标签空间。回归分析帮助我们理解在改变一个或多个自变量时，因变量的数值会如何变化。线性模型线性回归用于建立因变量和一个或多个自变量之间的线性关系模型。在线性回归中，假设因变量（被预测变量）与自变量（预测变量）之间存在着线性关系，也就是说，因变量的数值可以通过自变量的线性组合来预测。普通最小二乘线性回归。通过最小化实际观测值与模型预测值之间的误差平方和，可以找到
（Ridge， Lasso） Regression 王金松
岭回归岭回归的损失函数MSE+L2岭回归还是多元线性回归y=wTx只不过损失函数MSE添加了损失项w越小越好？因为为了提高模型的泛化能力（容错能力），w越小越好因为如果x1有错，w越小，对y的影响越小但是w为0没意义，所以w要适当保证准确率的情况下提高泛化能力和容错能力多元线性回归通过MSE（最小二乘leastsquares）保证正确率但是我们还需要模型提高泛化能力提高泛化能力min((y-y_h
GEE：CART（Classification and Regression Trees）回归教程（样本点、特征添加、训练、精度、参数优化） _养乐多_ GEE遥感图像处理教程回归 GEE javascript 云计算遥感图像处理
作者：CSDN@_养乐多_对于分类问题，这个输出通常是一个类别标签，而对于回归问题，输出通常是一个连续的数值。回归可以应用于多种场景，包括预测土壤PH值、土壤有机碳、土壤水分、碳密度、生物量、气温、海冰厚度、不透水面积百分比、植被覆盖度等。本文将介绍在GoogleEarthEngine（GEE）平台上进行CART（ClassificationandRegressionTrees）回归的方法和代码，
[论文精读]Intelligence Quotient Scores Prediction in rs-fMRI via Graph Convolutional Regression Network 夏莉莉iy 论文精读人工智能机器学习深度学习计算机视觉学习笔记图论
论文网址：IntelligenceQuotientScoresPredictioninrs-fMRIviaGraphConvolutionalRegressionNetwork|SpringerLink英文是纯手打的！论文原文的summarizingandparaphrasing。可能会出现难以避免的拼写错误和语法错误，若有发现欢迎评论指正！文章偏向于笔记，谨慎食用！目录1.省流版1.1.心得1.
最小二乘法的计算复杂度Computational complexity of least square regression operation 知识在于积累数学大类专栏最小二乘法算法
https://math.stackexchange.com/questions/84495/computational-complexity-of-least-square-regression-operationhttps://courses.grainger.illinois.edu/cs357/fa2021/notes/ref-17-least-squares.html
使用Logistic Regression进行文本分类 bitcarmanlee text classifier Logistic Regression 文本分类
1.文本格式sentence,label游戏太坑，暴率太低，太克金，平民不能玩,negative让人失望,negative能解决一下服务器问题？网络正常老掉线，换手机也一样。。。,negative期待,positive一星也不想给，这特么简直龟速，炫舞老年版？,negative衣服不好看游戏内容无特色，界面乱糟糟的,negative喜欢喜欢,positive从有了这个手游就一直玩，很喜欢呀，希望更
python基础学习-多元回归（Multiple Regression） Jiang_Immortals python 学习开发语言
多元回归就像线性回归一样，但是具有多个独立值，这意味着我们试图基于两个或多个变量来预测一个值。请看下面的数据集，其中包含了一些有关汽车的信息。CarModelVolumeWeightCO2ToyotaAygo100079099MitsubishiSpaceStar1200116095SkodaCitigo100092995Fiat50090086590MiniCooper15001140105VW
【机器学习笔记】回归算法住在天上的云机器学习笔记回归线性回归人工智能
回归算法文章目录回归算法1线性回归2损失函数3多元线性回归4线性回归的相关系数1线性回归回归分析(Regression)回归分析是描述变量间关系的一种统计分析方法例：在线教育场景因变量Y：在线学习课程满意度自变量X：平台交互性、教学资源、课程设计预测性的建模技术，通常用于预测分析，预测的结果多为连续值（也可为离散值，二值）线性回归(Linearregression)因变量和自变量之间是线性关系，就
机器学习算法之逻辑回归算法（Logistic Regression）迎风斯黄数学建模美赛机器学习算法回归
逻辑回归算法是一种用于分类问题的经典机器学习算法。虽然它的名字中带有“回归”，但实际上逻辑回归用于解决分类问题，特别是二分类问题。本篇博文将详细介绍逻辑回归算法的工作原理、应用领域以及Python示例。算法背景逻辑回归起源于20世纪初，用于分析生存率数据。随后，它被广泛应用于医学、社会科学、经济学和工程学等领域。在机器学习中，逻辑回归通常用于解决以下问题：信用评分垃圾邮件分类疾病诊断用户流失预测金
sklearn之模型评估指标总结归纳 lzw2016 机器学习 Python学习 sklearn 模型评估指标归纳总结
文章目录机器学习模型评估分类模型回归模型聚类模型交叉验证中指定scoring参数网格搜索中应用机器学习模型评估以下方法，sklearn中都在sklearn.metrics类下，务必记住哪些指标适合分类，那些适合回归，不能混着用分类的模型大多是Classifier结尾，回归是Regression分类模型accuracy_score（准确率得分）是模型分类正确的数据除以样本总数【模型的score方法算
【PSA】《Polarized Self-Attention: Towards High-quality Pixel-wise Regression》 bryant_meng CNN /Transformer 人工智能深度学习 PSA polarized attention
arXiv-2020文章目录1BackgroundandMotivation2RelatedWork3Advantages/Contributions4Method5Experiments5.1DatasetsandMetrics5.2PSAvs.Baselines5.3SemanticSegmentation5.4AblationStudy6Conclusion（own）1Backgrounda
深度学习入门笔记（6）—— Logistic Regression cnhwl 深度学习入门笔记深度学习机器学习逻辑回归人工智能 python
对比第三节中的Adaline和LogisticRegression，可以发现它们只有两点不同：1、激活函数，Adaline中的激活函数是恒等函数（线性），而LogisticRegression中的激活函数是Sigmoid函数（非线性）；2、损失函数，Adaline中的损失函数是均方误差，而LogisticRegression中的损失函数则是交叉熵。Sigmoid函数如图所示，其值域为0到1，输入为
linear_regression_2.ipynb fallinmix
{"cells":[{"cell_type":"code","execution_count":5,"metadata":{},"outputs":[],"source":["%matplotlibinline\n","%reload_extautoreload\n","%autoreload1\n","%aimportd2lzh_pytorch\n","importtorch\n","impor
XGBoost和LightGBM的参数以及调参噶噶~ 机器学习
一、XGBoost参数解释XGBoost的参数一共分为三类：通用参数：宏观函数控制。Booster参数：控制每一步的booster(tree/regression)。booster参数一般可以调控模型的效果和计算代价。我们所说的调参，很这是大程度上都是在调整booster参数。学习目标参数：控制训练目标的表现。我们对于问题的划分主要体现在学习目标参数上。比如我们要做分类还是回归，做二分类还是多分类
机器学习：Softmax回归（Python）捕捉一只Diu 机器学习回归 python 笔记
Softmax回归（多分类）logistic_regression_mulclass.pyimportnumpyasnpimportmatplotlib.pyplotaspltclassLogisticRegression_MulClass:"""逻辑回归，采用梯度下降算法+正则化，交叉熵损失函数，实现多分类，Softmax函数"""def__init__(self,fit_intercept=T
2020李宏毅学习笔记——1.概论是汤圆啊
即将学习内容分布：一、机器学习的本质就是自动寻找函式，如语音识别，就是让机器找一个函数，输入是声音信号，输出是对应的文字。如下棋，就是让机器找一个函数，输入是当前棋盘上黑子白子的位置，输出是下一步应该落子何处。例如二、寻找什么样子的函数式1.regression（回归）：输出是数值。如房价、PM2.5预测。Theoutputofthefunctionisascalar.:函数的输出是一个数值例如：
PyTorch RNN Regression Jancd
循环神经网络RNN及时预测时间序列.更多可以查看官网:*PyTorch官网载入数据假设想要用sin的曲线预测出cos的曲线.imageimporttorchfromtorchimportnnfromtorch.autogradimportVariableimportnumpyasnpimportmatplotlib.pyplotasplttorch.manual_seed(1)#reproduc
机器学习：Logistic回归（Python）捕捉一只Diu 机器学习 python 人工智能笔记逻辑回归
Logistic回归（二分类）logistic_regression_class2.pyimportnumpyasnpimportmatplotlib.pyplotaspltclassLogisticRegression:"""逻辑回归，采用梯度下降算法+正则化，交叉熵损失函数，实现二分类"""def__init__(self,fit_intercept=True,normalize=True,a
机器学习：正则化（Python）捕捉一只Diu 机器学习 python 笔记线性回归
regularization_linear_regression.pyimportnumpyasnpimportmatplotlib.pyplotaspltclassRegularizationLinearRegression:"""线性回归+正则化，梯度下降法+闭式解求解模型系数1、数据的预处理：是否训练偏置项fit_intercept（默认True），是否标准化normalized（默认Tru
深度学习 Day 4.2 Logistic Regression——Discriminative Model 闻.铃深度学习 python 深度学习人工智能
目录1.FunctionSet设定公式2.GoodnessofaFunction损失函数3.Findthebestfunction梯度下降4.为何判断logisticregression模型的好坏，用交叉熵而不是SquareError:5.Multi-classClassification5.1用softmax来计算一个元素去到各个class的概率5.2把f(x)和y的分布用交叉熵来对比6.Fea
【PyTorch】深度学习实践之逻辑斯蒂回归 Logistic Regression zoetu #PyTorch深度学习实践深度学习 pytorch 回归
本文目录回归vs分类sigmoid函数损失函数例子课堂练习模型实现计算损失实现代码测试模型学习资料系列文章索引回归vs分类回归是预测数值分类是预测类别概率sigmoid函数LogisticFunction是最典型的sigmoid函数，因此有些书会直接说成sigmoid函数。实际上满足如下条件即可称为sigmoid函数：饱和函数单调递增存在极限损失函数使用二分类交叉熵公式：y=1，预测值接近1，lo
矩阵求逆（JAVA）初等行变换 qiuwanchi 矩阵求逆（JAVA）
package gaodai.matrix; import gaodai.determinant.DeterminantCalculation; import java.util.ArrayList; import java.util.List; import java.util.Scanner; /** * 矩阵求逆(初等行变换) * @author 邱万迟 *
JDK timer antlove java jdk schedule code timer
1.java.util.Timer.schedule(TimerTask task, long delay)：多长时间（毫秒）后执行任务 2.java.util.Timer.schedule(TimerTask task, Date time)：设定某个时间执行任务 3.java.util.Timer.schedule(TimerTask task, long delay,longperiod
JVM调优总结 -Xms -Xmx -Xmn -Xss coder_xpf jvm 应用服务器
堆大小设置JVM 中最大堆大小有三方面限制：相关操作系统的数据模型（32-bt还是64-bit）限制；系统的可用虚拟内存限制；系统的可用物理内存限制。32位系统下，一般限制在1.5G~2G；64为操作系统对内存无限制。我在Windows Server 2003 系统，3.5G物理内存，JDK5.0下测试，最大可设置为1478m。典型设置： java -Xmx
JDBC连接数据库 Array_06 jdbc
package Util; import java.sql.Connection; import java.sql.DriverManager; import java.sql.ResultSet; import java.sql.SQLException; import java.sql.Statement; public class JDBCUtil { //完
Unsupported major.minor version 51.0（jdk版本错误） oloz java
java.lang.UnsupportedClassVersionError: cn/support/cache/CacheType : Unsupported major.minor version 51.0 (unable to load class cn.support.cache.CacheType) at org.apache.catalina.loader.WebappClassL
用多个线程处理1个List集合 362217990 多线程 thread list 集合
昨天发了一个提问，启动5个线程将一个List中的内容，然后将5个线程的内容拼接起来，由于时间比较急迫，自己就写了一个Demo，希望对菜鸟有参考意义。。 import java.util.ArrayList; import java.util.List; import java.util.concurrent.CountDownLatch; public c
JSP简单访问数据库香水浓 sql mysql jsp
学习使用javaBean，代码很烂，仅为留个脚印 public class DBHelper { private String driverName; private String url; private String user; private String password; private Connection connection; privat
Flex4中使用组件添加柱状图、饼状图等图表 AdyZhang Flex
1.添加一个最简单的柱状图 ? 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 <?xml version= "1.0"&n
Android 5.0 - ProgressBar 进度条无法展示到按钮的前面 aijuans android
在低于SDK < 21 的版本中，ProgressBar 可以展示到按钮前面，并且为之在按钮的中间，但是切换到android 5.0后进度条ProgressBar 展示顺序变化了，按钮再前面，ProgressBar 在后面了我的xml配置文件如下： [html] view plain copy <RelativeLa
查询汇总的sql baalwolf sql
select list.listname, list.createtime,listcount from dream_list as list , (select listid,count(listid) as listcount from dream_list_user group by listid order by count(
Linux du命令和df命令区别 BigBird2012 linux
1，两者区别 du，disk usage,是通过搜索文件来计算每个文件的大小然后累加，du能看到的文件只是一些当前存在的，没有被删除的。他计算的大小就是当前他认为存在的所有文件大小的累加和。
AngularJS中的$apply，用还是不用？ bijian1013 JavaScript AngularJS $apply
在AngularJS开发中，何时应该调用$scope.$apply()，何时不应该调用。下面我们透彻地解释这个问题。但是首先，让我们把$apply转换成一种简化的形式。 scope.$apply就像一个懒惰的工人。它需要按照命
[Zookeeper学习笔记十]Zookeeper源代码分析之ClientCnxn数据序列化和反序列化 bit1129 zookeeper
ClientCnxn是Zookeeper客户端和Zookeeper服务器端进行通信和事件通知处理的主要类，它内部包含两个类，1. SendThread 2. EventThread， SendThread负责客户端和服务器端的数据通信，也包括事件信息的传输，EventThread主要在客户端回调注册的Watchers进行通知处理 ClientCnxn构造方法 &
【Java命令一】jmap bit1129 Java命令
jmap命令的用法： [hadoop@hadoop sbin]$ jmap Usage: jmap [option] <pid> (to connect to running process) jmap [option] <executable <core> (to connect to a
Apache 服务器安全防护及实战 ronin47
此文转自IBM. Apache 服务简介 Web 服务器也称为 WWW 服务器或 HTTP 服务器 (HTTP Server)，它是 Internet 上最常见也是使用最频繁的服务器之一，Web 服务器能够为用户提供网页浏览、论坛访问等等服务。由于用户在通过 Web 浏览器访问信息资源的过程中，无须再关心一些技术性的细节，而且界面非常友好，因而 Web 在 Internet 上一推出就得到
unity 3d实例化位置出现布置？ brotherlamp unity教程 unity unity资料 unity视频 unity自学
问：unity 3d实例化位置出现布置？答：实例化的同时就可以指定被实例化的物体的位置,即 position Instantiate (original : Object, position : Vector3, rotation : Quaternion) : Object 这样你不需要再用Transform.Position了, 如果你省略了第二个参数(
《重构，改善现有代码的设计》第八章 Duplicate Observed Data bylijinnan java 重构
import java.awt.Color; import java.awt.Container; import java.awt.FlowLayout; import java.awt.Label; import java.awt.TextField; import java.awt.event.FocusAdapter; import java.awt.event.FocusE
struts2更改struts.xml配置目录 chiangfai struts.xml
struts2默认是读取classes目录下的配置文件，要更改配置文件目录，比如放在WEB-INF下，路径应该写成../struts.xml(非/WEB-INF/struts.xml) web.xml文件修改如下： <filter> <filter-name>struts2</filter-name> <filter-class&g
redis做缓存时的一点优化 chenchao051 redis hadoop pipeline
最近集群上有个job，其中需要短时间内频繁访问缓存，大概7亿多次。我这边的缓存是使用redis来做的，问题就来了。首先，redis中存的是普通kv，没有考虑使用hash等解结构，那么以为着这个job需要访问7亿多次redis，导致效率低，且出现很多redi
mysql导出数据不输出标题行 daizj mysql 数据导出去掉第一行去掉标题
当想使用数据库中的某些数据，想将其导入到文件中，而想去掉第一行的标题是可以加上-N参数如通过下面命令导出数据： mysql -uuserName -ppasswd -hhost -Pport -Ddatabase -e " select * from tableName" > exportResult.txt 结果为： studentid
phpexcel导出excel表简单入门示例 dcj3sjt126com PHP Excel phpexcel
先下载PHPEXCEL类文件，放在class目录下面，然后新建一个index.php文件，内容如下 <?php error_reporting(E_ALL); ini_set('display_errors', TRUE); ini_set('display_startup_errors', TRUE); if (PHP_SAPI == 'cli') die('
爱情格言 dcj3sjt126com 格言
1) I love you not because of who you are, but because of who I am when I am with you. 　　我爱你，不是因为你是一个怎样的人，而是因为我喜欢与你在一起时的感觉。 　　2) No man or woman is worth your tears, and the one who is, won‘t
转 Activity 详解——Activity文档翻译 e200702084 android UI sqlite 配置管理网络应用
activity 展现在用户面前的经常是全屏窗口，你也可以将 activity 作为浮动窗口来使用（使用设置了 windowIsFloating 的主题），或者嵌入到其他的 activity （使用 ActivityGroup ）中。当用户离开 activity 时你可以在 onPause() 进行相应的操作。更重要的是，用户做的任何改变都应该在该点上提交 ( 经常提交到 ContentPro
win7安装MongoDB服务 geeksun mongodb
1. 下载MongoDB的windows版本：mongodb-win32-x86_64-2008plus-ssl-3.0.4.zip，Linux版本也在这里下载，下载地址： http://www.mongodb.org/downloads 2. 解压MongoDB在D:\server\mongodb, 在D:\server\mongodb下创建d
Javascript魔法方法:__defineGetter__,__defineSetter__ hongtoushizi js
转载自： http://www.blackglory.me/javascript-magic-method-definegetter-definesetter/ 在javascript的类中,可以用defineGetter和defineSetter_控制成员变量的Get和Set行为例如,在一个图书类中,我们自动为Book加上书名符号: function Book(name){
错误的日期格式可能导致走nginx proxy cache时不能进行304响应 jinnianshilongnian cache
昨天在整合某些系统的nginx配置时，出现了当使用nginx cache时无法返回304响应的情况，出问题的响应头： Content-Type:text/html; charset=gb2312 Date:Mon, 05 Jan 2015 01:58:05 GMT Expires:Mon , 05 Jan 15 02:03:00 GMT Last-Modified:Mon, 05
数据源架构模式之行数据入口 home198979 PHP 架构行数据入口
注：看不懂的请勿踩，此文章非针对java，java爱好者可直接略过。一、概念行数据入口（Row Data Gateway）：充当数据源中单条记录入口的对象，每行一个实例。二、简单实现行数据入口为了方便理解，还是先简单实现： <?php /** * 行数据入口类 */ class OrderGateway { /*定义元数
Linux各个目录的作用及内容 pda158 linux 脚本
1）根目录“/” 　　根目录位于目录结构的最顶层，用斜线（/）表示，类似于 Windows 操作系统的“C:\“，包含Fedora操作系统中所有的目录和文件。　　2）/bin 　　/bin 　　目录又称为二进制目录，包含了那些供系统管理员和普通用户使用的重要 linux命令的二进制映像。该目录存放的内容包括各种可执行文件，还有某些可执行文件的符号连接。常用的命令有：cp、d
ubuntu12.04上编译openjdk7 ol_beta HotSpot jvm jdk OpenJDK
获取源码从openjdk代码仓库获取(比较慢) 安装mercurial Mercurial是一个版本管理工具。 sudo apt-get install mercurial 将以下内容添加到$HOME/.hgrc文件中，如果没有则自己创建一个： [extensions] forest=/home/lichengwu/hgforest-crew/forest.py fe
将数据库字段转换成设计文档所需的字段 vipbooks 设计模式工作正则表达式
哈哈，出差这么久终于回来了，回家的感觉真好！ PowerDesigner的物理数据库一出来，设计文档中要改的字段就多得不计其数，如果要把PowerDesigner中的字段一个个Copy到设计文档中，那将会是一件非常痛苦的事情。