Liming07

【AlgorithmStar机器学习】AS机器学习库特征工程使用说明文档

Algorithm Star介绍

概述

AS库的一般处理流程

数据采集与清洗

向量生成与特征提取选择

机器学习

后续处理

Algorithm Star使用

数据类型-操作数

浮点类型操作数

整数类型操作数

复数

特征提取

字典特征提取

词频特征提取

特征选择

基于冗余排名比例去除

基于相关系数去除

机器学习

聚合计算

分类计算

差异计算

路径计算

数据预处理(标准化/归一化)

概率计算

决策计算

模型预测

Algorithm Star开源协议

Algorithm Star介绍

概述

Algorithm Star，简称“AS”，中文名为算法之星，由Ling Yu Zhao开发，是针对机器学习过程中的Java库，其具有良好的Java与Scala兼容性，于1.14版本开始重大改动，其中包含样本洗牌随机分布，特征选择，特征提取，度量计算，差异计算，路径推断，分类等计算组件，同时也具有运算符风格的复数，坐标，向量，矩阵等操作数，开发者Ling Yu Zhao于2022年发布的一款开源库，能够将复杂的机器学习简单化。

Algorithm Star采用apache2.0版本开源协议。支持通过maven坐标获取到框架本身，目前在GitHub中进行托管（GitHub - BeardedManZhao/algorithmStar: Toolkits for various algorithms, support vector computing and other functions, machine learning and mathematics, medicine, artificial intelligence and other fields of high practicality. | 用于各种算法、支持向量计算等功能，机器学习和数学、医学、人工智能等领域具有很高的实用性。）。

AS库的一般处理流程

数据采集与清洗

指的是数据的获取操作，在这一步获取到的数据是各种类型的原始数据，往往需要使用到各种大数据技术来采集到我们需要的数据，是整个工程中的第一项任务，在这一项任务中获取到的数据往往是具有噪音的数据，其中包含许多的冗余，因此要进行第二步，针对采集到的数据进行简单清洗。

数据清洗的方式有多种，例如去除空值与不合法的行等，使得其能够被按照预期转换，第一层过滤之后开始进行特征提取，将数据本身转换成为向量或矩阵等。

向量生成与特征提取选择

针对数据样本以及我们的目的和需求，对文本使用合适的特征提取操作，将获取到的数据转换成为AS中的向量/矩阵对象，使得其具有运算功能。

值得注意的是，多行数据的特征提取往往都是提取成为一个矩阵，那么矩阵中如果有一些没有特征的行向量，这将会影响最终的模型误差，因此对于这些行向量应进行特征选择操作，去除掉。

机器学习

作为整个特征工程中的核心操作，是需要频繁与AS库进行交互的一项任务，此任务中，需要涉及到AS库中的各种计算组件，通过这些计算组件达到最终的特征处理需求，目前库中有诸多函数，其中支持8种以上距离算法，3种以上的聚合算法，两种分类算法，5种差异算法等诸多机器学习算法，每一种算法在AS库中都是一个计算组件对象，且可以支持诸多操作组件，例如计算坐标之间的距离，计算矩阵中多个向量之间的类别等。

在计算过程中，AS库将遵循不拷贝原则，能不拷贝出一个新的对象就不拷贝，尽量在不修改原数据的基础上针对结果进行计算，当然，其中作为被计算矩阵与向量对象，允许用户直接获取到正在维护中的只读数组，也允许用户直接将正在维护中的数组拷贝出来一份新数据对象。

后续处理

在经过复杂的机器学习之后，我们应该会得到一种数据模型与数据结果，在这里的结果可以进行校验与后续的数据使用操作等，通过AS库，会使得Java中的机器学习不再困难与痛苦，多个重载函数的配合，使得Java中原生的基本数据类型也可以传递给计算组件进行机器学习的任务。

Algorithm Star使用

数据类型-操作数

操作数是AS库中的被计算对象，例如向量等，这类对象在AS库中能够实现加减乘除等基本运算，在AS库中每一个对象都在维护一个数组，这些数组在向量对象中的表现形式是一个基元的基本数据类型数组，因此在计算的时候具有良好的性能与原生支持。

操作数接口中的常用通用函数

Vector

`限定符和类型`	方法和说明
Object	clone`()`
`protected abstract` ArrayType	copyToNewArray`()`
`abstract` ImplementationType	expand`()`
`abstract int`	getNumberOfDimensions`()`
`abstract` ElementType	innerProduct`(`ImplementationType `vector)` 计算两个向量的内积，也称之为数量积，具体实现请参阅api说明 Calculate the inner product of two vectors, also known as the quantity product, please refer to the api node for the specific implementation
`abstract` ElementType	moduleLength`()` 计算该向量的模长，具体实现请参阅api说明 Calculate the modulo length of the vector, please refer to the api node for the specific implementation
`abstract` ImplementationType	multiply`(`ImplementationType `vector)` 两个向量相乘，同时也是两个向量的外积，具体实现请参阅api说明 The multiplication of two vectors is also the outer product of the two vectors.
`abstract` ImplementationType	shuffle`(long seed)` 将本对象中的所有数据进行洗牌打乱，随机分布数据行的排列。
String	toString`()`

ASVector

限定符和类型	方法和说明
`abstract` ImplementationType	add`(`ImplementationType `value, boolean ModifyCaller)` 在两个向量对象之间进行计算的函数，自从1.13版本开始支持该函数的调用，该函数中的计算并不会产生一个新的向量，而是将计算操作作用于原操作数中 The function that calculates between two vector objects supports the call of this function since version 1.13.
`abstract` ImplementationType	diff`(`ImplementationType `value, boolean ModifyCaller)` 在两个向量对象之间进行计算的函数，自从1.13版本开始支持该函数的调用，该函数中的计算并不会产生一个新的向量，而是将计算操作作用于原操作数中 The function that calculates between two vector objects supports the call of this function since version 1.13.
`abstract` ImplementationType	multiply`(`ImplementationType `value, boolean ModifyCaller)` 在两个向量对象之间进行计算的函数，自从1.13版本开始支持该函数的调用，该函数中的计算并不会产生一个新的向量，而是将计算操作作用于原操作数中 The function that calculates between two vector objects supports the call of this function since version 1.13.
`protected abstract void`	reFresh`()` 刷新操作数对象的所有字段
`abstract` ArrayType	toArray`()` 获取到本向量对象正在维护中的数组对象，注意，这里不会进行拷贝操作。

RangeVector

限定符和类型	方法和说明
`protected abstract` ArrayType	copyToNewArray`()`
`abstract` ImplementationType	expand`()`
`abstract void`	forEach`(`java.util.function.Consumer`<`ElementType`> action)` 区间内元素迭代器 Element iterator in interval
`abstract` ElementType	getRangeEnd`()`
`abstract` ElementType	getRangeStart`()`
`abstract` ElementType	getRangeSum`()`
`abstract double`	moduleLength`()` 计算该向量的模长，具体实现请参阅api说明 Calculate the modulo length of the vector, please refer to the api node for the specific implementation
`abstract` VectorType	shuffle`(long seed)` 将本对象中的所有数据进行洗牌打乱，随机分布数据行的排列。
`int`	size`()`
`abstract` VectorType	toVector`()` 将本区间的向量转换成具体向量。

Matrix

限定符和类型	方法和说明
ImplementationType	add`(`ImplementationType `value, boolean lock)` 在两个向量对象之间进行计算的函数，自从1.13版本开始支持该函数的调用，该函数中的计算并不会产生一个新的向量，而是将计算操作作用于原操作数中 The function that calculates between two vector objects supports the call of this function since version 1.13.
`abstract` ArrayType	copyToNewArray`()`
`abstract` ArraysType	copyToNewArrays`()` 该方法将会获取到矩阵中的二维数组，值得注意的是，在该函数中获取到的数组是一个新的数组，不会有任何的关系。
ImplementationType	diff`(`ImplementationType `value, boolean lock)` 在两个向量对象之间进行计算的函数，自从1.13版本开始支持该函数的调用，该函数中的计算并不会产生一个新的向量，而是将计算操作作用于原操作数中 The function that calculates between two vector objects supports the call of this function since version 1.13.
`abstract` ElementType	get`(int row, int col)` 获取到矩阵中指定坐标点的数值
`int`	getColCount`()`
`int`	getRowCount`()`
ElementType	innerProduct`(`ImplementationType `value, boolean lock)`
`boolean`	isUnlock`()`
ImplementationType	multiply`(`ImplementationType `value, boolean lock)` 在两个向量对象之间进行计算的函数，自从1.13版本开始支持该函数的调用，该函数中的计算并不会产生一个新的向量，而是将计算操作作用于原操作数中 The function that calculates between two vector objects supports the call of this function since version 1.13.
`abstract` ArrayType	toArray`()`
ArraysType	toArrays`()` 该方法将会获取到矩阵中的二维数组，注意，与toArray一样返回的是正在被维护的数组对象，建议作为只读变量使用。
String	toString`()`
`abstract` ImplementationType	transpose`()` 将现有矩阵的转置矩阵获取到 Get the transpose of an existing matrix into

NumberMatrix

限定符和类型

方法和说明

abstract ImplementationType

deleteRelatedDimensions(int index, double thresholdLeft, double thresholdRight)

删除与目标索引维度相关的所有行维度，并返回新矩阵对象。

abstract ImplementationType

featureSelection(double threshold)

去除冗余特征维度，将当前矩阵中的每一个维度都进行方差或无向差计算，并将过于稳定的冗余特征去除。

Coordinate

限定符和类型

方法和说明

ImplementationType

extend()

显式拓展到子类的函数

int

getNumberOfDimensions()

一般是获取到坐标对象中的维度

IntegerCoordinates

限定符和类型	方法和说明
ImplementationType	extend`()` 与父类函数的作用一样
`int`	getNumberOfDimensions`()` 与父类函数的作用一样
`int[]`	toArray`()` 获取到坐标的每一个维度的值组成的数组

FloatingPointCoordinates

限定符和类型	方法和说明
ImplementationType	extend`()` 与父类函数的作用一样
`int`	getNumberOfDimensions`()` 与父类函数的作用一样
`int[]`	toArray`()` 获取到坐标的每一个维度的值组成的数组

浮点类型操作数

能够通过一个浮点数组创建出来其对应的操作数对象，操作数对象中具有强大的计算功能。

package zhao.algorithmMagic;



import zhao.algorithmMagic.operands.matrix.DoubleMatrix;
import zhao.algorithmMagic.operands.vector.DoubleVector;

import java.util.Arrays;

public class MAIN1 {
    public static void main(String[] args) {
        // 构建出Java数组
        double[] ints1 = new double[]{1, 2, 3, 4, 5, 6};
        double[] ints2 = new double[]{10, 20, 30, 40, 50, 60};
        // 构建出整形向量
        DoubleVector parse1 = DoubleVector.parse(ints1);
        DoubleVector parse2 = DoubleVector.parse(ints2);
        // 对向量进行加减基本运算，并打印结果
        System.out.println(">>> 1: =========");
        System.out.println(parse2.add(parse1));
        System.out.println(parse2.diff(parse1));
        // 进行连减
        System.out.println(parse2.diff(parse1).diff(parse1).diff(parse1));

        // 进行内积与外积计算
        System.out.println(">>> 2: =========");
        System.out.println(parse2.innerProduct(parse1));
        System.out.println(parse2.multiply(parse1));

        // 将两个向量对象组合成为矩阵
        DoubleMatrix matrix = DoubleMatrix.parse(parse1, parse2);
        // 进行特征选择，在这里我们选择清理掉特征突出性较小排名中，前70% 的维度
        System.out.println(">>> 3: =========");
        DoubleMatrix integerMatrix = matrix.featureSelection(0.7);
        // 打印去除结果
        System.out.println(integerMatrix);

        // 获取到向量中的数组对象
        System.out.println(">>> 4: =========");
        double[] ints3 = parse1.copyToNewArray();
        double[] ints4 = parse2.copyToNewArray();
        double[] ints5 = parse1.toArray();
        double[] ints6 = parse2.toArray();
        // 修改 ints5 ints6 两个数组的数值
        // 会导致ints1 ints2 以及其所有对象发生变化，这是因为AS允许用户直接从对象中获取到数组
        ints5[1] = 1024;
        ints6[1] = 1024;
        System.out.println(Arrays.toString(ints1));
        System.out.println(Arrays.toString(ints2));
        // 而修改 ints3 ints4 则不会发生这种情况
        // 因此在需要对数组进行修改的时候，建议使用copyToNewArray
        ints3[1] = 2048;
        ints4[1] = 2048;
        System.out.println(Arrays.toString(ints1));
        System.out.println(Arrays.toString(ints2));
    }
}

整数类型操作数

package zhao.algorithmMagic;

import zhao.algorithmMagic.operands.matrix.IntegerMatrix;
import zhao.algorithmMagic.operands.vector.IntegerVector;

import java.util.Arrays;

public class MAIN1 {
    public static void main(String[] args) {
        // 构建出Java数组
        int[] ints1 = new int[]{1, 2, 3, 4, 5, 6};
        int[] ints2 = new int[]{10, 20, 30, 40, 50, 60};
        // 构建出整形向量
        IntegerVector parse1 = IntegerVector.parse(ints1);
        IntegerVector parse2 = IntegerVector.parse(ints2);
        // 对向量进行加减基本运算，并打印结果
        System.out.println(">>> 1: =========");
        System.out.println(parse2.add(parse1));
        System.out.println(parse2.diff(parse1));
        // 进行连减
        System.out.println(parse2.diff(parse1).diff(parse1).diff(parse1));

        // 进行内积与外积计算
        System.out.println(">>> 2: =========");
        System.out.println(parse2.innerProduct(parse1));
        System.out.println(parse2.multiply(parse1));

        // 将两个向量对象组合成为矩阵
        IntegerMatrix matrix = IntegerMatrix.parse(parse1, parse2);
        // 进行特征选择，在这里我们选择清理掉特征突出性较小排名中，前70% 的维度
        System.out.println(">>> 3: =========");
        IntegerMatrix integerMatrix = matrix.featureSelection(0.7);
        // 打印去除结果
        System.out.println(integerMatrix);
        // 获取到向量中的数组对象
        System.out.println(">>> 4: =========");
        int[] ints3 = parse1.copyToNewArray();
        int[] ints4 = parse2.copyToNewArray();
        int[] ints5 = parse1.toArray();
        int[] ints6 = parse2.toArray();
        // 修改 ints5 ints6 两个数组的数值
        // 会导致ints1 ints2 以及其所有对象发生变化
        // 这是因为AS允许用户直接从对象中获取到数组
        ints5[1] = 1024;
        ints6[1] = 1024;
        System.out.println(Arrays.toString(ints1));
        System.out.println(Arrays.toString(ints2));
        // 而修改 ints3 ints4 则不会发生这种情况
        // 因此在需要对数组进行修改的时候，建议使用copyToNewArray
        ints3[1] = 2048;
        ints4[1] = 2048;
        System.out.println(Arrays.toString(ints1));
        System.out.println(Arrays.toString(ints2));
    }
}

复数

package zhao.algorithmMagic;

import zhao.algorithmMagic.operands.ComplexNumber;

public class MAIN1 {
    public static void main(String[] args) {
        // 创建2个复数对象
        ComplexNumber complexNumber1 = ComplexNumber.parse("1 + 2i");
        ComplexNumber complexNumber2 = ComplexNumber.parse(2, 1);
        // 打印两个复数对象
        System.out.println(">>> 1: =========");
        System.out.println(complexNumber1);
        System.out.println(complexNumber2);
        System.out.println(
                "complexNumber1 的实部 = " + complexNumber1.getReal() +
                        "\tcomplexNumber1 的虚部 = " + complexNumber1.getImaginary()
        );
        // 对两个复数对象进行基本运算
        System.out.println(">>> 2: =========");
        System.out.println(complexNumber1.add(complexNumber2));
        System.out.println(complexNumber1.diff(complexNumber2));
        System.out.println(complexNumber1.multiply(complexNumber2));
        System.out.println(complexNumber1.divide(complexNumber2));
        // 获取到两个复数的共轨
        System.out.println(">>> 3: =========");
        System.out.println(complexNumber1.conjugate());
        System.out.println(complexNumber2.conjugate());
    }
}

特征提取

特征提取的本质就是将一份计算机中并不认识的数据，转换成为向量或矩阵这种计算机可以用来计算的对象，使得后续的数据处理流程不会因此受挫，AS库中的特征提取主要针对字符串类的数据，接下来就进行一下演示！

字典特征提取

字典特征提取是将每一个数据作为矩阵中的一个行向量，AS库中采用one-hot编码的形式将数据进行转换，接下来看一个实际的例子。

代码与运行结果

package zhao.algorithmMagic;

import zhao.algorithmMagic.algorithm.featureExtraction.DictFeatureExtraction;
import zhao.algorithmMagic.operands.matrix.ColumnIntegerMatrix;

public class MAIN1 {
    public static void main(String[] args) {
        // 获取到字典特征提取组件
        DictFeatureExtraction dict = DictFeatureExtraction.getInstance("dict");
        // 构造一个需要被提取的数组，其中每一个元素都会作为一个行向量，每一个行内数据会作为一个列字段
        String[] strings = {
                "cat", "dog", "turtle", "fish", "cat"
        };
        // 开始提取特征矩阵
        ColumnIntegerMatrix extract = dict.extract(strings);
        // 打印矩阵
        System.out.println(extract);
        // 打印矩阵的hashMap形式
        extract.toHashMap().forEach((key, value) -> System.out.println(value.toString() + '\t' + key));
    }
}

接下来是运行结果，在运行结果中可以看到，针对所有的行数据都构建成为了一个数组，每一个数组都是一个向量对象，可以看到其中数据对应的值在每一列都是对应的，其中1代表所属标记，0代表不属于。

简单来说就是在进行字典特征提取之前将每一个行数据作为了一种类别，在构造的时候，为对应类别打上属于标记！

通过toHashMap函数可以获取到不同行数据对应的向量值。

词频特征提取

词频特征提取，顾名思义就是词频统计，一句话中的词频往往可以体现出这句话要表达的意义，本次就进行AS库中的词频特征向量提取实现。

代码与运行结果

package zhao.algorithmMagic;

import zhao.algorithmMagic.algorithm.featureExtraction.WordFrequency;
import zhao.algorithmMagic.operands.matrix.ColumnIntegerMatrix;

public class MAIN1 {
    public static void main(String[] args) {
        // 获取到词频特征提取组件
        WordFrequency word = WordFrequency.getInstance("word");
        // 构建一些被统计的文本
        String[] data = {
                "I love you, Because you are beautiful.",
                "I need you. Because I'm trapped"
        };
        // 开始统计
        ColumnIntegerMatrix extract1 = word.extract(data);
        // 打印结果
        System.out.println(extract1);
    }
}

下面是运行结果，可以看到它返回的是一个具有行列的整形矩阵，在矩阵中，列字段代表每一个被提取的文本，在矩阵中的行字段代表每一种词，其中矩阵的数值就是代表的对应词的出现频率。

特征选择

基于冗余排名比例去除

代码与运行结果

特征选择是所有矩阵中都可以使用的一个函数，其于1.14版本后开始支持，特征选择能够将诸多的冗余特征去除掉，AS库中的矩阵进行的特征选择都是基于行向量的操作，接下来是矩阵冗余特征去除实现！

package zhao.algorithmMagic;

import zhao.algorithmMagic.operands.matrix.DoubleMatrix;

public class MAIN1 {
    public static void main(String[] args) {
        // 创建一个矩阵对象，其中包含一些不具有特征突出行的向量
        DoubleMatrix doubleMatrix = DoubleMatrix.parse(
                new double[]{1, 2, 3, 4, 5, 6},
                new double[]{1, 2, 1, 1, 2, 1}, // 过于稳定，缺少特征突出性
                new double[]{10, 20, 30, 40, 50, 60}
        );
        System.out.println(doubleMatrix);
        // 开始调用特征去除函数，去除其中百分之40的行，并返回新矩阵
        DoubleMatrix doubleMatrix1 = doubleMatrix.featureSelection(0.4);
        System.out.println(doubleMatrix1);
    }
}

基于相关系数去除

代码与运行结果

package zhao.algorithmMagic;

import zhao.algorithmMagic.operands.matrix.ColumnDoubleMatrix;

public class MAIN1 {
    public static void main(String[] args) {
        // 创建一个矩阵对象，其中包含一些相关联的数据，本次要求将与年龄相关联的数据全部删掉
        ColumnDoubleMatrix columnDoubleMatrix = ColumnDoubleMatrix.parse(
                new String[]{"人员编号", "人员年龄", "人员工资(k)", "幸福指数"},
                new String[]{"N1", "N2", "N3", "N4", "N5"},
                new double[]{1, 25, 14, 16},
                new double[]{2, 45, 12, 10},
                new double[]{3, 33, 13, 12},
                new double[]{4, 42, 16, 17},
                new double[]{5, 25, 12, 10}
        );
        System.out.println(columnDoubleMatrix);
        // 转置矩阵
        columnDoubleMatrix = columnDoubleMatrix.transpose();
        System.out.println(columnDoubleMatrix);
        // 开始去除与第3行正相关的所有维度数据 TODO 需要保证相关维度的值接近！
        ColumnDoubleMatrix columnDoubleMatrix1 = columnDoubleMatrix.deleteRelatedDimensions(2, 0.5, 1);
        // 打印新矩阵
        System.out.println(columnDoubleMatrix1);
    }
}

机器学习

是特征工程中及其重要的一部分，A库中有诸多的算法计算组件，通过不同的计算组件实现不同的计算需求与学习目的，每一个计算组件采用惰性加载，不会将所有的计算组件全都实例化到内存中，减少冗余内存占用。度量计算

能够将两个坐标或其它操作数之间的距离计算出来，并将计算出来的结果作为函数的返回值，接下来看一些与之相关的所有函数。

度量计算函数说明

限定符和类型	方法和说明
`double`	getTrueDistance`(double[] doubles1, double[] doubles2)` 获取两个序列之间的距离 Get the Canberra distance between two sequences (note that there is no length check function here, if you need to use this method, please configure the array length check outside)
`double`	getTrueDistance`(`DoubleConsanguinityRoute `doubleConsanguinityRoute)` 计算一个路线的起始点与终止点的真实距离。
`double`	getTrueDistance`(`DoubleConsanguinityRoute2D `doubleConsanguinityRoute2D)` 计算一个路线的起始点与终止点的真实距离。
`double`	getTrueDistance`(int[] ints1, int[] ints2)` 获取两个序列之间的距离 Get the Canberra distance between two sequences (note that there is no length check function here, if you need to use this method, please configure the array length check outside)
`double`	getTrueDistance`(`IntegerConsanguinityRoute `integerConsanguinityRoute)` 计算一个路线的起始点与终止点的真实距离。
`double`	getTrueDistance`(`IntegerConsanguinityRoute2D `integerConsanguinityRoute2D)` 计算一个路线的起始点与终止点的真实距离。

度量计算组件列表

计算组件类型	支持版本	功能
zhao.algorithmMagic.algorithm.distanceAlgorithm.EuclideanMetric	v1.0	计算欧几里得距离
zhao.algorithmMagic.algorithm.distanceAlgorithm.CanberraDistance	v1.0	计算堪培拉距离
zhao.algorithmMagic.algorithm.distanceAlgorithm.ChebyshevDistance	v1.0	计算切比雪夫距离
zhao.algorithmMagic.algorithm.distanceAlgorithm.CosineDistance	v1.0	计算向量余弦度量
zhao.algorithmMagic.algorithm.distanceAlgorithm.HausdorffDistance	v1.0	计算豪斯多夫距离
zhao.algorithmMagic.algorithm.distanceAlgorithm.ManhattanDistance	v1.0	计算曼哈顿距离
zhao.algorithmMagic.algorithm.distanceAlgorithm.MinkowskiDistance	v1.0	计算闵可夫斯基距离
zhao.algorithmMagic.algorithm.distanceAlgorithm.StandardizedEuclideanDistance	v1.0	计算标准化欧几里得度量

度量计算API实现

本次我们使用欧几里得进行度量计算的API相关示例调用。

import zhao.algorithmMagic.algorithm.distanceAlgorithm.EuclideanMetric;
import zhao.algorithmMagic.operands.coordinate.FloatingPointCoordinates;
import zhao.algorithmMagic.operands.coordinate.IntegerCoordinateMany;

public class Test {

    public static void main(String[] args) {
        //  获取到德氏距离计算组件对象
        EuclideanMetric> euclideanMetric = EuclideanMetric.getInstance("zhao");
        // 创建需要计算的向量数组（也可以是坐标）
        double[] v1 = new double[]{1, 2, 3, 4, 5};
        double[] v2 = new double[]{1, 2, 3, 1, 5};
        double[] v3 = new double[]{1, 2, 3, 4, 5};
        // 开始进行计算
        System.out.println("v1 与 v2 之间的德式距离：" + euclideanMetric.getTrueDistance(v1, v2));
        System.out.println("v1 与 v3 之间的德式距离：" + euclideanMetric.getTrueDistance(v1, v3));
        System.out.println("v2 与 v3 之间的德式距离：" + euclideanMetric.getTrueDistance(v2, v3));
    }
}

聚合计算

在AS库中，聚合计算组件是专用于向量这一类多数值转换成为少量甚至1个数值任务的计算组件，能够实现多种需求的计算与操作。接下来展示与之相关的API介绍。

聚合计算函数说明

限定符和类型

方法和说明

double

calculation(double... doubles)

计算函数，将某个数组中的所有元素按照某个规则进行聚合 Compute function to aggregate all elements in an array according to a certain rule

int

calculation(int... doubles)

计算函数，将某个数组中的所有元素按照某个规则进行聚合 Compute function to aggregate all elements in an array according to a certain rule

聚合计算组件列表

计算组件类型	支持版本	功能
zhao.algorithmMagic.algorithm.aggregationAlgorithm.ExtremumAggregation	v1.14	计算一些数值的极值
zhao.algorithmMagic.algorithm.aggregationAlgorithm.WeightedAverage	v1.14	计算一些数值的加权平均数
zhao.algorithmMagic.algorithm.aggregationAlgorithm.ModularOperation	v1.14	计算一个序列或多个序列聚合之后的模长

聚合计算API实现

接下来使用AS库中的聚合计算组件，计算一个向量中的极值，AS库中的极值计算组件是一个聚合计算组件的实现类，其包含强大的数据过滤与极值计算功能，接下来就进行该组件的一个演示。

import zhao.algorithmMagic.algorithm.aggregationAlgorithm.ExtremumAggregation;

public class Test {

    public static void main(String[] args) {
        //  获取到德氏距离计算组件对象
        // 创建需要计算的向量数组（也可以是坐标）
        double[] v1 = new double[]{1, 2, 3, 4, 5,10, 1024, -1};
        // 获取到极值计算组件对象
        ExtremumAggregation ex = ExtremumAggregation.getInstance("ex");
        // 设置计算模式 - 计算向量中的最大值
        ex.setMode(ExtremumAggregation.MAX);
        System.out.println("最大值 = " + ex.calculation(v1));
        // 设置计算模式 - 计算向量中的最小值
        ex.setMode(ExtremumAggregation.MIN);
        System.out.println("最小值 = " + ex.calculation(v1));
        // 设置计算模式 - 计算向量中所有偶数的最大值（如果存在偶数的话就会返回预期结果）
        ex.setMode(ExtremumAggregation.EVEN_MAX);
        System.out.println("偶数中的最大值 = " + ex.calculation(v1));
        // 设置计算模式 - 计算向量中所有偶数的最小值（如果存在偶数的话就会返回预期结果）
        ex.setMode(ExtremumAggregation.EVEN_MIN);
        System.out.println("偶数中的最小值 = " + ex.calculation(v1));
        // 设置计算模式 - 计算向量中所有奇数的最大值（如果存在奇数的话就会返回预期结果）
        ex.setMode(ExtremumAggregation.ODD_MAX);
        System.out.println("奇数中的最大值 = " + ex.calculation(v1));
        // 设置计算模式 - 计算向量中所有奇数的最小值（如果存在奇数的话就会返回预期结果）
        ex.setMode(ExtremumAggregation.ODD_MIN);
        System.out.println("奇数中的最小值 = " + ex.calculation(v1));
    }
}

分类计算

专注于样本中某些未知类别的数据推断工作的计算组件，在AS库中常用的就是自定义距离计算组件的分类计算函数，能够通过用户所设设置的距离计算组件来进行样本之间的相似度分析等操作，在诸多支持自定义距离计算组件的分类组件中，默认的距离计算组件往往都是欧几里得计算，接下来就展示与之相关的一些信息！

分类计算函数说明

限定符和类型	方法和说明
HashMapArrayList`<DoubleVector>>`	classification`(double[][] data,` Map`<`String`,double[]> categorySample)` 计算一个矩阵中所有行或列的数据类别，并将计算之后的数据类别样本返回出去。
HashMapArrayList`<DoubleVector>>`	classification`(`DoubleMatrix `data,` Map`<`String`,double[]> categorySample)` 计算一个矩阵中所有行或列的数据类别，并将计算之后的数据类别样本返回出去。
HashMapArrayList`<IntegerVector>>`	classification`(int[][] data,` Map `categorySample)` 计算一个矩阵中所有行或列的数据类别，并将计算之后的数据类别样本返回出去。
HashMapArrayList`<IntegerVector>>`	classification`(`IntegerMatrix `data,` Map`<`String`,int[]> categorySample)` 计算一个矩阵中所有行或列的数据类别，并将计算之后的数据类别样本返回出去。
`static UDFDistanceClassification`	getInstance`(`String `Name)` 获取到该算法的类对象。

支持自定义距离计算组件的分类函数

限定符和类型	方法和说明
HashMap`<`String`,`ArrayList`<`DoubleVector`>>`	classification`(double[][] data,` Map`<`String`,double[]> categorySample)` 计算一个矩阵中所有行或列的数据类别，并将计算之后的数据类别样本返回出去。
HashMap`<`String`,`ArrayList`<`DoubleVector`>>`	classification`(`DoubleMatrix `data,` Map`<`String`,double[]> categorySample)` 计算一个矩阵中所有行或列的数据类别，并将计算之后的数据类别样本返回出去。
HashMap`<`String`,`ArrayList`<`IntegerVector`>>`	classification`(int[][] data,` Map`<`String`,int[]> categorySample)` 计算一个矩阵中所有行或列的数据类别，并将计算之后的数据类别样本返回出去。
HashMap`<`String`,`ArrayList`<`IntegerVector`>>`	classification`(`IntegerMatrix `data,` Map`<`String`,int[]> categorySample)` 计算一个矩阵中所有行或列的数据类别，并将计算之后的数据类别样本返回出去。
`static` UDFDistanceClassification	getInstance`(`String `Name)` 获取到该算法的类对象。

分类计算组件列表

计算组件类型

支持版本

功能

zhao.algorithmMagic.algorithm.

classificationAlgorithm.UDFDistanceClassification

v1.14

利用手动传入类别样本的方式，进行距离计算并分类

zhao.algorithmMagic.algorithm.

classificationAlgorithm.KnnClassification

v1.14

利用K 近邻算法将最近的K个特征进行距离

分类计算API实现

package zhao.algorithmMagic;

import zhao.algorithmMagic.algorithm.classificationAlgorithm.KnnClassification;
import zhao.algorithmMagic.algorithm.distanceAlgorithm.EuclideanMetric;
import zhao.algorithmMagic.operands.matrix.ColumnDoubleMatrix;
import zhao.algorithmMagic.operands.vector.DoubleVector;
import zhao.algorithmMagic.utils.DependentAlgorithmNameLibrary;

import java.util.ArrayList;
import java.util.HashMap;

public class MAIN1 {
    public static void main(String[] args) {
        // 创建一个矩阵对象其中包含一些数据
        ColumnDoubleMatrix columnDoubleMatrix = new ColumnDoubleMatrix(
                new String[]{"会说话", "会工具", "会觅食", "会编程"},
                new String[]{"人类", "人类", "?", "小鸟", "小鸟", "?", "人类", "人类"},
                new double[]{1, 1, 1, 0}, // 人类
                new double[]{1, 1, 1, 1}, // 人类
                new double[]{0, 0, 1, 0}, // 小鸟 未知量
                new double[]{1, 0, 1, 0}, // 小鸟
                new double[]{0, 0, 1, 0}, // 小鸟
                new double[]{0, 1, 1, 0}, // 人类 未知量
                new double[]{1, 1, 1, 1}, // 人类
                new double[]{0, 1, 1, 1}  // 人类
        );
        // 打乱矩阵中的数据 使用 221 作为随机种子
        columnDoubleMatrix = columnDoubleMatrix.shuffle(221);
        System.out.println(columnDoubleMatrix);
        // 开始进行矩阵数据的分类 先获取到knn近邻计算组件
        KnnClassification knn = KnnClassification.getInstance("knn");
        // 设置分类时需要使用的距离计算组件，这里使用的是欧几里得（如果不设置也是一样的）
        knn.setDistanceAlgorithm(

                EuclideanMetric.getInstance(DependentAlgorithmNameLibrary.EUCLIDEAN_METRIC_NAME)
        );        // 设置K近邻计算时候的 近邻阈值 K的具体数值
        knn.setK(10);
        // 开始进行计算与分类
        HashMap> classification = knn.classification(
                columnDoubleMatrix.getRowFieldNames(), columnDoubleMatrix.toArrays()
        );        // 打印分类结果 这里只会将需要分类的数据获取到
        classification.forEach((key, value) -> {
            System.out.print("\n种类：");
            System.out.println(key);
            System.out.println(value);
        });
    }
}

差异计算

差异计算用于计算两个样本之间的差异数值，其本身与度量计算组件是有关系的，一般来说差异计算组件的结果代表的就是差异系数，系数与样本之间的差异程线性关系，在AS库中，向量之间的距离由度量计算组件实现，而诸多其它类型的差异计算也可以被支持，其专门设立了一个差异计算组件模型，这类计算组件能支持的计算类型是泛型的，不同组件的实现能进行不同类型的数据对象的计算。

差异计算函数说明

限定符和类型

方法和说明

double

getDifferenceRatio(value value1, value value2)

计算两个事物之间的差异系数百分比 Calculate the percentage difference from the coefficient of difference between two things

差异计算组件列表

计算组件类型	支持版本	功能
zhao.algorithmMagic.algorithm. differenceAlgorithm.BrayCurtisDistance	v1.0	计算两个数据样本之间的布雷柯蒂斯差异系数
zhao.algorithmMagic.algorithm. differenceAlgorithm.DiceCoefficient	v1.0	计算两个数据样本之间的Dice差异系数
zhao.algorithmMagic.algorithm. differenceAlgorithm.EditDistance	v1.0	计算两个数据样本之间的最小编辑次数
zhao.algorithmMagic.algorithm. differenceAlgorithm.HammingDistance	v1.0	计算两个数据样本之间的汉明差异系数
zhao.algorithmMagic.algorithm. differenceAlgorithm.JaccardSimilarityCoefficient	v1.0	计算两个数据样本之间的杰卡德相似系数

差异计算API实现

package zhao.algorithmMagic;

import zhao.algorithmMagic.algorithm.differenceAlgorithm.BrayCurtisDistance;
import zhao.algorithmMagic.algorithm.differenceAlgorithm.HammingDistance;
import zhao.algorithmMagic.operands.coordinate.DoubleCoordinateThree;
import zhao.algorithmMagic.operands.coordinate.IntegerCoordinateThree;

public class MAIN1 {
    public static void main(String[] args) {
        // 获取到两个差异计算组件，分别用于计算坐标之间的距离与字符串之间的差异
        // 该组件能够接收字符串对象
        HammingDistance hammingDistance = HammingDistance.getInstance("HammingDistance");
        // 这里指定组件能够计算的坐标数据类型
        BrayCurtisDistance brayCurtisDistance = BrayCurtisDistance.getInstance("BrayCurtisDistance");
        // 开始进行字符串之间的距离计算
        System.out.println(hammingDistance.getDifferenceRatio("Hello Zhao!", "Hello Yang!"));
        // 开始进行坐标之间的差异计算
        System.out.println(brayCurtisDistance.getDifferenceRatio(
                new DoubleCoordinateThree(1, 0, 1), new DoubleCoordinateThree(2, 1, 1)
        ));
    }
}

路径计算

路径计算专用于路径网络中的计算，能够在一个网络中快速的计算出我们需要的目标，目前AS库中能够计算出一个路线网络中的最短路径，与生成潜在的路径联系等功能，接下来就是相关信息多少介绍。

路径计算函数说明

限定符和类型	方法和说明
`void`	addRoute`(`DoubleConsanguinityRoute `doubleConsanguinityRoute)` 添加一个需要被算法处理的线路。
`void`	addRoute`(`IntegerConsanguinityRoute `integerConsanguinityRoute)` 添加一个需要被算法处理的线路。
`void`	clear`()` 一般情况下。该函数用于清理所有被添加的线路

路径计算组件列表

计算组件类型	支持版本	功能
zhao.algorithmMagic.algorithm.generatingAlgorithm.Dijkstra	v1.0	计算一个路线网络中的最小距离
zhao.algorithmMagic.algorithm.generatingAlgorithm.Dijkstra2D	v1.0	计算一个路线网络中的最小距离
zhao.algorithmMagic.algorithm.generatingAlgorithm.DirectionalDijkstra2D	v1.0	计算一个路线网络中的最小距离
zhao.algorithmMagic.algorithm.generatingAlgorithm.ZhaoCoordinateNet	v1.0	计算一个路线网络潜在联系，并生成对应的路线对象到路线网中
zhao.algorithmMagic.algorithm.generatingAlgorithm.ZhaoCoordinateNet2D	v1.0	计算一个路线网络潜在联系，并生成对应的路线对象到路线网中

路径计算组件API实现

路径计算组件中最常用的同时也是现有计算算法中比较熟悉的dijkstra 计算组件，在AS的实现中，其可以计算出一个复杂路线网络中的最短线路对象，并将其在网络中进行标记，接下来就是与之相关的API实现。

package zhao.algorithmMagic;

import zhao.algorithmMagic.algorithm.distanceAlgorithm.EuclideanMetric;
import zhao.algorithmMagic.algorithm.generatingAlgorithm.Dijkstra;
import zhao.algorithmMagic.operands.coordinate.IntegerCoordinateMany;
import zhao.algorithmMagic.operands.coordinateNet.DoubleRouteNet;
import zhao.algorithmMagic.operands.route.DoubleConsanguinityRoute;
import zhao.algorithmMagic.operands.route.IntegerConsanguinityRoute;

public class MAIN1 {
    public static void main(String[] args) {
        // 获取到 Dijkstra 算法
        Dijkstra dijkstra = Dijkstra.getInstance("Dijkstra");
        // 向算法中添加一些线路
        IntegerCoordinateMany integerCoordinateMany_B = new IntegerCoordinateMany(1, 2, 8);
        dijkstra.addRoute(
                IntegerConsanguinityRoute.parse(
                        "A -> B", new IntegerCoordinateMany(1, 2, 3), integerCoordinateMany_B
                )
        );
        dijkstra.addRoute(
                IntegerConsanguinityRoute.parse(
                        "C -> B", new IntegerCoordinateMany(0, 2, 3), integerCoordinateMany_B
                )
        );
        dijkstra.addRoute(
                IntegerConsanguinityRoute.parse(
                        "D -> B", new IntegerCoordinateMany(-1, 2, 3), integerCoordinateMany_B
                )
        );
        // 设置计算时需要的度量计算组件
        dijkstra.setDistanceAlgorithm(EuclideanMetric.getInstance("E"));
        // 开始计算出以B为中心的最短线路网
        DoubleRouteNet doubleRouteNet = dijkstra.getShortestPath("B");
        // 打印出网络中的最短线路 最短线路将会被添加到网络中的主标记集合，因此这里获取到主标记集合，并打印最短坐标的名称
        doubleRouteNet
                // 获取到所有被标记的线路对象 dijkstra 会将最短线路标记出来
                .getDoubleConsanguinityRouteHashMap_MasterTag()
                // 将所有的线路对象转换成线路路径名字
                .values().stream().map(DoubleConsanguinityRoute::getRouteName)
                // 开始打印所有的路径名称
                .forEach(System.out::println);
    }
}

数据预处理(标准化/归一化)

特征工程中的数据预处理主要包括数据降维与数据维度标准和归一化操作，针对数据降维等相关函数在矩阵中有直接的调用，针对数据的标准化与归一化则需要使用到数据预处理算法，接下来就进行一个相关的介绍与演示。

数据预处理函数说明

注意：在这里的函数统一使用标准化作为函数名，但不影响归一化组件是实现出来的序列归一操作，后期会更改此函数名称。

限定符和类型	方法和说明
String	getAlgorithmName`()`
`boolean`	init`()` 算法模块的初始化方法。 The initialization method of the algorithm module.
`abstract FloatingPointCoordinates<DoubleCoordinateMany>`	NormalizedSequence`(`DoubleCoordinateMany `v)` 将一个序列进行标准化，具体的标准化有不同的实现
`abstract DoubleVector`	NormalizedSequence`(`DoubleVector `doubleVector)` 将一个序列进行标准化，具体的标准化有不同的实现
`abstract IntegerCoordinates<IntegerCoordinateMany>`	NormalizedSequence`(`IntegerCoordinateMany `v)` 将一个序列进行标准化，具体的标准化有不同的实现
`abstract IntegerVector`	NormalizedSequence`(`IntegerVector `integerVector)` 将一个序列进行标准化，具体的标准化有不同的实现

数据预处理组件列表

计算组件类型	支持版本	功能
zhao.algorithmMagic.algorithm.normalization.LinearNormalization	v1.0	将一个向量数据样本进行线性归一化
zhao.algorithmMagic.algorithm.normalization.Z_ScoreNormalization	v1.0	将一个向量数据样本进行正负均匀分配的标准化

数据预处理API实现

import zhao.algorithmMagic.algorithm.normalization.LinearNormalization;
import zhao.algorithmMagic.operands.vector.DoubleVector;

public class Test {

    public static void main(String[] args) {
        //  获取到一个向量对象
        DoubleVector doubleVector = DoubleVector.parse(1, 2, 3, 4, 5, 6, 5, 4, 3, 2, 1);
        // 获取到数据预处理归一化组件
        LinearNormalization line = LinearNormalization.getInstance("line");
        // 设置归一化区间
        line.setMax(3);
        line.setMin(-3);
        // 开始进行向量归一化
        DoubleVector doubleVector1 = line.NormalizedSequence(doubleVector);
        // 打印归一化之后的向量数据
        System.out.println(doubleVector1);
    }
}

概率计算

概率计算是一种以标准系数衡量事件发生可能性的数据计算组件，其具有强大的概率计算体系，能够针对事务期望做出类别预分析等操作。

概率计算函数说明

限定符和类型	方法和说明
`double`	estimate`(`DoubleMatrix `doubleMatrix,` ArrayDoubleFiltering `StatisticCondition1,` ArrayDoubleFiltering `StatisticCondition2)` 计算一个矩阵中的某些条件限制下的联合概率结果 P(A\|B) 其中的分子与分母值！
`double`	estimate`(`IntegerMatrix `integerMatrix,` ArrayIntegerFiltering `StatisticCondition1,` ArrayIntegerFiltering `StatisticCondition2)` 计算一个矩阵中的某些条件限制下的联合概率结果 P(A\|B) 其中的分子与分母值！
`abstract double[]`	estimateGetFraction`(`DoubleMatrix `doubleMatrix,` ArrayDoubleFiltering `StatisticCondition1,` ArrayDoubleFiltering `StatisticCondition2)` 计算一个矩阵中的某些条件限制下的联合概率结果 P(A\|B) 其中的分子与分母值！
`abstract double[]`	estimateGetFraction`(`IntegerMatrix `integerMatrix,` ArrayIntegerFiltering `StatisticCondition1,` ArrayIntegerFiltering `StatisticCondition2)` 计算一个矩阵中的某些条件限制下的联合概率结果 P(A\|B) 其中的分子与分母值！
String	getAlgorithmName`()`
`boolean`	init`()` 算法模块的初始化方法，在这里您可以进行组件的初始化方法，当初始化成功之后，该算法就可以处于就绪的状态，一般这里就是将自己添加到算法管理类中 The initialization method of the algorithm module, here you can perform the initialization method of the component, when the initialization is successful, the algorithm can be in a ready state, generally here is to add yourself to the algorithm management class

概率计算组件列表

计算组件类型	支持版本	功能
zhao.algorithmMagic.algorithm.probabilisticAlgorithm.NaiveBayes	v1.14	通过较小的计算量计算出形如”P(A\B)“事件发生的概率数值

概率计算API实现

import zhao.algorithmMagic.algorithm.probabilisticAlgorithm.NaiveBayes;
import zhao.algorithmMagic.operands.matrix.ColumnIntegerMatrix;
import zhao.algorithmMagic.utils.filter.ArrayIntegerFiltering;


public class Test {
    public static void main(String[] args) {
        String[] strings1 = {"职业", "体型", "喜欢"};
        // 准备一个数据矩阵
        // 职业：1-程序员  2-产品  3-美工
        ColumnIntegerMatrix parse = ColumnIntegerMatrix.parse(
                strings1,
                new String[]{"N1", "N2", "N3", "N4", "N5", "N6", "N7", "N8", "N9", "N10"},
                new int[]{1, 1, 0},
                new int[]{2, 0, 1},
                new int[]{1, 0, 1},
                new int[]{1, 1, 1},
                new int[]{3, 0, 0},
                new int[]{3, 1, 0},
                new int[]{2, 0, 1},
                new int[]{2, 1, 1},
                new int[]{2, 1, 0},
                new int[]{2, 1, 0}
        );
        System.out.println(parse);
        // 打乱样本 删除原先的矩阵，并打印新矩阵
        parse = parse.shuffle(22);
        System.out.println(parse);
        // 开始获取朴素贝叶斯算法 计算目标：在自己是产品同时超重的情况下，被喜欢的概率 P(被喜欢|产品,超重)
        NaiveBayes bayes = NaiveBayes.getInstance("bayes");
        // 构造事件A 自己被喜欢
        ArrayIntegerFiltering arrayIntegerFilteringA = v -> v[2] == 1;
        // 构造事件B 自己是产品，同时超重
        ArrayIntegerFiltering arrayIntegerFilteringB = v -> v[0] == 2 && v[1] == 1;
        // 开始计算结果 这个结果是一个条件概率值 P(A|B) 在B事件的前提下，A事件发生的概率
        System.out.println(bayes.estimate(parse, arrayIntegerFilteringA, arrayIntegerFilteringB));
    }
}

决策计算

决策计算函数说明

限定符和类型	方法和说明
`double`	estimate`(`DoubleMatrix `doubleMatrix,` ArrayDoubleFiltering `StatisticCondition1,` ArrayDoubleFiltering `StatisticCondition2)` 计算一个矩阵中的某些条件限制下的联合概率结果 P(A\|B) 其中的分子与分母值！
`double`	estimate`(`IntegerMatrix `integerMatrix,` ArrayIntegerFiltering `StatisticCondition1,` ArrayIntegerFiltering `StatisticCondition2)` 计算一个矩阵中的某些条件限制下的联合概率结果 P(A\|B) 其中的分子与分母值！
`abstract double[]`	estimateGetFraction`(`DoubleMatrix `doubleMatrix,` ArrayDoubleFiltering `StatisticCondition1,` ArrayDoubleFiltering `StatisticCondition2)` 计算一个矩阵中的某些条件限制下的联合概率结果 P(A\|B) 其中的分子与分母值！
`abstract double[]`	estimateGetFraction`(`IntegerMatrix `integerMatrix,` ArrayIntegerFiltering `StatisticCondition1,` ArrayIntegerFiltering `StatisticCondition2)` 计算一个矩阵中的某些条件限制下的联合概率结果 P(A\|B) 其中的分子与分母值！
String	getAlgorithmName`()`
`boolean`	init`()` 算法模块的初始化方法，在这里您可以进行组件的初始化方法，当初始化成功之后，该算法就可以处于就绪的状态，一般这里就是将自己添加到算法管理类中 The initialization method of the algorithm module, here you can perform the initialization method of the component, when the initialization is successful, the algorithm can be in a ready state, generally here is to add yourself to the algorithm management class

决策计算组件列表

计算组件类型	支持版本	功能
zhao.algorithmMagic.algorithm.schemeAlgorithm.DecisionTree	v1.14	决策树计算组件，计算出最有效率的筛选路径，并按照路径将传递进来的事件处理函数进行排列
zhao.algorithmMagic.algorithm.schemeAlgorithm.RandomForest	v1.15	随机森林计算组件，随机分布样本自动选择最优秀解

决策计算API实现

import zhao.algorithmMagic.algorithm.schemeAlgorithm.DecisionTree;
import zhao.algorithmMagic.operands.matrix.ColumnIntegerMatrix;
import zhao.algorithmMagic.utils.filter.ArrayIntegerFiltering;

import java.util.ArrayList;

public class Test {
    public static void main(String[] args) {
        // 获取到决策树计算组件
        DecisionTree decisionTree = DecisionTree.getInstance("DecisionTree");
        String[] strings1 = {"职业", "体型", "喜欢"};
        // 准备一个数据矩阵
        // 职业：1-程序员  2-产品  3-美工
        ColumnIntegerMatrix parse = ColumnIntegerMatrix.parse(
                strings1,
                new String[]{"N1", "N2", "N3", "N4", "N5", "N6", "N7", "N8", "N9", "N10"},
                new int[]{1, 1, 0},
                new int[]{2, 0, 1},
                new int[]{1, 0, 1},
                new int[]{1, 1, 1},
                new int[]{3, 0, 0},
                new int[]{3, 1, 0},
                new int[]{2, 0, 1},
                new int[]{2, 1, 1},
                new int[]{2, 1, 0},
                new int[]{2, 1, 0}
        );
        System.out.println(parse);
        // 使用22作为随机种子，打乱样本 删除原先的矩阵，并打印新矩阵
        parse = parse.shuffle(22);
        System.out.println(parse);
        // 开始进行筛选，要去获取到 职业=3 体型=1 喜欢=0 的行数据，并将其处理过程展示出来
        // 先将事件对象准备出来
        ArrayList arrayList = new ArrayList<>();
        // 添加职业=3的事件
        arrayList.add(v -> v[0] == 3);
        // 添加体型=1的事件
        arrayList.add(v -> v[1] == 1);
        // 添加喜欢=0的是啊金
        arrayList.add(v -> v[2] == 0);
        String s = decisionTree.executeGetString(parse.toArrays(), arrayList);
        // 打印结果
        System.out.println(s);
    }
}

模型预测

在机器学习中的预测部分经常是使用的模型对数据的趋势进行的数据预测，在我们的已知的这些计算组件中，常用的就是线性回归计算组件，在该组件这种，您可以使用一个预先设置好的线性模型，来对数据模型中的未知回归参数进行推断与计算。

模型预测

模型预测函数说明

限定符和类型	方法和说明
`java.lang.String`	getAlgorithmName`()`
`boolean`	init`()` 算法模块的初始化方法，在这里您可以进行组件的初始化方法，当初始化成功之后，该算法就可以处于就绪的状态，一般这里就是将自己添加到算法管理类中 The initialization method of the algorithm module, here you can perform the initialization method of the component, when the initialization is successful, the algorithm can be in a ready state, generally here is to add yourself to the algorithm management class
`abstract double[]`	modelInference`(int targetIndex, DoubleMatrix doubleMatrix)` 通过给定的一个模型，不断修正模型中的参数或其它方式，最终返回在最接近样本本身时所有参数组成的数组 Through a given model, continuously modify the parameters in the model or other ways, and finally return the array of all parameters when they are closest to the sample itself.
`abstract double[]`	modelInference`(int targetIndex, IntegerMatrix integerMatrix)` 通过给定的一个模型，不断修正模型中的参数或其它方式，最终返回在最接近样本本身时所有参数组成的数组 Through a given model, continuously modify the parameters in the model or other ways, and finally return the array of all parameters when they are closest to the sample itself.

概率预测组件列表

计算组件类型

支持版本

功能

zhao.algorithmMagic.algorithm.

modelAlgorithm.LinearRegression

v1.15

该计算组件能够实现快速的一元线性回归计算

概率预测API实现

package zhao.algorithmMagic;

import zhao.algorithmMagic.algorithm.modelAlgorithm.LinearRegression;
import zhao.algorithmMagic.operands.matrix.ColumnDoubleMatrix;

public class MAIN1 {
    public static void main(String[] args) throws CloneNotSupportedException {
        // 创建一个矩阵对象，其中包含一些数据，现在需要找到最块的筛选路线，并使用此路线将数据进行一次获取
        ColumnDoubleMatrix columnDoubleMatrix = ColumnDoubleMatrix.parse(
                new String[]{"x", "y"},
                null,
                new double[]{1, 50},
                new double[]{2, 100},
                new double[]{3, 150},
                new double[]{4, 200}
        );
        // 获取到线性回归
        LinearRegression line = LinearRegression.getInstance("line");
        // 开始计算线性回归 计算x 与 y 之间的关系 其中 x 为自变量  y 为因变量
        // 设置自变量的列编号
        line.setFeatureIndex(0);
        // 设置因变量的列编号
        line.setTargetIndex(1);
        // 计算出回归系数与结果值
        double[] doubles = line.modelInference(columnDoubleMatrix);
        // 获取到线性回归计算之后的权重数组，并将权重数组插入到公式打印出来
        System.out.println("数据特征：");
        System.out.println("y = x * " + doubles[0] + " + " + doubles[1]);
    }
}

Algorithm Star开源协议

Apache License

Version 2.0, January 2004

http://www.apache.org/licenses/

TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION

1. Definitions.

"License" shall mean the terms and conditions for use, reproduction,

and distribution as defined by Sections 1 through 9 of this document.

"Licensor" shall mean the copyright owner or entity authorized by

the copyright owner that is granting the License.

"Legal Entity" shall mean the union of the acting entity and all

other entities that control, are controlled by, or are under common

control with that entity. For the purposes of this definition,

"control" means (i) the power, direct or indirect, to cause the

direction or management of such entity, whether by contract or

otherwise, or (ii) ownership of fifty percent (50%) or more of the

outstanding shares, or (iii) beneficial ownership of such entity.

"You" (or "Your") shall mean an individual or Legal Entity

exercising permissions granted by this License.

"Source" form shall mean the preferred form for making modifications,

including but not limited to software source code, documentation

source, and configuration files.

"Object" form shall mean any form resulting from mechanical

transformation or translation of a Source form, including but

not limited to compiled object code, generated documentation,

and conversions to other media types.

"Work" shall mean the work of authorship, whether in Source or

Object form, made available under the License, as indicated by a

(an example is provided in the Appendix below).

"Derivative Works" shall mean any work, whether in Source or Object

form, that is based on (or derived from) the Work and for which the

editorial revisions, annotations, elaborations, or other modifications

represent, as a whole, an original work of authorship. For the purposes

of this License, Derivative Works shall not include works that remain

separable from, or merely link (or bind by name) to the interfaces of,

the Work and Derivative Works thereof.

"Contribution" shall mean any work of authorship, including

the original version of the Work and any modifications or additions

to that Work or Derivative Works thereof, that is intentionally

submitted to Licensor for inclusion in the Work by the copyright owner

or by an individual or Legal Entity authorized to submit on behalf of

the copyright owner. For the purposes of this definition, "submitted"

means any form of electronic, verbal, or written communication sent

to the Licensor or its representatives, including but not limited to

communication on electronic mailing lists, source code control systems,

and issue tracking systems that are managed by, or on behalf of, the

Licensor for the purpose of discussing and improving the Work, but

excluding communication that is conspicuously marked or otherwise

designated in writing by the copyright owner as "Not a Contribution."

"Contributor" shall mean Licensor and any individual or Legal Entity

on behalf of whom a Contribution has been received by Licensor and

subsequently incorporated within the Work.

2. Grant of Copyright License. Subject to the terms and conditions of

this License, each Contributor hereby grants to You a perpetual,

worldwide, non-exclusive, no-charge, royalty-free, irrevocable

publicly display, publicly perform, sublicense, and distribute the

Work and such Derivative Works in Source or Object form.

3. Grant of Patent License. Subject to the terms and conditions of

this License, each Contributor hereby grants to You a perpetual,

worldwide, non-exclusive, no-charge, royalty-free, irrevocable

(except as stated in this section) patent license to make, have made,

use, offer to sell, sell, import, and otherwise transfer the Work,

where such license applies only to those patent claims licensable

by such Contributor that are necessarily infringed by their

Contribution(s) alone or by combination of their Contribution(s)

with the Work to which such Contribution(s) was submitted. If You

institute patent litigation against any entity (including a

cross-claim or counterclaim in a lawsuit) alleging that the Work

or a Contribution incorporated within the Work constitutes direct

or contributory patent infringement, then any patent licenses

granted to You under this License for that Work shall terminate

as of the date such litigation is filed.

4. Redistribution. You may reproduce and distribute copies of the

Work or Derivative Works thereof in any medium, with or without

modifications, and in Source or Object form, provided that You

meet the following conditions:

(a) You must give any other recipients of the Work or

Derivative Works a copy of this License; and

(b) You must cause any modified files to carry prominent notices

stating that You changed the files; and

that You distribute, all copyright, patent, trademark, and

attribution notices from the Source form of the Work,

excluding those notices that do not pertain to any part of

the Derivative Works; and

(d) If the Work includes a "NOTICE" text file as part of its

distribution, then any Derivative Works that You distribute must

include a readable copy of the attribution notices contained

within such NOTICE file, excluding those notices that do not

pertain to any part of the Derivative Works, in at least one

of the following places: within a NOTICE text file distributed

as part of the Derivative Works; within the Source form or

documentation, if provided along with the Derivative Works; or,

within a display generated by the Derivative Works, if and

wherever such third-party notices normally appear. The contents

of the NOTICE file are for informational purposes only and

do not modify the License. You may add Your own attribution

notices within Derivative Works that You distribute, alongside

or as an addendum to the NOTICE text from the Work, provided

that such additional attribution notices cannot be construed

as modifying the License.

You may add Your own copyright statement to Your modifications and

may provide additional or different license terms and conditions

for use, reproduction, or distribution of Your modifications, or

for any such Derivative Works as a whole, provided Your use,

reproduction, and distribution of the Work otherwise complies with

the conditions stated in this License.

5. Submission of Contributions. Unless You explicitly state otherwise,

any Contribution intentionally submitted for inclusion in the Work

by You to the Licensor shall be under the terms and conditions of

this License, without any additional terms or conditions.

Notwithstanding the above, nothing herein shall supersede or modify

the terms of any separate license agreement you may have executed

with Licensor regarding such Contributions.

6. Trademarks. This License does not grant permission to use the trade

names, trademarks, service marks, or product names of the Licensor,

except as required for reasonable and customary use in describing the

origin of the Work and reproducing the content of the NOTICE file.

7. Disclaimer of Warranty. Unless required by applicable law or

agreed to in writing, Licensor provides the Work (and each

Contributor provides its Contributions) on an "AS IS" BASIS,

WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or

implied, including, without limitation, any warranties or conditions

of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A

PARTICULAR PURPOSE. You are solely responsible for determining the

appropriateness of using or redistributing the Work and assume any

risks associated with Your exercise of permissions under this License.

8. Limitation of Liability. In no event and under no legal theory,

whether in tort (including negligence), contract, or otherwise,

unless required by applicable law (such as deliberate and grossly

negligent acts) or agreed to in writing, shall any Contributor be

liable to You for damages, including any direct, indirect, special,

incidental, or consequential damages of any character arising as a

result of this License or out of the use or inability to use the

Work (including but not limited to damages for loss of goodwill,

work stoppage, computer failure or malfunction, or any and all

other commercial damages or losses), even if such Contributor

has been advised of the possibility of such damages.

9. Accepting Warranty or Additional Liability. While redistributing

the Work or Derivative Works thereof, You may choose to offer,

and charge a fee for, acceptance of support, warranty, indemnity,

or other liability obligations and/or rights consistent with this

License. However, in accepting such obligations, You may act only

on Your own behalf and on Your sole responsibility, not on behalf

of any other Contributor, and only if You agree to indemnify,

defend, and hold each Contributor harmless for any liability

incurred by, or claims asserted against, such Contributor by reason

of your accepting any such warranty or additional liability.

END OF TERMS AND CONDITIONS

APPENDIX: How to apply the Apache License to your work.

To apply the Apache License to your work, attach the following

boilerplate notice, with the fields enclosed by brackets "[]"

replaced with your own identifying information. (Don't include

the brackets!) The text should be enclosed in the appropriate

comment syntax for the file format. We also recommend that a

file or class name and description of purpose be included on the

same "printed page" as the copyright notice for easier

identification within third-party archives.

Licensed under the Apache License, Version 2.0 (the "License");

you may not use this file except in compliance with the License.

You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software

distributed under the License is distributed on an "AS IS" BASIS,

WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.

See the License for the specific language governing permissions and

limitations under the License.

使用Wolfram Alpha API在LangChain中的应用 shuoac langchain python
在AI技术应用中，WolframAlpha以其强大的计算能力和信息检索功能，被广泛应用于各类智能系统中。本文将为您介绍如何结合LangChain使用WolframAlphaAPI，以实现功能强大的计算和信息查询服务。技术背景介绍WolframAlpha是由WolframResearch开发的问答引擎，它通过计算从外部数据源中获取答案，实现对事实性问题的解答。在开发智能应用时，我们可以利用Wolfr
C++多线程苜柠 C++c++
线程：async和thread锁：C++11中的std::atomic和std::mutex推荐文章：C++11多线程（std::thread）详解_c++11线程使用-CSDN博客c++标准库多线程-云山漫卷-博客园std::lock_guard是一个RAII风格的简单的锁管理器，它在构造时自动加锁，在析构时自动解锁。#include#include#include#includestd::mu
java面向对象基础 miehamiha java 开发语言
引入三大特征封装核心思想就是“隐藏细节”、“数据安全”，将对象不需要让外界访问的成员变量和方法私有化，只提供符合开发者意愿的公有方法来访问这些数据和逻辑，保证了数据的安全和程序的稳定。所有的内容对外部不可见。继承子类可以继承父类的属性和方法，并对其进行拓展。将其他的功能继承下来继续发展。多态同一种类型的对象执行同一个方法时可以表现出不同的行为特征。通过继承的上下转型、接口的回调以及方法的重写和重载
如何使用百度云Qianfan进行AI应用开发 dgay_hua 百度云人工智能云计算 python
技术背景介绍百度云Qianfan是由百度公司提供的云服务，包含了云存储、文件管理、资源共享、以及第三方集成等功能。作为开发者，Qianfan支持多种AI应用开发组件，包括大语言模型（LLMs）、对话模型、嵌入模型和向量存储等。本文将重点介绍如何利用这些组件进行实际的AI应用开发。核心原理解析百度云Qianfan通过其丰富的API接口和云计算能力，为开发者提供了易于集成的AI开发环境。核心组件如Qi
如何用PHP开发一个api数据接口幽蓝计划 php
对于一个iOS开发者来说，我一直觉得会写接口是一件很酷的事情，因为它可以实时修改前台数据，而不像App一样需要更新版本和接受审核。更重要的是，它意味着你的技术完成了一个闭环，可以独自完成一整个项目的开发。PHP是我接触的第一个脚本语言，使用之后更是感觉PHP功能强大，开发过程非常友好方便，虽然之后也学习过Python、JavaScript等语言，但现在还是习惯使用PHP，下面就来介绍一下如何用PH
Gone v2 使用 Gone Viper 组件进行本地配置 dapeng-大鹏 Gone框架介绍 Gone框架配置管理 Viper配置组件多格式配置文件配置自动加载机制环境变量配置覆盖层级化配置结构 Go应用配置注入
发现gone-io/gone：一个优雅的Go依赖注入框架！它让您的代码更简洁、更易测试。框架轻量却功能强大，完美平衡了灵活性与易用性。⭐如果您喜欢这个项目，请给我们点个星！您的支持是我们前进的动力！欢迎贡献代码或提出建议，一起让gone变得更好！‍#golang#依赖注入#开源github.com/gone-io/gone本文原地址：https://github.com/gone-io/goner
DataGridView使用方法汇总 weixin_33933118 操作系统数据库 ui
DataGridView控件DataGridView是用于WindowsFroms2.0的新网格控件。它能够代替先前版本号中DataGrid控件，它易于使用并高度可定制，支持许多我们的用户须要的特性。关于本文档：本文档不准备面面俱到地介绍DataGridView，而是着眼于深入地介绍一些技术点的高级特性。本文档按逻辑分为5个章节，首先是结构和特性的概览，其次是内置的列/单元格类型的介绍，再次是数据
HarmonyOS Next 企业级分布式办公应用实战：构建高效协同的办公新生态 lyc233333 harmonyos 分布式华为
在数字化办公浪潮汹涌的当下，企业对于高效、便捷且协同性强的办公应用需求愈发迫切。华为鸿蒙HarmonyOSNext系统凭借其先进的分布式技术，为打造创新型企业级分布式办公应用提供了坚实的基础。本文将基于实际开发经验，深入剖析如何利用HarmonyOSNext构建企业级分布式办公应用，涵盖从需求分析到系统架构搭建，再到核心功能实现以及性能优化等关键环节。一、办公应用需求与系统架构搭建（一）企业级分布
探索Astra DB与LangChain的集成：从向量存储到对话历史 eahba 数据库 langchain python
技术背景介绍AstraDB是DataStax推出的一款无服务器的向量数据库，基于ApacheCassandra®构建，并通过易于使用的JSONAPI提供服务。AstraDB的独特之处在于其强大的向量存储能力，这在处理自然语言处理任务时尤为突出。LangChain与AstraDB的集成为开发者提供了强大的工具链，从数据存储到语义缓存，再到自查询检索，帮助简化复杂的数据操作。核心原理解析LangCha
使用LangSmith追踪LLM令牌使用情况的指南 dgay_hua java 服务器前端 python
在将应用程序投入生产时，追踪令牌使用情况以计算成本是一个重要的步骤。本文将深入探讨如何从LangChain模型调用中获取这些信息。技术背景介绍在大语言模型（LLM）的应用中，令牌使用计数是估算模型调用成本的基础。LangSmith提供了一种有效的方式来帮助跟踪应用程序中的令牌使用。此外，使用回调机制可以在不同的API调用中进行监控，这对于复杂的应用程序尤其重要。核心原理解析通过在API调用中使用回
如何评估一个RAG系统（RAGas评测框架）-下篇写程序的小火箭大语言模型人工智能语言模型 chatgpt langchain gpt
RAGas是一个用于评测RAG系统的评测框架，它支持与不同大语言模型的集成，并与langchain生态打通，能够很方便的构建评测系统。下面是RAGas的一些链接论文：https://arxiv.org/pdf/2309.15217官方文档：Ragashttps://github.com/explodinggradients/ragas官方文档及github对框架的使用介绍的比较详细，本文不会就该方
【AI大模型应用开发】【RAG评估】0. 综述：一文了解RAG评估方法、工具与指标同学小张大模型人工智能笔记经验分享 gpt agi AIGC
大家好，我是同学小张，日常分享AI知识和实战案例欢迎点赞+关注，持续学习，持续干货输出。+v:jasper_8017一起交流，一起进步。微信公众号也可搜【同学小张】本站文章一览：前面我们学习了RAG的基本框架并进行了实践，我们也知道使用它的目的是为了改善大模型在一些方面的不足：如训练数据不全、无垂直领域数据、容易出现幻觉等。那么如何评估RAG的效果呢？本文我们来了解一下。文章目录推荐前置阅读0.R
Java 环境配置与 JAR 文件问题解决全攻略不羁。。杂记丨每天亿点小知识 java jar 开发语言
目录一、Java环境配置指南1.Windows系统配置步骤1.1下载安装JDK1.2配置环境变量2.Linux/macOS系统配置2.1终端命令配置二、JAR文件问题诊断与修复1.检查JAR文件完整性1.1命令行验证1.2哈希值校验2.依赖库管理方案2.1Maven依赖配置示例2.2命令行指定依赖三、常见问题解决方案1.环境变量不生效处理1.1清除系统缓存1.2路径优先级调整2.旧版本残留处理2.
HarmonyOS Next--实现炫酷下拉刷新与上拉加载 harmonyos-next
摘要：本文通过HarmonyOS的PullToRefresh组件，结合Canvas绘图技术，实现具有动态小球特效的下拉刷新与上拉加载功能。文章将详细解析动画绘制原理、手势交互逻辑以及性能优化要点。一、效果预览实现功能包含：弹性下拉刷新：带有透明度渐变的圆形聚合动画波浪加载动画：三个小球按序弹跳的加载效果数据动态加载：模拟异步数据请求与列表更新流畅交互体验：支持列表惯性滑动与边缘回弹二、核心实现原理
COMP 315: Cloud Computing for E-Commerce 后端
Assignment1:JavascriptCOMP315:CloudComputingforE-CommerceFebruary20251IntroductionAcommontaskwhenbackendprogrammingisdatacleaning,whichistheprocessoftakinganinitialdatasetthatmaycontainerroneousorinco
清晰架构之typescript实践：构建可扩展服务的利器吕曦耘George
清晰架构之typescript实践：构建可扩展服务的利器react-with-clean-architectureCleanarchitecturebasedreactprojectsamplecode.项目地址:https://gitcode.com/gh_mirrors/re/react-with-clean-architecture在软件开发的浩瀚宇宙中，找到一个既能维持代码的清晰度又能确保
一步到位！7大模型部署框架深度测评：从理论到DeepSeek R1:7B落地实战人肉推土机人工智能 python
本文在掘金同步发布：文章地址更多优质文章，请关注本人掘金账号：人肉推土机的掘金账号随着大语言模型（LLM）的广泛应用，如何高效部署和推理模型成为开发者关注的核心问题。本文深入解析主流模型部署框架（Transformers、ModelScope、vLLM、LMDeploy、Ollama、SGLang、DeepSpeed），结合其技术原理、优缺点及适用场景，并提供DeepSeekR1:7B的详细部署实
MDC-Mapped Diagnostic Context（映射诊断上下文） NEUMaple 微服务 spring boot java MDC
MDC，全称为MappedDiagnosticContext（映射诊断上下文），是SLF4J（SimpleLoggingFacadeforJava）提供的一种机制，用于在多线程应用中存储和管理与特定线程相关的上下文信息。这种机制特别适用于需要跨多个方法调用或服务边界传递诊断信息的场景，例如跟踪分布式系统中的请求流。MDC的主要用途日志关联：在分布式系统或多线程应用中，MDC可以用来携带一些上下文信
DeepSeek-R1核心技术深度解密：动态专家网络与多维注意力融合的智能架构实现全解析 Coderabo DeepSeek R1模型企业级应用架构 DeepSeek-R1
DeepSeek-R1智能架构核心技术揭秘：从动态路由到分布式训练的完整实现指南一、DeepSeek-R1架构设计原理1.1动态专家混合系统DeepSeek-R1采用改进型MoE（MixtureofExperts）架构，核心公式表达为：y=∑i=1nG(x
计算机视觉技术探索：美颜SDK如何利用深度学习优化美颜、滤镜功能？美狐美颜sdk 美颜SDK 美颜API 直播美颜SDK 计算机视觉深度学习直播美颜SDK 美颜sdk 第三方美颜sdk 美颜api
时下，计算机视觉+深度学习正在重塑美颜技术，通过智能人脸检测、AI滤镜、深度美肤、实时优化等方式，让美颜效果更加自然、精准、个性化。那么，美颜SDK如何结合深度学习来优化美颜和滤镜功能？本文将深入解析AI在美颜技术中的应用，并探讨其未来发展趋势。一、深度学习如何赋能美颜SDK？1.AI人脸检测与关键点识别：精准捕捉五官在美颜过程中，首先需要精准检测人脸位置和五官特征点，确保美颜效果不会失真。深度学
RPA（Robotic Process Automation）技术介绍及其应用乐Code Other rpa
一、RPA技术概述RPA，即机器人流程自动化，是一种利用软件机器人（或称为“机器人工作者”）来模拟和自动执行人类在计算机上执行的各种重复性、规则性业务流程的技术。RPA技术旨在通过自动化这些业务流程，提高工作效率、减少人为错误，并让员工能够专注于更高价值的工作。二、RPA技术的核心特点无侵入性：RPA软件能够在现有的IT架构上运行，无需对现有系统进行大幅修改或替换。易于实现和扩展：相对于传统的IT
浅谈RPA 烽火联营人工智能
RPA(RoboticProcessAutomation)机器人自动化近期已在各行业受到广泛关注，在金融、消费品、物流、制造等行业有了大量的成功应用案例。RPA主要通过计算机自动处理一系列重复性任务，可以帮助企业创造显著的增长和效率率提升。I.RPA发展现状A.RPA定义RPA是一种支持软件解决方案，它使用机器人技术自动完成人类日常的重复性任务，从而提高企业工作效率和减少员工的劳动强度，同时还可以
Web端驱动的综合打印方案与场景 #六脉神剑 Web打印 myBuilder 产品运营
随着Web技术的快速发展，基于Web端的打印方案逐渐成为主流，它能够满足多样化的打印需求，并提供更便捷、高效的打印体验。以下是一些常见的Web端驱动综合打印方案与应用场景：一、方案概述浏览器直接打印原理:利用浏览器自带的打印功能，调用操作系统打印接口，直接打印网页内容。优点:简单易用，无需额外开发。缺点:打印样式控制有限，兼容性差，无法满足复杂打印需求。适用场景:打印简单的网页内容，例如文章、表格
B端安全网关的简单实现 #六脉神剑 java java 网络安全 spring boot
安全网关中的DMZ内网穿透是一种结合网络安全隔离与穿透技术的解决方案，主要用于实现外部网络对内网资源的安全访问。其核心逻辑如下：一、DMZ区的安全隔离作用网络分区机制‌：DMZ（非军事区）是安全网关设置的中间隔离区域，用于部署对外提供服务的设备（如Web服务器、邮件服务器），与内网核心数据区域物理隔离‌。访问控制‌：外网用户仅能访问DMZ区资源，无法直接触及内网敏感数据，即使DMZ区设备被攻破，内
SOFAStack-00-sofa 技术栈概览老马啸西风 sofa 架构监控阿里云系统架构
SOFAStack前言大家好，我是老马。sofastack其实出来很久了，第一次应该是在2022年左右开始关注，但是一直没有深入研究。最近想学习一下SOFA对于生态的设计和思考。核心项目⚙️SOFABootGitHub:sofastack/sofa-boot|★3.8k功能：企业级SpringBoot增强框架，支持模块化开发、类隔离、日志隔离，提供健康检查、异步初始化等特性。SOFARPCGitH
Java：Apache HttpClient中HttpRoute用法的介绍 netyeaxi Java java apache 开发语言
当使用ApacheHttpClient组件时，经常会用到它的连接池组件。典型的代码如下：PoolingHttpClientConnectionManagerconnectionManager=newPoolingHttpClientConnectionManager();connectionManager.setMaxTotal(httpConfig.getMaxPoolTotal());conn
**探索微博世界的新视角：twiyou——您的推特好友监测神器** 许煦津
探索微博世界的新视角：twiyou——您的推特好友监测神器twiyouTwitterfriendmonitoringtool项目地址:https://gitcode.com/gh_mirrors/tw/twiyou项目介绍在这个信息爆炸的时代，推特（Twitter）作为全球最具影响力的社交媒体之一，汇聚了无数声音与故事。twiyou，一款专为推特设计的友好监视工具，犹如你的个人情报员，帮助你轻松掌
【从漏洞到防护：浅谈Docker不容忽视的安全问题】 OpsEye docker 网络安全安全运维
从漏洞到防护：浅谈Docker不容忽视的安全问题文章目录前言一、Docker存在的漏洞二、场景案例三、安全基线标准总结前言在网络时代，几乎所有编写的软件和应用都存在潜在的漏洞，想要完全没有漏洞的应用是几乎不可能实现的，当然Docker也不例外。Docker容器技术在提供高效、可移植的软件部署环境的同时，也带来了一些安全挑战。针对Docker自身的漏洞，黑客的攻击手段层出不穷，给企业带来了多方面的挑
挑战20天学完JavaSE第四天——方法的定义、调用和方法重载呆呆why care 挑战20天学完javaSE java 笔记改行学it 程序人生
Java方法是语句的集合，它们在一起执行一个功能。方法是解决一类问题的步骤的有序组合。方法包含于类或对象中。方法在程序中被创建，在其他地方被引用。设计方法的原则:方法的本意是功能块，就是实现某个功能的语句块的集合。我们设计方法的时候，最好保持方法的原子性，就是一个方法只完成1个功能，这样利于我们后期的扩展。方法的命名规则：首字母小写驼峰命名方法的定义Java的方法类似于其它语言的函数，是一段用来完
java struts jxl 导入导出Excel（无模板） weixin_30437847 java 数据库 javascript ViewUI
jar包：importjavax.servlet.http.HttpServletResponse;importjava.io.OutputStream;importjava.io.File;importjxl.DateCell;importjxl.Sheet;importjxl.Workbook;importjxl.format.Alignment;importjxl.format.Border
Java 并发包之线程池和原子计数 lijingyao8206 Java计数 ThreadPool 并发包 java线程池
对于大数据量关联的业务处理逻辑，比较直接的想法就是用JDK提供的并发包去解决多线程情况下的业务数据处理。线程池可以提供很好的管理线程的方式，并且可以提高线程利用率，并发包中的原子计数在多线程的情况下可以让我们避免去写一些同步代码。这里就先把jdk并发包中的线程池处理器ThreadPoolExecutor 以原子计数类AomicInteger 和倒数计时锁C
java编程思想抽象类和接口百合不是茶 java 抽象类接口
接口c++对接口和内部类只有简介的支持,但在java中有队这些类的直接支持 1 ,抽象类 : 如果一个类包含一个或多个抽象方法,该类必须限定为抽象类(否者编译器报错) 抽象方法 : 在方法中仅有声明而没有方法体 package com.wj.Interface;
[房地产与大数据]房地产数据挖掘系统 comsci 数据挖掘
随着一个关键核心技术的突破,我们已经是独立自主的开发某些先进模块,但是要完全实现,还需要一定的时间... 所以,除了代码工作以外,我们还需要关心一下非技术领域的事件..比如说房地产 &nb
数组队列总结沐刃青蛟数组队列
数组队列是一种大小可以改变，类型没有定死的类似数组的工具。不过与数组相比，它更具有灵活性。因为它不但不用担心越界问题，而且因为泛型（类似c++中模板的东西）的存在而支持各种类型。以下是数组队列的功能实现代码： import List.Student; public class
Oracle存储过程无法编译的解决方法 IT独行者 oracle 存储过程　
今天同事修改Oracle存储过程又导致2个过程无法被编译，流程规范上的东西，Dave 这里不多说，看看怎么解决问题。 1. 查看无效对象 XEZF@xezf(qs-xezf-db1)> select object_name,object_type,status from all_objects where status='IN
重装系统之后oracle恢复文强chu oracle
前几天正在使用电脑，没有暂停oracle的各种服务。突然win8.1系统奔溃，无法修复，开机时系统提示正在搜集错误信息，然后再开机，再提示的无限循环中。无耐我拿出系统u盘准备重装系统，没想到竟然无法从u盘引导成功。晚上到外面早了一家修电脑店，让人家给装了个系统，并且那哥们在我没反应过来的时候，直接把我的c盘给格式化了并且清理了注册表，再装系统。然后的结果就是我的oracl
python学习二（一些基础语法）小桔子 pthon 基础语法
紧接着把！昨天没看继续看django 官方教程，学了下python的基本语法与c类语言还是有些小差别： 1.ptyhon的源文件以UTF-8编码格式 2. / 除结果浮点型 // 除结果整形 % 除取余数 * 乘 ** 乘方 eg 5**2 结果是5的2次方25 _&
svn 常用命令 aichenglong SVN 版本回退
1 svn回退版本 1)在window中选择log,根据想要回退的内容,选择revert this version或revert chanages from this version 两者的区别: revert this version:表示回退到当前版本(该版本后的版本全部作废) revert chanages from this versio
某小公司面试归来 alafqq 面试
先填单子，还要写笔试题，我以时间为急，拒绝了它。。时间宝贵。老拿这些对付毕业生的东东来吓唬我。。面试官很刁难，问了几个问题，记录下； 1，包的范围。。。public,private,protect. --悲剧了 2，hashcode方法和equals方法的区别。谁覆盖谁.结果，他说我说反了。 3，最恶心的一道题，抽象类继承抽象类吗？（察，一般它都是被继承的啊） 4，stru
动态数组的存储速度比较集合框架百合不是茶集合框架
集合框架：自定义数据结构(增删改查等) package 数组; /** * 创建动态数组 * @author 百合 * */ public class ArrayDemo{ //定义一个数组来存放数据 String[] src = new String[0]; /** * 增加元素加入容器 * @param s要加入容器
用JS实现一个JS对象，对象里有两个属性一个方法 bijian1013 js对象
<html> <head> </head> <body> 用js代码实现一个js对象，对象里有两个属性，一个方法 </body> <script> var obj={a:'1234567',b:'bbbbbbbbbb',c:function(x){
探索JUnit4扩展：使用Rule bijian1013 java 单元测试 JUnit Rule
在上一篇文章中，讨论了使用Runner扩展JUnit4的方式，即直接修改Test Runner的实现(BlockJUnit4ClassRunner)。但这种方法显然不便于灵活地添加或删除扩展功能。下面将使用JUnit4.7才开始引入的扩展方式——Rule来实现相同的扩展功能。 1. Rule &n
[Gson一]非泛型POJO对象的反序列化 bit1129 POJO
当要将JSON数据串反序列化自身为非泛型的POJO时，使用Gson.fromJson(String, Class)方法。自身为非泛型的POJO的包括两种： 1. POJO对象不包含任何泛型的字段 2. POJO对象包含泛型字段，例如泛型集合或者泛型类 Data类 a.不是泛型类， b.Data中的集合List和Map都是泛型的 c.Data中不包含其它的POJO
【Kakfa五】Kafka Producer和Consumer基本使用 bit1129 kafka
0.Kafka服务器的配置一个Broker，一个Topic Topic中只有一个Partition（） 1. Producer： package kafka.examples.producers; import kafka.producer.KeyedMessage; import kafka.javaapi.producer.Producer; impor
lsyncd实时同步搭建指南——取代rsync+inotify ronin47
1. 几大实时同步工具比较 1.1 inotify + rsync 最近一直在寻求生产服务服务器上的同步替代方案，原先使用的是 inotify + rsync，但随着文件数量的增大到100W+，目录下的文件列表就达20M，在网络状况不佳或者限速的情况下，变更的文件可能10来个才几M，却因此要发送的文件列表就达20M，严重减低的带宽的使用效率以及同步效率；更为要紧的是，加入inotify
java-9. 判断整数序列是不是二元查找树的后序遍历结果 bylijinnan java
public class IsBinTreePostTraverse{ static boolean isBSTPostOrder(int[] a){ if(a==null){ return false; } /*1.只有一个结点时，肯定是查找树 *2.只有两个结点时，肯定是查找树。例如{5,6}对应的BST是 6 {6,5}对应的BST是
MySQL的sum函数返回的类型 bylijinnan java spring sql mysql jdbc
今天项目切换数据库时，出错访问数据库的代码大概是这样： String sql = "select sum(number) as sumNumberOfOneDay from tableName"; List<Map> rows = getJdbcTemplate().queryForList(sql); for (Map row : rows
java设计模式之单例模式 chicony java设计模式
在阎宏博士的《JAVA与模式》一书中开头是这样描述单例模式的：　　作为对象的创建模式，单例模式确保某一个类只有一个实例，而且自行实例化并向整个系统提供这个实例。这个类称为单例类。单例模式的结构　　单例模式的特点：单例类只能有一个实例。单例类必须自己创建自己的唯一实例。单例类必须给所有其他对象提供这一实例。　　饿汉式单例类 publ
javascript取当月最后一天 ctrain JavaScript
 <script language=javascript> var current = new Date(); var year = current.getYear(); var month = current.getMonth(); showMonthLastDay(year, mont
linux tune2fs命令详解 daizj linux tune2fs 查看系统文件块信息
一.简介： tune2fs是调整和查看ext2/ext3文件系统的文件系统参数，Windows下面如果出现意外断电死机情况，下次开机一般都会出现系统自检。Linux系统下面也有文件系统自检，而且是可以通过tune2fs命令，自行定义自检周期及方式。二.用法： Usage: tune2fs [-c max_mounts_count] [-e errors_behavior] [-g grou
做有中国特色的程序员 dcj3sjt126com 程序员
从出版业说起网络作品排到靠前的，都不会太难看，一般人不爱看某部作品也是因为不喜欢这个类型，而此人也不会全不喜欢这些网络作品。究其原因，是因为网络作品都是让人先白看的，看的好了才出了头。而纸质作品就不一定了，排行榜靠前的，有好作品，也有垃圾。许多大牛都是写了博客，后来出了书。这些书也都不次，可能有人让为不好，是因为技术书不像小说，小说在读故事，技术书是在学知识或温习知识，有
Android：TextView属性大全 dcj3sjt126com textview
android:autoLink 设置是否当文本为URL链接/email/电话号码/map时，文本显示为可点击的链接。可选值(none/web/email/phone/map/all) android:autoText 如果设置，将自动执行输入值的拼写纠正。此处无效果，在显示输入法并输
tomcat虚拟目录安装及其配置 eksliang tomcat配置说明 tomca部署web应用 tomcat虚拟目录安装
转载请出自出处：http://eksliang.iteye.com/blog/2097184 1.-------------------------------------------tomcat 目录结构 config：存放tomcat的配置文件 temp ：存放tomcat跑起来后存放临时文件用的 work ：当第一次访问应用中的jsp
浅谈：APP有哪些常被黑客利用的安全漏洞 gg163 APP
首先，说到APP的安全漏洞，身为程序猿的大家应该不陌生；如果抛开安卓自身开源的问题的话，其主要产生的原因就是开发过程中疏忽或者代码不严谨引起的。但这些责任也不能怪在程序猿头上，有时会因为BOSS时间催得紧等很多可观原因。由国内移动应用安全检测团队爱内测（ineice.com）的CTO给我们浅谈关于Android 系统的开源设计以及生态环境。 1. 应用反编译漏洞：APK 包非常容易被反编译成可读
C#根据网址生成静态页面 hvt Web .net C#asp.net hovertree
HoverTree开源项目中HoverTreeWeb.HVTPanel的Index.aspx文件是后台管理的首页。包含生成留言板首页，以及显示用户名，退出等功能。根据网址生成页面的方法： bool CreateHtmlFile(string url, string path) { //http://keleyi.com/a/bjae/3d10wfax.htm stri
SVG 教程（一）天梯梦 svg
SVG 简介 SVG 是使用 XML 来描述二维图形和绘图程序的语言。学习之前应具备的基础知识：继续学习之前，你应该对以下内容有基本的了解： HTML XML 基础如果希望首先学习这些内容，请在本站的首页选择相应的教程。什么是SVG？ SVG 指可伸缩矢量图形 (Scalable Vector Graphics) SVG 用来定义用于网络的基于矢量
一个简单的java栈 luyulong java 数据结构栈
public class MyStack { private long[] arr; private int top; public MyStack() { arr = new long[10]; top = -1; } public MyStack(int maxsize) { arr = new long[maxsize]; top
基础数据结构和算法八：Binary search sunwinner Algorithm Binary search
Binary search needs an ordered array so that it can use array indexing to dramatically reduce the number of compares required for each search, using the classic and venerable binary search algori
12个C语言面试题，涉及指针、进程、运算、结构体、函数、内存，看看你能做出几个！刘星宇 c 面试
12个C语言面试题，涉及指针、进程、运算、结构体、函数、内存，看看你能做出几个！ 1.gets()函数问：请找出下面代码里的问题： #include<stdio.h> int main(void) { char buff[10]; memset(buff,0,sizeof(buff));
ITeye 7月技术图书有奖试读获奖名单公布 ITeye管理员活动 ITeye 试读
ITeye携手人民邮电出版社图灵教育共同举办的7月技术图书有奖试读活动已圆满结束，非常感谢广大用户对本次活动的关注与参与。 7月试读活动回顾： http://webmaster.iteye.com/blog/2092746 本次技术图书试读活动的优秀奖获奖名单及相应作品如下（优秀文章有很多，但名额有限，没获奖并不代表不优秀）：《Java性能优化权威指南》

【AlgorithmStar机器学习】AS机器学习库特征工程使用说明文档

Algorithm Star介绍

概述

AS库的一般处理流程

数据采集与清洗

向量生成与特征提取选择

机器学习

后续处理

Algorithm Star使用

数据类型-操作数

浮点类型操作数

整数类型操作数

复数

特征提取

字典特征提取

词频特征提取

特征选择

基于冗余排名比例去除

基于相关系数去除

机器学习

聚合计算

分类计算

差异计算

路径计算

数据预处理(标准化/归一化)

概率计算

决策计算

模型预测

模型预测

Algorithm Star开源协议

你可能感兴趣的:(文档资料,技术推荐,技术分享,java,scala,人工智能,git)