cullen2012

c# simd 指令_.NET / C＃中的SIMD概述

c# simd 指令

Here’s a quick look at algorithm vectorization capabilities in .NET Framework and .NET Core. This article is for those who know nothing about these techniques. I will also show that .NET doesn’t actually lag behind "real compiled" languages for native development.

快速浏览一下.NET Framework和.NET Core中的算法矢量化功能。本文适用于对这些技术一无所知的人。我还将说明，.NET实际上在本机开发方面并不落后于“真正的编译”语言。

I’m just starting to learn vectorization techniques. So, I will appreciate if community members find clear errors or suggest improvements to the described algorithms.

我才刚刚开始学习矢量化技术。因此，如果社区成员发现明显的错误或对所描述的算法提出改进建议，我将不胜感激。

一些历史 (Some history)

SIMD appeared in .NET Framework 4.6 in 2015. That's when Matrix3x2, Matrix4x4, Plane, Quaternion, Vector2, Vector3 and Vector4 types were added. They allowed vectorized computations. Next was the Vector type that gave more opportunities to vectorize algorithms. However, many programmers were still dissatisfied as these types restricted coders’ idea streams and didn’t let use the full capacity of SIMD instructions in modern processors. Now in .NET Core 3.0 Preview, we have System.Runtime.Intrinsics namespace that gives much freedom in the choice of instructions. To get the most in speed you need to use RyuJit and resort either to x64 assembly or switch off Prefer 32-bit and choose AnyCPU assembly. I ran all the benchmarks on Intel Core i7-6700 3.40 GHz (Skylake) CPU computer.

SIMD出现在2015年的.NET Framework 4.6中。那时，添加了Matrix3x2，Matrix4x4，Plane，Quaternion，Vector2，Vector3和Vector4类型。他们允许向量化计算。接下来是Vector 类型，它提供了更多的机会来对算法进行矢量化。但是，许多程序员仍然不满意，因为这些类型限制了编码人员的想法流，并且不允许在现代处理器中使用SIMD指令的全部功能。现在，在.NET Core 3.0 Preview中，我们有了System.Runtime.Intrinsics命名空间，该命名空间为指令选择提供了很大的自由度。要获得最快的速度，您需要使用RyuJit并采用x64汇编或关闭Prefer 32-bit并选择AnyCPU汇编。我在Intel Core i7-6700 3.40 GHz(Skylake)CPU计算机上运行了所有基准测试。

汇总数组元素 (Summing array elements)

I decided to start with a classic task which usually comes first if there’s vectorization involved. It deals with finding the sum of array elements. Let’s write four implementations of this task to sum the elements of Array.

我决定从经典任务开始，如果涉及矢量化，通常会首先执行。它涉及查找数组元素的总和。让我们编写此任务的四个实现以汇总Array的元素。

The most obvious implementation:

最明显的实现：

public int Naive() {
    int result = 0;
    foreach (int i in Array) {
        result += i;
    }
    return result;
}

LINQ-based implementation:

基于LINQ的实现：

public long LINQ() => Array.Aggregate(0, (current, i) => current + i);

The implementation based on vectors from System.Numerics:

基于System.Numerics中的向量的实现：

public int Vectors() {
    int vectorSize = Vector.Count;
    var accVector = Vector.Zero;
    int i;
    var array = Array;
    for (i = 0; i <= array.Length - vectorSize; i += vectorSize) {
        var v = new Vector(array, i);
        accVector = Vector.Add(accVector, v);
    }
    int result = Vector.Dot(accVector, Vector.One);
    for (; i < array.Length; i++) {
        result += array[i];
    }
    return result;
}

The implementation based on code from System.Runtime.Intrinsics namespace:

基于System.Runtime.Intrinsics命名空间中的代码的实现：

public unsafe int Intrinsics() {
    int vectorSize = 256 / 8 / 4;
    var accVector = Vector256.Zero;
    int i;
    var array = Array;
    fixed (int* ptr = array) {
        for (i = 0; i <= array.Length - vectorSize; i += vectorSize) {
            var v = Avx2.LoadVector256(ptr + i);
            accVector = Avx2.Add(accVector, v);
        }
    }
    int result = 0;
    var temp = stackalloc int[vectorSize];
    Avx2.Store(temp, accVector);
    for (int j = 0; j < vectorSize; j++) {
        result += temp[j];
    }   
    for (; i < array.Length; i++) {
        result += array[i];
    }
    return result;
}

I benchmarked those 4 methods on my computer and got the following results:

我在计算机上对这4种方法进行了基准测试，并得到以下结果：

Method	ItemsCount	Mean	Error	StdDev	Ratio
Naive	10	3.531 ns	0.0336 ns	0.0314 ns	1.00
LINQ	10	76.925 ns	0.4166 ns	0.3897 ns	21.79
Vectors	10	2.750 ns	0.0210 ns	0.0196 ns	0.78
Intrinsics	10	6.513 ns	0.0623 ns	0.0582 ns	1.84

Naive	100	47.982 ns	0.3975 ns	0.3524 ns	1.00
LINQ	100	590.414 ns	3.8808 ns	3.4402 ns	12.31
Vectors	100	10.122 ns	0.0747 ns	0.0699 ns	0.21
Intrinsics	100	14.277 ns	0.0566 ns	0.0529 ns	0.30

Naive	1000	569.910 ns	2.8297 ns	2.6469 ns	1.00
LINQ	1000	5,658.570 ns	31.7465 ns	29.6957 ns	9.93
Vectors	1000	79.598 ns	0.3498 ns	0.3272 ns	0.14
Intrinsics	1000	66.970 ns	0.3937 ns	0.3682 ns	0.12

Naive	10000	5,637.571 ns	37.5050 ns	29.2814 ns	1.00
LINQ	10000	56,498.987 ns	294.8776 ns	275.8287 ns	10.02
Vectors	10000	772.900 ns	2.6802 ns	2.5070 ns	0.14
Intrinsics	10000	579.152 ns	2.8371 ns	2.6538 ns	0.10

Naive	100000	56,352.865 ns	230.7916 ns	215.8826 ns	1.00
LINQ	100000	562,610.571 ns	3,775.7631 ns	3,152.9332 ns	9.98
Vectors	100000	8,389.647 ns	165.9590 ns	227.1666 ns	0.15
Intrinsics	100000	7,261.334 ns	89.6468 ns	69.9903 ns	0.13

方法	ItemsCount	意思	错误	标准差	比
幼稚	10	3.531 ns	0.0336纳秒	0.0314纳秒	1.00
LINQ	10	76.925 ns	0.4166纳秒	0.3897 ns	21.79
向量	10	2.750纳秒	0.0210纳秒	0.0196 ns	0.78
本征	10	6.513 ns	0.0623 ns	0.0582 ns	1.84

幼稚	100	47.982 ns	0.3975纳秒	0.3524 ns	1.00
LINQ	100	590.414 ns	3.8808 ns	3.4402纳秒	12.31
向量	100	10.122 ns	0.0747 ns	0.0699 ns	0.21
本征	100	14.277 ns	0.0566 ns	0.0529 ns	0.30

幼稚	1000	569.910 ns	2.8297 ns	2.6469 ns	1.00
LINQ	1000	5,658.570 ns	31.7465 ns	29.6957 ns	9.93
向量	1000	79.598 ns	0.3498 ns	0.3272纳秒	0.14
本征	1000	66.970 ns	0.3937纳秒	0.3682纳秒	0.12

幼稚	10000	5,637.571 ns	37.5050 ns	29.2814 ns	1.00
LINQ	10000	56,498.987 ns	294.8776 ns	275.8287 ns	10.02
向量	10000	772.900 ns	2.6802纳秒	2.5070 ns	0.14
本征	10000	579.152 ns	2.8371 ns	2.6538 ns	0.10

幼稚	100000	56,352.865 ns	230.7916 ns	215.8826 ns	1.00
LINQ	100000	562,610.571 ns	3,775.7631 ns	3,152.9332 ns	9.98
向量	100000	8,389.647 ns	165.9590 ns	227.1666 ns	0.15
本征	100000	7,261.334 ns	89.6468 ns	69.9903 ns	0.13

It’s clear that solutions with Vectors and Intrinsics are much faster than the obvious and LINQ-based solutions. Now we need to figure out what goes on in these two methods.

显然，使用向量和内在函数的解决方案比明显的基于LINQ的解决方案要快得多。现在，我们需要弄清楚这两种方法的情况。

Let’s consider Vectors method more closely:

让我们更仔细地考虑Vectors方法：

向量 (Vectors)

public int Vectors() {
    int vectorSize = Vector.Count;
    var accVector = Vector.Zero;
    int i;
    var array = Array;
    for (i = 0; i <= array.Length - vectorSize; i += vectorSize) {
        var v = new Vector(array, i);
        accVector = Vector.Add(accVector, v);
    }
    int result = Vector.Dot(accVector, Vector.One);
    for (; i < array.Length; i++) {
        result += array[i];
    }
    return result;
}

int vectorSize = Vector.Count; — the amount of 4-byte numbers we can place in a vector. If hardware acceleration is used, this value shows how many 4-byte numbers we can put in one SIMD register. In fact, it shows how many elements of this type can be handled concurrently;
int vectorSize = Vector .Count; —我们可以在向量中放置的4字节数字的数量。如果使用硬件加速，则该值显示我们可以在一个SIMD寄存器中放入多少个4字节数字。实际上，它显示了可以同时处理此类型的元素的数量。
accVector is a vector that accumulates the result of the function;
accVector是累积函数结果的向量；
var v = new Vector(array, i); — the data from array is loaded into a new v vector, starting from i index. The vectorSize of data will be loaded exactly;
var v = new Vector (array，i); —从i索引开始，将数组中的数据加载到新的v向量中。数据的vectorSize将被完全加载；
accVector = Vector.Add(accVector, v); — two vectors are summed.
accVector = Vector.Add(accVector，v); —两个向量相加。

For example, there are 8 numbers in Array: {0, 1, 2, 3, 4, 5, 6, 7} and vectorSize == 4.
例如，数组中有8个数字：{0、1、2、3、4、5、6、7}和vectorSize == 4。

Then during the first cycle iteration accVector = {0, 0, 0, 0}, v = {0, 1, 2, 3} and after addition accVector will hold: {0, 0, 0, 0} + {0, 1, 2, 3} = {0, 1, 2, 3}.
然后在第一个循环迭代期间，accVector = {0，0，0，0}，v = {0，1，2，3}，加法后，accVector将保持：{0，0，0，0} + {0，1 ，2，3} = {0，1，2，3}。

During the second iteration v = {4, 5, 6, 7} and after addition accVector = {0, 1, 2, 3} + {4, 5, 6, 7} = {4, 6, 8, 10}.
在第二次迭代中，v = {4，5，6，7}，加法后accVector = {0，1，2，3} + {4，5，6，7} = {4，6，8，10}。
Now we just need to get the sum of all vector elements. To do this we can use scalar multiplication by a vector filled with ones: int result = Vector.Dot(accVector, Vector.One);
现在我们只需要获取所有向量元素的总和即可。为此，我们可以使用标量乘以一个填充有一个的向量：int result = Vector.Dot(accVector，Vector .One);

Then we get: {4, 6, 8, 10} * {1, 1, 1, 1} = 4 * 1 + 6 * 1 + 8 * 1 + 10 * 1 = 28.
然后得到：{4，6，8，10} * {1，1，1，1} = 4 * 1 + 6 * 1 + 8 * 1 + 10 * 1 = 28。
If necessary, those numbers that don’t fit the last vector will be summed at the end.
如有必要，那些不适合最后一个向量的数字将在末尾求和。

Let's look into Intrinsics code:

让我们看一下内部代码：

本征 (Intrinsics)

public unsafe int Intrinsics() {
    int vectorSize = 256 / 8 / 4;
    var accVector = Vector256.Zero;
    int i;
    var array = Array;
    fixed (int* ptr = array) {
        for (i = 0; i <= array.Length - vectorSize; i += vectorSize) {
            var v = Avx2.LoadVector256(ptr + i);
            accVector = Avx2.Add(accVector, v);
        }
    }
    int result = 0;
    var temp = stackalloc int[vectorSize];
    Avx2.Store(temp, accVector);
    for (int j = 0; j < vectorSize; j++) {
        result += temp[j];
    }   
    for (; i < array.Length; i++) {
        result += array[i];
    }
    return result;
}

We can see that it’s like Vectors with one exception:

我们可以看到它类似于Vectors，但有一个例外：

vectorSize is specified by a constant. This is because this method explicitly uses Avx2 instructions that operate with 256-bit registers. A real application should include a check of whether a current processor supports Avx2. If not, another code should be called. It looks like this:
vectorSize由常量指定。这是因为此方法明确使用了与256位寄存器一起操作的Avx2指令。实际的应用程序应包括检查当前处理器是否支持Avx2。如果不是，则应调用另一个代码。看起来像这样：
```
if (Avx2.IsSupported) {
DoThingsForAvx2();
}
else if (Avx.IsSupported) {
DoThingsForAvx();
}
...
else if (Sse2.IsSupported) {
DoThingsForSse2();
}
...
```
var accVector = Vector256.Zero; accVector is declared as 256-bit vector filled with zeros.
var accVector = Vector256 .Zero; accVector被声明为填充零的256位向量。
fixed (int* ptr = Array) — the pointer to the array is placed in ptr.
固定的(int * ptr = Array)—数组的指针放在ptr中。
Next are the same operations as in Vectors: loading data into a vector and addition of two vectors.
接下来是与向量中相同的操作：将数据加载到向量中并添加两个向量。
The summing of vector elements uses the following method:
向量元素的求和使用以下方法：
- create an array on stack: var temp = stackalloc int[vectorSize];
  在堆栈上创建一个数组：var temp = stackalloc int [vectorSize];
- load a vector into this array: Avx2.Store(temp, accVector);
  将向量加载到此数组中：Avx2.Store(temp，accVector);
- sum array elements during the cycle.
  在循环中对数组元素求和。
Next, the elements which don’t fit the last vector are summed up.
接下来，汇总不适合最后一个向量的元素。

比较两个数组 (Comparing two arrays)

We need to compare two arrays of bytes. It is exactly this task that made me study SIMD in .NET. Again, let’s write several methods for benchmarking and compare two arrays: ArrayA and ArrayB.

我们需要比较两个字节数组。正是这一任务使我在.NET中学习SIMD。再次，让我们编写几种基准测试方法并比较两个数组：ArrayA和ArrayB。

The most obvious solution:

最明显的解决方案：

public bool Naive() {
    for (int i = 0; i < ArrayA.Length; i++) {
        if (ArrayA[i] != ArrayB[i]) return false;
    }
    return true;
}

LINQ-based solution:

基于LINQ的解决方案：

public bool LINQ() => ArrayA.SequenceEqual(ArrayB);

The solution based on MemCmp function:

基于MemCmp功能的解决方案：

[DllImport("msvcrt.dll", CallingConvention = CallingConvention.Cdecl)]
static extern int memcmp(byte[] b1, byte[] b2, long count);

public bool MemCmp() => memcmp(ArrayA, ArrayB, ArrayA.Length) == 0;

The solution based on vectors from System.Numerics:

基于System.Numerics中的向量的解决方案：

public bool Vectors() {
    int vectorSize = Vector.Count;
    int i = 0;
    for (; i <= ArrayA.Length - vectorSize; i += vectorSize) {
        var va = new Vector(ArrayA, i);
        var vb = new Vector(ArrayB, i);
        if (!Vector.EqualsAll(va, vb)) {
            return false;
        }
    }
    for (; i < ArrayA.Length; i++) {
        if (ArrayA[i] != ArrayB[i])
            return false;
    }
    return true;
}

Intrinsics-based solution:

基于内在的解决方案：

public unsafe bool Intrinsics() {
    int vectorSize = 256 / 8;
    int i = 0;
    const int equalsMask = unchecked((int) (0b1111_1111_1111_1111_1111_1111_1111_1111));
    fixed (byte* ptrA = ArrayA)
    fixed (byte* ptrB = ArrayB) {
        for (; i <= ArrayA.Length - vectorSize; i += vectorSize) {
            var va = Avx2.LoadVector256(ptrA + i);
            var vb = Avx2.LoadVector256(ptrB + i);
            var areEqual = Avx2.CompareEqual(va, vb);
            if (Avx2.MoveMask(areEqual) != equalsMask) {
                return false;
            }
        }
        for (; i < ArrayA.Length; i++) {
            if (ArrayA[i] != ArrayB[i])
                return false;
        }
        return true;
    }
}

The results of running the benchmark on my computer:

在计算机上运行基准测试的结果：

Method	ItemsCount	Mean	Error	StdDev	Ratio
Naive	10000	7,033.8 ns	50.636 ns	47.365 ns	1.00
LINQ	10000	64,841.4 ns	289.157 ns	270.478 ns	9.22
Vectors	10000	504.0 ns	2.406 ns	2.251 ns	0.07
MemCmp	10000	368.1 ns	2.637 ns	2.466 ns	0.05
Intrinsics	10000	283.6 ns	1.135 ns	1.061 ns	0.04

Naive	100000	85,214.4 ns	903.868 ns	845.478 ns	1.00
LINQ	100000	702,279.4 ns	2,846.609 ns	2,662.720 ns	8.24
Vectors	100000	5,179.2 ns	45.337 ns	42.409 ns	0.06
MemCmp	100000	4,510.5 ns	24.292 ns	22.723 ns	0.05
Intrinsics	100000	2,957.0 ns	11.452 ns	10.712 ns	0.03

Naive	1000000	844,006.1 ns	3,552.478 ns	3,322.990 ns	1.00
LINQ	1000000	6,483,079.3 ns	42,641.040 ns	39,886.455 ns	7.68
Vectors	1000000	54,180.1 ns	357.258 ns	334.180 ns	0.06
MemCmp	1000000	49,480.1 ns	515.675 ns	457.133 ns	0.06
Intrinsics	1000000	36,633.9 ns	680.525 ns	636.564 ns	0.04

方法	ItemsCount	意思	错误	标准差	比
幼稚	10000	7,033.8 ns	50.636 ns	47.365 ns	1.00
LINQ	10000	64,841.4 ns	289.157 ns	270.478 ns	9.22
向量	10000	504.0 ns	2.406纳秒	2.251纳秒	0.07
记忆卡	10000	368.1 ns	2.637 ns	2.466纳秒	0.05
本征	10000	283.6 ns	1.135纳秒	1.061纳秒	0.04

幼稚	100000	85,214.4 ns	903.868 ns	845.478 ns	1.00
LINQ	100000	702,279.4 ns	2,846.609 ns	2,662.720 ns	8.24
向量	100000	5,179.2 ns	45.337 ns	42.409 ns	0.06
记忆卡	100000	4,510.5 ns	24.292 ns	22.723 ns	0.05
本征	100000	2,957.0 ns	11.452 ns	10.712 ns	0.03

幼稚	1000000	844,006.1 ns	3,552.478 ns	3,322.990 ns	1.00
LINQ	1000000	6,483,079.3 ns	42,641.040 ns	39,886.455 ns	7.68
向量	1000000	54,180.1 ns	357.258 ns	334.180 ns	0.06
记忆卡	1000000	49,480.1 ns	515.675 ns	457.133 ns	0.06
本征	1000000	36,633.9 ns	680.525 ns	636.564 ns	0.04

I guess the code of these methods is clear, except for two lines in Intrinsics:

我猜这些方法的代码很清楚，除了Intrinsics中的两行：

var areEqual = Avx2.CompareEqual(va, vb);
if (Avx2.MoveMask(areEqual) != equalsMask) {
    return false;
}

In the first line two vectors are compared for equality and the result is saved in areEqual vector in which all bits in the element at a particular position are set to 1 if the corresponding elements in va and vb are equal. So, it turns out that if byte vectors va and vb are equal, all the elements in areEquals should equal to 255 (11111111b). As Avx2.CompareEqual is a wrapper over _mm256_cmpeq_epi8, we can go to Intel website and see the pseudocode of this operation: MoveMask method makes a 32-bit number from a vector. The top bits of each 32 onebyte elements in a vector are the values of bits in the MoveMask result. The pseudocode is available here.

在第一行中，比较两个向量的相等性，并将结果保存在areEqual向量中，如果va和vb中的对应元素相等，则将特定位置元素中的所有位都设置为1。因此，事实证明，如果字节向量va和vb相等，则areEquals中的所有元素都应等于255(11111111b)。由于Avx2.CompareEqual是_mm256_cmpeq_epi8的包装器，因此我们可以访问Intel网站并查看此操作的伪代码：MoveMask方法从向量中得出32位数字。向量中每32个单字节元素的高位是MoveMask结果中的位的值。伪代码在此处可用。

Thus, if some bytes in va and vb don’t match, the corresponding bytes in areEqual will be 0. Therefore, the top bits of these bytes will be 0 too. This means the corresponding bits in Avx2.MoveMask response will also be 0 and areEqual will not equal to equalsMask.

因此，如果va和vb中的某些字节不匹配，则areEqual中的相应字节将为0。因此，这些字节的高位也将为0。这意味着Avx2.MoveMask响应中的相应位也将为0，而areEqual将不等于equalsMask。

Let’s look at one example assuming that vector length is 8 bytes (to write less):

让我们看一个示例，假设向量长度为8字节(以减少写入)：

Let va = {100, 10, 20, 30, 100, 40, 50, 100} and vb = {100, 20, 10, 30, 100, 40, 80, 90}.
设va = {100，10，20，30，100，40，50，100}，vb = {100，20，10，30，100，40，80，90}。
Then areEquals will be {255, 0, 0, 255, 255, 255, 0, 0}.
然后areEquals将为{255，0，0，255，255，255，0，0}。
The MoveMask method will return 10011100b that should be compared with 11111111b mask. As these masks are not equal, va and vb vectors are not equal too.
MoveMask方法将返回10011100b，应将其与11111111b掩码进行比较。由于这些掩码不相等，因此va和vb向量也不相等。

计算元素在集合中出现的次数。 (Counting the times an element occurs in a collection.)

Sometimes you need to count the occurrences of a particular element, e.g. integers, in a collection. We can speed up this algorithm too. For comparison let’s write several methods to search Item element in Array.

有时您需要计算集合中特定元素(例如整数)的出现次数。我们也可以加快该算法的速度。为了进行比较，让我们编写几种方法来搜索Array中的Item元素。

The most obvious one:

最明显的一个：

public int Naive() {
    int result = 0;
    foreach (int i in Array) {
        if (i == Item) {
            result++;
        }
    }
    return result;
}

Using LINQ:

使用LINQ：

public int LINQ() => Array.Count(i => i == Item);

Using vectors from System.Numerics.Vectors:

使用System.Numerics.Vectors中的向量：

public int Vectors() {
    var mask = new Vector(Item);
    int vectorSize = Vector.Count;
    var accResult = new Vector();
    int i;
    var array = Array;
    for (i = 0; i <= array.Length - vectorSize; i += vectorSize) {
        var v = new Vector(array, i);
        var areEqual = Vector.Equals(v, mask);
        accResult = Vector.Subtract(accResult, areEqual);
    }
    int result = 0;
    for (; i < array.Length; i++) {
        if (array[i] == Item) {
            result++;
        }
    }
    result += Vector.Dot(accResult, Vector.One);
    return result;
}

Using Intrinsics:

使用本征：

public unsafe int Intrinsics() {
    int vectorSize = 256 / 8 / 4;
    var temp = stackalloc int[vectorSize];
    for (int j = 0; j < vectorSize; j++) {
        temp[j] = Item;
    }
    var mask = Avx2.LoadVector256(temp);
    var accVector = Vector256.Zero;
    int i;
    var array = Array;
    fixed (int* ptr = array) {
        for (i = 0; i <= array.Length - vectorSize; i += vectorSize) {
            var v = Avx2.LoadVector256(ptr + i);
            var areEqual = Avx2.CompareEqual(v, mask);
            accVector = Avx2.Subtract(accVector, areEqual);
        }
    }
    int result = 0;
    Avx2.Store(temp, accVector);
    for(int j = 0; j < vectorSize; j++) {
        result += temp[j];
    }
    for(; i < array.Length; i++) {
        if (array[i] == Item) {
            result++;
        }
    }
    return result;
}

The results of running the benchmark on my computer:

在计算机上运行基准测试的结果：

Method	ItemsCount	Mean	Error	StdDev	Ratio
Naive	10	8.844 ns	0.0772 ns	0.0603 ns	1.00
LINQ	10	87.456 ns	0.9496 ns	0.8883 ns	9.89
Vectors	10	3.140 ns	0.0406 ns	0.0380 ns	0.36
Intrinsics	10	13.813 ns	0.0825 ns	0.0772 ns	1.56

Naive	100	107.310 ns	0.6975 ns	0.6183 ns	1.00
LINQ	100	626.285 ns	5.7677 ns	5.3951 ns	5.83
Vectors	100	11.844 ns	0.2113 ns	0.1873 ns	0.11
Intrinsics	100	19.616 ns	0.1018 ns	0.0903 ns	0.18

Naive	1000	1,032.466 ns	6.3799 ns	5.6556 ns	1.00
LINQ	1000	6,266.605 ns	42.6585 ns	39.9028 ns	6.07
Vectors	1000	83.417 ns	0.5393 ns	0.4780 ns	0.08
Intrinsics	1000	88.358 ns	0.4921 ns	0.4603 ns	0.09

Naive	10000	9,942.503 ns	47.9732 ns	40.0598 ns	1.00
LINQ	10000	62,305.598 ns	643.8775 ns	502.6972 ns	6.27
Vectors	10000	914.967 ns	7.2959 ns	6.8246 ns	0.09
Intrinsics	10000	931.698 ns	6.3444 ns	5.9346 ns	0.09

Naive	100000	94,834.804 ns	793.8585 ns	703.7349 ns	1.00
LINQ	100000	626,620.968 ns	4,696.9221 ns	4,393.5038 ns	6.61
Vectors	100000	9,000.827 ns	179.5351 ns	192.1005 ns	0.09
Intrinsics	100000	8,690.771 ns	101.7078 ns	95.1376 ns	0.09

Naive	1000000	959,302.249 ns	4,268.2488 ns	3,783.6914 ns	1.00
LINQ	1000000	6,218,681.888 ns	31,321.9277 ns	29,298.5506 ns	6.48
Vectors	1000000	99,778.488 ns	1,975.6001 ns	4,252.6877 ns	0.10
Intrinsics	1000000	96,449.350 ns	1,171.8067 ns	978.5116 ns	0.10

方法	ItemsCount	意思	错误	标准差	比
幼稚	10	8.844纳秒	0.0772 ns	0.0603 ns	1.00
LINQ	10	87.456 ns	0.9496纳秒	0.8883纳秒	9.89
向量	10	3.140纳秒	0.0406纳秒	0.0380纳秒	0.36
本征	10	13.813 ns	0.0825 ns	0.0772 ns	1.56

幼稚	100	107.310 ns	0.6975纳秒	0.6183 ns	1.00
LINQ	100	626.285 ns	5.7677 ns	5.3951 ns	5.83
向量	100	11.844 ns	0.2113 ns	0.1873 ns	0.11
本征	100	19.616纳秒	0.1018纳秒	0.0903 ns	0.18

幼稚	1000	1,032.466 ns	6.3799 ns	5.6556 ns	1.00
LINQ	1000	6,266.605 ns	42.6585 ns	39.9028 ns	6.07
向量	1000	83.417 ns	0.5393纳秒	0.4780纳秒	0.08
本征	1000	88.358 ns	0.4921纳秒	0.4603纳秒	0.09

幼稚	10000	9,942.503 ns	47.9732 ns	40.0598 ns	1.00
LINQ	10000	62,305.598 ns	643.8775 ns	502.6972 ns	6.27
向量	10000	914.967 ns	7.2959 ns	6.8246 ns	0.09
本征	10000	931.698 ns	6.3444 ns	5.9346 ns	0.09

幼稚	100000	94,834.804 ns	793.8585 ns	703.7349 ns	1.00
LINQ	100000	626,620.968 ns	4,696.9221 ns	4,393.5038 ns	6.61
向量	100000	9,000.827 ns	179.5351 ns	192.1005 ns	0.09
本征	100000	8,690.771 ns	101.7078 ns	95.1376 ns	0.09

幼稚	1000000	959,302.249 ns	4,268.2488 ns	3,783.6914 ns	1.00
LINQ	1000000	6,218,681.888 ns	31,321.9277 ns	29,298.5506 ns	6.48
向量	1000000	99,778.488 ns	1,975.6001 ns	4,252.6877 ns	0.10
本征	1000000	96,449.350 ns	1,171.8067 ns	978.5116 ns	0.10

Vectors and Intrinsics methods completely coincide in logic but differ in the implementation of particular operations. The idea is the following:

向量和本征方法在逻辑上完全重合，但在特定操作的实现上有所不同。这个想法如下：

create mask vector in which a required number is stored in each element;
创建掩码向量，其中每个元素中都存储有所需的编号；
load the part of an array in v vector and compare this part with a mask. As a result, all bits will set in equal elements of areEqual. As areEqual is an array of integers, then if we set all the bits of one element, we will get -1 in this element ((int)(1111_1111_1111_1111_1111_1111_1111_1111b) == -1);
在v向量中加载数组的一部分，并将其与掩码进行比较。结果，所有位将设置在areEqual的相等元素中。因为areEqual是一个整数数组，所以如果我们设置一个元素的所有位，我们将在该元素中获得-1((int)(1111_1111_1111_1111_1111_1111_1111_1111b)== -1);
subtract areEqual vector from accVector. Then, accVector will hold the count of how many times the item element occurred in all v vectors for each position (minus by minus is a plus).
从accVector中减去areEqual向量。然后，accVector将保存每个位置的所有v向量中item元素出现的次数(减乘以加号)。

The whole code from the article is on GitHub.

本文的全部代码在GitHub上。

结论 (Conclusion)

I described only a small part of .NET capabilities for computation vectorization. To see the full updated list of all intrinsics available in .NET Core under x86, turn to the source code. It’s convenient that the summary of each intrinsic in C# files contains its name in C world. This helps either to understand the purpose of this intrinsic and the transfer of existing C++/C algorithms to .NET. System.Numerics.Vector documentation is available on msdn.

我仅描述了用于计算矢量化的.NET功能的一小部分。若要查看x86下.NET Core中可用的所有内部函数的完整更新列表，请转到源代码。 C＃文件中每个内在函数的摘要都包含其在C world中的名称，这很方便。这有助于了解此内在功能的目的，以及帮助将现有C ++ / C算法转移到.NET。 msdn上提供了System.Numerics.Vector文档。

I think .NET has a great advantage over C++. As JIT compilation already occurs on a client machine, a compiler can optimize code for a particular client processor, giving maximum performance. At the same time, a programmer can stay within one language and the same technologies to write fast code.

我认为.NET比C ++具有很大的优势。由于JIT编译已在客户端计算机上进行，因此编译器可以为特定客户端处理器优化代码，从而提供最佳性能。同时，程序员可以使用一种语言和相同的技术来编写快速代码。

翻译自: https://habr.com/en/post/467689/

c# simd 指令

你可能感兴趣的:(算法,python,java,数据结构,编程语言)

系统学习Python——并发模型和异步编程：进程、线程和GIL
分类目录：《系统学习Python》总目录在文章《并发模型和异步编程：基础知识》我们简单介绍了Python中的进程、线程和协程。本文就着重介绍Python中的进程、线程和GIL的关系。Python解释器的每个实例都是一个进程。使用multiprocessing或concurrent.futures库可以启动额外的Python进程。Python的subprocess库用于启动运行外部程序（不管使用何种
C++11堆操作深度解析：std::is_heap与std::is_heap_until原理解析与实践
文章目录堆结构基础与函数接口堆的核心性质函数签名与核心接口std::is_heapstd::is_heap_until实现原理深度剖析std::is_heap的验证逻辑std::is_heap_until的定位策略算法优化细节代码实践与案例分析基础用法演示自定义比较器实现最小堆检查边缘情况处理性能分析与实际应用时间复杂度对比典型应用场景与手动实现的对比注意事项与最佳实践迭代器要求比较器设计C++标
Flask框架入门：快速搭建轻量级Python网页应用「已注销」 python-AI python基础网站网络 python flask 后端
转载：Flask框架入门：快速搭建轻量级Python网页应用1.Flask基础Flask是一个使用Python编写的轻量级Web应用框架。它的设计目标是让Web开发变得快速简单，同时保持应用的灵活性。Flask依赖于两个外部库：Werkzeug和Jinja2，Werkzeug作为WSGI工具包处理Web服务的底层细节，Jinja2作为模板引擎渲染模板。安装Flask非常简单，可以使用pip安装命令
JSON 与 AJAX Auscy json ajax 前端
一、JSON（JavaScriptObjectNotation）1.数据类型与语法细节支持的数据类型：基本类型：字符串（需用双引号）、数字、布尔值（true/false）、null。复杂类型：数组（[]）、对象（{}）。严格语法规范：键名必须用双引号包裹（如"name":"张三"）。数组元素用逗号分隔，最后一个元素后不能有多余逗号。数字不能以0开头（如012会被解析为12），不支持八进制/十六进制
Python Flask 框架入门：快速搭建 Web 应用的秘诀 Python编程之道 Python人工智能与大数据 Python编程之道 python flask 前端 ai
PythonFlask框架入门：快速搭建Web应用的秘诀关键词Flask、微框架、路由系统、Jinja2模板、请求处理、WSGI、Web开发摘要想快速用Python搭建一个灵活的Web应用？Flask作为“微框架”代表，凭借轻量、可扩展的特性，成为初学者和小型项目的首选。本文将从Flask的核心概念出发，结合生活化比喻、代码示例和实战案例，带你一步步掌握：如何用Flask搭建第一个Web应用？路由
JavaScript 树形菜单总结 Auscy microsoft
树形菜单是前端开发中常见的交互组件，用于展示具有层级关系的数据（如文件目录、分类列表、组织架构等）。以下从核心概念、实现方式、常见功能及优化方向等方面进行总结。一、核心概念层级结构：数据以父子嵌套形式存在，如{id:1,children:[{id:2}]}。节点：树形结构的基本单元，包含自身信息及子节点（若有）。展开/折叠：子节点的显示与隐藏切换，是树形菜单的核心交互。递归渲染：因数据层级不固定，
冒泡、选择、插入排序：三大基础排序算法深度解析（C语言实现） xienda 算法排序算法数据结构
在算法学习道路上，排序算法是每位程序员必须掌握的基石。本文将深入解析冒泡排序、选择排序和插入排序这三种基础排序算法，通过C语言代码实现和对比分析，帮助读者彻底理解它们的差异与应用场景。算法原理与代码实现1.冒泡排序（BubbleSort）工作原理：通过重复比较相邻元素，将较大元素逐步"冒泡"到数组末尾。voidbubbleSort(intarr[],intn){ for(inti=0;iarr[
Leetcode 148. 排序链表
文章目录前引题目代码（首刷看题解）代码（8.9二刷部分看解析）代码（9.15三刷部分看解析）前引综合性比较强的一道题，要求时间复杂度必须O(logn)才能通过，最适合链表的排序算法就是归并。这里采用自顶向下的方法步骤：找到链表中点（双指针）对两个子链表排序(递归，直到只有一个结点，记得将子链表最后指向nullptr）归并（引入dummy结点）题目Leetcode148.排序链表代码（首刷看题解）c
python_虚拟环境阿_焦 python
第一、配置虚拟环境：virtualenv（1）pipvirtualenv>安装虚拟环境包（2）pipinstallvirtualenvwrapper-win>安装虚拟环境依赖包（3）c盘创建虚拟目录>C:\virtualenv>配置环境变量【了解一下】：（1）如何使用virtualenv创建虚拟环境a、cd到C:\virtualenv目录下：b、mkvirtualenvname>创建虚拟环境nam
全面触摸屏输入法设计与实现长野君
本文还有配套的精品资源，点击获取简介：触摸屏输入法是针对触摸设备优化的文字输入方案，包括虚拟键盘、手写、语音识别和手势等多种输入方式。本方案通过提供主程序文件、用户手册、界面截图、示例图、说明文本和音效文件，旨在为用户提供一个完整的、多样的文字输入体验。开发者通过持续优化算法和用户界面，使用户在无物理键盘环境下也能高效准确地进行文字输入。1.触摸屏输入法概述简介在现代信息技术飞速发展的今天，触摸屏
精通Canvas：15款时钟特效代码实现指南烟幕缭绕
本文还有配套的精品资源，点击获取简介：HTML5的Canvas是一个用于绘制矢量图形的API，通过JavaScript实现动态效果。本项目集合了15种不同的时钟特效代码，帮助开发者通过学习绘制圆形、线条、时间更新、旋转、颜色样式设置及动画效果等概念，深化对Canvas的理解和应用。项目中的CSS文件负责时钟的样式设定，而JS文件则包含实现各种特效的逻辑，通过不同的函数或类处理时间更新和动画绘制，提
深入剖析OpenJDK 18 GA源码：Java平台最新发展想法臃肿
本文还有配套的精品资源，点击获取简介：OpenJDK18GA作为Java开发的关键里程碑，提供了诸多新特性和改进。本文章深入探讨了OpenJDK18GA源码，揭示其内部机制，帮助开发者更好地理解和利用这个版本。文章还涵盖了PatternMatching、SealedClasses、Records、JEP395、JEP406和JEP407等特性，以及HotSpot虚拟机、编译器、垃圾收集器、内存模型
FPGA小白到项目实战：Verilog+Vivado全流程通关指南（附光学类岗位技能映射）阿牛的药铺算法移植部署 fpga开发 verilog
FPGA小白到项目实战：Verilog+Vivado全流程通关指南（附光学类岗位技能映射）引言：为什么这个FPGA入门路线能帮你快速上岗？本文设计了一条**"Verilog语法→工具链操作→光学项目实战→岗位技能对标"的阶梯式学习路径。不同于泛泛而谈的FPGA教程，我们聚焦光学类产品开发**核心能力（时序接口设计、图像处理算法移植、高速接口应用），通过3个递进式项目（从LED闪烁到图像边缘检测），
PyTorch & TensorFlow速成复习：从基础语法到模型部署实战（附FPGA移植衔接）阿牛的药铺算法移植部署 pytorch tensorflow fpga开发
PyTorch&TensorFlow速成复习：从基础语法到模型部署实战（附FPGA移植衔接）引言：为什么算法移植工程师必须掌握框架基础？针对光学类产品算法FPGA移植岗位需求（如可见光/红外图像处理），深度学习框架是算法落地的"桥梁"——既要用PyTorch/TensorFlow验证算法可行性，又要将训练好的模型（如CNN、目标检测）转换为FPGA可部署的格式（ONNX、TFLite）。本文采用"
Python爱心光波
系列文章序号直达链接Tkinter1Python李峋同款可写字版跳动的爱心2Python跳动的双爱心3Python蓝色跳动的爱心4Python动漫烟花5Python粒子烟花Turtle1Python满屏飘字2Python蓝色流星雨3Python金色流星雨4Python漂浮爱心5Python爱心光波①6Python爱心光波②7Python满天繁星8Python五彩气球9Python白色飘雪10Pyt
Python流星雨 Want595 python 开发语言
文章目录系列文章写在前面技术需求完整代码代码分析1.模块导入2.画布设置3.画笔设置4.颜色列表5.流星类(Star)6.流星对象创建7.主循环8.流星运动逻辑9.视觉效果10.总结写在后面系列文章序号直达链接表白系列1Python制作一个无法拒绝的表白界面2Python满屏飘字表白代码3Python无限弹窗满屏表白代码4Python李峋同款可写字版跳动的爱心5Python流星雨代码6Python
Java大厂面试实录：谢飞机的电商场景技术问答（Spring Cloud、MyBatis、Redis、Kafka、AI等）
Java大厂面试实录：谢飞机的电商场景技术问答（SpringCloud、MyBatis、Redis、Kafka、AI等）本文模拟知名互联网大厂Java后端岗位面试流程，以电商业务为主线，由严肃面试官与“水货”程序员谢飞机展开有趣的对话，涵盖SpringCloud、MyBatis、Redis、Kafka、SpringSecurity、AI等热门技术栈，并附详细解析，助力求职者备战大厂面试。故事设定谢
【超硬核】JVM源码解读：Java方法main在虚拟机上解释执行 HeapDump性能社区 java 开发语言后端 jvm
本文由HeapDump性能社区首席讲师鸠摩（马智）授权整理发布第1篇-关于Java虚拟机HotSpot，开篇说的简单点开讲Java运行时，这一篇讲一些简单的内容。我们写的主类中的main()方法是如何被Java虚拟机调用到的？在Java类中的一些方法会被由C/C++编写的HotSpot虚拟机的C/C++函数调用，不过由于Java方法与C/C++函数的调用约定不同，所以并不能直接调用，需要JavaC
算法学习笔记：17.蒙特卡洛算法 ——从原理到实战，涵盖 LeetCode 与考研 408 例题
在计算机科学和数学领域，蒙特卡洛算法（MonteCarloAlgorithm）以其独特的随机抽样思想，成为解决复杂问题的有力工具。从圆周率的计算到金融风险评估，从物理模拟到人工智能，蒙特卡洛算法都发挥着不可替代的作用。本文将深入剖析蒙特卡洛算法的思想、解题思路，结合实际应用场景与Java代码实现，并融入考研408的相关考点，穿插图片辅助理解，帮助你全面掌握这一重要算法。蒙特卡洛算法的基本概念蒙特卡
Python之七彩花朵代码实现 PlutoZuo Python python 开发语言
Python之七彩花朵代码实现文章目录Python之七彩花朵代码实现下面是一个简单的使用Python的七彩花朵。这个示例只是一个简单的版本，没有很多高级功能，但它可以作为一个起点，你可以在此基础上添加更多功能。importturtleastuimportrandomasraimportmathtu.setup(1.0,1.0)t=tu.Pen()t.ht()colors=['red','skybl
算法学习笔记：15.二分查找 ——从原理到实战，涵盖 LeetCode 与考研 408 例题呆呆企鹅仔算法学习算法学习笔记考研二分查找
在计算机科学的查找算法中，二分查找以其高效性占据着重要地位。它利用数据的有序性，通过不断缩小查找范围，将原本需要线性时间的查找过程优化为对数时间，成为处理大规模有序数据查找问题的首选算法。二分查找的基本概念二分查找（BinarySearch），又称折半查找，是一种在有序数据集合中查找特定元素的高效算法。其核心原理是：通过不断将查找范围减半，快速定位目标元素。与线性查找逐个遍历元素不同，二分查找依赖
Python 脚本最佳实践2025版
前文可以直接把这篇文章喂给AI,可以放到AI角色设定里,也可以直接作为提示词.这样,你只管提需求,写脚本就让AI来.概述追求简洁和清晰：脚本应简单明了。使用函数(functions)、常量(constants)和适当的导入(import)实践来有逻辑地组织你的Python脚本。使用枚举(enumerations)和数据类(dataclasses)等数据结构高效管理脚本状态。通过命令行参数增强交互性
（Python基础篇）了解和使用分支结构 EternityArt 基础篇 python
目录一、引言二、Python分支结构的类型与语法（一）if语句（单分支）（二）if-else语句（双分支）（三）if-elif-else语句（多分支）三、分支结构的应用场景（一）提示用户输入用户名，然后再提示输入密码，如果用户名是“admin”并且密码是“88888”则提示正确，否则，如果用户名不是admin还提示用户用户名不存在,（二）提示用户输入用户名，然后再提示输入密码，如果用户名是“adm
（Python基础篇）循环结构 EternityArt 基础篇 python
一、什么是Python循环结构？循环结构是编程中重复执行代码块的机制。在Python中，循环允许你：1.迭代处理数据：遍历列表、字典、文件内容等。2.自动化重复任务：如批量处理数据、生成序列等。3.控制执行流程：根据条件决定是否继续或终止循环。二、为什么需要循环结构？假设你需要打印1到100的所有偶数：没有循环：需手动编写100行print()语句。print(0)print(2)print(4)
（Python基础篇）字典的操作 EternityArt 基础篇 python 开发语言
一、引言在Python编程中，字典（Dictionary）是一种极具灵活性的数据结构，它通过“键-值对”（key-valuepair）的形式存储数据，如同现实生活中的字典——通过“词语（键）”快速查找“释义（值）”。相较于列表和元组的有序索引访问，字典的优势在于基于键的快速查找，这使得它在处理需要频繁通过唯一标识获取数据的场景中极为高效。掌握字典的操作，能让我们更高效地组织和管理复杂数据，是Pyt
LeetCode算法题：电话号码的字母组合吱屋猪_ 算法 leetcode java
题目描述：给定一个仅包含数字2-9的字符串，返回所有它能表示的字母组合。答案可以按任意顺序返回。给出数字到字母的映射如下（与电话按键相同）。注意1不对应任何字母。2->"abc"3->"def"4->"ghi"5->"jkl"6->"mno"7->"pqrs"8->"tuv"9->"wxyz"例如，给定digits="23"，返回["ad","ae","af","bd","be","bf","cd
Python七彩花朵 Want595 python 开发语言
系列文章序号直达链接Tkinter1Python李峋同款可写字版跳动的爱心2Python跳动的双爱心3Python蓝色跳动的爱心4Python动漫烟花5Python粒子烟花Turtle1Python满屏飘字2Python蓝色流星雨3Python金色流星雨4Python漂浮爱心5Python爱心光波①6Python爱心光波②7Python满天繁星8Python五彩气球9Python白色飘雪10Pyt
Java大厂面试故事：谢飞机的互联网音视频场景技术面试全纪录（Spring Boot、MyBatis、Kafka、Redis、AI等）来旺 Java场景面试宝典 Java Spring Boot MyBatis Kafka Redis 微服务 AI
Java大厂面试故事：谢飞机的互联网音视频场景技术面试全纪录（SpringBoot、MyBatis、Kafka、Redis、AI等）互联网大厂技术面试不仅考察技术深度，更注重业务场景与系统设计能力。本篇以严肃面试官与“水货”程序员谢飞机的对话，带你体验音视频业务场景下的Java面试全过程，涵盖主流技术栈，并附详细答案解析，助你面试无忧。故事场景设定谢飞机是一名有趣但技术基础略显薄弱的程序员，这次应
【前端】jQuery数组合并去重方法总结
在jQuery中合并多个数组并去重，推荐使用原生JavaScript的Set对象（高效简单）或$.unique()（仅适用于DOM元素，不适用于普通数组）。以下是完整解决方案：方法1：使用ES6Set（推荐）//定义多个数组constarr1=[1,2,3];constarr2=[2,3,4];constarr3=[3,4,5];//合并数组并用Set去重constmergedArray=[...
霍夫变换（Hough Transform）算法原来详解和纯C++代码实现以及OpenCV中的使用示例点云SLAM 算法图形图像处理算法 opencv 图像处理与计算机视觉算法直线提取检测目标检测霍夫变换算法
霍夫变换（HoughTransform）是一种经典的图像处理与计算机视觉算法，广泛用于检测图像中的几何形状，例如直线、圆、椭圆等。其核心思想是将图像空间中的“点”映射到参数空间中的“曲线”，从而将形状检测问题转化为参数空间中的峰值检测问题。一、霍夫变换基本思想输入：边缘图像（如经过Canny边缘检测）输出：一组满足几何模型的形状（如直线、圆）关键思想：图像空间中的一个点→参数空间中的一个曲线参数空
web报表工具FineReport常见的数据集报错错误代码和解释老A不折腾 web报表 finereport 代码可视化工具
在使用finereport制作报表，若预览发生错误，很多朋友便手忙脚乱不知所措了，其实没什么，只要看懂报错代码和含义，可以很快的排除错误，这里我就分享一下finereport的数据集报错错误代码和解释，如果有说的不准确的地方，也请各位小伙伴纠正一下。 NS-war-remote=错误代码\:1117 压缩部署不支持远程设计 NS_LayerReport_MultiDs=错误代码
Java的WeakReference与WeakHashMap bylijinnan java 弱引用
首先看看 WeakReference wiki 上 Weak reference 的一个例子： public class ReferenceTest { public static void main(String[] args) throws InterruptedException { WeakReference r = new Wea
Linux——（hostname）主机名与ip的映射 eksliang linux hostname
一、什么是主机名无论在局域网还是INTERNET上，每台主机都有一个IP地址，是为了区分此台主机和彼台主机，也就是说IP地址就是主机的门牌号。但IP地址不方便记忆，所以又有了域名。域名只是在公网（INtERNET)中存在，每个域名都对应一个IP地址，但一个IP地址可有对应多个域名。域名类型 linuxsir.org 这样的；主机名是用于什么的呢？答：在一个局域网中，每台机器都有一个主
oracle 常用技巧 18289753290
oracle常用技巧 ①复制表结构和数据 create table temp_clientloginUser as select distinct userid from tbusrtloginlog ②仅复制数据如果表结构一样 insert into mytable select * &nb
使用c3p0数据库连接池时出现com.mchange.v2.resourcepool.TimeoutException 酷的飞上天空 exception
有一个线上环境使用的是c3p0数据库，为外部提供接口服务。最近访问压力增大后台tomcat的日志里面频繁出现 com.mchange.v2.resourcepool.TimeoutException: A client timed out while waiting to acquire a resource from com.mchange.v2.resourcepool.BasicResou
IT系统分析师如何学习大数据蓝儿唯美大数据
我是一名从事大数据项目的IT系统分析师。在深入这个项目前需要了解些什么呢？学习大数据的最佳方法就是先从了解信息系统是如何工作着手，尤其是数据库和基础设施。同样在开始前还需要了解大数据工具，如Cloudera、Hadoop、Spark、Hive、Pig、Flume、Sqoop与Mesos。系统分析师需要明白如何组织、管理和保护数据。在市面上有几十款数据管理产品可以用于管理数据。你的大数据数据库可能
spring学习——简介 a-john spring
Spring是一个开源框架，是为了解决企业应用开发的复杂性而创建的。Spring使用基本的JavaBean来完成以前只能由EJB完成的事情。然而Spring的用途不仅限于服务器端的开发，从简单性，可测试性和松耦合的角度而言，任何Java应用都可以从Spring中受益。其主要特征是依赖注入、AOP、持久化、事务、SpringMVC以及Acegi Security 为了降低Java开发的复杂性，
自定义颜色的xml文件 aijuans xml
<?xml version="1.0" encoding="utf-8"?> <resources> <color name="white">#FFFFFF</color> <color name="black">#000000</color> &
运营到底是做什么的？ aoyouzi 运营到底是做什么的？
文章来源：夏叔叔（微信号：woshixiashushu），欢迎大家关注！很久没有动笔写点东西，近些日子，由于爱狗团产品上线，不断面试，经常会被问道一个问题。问：爱狗团的运营主要做什么？答：带着用户一起嗨。为什么是带着用户玩起来呢？究竟什么是运营？运营到底是做什么的？那么，我们先来回答一个更简单的问题——互联网公司对运营考核什么？以爱狗团为例，绝大部分的移动互联网公司，对运营部门的考核分为三块——用
js面向对象类和对象百合不是茶 js 面向对象函数创建类和对象
接触js已经有几个月了,但是对js的面向对象的一些概念根本就是模糊的,js是一种面向对象的语言但又不像java一样有class,js不是严格的面向对象语言 ,js在java web开发的地位和java不相上下 ,其中web的数据的反馈现在主流的使用json,json的语法和js的类和属性的创建相似下面介绍一些js的类和对象的创建的技术一:类和对
web.xml之资源管理对象配置 resource-env-ref bijian1013 java web.xml servlet
resource-env-ref元素来指定对管理对象的servlet引用的声明，该对象与servlet环境中的资源相关联 <resource-env-ref> <resource-env-ref-name>资源名</resource-env-ref-name> <resource-env-ref-type>查找资源时返回的资源类
Create a composite component with a custom namespace sunjing
https://weblogs.java.net/blog/mriem/archive/2013/11/22/jsf-tip-45-create-composite-component-custom-namespace When you developed a composite component the namespace you would be seeing would
【MongoDB学习笔记十二】Mongo副本集服务器角色之Arbiter bit1129 mongodb
一、复本集为什么要加入Arbiter这个角色回答这个问题，要从复本集的存活条件和Aribter服务器的特性两方面来说。什么是Artiber？ An arbiter does not have a copy of data set and cannot become a primary. Replica sets may have arbiters to add a
Javascript开发笔记白糖_ JavaScript
获取iframe内的元素通常我们使用window.frames["frameId"].document.getElementById("divId").innerHTML这样的形式来获取iframe内的元素，这种写法在IE、safari、chrome下都是通过的，唯独在fireforx下不通过。其实jquery的contents方法提供了对if
Web浏览器Chrome打开一段时间后，运行alert无效 bozch Web chorme alert 无效
今天在开发的时候，突然间发现alert在chrome浏览器就没法弹出了，很是怪异。试了试其他浏览器，发现都是没有问题的。开始想以为是chorme浏览器有啥机制导致的，就开始尝试各种代码让alert出来。尝试结果是仍然没有显示出来。这样开发的结果，如果客户在使用的时候没有提示，那会带来致命的体验。哎，没啥办法了就关闭浏览器重启。结果就好了，这也太怪异了。难道是cho
编程之美-高效地安排会议图着色问题贪心算法 bylijinnan 编程之美
import java.util.ArrayList; import java.util.Collections; import java.util.List; import java.util.Random; public class GraphColoringProblem { /**编程之美高效地安排会议图着色问题贪心算法 * 假设要用很多个教室对一组
机器学习相关概念和开发工具 chenbowen00 算法 matlab 机器学习
基本概念：机器学习(Machine Learning, ML)是一门多领域交叉学科，涉及概率论、统计学、逼近论、凸分析、算法复杂度理论等多门学科。专门研究计算机怎样模拟或实现人类的学习行为，以获取新的知识或技能，重新组织已有的知识结构使之不断改善自身的性能。它是人工智能的核心，是使计算机具有智能的根本途径，其应用遍及人工智能的各个领域，它主要使用归纳、综合而不是演绎。开发工具 M
[宇宙经济学]关于在太空建立永久定居点的可能性 comsci 经济
大家都知道,地球上的房地产都比较昂贵,而且土地证经常会因为新的政府的意志而变幻文本格式........ 所以,在地球议会尚不具有在太空行使法律和权力的力量之前,我们外太阳系统的友好联盟可以考虑在地月系的某些引力平衡点上面,修建规模较大的定居点
oracle 11g database control 证书错误 daizj oracle 证书错误 oracle 11G 安装
oracle 11g database control 证书错误 win7 安装完oracle11后打开 Database control 后，会打开em管理页面，提示证书错误，点“继续浏览此网站”，还是会继续停留在证书错误页面解决办法：是 KB2661254 这个更新补丁引起的，它限制了 RSA 密钥位长度少于 1024 位的证书的使用。具体可以看微软官方公告：
Java I/O之用FilenameFilter实现根据文件扩展名删除文件游其是你 FilenameFilter
在Java中，你可以通过实现FilenameFilter类并重写accept(File dir, String name) 方法实现文件过滤功能。在这个例子中，我们向你展示在“c:\\folder”路径下列出所有“.txt”格式的文件并删除。 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
C语言数组的简单以及一维数组的简单排序算法示例，二维数组简单示例 dcj3sjt126com c array
# include <stdio.h> int main(void) { int a[5] = {1, 2, 3, 4, 5}; //a 是数组的名字 5是表示数组元素的个数，并且这五个元素分别用a[0], a[1]...a[4] int i; for (i=0; i<5; ++i) printf("%d\n",
PRIMARY, INDEX, UNIQUE 这3种是一类 PRIMARY 主键。就是唯一且不能为空。 INDEX 索引，普通的 UNIQUE 唯一索引 dcj3sjt126com primary
PRIMARY, INDEX, UNIQUE 这3种是一类PRIMARY 主键。就是唯一且不能为空。INDEX 索引，普通的UNIQUE 唯一索引。不允许有重复。FULLTEXT 是全文索引，用于在一篇文章中，检索文本信息的。举个例子来说，比如你在为某商场做一个会员卡的系统。这个系统有一个会员表有下列字段：会员编号 INT会员姓名
java集合辅助类 Collections、Arrays shuizhaosi888 Collections Arrays HashCode
Arrays、Collections 1 ）数组集合之间转换 public static <T> List<T> asList(T... a) { return new ArrayList<>(a); } a）Arrays.asL
Spring Security（10）——退出登录logout 234390216 logout Spring Security 退出登录 logout-url LogoutFilter
要实现退出登录的功能我们需要在http元素下定义logout元素，这样Spring Security将自动为我们添加用于处理退出登录的过滤器LogoutFilter到FilterChain。当我们指定了http元素的auto-config属性为true时logout定义是会自动配置的，此时我们默认退出登录的URL为“/j_spring_secu
透过源码学前端之 Backbone 三 Model 逐行分析JS源代码 backbone 源码分析 js学习
Backbone 分析第三部分 Model 概述： Model 提供了数据存储，将数据以JSON的形式保存在 Model的 attributes里，但重点功能在于其提供了一套功能强大，使用简单的存、取、删、改数据方法，并在不同的操作里加了相应的监听事件，如每次修改添加里都会触发 change，这在据模型变动来修改视图时很常用，并且与collection建立了关联。
SpringMVC源码总结（七）mvc:annotation-driven中的HttpMessageConverter 乒乓狂魔 springMVC
这一篇文章主要介绍下HttpMessageConverter整个注册过程包含自定义的HttpMessageConverter，然后对一些HttpMessageConverter进行具体介绍。 HttpMessageConverter接口介绍： public interface HttpMessageConverter<T> { /** * Indicate
分布式基础知识和算法理论 bluky999 算法 zookeeper 分布式一致性哈希 paxos
分布式基础知识和算法理论 BY [email protected] 本文永久链接：http://nodex.iteye.com/blog/2103218 在大数据的背景下，不管是做存储，做搜索，做数据分析，或者做产品或服务本身，面向互联网和移动互联网用户，已经不可避免地要面对分布式环境。笔者在此收录一些分布式相关的基础知识和算法理论介绍，在完善自我知识体系的同
Android Studio的.gitignore以及gitignore无效的解决 bell0901 android gitignore
　　github上.gitignore模板合集，里面有各种.gitignore ： https://github.com/github/gitignore 　　自己用的Android Studio下项目的.gitignore文件，对github上的android.gitignore添加了　　　　　　# OSX files　　　　　　//mac os下　　　　　　.DS_Store
成为高级程序员的10个步骤 tomcat_oracle 编程
What 软件工程师的职业生涯要历经以下几个阶段：初级、中级，最后才是高级。这篇文章主要是讲如何通过 10 个步骤助你成为一名高级软件工程师。 Why 得到更多的报酬！因为你的薪水会随着你水平的提高而增加提升你的职业生涯。成为了高级软件工程师之后，就可以朝着架构师、团队负责人、CTO 等职位前进历经更大的挑战。随着你的成长，各种影响力也会提高。
mongdb在linux下的安装 xtuhcy mongodb linux
一、查询linux版本号： lsb_release -a LSB Version: :base-4.0-amd64:base-4.0-noarch:core-4.0-amd64:core-4.0-noarch:graphics-4.0-amd64:graphics-4.0-noarch:printing-4.0-amd64:printing-4.0-noa