zsc09_leaf

Neon 指令集 ARMv7/v8 对比

原文：http://community.arm.com/groups/android-community/blog/2015/03/27/arm-neon-programming-quick-reference

ARM NEON programming quick reference

1 Introduction

This article aims to introduce ARM NEON technology. Hope that beginners can get started with NEON programming quickly after reading the article. The article will also inform users which documents can be consulted if more detailed information is needed.

2 NEON overview

This section describes the NEON technology and supplies some background knowledge.

2.1 What is NEON?

NEON technology is an advanced SIMD (Single Instruction, Multiple Data) architecture for the ARM Cortex-A series processors. It can accelerate multimedia and signal processing algorithms such as video encoder/decoder, 2D/3D graphics, gaming, audio and speech processing, image processing, telephony, and sound.

NEON instructions perform "Packed SIMD" processing:

Registers are considered as vectors of elements of the same data type
Data types can be: signed/unsigned 8-bit, 16-bit, 32-bit, 64-bit, single-precision floating-point on ARM 32-bit platform, both single-precision floating-point and double-precision floating-point on ARM 64-bit platform.
Instructions perform the same operation in all lanes

2.2 History of ARM Adv SIMD

ARMv6[i]

SIMD extension

ARMv7-A

NEON

ARMv8-A AArch64

NEON

Operates on 32-bit general purpose ARM registers
8-bit/16-bit integer
2x16-bit/4x8-bit operations per instruction

Separate register bank, 32x64-bit NEON registers
8/16/32/64-bit integer
Single precision floating point
Up to 16x8-bit operations per instruction

Separate register bank, 32x128-bit NEON registers
8/16/32/64-bit integer
Single precision floating point
double precision floating point, both of them are IEEE compliance
Up to 16x8-bit operations per instruction

[i] The ARM Architecture Version 6 (ARMv6) David Brash: page 13

2.3 Why use NEON

NEON provides:

Support for both integer and floating point operations ensures the adaptability of a broad range of applications, from codecs to High Performance Computing to 3D graphics.
Tight coupling to the ARM processor provides a single instruction stream and a unified view of memory, presenting a single development platform target with a simpler tool flow[ii]

3 ARMv7/v8 comparison

ARMv8-A is a fundamental change to the ARM architecture. It supports the 64-bit Execution state called “AArch64”, and a new 64-bit instruction set “A64”. To provide compatibility with the ARMv7-A (32-bit architecture) instruction set, a 32-bit variant of ARMv8-A “AArch32” is provided. Most of existing ARMv7-A code can be run in the AArch32 execution state of ARMv8-A.

This section compares the NEON-related features of both the ARMv7-A and ARMv8-A architectures. In addition, general purpose ARM registers and ARM instructions, which are used often for NEON programming, will also be mentioned. However, the focus is still on the NEON technology.

3.1 Register

ARMv7-A and AArch32 have the same general purpose ARM registers – 16 x 32-bit general purpose ARM registers (R0-R15).

ARMv7-A and AArch32 have 32 x 64-bit NEON registers (D0-D31). These registers can also be viewed as 16x128-bit registers (Q0-Q15). Each of the Q0-Q15 registers maps to a pair of D registers, as shown in the following figure.

注：V7a 有32个64位的D寄存器[D0-D31], 16个128位的Q寄存器 [Q0-Q15] ,一个Q对应2个D(2个D公用Q的高64位和低64位)。

AArch64 by comparison, has 31 x 64-bit general purpose ARM registers and 1 special register having different names, depending on the context in which it is used. These registers can be viewed as either 31 x 64-bit registers (X0-X30) or as 31 x 32-bit registers (W0-W30).

注：ARMv8 有31 个64位寄存器,1个不同名字的特殊寄存器,用途取决于上下文, 因此我们可以看成 31个64位的X寄存器或者31个32位的W寄存器(X寄存器的低32位)

AArch64 has 32 x 128-bit NEON registers (V0-V31). These registers can also be viewed as 32-bit Sn registers or 64-bit Dn registers.

注：ARMv8有32个128位的V寄存器，相似的，我们同样可以看成是32个32位的S寄存器或者32个64位的D寄存器。

3.2 Instruction set[iii]

The following figure illustrates the relationship between ARMv7-A, ARMv8-A AArch32 and ARMv8-A AArch64 instruction set.

The ARMv8-A AArch32 instruction set consists of A32 (ARM instruction set, a 32-bit fixed length instruction set) and T32 (Thumb instruction set, a 16-bit fixed length instruction set; Thumb2 instruction set, 16 or 32-bit length instruction set). It is a superset of the ARMv7-A instruction set, so that it retains the backwards compatibility necessary to run existing software. There are some additions to A32 and T32 to maintain alignment with the A64 instruction set, including NEON division, and the Cryptographic Extension instructions. NEON double precision floating point (IEEE compliance) is also supported.

3.3 NEON instruction format

This section describes the changes to the NEON instruction syntax.

3.3.1 ARMv7-A/AArch32 instruction syntax[iv]

All mnemonics for ARMv7-A/AAArch32 NEON instructions (as with VFP) begin with the letter “V”. Instructions are generally able to operate on different data types, with this being specified in the instruction encoding. The size is indicated with a suffix to the instruction. The number of elements is indicated by the specified register size and data type of operation. Instructions have the following general format:

V{}{}{}{.

}{}, src1, src2

Where:

- modifiers

Q: The instruction uses saturating arithmetic, so that the result is saturated within the range of the specified data type, such as VQABS, VQSHL etc.
H: The instruction will halve the result. It does this by shifting right by one place (effectively a divide by two with truncation), such as VHADD, VHSUB.
D: The instruction doubles the result, such as VQDMULL, VQDMLAL, VQDMLSL and VQ{R}DMULH
R: The instruction will perform rounding on the result, equivalent to adding 0.5 to the result before truncating, such as VRHADD, VRSHR.

- the operation (for example, ADD, SUB, MUL).

- Shape.

Neon data processing instructions are typically available in Normal, Long, Wide and Narrow variants.

Long (L): instructions operate on double-word vector operands and produce a quad-word vector result. The result elements are twice the width of the operands, and of the same type. Lengthening instructions are specified using an L appended to the instruction.

Wide (W): instructions operate on a double-word vector operand and a quad-word vector operand, producing a quad-word vector result. The result elements and the first operand are twice the width of the second operand elements. Widening instructions have a W appended to the instruction.

Narrow (N): instructions operate on quad-word vector operands, and produce a double-word vector result. The result elements are half the width of the operand elements. Narrowing instructions are specified using an N appended to the instruction.

- Condition, used with IT instruction

<.dt> - Data type, such as s8, u8, f32 etc.

- Destination

- Source operand 1

- Source operand 2

Note: {} represents and optional parameter.

For example:

VADD.I8 D0, D1, D2

VMULL.S16 Q2, D8, D9

For more information, please refer to the documents listed in the Appendix.

3.3.2 AArch64 NEON instruction syntax[v]

In the AArch64 execution state, the syntax of NEON instruction has changed. It can be described as follows:

{}{<suffix>} Vd., Vn., Vm.

Where:

- prefix, such as using S/U/F/P to represent signed/unsigned/float/bool data type.

– operation, such as ADD, AND etc.

<suffix> - suffix

P: “pairwise” operations, such as ADDP.
V: the new reduction (across-all-lanes) operations, such as FMAXV.
2：new widening/narrowing “second part” instructions, such as ADDHN2, SADDL2.

ADDHN2: add two 128-bit vectors and produce a 64-bit vector result which is stored as high 64-bit part of NEON register.

SADDL2: add two high 64-bit vectors of NEON register and produce a 128-bit vector result.

- data type, 8B/16B/4H/8H/2S/4S/2D. B represents byte (8-bit). H represents half-word (16-bit). S represents word (32-bit). D represents a double-word (64-bit).

For example:

UADDLP V0.8H, V0.16B

FADD V0.4S, V0.4S, V0.4S

For more information, please refer to the documents listed in the Appendix.

3.4 NEON instructions[vi]

The following table compares the ARMv7-A, AArch32 and AArch64 NEON instruction set.

“√” indicates that the AArch32 NEON instruction has the same format as ARMv7-A NEON instruction.

“Y” indicates that the AArch64 NEON instruction has the same functionality as ARMv7-A NEON instructions, but the format is different. Please check the ARMv8-A ISA document.

If you are familiar with the ARMv7-A NEON instructions, there is a simple way to map the NEON instructions of ARMv7-A and AArch64. It is to check the NEON intrinsics document, so that you can find the AArch64 NEON instruction according to the intrinsics instruction.

New or changed functionality is highlighted.

	ARMv7-A	AArch32	AArch64
logical and compare	VAND, VBIC, VEOR, VORN, and VORR (register)	√	Y
	VBIC and VORR (immediate)	√	Y
	VBIF, VBIT, and VBSL	√	Y
	VMOV, VMVN (register)	√	Y
	VACGE and VACGT	√	Y
	VCEQ, VCGE, VCGT, VCLE, and VCLT	√	Y
	VTST	√	Y
general data processing	VCVT (between fixed-point or integer, and floating-point)	√	Y
	VCVT (between half-precision and single-precision floating-point)	√	Y
	n/a	n/a	FCVTXN(double to single-precision)
	VDUP	√	Y
	VEXT	√	Y
	VMOV, VMVN (immediate)	√	Y
	VMOVL, V{Q}MOVN, VQMOVUN	√	Y
	VREV	√	Y
	VSWP	√	n/a
	VTBL, VTBX	√	Y
	VTRN	√	TRN1, TRN2
	VUZP, VZIP	√	UZP1,UZP2, ZIP, ZIP2
	n/a	n/a	INS
	n/a	VRINTA, VRINM, VRINTN, VRINTP, VRINTR, VRINTX, VRINTZ	FRINTA, FRINTI, FRINTM, FRINTN, FRINTP, FRINTX, FRINTZ
shift	VSHL, VQSHL, VQSHLU, and VSHLL (by immediate)	√	Y
	V{Q}{R}SHL (by signed variable)	√	Y
	V{R}SHR	√	Y
	V{R}SHRN	√	Y
	V{R}SRA	√	Y
	VQ{R}SHR{U}N	√	Y
	VSLI and VSRI	√	Y
general arithmetic	VABA{L} and VABD{L}	√	Y
	V{Q}ABS and V{Q}NEG	√	Y
	V{Q}ADD, VADDL, VADDW, V{Q}SUB, VSUBL, and VSUBW	√	Y
	n/a	n/a	SUQADD, USQADD
	V{R}ADDHN and V{R}SUBHN	√	Y
	V{R}HADD and VHSUB	√	Y
	VPADD{L}, VPADAL	√	Y
	VMAX, VMIN, VPMAX, and VPMIN	√	Y
	n/a	n/a	FMAXNMP, FMINNMP
	VCLS, VCLZ, and VCNT	√	Y
	VRECPE and VRSQRTE	√	Y
	VRECPS and VRSQRTS	√	Y
	n/a	n/a	FRECPX
			RBIT
			FSQRT
			ADDV
			SADDLV, UADDLV
			SMAXV,UMAXV,FMAXV
			FMAXNMV
			SMINV,UMINV,FMINV
			FMINNMV
multiply	VMUL{L}, VMLA{L}, and VMLS{L}	√	There isn’t float MLA/MLS
	VMUL{L}, VMLA{L}, and VMLS{L} (by scalar)	√	Y
	VFMA, VFMS	√	Y
	VQDMULL, VQDMLAL, and VQDMLSL (by vector or by scalar)	√	Y
	VQ{R}DMULH (by vector or by scalar)	√	Y
	n/a	n/a	FMULX
	n/a	n/a	FDIV
load and store	VLDn/VSTn(n=1, 2, 3, 4)	√	Y
load and store	VPUSH/VPOP	√	n/a
Crypto Extension	n/a	PMULL, PMULL2	PMULL, PMULL2
		AESD, AESE	AESD, AESE
		AESIMC, AESMC	AESIMC, AESMC
		SHA1C, SHA1H, SHA1M, SHA1P	SHA1C, SHA1H, SHA1M, SHA1P
		SHA1SU0, SHA1SU1	SHA1SU0, SHA1SU1
		SHA256H, SHA256H2	SHA256H, SHA256H2
		SHA256SU0, SHA256SU1	SHA256SU0, SHA256SU1

4 NEON programming basics

There are four ways of using NEON[vii]:

NEON optimized libraries
Vectorizing compilers
NEON intrinsics
NEON assembly

4.1 Libraries

The users can call the NEON optimized libraries directly in their program. Currently, you can use the following libraries:

OpenMax DL

This provides the recommended approach for accelerating AV codecs and supports signal processing and color space conversions.

Ne10

It is ARM’s open source project. Currently, the Ne10 library provides some math, image processing and FFT function. The FFT implementation is faster than other open source FFT implementations.

4.2 Vectorizing compilers

Adding vectorizing options in GCC can help C code to generate NEON code. GNU GCC gives you a wide range of options that aim to increase the speed, or reduce the size of the executable files they generate. For each line in your source code there are generally many possible choices of assembly instructions that could be used. The compiler must trade-off a number of resources, such as registers, stack and heap space, code size (number of instructions), compilation time, ease of debug, and number of cycles per instruction in order to produce an optimized image file.

4.3 NEON intrinsics

NEON intrinsics provides a C function call interface to NEON operations, and the compiler will automatically generate relevant NEON instructions allowing you to program once and run on either an ARMv7-A or ARMv8-A platform. If you intend to use the AArch64 specific NEON instructions, you can use the (__aarch64__) macro definition to separate these codes, as in the following example.

NEON intrinsics example:

//add for float array. assumed that count is multiple of 4

#include

void add_float_c(float* dst, float* src1, float* src2, int count)

{

int i;

for (i = 0; i < count; i++)

dst[i] = src1[i] + src2[i];

}

void add_float_neon1(float* dst, float* src1, float* src2, int count)

{

int i;

for (i = 0; i < count; i += 4)

{

float32x4_t in1, in2, out;

in1 = vld1q_f32(src1);

src1 += 4;

in2 = vld1q_f32(src2);

src2 += 4;

out = vaddq_f32(in1, in2);

vst1q_f32(dst, out);

dst += 4;

// The following is only an example describing how to use AArch64 specific NEON

// instructions.

#if defined (__aarch64__)

float32_t tmp = vaddvq_f32(in1);

#endif

}

Checking disassembly, you can find vld1/vadd/vst1 NEON instruction on ARMv7-A platform and ldr/fadd/str NEON instruction on ARMv8-A platform.

4.4 NEON assembly

There are two ways to write NEON assembly:

Assembly files
Inline assembly

4.4.1 Assembly files

You can use ".S" or “.s” as the file suffix. The only difference is that C/C ++ preprocessor will process .S files first. C language features such as macro definitions can be used.

When writing NEON assembly in a separate file, you need to pay attention to saving the registers. For both ARMv7 and ARMv8, the following registers must be saved:

	ARMv7-A/AArch32	AArch64[viii]
General purpose registers	R0-R3 parameters R4-R11 need to be saved R12 IP R13(SP) R14(LR) need to be saved R0 for return value	X0-X7 parameters X8-X18 X19-X28 need to be saved X29(FP) need to be saved X30(LR) X0, X1 for return value
NEON registers	D8-D15 need to be saved	D part of V8-V15 need to be saved
Stack alignment	64-bit alignment	128-bit alignment[ix]
Stack push/pop	PUSH/POP Rn list VPUSH/VPOP Dn list	LDP/STP register pair

The following is an example of ARM v7-A and ARM v8-A NEON assembly.

//header void add_float_neon2(float* dst, float* src1, float* src2, int count);
//assembly code in .S file
ARMv7-A/AArch32	AArch64
.text .syntax unified .align 4 .global add_float_neon2 .type add_float_neon2, %function .thumb .thumb_func add_float_neon2: .L_loop: vld1.32 {q0}, [r1]! vld1.32 {q1}, [r2]! vadd.f32 q0, q0, q1 subs r3, r3, #4 vst1.32 {q0}, [r0]! bgt .L_loop bx lr	.text .align 4 .global add_float_neon2 .type add_float_neon2, %function add_float_neon2: .L_loop: ld1 {v0.4s}, [x1], #16 ld1 {v1.4s}, [x2], #16 fadd v0.4s, v0.4s, v1.4s subs x3, x3, #4 st1 {v0.4s}, [x0], #16 bgt .L_loop ret

For more examples, see: https://github.com/projectNe10/Ne10/tree/master/modules/dsp

4.4.2 Inline assembly

You can use NEON inline assembly directly in C/C++ code.

Pros:

The procedure call standard is simple. You do not need to save registers manually.
You can use C / C ++ variables and functions, so it can be easily integrated into C / C ++ code.

Cons:

Inline assembly has a complex syntax.
NEON assembly code is embedded in C/C ++ code, and it’s not easily ported to other platforms.

Example:

// ARMv7-A/AArch32

void add_float_neon3(float* dst, float* src1, float* src2, int count)

{

asm volatile (

"1: \n"

"vld1.32 {q0}, [%[src1]]! \n"

"vld1.32 {q1}, [%[src2]]! \n"

"vadd.f32 q0, q0, q1 \n"

"subs %[count], %[count], #4 \n"

"vst1.32 {q0}, [%[dst]]! \n"

"bgt 1b \n"

: [dst] "+r" (dst)

: [src1] "r" (src1), [src2] "r" (src2), [count] "r" (count)

: "memory", "q0", "q1"

);

}

// AArch64

void add_float_neon3(float* dst, float* src1, float* src2, int count)

{

asm volatile (

"1: \n"

"ld1 {v0.4s}, [%[src1]], #16 \n"

"ld1 {v1.4s}, [%[src2]], #16 \n"

"fadd v0.4s, v0.4s, v1.4s \n"

"subs %[count], %[count], #4 \n"

"st1 {v0.4s}, [%[dst]], #16 \n"

"bgt 1b \n"

: [dst] "+r" (dst)

: [src1] "r" (src1), [src2] "r" (src2), [count] "r" (count)

: "memory", "v0", "v1"

);

}

4.5 NEON intrinsics and assembly

NEON intrinsics and assembly are the commonly used NEON. The following table describes the pros and cons of these two approaches:

	NEON assembly	NEON intrinsic
Performance	Always shows the best performance for thespecified platform for an experienced developer.	Depends heavily on the toolchain used
Portability	The different ISAs (ARMv7-A/AArch32 and AArch64) have different assembly implementations. Even for the same ISA, the assembly might need to be fine-tuned to achieve ideal performance between different micro architectures.	Program once and run on different ISA’s. The compiler may also grant performance fine-tuning for different micro-architectures.
Maintainability	Hard to read/write compared to C.	Similar to C code, it’s easy to read/write.

This is a simple summary. When applying NEON to more complex scenarios, there will be many special cases. This will be described in a future article ARM NEON Optimization.

With the above information, you can choose a NEON implementation and start your NEON programming journey.

For more reference documentation, please check the appendix.

Appendix: NEON reference document

[i] The ARM Architecture Version 6 (ARMv6) David Brash: page 13

[ii] ARM Cortex-A Series Programmer’s Guide Version 4.0: page 7-5

[iii] http://www.arm.com/zh/products/processors/instruction-set-architectures/armv8-architecture.php

[iv] ARM® Compiler toolchain Version 5.02 Assembler Reference: Chapter 4

NEON and VFP Programming

ARM Cortex™-A Series Version: 4.0 Programmer’s Guide: 7.2.4 NEON instruction set

[v] ARMv8 Instruction Set Overview: 5.8 Advanced SIMD

[vi] ARMv8 Instruction Set Overview: 5.8.25 AArch32 Equivalent Advanced SIMD Mnemonics

[vii] http://www.arm.com/zh/products/processors/technologies/neon.php

[viii]Procedure Call Standard for the ARM 64-bit Architecture (AArch64) : 5 THE BASE PROCEDURE CALL STANDARD

[ix] Procedure Call Standard for the ARM 64-bit Architecture (AArch64) : 5.2.2 The Stack

2462 查看标签： neon , simd

后续补充翻译未完待续。。。

你可能感兴趣的:(Neon 指令集 ARMv7/v8 对比)

从0到500+，我是如何利用自媒体赚钱？一列脚印
运营公众号半个多月，从零基础的小白到现在慢慢懂了一些运营的知识。做好公众号是很不容易的，要做很多事情；排版、码字、引流…通通需要自己解决，业余时间全都花费在这上面涨这么多粉丝是真的不容易，对比知乎大佬来说，我们这种没资源，没人脉，还没钱的小透明来说，想要一个月涨粉上万，怕是今天没睡醒（不过你有的方法，算我piapia打脸）至少我是清醒的，自己慢慢努力，实现我的万粉目标！大家快来围观、支持我吧！孩子
libyuv之linux编译 jaronho Linux linux 运维服务器
文章目录一、下载源码二、编译源码三、注意事项1、银河麒麟系统（aarch64）（1）解决armv8-a+dotprod+i8mm指令集支持问题（2）解决armv9-a+sve2指令集支持问题一、下载源码到GitHub网站下载https://github.com/lemenkov/libyuv源码，或者用直接用git克隆到本地，如：gitclonehttps://github.com/lemenko
ARM中断处理过程落汤老狗嵌入式linux
一、前言本文主要以ARM体系结构下的中断处理为例，讲述整个中断处理过程中的硬件行为和软件动作。具体整个处理过程分成三个步骤来描述：1、第二章描述了中断处理的准备过程2、第三章描述了当发生中的时候，ARM硬件的行为3、第四章描述了ARM的中断进入过程4、第五章描述了ARM的中断退出过程本文涉及的代码来自3.14内核。另外，本文注意描述ARM指令集的内容，有些sourcecode为了简短一些，删除了T
心有蓝天白云，爱情便会晴空万里，然后有花香有鸟鸣有美好的未来曹十二吖
丁南的婚姻，来自于一场她对生命的对比。她曾经说过，当她最爱的母亲用生命去逼迫她结婚的时候，她曾一度不理解到愤怒，甚至于想过用轻生来对抗母亲的不理智。庆幸的是，丁南是一个自我调节能力非常强的人，她想如果我连死亡都不怕，还怕不能经营好一段婚姻吗？抱着这样的念头，24年没有谈过恋爱的她，用短短三个月的时间，完成了少女到女人的蜕变。她曾经说过：“我要把自己最珍贵的东西留给自己命中注定的那个人。”闺蜜几人中
《Python数据分析实战终极指南》 xjt921122 python 数据分析开发语言
对于分析师来说，大家在学习Python数据分析的路上，多多少少都遇到过很多大坑**，有关于技能和思维的**：Excel已经没办法处理现有的数据量了，应该学Python吗？找了一大堆Python和Pandas的资料来学习，为什么自己动手就懵了？跟着比赛类公开数据分析案例练了很久，为什么当自己面对数据需求还是只会数据处理而没有分析思路？学了对比、细分、聚类分析，也会用PEST、波特五力这类分析法，为啥
【六】阿伟开始搭建Kafka学习环境能源恒观中间件学习 kafka spring
阿伟开始搭建Kafka学习环境概述上一篇文章阿伟学习了Kafka的核心概念，并且把市面上流行的消息中间件特性进行了梳理和对比，方便大家在学习过程中进行对比学习，最后梳理了一些Kafka使用中经常遇到的Kafka难题以及解决思路，经过上一篇的学习我相信大家对Kafka有了初步的认识，本篇将继续学习Kafka。一、安装和配置学习一项技术首先要搭建一套服务，而Kafka的运行主要需要部署jdk、zook
ARM V8 base instruction -- Debug instructions xiaozhiwise Assembly arm
/**Debuginstructions*/BRK#imm16进入monitormodedebug，那里有on-chipdebugmonitorcodeHLT#imm16进入haltmodedebug，连接有外部调试硬件
Armv8.3 体系结构扩展--原文版代码改变世界ctw ARM-TEE-Android armv8 嵌入式 arm架构安全架构芯片 Trustzone Secureboot
快速链接:.ARMv8/ARMv9架构入门到精通-[目录]付费专栏-付费课程【购买须知】:个人博客笔记导读目录(全部)TheArmv8.3architectureextensionTheArmv8.3architectureextensionisanextensiontoArmv8.2.Itaddsmandatoryandoptionalarchitecturalfeatures.Somefeat
ARMv8 Debug __pop_ ARMv8 ARM64 架构 linux 运维
内容来自DEN0024A_v8_architecture_PG.pdf本质ARMv8Debug是什么历史在ARMv4开始被引入,并已发展成一系列广泛的调试(debug1)和跟踪(trace)功能ARMv6和ARMv7-a新增了自托管调试(debug2)和性能评测(trace-enhance)ARMv8处理器提供硬件功能侵入式:调试工具能够对核心活动提供显著级别的控制非侵入式:以非侵入性方式收集有关
ARMV8体系结构简介：概述简单同学 ARMV8体系结构 ARMV8
1.前言本文主要概括的介绍ARMV8体系结构定义了哪些内容，概括的说：ARM体系结构定义了PE的行为，不会定义具体的实现ARM体系结构也定义了debug体系结构和trace体系结构ARM体系结构采用RISC指令集（1）长度一致的寄存器；（2）load/store架构，数据处理操作只能对寄存器内容进行处理，不会直接对内存的内容进行处理；（3）简单寻址方式，load/store地址来源于寄存器或指令域
python获取子进程返回值_Python对进程Multiprocessing子进程返回值 weixin_39752157 python获取子进程返回值
在实际使用多进程的时候，可能需要获取到子进程运行的返回值。如果只是用来存储，则可以将返回值保存到一个数据结构中；如果需要判断此返回值，从而决定是否继续执行所有子进程，则会相对比较复杂。另外在Multiprocessing中，可以利用Process与Pool创建子进程，这两种用法在获取子进程返回值上的写法上也不相同。这篇中，我们直接上代码，分析多进程中获取子进程返回值的不同用法，以及优缺点。初级用法
好事多磨红豆_
今天从店长那里学到了一个词，好事多磨。今天所发生的事情也让我深刻的领会了这个词的意义。原本定好今晚签合同，上午房东突然打来电话说他不想租给那两个女孩了。也就是我的客户，房东说，有其他人帮他找到了一个租客，价格就是他定的价格，不还价，并且押一付三。这一对比，我的客户一点优势都没有。可是客户明天就要搬家了，这个时候，上哪给客户找房子去，客户也上班，没有时间看房。听完房东的话，跟房东也商量了很久，房东很
现在做自媒体还赚钱吗，普通人怎样做自媒体赚钱？氧惠好物
短视频平台很多，但真正能赚到钱的不多，选好阵地盆满钵满，选错阵地颗粒无收也可以做氧惠APP分享赚钱，2023新型淘客平台，收益还不错氧惠（全网优惠上氧惠）——是与以往完全不同的抖客+淘客app！2023全新模式，我的直推也会放到你下面，注册送V8等级，欢迎各位团队长体验！也期待你的加入。氧惠邀请码166666，注册就帮你推广，一起做到百万团队！氧惠怎么使用1复制淘宝（其它平台）商品链接，淘口令，标
午饭吃米好还是吃面好？第二梦想
1，午饭后和同事谈论午饭是吃米好还是吃面好，记得这个话题在网上曾经有过激烈地争论。2，米和面作为主食，都是通过碳水化合物来提供能量，通过数据对比两者在热量、碳水化合物、脂肪、蛋白质各方面都是十分接近的。3，那为什么有的人就觉得吃面才有舒服的饱腹感，有的人就觉得吃米才好消化呢？应该是长期饮食习惯不同导致的差异。4，我觉得中午吃米饭好，根据自身经验吃米饭极少吃撑，而吃面则十有八九下午都会嗳气，也有多次
我们一起喵喵喵米菲兴哥
2021-4-16星期五晴天今天忙碌了2件事情，车险和接种疫苗。对比平安的车险，电销的保险是优惠不少，还送电子门锁（不含安装费用），等会儿查核电子门锁的价格。今天在公司接种疫苗，上次公司安排到社区接种，有点心虚，没有去。这次安排到公司的，就接种吧。早晚要接种的，这次安排这么好，上班时间接种疫苗，直接干呢。下次的接种时间已经安排好啦。刚开始还感觉有点怕怕，皮肤消毒过后，就只有凉凉的感觉，护士的手一接
ruby和python哪个好学 hakesashou python基础知识 ruby python 开发语言
Ruby和python都挺好学的。建议学习Python，语法的话，Python相对更简洁。而且Python应用场合更广泛，运维、网站开发、数据处理、科学研究都可以。Ruby和Python十分相似，有很多共同点，但也有一些不同之外，以下是Python和Ruby的对比：1、Python和Ruby都是面向对象的语言，都是动态和灵活的。二者的主要区别在于他们解决问题的方式。Ruby提供了不同的方法，而Py
小红书和知乎哪个平台更适合种草?小红书和知乎平台区别氧惠评测
这篇文章主要介绍了小红书和知乎哪个平台更适合种草?，小红书和知乎平台区别的相关资料，小编觉得这篇文章对于那些还不了解小红书和知乎平台对比方面知识的小伙伴来说很有参考性，一起来看看吧购物、看电影、点外卖、用氧惠APP！更优惠！氧惠（全网优惠上氧惠）——是与以往完全不同的抖客+淘客app！2022全新模式，我的直推也会放到你下面，送1:1超级补贴(邀请好友自购多少，你就推广得多少，非常厉害)，欢迎各位
Kafka 基础与架构理解 StaticKing KAFKA kafka
目录前言Kafka基础概念消息队列简介：Kafka与传统消息队列（如RabbitMQ、ActiveMQ）的对比Kafka的组件Kafka的工作原理：消息的生产、分发、消费流程Kafka系统架构Kafka的分布式架构设计Leader-Follower机制与数据复制Log-basedStorage和持久化Broker间通信协议Zookeeper在Kafka中的角色总结前言Kafka是一个分布式的消息系
小说《101所》09：官司（中）一言莫辩
经过合同、沙盘和现场对比，李天明觉得外部环境的变化，可以打打官司，至少还有沙盘模型作为证据，虽然合同里声明不能作为的合同的条款，但外部环境足以影响到是否购买底楼的房子，而且这是开发商提供的格式合同，该条款明显规避了开发商的责任，签订合同时没有特别的提示，李天明记得当初自学法律时，记得特别清楚，书上举的例子是保险合同的免责条款。慎重起见，李天明专门咨询了法院和律师朋友，虽然没有得到确切的答复，但是找
遥感图像分割系统：融合空间金字塔池化（FocalModulation)改进YOLOv8 xuehaisj YOLO 人工智能计算机视觉 yolov8
1.研究背景与意义项目参考AAAIAssociationfortheAdvancementofArtificialIntelligence研究背景与意义遥感图像分割是遥感技术领域中的一个重要研究方向，它的目标是将遥感图像中的不同地物或地物类别进行有效的分割和识别。随着遥感技术的不断发展和遥感图像数据的大规模获取，遥感图像分割在农业、城市规划、环境监测等领域具有广泛的应用前景。然而，由于遥感图像的特
2023-06-15 小金119
我对访谈阶段的认识：第一、【要放松】对比了李娜老师和曼姨的访谈，我感受到曼姨的放松、从容。访谈就是逐步了解个案，不是带着必须要完成任务去交谈。是谈着谈着就发现了主要问题，而不是带着［访谈阶段要完成相关任务］的压力去访谈。催眠师比个案还紧张的话，个案就不好敞开了。催眠师很轻松的和对方聊天，对方也会放松，容易敞开。第二、【专注并好奇】催眠师一直专注在个案身上，对个案保持好奇。对于个案的答案，能继续提出
中国为什么没有发展出具有影响力的宗教？ llSteven
关于这个话题，我就想笼统地随便聊聊，文章里的内容会稍显片面，有兴趣地小伙伴我们私底下聊，我就随便扯扯，说说有意思的。Reference我就不写了，麻烦！在中国，宗教在我们的历史进程中没有很大的影响。中国的传统宗教有两个，佛教和道教，但信仰者的比例很低。到了现在，可以说中国不是一个有宗教信仰的国家。或者我猜测，大多数人会说信仰科学。对比西方，相信大家都知道就不用多说了。说个有意思的，2012年的时候
华为云分布式缓存服务DCS与开源服务差异对比 hcinfo_18 redis使用华为云 Redis5.0 分布式缓存服务 Redis客户端
分布式缓存服务DCS提供单机、主备、集群等丰富的实例类型，满足用户高读写性能及快速数据访问的业务诉求。支持丰富的实例管理操作，帮助用户省去运维烦恼。用户可以聚焦于业务逻辑本身，而无需过多考虑部署、监控、扩容、安全、故障恢复等方面的问题。DCS基于开源Redis、Memcached向用户提供一定程度定制化的缓存服务，因此，除了拥有开源服务缓存数据库的优秀特性，DCS提供更多实用功能。一、与开源Red
京粉怎么给自己返利?京粉自己下单有佣金吗日常购物小技巧
大家好，我是花桃APP商品推荐官：美美，今天给各位说说京粉怎么给自己返利?京粉自己下单有佣金吗。京粉怎么给自己返利？京粉自己下单有佣金吗京粉怎么给自己返利？京粉自己下单买有佣金吗？怎么实现单个账号自推自买？说【京粉返利】之前给大家推荐一款返利APP，【全网返利最高哦!可以对比一下自己在用的返利软件】都是有内部返利和优惠券的，应用商店搜索下载花桃APP即可查询返利佣金。【官方邀请码：00028】目前
从简单到复杂：三种工厂模式的对比与应用技术拾光者设计模式 java 设计模式简单工厂模式抽象工厂模式工厂方法模式
在软件设计中，创建型设计模式用于处理对象创建的复杂性。本文将对比三种常见的创建型设计模式：简单工厂模式、工厂方法模式和抽象工厂模式。一，简单工厂模式定义：简单工厂模式（SimpleFactoryPattern）定义了一个工厂类，该类可以根据传入的参数决定创建哪一种产品实例。结构：产品（Product）：定义产品的接口。具体产品（ConcreteProduct）：实现具体产品。工厂（Factory）
odoo 开源版/企业版/社区版的对比分析 lijianhua_9712 odoo odoo
odoo的三个版本1开源版开发者odoo限制功能版本优点功能稳定，bug少缺点限制功能，进销存勉强可用2企业版开发者odoo中型企业功能优点功能稳定，bug少缺点授权费用昂贵3社区版开发者社区(1700余名专家）大型企业功能优点功能丰富，社区不受odoo公司控制，社区开发者基本都是资深erp技术专家，增加了大量细致功能缺点存在一些bug为什么用odoo社区版，不用odoo企业版呢1odoo企业版是
452期：2022年吉林养老金方案公布，三降一升一持平，基础养老金高优势大社保小龙虾
社保知识，小龙虾每日分享第452期，欢迎关注！续陕西首先默默调整2022年养老金后，吉林也开始进行了养老金的调整（河南的公告不算，没有任何具体调整比例！）下面随小龙虾一起看看吉林的调整方案，具体参照吉人社联【2022】95号文件！一、具体方案对比，三降一升1、定额调整2022年吉林的定额调整金额是每人每月增加30元，相对于去年的36元，下降了6元，下降幅度16.67%2、挂钩调整：工龄挂钩调整工龄
详解C语言中的循环语句埋头编程~ C语言 c语言开发语言
文章目录1.前言2.while循环2.1if和whlie的对比2.2while语句的工作机制2.3while循环的实践3.for循环3.1for循环语法3.2for循环的工作机制3.3for循环实践4dowhile循环4.1dowhlie循环语法4.2dowhile循环的工作机理4.3dowhile循环实践5.break和continue语句5.1break举例5.2continue举例6.got
淘宝延长收货可以延长多久，淘宝的延长收货能延迟几天日常购物小技巧
在淘宝上购物虽然很方便，但是等待收货的过程是非常煎熬的，这对于一些心急的人来说非常不友好，而且有的时候因为各种原因还会导致货物不能按时送达，这个时候为了防止货物出现问题可以选择延长收货来确保收到货物。那么淘宝延长收货可以延长多久，淘宝的延迟收货能延迟几天？说【淘宝延长收货】之前给大家推荐一款返利APP，【全网返利最高哦!可以对比一下自己在用的返利软件】都是有内部返利和优惠券的，应用商店搜索下载花桃
架构师备考的一些思考（四） kiba518
前言对于数学，我们之前学的是对的，但不是真的，所以我们没有数学思维。对于计算机，我们学校教的是对的，但不是真的，所以仅仅从学校学习知识的应届毕业生，不论985,211，本科，专科都一样，都是一张白纸，啥也不会。案例分析案例分析是5选3，第一题必答。问题一的类型架构风格对比问题二的类型质量属性填写问题三的类型ER图分析问题类型四场景分析，此类型题比较多。案例分析主要是结合我们之前介绍的内容和自身的经
apache 安装linux windows 墙头上一根草 apache inux windows
linux安装Apache 有两种方式一种是手动安装通过二进制的文件进行安装，另外一种就是通过yum 安装，此中安装方式，需要物理机联网。以下分别介绍两种的安装方式通过二进制文件安装Apache需要的软件有apr,apr-util,pcre 1，安装 apr 下载地址：htt
fill_parent、wrap_content和match_parent的区别 Cb123456 match_parent fill_parent
fill_parent、wrap_content和match_parent的区别: 1）fill_parent 设置一个构件的布局为fill_parent将强制性地使构件扩展，以填充布局单元内尽可能多的空间。这跟Windows控件的dockstyle属性大体一致。设置一个顶部布局或控件为fill_parent将强制性让它布满整个屏幕。 2） wrap_conte
网页自适应设计天子之骄 html css 响应式设计页面自适应
网页自适应设计网页对浏览器窗口的自适应支持变得越来越重要了。自适应响应设计更是异常火爆。再加上移动端的崛起，更是如日中天。以前为了适应不同屏幕分布率和浏览器窗口的扩大和缩小，需要设计几套css样式，用js脚本判断窗口大小，选择加载。结构臃肿，加载负担较大。现笔者经过一定时间的学习，有所心得，故分享于此，加强交流，共同进步。同时希望对大家有所
[sql server] 分组取最大最小常用sql 一炮送你回车库 SQL Server
--分组取最大最小常用sql--测试环境if OBJECT_ID('tb') is not null drop table tb;gocreate table tb( col1 int, col2 int, Fcount int)insert into tbselect 11,20,1 union allselect 11,22,1 union allselect 1
ImageIO写图片输出到硬盘 3213213333332132 java image
package awt; import java.awt.Color; import java.awt.Font; import java.awt.Graphics; import java.awt.image.BufferedImage; import java.io.File; import java.io.IOException; import javax.imagei
自己的String动态数组宝剑锋梅花香 java 动态数组数组
数组还是好说，学过一两门编程语言的就知道，需要注意的是数组声明时需要把大小给它定下来，比如声明一个字符串类型的数组：String str[]=new String[10]; 但是问题就来了，每次都是大小确定的数组，我需要数组大小不固定随时变化怎么办呢？动态数组就这样应运而生，龙哥给我们讲的是自己用代码写动态数组，并非用的ArrayList 看看字符
pinyin4j工具类 darkranger .net
pinyin4j工具类Java工具类 2010-04-24 00:47:00 阅读69 评论0 字号：大中小引入pinyin4j-2.5.0.jar包: pinyin4j是一个功能强悍的汉语拼音工具包，主要是从汉语获取各种格式和需求的拼音，功能强悍，下面看看如何使用pinyin4j。本人以前用AscII编码提取工具，效果不理想，现在用pinyin4j简单实现了一个。功能还不是很完美，
StarUML学习笔记----基本概念 aijuans UML建模
介绍StarUML的基本概念，这些都是有效运用StarUML?所需要的。包括对模型、视图、图、项目、单元、方法、框架、模型块及其差异以及UML轮廓。模型、视与图（Model, View and Diagram） &
Activiti最终总结 avords Activiti id 工作流
1、流程定义ID：ProcessDefinitionId，当定义一个流程就会产生。 2、流程实例ID：ProcessInstanceId，当开始一个具体的流程时就会产生，也就是不同的流程实例ID可能有相同的流程定义ID。 3、TaskId，每一个userTask都会有一个Id这个是存在于流程实例上的。 4、TaskDefinitionKey和（ActivityImpl activityId
从省市区多重级联想到的，react和jquery的差别 bee1314 jquery UI react
在我们的前端项目里经常会用到级联的select，比如省市区这样。通常这种级联大多是动态的。比如先加载了省，点击省加载市，点击市加载区。然后数据通常ajax返回。如果没有数据则说明到了叶子节点。针对这种场景，如果我们使用jquery来实现，要考虑很多的问题，数据部分，以及大量的dom操作。比如这个页面上显示了某个区，这时候我切换省，要把市重新初始化数据，然后区域的部分要从页面
Eclipse快捷键大全 bijian1013 java eclipse 快捷键
Ctrl+1 快速修复(最经典的快捷键,就不用多说了)Ctrl+D: 删除当前行 Ctrl+Alt+↓ 复制当前行到下一行(复制增加)Ctrl+Alt+↑ 复制当前行到上一行(复制增加)Alt+↓ 当前行和下面一行交互位置(特别实用,可以省去先剪切,再粘贴了)Alt+↑ 当前行和上面一行交互位置(同上)Alt+← 前一个编辑的页面Alt+→ 下一个编辑的页面(当然是针对上面那条来说了)Alt+En
js 笔记函数征客丶 JavaScript
一、函数的使用 1.1、定义函数变量 var vName = funcation(params){ } 1.2、函数的调用函数变量的调用： vName(params); 函数定义时自发调用：(function(params){})(params); 1.3、函数中变量赋值 var a = 'a'; var ff
【Scala四】分析Spark源代码总结的Scala语法二 bit1129 scala
1. Some操作在下面的代码中，使用了Some操作：if (self.partitioner == Some(partitioner))，那么Some(partitioner)表示什么含义？首先partitioner是方法combineByKey传入的变量， Some的文档说明： /** Class `Some[A]` represents existin
java 匿名内部类 BlueSkator java匿名内部类
组合优先于继承 Java的匿名类，就是提供了一个快捷方便的手段，令继承关系可以方便地变成组合关系继承只有一个时候才能用，当你要求子类的实例可以替代父类实例的位置时才可以用继承。在Java中内部类主要分为成员内部类、局部内部类、匿名内部类、静态内部类。内部类不是很好理解，但说白了其实也就是一个类中还包含着另外一个类如同一个人是由大脑、肢体、器官等身体结果组成，而内部类相
盗版win装在MAC有害发热，苹果的东西不值得买，win应该不用 ljy325 游戏 apple windows XP OS
Mac mini 型号: MC270CH-A RMB:5,688 Apple 对windows的产品支持不好,有以下问题: 1.装完了xp,发现机身很热虽然没有运行任何程序！貌似显卡跑游戏发热一样，按照那样的发热量,那部机子损耗很大,使用寿命受到严重的影响! 2.反观安装了Mac os的展示机，发热量很小，运行了1天温度也没有那么高 &nbs
读《研磨设计模式》-代码笔记-生成器模式-Builder bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ /** * 生成器模式的意图在于将一个复杂的构建与其表示相分离，使得同样的构建过程可以创建不同的表示（GoF） * 个人理解： * 构建一个复杂的对象，对于创建者（Builder）来说，一是要有数据来源(rawData)，二是要返回构
JIRA与SVN插件安装 chenyu19891124 SVN jira
JIRA安装好后提交代码并要显示在JIRA上，这得需要用SVN的插件才能看见开发人员提交的代码。 1.下载svn与jira插件安装包，解压后在安装包(atlassian-jira-subversion-plugin-0.10.1) 2.解压出来的包里下的lib文件夹下的jar拷贝到(C:\Program Files\Atlassian\JIRA 4.3.4\atlassian-jira\WEB
常用数学思想方法 comsci 工作
对于搞工程和技术的朋友来讲，在工作中常常遇到一些实际问题，而采用常规的思维方式无法很好的解决这些问题，那么这个时候我们就需要用数学语言和数学工具，而使用数学工具的前提却是用数学思想的方法来描述问题。。下面转帖几种常用的数学思想方法，仅供学习和参考函数思想　　把某一数学问题用函数表示出来，并且利用函数探究这个问题的一般规律。这是最基本、最常用的数学方法
pl/sql集合类型 daizj oracle 集合 type pl/sql
--集合类型 /* 单行单列的数据，使用标量变量单行多列数据，使用记录单列多行数据，使用集合（。。。） *集合：类似于数组也就是。pl/sql集合类型包括索引表（pl/sql table）、嵌套表（Nested Table）、变长数组（VARRAY）等 */ /* --集合方法 &n
[Ofbiz]ofbiz初用 dinguangx 电商 ofbiz
从github下载最新的ofbiz（截止2015-7-13），从源码进行ofbiz的试用 1. 加载测试库 ofbiz内置derby，通过下面的命令初始化测试库 ./ant load-demo (与load-seed有一些区别) 2. 启动内置tomcat ./ant start 或 ./startofbiz.sh 或 java -jar ofbiz.jar &
结构体中最后一个元素是长度为0的数组 dcj3sjt126com c gcc
在Linux源代码中，有很多的结构体最后都定义了一个元素个数为0个的数组，如/usr/include/linux/if_pppox.h中有这样一个结构体： struct pppoe_tag { __u16 tag_type; __u16 tag_len; &n
Linux cp 实现强行覆盖 dcj3sjt126com linux
发现在Fedora 10 /ubutun 里面用cp -fr src dest，即使加了-f也是不能强行覆盖的，这时怎么回事的呢？一两个文件还好说，就输几个yes吧，但是要是n多文件怎么办，那还不输死人呢？下面提供三种解决办法。方法一我们输入alias命令，看看系统给cp起了一个什么别名。 [root@localhost ~]# aliasalias cp=’cp -i’a
Memcached(一)、HelloWorld frank1234 memcached
一、简介高性能的架构离不开缓存，分布式缓存中的佼佼者当属memcached，它通过客户端将不同的key hash到不同的memcached服务器中，而获取的时候也到相同的服务器中获取，由于不需要做集群同步，也就省去了集群间同步的开销和延迟，所以它相对于ehcache等缓存来说能更好的支持分布式应用，具有更强的横向伸缩能力。二、客户端选择一个memcached客户端，我这里用的是memc
Search in Rotated Sorted Array II hcx2013 search
Follow up for "Search in Rotated Sorted Array":What if duplicates are allowed? Would this affect the run-time complexity? How and why? Write a function to determine if a given ta
Spring4新特性——更好的Java泛型操作API jinnianshilongnian spring4 generic type
Spring4新特性——泛型限定式依赖注入 Spring4新特性——核心容器的其他改进 Spring4新特性——Web开发的增强 Spring4新特性——集成Bean Validation 1.1(JSR-349)到SpringMVC Spring4新特性——Groovy Bean定义DSL Spring4新特性——更好的Java泛型操作API Spring4新
CentOS安装JDK liuxingguome centos
1、行卸载原来的： [root@localhost opt]# rpm -qa | grep java tzdata-java-2014g-1.el6.noarch java-1.7.0-openjdk-1.7.0.65-2.5.1.2.el6_5.x86_64 java-1.6.0-openjdk-1.6.0.0-11.1.13.4.el6.x86_64 [root@localhost
二分搜索专题2-在有序二维数组中搜索一个元素 OpenMind 二维数组算法二分搜索
1,设二维数组p的每行每列都按照下标递增的顺序递增。用数学语言描述如下：p满足 (1),对任意的x1，x2，y，如果x1<x2,则p(x1,y)<p(x2,y); (2),对任意的x，y1,y2, 如果y1<y2,则p(x,y1)<p(x,y2); 2,问题：给定满足1的数组p和一个整数k，求是否存在x0,y0使得p(x0,y0)=k? 3,算法分析： (
java 随机数 Math与Random SaraWon java Math Random
今天需要在程序中产生随机数，知道有两种方法可以使用，但是使用Math和Random的区别还不是特别清楚，看到一篇文章是关于的，觉得写的还挺不错的，原文地址是 http://www.oschina.net/question/157182_45274?sort=default&p=1#answers 产生1到10之间的随机数的两种实现方式： //Math Math.roun
oracle创建表空间 tugn oracle
create temporary tablespace TXSJ_TEMP tempfile 'E:\Oracle\oradata\TXSJ_TEMP.dbf' size 32m autoextend on next 32m maxsize 2048m extent m
使用Java8实现自己的个性化搜索引擎 yangshangchuan java superword 搜索引擎 java8 全文检索
需要对249本软件著作实现句子级别全文检索，这些著作均为PDF文件，不使用现有的框架如lucene，自己实现的方法如下： 1、从PDF文件中提取文本，这里的重点是如何最大可能地还原文本。提取之后的文本，一个句子一行保存为文本文件。 2、将所有文本文件合并为一个单一的文本文件，这样，每一个句子就有一个唯一行号。 3、对每一行文本进行分词，建立倒排表，倒排表的格式为：词=包含该词的总行数N=行号