longlongway2012

D3d Shader Instruction

https://www.3dbrew.org/wiki/GPU/Shader_Instruction_Set

1Overview
2Nomenclature
3Instruction formats
4Instructions
5Operand descriptors
6Relative addressing
7Comparison operator
8Conditions
9Registers
10Floating-Point Behavior
11Control Flow

Overview[edit]

A compiled shader binary is comprised of two parts : the main instruction sequence and the operand descriptor table. These are both sent to the GPU around the same time but using separate GPU Commands. Instructions (such as format 1 instruction) may reference operand descriptors. When such is the case, the operand descriptor ID is the offset, in words, of the descriptor within the table. Both instructions and descriptors are coded in little endian. Basic implementations of the following specification can be found at [1] and [2]. The instruction set seems to have been heavily inspired by Microsoft's vs_3_0 [3] and the Direct3D shader code [4]. Please note that this page is being written as the instruction set is reverse engineered; as such it may very well contain mistakes.

Debug information found in the code.bin of "Ironfall: Invasion" suggests that there may not be more than 512 instructions and 128 operand descriptors in a shader.

Nomenclature[edit]

opcode names with I appended to them are the same as their non-I version, except they use the inverted instruction format, giving 7 bits to SRC2 (and access to uniforms) and 5 bits to SRC1

opcode names with U appended to them are the same as their non-U version, except they are executed conditionally based on the value of a uniform boolean.

opcode names with C appended to them are the same as their non-C version, except they are executed conditionally based on a logical expression specified in the instruction.

Instruction formats[edit]

Format 1 : (used for register operations)

Offset	Size (bits)	Description
0x0	0x7	Operand descriptor ID (DESC)
0x7	0x5	Source 2 register (SRC2)
0xC	0x7	Source 1 register (SRC1)
0x13	0x2	Address register index for SRC1 (IDX_1)
0x15	0x5	Destination register (DST)
0x1A	0x6	Opcode

Format 1i : (used for register operations)

Offset	Size (bits)	Description
0x0	0x7	Operand descriptor ID (DESC)
0x7	0x7	Source 2 register (SRC2)
0xE	0x5	Source 1 register (SRC1)
0x13	0x2	Address register index for SRC2 (IDX_2)
0x15	0x5	Destination register (DST)
0x1A	0x6	Opcode

Format 1u : (used for unary register operations)

Offset	Size (bits)	Description
0x0	0x7	Operand descriptor ID (DESC)
0xC	0x7	Source 1 register (SRC1)
0x13	0x2	Address register index for SRC1 (IDX_1)
0x15	0x5	Destination register (DST)
0x1A	0x6	Opcode

Format 1c : (used for comparison operations)

Offset	Size (bits)	Description
0x0	0x7	Operand descriptor ID (DESC)
0x7	0x5	Source 2 register (SRC2)
0xC	0x7	Source 1 register (SRC1)
0x13	0x2	Address register index for SRC1 (IDX_1)
0x15	0x3	Comparison operator for Y (CMPY)
0x18	0x3	Comparison operator for X (CMPX)
0x1B	0x5	Opcode

Format 2 : (used for flow control instructions)

Offset	Size (bits)	Description
0x0	0x8	Number of instructions (NUM)
0xA	0xC	Destination offset (in words) (DST)
0x16	0x2	Condition boolean operator (CONDOP)
0x18	0x1	Y reference bit (REFY)
0x19	0x1	X reference bit (REFX)
0x1A	0x6	Opcode

Format 3 : (used for uniform-based conditional flow control instructions)

Offset	Size (bits)	Description
0x0	0x8	Number of instructions ? (NUM)
0xA	0xC	Destination offset (in words) (DST)
0x16	0x4	Uniform ID (BOOL/INT)
0x1A	0x6	Opcode

Format 4 : (used for SETEMIT)

Offset	Size (bits)	Description
0x16	0x1	Winding flag (FLAG_WINDING)
0x17	0x1	Primitive emit flag (FLAG_PRIMEMIT)
0x18	0x2	Vertex ID (VTXID)
0x1A	0x6	Opcode

Format 5 : (used for MAD)

Offset	Size (bits)	Description
0x0	0x5	Operand descriptor ID (DESC)
0x5	0x5	Source 3 register (SRC3)
0xA	0x7	Source 2 register (SRC2)
0x11	0x5	Source 1 register (SRC1)
0x16	0x2	Address register index for SRC2 (IDX_2)
0x18	0x5	Destination register (DST)
0x1D	0x3	Opcode

Format 5i : (used for MADI)

Offset	Size (bits)	Description
0x0	0x5	Operand descriptor ID (DESC)
0x5	0x7	Source 3 register (SRC3)
0xC	0x5	Source 2 register (SRC2)
0x11	0x5	Source 1 register (SRC1)
0x16	0x2	Address register index for SRC3 (IDX_3)
0x18	0x5	Destination register (DST)
0x1D	0x3	Opcode

Instructions[edit]

Unless noted otherwise, SRC1 and SRC2 refer to their respectively indexed float[4] registers (after swizzling). Similarly, DST refers to its indexed register modulo destination component masking, i.e. an expression like DST=SRC1 might actually just set DST.y to SRC1.y.

Opcode	Format	Name	Description
0x00	1	ADD	Adds two vectors component by component; DST[i] = SRC1[i]+SRC2[i] for all i
0x01	1	DP3	Computes dot product on 3-component vectors; DST = SRC1.SRC2
0x02	1	DP4	Computes dot product on 4-component vectors; DST = SRC1.SRC2
0x03	1	DPH	Computes dot product on a 3-component vector with 1.0 appended to it and a 4-component vector; DST = SRC1.SRC2 (with SRC1 homogenous)
0x04	1	DST	Equivalent to Microsoft's dst instruction: DST = {1, SRC1[1]*SRC2[1], SRC1[2], SRC2[3]}
0x05	1u	EX2	Computes SRC1's first component exponent with base 2; DST[i] = EXP2(SRC1[0]) for all i
0x06	1u	LG2	Computes SRC1's first component logarithm with base 2; DST[i] = LOG2(SRC1[0]) for all i
0x07	1u	LITP	Appears to be related to Microsoft's lit instruction; DST = clamp(SRC1, min={0, -127.9961, 0, 0}, max={inf, 127.9961, 0, inf}); n.b.: 127.9961 = 0x7FFF / 0x100
0x08	1	MUL	Multiplies two vectors component by component; DST[i] = SRC1[i].SRC2[i] for all i
0x09	1	SGE	Sets output if SRC1 is greater than or equal to SRC2; DST[i] = (SRC1[i] >= SRC2[i]) ? 1.0 : 0.0 for all i
0x0A	1	SLT	Sets output if SRC1 is strictly less than SRC2; DST[i] = (SRC1[i] < SRC2[i]) ? 1.0 : 0.0 for all i
0x0B	1u	FLR	Computes SRC1's floor component by component; DST[i] = FLOOR(SRC1[i]) for all i
0x0C	1	MAX	Takes the max of two vectors, component by component; DST[i] = MAX(SRC1[i], SRC2[i]) for all i
0x0D	1	MIN	Takes the min of two vectors, component by component; DST[i] = MIN(SRC1[i], SRC2[i]) for all i
0x0E	1u	RCP	Computes the reciprocal of the vector's first component; DST[i] = 1/SRC1[0] for all i
0x0F	1u	RSQ	Computes the reciprocal of the square root of the vector's first component; DST[i] = 1/sqrt(SRC1[0]) for all i
0x10	?	???	?
0x11	?	???	?
0x12	1u	MOVA	Move to address register; Casts the float uniform given by SRC1 to an integer (truncating the fractional part) and assigns the result to (a0.x, a0.y, _, _), respecting the destination component mask.
0x13	1u	MOV	Moves value from one register to another; DST = SRC1.
0x14	?	???	?
0x15	?	???	?
0x16	?	???	?
0x17	?	???	?
0x18	1i	DPHI	Computes dot product on a 3-component vector with 1.0 appended to it and a 4-component vector; DST = SRC1.SRC2 (with SRC1 homogenous)
0x19	1i	DSTI	DST with sources swapped.
0x1A	1i	SGEI	Sets output if SRC1 is greater than or equal to SRC2; DST[i] = (SRC1[i] >= SRC2[i]) ? 1.0 : 0.0 for all i
0x1B	1i	SLTI	Sets output if SRC1 is strictly less than SRC2; DST[i] = (SRC1[i] < SRC2[i]) ? 1.0 : 0.0 for all i
0x1C	?	???	?
0x1D	?	???	?
0x1E	?	???	?
0x1F	?	???	?
0x20	0	BREAK	Breaks out of LOOP block; do not use while in nested IF/CALL block inside LOOP block.
0x21	0	NOP	Does literally nothing.
0x22	0	END	Signals the shader unit that processing for this vertex/primitive is done.
0x23	2	BREAKC	If condition (see below for details) is true, then breaks out of LOOP block.
0x24	2	CALL	Jumps to DST and executes instructions until it reaches DST+NUM instructions
0x25	2	CALLC	If condition (see below for details) is true, then jumps to DST and executes instructions until it reaches DST+NUM instructions, else does nothing.
0x26	3	CALLU	Jumps to DST and executes instructions until it reaches DST+NUM instructions if BOOL is true
0x27	3	IFU	If condition BOOL is true, then executes instructions until DST, then jumps to DST+NUM; else, jumps to DST.
0x28	2	IFC	If condition (see below for details) is true, then executes instructions until DST, then jumps to DST+NUM; else, jumps to DST
0x29	3	LOOP	Loops over the code between itself and DST (inclusive), performing INT.x+1 iterations in total. First, aL is initialized to INT.y. After each iteration, aL is incremented by INT.z.
0x2A	0 (no param)	EMIT	(geometry shader only) Emits a vertex (and primitive if FLAG_PRIMEMIT was set in the corresponding SETEMIT). SETEMIT must be called before this.
0x2B	4	SETEMIT	(geometry shader only) Sets VTXID, FLAG_WINDING and FLAG_PRIMEMIT for the next EMIT instruction. VTXID is the ID of the vertex about to be emitted within the primitive, while FLAG_PRIMEMIT is zero if we are just emitting a single vertex and non-zero if are emitting a vertex and primitive simultaneously. FLAG_WINDING controls the output primitive's winding. Note that the output vertex buffer (which holds 4 vertices) is notcleared when the primitive is emitted, meaning that vertices from the previous primitive can be reused for the current one. (this is still a working hypothesis and unconfirmed)
0x2C	2	JMPC	If condition (see below for details) is true, then jumps to DST, else does nothing.
0x2D	3	JMPU	If condition BOOL is true, then jumps to DST, else does nothing. Having bit 0 of NUM = 1 will invert the test, jumping if BOOL is false instead.
0x2E-0x2F	1c	CMP	Sets booleans cmp.x and cmp.y based on the operand's x and y components and the CMPX and CMPY comparison operators respectively. See below for details about operators. It's unknown whether CMP respects the destination component mask or not.
0x30-0x37	5i	MADI	Multiplies two vectors and adds a third one component by component; DST[i] = SRC3[i] + SRC2[i].SRC1[i] for all i; this is not an FMA, the intermediate result is rounded
0x38-0x3F	5	MAD	Multiplies two vectors and adds a third one component by component; DST[i] = SRC3[i] + SRC2[i].SRC1[i] for all i; this is not an FMA, the intermediate result is rounded

Operand descriptors[edit]

Sizes below are in bits, not bytes.

Offset	Size	Description
0x0	0x4	Destination component mask. Bit 3 = x, 2 = y, 1 = z, 0 = w.
0x4	0x1	Source 1 negation bit
0x5	0x8	Source 1 component selector
0xD	0x1	Source 2 negation bit
0xE	0x8	Source 2 component selector
0x16	0x1	Source 3 negation bit
0x17	0x8	Source 3 component selector

Component selector :

Offset	Size	Description
0x0	0x2	Component 3 value
0x2	0x2	Component 2 value
0x4	0x2	Component 1 value
0x6	0x2	Component 0 value

Value	Component
0x0	x
0x1	y
0x2	z
0x3	w

The component selector enables swizzling. For example, component selector 0x1B is equivalent to .xyzw, while 0x55 is equivalent to .yyyy.

Depending on the current shader opcode, source components are disabled implicitly by setting the destination component mask. For example, ADD o0.xy, r0.xyzw, r1.xyzw will not make use of r0's or r1's z/w components, while DP4 o0.xy, r0.xyzw, r1.xyzw will use all input components regardless of the used destination component mask.

Relative addressing[edit]

There are 3 address registers: a0.x, a0.y and aL (loop counter). For format 1 instructions, when IDX != 0, the value of the corresponding address register is added to SRC1's value. For example, if IDX = 2, a0.y = 3 and SRC1 = c8, then instead SRC1+a0.y = c11 will be used for the instruction. It is only possible to use address registers with vector uniform registers, attempting to use them with input attribute or temporary registers results in the address register being ignored (i.e. read as zero).

a0.x and a0.y are set manually through the MOVA instruction by rounding a float value to integer precision. Hence, they may take negative values.

aL can only be set indirectly by the LOOP instruction. It is still accessible and valid after exiting a LOOP block, though.

Comparison operator[edit]

CMPX/CMPY raw value	Operator name	Expression
0x0	EQ	src1 == src2
0x1	NE	src1 != src2
0x2	LT	src1 < src2
0x3	LE	src1 <= src2
0x4	GT	src1 > src2
0x5	GE	src1 >= src2
0x6	??	true ?
0x7	??	true ?

6 and 7 seem to always return true.

Conditions[edit]

A number of format 2 instructions are executed conditionally. These conditions are based on two boolean registers which can be set with CMP : cmp.x and cmp.y.

Conditional instructions include 3 parameters : CONDOP, REFX and REFY. REFX and REFY are reference values which are tested for equality against cmp.x and cmp.y, respectively. CONDOP describes how the final truth value is constructed from the results of the two tests. There are four conditional expression formats :

CONDOP raw value	Expression	Description
0x0	cmp.x == REFX \|\| cmp.y == REFY	OR
0x1	cmp.x == REFX && cmp.y == REFY	AND
0x2	cmp.x == REFX	X
0x3	cmp.y == REFY	Y

Registers[edit]

Input attribute registers (v0-v7?) store the per-vertex data given by the CPU and hence are read-only.

Output attribute registers (o0-o6) hold the data to be passed to the later GPU stages and are write-only. Each of the output attribute register components is assigned a semantic by setting the corresponding GPU_Internal_Registers.

Uniform registers hold user-specified data which is constant throughout all processed vertices. There are 96 float[4] uniform registers (c0-c95), eight boolean registers (b0-b7), and four int[4] registers (i0-i3).

Temporary registers (r0-r15) can be used for intermediate calculations and can both be read and written.

Many shader instructions which take float arguments have only 5 bits available for the second argument. They may hence only refer to input attributes or temporary registers. In particular, it's not possible to pass two float[4] uniforms to these instructions.

It appears that writing twice to the same output register can cause problems (e.g. GPU hangs).

DST mapping :

DST raw value	Register name	Description
0x0-0x6	o0-o6	Output registers.
0x10-0x1F	r0-r15	Temporary registers.

SRC mapping :

SRC1 raw value	Register name	Description
0x0-0x7	v0-v7	Input attribute registers.
0x10-0x1F	r0-r15	Temporary registers.
0x20-0x7F	c0-c95	Vector uniform registers.

Floating-Point Behavior[edit]

The PICA200 is not IEEE-compliant. It has positive and negative infinities and NaN, but does not seem to have negative 0. Input and output subnormals are flushed to +0. The internal floating point format seems to be the same as used in shader binaries: 1 sign bit, 7 exponent bits, 16 (explicit) mantissa bits. Several instructions also have behavior that differs from the IEEE functions. Here are the results from some tests done on hardware (s = largest subnormal, n = smallest positive normal):

Computation	Result	Notes
inf * 0	0	Including inside MUL, MAD, DP4, etc.
NaN * 0	NaN
+inf - +inf	NaN	Indicates +inf is real inf, not FLT_MAX
rsq(rcp(-inf))	+inf	Indicates that there isn't -0.0.
rcp(-0)	+inf	no -0 so differs from IEEE where rcp(-0) = -inf
rcp(0)	+inf
rcp(+inf)	0
rcp(NaN)	NaN
rsq(-0)	+inf	no -0 so differs from IEEE where rsq(-0) = -inf
rsq(-2)	NaN
rsq(+inf)	0
rsq(-inf)	NaN
rsq(NaN)	NaN
max(0, +inf)	+inf
max(0, -inf)	-inf
max(0, NaN)	NaN	max violates IEEE but match GLSL spec
max(NaN, 0)	0
max(-inf, +inf)	+inf
min(0, +inf)	0
min(0, -inf)	-inf
min(0, NaN)	NaN	min violates IEEE but match GLSL spec
min(NaN, 0)	0
min(-inf, +inf)	-inf
cmp(s, 0)	false	cmp does not flush input subnormals
max(s, 0)	s	max does not flush input or output subnormals
mul(s, 2)	0	input subnormals are flushed in arithmetic instructions
mul(n, 0.5)	0	output subnormals are flushed in arithmetic instructions

1.0 can be multiplied 63 times by 0.5 until the result compares equal zero. This is consistent with a 7-bit exponent and output subnormal flushing.

Control Flow[edit]

Control flow is implemented using four independent stacks:

4-deep CALL stack
8-deep IF stack
4-deep LOOP stack

All stacks are initially empty. After every instruction but before JMP takes effect, the PC is incremented and a copy is sent to each stack. Each stack is checked against its copy of the PC. If an entry is popped from the stack, the copied PC is updated and used for the next check of this stack, although the IF/LOOP stacks can each only pop one entry per instruction, whereas the CALL stack is checked again until it doesn't match or the stack is empty. The updated PC copy with the highest priority wins: LOOP (highest), IF, CALL, JMP, original PC (lowest).

Special cases:

JMP overwrites the PC *after* the stacks checks (and only if no stack was popped).
Executing a BREAK on an empty LOOP stack hangs the GPU.
A stack overflow discards the oldest element, so you could think of it as a queue or a ring buffer.
If the CALL stack is popped four times in a row, the fourth update to its copy of the PC is missed (the third PC update will be propagated). Probably a hardware bug.

Windows 图形显示驱动开发-WDDM 3.0功能- D3D12 视频编码（二）程序员王马 windows图形显示驱动开发驱动开发
D3D12视频编码回调函数驱动程序实现以下回调函数以支持D3D12视频编码。创建表示视频编码器的驱动程序对象：PFND3D12DDI_CALCPRIVATEVIDEOENCODERSIZE_0082_0会计算D3D运行时需要为驱动程序对象分配的内存量。PFND3D12DDI_CREATEVIDEOENCODER_0082_0创建保存视频编码会话状态的实际视频编码器对象。创建表示视频编码器堆的驱动程
Windows 图形显示驱动开发-WDDM 3.2- WDDM 功能的内核模式测试程序员王马 windows图形显示驱动开发驱动开发
概述在某些情况下，引入了基于WDDM或MCDM的新计算设备，并且这些设备的驱动程序不支持D3D运行时。为了帮助验证此类驱动程序，将功能添加到Dxgkrnl，以便仅使用内核模式thunk进行验证;也就是说，无需涉及D3D运行时和用户模式驱动程序（UMD）。此基础结构还允许使用精确设置测试WDDM功能，而无需通过D3D运行时或UMD，这可能会使事情复杂化。引入了DDI，以便在给定的一组命令的内核模式下
UE发生GPU崩溃D3D丢失，真的跟硬件有关系。虚幻叫兽 UE虚幻引擎MetaHuman ue5 GPU崩溃
先说一下我的配置：2022年4月全新台式机。i912900kfDDR560003070ti8G读写7000m的M.2win11，最新显卡驱动，GameReady和Studio都试过。===但是BUT===UE5每天GPU崩溃几十次，UE4比较少见。按说我这配置还可以吧，鲁大师全国排名六百多（4月8日），二百三十多万分，也算够用。但我没说运行哪个UE程序导致的GPU崩溃。也许你看出来了，问题就出在8
《DirectX 12 3D游戏开发实战》读书笔记1：数学基础 tikris 3d 游戏 c++矩阵线性代数
文章目录学习内容内容关于浮点类型误差解决方案参数与D3D数据结构向量类型XMVECTOR与XMFLOATn：XMVECTOR与XMFLOATn的相互转化：取得某个分量或者将某个分量转换为XMVECTOR类型：参数向量特点：表示方法：运算求模：单位化(规范化、标准化等同义)：正交化：加(减)法：乘法：其他函数杂项点常向量矩阵矩阵的传参矩阵的初始化XMMATRIX和XMFLOAT4X4的转换运算矩阵的
dx12 龙书第六章学习笔记 -- 利用Direct3D绘制几何体帅狗狗灬 DirectX 笔记学习 c++游戏
1.顶点与输入布局：除了空间位置,D3D的顶点还可以存储其他属性数据,且D3D允许我们自行构建顶点格式①第一步：创建一个结构体来容纳选定的顶点数据structVertex1{XMFLOAT3Pos;XMFLOAT4Color;};structVertex2{XMFLOAT3Pos;XMFLOAT3Normal;XMFLOAT2Tex0;XMFLOAT2Tex1;};//成员使用XMFLOATn而不
图形世界分裂的两派——理清D3D和OpenGL的脉络 iteye_15898 c/c++数据库游戏
转载自：http://www.iieeg.com/newscon.php?id=8388计算机三维图形是指将用数据描述的三维空间通过计算转换成二维图像并显示或打印出来的技术，API(ApplicationProgrammingInterface)即“应用程序接口”是连接应用程序与操作系统、实现对计算机硬件控制的纽带，Direct3D和OpenGL是目前的两大3D图形API，要在你的3D显卡上进行3
UnityShader实例09:Stencil Buffer&Stencil Test lupeng0330 unity3D shader实例笔记 unity stencil 模板缓冲深度测试 shader
StencilBuffer&StencilTest在开始前先吐槽下unity的官方文档，说实话关于stencil，官方文档真的是可以不要了，除了记流水账般解释了下各个参数的作用，作为例子的shader也是让人一头雾水，整个文档看下来，你发觉stencil是用来干嘛的，怎么操作，仍然不知道。好在unity的shaderlab和D3D，OpenGL等shader语言是一致的，还可以从它们的相关解释来了
贴图问题，opengl，linux，windows，消除锯齿，摩尔纹，yuv 还是 rgb qianbo_insist c++高级技巧音视频和c++java 物联网 ffmpeg linux 运维服务器
1消除锯齿和摩尔纹windows下使用d3d是很方便的，基本不用设置很多东西，就可以做到，所以windows上最好使用d3d。但是linux上有所不同。摩尔条纹是两条线或两个物体之间以恒定的角度和频率发生干涉的视觉结果，而锯齿是在缩小的情况下，画面计算引起，这两个事物都必须消除。使用opengl在linux上做opengl和windows上有所不同吗，事实上，是这样的，我们在渲染的时候，如何做到反
解决UE5出现GPU发生崩溃，或D3D设备已移除獨孤記憶
在cmd处输入regaddHKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\GraphicsDrivers/vTdrDelay/tREG_DWORD/d120/f到英伟达官网下载更新至最新版studio驱动到主板官网下载最新版bios并更新，同时关闭超频操作。
TA百人计划学习笔记 1.1 渲染流水线 yoi啃码磕了牙游戏游戏引擎
源视频链接【技术美术百人计划】图形1.1渲染流水线_哔哩哔哩_bilibilippt1100-渲染管线简介-v4总流程可完全编程控制顶点着色器，曲面细分着色器，几何着色器，片元着色器可配置但不能编程裁剪，逐片元操作GPU固定实现屏幕映射，三角形设置，三角形遍历不可编程不可配置顶点数据，屏幕图像1.应用阶段疑惑点渲染模式2.几何阶段注意投影坐标系有别与其他坐标系，是独立的OpenGL和D3D的差异Z
深入解析d3dcompiler_47.dll文件及其丢失的修复方法 sheng12345678rui windows dll丢失 dll文件 dll dll修复
一、d3dcompiler_47.dll是什么文件？d3dcompiler_47.dll是DirectXSDK中的一个动态链接库文件，它是用于编译DirectX着色器的工具之一。DirectX是由微软公司开发的一种多媒体编程接口，它提供了一系列的API和工具，用于开发游戏和多媒体应用程序。而d3dcompiler_47.dll则是其中的一个重要组件，它负责将着色器代码编译成可执行的程序。二、d3d
学习笔记30——DirectX框架一念白发学习笔记 c++游戏程序游戏引擎
首先，这一节开始就要接触DX了，希望大家能够把前面讲的游戏程序框架、数学基础和渲染管线相关的内容，能够有一个很好的掌握。然后今天正式开启咱们的旅途！这里D3D是需要环境配置的，因为我的环境就是按照X_Jun教程搭建的，所以你直接按照他教程中写的环境配置一步步跟着走就行了新建项目(directx11.tech)我在这里强调几个点：第一如果你要按照我写的教程去学习，这里你暂且不要拿X_Jun的01项目
基于Direct3D实现简单的粒子系统 ntwilford DirectX 学习日记 direct3d null float winapi class parameters
这是一个基于D3D的基本的粒子系统，能够实现一些基本的效果，如：雨、雪、烟花等。代码很少，只有一个头文件和一个CPP文件，便于研究粒子系统的原理。EpParticleSystem.h:#ifndef_EPPARTICLESYSTEM_H_#define_EPPARTICLESYSTEM_H_#include#includestructEpParticle{D3DXVECTOR3pos;floatp
RHCSA-Vim 的用法 Michael_XiaoQ linux RHEL7 linux vim
如何从命令模式进入插入模式：#A/a/O/o/i/Ii：小写是插入光标所在位置前一个字母I:大写是在光标所在行的开始插入a：在光标所在位置的后一个字母开始插入A:在光标所在行的末端插入o：光标所在行的下一行开始插入O：光标所在行的上一行开始插入无论小O还是大O都是光标另起一行命令模式下：x：删除单个字符u：代表撤销undodd：删除光标所在行。---这个删除实际上是剪切3dd/d3d：包括光标在内
SDL2的学习之路＜一＞创建基本窗口 forever_hdm c++游戏学习 c++游戏引擎
SDL2的学习之路工作之余的爱好，自己玩了下几个游戏开发的引擎，也自己基于d3d写了个简单的引擎，还去玩了下UE4这种成熟完善的引擎，玩的多了，记不住了，来记录且分享下，希望跟大家一起交流成长，废话不多，注重简洁明了(我懒)SDL2的下载官网下载地址:http://www.libsdl.org需要注意的是，除了基本的sdl库，还需要另外两个非常有用的库，sdl2_image和sdl2_ttf。这两
pscc2019计算机中丢失d3d,Solved: Photoshop CC 2019 (20.0) crash d3dcompiler_47.dll ... - Adobe Support Comm... 邢哲wanderer
Hiall,Thismorning,ourCreativeCloudsoftwarehavesomeupdates(becauseCC2019),andPhotoshopCCnowdoesn'trun.ItisonWin764bits(version6.1,number7601,servicepack1).IhavethiserroronPhotoshoplaunch:"Can'tstartsof
计算机缺失d3dcompiler_47.dll解决方案,如何修复电脑缺失d3d文件 a555333820 windows dll文件丢失 c++开发语言 dll丢失
在计算机系统中，DLL文件（动态链接库）是一种重要的共享库，它包含了可被多个程序使用的代码和数据。然而，当某些DLL文件丢失或损坏时，可能会导致程序无法正常运行。本文将介绍四种解决D3DCompiler_47.dll缺失的方法。方法1.使用dll修复工具来修复D3DCompiler_47.dll可以通过百度或许微软官网搜索dll修复程序文件或者打开电脑浏览器在浏览器顶部栏目输入：dll修复文件.s
D3D中的模板缓存（1） Jaz_Chu Direct 3d游戏编程测试 direct3d reference less
模板缓存是一个离屏缓存，我们能够用它来完成一些特效。模板缓存与后台缓存和深度缓存有相同的定义，因此在模板缓存中的像素与后台缓存和深度缓存中的像素是相协调的。就象名字所说，模板缓存就象一个模板它允许我们刷新渲染后缓存的某个部分。举例，当要实现一个镜子时，我们只需要简单地反射一个物体细节到镜子平面上；然而，我们仅仅想只绘制镜子里的反射结果。我们能用模板缓存来渲染它，图8.1清楚的显示了这一点。模板缓存
【游戏逆向】D3D HOOK实现透视讲解 douluo998 游戏 3d
实现目的：目前大部分游戏通过Direct3D实现3D效果，通过挂钩相应函数，可以实现3D透视，屏幕挂字效果。而透视，屏蔽特定效果，设置透明在很多游戏（特别是FPS）中发挥着巨大的作用！实现思路：[D3D]DirectX的功能都是以COM组件的形式提供的。在Direct3D中，主要通过采取以下操作来实现编程：调用适当的函数获取接口指针；调用接口的方法（成员函数）来完成所需功能；用完接口后，调用Rel
d3dcompiler_47.dll是什么文件？游戏确实d3dcompiler_47.dll的常用解决方法 2301_77698200 dll修复教程 dll修复游戏 windows
d3dcompiler_47.dll是一个动态链接库（DLL）文件，属于MicrosoftDirectX软件组件的一部分。它主要负责处理DirectX中的图形和多媒体内容，以确保游戏和应用程序能够正常运行。d3dcompiler_47.dll的主要功能是将DirectXAPI转换为特定于硬件的指令，从而实现高效的游戏性能和高质量的图形渲染。在一些基于DirectX11的游戏和应用程序中，如果d3d
191005 七三二十一Q
十点多从宾馆出来，到方山不过十二点多。漫漫下午时光，有雨有风，等着许嵩。漫长下午时光，在游戏风云度过。游戏风云里，头文字D3D，我认识了南京本地车队的老头；劲舞团，见识了各路牛鬼蛇神。南京本地Saturday乐队、布衣乐队、国风专场、许嵩、Hebe都给我留下深刻记忆。等了七八个小时，许嵩出场的那一瞬间，我站在人群后狂喊怒吼，他在的三四十分钟，让我彻底理解了为什么会有那样追星的女孩。中途有个彩蛋，原
正交投影矩阵的推导荆楚闲人 OpenGL 矩阵正投影
目录1.说明2.预备知识3.OpenGL正交投影变换4.D3D正交投影变换5.M3G正交投影变换6.结束语1.说明关于OPenGL透视投影矩阵的推导，参见《OPengGL透视投影矩阵的推导》。2.预备知识之前我们在《深入探索透视投影变换》以及《深入探索透视投影变换（续）》中研究了OpenGL、D3D以及M3G的透视投影变换的原理以及生成方法。这些方法在当前的主流图形API中得到了普遍使用。但关于投
D3D绘制精确到屏幕上每一个像素萧戈 D3D 3d
最近项目上有一个需求，就是要求我们绘制的图像必须精确到屏幕上的每个像素。这就要求我们对d3d的绘制有比较深入的认识，比如：我们已经设置好了3D变换中视图矩阵和投影矩阵（无论是透视变换还是正交变换），此时我们就已经在3D空间中确定了图像的边界（可以根据视图矩阵和投影矩阵计算出图像上下左右边界的位置），我们这里假设边界为left,top,right,bottom。如果此时我们设置好窗口的视口区域，比如
d3dcompiler_47.dll缺失怎么修复，推荐这4个修复方案 2301_77698200 dll修复教程 dll修复 3d microsoft
运行游戏，图片处理软件，如photoshop等计算机报错“由于找不到d3dcompiler_47.dll”是什么原因？d3dcompiler_47.dll是与MicrosoftDirectX相关的动态链接库文件。这个文件通常是游戏或图形相关软件需要的组件之一，缺少它可能会导致应用程序无法正常运行。计算机报错情景如图所示：当您在运行某些应用程序或游戏时计算机报错“无法启动此程序，因为计算机丢失d3d
d3dcompiler_47.dll缺失怎么修复，分享几种快速修复方法 askah6644 电脑经验分享 microsoft dll修复 windows dll
当我们打开电脑软件或许游戏时候，如果电脑计算机中丢失了d3dcompiler_47.dll就会报错，丢失d3dcompiler_47.dll“”或许找不到d3dcompiler_47.dll等等提示。它主要用于编写和编译Direct3D11的着色器程序，是Direct3D11中非常重要的组成部分。很多游戏，图形处理软件比如ps等丢失d3dcompiler_47.dll都会造成无法启动运行。当d3d
VTK学习之激光点云动态库封装（排水管道）光谷码农 VTK c++激光雷达点云管道检测
目前各行各业都应用了激光点云，包括目前非常火的自动驾驶行业，本人目前在排水管道检测行业，因此封装了应用于排水管道的点云库。激光雷达测得点云数据存储下来后，解析出坐标点，然后传递到函数入口中，即可获得三维点云模型。处理点云数据的工具有很多，这里没有直接采用OpenGL和D3D，而选择了封装得比较好，容易上手的vtk，本示例是基于vtk9.0+vs2019，封装好的库使用C#进行调用测试。废话不多说，
Modern Drawcall API 离原春草
突然发现，D3D的RenderAPI中蕴藏了很多可以用于做性能优化的参数，而平时自己对这块的了解不多，基础不是非常扎实，因此专门开一篇文章来进行学习与总结。DXRenderAPI1.DrawPrimitive大概的使用逻辑：//设置vertexbufferg_pd3dDevice->SetStreamSource(0,g_pVB,0,sizeof(CUSTEMVERTEX));//设置顶点格式，此
###2018-09-17安装AirTest出现 failed to create D3D shaders解决三个梨涡三个酒窝
安装AirTest程序后Windows7启动报错如何解决image.pngimage.pngimage.png把显卡驱动更换为VGA图形适配器重启即可解决image.png
D3D中的粒子系统（4） weixin_34258078
14.3具体的粒子系统：雪、火、粒子枪现在让我们用cParticleSystem类开始一个具体的粒子系统，为了说明用意，这些系统的设计很简单，没有用到cParticleSystem类所提供的所有灵活性。我们实现雪、火、粒子枪系统。雪系统模拟下落的雪花，火系统模拟看上去像火焰的爆炸，粒子枪系统从照相机位置向对面发射出粒子（用键盘）。14.3.1例子程序：雪雪系统类定义如下：classcSnow:pu
D3D绘制图元理论基础 pizi0475 Direct3D 图形引擎技术理论图形图像存储数据结构工作 manager 文档 vb
在前面部分，我介绍了D3D的初始化和固定渲染流水线。这一章，将它们用于实践。我们需要解决的事情是：1、在D3D中如何存储顶点和索引数据；2、怎样使用渲染状态来改变渲染结果；3、学习怎样渲染场景；4、学习怎样用D3DXCreate*函数创建更多的3D物体。一、顶点缓冲区、索引缓冲区的概念顶点缓冲区是一块连续的存储了顶点数据的内存。索引缓冲区是一块连续的存储了索引数据的内存。我们使用顶点缓冲区和索引缓
用MiddleGenIDE工具生成hibernate的POJO（根据数据表生成POJO类） AdyZhang POJO eclipse Hibernate MiddleGenIDE
推荐:MiddlegenIDE插件, 是一个Eclipse 插件. 用它可以直接连接到数据库, 根据表按照一定的HIBERNATE规则作出BEAN和对应的XML ，用完后你可以手动删除它加载的JAR包和XML文件! 今天开始试着使用
.9.png Cb123456 android
“点九”是andriod平台的应用软件开发里的一种特殊的图片形式，文件扩展名为：.9.png 　　智能手机中有自动横屏的功能,同一幅界面会在随着手机(或平板电脑)中的方向传感器的参数不同而改变显示的方向,在界面改变方向后,界面上的图形会因为长宽的变化而产生拉伸,造成图形的失真变形。　　我们都知道android平台有多种不同的分辨率，很多控件的切图文件在被放大拉伸后，边
算法的效率天子之骄算法效率复杂度最坏情况运行时间大O阶平均情况运行时间
算法的效率效率是速度和空间消耗的度量。集中考虑程序的速度，也称运行时间或执行时间，用复杂度的阶(O)这一标准来衡量。空间的消耗或需求也可以用大O表示，而且它总是小于或等于时间需求。以下是我的学习笔记： 1.求值与霍纳法则，即为秦九韶公式。 2.测定运行时间的最可靠方法是计数对运行时间有贡献的基本操作的执行次数。运行时间与这个计数成正比。
java数据结构何必如此 java 数据结构
Java 数据结构 Java工具包提供了强大的数据结构。在Java中的数据结构主要包括以下几种接口和类：枚举（Enumeration）位集合（BitSet）向量（Vector）栈（Stack）字典（Dictionary）哈希表（Hashtable）属性（Properties）以上这些类是传统遗留的，在Java2中引入了一种新的框架-集合框架(Collect
MybatisHelloWorld 3213213333332132
//测试入口TestMyBatis package com.base.helloworld.test; import java.io.IOException; import org.apache.ibatis.io.Resources; import org.apache.ibatis.session.SqlSession; import org.apache.ibat
Java|urlrewrite|URL重写|多个参数 7454103 java xml Web 工作
个人工作经验！如有不当之处，敬请指点 1.0 web -info 目录下建立 urlrewrite.xml 文件类似如下： <?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE u
达梦数据库+ibatis darkranger sql mysql ibatis SQL Server
--插入数据方面如果您需要数据库自增... 那么在插入的时候不需要指定自增列. 如果想自己指定ID列的值, 那么要设置 set identity_insert 数据库名.模式名.表名; ----然后插入数据; example: create table zhabei.test( id bigint identity(1,1) primary key, nam
XML 解析四种方式 aijuans android
XML现在已经成为一种通用的数据交换格式,平台的无关性使得很多场合都需要用到XML。本文将详细介绍用Java解析XML的四种方法。 XML现在已经成为一种通用的数据交换格式,它的平台无关性,语言无关性,系统无关性,给数据集成与交互带来了极大的方便。对于XML本身的语法知识与技术细节,需要阅读相关的技术文献,这里面包括的内容有DOM(Document Object
spring中配置文件占位符的使用 avords
1.类 <?xml version="1.0" encoding="UTF-8"?><!DOCTYPE beans PUBLIC "-//SPRING//DTD BEAN//EN" "http://www.springframework.o
前端工程化-公共模块的依赖和常用的工作流 bee1314 webpack
题记：一个人的项目，还有工程化的问题嘛？我们在推进模块化和组件化的过程中，肯定会不断的沉淀出我们项目的模块和组件。对于这些沉淀出的模块和组件怎么管理？另外怎么依赖也是个问题？你真的想这样嘛？ var BreadCrumb = require(‘../../../../uikit/breadcrumb’); //真心ugly。
上司说「看你每天准时下班就知道你工作量不饱和」，该如何回应？ bijian1013 项目管理沟通 IT职业规划
问题：上司说「看你每天准时下班就知道你工作量不饱和」，如何回应正常下班时间6点，只要是6点半前下班的，上司都认为没有加班。 Eno-Bea回答，注重感受，不一定是别人的虽然我不知道你具体从事什么工作与职业，但是我大概猜测，你是从事一项不太容易出现阶段性成果的工作
TortoiseSVN，过滤文件征客丶 SVN
环境： TortoiseSVN 1.8 配置：在文件夹空白处右键选择 TortoiseSVN -> Settings 在 Global ignote pattern 中添加要过滤的文件：多类型用英文空格分开 *name ：过滤所有名称为 name 的文件或文件夹 *.name ：过滤所有后缀为 name 的文件或文件夹 --------
【Flume二】HDFS sink细说 bit1129 Flume
1. Flume配置 a1.sources=r1 a1.channels=c1 a1.sinks=k1 ###Flume负责启动44444端口 a1.sources.r1.type=avro a1.sources.r1.bind=0.0.0.0 a1.sources.r1.port=44444 a1.sources.r1.chan
The Eight Myths of Erlang Performance bookjovi erlang
erlang有一篇guide很有意思： http://www.erlang.org/doc/efficiency_guide 里面有个The Eight Myths of Erlang Performance： http://www.erlang.org/doc/efficiency_guide/myths.html Myth: Funs are sl
java多线程网络传输文件(非同步)-2008-08-17 ljy325 java 多线程 socket
利用 Socket 套接字进行面向连接通信的编程。客户端读取本地文件并发送；服务器接收文件并保存到本地文件系统中。使用说明:请将TransferClient, TransferServer, TempFile三个类编译，他们的类包是FileServer. 客户端: 修改TransferClient: serPort, serIP, filePath, blockNum,的值来符合您机器的系
读《研磨设计模式》-代码笔记-模板方法模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.sql.Connection; import java.sql.DriverManager; import java.sql.PreparedStatement; import java.sql.ResultSet;
配置心得 chenyu19891124 配置
时间就这样不知不觉的走过了一个春夏秋冬，转眼间来公司已经一年了，感觉时间过的很快，时间老人总是这样不停走，从来没停歇过。作为一名新手的配置管理员，刚开始真的是对配置管理是一点不懂，就只听说咱们公司配置主要是负责升级，而具体该怎么做却一点都不了解。经过老员工的一点点讲解，慢慢的对配置有了初步了解，对自己所在的岗位也慢慢的了解。做了一年的配置管理给自总结下： 1.改变从一个以前对配置毫无
对“带条件选择的并行汇聚路由问题”的再思考 comsci 算法工作软件测试嵌入式领域模型
2008年上半年，我在设计并开发基于”JWFD流程系统“的商业化改进型引擎的时候，由于采用了新的嵌入式公式模块而导致出现“带条件选择的并行汇聚路由问题”(请参考2009-02-27博文)，当时对这个问题的解决办法是采用基于拓扑结构的处理思想，对汇聚点的实际前驱分支节点通过算法预测出来，然后进行处理，简单的说就是找到造成这个汇聚模型的分支起点，对这个起始分支节点实际走的路径数进行计算，然后把这个实际
Oracle 10g 的clusterware 32位下载地址 daizj oracle
Oracle 10g 的clusterware 32位下载地址 http://pan.baidu.com/share/link?shareid=531580&uk=421021908 http://pan.baidu.com/share/link?shareid=137223&uk=321552738 http://pan.baidu.com/share/l
非常好的介绍：Linux定时执行工具cron dongwei_6688 linux
Linux经过十多年的发展，很多用户都很了解Linux了，这里介绍一下Linux下cron的理解，和大家讨论讨论。cron是一个Linux 定时执行工具，可以在无需人工干预的情况下运行作业，本文档不讲cron实现原理，主要讲一下Linux定时执行工具cron的具体使用及简单介绍。新增调度任务推荐使用crontab -e命令添加自定义的任务（编辑的是/var/spool/cron下对应用户的cr
Yii assets目录生成及修改 dcj3sjt126com yii
assets的作用是方便模块化，插件化的，一般来说出于安全原因不允许通过url访问protected下面的文件，但是我们又希望将module单独出来，所以需要使用发布，即将一个目录下的文件复制一份到assets下面方便通过url访问。 assets设置对应的方法位置 \framework\web\CAssetManager.php assets配置方法在m
mac工作软件推荐 dcj3sjt126com mac
mac上的Terminal + bash ＋ screen组合现在已经非常好用了，但是还是经不起iterm＋zsh＋tmux的冲击。在同事的强烈推荐下，趁着升级mac系统的机会，顺便也切换到iterm＋zsh＋tmux的环境下了。我为什么要要iterm2 切换过来也是脑袋一热的冲动，我也调查过一些资料，看了下iterm的一些优点： * 兼容性好，远程服务器 vi 什么的低版本能很好兼
Memcached(三)、封装Memcached和Ehcache frank1234 memcached ehcache spring ioc
本文对Ehcache和Memcached进行了简单的封装，这样对于客户端程序无需了解ehcache和memcached的差异，仅需要配置缓存的Provider类就可以在二者之间进行切换，Provider实现类通过Spring IoC注入。 cache.xml <?xml version="1.0" encoding="UTF-8"?>
Remove Duplicates from Sorted List II hcx2013 remove
Given a sorted linked list, delete all nodes that have duplicate numbers, leaving only distinct numbers from the original list. For example,Given 1->2->3->3->4->4->5,
Spring4新特性——注解、脚本、任务、MVC等其他特性改进 jinnianshilongnian spring4
Spring4新特性——泛型限定式依赖注入 Spring4新特性——核心容器的其他改进 Spring4新特性——Web开发的增强 Spring4新特性——集成Bean Validation 1.1(JSR-349)到SpringMVC Spring4新特性——Groovy Bean定义DSL Spring4新特性——更好的Java泛型操作API Spring4新
MySQL安装文档 liyong0802 mysql
工作中用到的MySQL可能安装在两种操作系统中，即Windows系统和Linux系统。以Linux系统中情况居多。安装在Windows系统时与其它Windows应用程序相同按照安装向导一直下一步就即，这里就不具体介绍，本文档只介绍Linux系统下MySQL的安装步骤。 Linux系统下安装MySQL分为三种：RPM包安装、二进制包安装和源码包安装。二
使用VS2010构建HotSpot工程 p2p2500 HotSpot OpenJDK VS2010
1. 下载OpenJDK7的源码： http://download.java.net/openjdk/jdk7 http://download.java.net/openjdk/ 2. 环境配置 ▶
Oracle实用功能之分组后列合并 seandeng888 oracle 分组实用功能合并
1 实例解析由于业务需求需要对表中的数据进行分组后进行合并的处理，鉴于Oracle10g没有现成的函数实现该功能，且该功能如若用JAVA代码实现会比较复杂，因此，特将SQL语言的实现方式分享出来，希望对大家有所帮助。如下：表test 数据如下： ID,SUBJECTCODE,DIMCODE,VALUE 1&nbs
Java定时任务注解方式实现 tuoni java spring jvm xml jni
Spring 注解的定时任务，有如下两种方式：第一种： <?xml version="1.0" encoding="UTF-8"?> <beans xmlns="http://www.springframework.org/schema/beans" xmlns:xsi="http
11大Java开源中文分词器的使用方法和分词效果对比 yangshangchuan word分词器 ansj分词器 Stanford分词器 FudanNLP分词器 HanLP分词器
本文的目标有两个： 1、学会使用11大Java开源中文分词器 2、对比分析11大Java开源中文分词器的分词效果本文给出了11大Java开源中文分词的使用方法以及分词结果对比代码，至于效果哪个好，那要用的人结合自己的应用场景自己来判断。 11大Java开源中文分词器，不同的分词器有不同的用法，定义的接口也不一样，我们先定义一个统一的接口： /** * 获取文本的所有分词结果, 对比