Compact Normal Storage for Small G-Buffers

http://aras-p.info/texts/CompactNormalStorage.html

Intro

Various deferred shading/lighting approaches or image postprocessing effects need to store normals as part of their G-buffer. Let’s figure out a compact storage method for view space normals. In my case, main target is minimalist G-buffer, where depth and normals are packed into a single 32 bit (8 bits/channel) render texture. I try to minimize error and shader cycles to encode/decode.

Now of course, 8 bits/channel storage for normals can be not enough for shading, especially if you want specular (low precision & quantization leads to specular “wobble” when camera or objects move). However, everything below should Just Work (tm) for 10 or 16 bits/channel integer formats. For 16 bits/channel half-float formats, some of the computations are not necessary (e.g. bringing normal values into 0..1 range).

If you know other ways to store/encode normals, please let me know in the comments!

Various normal encoding methods and their comparison below. Notes:

Error images are: 1-pow(dot(n1,n2),1024) and abs(n1-n2)*30, where n1 is actual normal, andn2 is normal encoded into a texture, read back & decoded. MSE and PSNR is computed on the difference (abs(n1-n2)) image.
Shader code is HLSL. Compiled into ps_3_0 by d3dx9_42.dll (February 2010 SDK).
Radeon GPU performance numbers from AMD’s GPU ShaderAnalyzer 1.53, using Catalyst 9.12 driver.
GeForce GPU performance numbers from NVIDIA’s NVShaderPerf 2.0, using 174.74 driver.

Note: there was an error!

Original version of my article had some stupidity: encoding shaders did not normalize the incoming per-vertex normal. This resulted in quality evaluation results being somewhat wrong. Also, if normal is assumed to be normalized, then three methods in original article (Sphere Map, Cry Engine 3 and Lambert Azimuthal) are in fact completely equivalent. The old version is still available for the sake of integrity of the internets.

Test Playground Application

Here is a small Windows application I used to test everything below: NormalEncodingPlayground.zip(4.8MB, source included).

It requires GPU with Shader Model 3.0 support. When it writes fancy shader reports, it expects AMD’s GPUShaderAnalyzer and NVIDIA’s NVShaderPerf to be installed. Source code should build with Visual C++ 2008.

Baseline: store X&Y&Z

Just to set the basis, store all three components of the normal. It’s not suitable for our quest, but I include it here to evaluate “base” encoding error (which happens here only because of quantization to 8 bits per component).

Encoding, Error to Power, Error * 30 images below. MSE: 0.000008; PSNR: 51.081 dB.

Encoding	Decoding
half4 encode (half3 n, float3 view) { return half4(n.xyz*0.5+0.5,0); }	half3 decode (half4 enc, float3 view) { return enc.xyz*2-1; }
ps_3_0 def c0, 0.5, 0, 0, 0 dcl_texcoord_pp v0.xyz mad_pp oC0, v0.xyzx, c0.xxxy, c0.xxxy	ps_3_0 def c0, 2, -1, 0, 0 dcl_texcoord2 v0.xy dcl_2d s0 texld_pp r0, v0, s0 mad_pp oC0.xyz, r0, c0.x, c0.y mov_pp oC0.w, c0.z
1 ALU Radeon HD 2400: 1 GPR, 1.00 clk Radeon HD 3870: 1 GPR, 1.00 clk Radeon HD 5870: 1 GPR, 0.50 clk GeForce 6200: 1 GPR, 1.00 clk GeForce 7800GT: 1 GPR, 1.00 clk GeForce 8800GTX: 6 GPR, 8.00 clk	2 ALU, 1 TEX Radeon HD 2400: 1 GPR, 1.00 clk Radeon HD 3870: 1 GPR, 1.00 clk Radeon HD 5870: 1 GPR, 0.50 clk GeForce 6200: 1 GPR, 1.00 clk GeForce 7800GT: 1 GPR, 1.00 clk GeForce 8800GTX: 6 GPR, 10.00 clk

Encoding

Decoding

half4 encode (half3 n, float3 view)

{

    return half4(n.xyz*0.5+0.5,0);

}

half3 decode (half4 enc, float3 view)

{

    return enc.xyz*2-1;

}

ps_3_0

def c0, 0.5, 0, 0, 0

dcl_texcoord_pp v0.xyz

mad_pp oC0, v0.xyzx, c0.xxxy, c0.xxxy

ps_3_0

def c0, 2, -1, 0, 0

dcl_texcoord2 v0.xy

dcl_2d s0

texld_pp r0, v0, s0

mad_pp oC0.xyz, r0, c0.x, c0.y

mov_pp oC0.w, c0.z

1 ALU

Radeon HD 2400: 1 GPR, 1.00 clk

Radeon HD 3870: 1 GPR, 1.00 clk

Radeon HD 5870: 1 GPR, 0.50 clk

GeForce 6200: 1 GPR, 1.00 clk

GeForce 7800GT: 1 GPR, 1.00 clk

GeForce 8800GTX: 6 GPR, 8.00 clk

2 ALU, 1 TEX

Radeon HD 2400: 1 GPR, 1.00 clk

Radeon HD 3870: 1 GPR, 1.00 clk

Radeon HD 5870: 1 GPR, 0.50 clk

GeForce 6200: 1 GPR, 1.00 clk

GeForce 7800GT: 1 GPR, 1.00 clk

GeForce 8800GTX: 6 GPR, 10.00 clk

Method #1: store X&Y, reconstruct Z

Used by Killzone 2 among others (PDF link).

Encoding, Error to Power, Error * 30 images below. MSE: 0.013514; PSNR: 18.692 dB.

Pros:

Very simple to encode/decode

Cons:

Normal can point away from the camera. My test scene setup actually has that. See Resistance 2 Prelighting paper (PDF link) for explanation.

Encoding	Decoding
half4 encode (half3 n, float3 view) { return half4(n.xy*0.5+0.5,0,0); }	half3 decode (half2 enc, float3 view) { half3 n; n.xy = enc*2-1; n.z = sqrt(1-dot(n.xy, n.xy)); return n; }
ps_3_0 def c0, 0.5, 0, 0, 0 dcl_texcoord_pp v0.xy mad_pp oC0, v0.xyxx, c0.xxyy, c0.xxyy	ps_3_0 def c0, 2, -1, 1, 0 dcl_texcoord2 v0.xy dcl_2d s0 texld_pp r0, v0, s0 mad_pp r0.xy, r0, c0.x, c0.y dp2add_pp r0.z, r0, -r0, c0.z mov_pp oC0.xy, r0 rsq_pp r0.x, r0.z rcp_pp oC0.z, r0.x mov_pp oC0.w, c0.w
1 ALU Radeon HD 2400: 1 GPR, 1.00 clk Radeon HD 3870: 1 GPR, 1.00 clk Radeon HD 5870: 1 GPR, 0.50 clk GeForce 6200: 1 GPR, 1.00 clk GeForce 7800GT: 1 GPR, 1.00 clk GeForce 8800GTX: 5 GPR, 7.00 clk	7 ALU, 1 TEX Radeon HD 2400: 1 GPR, 1.00 clk Radeon HD 3870: 1 GPR, 1.00 clk Radeon HD 5870: 1 GPR, 0.50 clk GeForce 6200: 1 GPR, 4.00 clk GeForce 7800GT: 1 GPR, 3.00 clk GeForce 8800GTX: 5 GPR, 15.00 clk

Encoding

Decoding

half4 encode (half3 n, float3 view)

{

    return half4(n.xy*0.5+0.5,0,0);

}

half3 decode (half2 enc, float3 view)

{

    half3 n;

    n.xy = enc*2-1;

    n.z = sqrt(1-dot(n.xy, n.xy));

    return n;

}

ps_3_0

def c0, 0.5, 0, 0, 0

dcl_texcoord_pp v0.xy

mad_pp oC0, v0.xyxx, c0.xxyy, c0.xxyy

ps_3_0

def c0, 2, -1, 1, 0

dcl_texcoord2 v0.xy

dcl_2d s0

texld_pp r0, v0, s0

mad_pp r0.xy, r0, c0.x, c0.y

dp2add_pp r0.z, r0, -r0, c0.z

mov_pp oC0.xy, r0

rsq_pp r0.x, r0.z

rcp_pp oC0.z, r0.x

mov_pp oC0.w, c0.w

1 ALU

Radeon HD 2400: 1 GPR, 1.00 clk

Radeon HD 3870: 1 GPR, 1.00 clk

Radeon HD 5870: 1 GPR, 0.50 clk

GeForce 6200: 1 GPR, 1.00 clk

GeForce 7800GT: 1 GPR, 1.00 clk

GeForce 8800GTX: 5 GPR, 7.00 clk

7 ALU, 1 TEX

Radeon HD 2400: 1 GPR, 1.00 clk

Radeon HD 3870: 1 GPR, 1.00 clk

Radeon HD 5870: 1 GPR, 0.50 clk

GeForce 6200: 1 GPR, 4.00 clk

GeForce 7800GT: 1 GPR, 3.00 clk

GeForce 8800GTX: 5 GPR, 15.00 clk

Method #3: Spherical Coordinates

It is possible to use spherical coordinates to encode the normal. Since we know it’s unit length, we can just store the two angles.

Suggested by Pat Wilson of Garage Games: GG blog post. Other mentions: MJP’s blog, GarageGames thread, Wolf Engel’s blog, gamedev.net forum thread.

Encoding, Error to Power, Error * 30 images below. MSE: 0.000062; PSNR: 42.042 dB.

Pros:

Suitable for normals in general (not necessarily view space)

Cons:

Uses trig instructions (quite heavy on ALU). Possible to replace some of that with texture lookups though.

Encoding	Decoding
#define kPI 3.1415926536f half4 encode (half3 n, float3 view) { return half4( (half2(atan2(n.y,n.x)/kPI, n.z)+1.0)*0.5, 0,0); }	half3 decode (half2 enc, float3 view) { half2 ang = enc2-1; half2 scth; sincos(ang.x kPI, scth.x, scth.y); half2 scphi = half2(sqrt(1.0 - ang.yang.y), ang.y); return half3(scth.yscphi.x, scth.x*scphi.x, scphi.y); }
ps_3_0 def c0, 0.999866009, 0, 1, 3.14159274 def c1, 0.0208350997, -0.0851330012, 0.180141002, -0.330299497 def c2, -2, 1.57079637, 0.318309873, 0.5 dcl_texcoord_pp v0.xyz add_pp r0.xy, -v0_abs, v0_abs.yxzw cmp_pp r0.xz, r0.x, v0_abs.xyyw, v0_abs.yyxw cmp_pp r0.y, r0.y, c0.y, c0.z rcp_pp r0.z, r0.z mul_pp r0.x, r0.x, r0.z mul_pp r0.z, r0.x, r0.x mad_pp r0.w, r0.z, c1.x, c1.y mad_pp r0.w, r0.z, r0.w, c1.z mad_pp r0.w, r0.z, r0.w, c1.w mad_pp r0.z, r0.z, r0.w, c0.x mul_pp r0.x, r0.x, r0.z mad_pp r0.z, r0.x, c2.x, c2.y mad_pp r0.x, r0.z, r0.y, r0.x cmp_pp r0.y, v0.x, -c0.y, -c0.w add_pp r0.x, r0.x, r0.y add_pp r0.y, r0.x, r0.x add_pp r0.z, -v0.x, v0.y cmp_pp r0.zw, r0.z, v0.xyxy, v0.xyyx cmp_pp r0.zw, r0, c0.xyyz, c0.xyzy mul_pp r0.z, r0.w, r0.z mad_pp r0.x, r0.z, -r0.y, r0.x mul_pp r0.x, r0.x, c2.z mov_pp r0.y, v0.z add_pp r0.xy, r0, c0.z mul_pp oC0.xy, r0, c2.w mov_pp oC0.zw, c0.y	ps_3_0 def c0, 2, -1, 0.5, 1 def c1, 6.28318548, -3.14159274, 1, 0 dcl_texcoord2 v0.xy dcl_2d s0 texld_pp r0, v0, s0 mad_pp r0.xy, r0, c0.x, c0.y mad r0.x, r0.x, c0.z, c0.z frc r0.x, r0.x mad r0.x, r0.x, c1.x, c1.y sincos_pp r1.xy, r0.x mad_pp r0.x, r0.y, -r0.y, c0.w mul_pp oC0.zw, r0.y, c1 rsq_pp r0.x, r0.x rcp_pp r0.x, r0.x mul_pp oC0.xy, r1, r0.x
26 ALU Radeon HD 2400: 1 GPR, 17.00 clk Radeon HD 3870: 1 GPR, 4.25 clk Radeon HD 5870: 2 GPR, 0.95 clk GeForce 6200: 2 GPR, 12.00 clk GeForce 7800GT: 2 GPR, 9.00 clk GeForce 8800GTX: 9 GPR, 43.00 clk	17 ALU, 1 TEX Radeon HD 2400: 1 GPR, 17.00 clk Radeon HD 3870: 1 GPR, 4.25 clk Radeon HD 5870: 2 GPR, 0.95 clk GeForce 6200: 2 GPR, 7.00 clk GeForce 7800GT: 1 GPR, 5.00 clk GeForce 8800GTX: 6 GPR, 23.00 clk

Encoding

Decoding

#define kPI 3.1415926536f

half4 encode (half3 n, float3 view)

{

    return half4(

      (half2(atan2(n.y,n.x)/kPI, n.z)+1.0)*0.5,

      0,0);

}

half3 decode (half2 enc, float3 view)

{

    half2 ang = enc*2-1;

    half2 scth;

    sincos(ang.x * kPI, scth.x, scth.y);

    half2 scphi = half2(sqrt(1.0 - ang.y*ang.y), ang.y);

    return half3(scth.y*scphi.x, scth.x*scphi.x, scphi.y);

}

ps_3_0

def c0, 0.999866009, 0, 1, 3.14159274

def c1, 0.0208350997, -0.0851330012,

    0.180141002, -0.330299497

def c2, -2, 1.57079637, 0.318309873, 0.5

dcl_texcoord_pp v0.xyz

add_pp r0.xy, -v0_abs, v0_abs.yxzw

cmp_pp r0.xz, r0.x, v0_abs.xyyw, v0_abs.yyxw

cmp_pp r0.y, r0.y, c0.y, c0.z

rcp_pp r0.z, r0.z

mul_pp r0.x, r0.x, r0.z

mul_pp r0.z, r0.x, r0.x

mad_pp r0.w, r0.z, c1.x, c1.y

mad_pp r0.w, r0.z, r0.w, c1.z

mad_pp r0.w, r0.z, r0.w, c1.w

mad_pp r0.z, r0.z, r0.w, c0.x

mul_pp r0.x, r0.x, r0.z

mad_pp r0.z, r0.x, c2.x, c2.y

mad_pp r0.x, r0.z, r0.y, r0.x

cmp_pp r0.y, v0.x, -c0.y, -c0.w

add_pp r0.x, r0.x, r0.y

add_pp r0.y, r0.x, r0.x

add_pp r0.z, -v0.x, v0.y

cmp_pp r0.zw, r0.z, v0.xyxy, v0.xyyx

cmp_pp r0.zw, r0, c0.xyyz, c0.xyzy

mul_pp r0.z, r0.w, r0.z

mad_pp r0.x, r0.z, -r0.y, r0.x

mul_pp r0.x, r0.x, c2.z

mov_pp r0.y, v0.z

add_pp r0.xy, r0, c0.z

mul_pp oC0.xy, r0, c2.w

mov_pp oC0.zw, c0.y

ps_3_0

def c0, 2, -1, 0.5, 1

def c1, 6.28318548, -3.14159274, 1, 0

dcl_texcoord2 v0.xy

dcl_2d s0

texld_pp r0, v0, s0

mad_pp r0.xy, r0, c0.x, c0.y

mad r0.x, r0.x, c0.z, c0.z

frc r0.x, r0.x

mad r0.x, r0.x, c1.x, c1.y

sincos_pp r1.xy, r0.x

mad_pp r0.x, r0.y, -r0.y, c0.w

mul_pp oC0.zw, r0.y, c1

rsq_pp r0.x, r0.x

rcp_pp r0.x, r0.x

mul_pp oC0.xy, r1, r0.x

26 ALU

Radeon HD 2400: 1 GPR, 17.00 clk

Radeon HD 3870: 1 GPR, 4.25 clk

Radeon HD 5870: 2 GPR, 0.95 clk

GeForce 6200: 2 GPR, 12.00 clk

GeForce 7800GT: 2 GPR, 9.00 clk

GeForce 8800GTX: 9 GPR, 43.00 clk

17 ALU, 1 TEX

Radeon HD 2400: 1 GPR, 17.00 clk

Radeon HD 3870: 1 GPR, 4.25 clk

Radeon HD 5870: 2 GPR, 0.95 clk

GeForce 6200: 2 GPR, 7.00 clk

GeForce 7800GT: 1 GPR, 5.00 clk

GeForce 8800GTX: 6 GPR, 23.00 clk

Method #4: Spheremap Transform

Spherical environment mapping (indirectly) maps reflection vector to a texture coordinate in [0..1] range. The reflection vector can point away from the camera, just like our view space normals. Bingo! See Siggraph 99 notes for sphere map math. Normal we want to encode is R, resulting values are (s,t).

If we assume that incoming normal is normalized, then there are methods derived from elsewhere that end up being exactly equivalent:

Used in Cry Engine 3, presented by Martin Mittring in “A bit more Deferred” presentation (PPT link, slide 13). For Unity, I had to negate Z component of view space normal to produce good results, I guess Unity’s and Cry Engine’s coordinate systems are different. The code would be:

half2 encode (half3 n, float3 view)

{

    half2 enc = normalize(n.xy) * (sqrt(-n.z*0.5+0.5));

    enc = enc*0.5+0.5;

    return enc;

}

half3 decode (half4 enc, float3 view)

{

    half4 nn = enc*half4(2,2,0,0) + half4(-1,-1,1,-1);

    half l = dot(nn.xyz,-nn.xyw);

    nn.z = l;

    nn.xy *= sqrt(l);

    return nn.xyz * 2 + half3(0,0,-1);

}

Lambert Azimuthal Equal-Area projection (Wikipedia link). Suggested by Sean Barrett in comments for this article. The code would be:

half2 encode (half3 n, float3 view)

{

    half f = sqrt(8*n.z+8);

    return n.xy / f + 0.5;

}

half3 decode (half4 enc, float3 view)

{

    half2 fenc = enc*4-2;

    half f = dot(fenc,fenc);

    half g = sqrt(1-f/4);

    half3 n;

    n.xy = fenc*g;

    n.z = 1-f/2;

    return n;

}

Encoding, Error to Power, Error * 30 images below. MSE: 0.000016; PSNR: 48.071 dB.

Pros:

Quality pretty good!
Quite cheap to encode/decode.
Similar derivation used by Cry Engine 3, so it must be good :)

Cons:

Encoding	Decoding
half4 encode (half3 n, float3 view) { half p = sqrt(n.z*8+8); return half4(n.xy/p + 0.5,0,0); }	half3 decode (half2 enc, float3 view) { half2 fenc = enc4-2; half f = dot(fenc,fenc); half g = sqrt(1-f/4); half3 n; n.xy = fencg; n.z = 1-f/2; return n; }
ps_3_0 def c0, 8, 0.5, 0, 0 dcl_texcoord_pp v0.xyz mad_pp r0.x, v0.z, c0.x, c0.x rsq_pp r0.x, r0.x mad_pp oC0.xy, v0, r0.x, c0.y mov_pp oC0.zw, c0.z	ps_3_0 def c0, 4, -2, 0, 1 def c1, 0.25, 0.5, 1, 0 dcl_texcoord2 v0.xy dcl_2d s0 texld_pp r0, v0, s0 mad_pp r0.xy, r0, c0.x, c0.y dp2add_pp r0.z, r0, r0, c0.z mad_pp r0.zw, r0.z, -c1.xyxy, c1.z rsq_pp r0.z, r0.z mul_pp oC0.zw, r0.w, c0.xywz rcp_pp r0.z, r0.z mul_pp oC0.xy, r0, r0.z
4 ALU Radeon HD 2400: 2 GPR, 3.00 clk Radeon HD 3870: 2 GPR, 1.00 clk Radeon HD 5870: 2 GPR, 0.50 clk GeForce 6200: 1 GPR, 4.00 clk GeForce 7800GT: 1 GPR, 2.00 clk GeForce 8800GTX: 5 GPR, 12.00 clk	8 ALU, 1 TEX Radeon HD 2400: 2 GPR, 3.00 clk Radeon HD 3870: 2 GPR, 1.00 clk Radeon HD 5870: 2 GPR, 0.50 clk GeForce 6200: 1 GPR, 6.00 clk GeForce 7800GT: 1 GPR, 3.00 clk GeForce 8800GTX: 6 GPR, 15.00 clk

Encoding

Decoding

half4 encode (half3 n, float3 view)

{

    half p = sqrt(n.z*8+8);

    return half4(n.xy/p + 0.5,0,0);

}

half3 decode (half2 enc, float3 view)

{

    half2 fenc = enc*4-2;

    half f = dot(fenc,fenc);

    half g = sqrt(1-f/4);

    half3 n;

    n.xy = fenc*g;

    n.z = 1-f/2;

    return n;

}

ps_3_0

def c0, 8, 0.5, 0, 0

dcl_texcoord_pp v0.xyz

mad_pp r0.x, v0.z, c0.x, c0.x

rsq_pp r0.x, r0.x

mad_pp oC0.xy, v0, r0.x, c0.y

mov_pp oC0.zw, c0.z

ps_3_0

def c0, 4, -2, 0, 1

def c1, 0.25, 0.5, 1, 0

dcl_texcoord2 v0.xy

dcl_2d s0

texld_pp r0, v0, s0

mad_pp r0.xy, r0, c0.x, c0.y

dp2add_pp r0.z, r0, r0, c0.z

mad_pp r0.zw, r0.z, -c1.xyxy, c1.z

rsq_pp r0.z, r0.z

mul_pp oC0.zw, r0.w, c0.xywz

rcp_pp r0.z, r0.z

mul_pp oC0.xy, r0, r0.z

4 ALU

Radeon HD 2400: 2 GPR, 3.00 clk

Radeon HD 3870: 2 GPR, 1.00 clk

Radeon HD 5870: 2 GPR, 0.50 clk

GeForce 6200: 1 GPR, 4.00 clk

GeForce 7800GT: 1 GPR, 2.00 clk

GeForce 8800GTX: 5 GPR, 12.00 clk

8 ALU, 1 TEX

Radeon HD 2400: 2 GPR, 3.00 clk

Radeon HD 3870: 2 GPR, 1.00 clk

Radeon HD 5870: 2 GPR, 0.50 clk

GeForce 6200: 1 GPR, 6.00 clk

GeForce 7800GT: 1 GPR, 3.00 clk

GeForce 8800GTX: 6 GPR, 15.00 clk

Method #7: Stereographic Projection

What the title says: use Stereographic Projection (Wikipedia link), plus rescaling so that “practically visible” range of normals maps into unit circle (regular stereographic projection maps sphere to circle of infinite size). In my tests, scaling factor of 1.7777 produced best results; in practice it depends on FOV used and how much do you care about normals that point away from the camera.

Suggested by Sean Barrett and Ignacio Castano in comments for this article.

Encoding, Error to Power, Error * 30 images below. MSE: 0.000038; PSNR: 44.147 dB.

Pros:

Quality pretty good!
Quite cheap to encode/decode.

Cons:

Encoding	Decoding
half4 encode (half3 n, float3 view) { half scale = 1.7777; half2 enc = n.xy / (n.z+1); enc /= scale; enc = enc*0.5+0.5; return half4(enc,0,0); }	half3 decode (half4 enc, float3 view) { half scale = 1.7777; half3 nn = enc.xyzhalf3(2scale,2scale,0) + half3(-scale,-scale,1); half g = 2.0 / dot(nn.xyz,nn.xyz); half3 n; n.xy = gnn.xy; n.z = g-1; return n; }
ps_3_0 def c0, 1, 0.281262308, 0.5, 0 dcl_texcoord_pp v0.xyz add_pp r0.x, c0.x, v0.z rcp r0.x, r0.x mul_pp r0.xy, r0.x, v0 mad_pp oC0.xy, r0, c0.y, c0.z mov_pp oC0.zw, c0.w	ps_3_0 def c0, 3.55539989, 0, -1.77769995, 1 def c1, 2, -1, 0, 0 dcl_texcoord2 v0.xy dcl_2d s0 texld_pp r0, v0, s0 mad_pp r0.xyz, r0, c0.xxyw, c0.zzww dp3_pp r0.z, r0, r0 rcp r0.z, r0.z add_pp r0.w, r0.z, r0.z mad_pp oC0.z, r0.z, c1.x, c1.y mul_pp oC0.xy, r0, r0.w mov_pp oC0.w, c0.y
5 ALU Radeon HD 2400: 2 GPR, 4.00 clk Radeon HD 3870: 2 GPR, 1.00 clk Radeon HD 5870: 2 GPR, 0.50 clk GeForce 6200: 1 GPR, 2.00 clk GeForce 7800GT: 1 GPR, 2.00 clk GeForce 8800GTX: 5 GPR, 12.00 clk	7 ALU, 1 TEX Radeon HD 2400: 2 GPR, 4.00 clk Radeon HD 3870: 2 GPR, 1.00 clk Radeon HD 5870: 2 GPR, 0.50 clk GeForce 6200: 1 GPR, 4.00 clk GeForce 7800GT: 1 GPR, 4.00 clk GeForce 8800GTX: 6 GPR, 12.00 clk

Encoding

Decoding

half4 encode (half3 n, float3 view)

{

    half scale = 1.7777;

    half2 enc = n.xy / (n.z+1);

    enc /= scale;

    enc = enc*0.5+0.5;

    return half4(enc,0,0);

}

half3 decode (half4 enc, float3 view)

{

    half scale = 1.7777;

    half3 nn =

        enc.xyz*half3(2*scale,2*scale,0) +

        half3(-scale,-scale,1);

    half g = 2.0 / dot(nn.xyz,nn.xyz);

    half3 n;

    n.xy = g*nn.xy;

    n.z = g-1;

    return n;

}

ps_3_0

def c0, 1, 0.281262308, 0.5, 0

dcl_texcoord_pp v0.xyz

add_pp r0.x, c0.x, v0.z

rcp r0.x, r0.x

mul_pp r0.xy, r0.x, v0

mad_pp oC0.xy, r0, c0.y, c0.z

mov_pp oC0.zw, c0.w

ps_3_0

def c0, 3.55539989, 0, -1.77769995, 1

def c1, 2, -1, 0, 0

dcl_texcoord2 v0.xy

dcl_2d s0

texld_pp r0, v0, s0

mad_pp r0.xyz, r0, c0.xxyw, c0.zzww

dp3_pp r0.z, r0, r0

rcp r0.z, r0.z

add_pp r0.w, r0.z, r0.z

mad_pp oC0.z, r0.z, c1.x, c1.y

mul_pp oC0.xy, r0, r0.w

mov_pp oC0.w, c0.y

5 ALU

Radeon HD 2400: 2 GPR, 4.00 clk

Radeon HD 3870: 2 GPR, 1.00 clk

Radeon HD 5870: 2 GPR, 0.50 clk

GeForce 6200: 1 GPR, 2.00 clk

GeForce 7800GT: 1 GPR, 2.00 clk

GeForce 8800GTX: 5 GPR, 12.00 clk

7 ALU, 1 TEX

Radeon HD 2400: 2 GPR, 4.00 clk

Radeon HD 3870: 2 GPR, 1.00 clk

Radeon HD 5870: 2 GPR, 0.50 clk

GeForce 6200: 1 GPR, 4.00 clk

GeForce 7800GT: 1 GPR, 4.00 clk

GeForce 8800GTX: 6 GPR, 12.00 clk

Method #8: Per-pixel View Space

If we compute view space per-pixel, then Z component of a normal can never be negative. Then just store X&Y, and compute Z.

Suggested by Yuriy O’Donnell on Twitter.

Encoding, Error to Power, Error * 30 images below. MSE: 0.000134; PSNR: 38.730 dB.

Pros:

Cons:

Quite heavy on ALU

Encoding	Decoding
float3x3 make_view_mat (float3 view) { view = normalize(view); float3 x,y,z; z = -view; x = normalize (float3(z.z, 0, -z.x)); y = cross (z,x); return float3x3 (x,y,z); } half4 encode (half3 n, float3 view) { return half4(mul (make_view_mat(view), n).xy0.5+0.5,0,0); } half3 decode (half4 enc, float3 view) { half3 n; n.xy = enc2-1; n.z = sqrt(1+dot(n.xy,-n.xy)); n = mul(n, make_view_mat(view)); return n; }
ps_3_0 def c0, 1, -1, 0, 0.5 dcl_texcoord_pp v0.xyz dcl_texcoord1 v1.xyz mov r0.x, c0.z nrm r1.xyz, v1 mov r1.w, -r1.z mul r0.yz, r1.xxzw, c0.xxyw dp2add r0.w, r1.wxzw, r0.zyzw, c0.z rsq r0.w, r0.w mul r0.xyz, r0, r0.w mul r2.xyz, -r1.zxyw, r0 mad r1.xyz, -r1.yzxw, r0.yzxw, -r2 dp2add r0.x, r0.zyzw, v0.xzzw, c0.z dp3 r0.y, r1, v0 mad_pp oC0.xy, r0, c0.w, c0.w mov_pp oC0.zw, c0.z	ps_3_0 def c0, 2, -1, 1, 0 dcl_texcoord1 v0.xyz dcl_texcoord2 v1.xy dcl_2d s0 mov r0.y, c0.w nrm r1.xyz, v0 mov r1.w, -r1.z mul r0.xz, r1.zyxw, c0.yyzw dp2add r0.w, r1.wxzw, r0.xzzw, c0.w rsq r0.w, r0.w mul r0.xyz, r0, r0.w mul r2.xyz, -r1.zxyw, r0.yzxw mad r2.xyz, -r1.yzxw, r0.zxyw, -r2 texld_pp r3, v1, s0 mad_pp r3.xy, r3, c0.x, c0.y mul r2.xyz, r2, r3.y mad r0.xyz, r3.x, r0, r2 dp2add_pp r0.w, r3, -r3, c0.z rsq_pp r0.w, r0.w rcp_pp r0.w, r0.w mad_pp oC0.xyz, r0.w, -r1, r0 mov_pp oC0.w, c0.w
17 ALU Radeon HD 2400: 3 GPR, 11.00 clk Radeon HD 3870: 3 GPR, 2.75 clk Radeon HD 5870: 2 GPR, 0.80 clk GeForce 6200: 4 GPR, 12.00 clk GeForce 7800GT: 4 GPR, 8.00 clk GeForce 8800GTX: 8 GPR, 24.00 clk	21 ALU, 1 TEX Radeon HD 2400: 3 GPR, 11.00 clk Radeon HD 3870: 3 GPR, 2.75 clk Radeon HD 5870: 2 GPR, 0.80 clk GeForce 6200: 3 GPR, 12.00 clk GeForce 7800GT: 3 GPR, 9.00 clk GeForce 8800GTX: 12 GPR, 29.00 clk

Encoding

Decoding

float3x3 make_view_mat (float3 view)

{

    view = normalize(view);

    float3 x,y,z;

    z = -view;

    x = normalize (float3(z.z, 0, -z.x));

    y = cross (z,x);

    return float3x3 (x,y,z);

}

half4 encode (half3 n, float3 view)

{

    return half4(mul (make_view_mat(view), n).xy*0.5+0.5,0,0);

}

half3 decode (half4 enc, float3 view)

{

    half3 n;

    n.xy = enc*2-1;

    n.z = sqrt(1+dot(n.xy,-n.xy));

    n = mul(n, make_view_mat(view));

    return n;

}

ps_3_0

def c0, 1, -1, 0, 0.5

dcl_texcoord_pp v0.xyz

dcl_texcoord1 v1.xyz

mov r0.x, c0.z

nrm r1.xyz, v1

mov r1.w, -r1.z

mul r0.yz, r1.xxzw, c0.xxyw

dp2add r0.w, r1.wxzw, r0.zyzw, c0.z

rsq r0.w, r0.w

mul r0.xyz, r0, r0.w

mul r2.xyz, -r1.zxyw, r0

mad r1.xyz, -r1.yzxw, r0.yzxw, -r2

dp2add r0.x, r0.zyzw, v0.xzzw, c0.z

dp3 r0.y, r1, v0

mad_pp oC0.xy, r0, c0.w, c0.w

mov_pp oC0.zw, c0.z

ps_3_0

def c0, 2, -1, 1, 0

dcl_texcoord1 v0.xyz

dcl_texcoord2 v1.xy

dcl_2d s0

mov r0.y, c0.w

nrm r1.xyz, v0

mov r1.w, -r1.z

mul r0.xz, r1.zyxw, c0.yyzw

dp2add r0.w, r1.wxzw, r0.xzzw, c0.w

rsq r0.w, r0.w

mul r0.xyz, r0, r0.w

mul r2.xyz, -r1.zxyw, r0.yzxw

mad r2.xyz, -r1.yzxw, r0.zxyw, -r2

texld_pp r3, v1, s0

mad_pp r3.xy, r3, c0.x, c0.y

mul r2.xyz, r2, r3.y

mad r0.xyz, r3.x, r0, r2

dp2add_pp r0.w, r3, -r3, c0.z

rsq_pp r0.w, r0.w

rcp_pp r0.w, r0.w

mad_pp oC0.xyz, r0.w, -r1, r0

mov_pp oC0.w, c0.w

17 ALU

Radeon HD 2400: 3 GPR, 11.00 clk

Radeon HD 3870: 3 GPR, 2.75 clk

Radeon HD 5870: 2 GPR, 0.80 clk

GeForce 6200: 4 GPR, 12.00 clk

GeForce 7800GT: 4 GPR, 8.00 clk

GeForce 8800GTX: 8 GPR, 24.00 clk

21 ALU, 1 TEX

Radeon HD 2400: 3 GPR, 11.00 clk

Radeon HD 3870: 3 GPR, 2.75 clk

Radeon HD 5870: 2 GPR, 0.80 clk

GeForce 6200: 3 GPR, 12.00 clk

GeForce 7800GT: 3 GPR, 9.00 clk

GeForce 8800GTX: 12 GPR, 29.00 clk

Performance Comparison

GPU performance comparison in a single table:

Encoding, GPU cycles
	#1: X & Y	#3: Spherical	#4: Spheremap	#7: Stereo	#8: PPView
Radeon HD2400	1.00	17.00	3.00	4.00	11.00
Radeon HD5870	0.50	0.95	0.50	0.50	0.80
GeForce 6200	1.00	12.00	4.00	2.00	12.00
GeForce 8800	7.00	43.00	12.00	12.00	24.00
Decoding, GPU cycles
Radeon HD2400	1.00	17.00	3.00	4.00	11.00
Radeon HD5870	0.50	0.95	0.50	1.00	0.80
GeForce 6200	4.00	7.00	6.00	4.00	12.00
GeForce 8800	15.00	23.00	15.00	12.00	29.00
Encoding, D3D ALU+TEX instruction slots
SM3.0	1	26	4	5	17
Decoding, D3D ALU+TEX instruction slots
SM3.0	8	18	9	8	22

Quality Comparison

Quality comparison in a single table. PSNR based, higher numbers are better.

Method	PSNR, dB
#1: X & Y	18.629
#3: Spherical	42.042
#4: Spheremap	48.071
#7: Stereographic	44.147
#8: Per pixel view	38.730

Changelog

2010 03 25: Added Method #8: Per-pixel View Space. Suggested by Yuriy O’Donnell.
2010 03 24: Stop! Everything before was wrong! Old article moved here.
2009 08 12: Added Method #7: Stereographic projection. Suggested by Sean Barrett and Ignacio Castano.
2009 08 12: Optimized Method #5, suggested by Steve Hill.
2009 08 08: Added power difference images.
2009 08 07: Optimized Method #4: Sphere map. Suggested by Irenee Caroulle.
2009 08 07: Added Method #6: Lambert Azimuthal Equal Area. Suggested by Sean Barrett.
2009 08 05: Added Method #5: Cry Engine 3. Suggested by Steve Hill.
2009 08 05: Improved quality of Method #3a: round values in texture LUT.
2009 08 05: Added MSE and PSNR values for all methods.
2009 08 04: Added Method #3a: Spherical Coordinates w/ texture LUT.
2009 08 04: Method #1: 1-dot(n.xy,n.xy) is slightly better than 1-n.x*n.x-n.y*n.y (better pipelining on NV and ATI). Suggested by Arseny “zeux” Kapoulkine.

JVM探秘之旅：从class文件到垃圾回收的魔法世界 zhysunny Java那些事 jvm java
目录第一章：垃圾回收算法进化史JDK7时代：SerialGC（老式吸尘器）JDK8默认：ParallelGC（多线程清洁队）✈️JDK11+新宠：G1GC（智能分拣机器人）JDK12+实验品：Shenandoah（低延迟特工）⚡JDK15+新贵：ZGC（太空时代科技）第二章：GC算法原理实验室1.标记-清除（Mark-Sweep）2.标记-整理（Mark-Compact）3.复制算法（Copyin
Apache Doris 3.0.6 版本正式发布数据库apache
亲爱的社区小伙伴们，ApacheDoris3.0.6版本已于2025年06月16日正式发布。该版本进一步提升了系统的性能及稳定性，欢迎大家下载体验。GitHub下载官网下载行为变更禁止Unique表使用时序Compaction#49905存算分离场景下AutoBucket单分桶容量调整为10GB#50566新特性Lakehouse支持访问AWSS3TableBuckets中的Iceberg表格式详
windows使用mingw+cmake编译二维码生成库libqrencode 百口可乐__ Windows GNU/Linux 付费 windows linux microsoft
libqrencode介绍LibqrencodeisafastandcompactlibraryforencodingdatainaQRCodesymbol,a2DsymbologythatcanbescannedbyhandyterminalssuchasamobilephonewithCCD.ThecapacityofQRCodeisupto7000digitsor4000characters
【weaviate】分布式数据写入之LSM树深度解析：读写放大的权衡
文章目录一、LSM树的设计哲学：写优化的根本动机1、传统B+树存储的性能瓶颈2、LSM树的根本性创新二、写入路径的深度技术分析1、WAL机制的精密设计2、MemTable的数据结构3、刷盘（Flush）过程的技术细节三、Compaction策略：LSM树性能优化的核心机制1、为什么LSM树必须要Compaction？LSM树设计带来的必然问题2、Compaction理论2.1、Compaction
Apache Doris 3.0.6 版本正式发布 SelectDB技术团队 apache 大数据极速分析实时分析数据分析
亲爱的社区小伙伴们，ApacheDoris3.0.6版本已于2025年06月16日正式发布。该版本进一步提升了系统的性能及稳定性，欢迎大家下载体验。GitHub下载官网下载行为变更禁止Unique表使用时序Compaction存算分离场景下AutoBucket单分桶容量调整为10GB新特性Lakehouse支持访问AWSS3TableBuckets中的Iceberg表格式详情请参考文档：Icebe
[JAVA高频考点-面试题] Java 中有哪些垃圾回收算法算法大师 java 算法开发语言华为od
华为OD面试真题精选专栏：华为OD面试真题精选目录:2025华为OD面试手撕代码真题目录以及八股文真题目录文章目录华为OD面试真题精选本文为专栏附赠题，不一定是华为od面试真题Java中的垃圾回收算法详解1.标记-清除算法（Mark-Sweep）2.标记-整理算法（Mark-Compact）3.复制算法（Copying）4.分代收集算法（GenerationalCollection）5.增量式垃圾
【C#】C#八股文 manqi_ c#unity
目录1概述1.1GC（GarbageCollection）1.1.1为什么需要GC？1.1.2GC的工作原理工作原理什么是Root？GC算法：Mark-Compact标记压缩算法GC优化：Generational分代算法1.1.3GC的触发时间1.1.4如何减少垃圾回收1.1.5手动回收1.1.6需要特殊清理的类型*1.2内存1.2.1分区1.2.2为什么栈比堆快？1.2.3.NET&CLR*1.
大数据、数据挖掘技术收集（Vivo互联网技术） XiaoQiong.Zhang 数据挖掘大数据
Hudi在vivo湖仓一体的落地实践用户行为分析模型实践（四）——留存分析模型用户行为分析模型实践（三）——H5通用分析模型用户行为分析模型实践（二）——漏斗分析模型用户行为分析模型实践（一）——路径分析模型AB实验遇到用户不均匀怎么办？——vivo游戏中心业务实践经验分享HBaseCompaction原理与线上调优实践vivo游戏黑产反作弊实践Kafka实时数据即席查询应用与实践Hive和Spa
CSS预处理器 Sass/Scss 繁星学编程 CSS css sass scss
文章目录介绍Sass是什么Scss是什么Scss与Sass异同为什么使用Sass?Sass安装NPM安装(推荐使用)Windows上安装MacOSX(Homebrew)安装Sass转化为CSS转化步骤自动编译编译输出的CSS格式:nested：嵌套（默认格式）:compact：紧凑:expanded：扩展:compressed：压缩Sass语法注释变量`$`嵌套1.选择器嵌套2.父选择器`&`3.
Windows CE系统全面介绍及其与其他Windows嵌入式版本的差异轻栈OS工坊嵌入式Windows系统嵌入式操作系统 windows CE系统 winCE
WindowsCE系统全面介绍及其与其他Windows嵌入式版本的差异一、WindowsCE的起源与核心特性WindowsCE（后更名为WindowsEmbeddedCompact）是微软于1996年推出的嵌入式操作系统，专为资源受限设备设计，其核心目标是提供轻量化、模块化且支持实时任务的系统解决方案。与传统的WindowsNT内核不同，WindowsCE采用独立的CE内核，支持ARM、MIPS、
网络安全的几种攻击方法网络安全-老纪 web安全网络数据库
攻击方法挂马:就是在别人的网站文件里面放入网页木马或者是将代码潜入到对方正常的网页文件里，以使浏览者中马。挖洞:指漏洞挖掘。加壳:就是利用特殊的算法，将EXE可执行程序或者DLL动态连接库文件的编码进行改变（比如实现压缩、加密），以达到缩小文件体积或者加密程序编码，甚至是躲过杀毒软件查杀的目的。目前较常用的壳有UPX，ASPack、PePack、PECompact、UPack、免疫007、木马彩衣
LabVIEW实时系统数据监控与本地存储 LabVIEW开发 LabVIEW知识 LabVIEW知识
基于LabVIEWReal-Time模块，面向工业自动化、嵌入式测控等场景，提供实时数据采集、监控与本地存储的完整实现路径。通过分层任务调度、TDMS文件格式应用及跨平台兼容性设计，确保系统在实时性、可靠性与数据管理效率间达到平衡。文中以CompactRIO为例，阐述从工程搭建到功能实现的全流程，并对比传统方案差异，为工程师提供可复用的技术框架。核心功能实现2工程初始化与硬件配置项目架构新建Lab
MyBatis联表查询越来越无动于衷 mybatis 数据库 java
数据库表结构CREATETABLE`teacher`(`id`int(11)NOTNULLAUTO_INCREMENT,`tname`varchar(255)DEFAULTNULL,PRIMARYKEY(`id`)USINGBTREE)ENGINE=InnoDBAUTO_INCREMENT=3DEFAULTCHARSET=utf8ROW_FORMAT=COMPACT;CREATETABLE`stu
Kafka 的日志清理策略：delete 和 compact WZMeiei 大数据 kafka 分布式
Kafkadelete日志清理策略（日志删除）原理：按照一定保留策略，直接删除不符合条件的日志分段。Kafka把topic的一个partition大文件分成多个小文件段，通过这种方式，能方便地定期清除或删除已消费完的文件，以减少磁盘占用。保留策略按时间删除：设定一个时间阈值，删除修改时间在该时间之前的日志。比如设置log.retention.hours=1，就表示只保存1小时内的日志，超出1小时的
Tomcat的调优一盏盏洺灯 tomcat java
目录一.JVM1.1JVM的组成1.2运行时数据区域的组成二.垃圾回收2.1如何确认垃圾1.引用计数法2.根搜索算法2.2垃圾回收基本算法1.标记-清除算法（Mark-Sweep）2.标记-压缩算法（Mark-Compact）3.复制算法（Copying）4.多种算法总结2.3分代堆内存GC策略2.3.1堆内存分代三.java内存调整相关参数3.1JVM内存常用相关参数3.2查看JVM内存分配情况
论文复现Pushing and Grasping Policies（3） qq_50857609 linux 机器人
一、测试Test1原句：Compactscenariowherethetargetobjectisoccludedwithstructuredclutter,Thetermianloutputissavedtoatextfilesothatcanbelaterusedforevaluation.翻译：目标对象被结构化的杂乱遮挡的紧凑场景，终端输出被保存到文本文件中，以便稍后用于评估。在~/Copp
前端开发规范：CSS 代码规范指南易风920 前端开发规范 css 代码规范前端
CSS代码规范指南代码风格代码格式化样式书写一般有两种：一种是紧凑格式(Compact).web{display:block;width:50px;}一种是展开格式（Expanded）.web{display:block;width:50px;}团队约定:统一使用展开格式书写样式代码大小写样式选择器，属性名，属性值关键字全部使用小写字母书写，属性字符串允许使用大小写。/*推荐*/.web{disp
Etcd 压缩整理富士康质检员张全蛋 ETCD 大数据
etcd数据存储在实际生产中使用ETCD存储元数据，起初集群规模不大的时候元数据信息不多没有发现什么问题。随着集群规模越来越大，可能引发存储问题。—auto-compaction-retention由于ETCD数据存储多版本数据，随着写入的主键增加历史版本需要定时清理，默认的历史数据是不会清理的，数据达到2G就不能写入，必须要清理压缩历史数据才能继续写入。根据业务需求，在上生产环境之前就提前确定，
flink写doris时的优化别这么骄傲 flink 大数据
1.概念doris并不擅长高频、小量数据的导入；因为doris每一次数据导入都会在be节点上生成数据文件；如果高频导入小量数据，就会在存储层产生大量的小文件（必然会影响到后续的查询效率，也会对系统产生更多的compaction操作压力）而flink是实时不断地往doris中插入数据，所以很容易出现上述问题；怎么办？有两个办法：在flink中先做一些按时间开窗后的轻度聚合，降低写入的数据量（在先fl
vue运行报错Ineffective mark-compacts near heap limit Allocation failed-JavaScript heap out of memory 我即将远走丶或许也能高飞 javascript vue.js 前端
项目运行的时候突然报：Ineffectivemark-compactsnearheaplimitAllocationfailed-JavaScriptheapoutofmemory解决办法：1、快捷键Win+R打开运行窗口，运行npminstall-gincrease-memory-limit2、在项目文件夹运行increase-memory-limit此时运行numrundev项目又会报错：no
【Kafka基础】topics命令行操作大全：高级命令解析（1） IT成长日记 Kafka探索之旅 kafka 分布式 topics 高级命令行操作
1创建压缩主题（LogCompaction）/export/home/kafka_zk/kafka_2.13-2.7.1/bin/kafka-topics.sh--create\--bootstrap-server192.168.10.33:9092\--topiccomtopic\--partitions3\--replication-factor2\--configcleanup.policy
JVM GC四大算法 coding_-_半生 jvm 算法 java
JVMGC四大算法文章目录JVMGC四大算法GC四大算法一、引用计数法二、复制算法（COPY）三、标记清除算法（MARK-SWEEP）四、标记整理算法（MARK-COMPACT）五、总结GC四大算法一、引用计数法描述：给每一个对象分配一个计数器，用于记录对象是否被引用，被引用一次，计数进行+1优点：方便直接判断对象是否能够回收缺点：使用计数器需要消耗一定的内存，且每一次计数的修改同样需要消耗内存致
Hive 3.1 在 metastore 运行的 remote threads houzhizhen hive hive hadoop 数据仓库
Remotethreads是仅当Hivemetastore作为单独的服务运行是启动，请求需要开启compactor。有以下几种：1.AcidOpenTxnsCounterService统计当前open的事务数从表TXNS中统计状态为open的事务。此事务数量可以再hivemetrics中。2.AcidHouseKeeperService定期调用txnHandler.performTimeOuts(
Mysql行格式DYNAMIC和COMPACT区别 yyytucj mysql 数据库
MySQL的InnoDB存储引擎支持多种行格式，其中DYNAMIC和COMPACT是两种常见的行格式，它们各自有着不同的特性和应用场景。下面将详细对比这两种行格式的主要区别，以便于在设计数据库时做出合适的选择。COMPACT行格式COMPACT是MySQL5.0之后引入的一种行记录存储方式，旨在提高数据页的利用率，使每个数据页能够存储更多的行记录。COMPACT格式的特点包括：变长字段处理：对于V
垃圾回收算法努力的小钟算法
文章目录一、引用计数(ReferenceCounting)二、标记-清除(Mark-Sweep)三、标记-整理(Mark-Compact)四、分代回收(Generational)一、引用计数(ReferenceCounting)原理：每个对象维护引用计数，当计数归零时释放内存。C++示例：#includeclassRefCounted{intcount=0;public:voidaddRef(){
CPCI机箱阿尔泰科技4槽2U CPCI测控机箱后IO走线 CPCIC7604A 北京阿尔泰科技厂家 CPCI测控机箱科技工业自动化测控机箱工业机箱 CPCI机箱
品牌：阿尔泰科技型号：CPCIC-7604A概述：阿尔泰科技CPCIC-7604A是一款4槽CPCI机箱，该机箱为标准2U高度、支持19”机柜安装，符合PICMG2.1标准规范，提供了一个system插槽、三个外设插槽，支持80mm后I/O卡，以满足用户灵活多样测控应用需求。产品特点：◆机箱整体为2U高度金属结构◆4槽6UCompactPCI64位/66MHz高速总线无源背板◆带P3、P4和P5后
C#——垃圾回收(GC) 面向大象编程 C#c#开发语言面向对象编程
文章目录前言一、垃圾回收是什么二、好处三、GC过程1.GC条件2.GC步骤3.Mark-Compact标记压缩算法4.Generational分代算法5.FinalizationQueue和FreachableQueue四、托管和非托管资源1.托管资源2.非托管资源五、GC注意事项参考前言C#的垃圾回收网上有很多博客进行讲解，这里摘录一部分较好的讲解，同时建议直接使用微软官方文档，万变不离其宗一、
C# GC原理 palawind
root为全局变量的引用静态对象的引用对所有对象检查。判断应用程序是否可以访问，即是否有活动根第0带从未被标记为回收的新分配对象第1带上一次垃圾回收未被标记第2代一次以上垃圾回收未被标记不是单纯的引用计数而是标记。从root出发。找到所有reachableobject（被引用了的对象）。标记。释放。重新整理地址连续引用计数对于闭环a->b->c->d->a是无法回收Mark-Compact标记压缩
关于Mysql 中 Row size too large (＞ 8126) 错误的解决和理解 m0_74824025 mysql 数据库
提示：啰嗦一嘴，数据库的任何操作和验证前，一定要记得先备份！！！不会有错；文章目录问题发现一、问题导致的可能原因1、页大小2、行格式2.1compact格式2.2Redundant格式2.3Dynamic格式2.4Compressed格式3、BLOB和TEXT列二、解决办法1、修改页大小（不推荐）2、修改行格式3、修改数据类型为BLOB和TEXT列4、其他优化方式（可以参考使用）4.1合理设置数据
HBase的合并操作 b1gx HBase
compact的作用flush操作会将memstore的数据落地为一个个StoreFile（HFile），那么随着时间的增长在HDFS上面就会有很多的HFile文件，这样对读操作会产生比较大的影响（读操作会对HFile进行归并查询），并且对DataNode的压力也会比较大。为了降低对读操作的影响，可以对这些HFile进行compact操作，但是compact操作会产生大量的IO，所以可以看出com
312个免费高速HTTP代理IP（能隐藏自己真实IP地址） yangshangchuan 高速免费 superword HTTP代理
124.88.67.20:843 190.36.223.93:8080 117.147.221.38:8123 122.228.92.103:3128 183.247.211.159:8123 124.88.67.35:81 112.18.51.167:8123 218.28.96.39:3128 49.94.160.198:3128 183.20
pull解析和json编码百合不是茶 android pull解析 json
n.json文件: [{name:java,lan:c++,age:17},{name:android,lan:java,age:8}] pull.xml文件 <?xml version="1.0" encoding="utf-8"?> <stu> <name>java
[能源与矿产]石油与地球生态系统 comsci 能源
按照苏联的科学界的说法,石油并非是远古的生物残骸的演变产物,而是一种可以由某些特殊地质结构和物理条件生产出来的东西,也就是说,石油是可以自增长的.... 那么我们做一个猜想: 石油好像是地球的体液,我们地球具有自动产生石油的某种机制,只要我们不过量开采石油,并保护好
类与对象浅谈沐刃青蛟 java 基础
类，字面理解，便是同一种事物的总称，比如人类，是对世界上所有人的一个总称。而对象，便是类的具体化，实例化，是一个具体事物，比如张飞这个人，就是人类的一个对象。但要注意的是：张飞这个人是对象，而不是张飞，张飞只是他这个人的名字，是他的属性而已。而一个类中包含了属性和方法这两兄弟，他们分别用来描述对象的行为和性质（感觉应该是
新站开始被收录后，我们应该做什么？ IT独行者 PHP seo
新站开始被收录后，我们应该做什么？百度终于开始收录自己的网站了，作为站长，你是不是觉得那一刻很有成就感呢，同时，你是不是又很茫然，不知道下一步该做什么了？至少我当初就是这样，在这里和大家一份分享一下新站收录后，我们要做哪些工作。至于如何让百度快速收录自己的网站，可以参考我之前的帖子《新站让百
oracle 连接碰到的问题文强chu oracle
Unable to find a java Virtual Machine－－安装64位版Oracle11gR2后无法启动SQLDeveloper的解决方案作者：草根IT网来源：未知人气：813标签：导读：安装64位版Oracle11gR2后发现启动SQLDeveloper时弹出配置java.exe的路径，找到Oracle自带java.exe后产生的路径“C:\app\用户名\prod
Swing中按ctrl键同时移动鼠标拖动组件（类中多借口共享同一数据）小桔子 java 继承 swing 接口监听
都知道java中类只能单继承，但可以实现多个接口，但我发现实现多个接口之后，多个接口却不能共享同一个数据，应用开发中想实现：当用户按着ctrl键时，可以用鼠标点击拖动组件，比如说文本框。编写一个监听实现KeyListener,NouseListener,MouseMotionListener三个接口，重写方法。定义一个全局变量boolea
linux常用的命令 aichenglong linux 常用命令
1 startx切换到图形化界面 2 man命令:查看帮助信息 man 需要查看的命令,man命令提供了大量的帮助信息,一般可以分成4个部分 name:对命令的简单说明 synopsis:命令的使用格式说明 description:命令的详细说明信息 options:命令的各项说明 3 date:显示时间语法：date [OPTION]... [+FORMAT]
eclipse内存优化 AILIKES java eclipse jvm jdk
一基本说明在JVM中，总体上分2块内存区,默认空余堆内存小于 40%时，JVM就会增大堆直到-Xmx的最大限制；空余堆内存大于70%时，JVM会减少堆直到-Xms的最小限制。 1)堆内存(Heap memory):堆是运行时数据区域，所有类实例和数组的内存均从此处分配,是Java代码可及的内存，是留给开发人
关键字的使用探讨百合不是茶关键字
//关键字的使用探讨/*访问关键词private 只能在本类中访问public 只能在本工程中访问protected 只能在包中和子类中访问默认的只能在包中访问*//*final 类方法变量 final 类不能被继承 final 方法不能被子类覆盖，但可以继承 final 变量只能有一次赋值，赋值后不能改变 final 不能用来修饰构造方法*///this()
JS中定义对象的几种方式 bijian1013 js
1. 基于已有对象扩充其对象和方法(只适合于临时的生成一个对象)： <html> <head> <title>基于已有对象扩充其对象和方法(只适合于临时的生成一个对象)</title> </head> <script> var obj = new Object();
表驱动法实例 bijian1013 java 表驱动法 TDD
获得月的天数是典型的直接访问驱动表方式的实例，下面我们来展示一下： MonthDaysTest.java package com.study.test; import org.junit.Assert; import org.junit.Test; import com.study.MonthDays; public class MonthDaysTest { @T
LInux启停重启常用服务器的脚本 bit1129 linux
启动，停止和重启常用服务器的Bash脚本，对于每个服务器，需要根据实际的安装路径做相应的修改 #! /bin/bash Servers=(Apache2, Nginx, Resin, Tomcat, Couchbase, SVN, ActiveMQ, Mongo); Ops=(Start, Stop, Restart); currentDir=$(pwd); echo
【HBase六】REST操作HBase bit1129 hbase
HBase提供了REST风格的服务方便查看HBase集群的信息，以及执行增删改查操作 1. 启动和停止HBase REST 服务 1.1 启动REST服务前台启动（默认端口号8080） [hadoop@hadoop bin]$ ./hbase rest start 后台启动 hbase-daemon.sh start rest 启动时指定
大话zabbix 3.0设计假设 ronin47
What’s new in Zabbix 2.0? 去年开始使用Zabbix的时候，是1.8.X的版本，今年Zabbix已经跨入了2.0的时代。看了2.0的release notes，和performance相关的有下面几个： :: Performance improvements::Trigger related da
http错误码大全 byalias http协议 javaweb
响应码由三位十进制数字组成，它们出现在由HTTP服务器发送的响应的第一行。响应码分五种类型，由它们的第一位数字表示： 1）1xx：信息，请求收到，继续处理 2）2xx：成功，行为被成功地接受、理解和采纳 3）3xx：重定向，为了完成请求，必须进一步执行的动作 4）4xx：客户端错误，请求包含语法错误或者请求无法实现 5）5xx：服务器错误，服务器不能实现一种明显无效的请求
J2EE设计模式-Intercepting Filter bylijinnan java 设计模式数据结构
Intercepting Filter类似于职责链模式有两种实现其中一种是Filter之间没有联系，全部Filter都存放在FilterChain中，由FilterChain来有序或无序地把把所有Filter调用一遍。没有用到链表这种数据结构。示例如下： package com.ljn.filter.custom; import java.util.ArrayList;
修改jboss端口 chicony jboss
修改jboss端口 %JBOSS_HOME%\server\{服务实例名}\conf\bindingservice.beans\META-INF\bindings-jboss-beans.xml 中找到 <!-- The ports-default bindings are obtained by taking the base bindin
c++ 用类模版实现数组类 CrazyMizzz C++
最近c++学到数组类，写了代码将他实现，基本具有vector类的功能 #include<iostream> #include<string> #include<cassert> using namespace std; template<class T> class Array { public: //构造函数
hadoop dfs.datanode.du.reserved 预留空间配置方法 daizj hadoop 预留空间
对于datanode配置预留空间的方法为：在hdfs-site.xml添加如下配置 <property> <name>dfs.datanode.du.reserved</name> <value>10737418240</value>
mysql远程访问的设置 dcj3sjt126com mysql 防火墙
第一步: 激活网络设置你需要编辑mysql配置文件my.cnf. 通常状况，my.cnf放置于在以下目录： /etc/mysql/my.cnf (Debian linux) /etc/my.cnf （Red Hat Linux/Fedora Linux) /var/db/mysql/my.cnf (FreeBSD) 然后用vi编辑my.cnf，修改内容从以下行： [mysqld] 你所需要: 1
ios 使用特定的popToViewController返回到相应的Controller dcj3sjt126com controller
1、取navigationCtroller中的Controllers NSArray * ctrlArray = self.navigationController.viewControllers; 2、取出后，执行， [self.navigationController popToViewController:[ctrlArray objectAtIndex:0] animated:YES
Linux正则表达式和通配符的区别 eksliang 正则表达式通配符和正则表达式的区别通配符
转载请出自出处：http://eksliang.iteye.com/blog/1976579 首先得明白二者是截然不同的通配符只能用在shell命令中,用来处理字符串的的匹配。判断一个命令是否为bash shell(linux 默认的shell)的内置命令 type -t commad 返回结果含义 file 表示为外部命令 alias 表示该
Ubuntu Mysql Install and CONF gengzg Install
http://www.navicat.com.cn/download/navicat-for-mysql Step1: 下载Navicat ，网址：http://www.navicat.com/en/download/download.html Step2：进入下载目录，解压压缩包：tar -zxvf navicat11_mysql_en.tar.gz
批处理，删除文件bat huqiji windows dos
@echo off ::演示：删除指定路径下指定天数之前（以文件名中包含的日期字符串为准）的文件。 ::如果演示结果无误，把del前面的echo去掉，即可实现真正删除。 ::本例假设文件名中包含的日期字符串（比如：bak-2009-12-25.log） rem 指定待删除文件的存放路径 set SrcDir=C:/Test/BatHome rem 指定天数 set DaysAgo=1
跨浏览器兼容的HTML5视频音频播放器天梯梦 html5
HTML5的video和audio标签是用来在网页中加入视频和音频的标签，在支持html5的浏览器中不需要预先加载Adobe Flash浏览器插件就能轻松快速的播放视频和音频文件。而html5media.js可以在不支持html5的浏览器上使video和audio标签生效。 How to enable <video> and <audio> tags in
Bundle自定义数据传递 hm4123660 android Serializable 自定义数据传递 Bundle Parcelable
我们都知道Bundle可能过put****()方法添加各种基本类型的数据，Intent也可以通过putExtras(Bundle)将数据添加进去，然后通过startActivity()跳到下一下Activity的时候就把数据也传到下一个Activity了。如传递一个字符串到下一个Activity 把数据放到Intent
C＃：异步编程和线程的使用（.NET 4.5 ） powertoolsteam .net 线程 C#异步编程
异步编程和线程处理是并发或并行编程非常重要的功能特征。为了实现异步编程，可使用线程也可以不用。将异步与线程同时讲，将有助于我们更好的理解它们的特征。本文中涉及关键知识点 1. 异步编程 2. 线程的使用 3. 基于任务的异步模式 4. 并行编程 5. 总结异步编程什么是异步操作？异步操作是指某些操作能够独立运行，不依赖主流程或主其他处理流程。通常情况下，C＃程序
spark 查看 job history 日志 Stark_Summer 日志 spark history job
SPARK_HOME/conf 下: spark-defaults.conf 增加如下内容 spark.eventLog.enabled true spark.eventLog.dir hdfs://master:8020/var/log/spark spark.eventLog.compress true spark-env.sh 增加如下内容 export SP
SSH框架搭建 wangxiukai2015eye spring Hibernate struts
MyEclipse搭建SSH框架 Struts Spring Hibernate 1、new一个web project。 2、右键项目，为项目添加Struts支持。选择Struts2 Core Libraries -<MyEclipes-Library> 点击Finish。src目录下多了struts

Compact Normal Storage for Small G-Buffers

Intro

Note: there was an error!

Test Playground Application

Baseline: store X&Y&Z

Method #1: store X&Y, reconstruct Z

Method #3: Spherical Coordinates

Method #4: Spheremap Transform

Method #7: Stereographic Projection

Method #8: Per-pixel View Space

Performance Comparison

Quality Comparison

Changelog

你可能感兴趣的:(compact)