ilvu999

Reversing MS VC++Part I: Exception Handling

摘要

MS VC++ 是Win32平台上最广泛使用的编译器，因此熟悉它的内部工作机制对于Win32逆向爱好者非常重要。能够理解编译器生成的附加（glue）代码有助于快速理解程序员写的实际代码。同样也有助于恢复程序的高级结构。

在这个两部分组成的系列文章的Part I中，我会专注于栈的结构，异常处理和由MSVC编译出的程序的相关结构。前提是假设你对汇编器，寄存器，调用习惯有一定程度的熟悉。

术语：

· 栈帧：堆栈上由一个函数占用的一段。通常包括函数参数，返回到调用者的地址，保存的寄存器值，局部变量和这个函数中的其它特定数据。在X86（以及其它大多数架构）中调用者和被调用者的栈帧是连续的。

· 帧指针：它是一个寄存器或者变量，指向栈帧内部的一个固定地址。通常栈帧内所有数据都是以相对于这个指针的地址引用的。在X86上通常是ebp，并且指向返回地址的下一个位置。

· 对象。一个C++类的实例。

· 可展开对象。由auto storage－class指示符修饰的局部对象，它分配在栈上，并且当超出域作用范围（scope）时需要析构。

· 栈展开。当发生异常，控制离开对象域作用范围（scope）时会导致对象的自动析构，就是栈展开。

有两种类型的异常可以用在C或C++程序中。

SEH异常（Structured Exception Handling）。也被叫做Win32异常或系统异常。它们已经被著名的Matt Pietrek[1]解释的非常详尽。它们只能被用在C程序中。编译器级的支持包括关键字__try, __except,__finally和其它一些。

C++异常（有时候也叫做EH）。它基于SEH实现，C++异常允许抛出和捕获任意类型的异常。C++的一个非常重要的特点是在异常处理过程中自动的栈展开，并且MSVC使用了一种非常复杂的底层框架来确保它在任何情况都能正常运作。

在下面的图例中，内存地址从上到下增加，所以栈是“增长”的。这也是IDA采用的描述栈的方法，但和几乎其它所有描述相反。

基本的帧布局

最基本的栈帧布局如下，

...

Local variables

Other saved registers

Saved ebp

Return address

Function arguments

...

注意：如果允许了忽略帧指针 (frame pointeromission)，则saved ebp可能不存在。

SEH

在使用了编译器级SEH (__try/__except/__finally)的时候，栈的布局变得有一点复杂。

SEH3Stack Layout

当在某函数中没有__except块（只有__finally）时，不再使用saved ebp。Scopetable是一个记录（record）的数组，每个record描述了一个__try块，以及块之间的关系。

struct_SCOPETABLE_ENTRY {

DWORD EnclosingLevel;

void* FilterFunc;

void* HandlerFunc;

}

更多的SEH实现细节请看[1]。为了恢复try块，请注意观察try块的层次变量是如何更新的。每一个try块都分配了一个唯一的数作为标识，scopetable表中条目（entry）间的关系则描述了try块的嵌套关系。例如，如果scopetable的第i项的EnclosingLevel等于j，则表示try块j包围了try块i。函数体自身被认为拥有级别-1。请参看附录1作为例子。

Buffer Overrun Protection

Whidbey(MSVC2005)编译器为SEH帧增加了一些缓冲区溢出（overrun）保护。完整的栈帧布局如下：

SEH4Stack Layout

GS cookie只有在编译时打开/GS参数才存在。EH cookie总是存在。SEH4 scopetable基本和SEH3一样，只是加了一个头，

struct _EH4_SCOPETABLE {

DWORD GSCookieOffset;

DWORD GSCookieXOROffset;

DWORD EHCookieOffset;

DWORD EHCookieXOROffset;

_EH4_SCOPETABLE_RECORD ScopeRecord[1];

};

struct _EH4_SCOPETABLE_RECORD {

DWORD EnclosingLevel;

long (*FilterFunc)();

union {

void (*HandlerAddress)();

void (*FinallyFunc)();

};

GSCookieOffset =-2 意味着没有使用GScookie。 EH cookie总是存在。偏移量是相对于ebp的。检查按照下列方式进行： (ebp+CookieXOROffset) ^ [ebp+CookieOffset] == _security_cookie。指向栈中scopetable的指针同样也和__security_cookie进行了异或。而且，在SEH4中最外层的级别是-2，而不是SEH3的-1。

C++异常模块实现

当函数采用C++异常处理（try/catch）或者有可展开对象时，情形更加复杂。

C++EH Stack Layout

EH handler对每个函数都不相同（SEH正好相反），通常像这样，

(VC7+)

mov eax, OFFSET __ehfuncinfo

jmp ___CxxFrameHandler

__ehfuncinfo是一个类型为FuncInfo的结构体，它完整地描述了所有 try/catch块和所有可展开对象。

struct FuncInfo {

// compiler version.

// 0x19930520: up to VC6, 0x19930521: VC7.x(2002-2003), 0x19930522: VC8(2005)

DWORD magicNumber;

// number of entries in unwind table

int maxState;

// table of unwind destructors

UnwindMapEntry* pUnwindMap;

// number of try blocks in the function

DWORD nTryBlocks;

// mapping of catch blocks to try blocks

TryBlockMapEntry* pTryBlockMap;

// not used on x86

DWORD nIPMapEntries;

// not used on x86

void* pIPtoStateMap;

// VC7+ only, expected exceptions list (function "throw"specifier)

ESTypeList* pESTypeList;

// VC8+ only, bit 0 set if function was compiled with /EHs

int EHFlags;

};

Unwind map和SHE的scopetable类似，但没有过滤(filter)函数。

structUnwindMapEntry {

int toState; // targetstate

void (*action)(); // action toperform (unwind funclet address)

};

Try块描述子，描述了一个try块及其相关的catch块，

struct TryBlockMapEntry{

int tryLow;

int tryHigh; // this try {}covers states ranging from tryLow to tryHigh

int catchHigh; // highest stateinside catch handlers of this try

int nCatches; // number of catchhandlers

HandlerType* pHandlerArray; //catch handlers table

};

Catch块描述子，描述了一个try块的某一个catch块（因为一个try可以同时有几个catch块）。

structHandlerType {

// 0x01: const, 0x02: volatile, 0x08:reference

DWORD adjectives;

// RTTI descriptor of the exception type.0=any (ellipsis)

TypeDescriptor* pType;

// ebp-based offset of the exception objectin the function stack.

// 0 = no object (catch by type)

int dispCatchObj;

// address of the catch handler code.

// returns address where to continuesexecution (i.e. code after the try block)

void* addressOfHandler;

};

可预期异常链表(expected exceptions)（默认情况下，MSVC实现了它但没有打开，可以用/d1ESrt使之生效）。

struct ESTypeList {

// number of entries in the list

int nCount;

// list of exceptions; it seems only pType field in HandlerType is used

HandlerType* pTypeArray;

};

RTTI类型描述子。描述了单个的C++类型。在这里用它来匹配抛出的异常类型。

struct TypeDescriptor {

// vtable of type_info class

const void * pVFTable;

// used to keep thedemangled name returned by type_info::name()

void* spare;

// mangled type name, e.g.".H" = "int", ".?AUA@@" = "struct A",".?AVA@@" = "class A"

char name[0];

};

不似SEH，每个try块并没有一个与之相关的状态值。编译器不仅在进入和退出try块时修改状态值，还在每次构造和析构对象时修改。这样它就有可能在发生异常时知道哪个对象需要展开。你仍然可以通过检查与之关联的状态范围和由catch handler返回的地址来恢复try块的边界（参看附录2）。

抛出C++异常

Throw语句被转换为对_CxxThrowException()的调用，后者才真正的抛出一个Win32异常，以及异常代码0xE06D7363('msc'|0xE0000000)。可自定义的Win32异常参数包括指向异常对象的指针，和它的ThrowInfo结构，使用该结构可以让异常处理程序（handler）检查catch处理程序（handler）期待的类型和抛出异常的类型是否匹配。

struct ThrowInfo {

// 0x01: const, 0x02: volatile

DWORD attributes;

// exception destructor

void (*pmfnUnwind)();

// forward compatibility handler

int (*pForwardCompat)();

// list of types that can catch this exception.

// i.e. the actual type and all its ancestors.

CatchableTypeArray* pCatchableTypeArray;

};

struct CatchableTypeArray {

// number of entries in the following array

int nCatchableTypes;

CatchableType* arrayOfCatchableTypes[0];

};

下面描述了一个可以捕获该异常的类型。

struct CatchableType {

// 0x01: simple type (can be copied by memmove), 0x02: can be caught byreference only, 0x04: has virtual bases

DWORD properties;

// see above

TypeDescriptor* pType;

// how to cast the thrown object to this type

PMD thisDisplacement;

// object size

int sizeOrOffset;

// copy constructor address

void (*copyFunction)();

};

// Pointer-to-member descriptor.

struct PMD {

// member offset

int mdisp;

// offset of the vbtable (-1 if not a virtual base)

int pdisp;

// offset to the displacement value inside the vbtable

int vdisp;

};

在下一篇文章中我们会更加深入。

Prologs and Epilogs

相对于在函数体内生成代码来建立栈帧的方法，编译器可能会选择调用特定的prolog和epilog函数。它们有若干变种，每一种用于特定的函数类型。

Name	Type	EH Cookie	GS Cookie	Catch Handlers
_SEH_prolog/_SEH_epilog	SEH3	-	-
_SEH_prolog4/_SEH_epilog4 S	EH4	+	-
_SEH_prolog4_GS/_SEH_epilog4_GS	SEH4	+	+
_EH_prolog	C++ EH	-	-	+/-
_EH_prolog3/_EH_epilog3	C++ EH	+	-	-
_EH_prolog3_catch/_EH_epilog3	C++ EH	+	-	+
_EH_prolog3_GS/_EH_epilog3_GS	C++ EH	+	+	-
_EH_prolog3_catch_GS/_EH_epilog3_catch_GS	C++ EH	+	+	+

SEH2

显然，在过去它用于MSVC 1.XX编译器（由crtdll.dll导出）。可能会在一些老的NT程序中碰到它。

...

Saved edi

Saved esi

Saved ebx

Next SEH frame

Current SEH handler (__except_handler2)

Pointer to the scopetable

Try level

Saved ebp (of this function)

Exception pointers

Local variables

Saved ESP

Local variables

Callee EBP

Return address

Function arguments

...

Appendix I: SEH 样例

让我们思考下面的反汇编代码。

func1 proc near

_excCode = dword ptr -28h

buf = byte ptr -24h

_saved_esp = dword ptr -18h

_exception_info = dword ptr -14h

_next = dword ptr -10h

_handler = dword ptr -0Ch

_scopetable = dword ptr -8

_trylevel = dword ptr -4

str = dword ptr 8

push ebp

mov ebp, esp

push -1

push offset _func1_scopetable

push offset _except_handler3

mov eax, large fs:0

push eax

mov large fs:0, esp

add esp, -18h

push ebx

push esi

push edi

;--- end of prolog ---

mov [ebp+_trylevel], 0;trylevel -1 -> 0: beginning of try block 0

mov [ebp+_trylevel], 1;trylevel 0 -> 1: beginning of try block 1

mov large dword ptr ds:123,456

mov [ebp+_trylevel], 0;trylevel 1 -> 0: end of try block 1

jmp short _endoftry1

_func1_filter1: ; __except() filter oftry block 1

mov ecx, [ebp+_exception_info]

mov edx,[ecx+EXCEPTION_POINTERS.ExceptionRecord]

mov eax,[edx+EXCEPTION_RECORD.ExceptionCode]

mov [ebp+_excCode], eax

mov ecx, [ebp+_excCode]

xor eax, eax

cmp ecx,EXCEPTION_ACCESS_VIOLATION

setz al

retn

_func1_handler1: ; beginning of handlerfor try block 1

mov esp, [ebp+_saved_esp]

push offset aAccessViolatio ;"Access violation"

call _printf

add esp, 4

mov [ebp+_trylevel], 0;trylevel 1 -> 0: end of try block 1

_endoftry1:

mov edx, [ebp+str]

push edx

lea eax, [ebp+buf]

push eax

call _strcpy

add esp, 8

mov [ebp+_trylevel], -1 ;trylevel 0 -> -1: end of try block 0

call _func1_handler0 ; execute __finally of try block 0

jmp short _endoftry0

_func1_handler0: ; __finally handler oftry block 0

push offset aInFinally ;"in finally"

call _puts

add esp, 4

retn

_endoftry0:

;--- epilog ---

mov ecx, [ebp+_next]

mov large fs:0, ecx

pop edi

pop esi

pop ebx

mov esp, ebp

pop ebp

retn

func1 endp

_func1_scopetable

;try block 0

dd-1 ;EnclosingLevel

dd0 ;FilterFunc

ddoffset _func1_handler0 ;HandlerFunc

;try block 1

dd0 ;EnclosingLevel

ddoffset _func1_filter1 ;FilterFunc

ddoffset _func1_handler1 ;HandlerFunc

Try块0没有filter，因此它的handler是一个__finally块。Try块1的EnclosingLevel是0，所以它被置于try块0内部。考虑到这些，我们就可以试着重构出函数的结构：

    void func1 (char* str)

      char buf[12];

      __try // try block 0

         __try // try block 1

           *(int*)123=456;

         __except(GetExceptCode() == EXCEPTION_ACCESS_VIOLATION)

            printf("Access violation");

         strcpy(buf,str);

      __finally

         puts("in finally");

Appendix II: C++异常样例

func1 proc near

_a1 = dword ptr -24h

_exc = dword ptr -20h

e = dword ptr -1Ch

a2 = dword ptr -18h

a1 = dword ptr -14h

_saved_esp = dword ptr -10h

_next = dword ptr -0Ch

_handler = dword ptr -8

_state = dword ptr -4

push ebp

mov ebp, esp

push 0FFFFFFFFh

push offset func1_ehhandler

mov eax, large fs:0

push eax

mov large fs:0, esp

push ecx

sub esp, 14h

push ebx

push esi

push edi

mov [ebp+_saved_esp], esp

;--- end of prolog ---

lea ecx, [ebp+a1]

call A::A(void)

mov [ebp+_state], 0 ; state -1 -> 0: a1 constructed

mov [ebp+a1], 1 ; a1.m1 = 1

mov byte ptr [ebp+_state], 1 ;state 0 -> 1: try {

lea ecx, [ebp+a2]

call A::A(void)

mov [ebp+_a1], eax

mov byte ptr [ebp+_state], 2 ;state 2: a2 constructed

mov [ebp+a2], 2 ; a2.m1 = 2

mov eax, [ebp+a1]

cmp eax, [ebp+a2] ; a1.m1 == a2.m1?

jnz short loc_40109F

mov [ebp+_exc], offsetaAbc ; _exc = "abc"

push offset __TI1?PAD ; char *

lea ecx, [ebp+_exc]

push ecx

call _CxxThrowException ; throw "abc";

loc_40109F:

mov byte ptr [ebp+_state], 1 ;state 2 -> 1: destruct a2

lea ecx, [ebp+a2]

call A::~A(void)

jmp short func1_try0end

; catch (char * e)

func1_try0handler_pchar:

mov edx, [ebp+e]

push edx

push offset aCaughtS ;"Caught %s\n"

call ds:printf ;

add esp, 8

mov eax, offset func1_try0end

retn

; catch (...)

func1_try0handler_ellipsis:

push offset aCaught___ ;"Caught ...\n"

call ds:printf

add esp, 4

mov eax, offset func1_try0end

retn

func1_try0end:

mov [ebp+_state], 0 ; state 1 -> 0: }//try

push offset aAfterTry ;"after try\n"

call ds:printf

add esp, 4

mov [ebp+_state], -1 ; state 0 -> -1: destruct a1

lea ecx, [ebp+a1]

call A::~A(void)

;--- epilog ---

mov ecx, [ebp+_next]

mov large fs:0, ecx

pop edi

pop esi

pop ebx

mov esp, ebp

pop ebp

retn

func1 endp

func1_ehhandler proc near

mov eax, offset func1_funcinfo

jmp __CxxFrameHandler

func1_ehhandler endp

func1_funcinfo

dd19930520h ; magicNumber

dd4 ; maxState

ddoffset func1_unwindmap ; pUnwindMap

dd1 ; nTryBlocks

ddoffset func1_trymap ; pTryBlockMap

dd0 ; nIPMapEntries

dd0 ; pIPtoStateMap

dd0 ; pESTypeList

func1_unwindmap

dd-1

ddoffset func1_unwind_1tobase ; action

dd0 ; toState

dd0 ; action

dd1 ; toState

ddoffset func1_unwind_2to1 ; action

dd0 ; toState

dd0 ; action

func1_trymap

dd1 ; tryLow

dd 2 ; tryHigh

dd3 ; catchHigh

dd2 ; nCatches

ddoffset func1_tryhandlers_0 ; pHandlerArray

dd0

func1_tryhandlers_0

dd 0 ; adjectives

dd offset char * `RTTI Type Descriptor' ;pType

dd -1Ch ; dispCatchObj

dd offset func1_try0handler_pchar ;addressOfHandler

dd 0 ; adjectives

dd 0 ; pType

dd 0 ; dispCatchObj

dd offset func1_try0handler_ellipsis ;addressOfHandler

func1_unwind_1tobase proc near

a1 = byte ptr -14h

lea ecx, [ebp+a1]

call A::~A(void)

retn

func1_unwind_1tobase endp

func1_unwind_2to1 proc near

a2 = byte ptr -18h

lea ecx, [ebp+a2]

call A::~A(void)

retn

func1_unwind_2to1 endp

我们看看能找到些什么。FuncInfo结构的maxState域是4，表示我们在unwindmap中有4项，从0到3。通过检查这个map，我们看到下列动作在栈展开中被执行：

state 3 -> state 0 (noaction)

state 2 -> state 1 (destructa2)

state 1 -> state 0 (noaction)

state 0 -> state -1(destruct a1)

再看看try map，我们可以推断状态1和2对应于try块，状态3对应于catch块。这样，从状态0转换到1指明了try块的开始，从1到0表示try块执行完毕。从函数代码，我们也可以看到从-1到0是构造a1，从1到2是构造a2。所以状态图应该象这样：

那箭头1到3从何而来？我们在函数代码中看不到，在FuncInfo也看不到，因为它是异常handler完成的。如果一个异常发生在try块内部，异常handler首先展开栈到tryLow表示的状态（这里指状态1），然后在调用catch handler前设置状态值为tryHigh+1（2+1=3）。

这个try块有两个catchhandlers。第一个指定了一个期待的异常类型（char*），并从栈中获得异常对象e（-1Ch=e）。第二个没有指定类型（比如那个省略号）。它们都返回用于恢复执行流的地址，例如，刚好在try块后面的那个地址。现在，我们恢复的函数代码如下：

     void func1 ()

      A a1;

      a1.m1 = 1;

      try {

        A a2;

        a2.m1 = 2;

        if (a1.m1 == a1.m2) throw "abc";

      catch(char* e)

        printf("Caught %s\n",e);

      catch(...)

        printf("Caught ...\n");

      printf("after try\n");

Appendix III: IDC Helper Script

我写过一个IDC脚本用于辅助逆向MSVC程序。它在整个程序中搜索典型的SEH/EH代码序列，并标注出所有相关的结构和域。类似于栈变量，异常处理程序，异常类型等等都被标注了出来。它还试图修复有时候会被IDA错误判定的函数边界。你可以从这里下载。

Links and References

[1] Matt Pietrek. A Crash Course on the Depths of Win32 StructuredException Handling.
http://www.microsoft.com/msj/0197/exception/exception.aspx
Still THE definitive guide on the implementation of SEH in Win32.

[2] Brandon Bray. Security Improvements to the Whidbey Compiler.
http://blogs.msdn.com/branbray/archive/2003/11/11/51012.aspx
Short description on changes in the stack layout for cookie checks.

[3] Chris Brumme. The Exception Model.
http://blogs.msdn.com/cbrumme/archive/2003/10/01/51524.aspx
Mostly about .NET exceptions, but still contains a good deal of informationabout SEH and C++ exceptions.

[4] Vishal Kochhar. How a C++ compiler implements exception handling.
http://www.codeproject.com/cpp/exceptionhandler.asp
An overview of C++ exceptions implementation.

[5] Calling Standard for Alpha Systems. Chapter 5. Event Processing.
http://www.cs.arizona.edu/computer.help/policy/DIGITAL_unix/AA-PY8AC-TET1_html/callCH5.html
Win32 takes a lot from the way Alpha handles exceptions and this manual has avery detailed description on how it happens.

Structure definitions and flag values were also recovered from the followingsources:

VC8 CRT debug information (many structure definitions)
VC8 assembly output (/FAs)
VC8 WinCE CRT source

//////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////

Reversing Microsoft Visual C++ Part I: Exception Handling

原文链接http://www.openrce.org/articles/full_view/21

Abstract

Microsoft Visual C++ is the most widely used compiler for Win32 so it is important for the Win32 reverser to be familiar with its inner working. Being able to recognize the compiler-generated glue code helps to quickly concentrate on the actual code written by the programmer. It also helps in recovering the high-level structure of the program.

In part I of this 2-part article (see also: Part II: Classes, Methods and RTTI), I will concentrate on the stack layout, exception handling and related structures in MSVC-compiled programs. Some familiarity with assembler, registers, calling conventions etc. is assumed.

Terms:

Stack frame: A fragment of the stack segment used by a function. Usually contains function arguments, return-to-caller address, saved registers, local variables and other data specific to this function. On x86 (and most other architectures) caller and callee stack frames are contiguous.
Frame pointer: A register or other variable that points to a fixed location inside the stack frame. Usually all data inside the stack frame is addressed relative to the frame pointer. On x86 it's usually ebp and it usually points just below the return address.
Object: An instance of a (C++) class.
Unwindable Object: A local object with auto storage-class specifier that is allocated on the stack and needs to be destructed when it goes out of scope.
Stack UInwinding: Automatic destruction of such objects that happens when the control leaves the scope due to an exception.

There are two types of exceptions that can be used in a C or C++ program.

SEH exceptions (from "Structured Exception Handling"). Also known as Win32 or system exceptions. These are exhaustively covered in the famous Matt Pietrek article[1]. They are the only exceptions available to C programs. The compiler-level support includes keywords __try, __except, __finally and a few others.
C++ exceptions (sometimes referred to as "EH"). Implemented on top of SEH, C++ exceptions allow throwing and catching of arbitrary types. A very important feature of C++ is automatic stack unwinding during exception processing, and MSVC uses a pretty complex underlying framework to ensure that it works properly in all cases.

In the following diagrams memory addresses increase from top to bottom, so the stack grows "up". It's the way the stack is represented in IDA and opposite to the most other publications.

Basic Frame Layout

The most basic stack frame looks like following:

...
Local variables
Other saved registers
Saved ebp
Return address
Function arguments
...

Note: If frame pointer omission is enabled, saved ebp might be absent.

SEH

In cases where the compiler-level SEH (__try/__except/__finally) is used, the stack layout gets a little more complicated.

Reversing MS VC++Part I: Exception Handling_第1张图片

SEH3 Stack Layout

When there are no __except blocks in a function (only __finally), Saved ESP is not used. Scopetable is an array of records which describe each __try block and relationships between them:

struct _SCOPETABLE_ENTRY {
DWORD EnclosingLevel;
void* FilterFunc;
void* HandlerFunc;
}

For more details on SEH implementation see[1]. To recover try blocks watch how the try level variable is updated. It's assigned a unique number per try block, and nesting is described by relationship between scopetable entries. E.g. if scopetable entry i has EnclosingLevel=j, then try block j encloses try block i. The function body is considered to have try level -1. See Appendix 1 for an example.

Buffer Overrun Protection

The Whidbey (MSVC 2005) compiler adds some buffer overrun protection for the SEH frames. The full stack frame layout in it looks like following:

Reversing MS VC++Part I: Exception Handling_第2张图片

SEH4 Stack Layout

The GS cookie is present only if the function was compiled with /GS switch. The EH cookie is always present. The SEH4 scopetable is basically the same as SEH3 one, only with added header:

struct _EH4_SCOPETABLE {
DWORD GSCookieOffset;
DWORD GSCookieXOROffset;
DWORD EHCookieOffset;
DWORD EHCookieXOROffset;
_EH4_SCOPETABLE_RECORD ScopeRecord[1];
};
struct _EH4_SCOPETABLE_RECORD {
DWORD EnclosingLevel;
long (*FilterFunc)();
union {
void (*HandlerAddress)();
void (*FinallyFunc)();
};
};

GSCookieOffset = -2 means that GS cookie is not used. EH cookie is always present. Offsets are ebp relative. Check is done the following way: (ebp+CookieXOROffset) ^ [ebp+CookieOffset] == _security_cookie Pointer to the scopetable in the stack is XORed with the _security_cookie too. Also, in SEH4 the outermost scope level is -2, not -1 as in SEH3.

C++ Exception Model Implementation

When C++ exceptions handling (try/catch) or unwindable objects are present in the function, things get pretty complex.

Reversing MS VC++Part I: Exception Handling_第3张图片

C++ EH Stack Layout

EH handler is different for each function (unlike the SEH case) and usually looks like this:

(VC7+)
mov eax, OFFSET __ehfuncinfo
jmp ___CxxFrameHandler

__ehfuncinfo is a structure of type FuncInfo which fully describes all try/catch blocks and unwindable objects in the function.

struct FuncInfo {
// compiler version.
// 0x19930520: up to VC6, 0x19930521: VC7.x(2002-2003), 0x19930522: VC8 (2005)
DWORD magicNumber;
// number of entries in unwind table
int maxState;
// table of unwind destructors
UnwindMapEntry* pUnwindMap;
// number of try blocks in the function
DWORD nTryBlocks;
// mapping of catch blocks to try blocks
TryBlockMapEntry* pTryBlockMap;
// not used on x86
DWORD nIPMapEntries;
// not used on x86
void* pIPtoStateMap;
// VC7+ only, expected exceptions list (function "throw" specifier)
ESTypeList* pESTypeList;
// VC8+ only, bit 0 set if function was compiled with /EHs
int EHFlags;
};

Unwind map is similar to the SEH scopetable, only without filter functions:

struct UnwindMapEntry {
int toState; // target state
void (*action)(); // action to perform (unwind funclet address)
};

Try block descriptor. Describes a try{} block with associated catches.

struct TryBlockMapEntry {
int tryLow;
int tryHigh; // this try {} covers states ranging from tryLow to tryHigh
int catchHigh; // highest state inside catch handlers of this try
int nCatches; // number of catch handlers
HandlerType* pHandlerArray; //catch handlers table
};

Catch block descriptor. Describes a single catch() of a try block.

struct HandlerType {
// 0x01: const, 0x02: volatile, 0x08: reference
DWORD adjectives;
// RTTI descriptor of the exception type. 0=any (ellipsis)
TypeDescriptor* pType;
// ebp-based offset of the exception object in the function stack.
// 0 = no object (catch by type)
int dispCatchObj;
// address of the catch handler code.
// returns address where to continues execution (i.e. code after the try block)
void* addressOfHandler;
};

List of expected exceptions (implemented but not enabled in MSVC by default, use /d1ESrt to enable).

struct ESTypeList {
// number of entries in the list
int nCount;
// list of exceptions; it seems only pType field in HandlerType is used
HandlerType* pTypeArray;
};

RTTI type descriptor. Describes a single C++ type. Used here to match the thrown exception type with catch type.

struct TypeDescriptor {
// vtable of type_info class
const void * pVFTable;
// used to keep the demangled name returned by type_info::name()
void* spare;
// mangled type name, e.g. ".H" = "int", ".?AUA@@" = "struct A", ".?AVA@@" = "class A"
char name[0];
};

Unlike SEH, each try block doesn't have a single associated state value. The compiler changes the state value not only on entering/leaving a try block, but also for each constructed/destroyed object. That way it's possible to know which objects need unwinding when an exception happens. You can still recover try blocks boundaries by inspecting the associated state range and the addresses returned by catch handlers (see Appendix 2).

Throwing C++ Exceptions

throw statements are converted into calls of _CxxThrowException(), which actually raises a Win32 (SEH) exception with the code 0xE06D7363 ('msc'|0xE0000000). The custom parameters of the Win32 exception include pointers to the exception object and its ThrowInfo structure, using which the exception handler can match the thrown exception type against the types expected by catch handlers.

struct ThrowInfo {
// 0x01: const, 0x02: volatile
DWORD attributes;
// exception destructor
void (*pmfnUnwind)();
// forward compatibility handler
int (*pForwardCompat)();
// list of types that can catch this exception.
// i.e. the actual type and all its ancestors.
CatchableTypeArray* pCatchableTypeArray;
};
struct CatchableTypeArray {
// number of entries in the following array
int nCatchableTypes;
CatchableType* arrayOfCatchableTypes[0];
};

Describes a type that can catch this exception.

struct CatchableType {
// 0x01: simple type (can be copied by memmove), 0x02: can be caught by reference only, 0x04: has virtual bases
DWORD properties;
// see above
TypeDescriptor* pType;
// how to cast the thrown object to this type
PMD thisDisplacement;
// object size
int sizeOrOffset;
// copy constructor address
void (*copyFunction)();
};
// Pointer-to-member descriptor.
struct PMD {
// member offset
int mdisp;
// offset of the vbtable (-1 if not a virtual base)
int pdisp;
// offset to the displacement value inside the vbtable
int vdisp;
};

We'll delve more into this in the next article.

Prologs and Epilogs

Instead of emitting the code for setting up the stack frame in the function body, the compiler might choose to call specific prolog and epilog functions instead. There are several variants, each used for specific function type:

Name	Type	EH Cookie	GS Cookie	Catch Handlers
_SEH_prolog/_SEH_epilog	SEH3	-	-
_SEH_prolog4/_SEH_epilog4 S	EH4	+	-
_SEH_prolog4_GS/_SEH_epilog4_GS	SEH4	+	+
_EH_prolog	C++ EH	-	-	+/-
_EH_prolog3/_EH_epilog3	C++ EH	+	-	-
_EH_prolog3_catch/_EH_epilog3	C++ EH	+	-	+
_EH_prolog3_GS/_EH_epilog3_GS	C++ EH	+	+	-
_EH_prolog3_catch_GS/_EH_epilog3_catch_GS	C++ EH	+	+	+

SEH2

Apparently was used by MSVC 1.XX (exported by crtdll.dll). Encountered in some old NT programs.

...
Saved edi
Saved esi
Saved ebx
Next SEH frame
Current SEH handler (__except_handler2)
Pointer to the scopetable
Try level
Saved ebp (of this function)
Exception pointers
Local variables
Saved ESP
Local variables
Callee EBP
Return address
Function arguments
...

Appendix I: Sample SEH Program

Let's consider the following sample disassembly.

func1 proc near
_excCode = dword ptr -28h
buf = byte ptr -24h
_saved_esp = dword ptr -18h
_exception_info = dword ptr -14h
_next = dword ptr -10h
_handler = dword ptr -0Ch
_scopetable = dword ptr -8
_trylevel = dword ptr -4
str = dword ptr 8
push ebp
mov ebp, esp
push -1
push offset _func1_scopetable
push offset _except_handler3
mov eax, large fs:0
push eax
mov large fs:0, esp
add esp, -18h
push ebx
push esi
push edi
; --- end of prolog ---
mov [ebp+_trylevel], 0 ;trylevel -1 -> 0: beginning of try block 0
mov [ebp+_trylevel], 1 ;trylevel 0 -> 1: beginning of try block 1
mov large dword ptr ds:123, 456
mov [ebp+_trylevel], 0 ;trylevel 1 -> 0: end of try block 1
jmp short _endoftry1
_func1_filter1: ; __except() filter of try block 1
mov ecx, [ebp+_exception_info]
mov edx, [ecx+EXCEPTION_POINTERS.ExceptionRecord]
mov eax, [edx+EXCEPTION_RECORD.ExceptionCode]
mov [ebp+_excCode], eax
mov ecx, [ebp+_excCode]
xor eax, eax
cmp ecx, EXCEPTION_ACCESS_VIOLATION
setz al
retn
_func1_handler1: ; beginning of handler for try block 1
mov esp, [ebp+_saved_esp]
push offset aAccessViolatio ; "Access violation"
call _printf
add esp, 4
mov [ebp+_trylevel], 0 ;trylevel 1 -> 0: end of try block 1
_endoftry1:
mov edx, [ebp+str]
push edx
lea eax, [ebp+buf]
push eax
call _strcpy
add esp, 8
mov [ebp+_trylevel], -1 ; trylevel 0 -> -1: end of try block 0
call _func1_handler0 ; execute __finally of try block 0
jmp short _endoftry0
_func1_handler0: ; __finally handler of try block 0
push offset aInFinally ; "in finally"
call _puts
add esp, 4
retn
_endoftry0:
; --- epilog ---
mov ecx, [ebp+_next]
mov large fs:0, ecx
pop edi
pop esi
pop ebx
mov esp, ebp
pop ebp
retn
func1 endp
_func1_scopetable
;try block 0
dd -1 ;EnclosingLevel
dd 0 ;FilterFunc
dd offset _func1_handler0 ;HandlerFunc
;try block 1
dd 0 ;EnclosingLevel
dd offset _func1_filter1 ;FilterFunc
dd offset _func1_handler1 ;HandlerFunc

The try block 0 has no filter, therefore its handler is a __finally{} block. EnclosingLevel of try block 1 is 0, so it's placed inside try block 0. Considering this, we can try to reconstruct the function structure:

void func1 (char* str)
{
char buf[12];
__try // try block 0
{
__try // try block 1
{
*(int*)123=456;
}
__except(GetExceptCode() == EXCEPTION_ACCESS_VIOLATION)
{
printf("Access violation");
}
strcpy(buf,str);
}
__finally
{
puts("in finally");
}
}

Appendix II: Sample Program with C++ Exceptions

func1 proc near
_a1 = dword ptr -24h
_exc = dword ptr -20h
e = dword ptr -1Ch
a2 = dword ptr -18h
a1 = dword ptr -14h
_saved_esp = dword ptr -10h
_next = dword ptr -0Ch
_handler = dword ptr -8
_state = dword ptr -4
push ebp
mov ebp, esp
push 0FFFFFFFFh
push offset func1_ehhandler
mov eax, large fs:0
push eax
mov large fs:0, esp
push ecx
sub esp, 14h
push ebx
push esi
push edi
mov [ebp+_saved_esp], esp
; --- end of prolog ---
lea ecx, [ebp+a1]
call A::A(void)
mov [ebp+_state], 0 ; state -1 -> 0: a1 constructed
mov [ebp+a1], 1 ; a1.m1 = 1
mov byte ptr [ebp+_state], 1 ; state 0 -> 1: try {
lea ecx, [ebp+a2]
call A::A(void)
mov [ebp+_a1], eax
mov byte ptr [ebp+_state], 2 ; state 2: a2 constructed
mov [ebp+a2], 2 ; a2.m1 = 2
mov eax, [ebp+a1]
cmp eax, [ebp+a2] ; a1.m1 == a2.m1?
jnz short loc_40109F
mov [ebp+_exc], offset aAbc ; _exc = "abc"
push offset __TI1?PAD ; char *
lea ecx, [ebp+_exc]
push ecx
call _CxxThrowException ; throw "abc";
loc_40109F:
mov byte ptr [ebp+_state], 1 ; state 2 -> 1: destruct a2
lea ecx, [ebp+a2]
call A::~A(void)
jmp short func1_try0end
; catch (char * e)
func1_try0handler_pchar:
mov edx, [ebp+e]
push edx
push offset aCaughtS ; "Caught %s\n"
call ds:printf ;
add esp, 8
mov eax, offset func1_try0end
retn
; catch (...)
func1_try0handler_ellipsis:
push offset aCaught___ ; "Caught ...\n"
call ds:printf
add esp, 4
mov eax, offset func1_try0end
retn
func1_try0end:
mov [ebp+_state], 0 ; state 1 -> 0: }//try
push offset aAfterTry ; "after try\n"
call ds:printf
add esp, 4
mov [ebp+_state], -1 ; state 0 -> -1: destruct a1
lea ecx, [ebp+a1]
call A::~A(void)
; --- epilog ---
mov ecx, [ebp+_next]
mov large fs:0, ecx
pop edi
pop esi
pop ebx
mov esp, ebp
pop ebp
retn
func1 endp
func1_ehhandler proc near
mov eax, offset func1_funcinfo
jmp __CxxFrameHandler
func1_ehhandler endp
func1_funcinfo
dd 19930520h ; magicNumber
dd 4 ; maxState
dd offset func1_unwindmap ; pUnwindMap
dd 1 ; nTryBlocks
dd offset func1_trymap ; pTryBlockMap
dd 0 ; nIPMapEntries
dd 0 ; pIPtoStateMap
dd 0 ; pESTypeList
func1_unwindmap
dd -1
dd offset func1_unwind_1tobase ; action
dd 0 ; toState
dd 0 ; action
dd 1 ; toState
dd offset func1_unwind_2to1 ; action
dd 0 ; toState
dd 0 ; action
func1_trymap
dd 1 ; tryLow
dd 2 ; tryHigh
dd 3 ; catchHigh
dd 2 ; nCatches
dd offset func1_tryhandlers_0 ; pHandlerArray
dd 0
func1_tryhandlers_0
dd 0 ; adjectives
dd offset char * `RTTI Type Descriptor' ; pType
dd -1Ch ; dispCatchObj
dd offset func1_try0handler_pchar ; addressOfHandler
dd 0 ; adjectives
dd 0 ; pType
dd 0 ; dispCatchObj
dd offset func1_try0handler_ellipsis ; addressOfHandler
func1_unwind_1tobase proc near
a1 = byte ptr -14h
lea ecx, [ebp+a1]
call A::~A(void)
retn
func1_unwind_1tobase endp
func1_unwind_2to1 proc near
a2 = byte ptr -18h
lea ecx, [ebp+a2]
call A::~A(void)
retn
func1_unwind_2to1 endp

Let's see what we can find out here. The maxState field in FuncInfo structure is 4 which means we have four entries in the unwind map, from 0 to 3. Examining the map, we see that the following actions are executed during unwinding:

state 3 -> state 0 (no action)
state 2 -> state 1 (destruct a2)
state 1 -> state 0 (no action)
state 0 -> state -1 (destruct a1)

Checking the try map, we can infer that states 1 and 2 correspond to the try block body and state 3 to the catch blocks bodies. Thus, change from state 0 to state 1 denotes the beginning of try block, and change from 1 to 0 its end. From the function code we can also see that -1 -> 0 is construction of a1, and 1 -> 2 is construction of a2. So the state diagram looks like this:

Reversing MS VC++Part I: Exception Handling_第4张图片

Where did the arrow 1->3 come from? We cannot see it in the function code or FuncInfo structure since it's done by the exception handler. If an exception happens inside try block, the exception handler first unwinds the stack to the tryLow value (1 in our case) and then sets state value to tryHigh+1 (2+1=3) before calling the catch handler.

The try block has two catch handlers. The first one has a catch type (char*) and gets the exception object on the stack (-1Ch = e). The second one has no type (i.e. ellipsis catch). Both handlers return the address where to resume execution, i.e. the position just after the try block. Now we can recover the function code:

void func1 ()
{
A a1;
a1.m1 = 1;
try {
A a2;
a2.m1 = 2;
if (a1.m1 == a1.m2) throw "abc";
}
catch(char* e)
{
printf("Caught %s\n",e);
}
catch(...)
{
printf("Caught ...\n");
}
printf("after try\n");
}

Appendix III: IDC Helper Scripts

I wrote an IDC script to help with the reversing of MSVC programs. It scans the whole program for typical SEH/EH code sequences and comments all related structures and fields. Commented are stack variables, exception handlers, exception types and other. It also tries to fix function boundaries that are sometimes incorrectly determined by IDA. You can download it from MS SEH/EH Helper.

Links and References

[1] Matt Pietrek. A Crash Course on the Depths of Win32 Structured Exception Handling.
http://www.microsoft.com/msj/0197/exception/exception.aspx
Still THE definitive guide on the implementation of SEH in Win32.

[2] Brandon Bray. Security Improvements to the Whidbey Compiler.
http://blogs.msdn.com/branbray/archive/2003/11/11/51012.aspx
Short description on changes in the stack layout for cookie checks.

[3] Chris Brumme. The Exception Model.
http://blogs.msdn.com/cbrumme/archive/2003/10/01/51524.aspx
Mostly about .NET exceptions, but still contains a good deal of information about SEH and C++ exceptions.

[4] Vishal Kochhar. How a C++ compiler implements exception handling.
http://www.codeproject.com/cpp/exceptionhandler.asp
An overview of C++ exceptions implementation.

[5] Calling Standard for Alpha Systems. Chapter 5. Event Processing.
http://www.cs.arizona.edu/computer.help/policy/DIGITAL_unix/AA-PY8AC-TET1_html/callCH5.html
Win32 takes a lot from the way Alpha handles exceptions and this manual has a very detailed description on how it happens.

Structure definitions and flag values were also recovered from the following sources:

VC8 CRT debug information (many structure definitions)
VC8 assembly output (/FAs)
VC8 WinCE CRT source

你可能感兴趣的:(Reversing MS VC++Part I: Exception Handling)

一文搞懂MYSQL、SQL、SQLServer、SQLyog的区别和联系码上就位 mysql sql sqlserver
1.SQL（StructuredQueryLanguage）定义：SQL是结构化查询语言，主要用于管理和操作关系型数据库（RDBMS）。它是一种标准化语言，由ISO和ANSI定义，支持数据的查询、插入、更新、删除和数据库结构的定义。特点：通用性：SQL是关系型数据库的通用语言，支持多种数据库系统，如MySQL、SQLServer、Oracle、PostgreSQL等。功能性：DDL（数据定义语言）
西门子自动化冗余系统通过多层次冗余设计 D-海漠网络
西门子自动化冗余系统通过多层次冗余设计（包括PLC、电源、网络、从站及I/O模块）来确保系统的高可用性和稳定性。以下是具体实现方法及技术要点：一、PLC冗余设计硬件冗余架构冗余CPU配置：采用S7-1500R/H系列冗余CPU（如1515R或1517H），主备CPU通过冗余连接（X1接口）同步数据和程序，主CPU故障时备CPU无缝接管，切换时间可低至300ms614。同步机制：主备CPU通过同步链
知识蒸馏：从软标签压缩到推理能力迁移的工程实践(基于教师-学生模型的高效压缩技术与DeepSeek合成数据创新) AI仙人掌人工智能 AI 人工智能深度学习语言模型机器学习
知识蒸馏通过迁移教师模型（复杂）的知识到学生模型（轻量），实现模型压缩与性能平衡。核心在于利用教师模型的软标签（概率分布）替代独热编码标签，学生模型不仅学习到教师模型输出数据的类别信息，还能够捕捉到类别之间的相似性和关系，从而提升其泛化能力核心概念知识蒸馏的核心目标是实现从教师模型到学生模型的知识迁移。在实际应用中，无论是大规模语言模型（LLMs）还是其他类型的神经网络模型，都会通过softmax
【打卡d5】快速排序归并排序吧啦吧啦吡叭卜排序算法算法 java
快速排序算法模板——模板题AcWing785.快速排序voidquick_sort(intq[],intl,intr){if(l>=r)return;inti=l-1,j=r+1,x=q[(l+r)/2];while(ix);if(i=r)return;intmid=（l+r）>>1;merge_sort(q,l,mid);merge_sort(q,mid+1,r);intk=0,i=l,j=mi
MySQL主从同步面试核心20问：从原理到实战深度拆解 dblens 数据库管理和开发工具 mysql mysql 面试 android
一、核心原理篇1.主从同步基础流程（必考）答：主库：事务提交后生成binlog，由Dump线程发送给从库从库：I/O线程：接收binlog写入relaylog，受slave_net_timeout控制网络超时（默认3600秒）SQL线程：解析relaylog执行SQL，单线程设计是经典瓶颈核心文件：master.info（连接信息）、relay-log.info（执行进度）2.异步复制vs半同步复
字符串哈希从入门到精通 LIUJH1233 C++哈希算法算法 c++数据结构
一、基本概念字符串哈希是将任意长度的字符串映射为固定长度的哈希值（通常为整数）的技术，核心目标是实现O(1)时间的子串快速比较和高效查询。其本质是通过数学运算将字符串转换为唯一性较高的数值，例如：其中P为基数(根据题目)，M为大质数，s[i]为字符的ASCII值。二.一般哈希实现一般哈希的实现有两种方式：通俗的讲叫：1.蹲茅坑法2.拉拉链法2.1蹲茅坑法假设你现在要处理19与12（mod7）你会发
提到一个项目的“验证LOV”属性？提到lov和list项目有什么区别？思维导图代码示例（java 架构) 用心去追梦 list java 架构
验证LOV（ListofValues）属性在OracleForms中，LOV(ListofValues)是一种用于显示可供选择的值列表的组件。它通常与字段或项关联，允许用户从预定义的选项列表中选择一个值，而不是手动输入。验证LOV属性确保用户只能从LOV提供的选项中选择值，从而增强了数据输入的准确性和一致性。验证LOV属性定义：当设置为“是”时，表示该字段必须从LOV中选择值；如果用户尝试输入不在
基于粒子滤波与卡尔曼滤波的锂离子电池放电时间预测与使用特征研究算法如诗电池建模(RUL BC)粒子滤波锂离子电池放电时间预测
基于粒子滤波与卡尔曼滤波的锂离子电池放电时间预测与使用特征研究一、研究背景与意义锂离子电池作为现代储能系统的核心组件，其放电时间（End-of-DischargeTime,EOD）的准确预测对电池管理系统（BMS）的可靠性和安全性至关重要。传统方法（如安时积分法）易受噪声、温度漂移等因素干扰，而基于状态估计的滤波算法（粒子滤波/PF、卡尔曼滤波/KF）通过动态更新模型参数，能显著提升预测精度。二、
YashanDB配置资源管理数据库
YashanDB资源管理通过内置高级包DBMS_RESOURCE_MANAGER和相关配置参数提供对物理资源的配置能力。启用资源管理创建资源使用组调用DBMS_RESOURCE_MANAGER.CREATE_CONSUMER_GROUP创建资源使用组。--创建名为LOW_GROUP和HIGH_GROUP的资源使用组，只有SYS用户才有权限执行EXECDBMS_RESOURCE_MANAGER.CR
哇！5.2秒进入应用界面！Linux快速启动方案分享，基于全志T113-i国产平台 Tronlong创龙工业级核心板全志T113 嵌入式开发国产ARM 工业核心板
本文主要介绍基于创龙科技TLT113-EVM评估板（基于全志T113-i）的系统快速启动显示Qt界面、LVGL界面案例，适用开发环境如下。Windows开发环境：Windows764bit、Windows1064bit虚拟机：VMware15.5.5Linux开发环境：Ubuntu18.04.464bitU-Boot：U-Boot-2018.07Kernel：Linux-5.4.61、Linux-
【华为OD机试】日志采集系统--C语言 weixin_51635462 华为od c语言开发语言
#includeintinput[1000]={0};intfun(intx){intret=0;for(inti=0;i100){printf("100");}else{for(intj=1;j100){max_score=max_score>(100-fun(j))?max_score:(100-fun(j));break;}else{max_score=max_score>(cnt-fun(
LeetCode 2610. 转换二维数组迪小莫学AI 每日算法 leetcode 算法数据结构
LeetCode2610.转换二维数组题目描述给定一个整数数组nums，请你创建一个满足以下条件的二维数组：二维数组应该只包含数组nums中的元素。二维数组中的每一行都包含不同的整数。二维数组的行数应尽可能少。返回结果数组。如果存在多种答案，则返回其中任何一种。注意：二维数组的每一行上可以存在不同数量的元素。示例示例1：输入：nums=[1,3,4,1,2,3,1]输出：[[1,3,4,2],[1
docker 安装elasticsearch kibana，设置密码 biguojun docker elasticsearch kibana
安装elasticsearchdockerpulldocker.elastic.co/elasticsearch/elasticsearch:7.17.28dockerrun-d--namedocker-es-e"ES_JAVA_OPTS=-Xms512m-Xmx512m"-e"discovery.type=single-node"-vD:\docker\es\data:/usr/share/el
ActiveMQ学习总结（10）——ActiveMQ采用Spring注解方式发送和监听一杯甜酒 ActiveMQ
对于ActiveMQ消息的发送，原声的api操作繁琐，而且如果不进行二次封装，打开关闭会话以及各种创建操作也是够够的了。那么，Spring提供了一个很方便的去收发消息的框架，springjms。整合Spring后，代码不仅变得非常优雅，而且易用性和扩展性更好。1.maven依赖org.apache.xbeanxbean-spring3.16org.springframeworkspring-jms
算法-动态规划-最大子数组和程序员南飞算法动态规划 leetcode java 开发语言数据结构职场和发展
力扣题目：53.最大子数组和53.描述：给你一个整数数组nums，请你找出一个具有最大和的连续子数组（子数组最少包含一个元素），返回其最大和。子数组是数组中的一个连续部分。示例1：输入：nums=[-2,1,-3,4,-1,2,1,-5,4]输出：6解释：连续子数组 [4,-1,2,1]的和最大，为 6。示例2：输入：nums=[1]输出：1示例3：输入：nums=[5,4,-1,7,8]输出：2
算法-合并区间程序员南飞算法数据结构职场和发展 java 动态规划
力扣题目：56.合并区间-力扣（LeetCode）题目描述：以数组intervals表示若干个区间的集合，其中单个区间为intervals[i]=[starti,endi]。请你合并所有重叠的区间，并返回一个不重叠的区间数组，该数组需恰好覆盖输入中的所有区间。示例1：输入：intervals=[[1,3],[2,6],[8,10],[15,18]]输出：[[1,6],[8,10],[15,18]]
Java删除特定下标数组元素程序员南飞 Java 数组删除元素字符串遍历
15:16:06publicstaticvoidmain(String[]args){//数组创建以后长度不变，定义新的数组添加长度//删除特定下标数组String[]array1=newString[]{"a","b","b","c","d"};//删除第二个bintkey=2;String[]array2=newString[array1.length-1];for(inti=0;i=key)
面试经典算法150题系列-除自身以外数组的乘积 betterManchester 面试经典算法题150题算法面试 java
除自身以外数组的乘积给你一个整数数组nums，返回数组answer，其中answer[i]等于nums中除nums[i]之外其余各元素的乘积。题目数据保证数组nums之中任意元素的全部前缀元素和后缀的乘积都在32位整数范围内。请不要使用除法，且在O(n)时间复杂度内完成此题。示例1:输入:nums=[1,2,3,4]输出:[24,12,8,6]示例2:输入:nums=[-1,1,0,-3,3]输出
JAVA：网络编程 Socket 的技术指南拾荒的小海螺 JAVA java 网络开发语言
1、简述JavaNIO（Non-blockingI/O）是一种基于通道（Channel）和缓冲区（Buffer）的I/O模型，支持非阻塞通信和多路复用，适合高并发场景。相比传统的阻塞I/O（BIO），NIO更高效，因为它避免了线程被阻塞，降低了系统资源消耗。代码样例：https://gitee.com/lhdxhl/springboot-example.git核心组件：Channel（通道）：数据
算法通关----除自己自身以外数组乘积 fang4084 算法通关算法
题目来源：leetcode--238题目内容：给你一个整数数组nums，返回数组answer，其中answer[i]等于nums中除nums[i]之外其余各元素的乘积。题目数据保证数组nums之中任意元素的全部前缀元素和后缀的乘积都在32位整数范围内。请不要使用除法，且在O(n)时间复杂度内完成此题。示例1:输入:nums=[1,2,3,4]输出:[24,12,8,6]示例2:输入:nums=[-
python-leetcode-除自身以外数组的乘积 Joyner2018 python leetcode 算法职场和发展
238.除自身以外数组的乘积-力扣（LeetCode）classSolution:defproductExceptSelf(self,nums:List[int])->List[int]:n=len(nums)#初始化结果数组answer=[1]*n#计算前缀乘积prefix=1foriinrange(n):answer[i]=prefixprefix*=nums[i]#计算后缀乘积，同时更新结果
ActiveMQ监听器在MQ重启后不再监听问题四脚小蜗 ActiveMq activemq
应用的监听器注解@JmsListener(destination="TopicName",containerFactory="FactoryName")工厂代码@BeanJmsListenerContainerFactoryFactoryName(ConnectionFactoryconnectionFactory){SimpleJmsListenerContainerFactoryfactory
Dyn-VQA：含1452动态问题的视觉问答数据集，需灵活提供知识检索方案，查询、工具与检索时间皆可变。数据集
2024-11-05，由阿里巴巴集团创建Dyn-VQA数据集，它包含三种类型的“动态”问题，需要复杂的知识检索策略，这些问题的查询、工具和时间都是可变的。这个数据集的创建对于推动mRAG研究和解决现有VQA数据集无法充分反映启发式mRAGs在获取复杂知识方面的刚性问题具有重要意义。数据集地址：Dyn-VQA|多模态检索数据集|自然语言处理数据集一、研究背景：在多模态大型语言模型（MLLMs）中，解
VMware ESXi 8.0U3d 发布下载 - 领先的裸机 Hypervisor esxi
VMwareESXi8.0U3d-领先的裸机Hypervisor同步发布Dell(戴尔)、HPE(慧与)、Lenovo(联想)、IEITSYSTEMS(浪潮信息)、Cisco(思科)、Fujitsu(富士通)、Hitachi(日立)、NEC(日电)、Huawei(华为)、xFusion(超聚变)OEM定制版请访问原文链接：https://sysin.org/blog/vmware-esxi-8-u
力扣刷题笔记_动态规划爬楼梯问题 yma16 csp算法题目学习
题目描述假设你正在爬楼梯。需要n阶你才能到达楼顶。每次你可以爬1或2个台阶。你有多少种不同的方法可以爬到楼顶呢？注意：给定n是一个正整数。示例一输入：2输出：2解释：有两种方法可以爬到楼顶。方法一：1阶+1阶方法二：2阶示例二输入：3输出：3解释：有三种方法可以爬到楼顶。方法一：1阶+1阶+1阶方法二：1阶+2阶方法三：2阶+1阶动态规划它的最优解可以从其子问题的最优解来有效地构建。第i阶可以由以
深度学习框架PyTorch——从入门到精通（4）数据转换 Fansv587 Torch框架学习深度学习 pytorch 人工智能 python 经验分享
转换（Transforms）很多时候，数据并不总是以训练机器学习算法所需的最终处理形式出现。所以我们需要使用变换对数据进行一些处理，使其适合训练。所有TorchVision数据集都有两个参数——transform来修改特征，target_transform来修改标签——接受包含转换逻辑的可调用项。torchvision.transform模块提供了几个开箱即用的转换。FashionMNIST数据集
使用 DeepSeek-R1 为 RAG 运行本地 Gradio 应用程序呱牛 do IT 人工智能 deepseek
让我们使用Gradio构建一个简单的演示应用程序，以使用DeepSeek-R1查询和分析文档。第1步：先决条件在深入研究实现之前，我们确保已安装以下工具和库：Python3.8+Python3.8+版Langchain：用于构建由大型语言模型（）LLMs提供支持的应用程序的框架，支持轻松检索、推理和工具集成Chromadb：一个高性能的向量数据库，专为高效的相似性搜索和嵌入存储而设计。Gradio
宝石组合第十五届蓝桥杯大赛软件赛省赛C/C++ 大学 B 组 Geometry Fu 蓝桥杯蓝桥杯 c语言 c++
宝石组合题目来源第十五届蓝桥杯大赛软件赛省赛C/C++大学B组原题链接蓝桥杯宝石组合https://www.lanqiao.cn/problems/19711/learning/问题描述P10426[蓝桥杯2024省B]宝石组合题目描述在一个神秘的森林里，住着一个小精灵名叫小蓝。有一天，他偶然发现了一个隐藏在树洞里的宝藏，里面装满了闪烁着美丽光芒的宝石。这些宝石都有着不同的颜色和形状，但最引人注目
JavaScript 性能优化实战【详细指南】 AI筑梦师 JavaScript javascript 性能优化开发语言
#JavaScript性能优化实战#JavaScript性能优化实战JavaScript作为现代Web开发的核心技术，其性能优化涉及多个层面，包括计算效率、DOM操作、异步处理、内存管理、网络请求优化等。随着Web发展，越来越多的新技术（如WebAssembly、OffscreenCanvas、StreamsAPI、V8TurboFan优化等）正在提升JavaScript的性能。本指南涵盖从基础优
10-29 插入学生总学分表(MSSQL) 拿下pta500题 sqlserver 数据结构数据库 mssql
本题目要求编写Insert语句，计算每位同学获得的总学分，并将所有学生的总学分按学号升序排序后一起插入到totalcredit表中。注意：1）当某门课程成绩在60分以上时才能合计计入总学分2）如果某学生尚未选修任何课程时，总学分计为0，并插入到totalcredit表中。3）执行Insert语句之前，totalcredit表中没有任何记录。提示：MSSQLServer评测SQL语句。inserti
rust的指针作为函数返回值是直接传递，还是先销毁后创建？ wudixiaotie 返回值
这是我自己想到的问题，结果去知呼提问，还没等别人回答，我自己就想到方法实验了。。 fn main() { let mut a = 34; println!("a's addr:{:p}", &a); let p = &mut a; println!("p's addr:{:p}", &a
java编程思想 -- 数据的初始化百合不是茶 java 数据的初始化
1.使用构造器确保数据初始化 /* *在ReckInitDemo类中创建Reck的对象 */ public class ReckInitDemo { public static void main(String[] args) { //创建Reck对象 new Reck(); } }
[航天与宇宙]为什么发射和回收航天器有档期 comsci
地球的大气层中有一个时空屏蔽层,这个层次会不定时的出现,如果该时空屏蔽层出现,那么将导致外层空间进入的任何物体被摧毁,而从地面发射到太空的飞船也将被摧毁... 所以,航天发射和飞船回收都需要等待这个时空屏蔽层消失之后,再进行 &
linux下批量替换文件内容商人shang linux 替换
1、网络上现成的资料　　格式: sed -i "s/查找字段/替换字段/g" `grep 查找字段 -rl 路径` 　　linux sed 批量替换多个文件中的字符串　　sed -i "s/oldstring/newstring/g" `grep oldstring -rl yourdir` 　　例如：替换/home下所有文件中的www.admi
网页在线天气预报 oloz 天气预报
网页在线调用天气预报 <%@ page language="java" contentType="text/html; charset=utf-8" pageEncoding="utf-8"%> <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transit
SpringMVC和Struts2比较杨白白 springMVC
1. 入口 spring mvc的入口是servlet，而struts2是filter（这里要指出，filter和servlet是不同的。以前认为filter是servlet的一种特殊），这样就导致了二者的机制不同，这里就牵涉到servlet和filter的区别了。参见：http://blog.csdn.net/zs15932616453/article/details/8832343 2
refuse copy, lazy girl! 小桔子 copy
妹妹坐船头啊啊啊啊！都打算一点点琢磨呢。文字编辑也写了基本功能了。。今天查资料，结果查到了人家写得完完整整的。我清楚的认识到： 1.那是我自己觉得写不出的高度 2.如果直接拿来用，很快就能解决问题 3.然后就是抄咩~~ 4.肿么可以这样子，都不想写了今儿个，留着作参考吧！拒绝大抄特抄，慢慢一点点写！
apache与php整合 aichenglong php apache web
一 apache web服务器 1 apeche web服务器的安装 1)下载Apache web服务器 2)配置域名(如果需要使用要在DNS上注册) 3)测试安装访问http://localhost/验证是否安装成功 2 apache管理 1)service.msc进行图形化管理 2)命令管理，配
Maven常用内置变量 AILIKES maven
Built-in properties ${basedir} represents the directory containing pom.xml ${version} equivalent to ${project.version} (deprecated: ${pom.version}) Pom/Project properties Al
java的类和对象百合不是茶 JAVA面向对象类对象
java中的类： java是面向对象的语言，解决问题的核心就是将问题看成是一个类，使用类来解决 java使用 class 类名来创建类，在Java中类名要求和构造方法，Java的文件名是一样的创建一个A类： class A{ } java中的类：将某两个事物有联系的属性包装在一个类中，再通
JS控制页面输入框为只读 bijian1013 JavaScript
在WEB应用开发当中，增、删除、改、查功能必不可少，为了减少以后维护的工作量，我们一般都只做一份页面，通过传入的参数控制其是新增、修改或者查看。而修改时需将待修改的信息从后台取到并显示出来，实际上就是查看的过程，唯一的区别是修改时，页面上所有的信息能修改，而查看页面上的信息不能修改。因此完全可以将其合并，但通过前端JS将查看页面的所有信息控制为只读，在信息量非常大时，就比较麻烦。
AngularJS与服务器交互 bijian1013 JavaScript AngularJS $http
对于AJAX应用（使用XMLHttpRequests）来说，向服务器发起请求的传统方式是：获取一个XMLHttpRequest对象的引用、发起请求、读取响应、检查状态码，最后处理服务端的响应。整个过程示例如下： var xmlhttp = new XMLHttpRequest(); xmlhttp.onreadystatechange
[Maven学习笔记八]Maven常用插件应用 bit1129 maven
常用插件及其用法位于：http://maven.apache.org/plugins/ 1. Jetty server plugin 2. Dependency copy plugin 3. Surefire Test plugin 4. Uber jar plugin 1. Jetty Pl
【Hive六】Hive用户自定义函数(UDF) bit1129 自定义函数
1. 什么是Hive UDF Hive是基于Hadoop中的MapReduce，提供HQL查询的数据仓库。Hive是一个很开放的系统，很多内容都支持用户定制，包括：文件格式：Text File，Sequence File 内存中的数据格式： Java Integer/String, Hadoop IntWritable/Text 用户提供的 map/reduce 脚本：不管什么
杀掉nginx进程后丢失nginx.pid，如何重新启动nginx ronin47 nginx 重启 pid丢失
nginx进程被意外关闭，使用nginx -s reload重启时报如下错误：nginx: [error] open() “/var/run/nginx.pid” failed (2: No such file or directory)这是因为nginx进程被杀死后pid丢失了，下一次再开启nginx -s reload时无法启动解决办法：nginx -s reload 只是用来告诉运行中的ng
UI设计中我们为什么需要设计动效 brotherlamp UI ui教程 ui视频 ui资料 ui自学
随着国际大品牌苹果和谷歌的引领，最近越来越多的国内公司开始关注动效设计了，越来越多的团队已经意识到动效在产品用户体验中的重要性了，更多的UI设计师们也开始投身动效设计领域。但是说到底，我们到底为什么需要动效设计？或者说我们到底需要什么样的动效？做动效设计也有段时间了，于是尝试用一些案例，从产品本身出发来说说我所思考的动效设计。一、加强体验舒适度嗯，就是让用户更加爽更加爽的用你的产品。
Spring中JdbcDaoSupport的DataSource注入问题 bylijinnan java spring
参考以下两篇文章： http://www.mkyong.com/spring/spring-jdbctemplate-jdbcdaosupport-examples/ http://stackoverflow.com/questions/4762229/spring-ldap-invoking-setter-methods-in-beans-configuration Sprin
数据库连接池的工作原理 chicony 数据库连接池
随着信息技术的高速发展与广泛应用，数据库技术在信息技术领域中的位置越来越重要，尤其是网络应用和电子商务的迅速发展，都需要数据库技术支持动态Web站点的运行，而传统的开发模式是：首先在主程序（如Servlet、Beans）中建立数据库连接；然后进行SQL操作，对数据库中的对象进行查询、修改和删除等操作；最后断开数据库连接。使用这种开发模式，对
java 关键字 CrazyMizzz java
关键字是事先定义的，有特别意义的标识符，有时又叫保留字。对于保留字，用户只能按照系统规定的方式使用，不能自行定义。 Java中的关键字按功能主要可以分为以下几类：（1）访问修饰符 public,private,protected p
Hive中的排序语法 daizj 排序 hive order by DISTRIBUTE BY sort by
Hive中的排序语法 2014.06.22 ORDER BY hive中的ORDER BY语句和关系数据库中的sql语法相似。他会对查询结果做全局排序，这意味着所有的数据会传送到一个Reduce任务上，这样会导致在大数量的情况下，花费大量时间。与数据库中 ORDER BY 的区别在于在hive.mapred.mode = strict模式下，必须指定 limit 否则执行会报错。
单态设计模式 dcj3sjt126com 设计模式
单例模式（Singleton）用于为一个类生成一个唯一的对象。最常用的地方是数据库连接。使用单例模式生成一个对象后，该对象可以被其它众多对象所使用。 <?phpclass Example{ // 保存类实例在此属性中 private static&
svn locked dcj3sjt126com Lock
post-commit hook failed (exit code 1) with output: svn: E155004: Working copy 'D:\xx\xxx' locked svn: E200031: sqlite: attempt to write a readonly database svn: E200031: sqlite: attempt to write a
ARM寄存器学习 e200702084 数据结构 C++c C#F#
无论是学习哪一种处理器，首先需要明确的就是这种处理器的寄存器以及工作模式。 ARM有37个寄存器，其中31个通用寄存器，6个状态寄存器。 1、不分组寄存器（R0-R7）不分组也就是说说，在所有的处理器模式下指的都时同一物理寄存器。在异常中断造成处理器模式切换时，由于不同的处理器模式使用一个名字相同的物理寄存器，就是
常用编码资料 gengzg 编码
List<UserInfo> list=GetUserS.GetUserList(11); String json=JSON.toJSONString(list); HashMap<Object,Object> hs=new HashMap<Object, Object>(); for(int i=0;i<10;i++) {
进程 vs. 线程 hongtoushizi 线程 linux 进程
我们介绍了多进程和多线程，这是实现多任务最常用的两种方式。现在，我们来讨论一下这两种方式的优缺点。首先，要实现多任务，通常我们会设计Master-Worker模式，Master负责分配任务，Worker负责执行任务，因此，多任务环境下，通常是一个Master，多个Worker。如果用多进程实现Master-Worker，主进程就是Master，其他进程就是Worker。如果用多线程实现
Linux定时Job：crontab -e 与 /etc/crontab 的区别 Josh_Persistence linux crontab
一、linux中的crotab中的指定的时间只有5个部分：* * * * * 分别表示：分钟，小时，日，月，星期，具体说来：第一段代表分钟 0—59 第二段代表小时 0—23 第三段代表日期 1—31 第四段代表月份 1—12 第五段代表星期几，0代表星期日 0—6 如： */1 * * * * 每分钟执行一次。 *
KMP算法详解 hm4123660 数据结构 C++算法字符串 KMP
字符串模式匹配我们相信大家都有遇过，然而我们也习惯用简单匹配法（即Brute-Force算法)，其基本思路就是一个个逐一对比下去，这也是我们大家熟知的方法，然而这种算法的效率并不高，但利于理解。假设主串s="ababcabcacbab",模式串为t="
枚举类型的单例模式 zhb8015 单例模式
E.编写一个包含单个元素的枚举类型[极推荐]。代码如下： public enum MaYun {himself; //定义一个枚举的元素，就代表MaYun的一个实例private String anotherField;MaYun() {//MaYun诞生要做的事情//这个方法也可以去掉。将构造时候需要做的事情放在instance赋值的时候：/** himself = MaYun() {*
Kafka+Storm+HDFS ssydxa219 storm
cd /myhome/usr/stormbin/storm nimbus &bin/storm supervisor &bin/storm ui &Kafka+Storm+HDFS整合实践kafka_2.9.2-0.8.1.1.tgzapache-storm-0.9.2-incubating.tar.gzKafka安装配置我们使用3台机器搭建Kafk
Java获取本地服务器的IP 中华好儿孙 java Web 获取服务器ip地址
System.out.println("getRequestURL:"+request.getRequestURL()); System.out.println("getLocalAddr:"+request.getLocalAddr()); System.out.println("getLocalPort:&quo