snsn1984

Mapping High-Level Constructs to LLVM IR

原文地址：http://llvm.lyngvig.org/Articles/Mapping-High-Level-Constructs-to-LLVM-IR

Mapping High-Level Constructs to LLVM IR

Introduction

A Quick Primer

Some Useful LLVM Tools

Mapping Basic Constructs to LLVM IR

Global Variables

Local Variables

Constants

Constant Expressions

Size-Of Computations

Function Prototypes

Function Definitions

Simple Public Functions

Simple Private Functions

Functions with a Variable Number of Parameters

Exception-Aware Functions

Function Pointers

Casts

Bitwise Casts

Zero-Extending Casts (Unsigned Upcasts)

Sign-Extending Casts (Signed Upcasts)

Truncating Casts (Signed and Unsigned Downcasts)

Floating-Point Extending Casts (Float Upcasts)

Floating-Point Truncating Casts (Float Downcasts)

Pointer-to-Integer Casts

Integer-to-Pointer Casts

Address-Space Casts (Pointer Casts)

Incomplete Structure Types

Structures

Nested Structures

Unions

Structure Expressions

Getting a Pointer to a Structure Member

Mapping Control Structures to LLVM IR

Mapping Advanced Constructs to LLVM IR

Lambda Functions

Closures

Generators

Mapping Exception Handling to LLVM IR

Exception Handling by Propagated Return Value

Setjmp/Longjmp Exception Handling

Zero Cost Exception Handling

Resources

Mapping Object-Oriented Constructs to LLVM IR

Classes

Virtual Methods

Single Inheritance

Multiple Inheritance

Virtual Inheritance

Interfaces

Boxing and Unboxing

Class Equivalence Test

Class Inheritance Test

The New Operator

The Instance New Operator

The Array New Operator

Interoperating with a Runtime Library

Interfacing to the Operating System

How to Interface to POSIX Operating Systems

Sample POSIX "Hello World" Application

How to Interface to the Windows Operating System

Sample Windows "Hello World" Application

Resources

Epilogue

Appendix A: How to Implement a String Type in LLVM

Appendix B: Task List

Introduction

In this document we will take a look at how to map various classic high-level programming language constructs to LLVM IR. The purpose of the document is to make the learning curve less steep for aspiring LLVM users.

For the sake of simplicity, we'll be working with a 32-bit target machine so that pointers and word-sized operands are 32-bits.

Also, for the sake of readability we do not mangle (encode) names. Rather, they are given simple, easy-to-read names that reflect their purpose. A production compiler for any language that supports overloading would generally need to mangle the names so as to avoid conflicts between symbols.

A Quick Primer

Here are a few things that you should know before reading this document:

LLVM IR is not machine code, but sort of the step just above assembly.
LLVM IR is highly typed so expect to be told when you do something wrong.
LLVM IR does not differentiate between signed and unsigned integers.
LLVM IR assumes two's complement signed integers so that say trunc works equally well on signed and unsigned integers.
Global symbols begin with an at sign (@).
Local symbols begin with a percent symbol (%).
All symbols must be declared or defined.
Don't worry that the LLVM IR at times can seem somewhat lengthy when it comes to expressing something; the optimizer will ensure the output is well optimized and you'll often see two or three LLVM IR instructions be coalesced into a single machine code instruction.
If in doubt, consult the Language Reference. If there is a conflict between the Language Reference and this document, this document is wrong!
All LLVM IR examples are presented without a data layout and without a target triple. You need to add those yourself, if you want to actually build and run the samples. Get them from Clang for your platform.

Some Useful LLVM Tools

The most important LLVM tools for use with this article are as follows:

Name	Function	Reads	Writes	Arguments
clang	C Compiler	.c	.ll	-c -emit-llvm -S
clang++	C++ Compiler	.cpp	.ll	-c -emit-llvm -S
llvm-dis	Disassembler	.bc	.ll
opt	Optimizer	.bc/.ll	same
llc	IR Compiler	.ll	.s

While you are playing around with generating or writing LLVM IR, you may want to add the option -fsanitize=undefined to Clang/Clang++ insofar you use either of those. This option makes Clang/Clang++ insert run-time checks in places where it would normally output an ud2 instruction. This will likely save you some trouble if you happen to generate undefined LLVM IR. Please notice that this option only works for C and C++ compiles.

Mapping Basic Constructs to LLVM IR

In this chapter, we'll look at the most basic and simple constructs that are part of nearly all imperative/OOP languages out there.

Global Variables

Global varibles are trivial to implement in LLVM IR:

 
   int 
    variable 
     
   = 
     
   14 
   ; 
   
   int 
    main 
   ( 
   ) 
   
   { 
   
   return 
    variable 
   ; 
   
   }

Becomes:

 
   @variable 
     
   = 
     
   global 
     
   i32 
     
   14 
   
   define 
     
   i32 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   load 
     
   i32 
   * 
     
   @variable 
   
   ret 
     
   i32 
     
   %1 
   
   }

Please notice that LLVM views global variables as pointers; so you must explicitly dereference the global variable using the loadinstruction when accessing its value, likewise you must explicitly store the value of a global variable using the store instruction.

Local Variables

There are two kinds of local variables in LLVM:

Register-allocated local variables (temporaries).
Stack-allocated local variables.

The former is created by introducing a new symbol for the variable:

 
   %1 
     
   = 
    some computation

The latter is created by allocating the variable on the stack:

Please notice that alloca yields a pointer to the allocated type. As is generally the case in LLVM, you must explicitly use a loador store instruction to read or write the value respectively.

The use of alloca allows for a neat trick that can simplify your code generator in some cases. The trick is to explicitly allocate all mutable variables, including arguments, on the stack, initialize them with the appropriate initial value and then operate on the stack as if that was your end goal. The trick is to run the "memory to register promotion" pass on your code as part of the optimization phase. This will make LLVM store as many of the stack variables in registers as it possibly can. That way you don't have to ensure that the generated program is in SSA form but can generate code without having to worry about this aspect of the code generation.

This trick is also described in chapter 7.4, Mutable Variables in Kaleidoscope, in the OCaml tutorial on the LLVM website.

Constants

There are two different kinds of constants:

Constants that do not occupy allocated memory.
Constants that do occupy allocated memory.

The former are always expanded inline by the compiler as there is no LLVM IR equivalent of those. In other words, the compiler simply inserts the constant value wherever it is being used in a computation:

 
   %1 
     
   = 
     
   add 
     
   i32 
     
   %0 
   , 
     
   17 
        
   ; 17 is an inlined constant

Constants that do occupy memory are defined using the constant keyword:

 
   @hello 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   6 
    x 
     
   i8 
   ] 
    c 
   "hello\00" 
   
   %struct 
     
   = 
     
   type 
     
   { 
     
   i32 
   , 
     
   i8 
     
   } 
   
   @struct_constant 
     
   = 
     
   internal 
     
   constant 
     
   %struct 
     
   { 
     
   i32 
     
   16 
   , 
     
   i8 
     
   4 
     
   }

Such a constant is really a global variable whose visibility can be limited with private or internal so that it is invisible outside the current module.

Constant Expressions

TODO: Document the various forms of constant expressions that exist and .. how they can be very useful. For instance,getelementptr constant expressions are almost unavoidable in all but the simplest programs.

Size-Of Computations

Even though the compiler ought to know the exact size of everything in use (for statically checked languages), it can at times be convenient to ask LLVM to figure out the size of a structure for you. This is done with the following little snippet of code:

 
   %Struct 
     
   = 
     
   type 
     
   { 
     
   i8 
   , 
     
   i32 
   , 
     
   i8 
   * 
     
   } 
   
   @Struct_size 
     
   = 
     
   constant 
     
   i32 
     
   ptrtoint 
     
   ( 
   %Struct 
   * 
     
   getelementptr 
     
   ( 
   %Struct 
   * 
     
   null 
   , 
     
   i32 
     
   1 
   ) 
   ) 
     
   to 
     
   i32

@Struct_size will now contain the size of the structure %Struct. The trick is to compute the offset of the second element in the zero-based array starting at null and that way get the size of the structure.

Function Prototypes

A function prototype, aka a profile, is translated into an equivalent declare declaration in LLVM IR:

 
   int 
    Bar 
   ( 
   int 
    value 
   ) 
   ; 
  

Becomes:

 
   declare 
     
   i32 
     
   @Bar 
   ( 
   i32 
     
   %value 
   )

Or you can leave out the descriptive parameter name:

 
   declare 
     
   i32 
     
   @Bar 
   ( 
   i32 
   ) 
  

Function Definitions

The translation of function definitions depends on a range of factors, ranging from the calling convention in use, whether the function is exception-aware or not, and if the function is to be publicly available outside the module.

Simple Public Functions

The most basic model is:

 
   int 
    Bar 
   ( 
   void 
   ) 
   
   { 
   
   return 
     
   17 
   ; 
   
   }

Becomes:

 
   define 
     
   i32 
     
   @Bar 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   ret 
     
   i32 
     
   17 
   
   }

Simple Private Functions

A static function is a function private to a module that cannot be referenced from outside of the defining module:

 
   define 
     
   private 
     
   i32 
     
   @Foo 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   ret 
     
   i32 
     
   17 
   
   }

Functions with a Variable Number of Parameters

To call a so-called vararg function, you first need to define or declare it using the elipsis (...) and then you need to make use of a special syntax for function calls that allows you to explictly list the types of the parameters of the function that is being called. This "hack" exists to allow overriding a call to a function such as a function with variable parameters. Please notice that you only need to specify the return type once, not twice as you'd have to do if it was a true cast:

 
   declare 
     
   i32 
     
   @printf 
   ( 
   i8 
   *, 
     
   ... 
   ) 
     
   nounwind 
   
   @.text 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   20 
    x 
     
   i8 
   ] 
    c 
   "Argument count: %d\0A\00" 
   
   define 
     
   i32 
     
   @main 
   ( 
   i32 
     
   %argc 
   , 
     
   i8 
   ** 
     
   %argv 
   ) 
     
   nounwind 
     
   { 
   
   ; printf("Argument count: %d\n", argc) 
   
   %1 
     
   = 
     
   call 
     
   i32 
     
   ( 
   i8 
   *, 
     
   ... 
   ) 
   * 
     
   @printf 
   ( 
   i8 
   * 
     
   getelementptr 
   ( 
   [ 
   20 
    x 
     
   i8 
   ] 
   * 
     
   @.text 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   ) 
   , 
     
   i32 
   %argc 
   ) 
   
   ret 
     
   i32 
     
   0 
   
   }

Exception-Aware Functions

A function that is aware of being part of a larger scheme of exception-handling is called an exception-aware function. Depending upon the type of exception handling being employed, the function may either return a pointer to an exception instance, create a setjmp/longjmp frame, or simply specify the uwtable (for UnWind Table) attribute. These cases will all be covered in great detail in the chapter on Exception Handling below.

Function Pointers

Function pointers are expressed almost like in C and C++:

 
   int 
     
   ( 
   *Function 
   ) 
   ( 
   char 
     
   *buffer 
   ) 
   ; 
  

Becomes:

 
   @Function 
     
   = 
     
   global 
     
   i32 
   ( 
   i8 
   * 
   ) 
   * 
     
   null 
  

Casts

There are nine different types of casts:

Bitwise casts (type casts).
Zero-extending casts (unsigned upcasts).
Sign-extending casts (signed upcasts).
Truncating casts (signed and unsigned downcasts).
Floating-point extending casts (float upcasts).
Floating-point truncating casts (float downcasts).
Pointer-to-integer casts (todo: Document pointer-to-integer casts).
Integer-to-pointer casts (todo: Document integer-to-pointer casts).
Address-space casts (pointer casts).

Bitwise Casts

A bitwise cast (bitcast) reinterprets a given bit pattern without changing any bits in the operand. For instance, you could make a bitcast of a pointer to byte into a pointer to some structure as follows:

 
   typedef 
     
   struct 
   
   { 
   
   int 
    a 
   ; 
   
   } 
    Foo 
   ; 
   
   extern 
     
   void 
     
   * 
   malloc 
   ( 
   size_t 
    size 
   ) 
   ; 
   
   extern 
     
   void 
     
   free 
   ( 
   void 
     
   *value 
   ) 
   ; 
   
   void 
    allocate 
   ( 
   ) 
   
   { 
   
     Foo 
     
   *foo 
     
   = 
     
   (Foo 
     
   * 
   ) 
     
   malloc 
   ( 
   sizeof 
   (Foo 
   ) 
   ) 
   ; 
   
     foo. 
   a 
     
   = 
     
   12 
   ; 
   
   free 
   (foo 
   ) 
   ; 
   
   }

Becomes:

 
   %Foo 
     
   = 
     
   type 
     
   { 
     
   i32 
     
   } 
   
   declare 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
   ) 
   
   declare 
     
   void 
     
   @free 
   ( 
   i8 
   * 
   ) 
   
   define 
     
   void 
     
   @allocate 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   call 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
     
   4 
   ) 
   
   %foo 
     
   = 
     
   bitcast 
     
   i8 
   * 
     
   %1 
     
   to 
     
   %Foo 
   * 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %foo 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   12 
   , 
     
   i32 
   * 
     
   %2 
   
   call 
     
   void 
     
   @free 
   ( 
   i8 
   * 
     
   %1 
   ) 
   
   ret 
     
   void 
   
   }

Zero-Extending Casts (Unsigned Upcasts)

To upcast an unsigned value like in the example below:

 
   uint8 
    byte 
     
   = 
     
   117 
   ; 
   
   uint32 
    word 
   ; 
   
   void 
    main 
   ( 
   ) 
   
   { 
   
   /* The compiler automatically upcasts the byte to a word. */ 
   
     word 
     
   = 
    byte 
   ; 
   
   }

You use the zext instruction:

Sign-Extending Casts (Signed Upcasts)

To upcast a signed value, you replace the zext instruction with the sext instruction and everything else works just like in the previous section:

Truncating Casts (Signed and Unsigned Downcasts)

Both signed and unsigned integers use the same instruction, trunc, to reduce the size of the number in question. This is because LLVM IR assumes that all signed integer values are in two's complement format for which reason trunc is sufficient to handle both cases:

Floating-Point Extending Casts (Float Upcasts)

Floating points numbers can be extended using the fpext instruction:

 
   float 
    small 
     
   = 
     
   1.25 
   ; 
   
   double 
    large 
   ; 
   
   void 
    main 
   ( 
   ) 
   
   { 
   
   /* The compiler inserts an implicit float upcast. */ 
   
     large 
     
   = 
    small 
   ; 
   
   }

Becomes:

 
   @small 
     
   = 
     
   global 
     
   float 
     
   1.25 
   
   @large 
     
   = 
     
   global 
     
   double 
     
   0.0 
   
   define 
     
   void 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   load 
     
   float 
   * 
     
   @small 
   
   %2 
     
   = 
     
   fpext 
     
   float 
     
   %1 
     
   to 
     
   double 
   
   store 
     
   double 
     
   %2 
   , 
     
   double 
   * 
     
   @large 
   
   ret 
     
   void 
   
   }

Floating-Point Truncating Casts (Float Downcasts)

Likewise, a floating point number can be truncated to a smaller size:

 
   @large 
     
   = 
     
   global 
     
   double 
     
   1.25 
   
   @small 
     
   = 
     
   global 
     
   float 
     
   0.0 
   
   define 
     
   void 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   load 
     
   double 
   * 
     
   @large 
   
   %2 
     
   = 
     
   fptrunc 
     
   double 
     
   %1 
     
   to 
     
   float 
   
   store 
     
   float 
     
   %2 
   , 
     
   float 
   * 
     
   @small 
   
   ret 
     
   void 
   
   }

Pointer-to-Integer Casts

todo: Document pointer-to-integer casts.

Integer-to-Pointer Casts

todo: Document integer-to-pointer casts.

Address-Space Casts (Pointer Casts)

todo: Find a useful example of an address-space casts, using the addrspacecast instruction, to be included here.

Incomplete Structure Types

Incomplete types are very useful for hiding the details of what fields a given structure has. A well-designed C interface can be made so that no details of the structure are revealed to the client, so that the client cannot inspect or modify private members inside the structure:

 
   void 
    Bar 
   ( 
   struct 
    Foo 
     
   * 
   ) 
   ; 
  

Becomes:

 
   %Foo 
     
   = 
     
   type 
     
   opaque 
   
   declare 
     
   void 
     
   @Bar 
   ( 
   %Foo 
   )

Structures

LLVM IR already includes the concept of structures so there isn't much to do:

 
   struct 
    Foo 
   
   { 
   
   size_t 
    _length 
   ; 
   
   } 
   ;

It is only a matter of discarding the actual field names and then index with numerals starting from zero:

Nested Structures

Nested structures are straightforward:

 
   %Object 
     
   = 
     
   type 
     
   { 
   
   %Object 
   *, 
          
   ; 0: above; the parent pointer 
   
   i32 
                
   ; 1: value; the value of the node 
   
   }

Unions

Unions are getting more and more rare as the years have shown that they are quite dangerous to use; especially the C variant that does not have a selector field to indicate which of the union's variants are valid. Some may still have a legacy reason to use unions. In fact, LLVM does not support unions at all:

 
   union 
    Foo 
   
   { 
   
   int 
    a 
   ; 
   
   char 
     
   *b 
   ; 
   
   double 
    c 
   ; 
   
   } 
   ; 
   
 Foo Union 
   ;

Becomes this when run through Clang++:

 
   %union.Foo 
     
   = 
     
   type 
     
   { 
     
   double 
     
   } 
   
   @Union 
     
   = 
     
   %union.Foo 
     
   { 
     
   0.0 
     
   }

What happened here? Where did the other union members go? The answer is that in LLVM there are no unions; there are only structs that can be cast into whichever type the front-end want to cast the struct into. So to access the above union from LLVM IR, you'd use the bitcast instruction to cast a pointer to the "union" into whatever pointer you'd want it to be:

 
   %1 
     
   = 
     
   bitcast 
     
   %union.Foo 
   * 
     
   @Union 
     
   to 
     
   i32 
   * 
   
   store 
     
   i32 
     
   1 
   , 
     
   i32 
   * 
     
   %1 
   
   %2 
     
   = 
     
   bitcast 
     
   %union.Foo 
   * 
     
   @Union 
     
   to 
     
   i8 
   ** 
   
   store 
     
   i8 
   * 
     
   null 
   , 
     
   i8 
   ** 
     
   %2

This may seem strange, but the truth is that a union is nothing more than a piece of memory that is being accessed using different implicit pointer casts.

If you want to support unions in your front-end language, you should simply allocate the total size of the union (i.e. the size of the largest member) and then generate code to reinterpret the allocated memory as needed.

The cleanest approach might be to simply allocate a range of bytes (i8), possibly with alignment padding at the end, and then cast whenever you access the structure. That way you'd be sure you did everything properly all the time.

Structure Expressions

As already told, structure members are referenced by index rather than by name in LLVM IR. And at no point do you need to, or should you, compute the offset of a given structure member yourself. The getelementptr instruction is available to compute a pointer to any structure member with no overhead (the getelementptr instruction is typically coascaled into the actual load orstore instruction).

Getting a Pointer to a Structure Member

The C++ code below illustrates various things you might want to do:

 
   struct 
    Foo 
   
   { 
   
   int 
    a 
   ; 
   
   char 
     
   *b 
   ; 
   
   double 
    c 
   ; 
   
   } 
   ; 
   
   int 
    main 
   ( 
   void 
   ) 
   
   { 
   
     Foo foo 
   ; 
   
   char 
     
   **bptr 
     
   = 
     
   &foo. 
   b 
   ; 
   
     Foo bar 
   [ 
   100 
   ] 
   ; 
   
     bar 
   [ 
   17 
   ]. 
   c 
     
   = 
     
   0.0 
   ; 
   
   return 
     
   0 
   ; 
   
   }

Becomes:

 
   %Foo 
     
   = 
     
   type 
     
   { 
   
   i32 
   , 
            
   ; 0: a 
   
   i8 
   *, 
            
   ; 1: b 
   
   double 
          
   ; 2: c 
   
   } 
   
   define 
     
   i32 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   ; Foo foo 
   
   %foo 
     
   = 
     
   alloca 
     
   %Foo 
   
   ; char **bptr = &foo.b 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %foo 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   ; Foo bar[100] 
   
   %bar 
     
   = 
     
   alloca 
     
   %Foo 
   , 
     
   i32 
     
   100 
   
   ; bar[17].c = 0.0 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %bar 
   , 
     
   i32 
     
   17 
   , 
     
   i32 
     
   2 
   
   store 
     
   double 
     
   0.0 
   , 
     
   double 
   * 
     
   %2 
   
   ret 
     
   i32 
     
   0 
   
   }

todo: Document the extractvalue and insertvalue instructions.

Mapping Control Structures to LLVM IR

todo: Add common control structures such as if, for, switch, and while.

todo: Explain the purpose of the phi instruction; show how it becomes obvious that you need it as soon as you encounter multiple blocks that contribute a value through different temporaries. In a way, phi ought to have been called "join" as it sort of joins up subexpressions from different basic blocks.

Mapping Advanced Constructs to LLVM IR

In this chapter, we'll look at various non-OOP constructs that are highly useful and are becoming more and more widespread in use.

Lambda Functions

A lambda function is an anonymous function with the added spice that it may freely refer to the local variables (including argument variables) in the containing function. Lambdas are implemented just like Pascal's nested functions, except the compiler is responsible for generating an internal name for the lambda function. There are a few different ways of implementing lambda functions (see `Wikipedia on nested functions Wikipedia on Nested Functions for more information).

 
   int 
    foo 
   ( 
   int 
    a 
   ) 
   
 
   { 
   
     
     
   auto 
    function 
     
   = 
     
   [ 
   ] 
   ( 
   int 
    x 
   ) 
     
   { 
     
   return 
    x 
     
   + 
    a 
   ; 
     
   } 
   
     
     
   return 
    function 
   ( 
   10 
   ) 
   ; 
   
 
   } 
  

Here the "problem" is that the lambda function references a local variable of the caller, namely a, even though the lambda function is a function of its own. This can be solved easily by passing the local variable in as an implicit argument to the lambda function:

 
   define 
     
   internal 
     
   i32 
     
   @lambda 
   ( 
   i32 
     
   %a 
   , 
     
   i32 
     
   %x 
   ) 
     
   alwaysinline 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   add 
     
   i32 
     
   %a 
   , 
     
   %x 
   
   ret 
     
   i32 
     
   %1 
   
   } 
   
   define 
     
   i32 
     
   @foo 
   ( 
   i32 
     
   %a 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   call 
     
   i32 
     
   @lambda 
   ( 
   i32 
     
   %a 
   , 
     
   i32 
     
   10 
   ) 
   
   ret 
     
   i32 
     
   %1 
   
   }

Alternatively, if the lambda function uses more than a few variables, you can wrap them up in a structure which you pass in a pointer to the lambda function:

 
   int 
    foo 
   ( 
   int 
    a, 
     
   int 
    b 
   ) 
   
 
   { 
   
     
     
   int 
    c 
     
   = 
    integer_parse 
   ( 
   ) 
   ; 
   
     
     
   auto 
    function 
     
   = 
     
   [ 
   ] 
   ( 
   int 
    x 
   ) 
     
   { 
     
   return 
     
   (a 
     
   + 
    b 
     
   - 
    c 
   ) 
     
   * 
    x 
   ; 
     
   } 
   
     
     
   return 
    function 
   ( 
   10 
   ) 
   ; 
   
 
   } 
  

Becomes:

 
   %Lambda_Arguments 
     
   = 
     
   type 
     
   { 
   
   i32 
   , 
            
   ; 0: a (argument) 
   
   i32 
   , 
            
   ; 1: b (argument) 
   
   i32 
            
   ; 2: c (local) 
   
   } 
   
   define 
     
   i32 
     
   @lambda 
   ( 
   %Lambda_Arguments 
   * 
     
   %args 
   , 
     
   i32 
     
   %x 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Lambda_Arguments 
   * 
     
   %args 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %a 
     
   = 
     
   load 
     
   i32 
   * 
     
   %1 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Lambda_Arguments 
   * 
     
   %args 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %b 
     
   = 
     
   load 
     
   i32 
   * 
     
   %2 
   
   %3 
     
   = 
     
   getelementptr 
     
   %Lambda_Arguments 
   * 
     
   %args 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   2 
   
   %c 
     
   = 
     
   load 
     
   i32 
   * 
     
   %3 
   
   %4 
     
   = 
     
   add 
     
   i32 
     
   %a 
   , 
     
   %b 
   
   %5 
     
   = 
     
   sub 
     
   i32 
     
   %4 
   , 
     
   %c 
   
   %6 
     
   = 
     
   mul 
     
   i32 
     
   %5 
   , 
     
   %x 
   
   ret 
     
   i32 
     
   %6 
   
   } 
   
   declare 
     
   i32 
     
   @integer_parse 
   ( 
   ) 
   
   define 
     
   i32 
     
   @foo 
   ( 
   i32 
     
   %a 
   , 
     
   i32 
     
   %b 
   ) 
     
   nounwind 
     
   { 
   
   %args 
     
   = 
     
   alloca 
     
   %Lambda_Arguments 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Lambda_Arguments 
   * 
     
   %args 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   %a 
   , 
     
   i32 
   * 
     
   %1 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Lambda_Arguments 
   * 
     
   %args 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   %b 
   , 
     
   i32 
   * 
     
   %2 
   
   %c 
     
   = 
     
   call 
     
   i32 
     
   @integer_parse 
   ( 
   ) 
   
   %3 
     
   = 
     
   getelementptr 
     
   %Lambda_Arguments 
   * 
     
   %args 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   2 
   
   store 
     
   i32 
     
   %c 
   , 
     
   i32 
   * 
     
   %3 
   
   %4 
     
   = 
     
   call 
     
   i32 
     
   @lambda 
   ( 
   %Lambda_Arguments 
   * 
     
   %args 
   , 
     
   i32 
     
   10 
   ) 
   
   ret 
     
   i32 
     
   %4 
   
   }

Obviously there are some possible variations over this theme:

You could pass all implicit as explicit arguments as arguments.
You could pass all implicit as explicit arguments in the structure.
You could pass in a pointer to the frame of the caller and let the lambda function extract the arguments and locals from the input frame.

Closures

todo: Describe closures.

Generators

A generator is a function that repeatedly yields a value in such a way that the function's state is preserved across the repeated calls of the function; this includes the function's local offset at the point it yielded a value.

The most straigthforward way to implement a generator is by wrapping all of its state variables (arguments, local variables, and return values) up into an ad-hoc structure and then pass the address of that structure to the generator.

Somehow, you need to keep track of which block of the generator you are doing on each call. This can be done in various ways; the way we show here is by using LLVM's blockaddress instruction to save the address of the next local block of code that should be executed. Other implementations use a simple state variable and then do a switch-like dispatch according to the value of the state variable. In both cases, the end result is the same: A different block of code is executed for each local block in the generator.

The important thing is to think of iterators as a sort of micro-thread that is resumed whenever the iterator is called again. In other words, we need to save the address of how far the iterator got on each pass through so that it can resume as if a microscopic thread switch had occured. So we save the address of the instruction after the return instruction so that we can resume running as if we never had returned in the first place.

I resort to pseudo-C++ because C++ does not directly support generators. First we look at a very simple case then we advance on to a slightly more complex case:

 
   #include <stdio.h> 
   
 generator 
     
   int 
    foo 
   ( 
   ) 
   
   { 
   
     yield 
     
   1 
   ; 
   
     yield 
     
   2 
   ; 
   
     yield 
     
   3 
   ; 
   
   } 
   
   int 
    main 
   ( 
   ) 
   
   { 
   
     foreach 
     
   ( 
   int 
    i in foo 
   ( 
   ) 
   ) 
   
   printf 
   ( 
   "Value: %d\n", i 
   ) 
   ; 
   
   return 
     
   0 
   ; 
   
   }

 
   %foo_context 
     
   = 
     
   type 
     
   { 
   
   i8 
   *, 
          
   ; 0: block (state) 
   
   i32 
          
   ; 1: value (result) 
   
   } 
   
   define 
     
   void 
     
   @foo_setup 
   ( 
   %foo_context 
   * 
     
   %context 
   ) 
     
   nounwind 
     
   { 
   
   ; set up 'block' 
   
   %1 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   blockaddress 
   ( 
   @foo_yield 
   , 
     
   %.yield1 
   ) 
   , 
     
   i8 
   ** 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   ; The boolean returned indicates if a result was available or not. 
   
   ; Once no more results are available, the caller is expected to not call 
   
   ; the iterator again. 
   
   define 
     
   i1 
     
   @foo_yield 
   ( 
   %foo_context 
   * 
     
   %context 
   ) 
     
   nounwind 
     
   { 
   
   ; dispatch to the active generator block 
   
   %1 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %2 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %1 
   
   indirectbr 
     
   i8 
   * 
     
   %2 
   , 
     
   [ 
     
   label 
     
   %.yield1 
   , 
     
   label 
     
   %.yield2 
   , 
     
   label 
     
   %.yield3 
   , 
     
   label 
     
   %.done 
     
   ] 
   
   .yield1: 
   
   ; store the result value (1) 
   
   %3 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   1 
   , 
     
   i32 
   * 
     
   %3 
   
   ; make 'block' point to next block to execute 
   
   %4 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   blockaddress 
   ( 
   @foo_yield 
   , 
     
   %.yield2 
   ) 
   , 
     
   i8 
   ** 
     
   %4 
   
   ret 
     
   i1 
     
   1 
   
   .yield2: 
   
   ; store the result value (2) 
   
   %5 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   2 
   , 
     
   i32 
   * 
     
   %5 
   
   ; make 'block' point to next block to execute 
   
   %6 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   blockaddress 
   ( 
   @foo_yield 
   , 
     
   %.yield3 
   ) 
   , 
     
   i8 
   ** 
     
   %6 
   
   ret 
     
   i1 
     
   1 
   
   .yield3: 
   
   ; store the result value (3) 
   
   %7 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   3 
   , 
     
   i32 
   * 
     
   %7 
   
   ; make 'block' point to next block to execute 
   
   %8 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   blockaddress 
   ( 
   @foo_yield 
   , 
     
   %.done 
   ) 
   , 
     
   i8 
   ** 
     
   %8 
   
   ret 
     
   i1 
     
   1 
   
   .done 
   : 
   
   ret 
     
   i1 
     
   0 
   
   } 
   
   declare 
     
   i32 
     
   @printf 
   ( 
   i8 
   *, 
     
   ... 
   ) 
     
   nounwind 
   
   @.string 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   11 
    x 
     
   i8 
   ] 
    c 
   "Value: %d\0A\00" 
   
   define 
     
   void 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   ; allocate and initialize generator context structure 
   
   %context 
     
   = 
     
   alloca 
     
   %foo_context 
   
   call 
     
   void 
     
   @foo_setup 
   ( 
   %foo_context 
   * 
     
   %context 
   ) 
   
   br 
     
   label 
     
   %.head 
   
   .head: 
   
   ; foreach (int i in foo()) 
   
   %1 
     
   = 
     
   call 
     
   i1 
     
   @foo_yield 
   ( 
   %foo_context 
   * 
     
   %context 
   ) 
   
   br 
     
   i1 
     
   %1 
   , 
     
   label 
     
   %.body 
   , 
     
   label 
     
   %. 
   tail 
   
   .body 
   : 
   
   %2 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %3 
     
   = 
     
   load 
     
   i32 
   * 
     
   %2 
   
   %4 
     
   = 
     
   call 
     
   i32 
     
   ( 
   i8 
   *, 
     
   ... 
   ) 
   * 
     
   @printf 
   ( 
   i8 
   * 
     
   getelementptr 
   ( 
   [ 
   11 
    x 
     
   i8 
   ] 
   * 
     
   @.string 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   ) 
   , 
   i32 
     
   %3 
   ) 
   
   br 
     
   label 
     
   %.head 
   
   . 
   tail 
   : 
   
   ret 
     
   void 
   
   }

And now for a slightly more complex example that involves local variables:

 
   #include <stdio.h> 
   
 generator 
     
   int 
    foo 
   ( 
   int 
    start, 
     
   int 
    after 
   ) 
   
   { 
   
   for 
     
   ( 
   int 
    index 
     
   = 
    start 
   ; 
    index 
     
   < 
    after 
   ; 
    index 
   ++ 
   ) 
   
   { 
   
   if 
     
   (i 
     
   % 
     
   2 
     
   == 
     
   0 
   ) 
   
             yield index 
     
   + 
     
   1 
   ; 
   
   else 
   
             yield index 
     
   - 
     
   1 
   ; 
   
   } 
   
   } 
   
   int 
    main 
   ( 
   void 
   ) 
   
   { 
   
     foreach 
     
   ( 
   int 
    i in foo 
   ( 
   0, 
     
   5 
   ) 
   ) 
   
   printf 
   ( 
   "Value: %d\n", i 
   ) 
   ; 
   
   return 
     
   0 
   ; 
   
   }

This becomes something like this:

 
   %foo_context 
     
   = 
     
   type 
     
   { 
   
   i8 
   *, 
          
   ; 0: block (state) 
   
   i32 
   , 
          
   ; 1: start (argument) 
   
   i32 
   , 
          
   ; 2: after (argument) 
   
   i32 
   , 
          
   ; 3: index (local) 
   
   i32 
          
   ; 4: value (result) 
   
   } 
   
   define 
     
   void 
     
   @foo_setup 
   ( 
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   %start 
   , 
     
   i32 
     
   %after 
   ) 
     
   nounwind 
     
   { 
   
   ; set up 'block' 
   
   %1 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   blockaddress 
   ( 
   @foo_yield 
   , 
     
   %.init 
   ) 
   , 
     
   i8 
   ** 
     
   %1 
   
   ; set up 'start' 
   
   %2 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   %start 
   , 
     
   i32 
   * 
     
   %2 
   
   ; set up 'after' 
   
   %3 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   2 
   
   store 
     
   i32 
     
   %after 
   , 
     
   i32 
   * 
     
   %3 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   i1 
     
   @foo_yield 
   ( 
   %foo_context 
   * 
     
   %context 
   ) 
     
   nounwind 
     
   { 
   
   ; dispatch to the active generator block 
   
   %1 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %2 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %1 
   
   indirectbr 
     
   i8 
   * 
     
   %2 
   , 
     
   [ 
     
   label 
     
   %.init 
   , 
     
   label 
     
   %.loop_close 
   , 
     
   label 
     
   %.end 
     
   ] 
   
   .init: 
   
   ; copy argument 'start' to the local variable 'index' 
   
   %3 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %start 
     
   = 
     
   load 
     
   i32 
   * 
     
   %3 
   
   %4 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   3 
   
   store 
     
   i32 
     
   %start 
   , 
     
   i32 
   * 
     
   %4 
   
   br 
     
   label 
     
   %.head 
   
   .head: 
   
   ; for (; index < after; ) 
   
   %5 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   3 
   
   %index 
     
   = 
     
   load 
     
   i32 
   * 
     
   %5 
   
   %6 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   2 
   
   %after 
     
   = 
     
   load 
     
   i32 
   * 
     
   %6 
   
   %again 
     
   = 
     
   icmp 
    slt 
     
   i32 
     
   %index 
   , 
     
   %after 
   
   br 
     
   i1 
     
   %again 
   , 
     
   label 
     
   %.loop_begin 
   , 
     
   label 
     
   %.exit 
   
   .loop_begin 
   : 
   
   %7 
     
   = 
     
   srem 
     
   i32 
     
   %index 
   , 
     
   2 
   
   %8 
     
   = 
     
   icmp 
    eq 
     
   i32 
     
   %7 
   , 
     
   0 
   
   br 
     
   i1 
     
   %8 
   , 
     
   label 
     
   %.even 
   , 
     
   label 
     
   %.odd 
   
   .even: 
   
   ; store 'index + 1' in 'value' 
   
   %9 
     
   = 
     
   add 
     
   i32 
     
   %index 
   , 
     
   1 
   
   %10 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   4 
   
   store 
     
   i32 
     
   %9 
   , 
     
   i32 
   * 
     
   %10 
   
   ; make 'block' point to the end of the loop (after the yield) 
   
   %11 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   blockaddress 
   ( 
   @foo_yield 
   , 
     
   %.loop_close 
   ) 
   , 
     
   i8 
   ** 
     
   %11 
   
   ret 
     
   i1 
     
   1 
   
   .odd: 
   
   ; store 'index - 1' in value 
   
   %12 
     
   = 
     
   sub 
     
   i32 
     
   %index 
   , 
     
   1 
   
   %13 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   4 
   
   store 
     
   i32 
     
   %12 
   , 
     
   i32 
   * 
     
   %13 
   
   ; make 'block' point to the end of the loop (after the yield) 
   
   %14 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   blockaddress 
   ( 
   @foo_yield 
   , 
     
   %.loop_close 
   ) 
   , 
     
   i8 
   ** 
     
   %14 
   
   ret 
     
   i1 
     
   1 
   
   .loop_close: 
   
   ; increment 'index' 
   
   %15 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   3 
   
   %16 
     
   = 
     
   load 
     
   i32 
   * 
     
   %15 
   
   %17 
     
   = 
     
   add 
     
   i32 
     
   %16 
   , 
     
   1 
   
   store 
     
   i32 
     
   %17 
   , 
     
   i32 
   * 
     
   %15 
   
   br 
     
   label 
     
   %.head 
   
   .exit: 
   
   ; make 'block' point to the %.end label 
   
   %x 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   blockaddress 
   ( 
   @foo_yield 
   , 
     
   %.end 
   ) 
   , 
     
   i8 
   ** 
     
   %x 
   
   br 
     
   label 
     
   %.end 
   
   .end 
   : 
   
   ret 
     
   i1 
     
   0 
   
   } 
   
   declare 
     
   i32 
     
   @printf 
   ( 
   i8 
   *, 
     
   ... 
   ) 
     
   nounwind 
   
   @.string 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   11 
    x 
     
   i8 
   ] 
    c 
   "Value: %d\0A\00" 
   
   define 
     
   i32 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   ; allocate and initialize generator context structure 
   
   %context 
     
   = 
     
   alloca 
     
   %foo_context 
   
   call 
     
   void 
     
   @foo_setup 
   ( 
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   5 
   ) 
   
   br 
     
   label 
     
   %.head 
   
   .head: 
   
   ; foreach (int i in foo(0, 5)) 
   
   %1 
     
   = 
     
   call 
     
   i1 
     
   @foo_yield 
   ( 
   %foo_context 
   * 
     
   %context 
   ) 
   
   br 
     
   i1 
     
   %1 
   , 
     
   label 
     
   %.body 
   , 
     
   label 
     
   %. 
   tail 
   
   .body 
   : 
   
   %2 
     
   = 
     
   getelementptr 
     
   %foo_context 
   * 
     
   %context 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   4 
   
   %3 
     
   = 
     
   load 
     
   i32 
   * 
     
   %2 
   
   %4 
     
   = 
     
   call 
     
   i32 
     
   ( 
   i8 
   *, 
     
   ... 
   ) 
   * 
     
   @printf 
   ( 
   i8 
   * 
     
   getelementptr 
   ( 
   [ 
   11 
    x 
     
   i8 
   ] 
   * 
     
   @.string 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   ) 
   , 
   i32 
     
   %3 
   ) 
   
   br 
     
   label 
     
   %.head 
   
   . 
   tail 
   : 
   
   ret 
     
   i32 
     
   0 
   
   }

Another possible way of doing the above would be to generate an LLVM IR function for each state and then store a function pointer in the context structure, which is updated whenever a new state/function needs to be invoked.

Mapping Exception Handling to LLVM IR

Exceptions can be implemented in one of three ways:

The simple way, by using a propagated return value.
The bulky way, by using setjmp and longjmp.
The efficient way, by using a zero-cost exception ABI.

Please notice that many compiler developers with respect for themselves won't accept the first method as a proper way of handling exceptions. However, it is unbeatable in terms of simplicity and can likely help people to understand that implementing exceptions does not need to be very difficult.

The second method is used by some production compilers, but it has large overhead both in terms of code bloat and the cost of a try-catch statement (because all CPU registers are saved using setjmp whenever a try statement is encountered).

The third method is very advanced but in return does not add any cost to execution paths where no exceptions are being thrown. This method is the de-facto "right" way of implementing exceptions, whether you like it or not. LLVM directly supports this kind of exception handling.

In the three sections below, we'll be using this sample and transform it:

 
   #include <stdio.h> 
   
   #include <stddef.h> 
   
   class 
    Foo 
   
   { 
   
   public 
   : 
   
   int 
    GetLength 
   ( 
   ) 
     
   const 
   
   { 
   
   return 
    _length 
   ; 
   
   } 
   
   void 
    SetLength 
   ( 
   int 
    value 
   ) 
   
   { 
   
         _length 
     
   = 
    value 
   ; 
   
   } 
   
   private 
   : 
   
   int 
    _length 
   ; 
   
   } 
   ; 
   
   int 
    Bar 
   ( 
   bool 
    fail 
   ) 
   
   { 
   
     Foo foo 
   ; 
   
     foo. 
   SetLength 
   ( 
   17 
   ) 
   ; 
   
   if 
     
   (fail 
   ) 
   
   throw 
     
   new 
    Exception 
   ( 
   "Exception requested by caller" 
   ) 
   ; 
   
     foo. 
   SetLength 
   ( 
   24 
   ) 
   ; 
   
   return 
    foo. 
   GetLength 
   ( 
   ) 
   ; 
   
   } 
   
   int 
    main 
   ( 
   int 
    argc, 
     
   const 
     
   char 
     
   *argv 
   [ 
   ] 
   ) 
   
   { 
   
   int 
    result 
   ; 
   
   try 
   
   { 
   
   /* The program throws an exception if an argument is specified. */ 
   
   bool 
    fail 
     
   = 
     
   (argc 
     
   >= 
     
   2 
   ) 
   ; 
   
   /* Let callee decide if an exception is thrown. */ 
   
   int 
    value 
     
   = 
    Bar 
   (fail 
   ) 
   ; 
   
         result 
     
   = 
     
   EXIT_SUCCESS 
   ; 
   
   } 
   
   catch 
     
   (Exception 
     
   *that 
   ) 
   
   { 
   
   printf 
   ( 
   "Error: %s\n", that 
   - 
   >GetText 
   ( 
   ) 
   ) 
   ; 
   
         result 
     
   = 
     
   EXIT_FAILURE 
   ; 
   
   } 
   
   catch 
     
   (... 
   ) 
   
   { 
   
   puts 
   ( 
   "Internal error: Unhandled exception detected" 
   ) 
   ; 
   
         result 
     
   = 
     
   EXIT_FAILURE 
   ; 
   
   } 
   
   return 
    result 
   ; 
   
   }

Exception Handling by Propagated Return Value

This method is a compiler-generated way of implicitly checking each function's return value. Its main advantage is that it is simple - at the cost of many mostly unproductive checks of return values. The great thing about this method is that it readily interfaces with a host of languages and environments - it is all a matter of returning a pointer to an exception.

The C++ sample above maps to the following code:

 
   ;********************* External and Utility functions ********************* 
   
   declare 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
   ) 
     
   nounwind 
   
   declare 
     
   void 
     
   @free 
   ( 
   i8 
   * 
   ) 
     
   nounwind 
   
   declare 
     
   i32 
     
   @printf 
   ( 
   i8 
   * 
     
   noalias 
     
   nocapture 
   , 
     
   ... 
   ) 
     
   nounwind 
   
   declare 
     
   i32 
     
   @puts 
   ( 
   i8 
   * 
     
   noalias 
     
   nocapture 
   ) 
     
   nounwind 
   
   ;***************************** Object class ******************************* 
   
   %Object_vtable_type 
     
   = 
     
   type 
     
   { 
   
   %Object_vtable_type 
   *, 
            
   ; 0: above: parent class vtable pointer 
   
   i8 
   * 
                              
   ; 1: class: class name (usually mangled) 
   
   ; virtual methods would follow here 
   
   } 
   
   @.Object_class_name 
     
   = 
     
   private 
     
   constant 
     
   [ 
   7 
    x 
     
   i8 
   ] 
    c 
   "Object\00" 
   
   @.Object_vtable 
     
   = 
     
   private 
     
   constant 
     
   %Object_vtable_type 
     
   { 
   
   %Object_vtable_type 
   * 
     
   null 
   , 
        
   ; This is the root object of the object hierarchy 
   
   i8 
   * 
     
   getelementptr 
   ( 
   [ 
   7 
    x 
     
   i8 
   ] 
   * 
     
   @.Object_class_name 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   ) 
   
   } 
   
   %Object 
     
   = 
     
   type 
     
   { 
   
   %Object_vtable_type 
   * 
              
   ; 0: vtable: class vtable pointer (always non-null) 
   
   ; class data members would follow here 
   
   } 
   
   ; returns true if the specified object is identical to or derived from the 
   
   ; class with the specified name. 
   
   define 
     
   i1 
     
   @Object_IsA 
   ( 
   %Object 
   * 
     
   %object 
   , 
     
   i8 
   * 
     
   %name 
   ) 
     
   nounwind 
     
   { 
   
   .init: 
   
   ; if (object == null) return false 
   
   %0 
     
   = 
     
   icmp 
    ne 
     
   %Object 
   * 
     
   %object 
   , 
     
   null 
   
   br 
     
   i1 
     
   %0 
   , 
     
   label 
     
   %.once 
   , 
     
   label 
     
   %.exit_false 
   
   .once 
   : 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Object 
   * 
     
   %object 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   br 
     
   label 
     
   %.body 
   
   .body: 
   
   ; if (vtable->class == name) 
   
   %2 
     
   = 
     
   phi 
     
   %Object_vtable_type 
   ** 
     
   [ 
     
   %1 
   , 
     
   %.once 
     
   ] 
   , 
     
   [ 
     
   %7 
   , 
     
   %.next 
   ] 
   
   %3 
     
   = 
     
   load 
     
   %Object_vtable_type 
   ** 
     
   %2 
   
   %4 
     
   = 
     
   getelementptr 
     
   %Object_vtable_type 
   * 
     
   %3 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %5 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %4 
   
   %6 
     
   = 
     
   icmp 
    eq 
     
   i8 
   * 
     
   %5 
   , 
     
   %name 
   
   br 
     
   i1 
     
   %6 
   , 
     
   label 
     
   %.exit_true 
   , 
     
   label 
     
   %.next 
   
   .next: 
   
   ; object = object->above 
   
   %7 
     
   = 
     
   getelementptr 
     
   %Object_vtable_type 
   * 
     
   %3 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   ; while (object != null) 
   
   %8 
     
   = 
     
   icmp 
    ne 
     
   %Object_vtable_type 
   * 
     
   %3 
   , 
     
   null 
   
   br 
     
   i1 
     
   %8 
   , 
     
   label 
     
   %.body 
   , 
     
   label 
     
   %.exit_false 
   
   .exit_true 
   : 
   
   ret 
     
   i1 
     
   true 
   
   .exit_false 
   : 
   
   ret 
     
   i1 
     
   false 
   
   } 
   
   ;*************************** Exception class ****************************** 
   
   %Exception_vtable_type 
     
   = 
     
   type 
     
   { 
   
   %Object_vtable_type 
   *, 
                            
   ; 0: parent class vtable pointer 
   
   i8 
   * 
                                              
   ; 1: class name 
   
   ; virtual methods would follow here. 
   
   } 
   
   @.Exception_class_name 
     
   = 
     
   private 
     
   constant 
     
   [ 
   10 
    x 
     
   i8 
   ] 
    c 
   "Exception\00" 
   
   @.Exception_vtable 
     
   = 
     
   private 
     
   constant 
     
   %Exception_vtable_type 
     
   { 
   
   %Object_vtable_type 
   * 
     
   @.Object_vtable 
   , 
            
   ; the parent of this class is the Object class 
   
   i8 
   * 
     
   getelementptr 
   ( 
   [ 
   10 
    x 
     
   i8 
   ] 
   * 
     
   @.Exception_class_name 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   ) 
   
   } 
   
   %Exception 
     
   = 
     
   type 
     
   { 
   
   %Exception_vtable_type 
   *, 
                        
   ; 0: the vtable pointer 
   
   i8 
   * 
                                              
   ; 1: the _text member 
   
   } 
   
   define 
     
   void 
     
   @Exception_Create_String 
   ( 
   %Exception 
   * 
     
   %this 
   , 
     
   i8 
   * 
     
   %text 
   ) 
     
   nounwind 
     
   { 
   
   ; set up vtable 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Exception 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   %Exception_vtable_type 
   * 
     
   @.Exception_vtable 
   , 
     
   %Exception_vtable_type 
   ** 
     
   %1 
   
   ; save input text string into _text 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Exception 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i8 
   * 
     
   %text 
   , 
     
   i8 
   ** 
     
   %2 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   i8 
   * 
     
   @Exception_GetText 
   ( 
   %Exception 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Exception 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %2 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %1 
   
   ret 
     
   i8 
   * 
     
   %2 
   
   } 
   
   ;******************************* Foo class ******************************** 
   
   %Foo 
     
   = 
     
   type 
     
   { 
     
   i32 
     
   } 
   
   define 
     
   void 
     
   @Foo_Create_Default 
   ( 
   %Foo 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   0 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   i32 
     
   @Foo_GetLength 
   ( 
   %Foo 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %2 
     
   = 
     
   load 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   i32 
     
   %2 
   
   } 
   
   define 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   ;********************************* Foo function *************************** 
   
   @.message1 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   30 
    x 
     
   i8 
   ] 
    c 
   "Exception requested by caller\00" 
   
   define 
     
   %Exception 
   * 
     
   @Bar 
   ( 
   i1 
     
   %fail 
   , 
     
   i32 
   * 
     
   %result 
   ) 
     
   nounwind 
     
   { 
   
   ; Allocate Foo instance 
   
   %foo 
     
   = 
     
   alloca 
     
   %Foo 
   
   call 
     
   void 
     
   @Foo_Create_Default 
   ( 
   %Foo 
   * 
     
   %foo 
   ) 
   
   call 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %foo 
   , 
     
   i32 
     
   17 
   ) 
   
   ; if (fail) 
   
   %1 
     
   = 
     
   icmp 
    eq 
     
   i1 
     
   %fail 
   , 
     
   true 
   
   br 
     
   i1 
     
   %1 
   , 
     
   label 
     
   %.if_begin 
   , 
     
   label 
     
   %.if_close 
   
   .if_begin: 
   
   ; throw new Exception(...) 
   
   %2 
     
   = 
     
   call 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
     
   8 
   ) 
   
   %3 
     
   = 
     
   bitcast 
     
   i8 
   * 
     
   %2 
     
   to 
     
   %Exception 
   * 
   
   %4 
     
   = 
     
   getelementptr 
     
   [ 
   30 
    x 
     
   i8 
   ] 
   * 
     
   @.message1 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   call 
     
   void 
     
   @Exception_Create_String 
   ( 
   %Exception 
   * 
     
   %3 
   , 
     
   i8 
   * 
     
   %4 
   ) 
   
   ret 
     
   %Exception 
   * 
     
   %3 
   
   .if_close: 
   
   ; foo.SetLength(24) 
   
   call 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %foo 
   , 
     
   i32 
     
   24 
   ) 
   
   %5 
     
   = 
     
   call 
     
   i32 
     
   @Foo_GetLength 
   ( 
   %Foo 
   * 
     
   %foo 
   ) 
   
   store 
     
   i32 
     
   %5 
   , 
     
   i32 
   * 
     
   %result 
   
   ret 
     
   %Exception 
   * 
     
   null 
   
   } 
   
   ;********************************* Main program *************************** 
   
   @.message2 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   11 
    x 
     
   i8 
   ] 
    c 
   "Error: %s\0A\00" 
   
   @.message3 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   44 
    x 
     
   i8 
   ] 
    c 
   "Internal error: Unhandled exception detectd\00" 
   
   define 
     
   i32 
     
   @main 
   ( 
   i32 
     
   %argc 
   , 
     
   i8 
   ** 
     
   %argv 
   ) 
     
   nounwind 
     
   { 
   
   ; "try" keyword expands to nothing. 
   
   ; Body of try block. 
   
   ; fail = (argc >= 2) 
   
   %fail 
     
   = 
     
   icmp 
    uge 
     
   i32 
     
   %argc 
   , 
     
   2 
   
   ; Function call. 
   
   %1 
     
   = 
     
   alloca 
     
   i32 
   
   %2 
     
   = 
     
   call 
     
   %Exception 
   * 
     
   @Bar 
   ( 
   i1 
     
   %fail 
   , 
     
   i32 
   * 
     
   %1 
   ) 
   
   %3 
     
   = 
     
   icmp 
    ne 
     
   %Exception 
   * 
     
   %2 
   , 
     
   null 
   
   br 
     
   i1 
     
   %3 
   , 
     
   label 
     
   %.catch_block 
   , 
     
   label 
     
   %.exit 
   
   .catch_block 
   : 
   
   %4 
     
   = 
     
   bitcast 
     
   %Exception 
   * 
     
   %2 
     
   to 
     
   %Object 
   * 
   
   %5 
     
   = 
     
   getelementptr 
     
   [ 
   10 
    x 
     
   i8 
   ] 
   * 
     
   @.Exception_class_name 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %6 
     
   = 
     
   call 
     
   i1 
     
   @Object_IsA 
   ( 
   %Object 
   * 
     
   %4 
   , 
     
   i8 
   * 
     
   %5 
   ) 
   
   br 
     
   i1 
     
   %6 
   , 
     
   label 
     
   %.catch_exception 
   , 
     
   label 
     
   %.catch_all 
   
   .catch_exception 
   : 
   
   %7 
     
   = 
     
   getelementptr 
     
   [ 
   11 
    x 
     
   i8 
   ] 
   * 
     
   @.message2 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %8 
     
   = 
     
   call 
     
   i8 
   * 
     
   @Exception_GetText 
   ( 
   %Exception 
   * 
     
   %2 
   ) 
   
   %9 
     
   = 
     
   call 
     
   i32 
     
   ( 
   i8 
   *, 
     
   ... 
   ) 
   * 
     
   @printf 
   ( 
   i8 
   * 
     
   %7 
   , 
     
   i8 
   * 
     
   %8 
   ) 
   
   br 
     
   label 
     
   %.exit 
   
   .catch_all 
   : 
   
   %10 
     
   = 
     
   getelementptr 
     
   [ 
   44 
    x 
     
   i8 
   ] 
   * 
     
   @.message3 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %11 
     
   = 
     
   call 
     
   i32 
     
   @puts 
   ( 
   i8 
   * 
     
   %10 
   ) 
   
   br 
     
   label 
     
   %.exit 
   
   .exit 
   : 
   
   %result 
     
   = 
     
   phi 
     
   i32 
     
   [ 
     
   0 
   , 
     
   %0 
     
   ] 
   , 
     
   [ 
     
   1 
   , 
     
   %.catch_exception 
     
   ] 
   , 
     
   [ 
     
   1 
   , 
     
   %.catch_all 
     
   ] 
   
   ret 
     
   i32 
     
   %result 
   
   }

Setjmp/Longjmp Exception Handling

The basic idea behind the setjmp and longjmp exception handling scheme is that you save the CPU state whenever you encounter a try keyword and then do a longjmp whenever you throw an exception. If there are few try blocks in the program, as is typically the case, the cost of this method is not as high as it might seem. However, often there are implicit exception handlers due to the need to release local resources such as class instances allocated on the stack and then the cost can become quite high.

Setjmp/longjmp exception handling is often abbreviated SjLj for SetJmp/LongJmp.

The sample translates into something like this:

 
   ; jmp_buf is very platform specific, this is for illustration only... 
   
   %jmp_buf 
     
   = 
     
   type 
     
   { 
     
   i32 
     
   } 
   
   declare 
     
   i32 
     
   @setjmp 
   ( 
   %jmp_buf 
   * 
     
   %env 
   ) 
   
   declare 
     
   void 
     
   @longjmp 
   ( 
   %jmp_buf 
   * 
     
   %env 
   , 
     
   i32 
     
   %val 
   ) 
   
   ;********************* External and Utility functions ********************* 
   
   declare 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
   ) 
     
   nounwind 
   
   declare 
     
   void 
     
   @free 
   ( 
   i8 
   * 
   ) 
     
   nounwind 
   
   declare 
     
   i32 
     
   @printf 
   ( 
   i8 
   * 
     
   noalias 
     
   nocapture 
   , 
     
   ... 
   ) 
     
   nounwind 
   
   declare 
     
   i32 
     
   @puts 
   ( 
   i8 
   * 
     
   noalias 
     
   nocapture 
   ) 
     
   nounwind 
   
   ;***************************** Object class ******************************* 
   
   %Object_vtable_type 
     
   = 
     
   type 
     
   { 
   
   %Object_vtable_type 
   *, 
            
   ; 0: above: parent class vtable pointer 
   
   i8 
   * 
                              
   ; 1: class: class name (usually mangled) 
   
   ; virtual methods would follow here 
   
   } 
   
   @.Object_class_name 
     
   = 
     
   private 
     
   constant 
     
   [ 
   7 
    x 
     
   i8 
   ] 
    c 
   "Object\00" 
   
   @.Object_vtable 
     
   = 
     
   private 
     
   constant 
     
   %Object_vtable_type 
     
   { 
   
   %Object_vtable_type 
   * 
     
   null 
   , 
        
   ; This is the root object of the object hierarchy 
   
   i8 
   * 
     
   getelementptr 
   ( 
   [ 
   7 
    x 
     
   i8 
   ] 
   * 
     
   @.Object_class_name 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   ) 
   
   } 
   
   %Object 
     
   = 
     
   type 
     
   { 
   
   %Object_vtable_type 
   * 
              
   ; 0: vtable: class vtable pointer (always non-null) 
   
   ; class data members would follow here 
   
   } 
   
   ; returns true if the specified object is identical to or derived from the 
   
   ; class with the specified name. 
   
   define 
     
   i1 
     
   @Object_IsA 
   ( 
   %Object 
   * 
     
   %object 
   , 
     
   i8 
   * 
     
   %name 
   ) 
     
   nounwind 
     
   { 
   
   .init: 
   
   ; if (object == null) return false 
   
   %0 
     
   = 
     
   icmp 
    ne 
     
   %Object 
   * 
     
   %object 
   , 
     
   null 
   
   br 
     
   i1 
     
   %0 
   , 
     
   label 
     
   %.once 
   , 
     
   label 
     
   %.exit_false 
   
   .once 
   : 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Object 
   * 
     
   %object 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   br 
     
   label 
     
   %.body 
   
   .body: 
   
   ; if (vtable->class == name) 
   
   %2 
     
   = 
     
   phi 
     
   %Object_vtable_type 
   ** 
     
   [ 
     
   %1 
   , 
     
   %.once 
     
   ] 
   , 
     
   [ 
     
   %7 
   , 
     
   %.next 
   ] 
   
   %3 
     
   = 
     
   load 
     
   %Object_vtable_type 
   ** 
     
   %2 
   
   %4 
     
   = 
     
   getelementptr 
     
   %Object_vtable_type 
   * 
     
   %3 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %5 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %4 
   
   %6 
     
   = 
     
   icmp 
    eq 
     
   i8 
   * 
     
   %5 
   , 
     
   %name 
   
   br 
     
   i1 
     
   %6 
   , 
     
   label 
     
   %.exit_true 
   , 
     
   label 
     
   %.next 
   
   .next: 
   
   ; object = object->above 
   
   %7 
     
   = 
     
   getelementptr 
     
   %Object_vtable_type 
   * 
     
   %3 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   ; while (object != null) 
   
   %8 
     
   = 
     
   icmp 
    ne 
     
   %Object_vtable_type 
   * 
     
   %3 
   , 
     
   null 
   
   br 
     
   i1 
     
   %8 
   , 
     
   label 
     
   %.body 
   , 
     
   label 
     
   %.exit_false 
   
   .exit_true 
   : 
   
   ret 
     
   i1 
     
   true 
   
   .exit_false 
   : 
   
   ret 
     
   i1 
     
   false 
   
   } 
   
   ;*************************** Exception class ****************************** 
   
   %Exception_vtable_type 
     
   = 
     
   type 
     
   { 
   
   %Object_vtable_type 
   *, 
                            
   ; 0: parent class vtable pointer 
   
   i8 
   * 
                                              
   ; 1: class name 
   
   ; virtual methods would follow here. 
   
   } 
   
   @.Exception_class_name 
     
   = 
     
   private 
     
   constant 
     
   [ 
   10 
    x 
     
   i8 
   ] 
    c 
   "Exception\00" 
   
   @.Exception_vtable 
     
   = 
     
   private 
     
   constant 
     
   %Exception_vtable_type 
     
   { 
   
   %Object_vtable_type 
   * 
     
   @.Object_vtable 
   , 
            
   ; the parent of this class is the Object class 
   
   i8 
   * 
     
   getelementptr 
   ( 
   [ 
   10 
    x 
     
   i8 
   ] 
   * 
     
   @.Exception_class_name 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   ) 
   
   } 
   
   %Exception 
     
   = 
     
   type 
     
   { 
   
   %Exception_vtable_type 
   *, 
                        
   ; 0: the vtable pointer 
   
   i8 
   * 
                                              
   ; 1: the _text member 
   
   } 
   
   define 
     
   void 
     
   @Exception_Create_String 
   ( 
   %Exception 
   * 
     
   %this 
   , 
     
   i8 
   * 
     
   %text 
   ) 
     
   nounwind 
     
   { 
   
   ; set up vtable 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Exception 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   %Exception_vtable_type 
   * 
     
   @.Exception_vtable 
   , 
     
   %Exception_vtable_type 
   ** 
     
   %1 
   
   ; save input text string into _text 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Exception 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i8 
   * 
     
   %text 
   , 
     
   i8 
   ** 
     
   %2 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   i8 
   * 
     
   @Exception_GetText 
   ( 
   %Exception 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Exception 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %2 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %1 
   
   ret 
     
   i8 
   * 
     
   %2 
   
   } 
   
   ;******************************* Foo class ******************************** 
   
   %Foo 
     
   = 
     
   type 
     
   { 
     
   i32 
     
   } 
   
   define 
     
   void 
     
   @Foo_Create_Default 
   ( 
   %Foo 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   0 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   i32 
     
   @Foo_GetLength 
   ( 
   %Foo 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %2 
     
   = 
     
   load 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   i32 
     
   %2 
   
   } 
   
   define 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   ;********************************* Foo function *************************** 
   
   @.message1 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   30 
    x 
     
   i8 
   ] 
    c 
   "Exception requested by caller\00" 
   
   define 
     
   i32 
     
   @Bar 
   ( 
   %jmp_buf 
   * 
     
   %throw 
   , 
     
   i1 
     
   %fail 
   ) 
     
   nounwind 
     
   { 
   
   ; Allocate Foo instance 
   
   %foo 
     
   = 
     
   alloca 
     
   %Foo 
   
   call 
     
   void 
     
   @Foo_Create_Default 
   ( 
   %Foo 
   * 
     
   %foo 
   ) 
   
   call 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %foo 
   , 
     
   i32 
     
   17 
   ) 
   
   ; if (fail) 
   
   %1 
     
   = 
     
   icmp 
    eq 
     
   i1 
     
   %fail 
   , 
     
   true 
   
   br 
     
   i1 
     
   %1 
   , 
     
   label 
     
   %.if_begin 
   , 
     
   label 
     
   %.if_close 
   
   .if_begin: 
   
   ; throw new Exception(...) 
   
   %2 
     
   = 
     
   call 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
     
   8 
   ) 
   
   %3 
     
   = 
     
   bitcast 
     
   i8 
   * 
     
   %2 
     
   to 
     
   %Exception 
   * 
   
   %4 
     
   = 
     
   getelementptr 
     
   [ 
   30 
    x 
     
   i8 
   ] 
   * 
     
   @.message1 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   call 
     
   void 
     
   @Exception_Create_String 
   ( 
   %Exception 
   * 
     
   %3 
   , 
     
   i8 
   * 
     
   %4 
   ) 
   
   %5 
     
   = 
     
   ptrtoint 
     
   %Exception 
   * 
     
   %3 
     
   to 
     
   i32 
   
   call 
     
   void 
     
   @longjmp 
   ( 
   %jmp_buf 
   * 
     
   %throw 
   , 
     
   i32 
     
   %5 
   ) 
   
   ; we never get here 
   
   br 
     
   label 
     
   %.if_close 
   
   .if_close: 
   
   ; foo.SetLength(24) 
   
   call 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %foo 
   , 
     
   i32 
     
   24 
   ) 
   
   %6 
     
   = 
     
   call 
     
   i32 
     
   @Foo_GetLength 
   ( 
   %Foo 
   * 
     
   %foo 
   ) 
   
   ret 
     
   i32 
     
   %6 
   
   } 
   
   ;********************************* Main program *************************** 
   
   @.message2 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   11 
    x 
     
   i8 
   ] 
    c 
   "Error: %s\0A\00" 
   
   @.message3 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   44 
    x 
     
   i8 
   ] 
    c 
   "Internal error: Unhandled exception detectd\00" 
   
   define 
     
   i32 
     
   @main 
   ( 
   i32 
     
   %argc 
   , 
     
   i8 
   ** 
     
   %argv 
   ) 
     
   nounwind 
     
   { 
   
   ; "try" keyword expands to a call to @setjmp 
   
   %env 
     
   = 
     
   alloca 
     
   %jmp_buf 
   
   %status 
     
   = 
     
   call 
     
   i32 
     
   @setjmp 
   ( 
   %jmp_buf 
   * 
     
   %env 
   ) 
   
   %1 
     
   = 
     
   icmp 
    eq 
     
   i32 
     
   %status 
   , 
     
   0 
   
   br 
     
   i1 
     
   %1 
   , 
     
   label 
     
   %.body 
   , 
     
   label 
     
   %.catch_block 
   
   .body: 
   
   ; Body of try block. 
   
   ; fail = (argc >= 2) 
   
   %fail 
     
   = 
     
   icmp 
    uge 
     
   i32 
     
   %argc 
   , 
     
   2 
   
   ; Function call. 
   
   %2 
     
   = 
     
   call 
     
   i32 
     
   @Bar 
   ( 
   %jmp_buf 
   * 
     
   %env 
   , 
     
   i1 
     
   %fail 
   ) 
   
   br 
     
   label 
     
   %.exit 
   
   .catch_block 
   : 
   
   %3 
     
   = 
     
   inttoptr 
     
   i32 
     
   %status 
     
   to 
     
   %Object 
   * 
   
   %4 
     
   = 
     
   getelementptr 
     
   [ 
   10 
    x 
     
   i8 
   ] 
   * 
     
   @.Exception_class_name 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %5 
     
   = 
     
   call 
     
   i1 
     
   @Object_IsA 
   ( 
   %Object 
   * 
     
   %3 
   , 
     
   i8 
   * 
     
   %4 
   ) 
   
   br 
     
   i1 
     
   %5 
   , 
     
   label 
     
   %.catch_exception 
   , 
     
   label 
     
   %.catch_all 
   
   .catch_exception 
   : 
   
   %6 
     
   = 
     
   inttoptr 
     
   i32 
     
   %status 
     
   to 
     
   %Exception 
   * 
   
   %7 
     
   = 
     
   getelementptr 
     
   [ 
   11 
    x 
     
   i8 
   ] 
   * 
     
   @.message2 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %8 
     
   = 
     
   call 
     
   i8 
   * 
     
   @Exception_GetText 
   ( 
   %Exception 
   * 
     
   %6 
   ) 
   
   %9 
     
   = 
     
   call 
     
   i32 
     
   ( 
   i8 
   *, 
     
   ... 
   ) 
   * 
     
   @printf 
   ( 
   i8 
   * 
     
   %7 
   , 
     
   i8 
   * 
     
   %8 
   ) 
   
   br 
     
   label 
     
   %.exit 
   
   .catch_all 
   : 
   
   %10 
     
   = 
     
   getelementptr 
     
   [ 
   44 
    x 
     
   i8 
   ] 
   * 
     
   @.message3 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %11 
     
   = 
     
   call 
     
   i32 
     
   @puts 
   ( 
   i8 
   * 
     
   %10 
   ) 
   
   br 
     
   label 
     
   %.exit 
   
   .exit 
   : 
   
   %result 
     
   = 
     
   phi 
     
   i32 
     
   [ 
     
   0 
   , 
     
   %.body 
     
   ] 
   , 
     
   [ 
     
   1 
   , 
     
   %.catch_exception 
     
   ] 
   , 
     
   [ 
     
   1 
   , 
     
   %.catch_all 
     
   ] 
   
   ret 
     
   i32 
     
   %result 
   
   }

Zero Cost Exception Handling

todo: Explain how to implement exception handling using zero cost exception handling.

Resources

Compiler Internals - Exception Handling.
Exception Handling in C without C++.
How a C++ Compiler Implements Exception Handling.
DWARF standard - Exception Handling.
Itanium C++ ABI.

Mapping Object-Oriented Constructs to LLVM IR

In this chapter we'll look at various object-oriented constructs and see how they can be mapped to LLVM IR.

Classes

A class is nothing more than a structure with an associated set of functions that take an implicit first parameter, namely a pointer to the structure. Therefore, is is very trivial to map a class to LLVM IR:

 
   #include <stddef.h> 
   
   class 
    Foo 
   
   { 
   
   public 
   : 
   
     Foo 
   ( 
   ) 
   
   { 
   
         _length 
     
   = 
     
   0 
   ; 
   
   } 
   
   size_t 
    GetLength 
   ( 
   ) 
     
   const 
   
   { 
   
   return 
    _length 
   ; 
   
   } 
   
   void 
    SetLength 
   ( 
   size_t 
    value 
   ) 
   
   { 
   
         _length 
     
   = 
    value 
   ; 
   
   } 
   
   private 
   : 
   
   size_t 
    _length 
   ; 
   
   } 
   ;

We first transform this code into two separate pieces:

. The structure definition.
. The list of methods, including the constructor.

 
   ; The structure definition for class Foo. 
   
   %Foo 
     
   = 
     
   type 
     
   { 
     
   i32 
     
   } 
   
   ; The default constructor for class Foo. 
   
   define 
     
   void 
     
   @Foo_Create_Default 
   ( 
   %Foo 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   0 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   ; The Foo::GetLength() method. 
   
   define 
     
   i32 
     
   @Foo_GetLength 
   ( 
   %Foo 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %2 
     
   = 
     
   load 
     
   i32 
   * 
     
   %this 
   
   ret 
     
   i32 
     
   %2 
   
   } 
   
   ; The Foo::SetLength() method. 
   
   define 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   }

Then we make sure that the constructor (``Foo_Create_Default``) is invoked whenever an instance of the structure is created:

    Foo foo 
   ; 
  

 
   %foo 
     
   = 
     
   alloca 
     
   %Foo 
   
   call 
     
   void 
     
   @Foo_Create_Default 
   ( 
   %Foo 
   * 
     
   %foo 
   )

Virtual Methods

A virtual method is no more than a compiler-controlled function pointer. Each virtual method is recorded in the vtable, which is a structure of all the function pointers needed by a given class:

 
   class 
    Foo 
   
   { 
   
   public 
   : 
   
   virtual 
     
   int 
    GetLengthTimesTwo 
   ( 
   ) 
     
   const 
   
   { 
   
   return 
    _length 
     
   * 
     
   2 
   ; 
   
   } 
   
   void 
    SetLength 
   ( 
   size_t 
    value 
   ) 
   
   { 
   
         _length 
     
   = 
    value 
   ; 
   
   } 
   
   private 
   : 
   
   int 
    _length 
   ; 
   
   } 
   ; 
   
   int 
    main 
   ( 
   ) 
   
   { 
   
     Foo foo 
   ; 
   
     foo. 
   SetLength 
   ( 
   4 
   ) 
   ; 
   
   return 
    foo. 
   GetLengthTimesTwo 
   ( 
   ) 
   ; 
   
   }

This becomes:

 
   %Foo_vtable_type 
     
   = 
     
   type 
     
   { 
     
   i32 
   ( 
   %Foo 
   * 
   ) 
   * 
     
   } 
   
   %Foo 
     
   = 
     
   type 
     
   { 
     
   %Foo_vtable_type 
   *, 
     
   i32 
     
   } 
   
   define 
     
   i32 
     
   @Foo_GetLengthTimesTwo 
   ( 
   %Foo 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %2 
     
   = 
     
   load 
     
   i32 
   * 
     
   %1 
   
   %3 
     
   = 
     
   mul 
     
   i32 
     
   %2 
   , 
     
   2 
   
   ret 
     
   i32 
     
   %3 
   
   } 
   
   @Foo_vtable_data 
     
   = 
     
   global 
     
   %Foo_vtable_type 
     
   { 
   
   i32 
   ( 
   %Foo 
   * 
   ) 
   * 
     
   @Foo_GetLengthTimesTwo 
   
   } 
   
   define 
     
   void 
     
   @Foo_Create_Default 
   ( 
   %Foo 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   %Foo_vtable_type 
   * 
     
   @Foo_vtable_data 
   , 
     
   %Foo_vtable_type 
   ** 
     
   %1 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   0 
   , 
     
   i32 
   * 
     
   %2 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   i32 
     
   @main 
   ( 
   i32 
     
   %argc 
   , 
     
   i8 
   ** 
     
   %argv 
   ) 
     
   nounwind 
     
   { 
   
   %foo 
     
   = 
     
   alloca 
     
   %Foo 
   
   call 
     
   void 
     
   @Foo_Create_Default 
   ( 
   %Foo 
   * 
     
   %foo 
   ) 
   
   call 
     
   void 
     
   @Foo_SetLength 
   ( 
   %Foo 
   * 
     
   %foo 
   , 
     
   i32 
     
   4 
   ) 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Foo 
   * 
     
   %foo 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %2 
     
   = 
     
   load 
     
   %Foo_vtable_type 
   ** 
     
   %1 
   
   %3 
     
   = 
     
   getelementptr 
     
   %Foo_vtable_type 
   * 
     
   %2 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %4 
     
   = 
     
   load 
     
   i32 
   ( 
   %Foo 
   * 
   ) 
   ** 
     
   %3 
   
   %5 
     
   = 
     
   call 
     
   i32 
     
   %4 
   ( 
   %Foo 
   * 
     
   %foo 
   ) 
   
   ret 
     
   i32 
     
   %5 
   
   }

Please notice that some C++ compilers store _vtable at a negative offset into the structure so that things like memcpy(this, 0, sizeof(*this)) work, even though such commands should always be avoided in an OOP context.

Single Inheritance

Single inheritance is very straightforward: Each "structure" (class) is laid out in memory after one another in declaration order.

 
   class 
    Base 
   
   { 
   
   public 
   : 
   
   void 
    SetA 
   ( 
   int 
    value 
   ) 
   
   { 
   
         _a 
     
   = 
    value 
   ; 
   
   } 
   
   private 
   : 
   
   int 
    _a 
   ; 
   
   } 
   ; 
   
   class 
    Derived 
   : 
     
   public 
    Base 
   
   { 
   
   public 
   : 
   
   void 
    SetB 
   ( 
   int 
    value 
   ) 
   
   { 
   
         SetA 
   (value 
   ) 
   ; 
   
         _b 
     
   = 
    value 
   ; 
   
   } 
   
   protected 
   : 
   
   int 
    _b 
   ; 
   
   }

Here, a and b will be laid out to follow one another in memory so that inheriting from a class is simply a matter of declaring a the base class as a first member in the inheriting class:

 
   %Base 
     
   = 
     
   type 
     
   { 
   
   i32 
            
   ; '_a' in class Base 
   
   } 
   
   define 
     
   void 
     
   @Base_SetA 
   ( 
   %Base 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Base 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   %Derived 
     
   = 
     
   type 
     
   { 
   
   i32 
   , 
            
   ; '_a' from class Base 
   
   i32 
            
   ; '_b' from class Derived 
   
   } 
   
   define 
     
   void 
     
   @Derived_SetB 
   ( 
   %Derived 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   bitcast 
     
   %Derived 
   * 
     
   %this 
     
   to 
     
   %Base 
   * 
   
   call 
     
   void 
     
   @Base_SetA 
   ( 
   %Base 
   * 
     
   %1 
   , 
     
   i32 
     
   %value 
   ) 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Derived 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %2 
   
   ret 
     
   void 
   
   }

So the base class simply becomes plain members of the type declaration for the derived class.

And then the compiler must insert appropriate type casts whenever the derived class is being referenced as its base class as shown above with the bitcast operator.

Multiple Inheritance

Multiple inheritance is not that difficult, either, it is merely a question of laying out the multiply inherited "structures" in order inside each derived class.

 
   class 
    BaseA 
   
   { 
   
   public 
   : 
   
   void 
    SetA 
   ( 
   int 
    value 
   ) 
   
   { 
   
         _a 
     
   = 
    value 
   ; 
   
   } 
   
   private 
   : 
   
   int 
    _a 
   ; 
   
   } 
   ; 
   
   class 
    BaseB 
   : 
     
   public 
    BaseA 
   
   { 
   
   public 
   : 
   
   void 
    SetB 
   ( 
   int 
    value 
   ) 
   
   { 
   
         SetA 
   (value 
   ) 
   ; 
   
         _b 
     
   = 
    value 
   ; 
   
   } 
   
   private 
   : 
   
   int 
    _b 
   ; 
   
   } 
   ; 
   
   class 
    Derived 
   : 
   
   public 
    BaseA, 
   
   public 
    BaseB 
   
   { 
   
   public 
   : 
   
   void 
    SetC 
   ( 
   int 
    value 
   ) 
   
   { 
   
         SetB 
   (value 
   ) 
   ; 
   
         _c 
     
   = 
    value 
   ; 
   
   } 
   
   private 
   : 
   
   int 
    _c 
   ; 
   
   } 
   ;

This is equivalent to the following LLVM IR:

 
   %BaseA 
     
   = 
     
   type 
     
   { 
   
   i32 
            
   ; '_a' from BaseA 
   
   } 
   
   define 
     
   void 
     
   @BaseA_SetA 
   ( 
   %BaseA 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %BaseA 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   %BaseB 
     
   = 
     
   type 
     
   { 
   
   i32 
   , 
            
   ; '_a' from BaseA 
   
   i32 
            
   ; '_b' from BaseB 
   
   } 
   
   define 
     
   void 
     
   @BaseB_SetB 
   ( 
   %BaseB 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   bitcast 
     
   %BaseB 
   * 
     
   %this 
     
   to 
     
   %BaseA 
   * 
   
   call 
     
   void 
     
   @BaseA_SetA 
   ( 
   %BaseA 
   * 
     
   %1 
   , 
     
   i32 
     
   %value 
   ) 
   
   %2 
     
   = 
     
   getelementptr 
     
   %BaseB 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %2 
   
   ret 
     
   void 
   
   } 
   
   %Derived 
     
   = 
     
   type 
     
   { 
   
   i32 
   , 
            
   ; '_a' from BaseA 
   
   i32 
   , 
            
   ; '_b' from BaseB 
   
   i32 
            
   ; '_c' from Derived 
   
   } 
   
   define 
     
   void 
     
   @Derived_SetC 
   ( 
   %Derived 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   bitcast 
     
   %Derived 
   * 
     
   %this 
     
   to 
     
   %BaseB 
   * 
   
   call 
     
   void 
     
   @BaseB_SetB 
   ( 
   %BaseB 
   * 
     
   %1 
   , 
     
   i32 
     
   %value 
   ) 
   
   %2 
     
   = 
     
   getelementptr 
     
   %Derived 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   2 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %2 
   
   ret 
     
   void 
   
   }

And the compiler then supplies the needed type casts and pointer arithmentic whenever baseB is being referenced as an instance of BaseB. Please notice that all it takes is a bitcast from one class to another as well as an adjustment of the last argument togetelementptr.

Virtual Inheritance

Virtual inheritance is actually quite simple as it dictates that identical base classes are to be merged into a single occurence. For instance, given this:

 
   class 
    BaseA 
   
   { 
   
   public 
   : 
   
   int 
    a 
   ; 
   
   } 
   ; 
   
   class 
    BaseB 
   : 
     
   public 
    BaseA 
   
   { 
   
   public 
   : 
   
   int 
    b 
   ; 
   
   } 
   ; 
   
   class 
    BaseC 
   : 
     
   public 
    BaseA 
   
   { 
   
   public 
   : 
   
   int 
    c 
   ; 
   
   } 
   ; 
   
   class 
    Derived 
   : 
   
   public 
     
   virtual 
    BaseB, 
   
   public 
     
   virtual 
    BaseC 
   
   { 
   
   int 
    d 
   ; 
   
   } 
   ;

Derived will only contain a single instance of BaseA even if its inheritance graph dictates that it should have two instances. The result looks something like this:

 
   class 
    Derived 
   
   { 
   
   public 
   : 
   
   int 
    a 
   ; 
   
   int 
    b 
   ; 
   
   int 
    c 
   ; 
   
   int 
    d 
   ; 
   
   } 
   ;

So the second instance of a is silently ignored because it would cause multiple instances of BaseA to exist in Derived, which clearly would cause lots of confusion and ambiguities.

Interfaces

An interface is nothing more than a base class with no data members, where all the methods are pure virtual (i.e. has no body).

As such, we've already described how to convert an interface to LLVM IR - it is done precisely the same way that you convert a virtual member function to LLVM IR.

Boxing and Unboxing

Boxing is the process of converting a non-object primitive value into an object. It is as easy as it sounds. You create a wrapper class which you instantiate and initialize with the non-object value:

Unboxing is the reverse of boxing: You downgrade a full object to a mere scalar value by retrieving the boxed value from the box object.

It is important to notice that changes to the boxed value does not affect the original value and vice verse. The code below illustrates both steps:

 
   @Boxee 
     
   = 
     
   global 
     
   i32 
     
   17 
   
   %Integer 
     
   = 
     
   type 
     
   { 
     
   i32 
     
   } 
   
   define 
     
   void 
     
   @Integer_Create 
   ( 
   %Integer 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   nounwind 
     
   { 
   
   ; you might set up a vtable and associated virtual methods here 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Integer 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   %value 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   i32 
     
   @Integer_GetValue 
   ( 
   %Integer 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %Integer 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %2 
     
   = 
     
   load 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   i32 
     
   %2 
   
   } 
   
   define 
     
   i32 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   ; box @Boxee in an instance of %Integer 
   
   %1 
     
   = 
     
   load 
     
   i32 
   * 
     
   @Boxee 
   
   %2 
     
   = 
     
   alloca 
     
   %Integer 
   
   call 
     
   void 
     
   @Integer_Create 
   ( 
   %Integer 
   * 
     
   %2 
   , 
     
   i32 
     
   %1 
   ) 
   
   ; unbox @Boxee from an instance of %Integer 
   
   %3 
     
   = 
     
   call 
     
   i32 
     
   @Integer_GetValue 
   ( 
   %Integer 
   * 
     
   %2 
   ) 
   
   ret 
     
   i32 
     
   0 
   
   }

Class Equivalence Test

There are two ways of doing this:

If you can guarantee that each class a unique vtable, you can simply compare the pointers to the vtable.
If you cannot guarantee that each class has a unique vtable (because different vtables may have been merged by the linker), you need to add a unique field to the vtable so that you can compare that instead.

The first variant goes roughly as follows (assuming identical strings aren't merged by the compiler, something that they are most of the time):

 
   bool 
    equal 
     
   = 
     
   ( 
   typeid 
   (first 
   ) 
     
   == 
     
   typeid 
   (other 
   ) 
   ) 
   ; 
  

todo: Finish up class equivalence test sample.

As far as I know, RTTI is simply done by adding two fields to the _vtable structure: parent and signature. The former is a pointer to the vtable of the parent class and the latter is the mangled (encoded) name of the class. To see if a given class is another class, you simply compare the signature fields. To see if a given class is a derived class of some other class, you simply walk the chain of parent fields, while checking if you have found a matching signature.

Class Inheritance Test

A class inheritance test is a question of the form:

Is class X identical to or derived from class Y?

To answer that question, we can use one of two methods:

The naive implementation where we search upwards in the chain of parents.
The faster implementation where we search a preallocated list of parents.

The naive implementation is documented in the first two exception handling examples as the Object_IsA function.

todo: Document the faster class inheritance test implementation.

The New Operator

The new operator is generally nothing more than a type-safe version of the C malloc function - in some implementations of C++, they may even be called interchangeably without causing unseen or unwanted side-effects.

The Instance New Operator

All calls of the form new X are mapped into:

 
   declare 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
   ) 
     
   nounwind 
   
   %X 
     
   = 
     
   type 
     
   { 
     
   i8 
     
   } 
   
   define 
     
   void 
     
   @X_Create_Default 
   ( 
   %X 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %X 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
     
   0 
   , 
     
   i8 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   void 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   call 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
     
   1 
   ) 
   
   %2 
     
   = 
     
   bitcast 
     
   i8 
   * 
     
   %1 
     
   to 
     
   %X 
   * 
   
   call 
     
   void 
     
   @X_Create_Default 
   ( 
   %X 
   * 
     
   %2 
   ) 
   
   ret 
     
   void 
   
   }

Calls of the form new X(Y, Z) are the same, except Y and Z are passed into the constructor as arguments.

The Array New Operator

New operations involving arrays are equally simple. The code new X[100] is mapped into a loop that initializes each array element in turn:

 
   declare 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
   ) 
     
   nounwind 
   
   %X 
     
   = 
     
   type 
     
   { 
     
   i32 
     
   } 
   
   define 
     
   void 
     
   @X_Create_Default 
   ( 
   %X 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   %X 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i32 
     
   0 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   void 
     
   @main 
   ( 
   ) 
     
   nounwind 
     
   { 
   
   %n 
     
   = 
     
   alloca 
     
   i32 
                      
   ; %n = ptr to the number of elements in the array 
   
   store 
     
   i32 
     
   100 
   , 
     
   i32 
   * 
     
   %n 
   
   %i 
     
   = 
     
   alloca 
     
   i32 
                      
   ; %i = ptr to the loop index into the array 
   
   store 
     
   i32 
     
   0 
   , 
     
   i32 
   * 
     
   %i 
   
   %1 
     
   = 
     
   load 
     
   i32 
   * 
     
   %n 
                    
   ; %1 = *%n 
   
   %2 
     
   = 
     
   mul 
     
   i32 
     
   %1 
   , 
     
   4 
                  
   ; %2 = %1 * sizeof(X) 
   
   %3 
     
   = 
     
   call 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
     
   %2 
   ) 
        
   ; %3 = malloc(100 * sizeof(X)) 
   
   %4 
     
   = 
     
   bitcast 
     
   i8 
   * 
     
   %3 
     
   to 
     
   %X 
   * 
          
   ; %4 = (X*) %3 
   
   br 
     
   label 
     
   %.loop_head 
   
   .loop_head: 
                            
   ; for (; %i < %n; %i++) 
   
   %5 
     
   = 
     
   load 
     
   i32 
   * 
     
   %i 
   
   %6 
     
   = 
     
   load 
     
   i32 
   * 
     
   %n 
   
   %7 
     
   = 
     
   icmp 
    slt 
     
   i32 
     
   %5 
   , 
     
   %6 
   
   br 
     
   i1 
     
   %7 
   , 
     
   label 
     
   %.loop_body 
   , 
     
   label 
     
   %.loop_tail 
   
   .loop_body 
   : 
   
   %8 
     
   = 
     
   getelementptr 
     
   %X 
   * 
     
   %4 
   , 
     
   i32 
     
   %5 
   
   call 
     
   void 
     
   @X_Create_Default 
   ( 
   %X 
   * 
     
   %8 
   ) 
   
   %9 
     
   = 
     
   add 
     
   i32 
     
   %5 
   , 
     
   1 
   
   store 
     
   i32 
     
   %9 
     
   i32 
   * 
     
   %i 
   
   br 
     
   label 
     
   %.loop_head 
   
   .loop_tail 
   : 
   
   ret 
     
   void 
   
   }

Interoperating with a Runtime Library

It is common to provide a set of run-time support functions that are written in another language than LLVM IR and it is trivially easy to interface to such a run-time library. The use of malloc and free in the examples in this document are examples of such use of externally defined run-time functions.

The advantages of a custom, non-IR run-time library function is that it can be optimized by hand to provide the best possible performance under certain criteria. Also a custom non-IR run-time library function can make explicit use of native instructions that are foreign to the LLVM infrastructure.

The advantages of IR run-time library functions is that they can be run through the optimizer and thereby also be inlined automatically.

Interfacing to the Operating System

I'll divide this chapter into two sections:

How to Interface to POSIX Operating Systems.
How to Interface to the Windows Operating System.

How to Interface to POSIX Operating Systems

On POSIX, the presence of the C run-time library is an unavoidable fact for which reason it makes a lot of sense to directly call such C run-time functions.

Sample POSIX "Hello World" Application

On POSIX, it is really very easy to create the Hello world program:

 
   declare 
     
   i32 
     
   @puts 
   ( 
   i8 
   * 
     
   nocapture 
   ) 
     
   nounwind 
   
   @.hello 
     
   = 
     
   private 
    unnamed_addr 
     
   constant 
     
   [ 
   13 
    x 
     
   i8 
   ] 
    c 
   "hello world\0A\00" 
   
   define 
     
   i32 
     
   @main 
   ( 
   i32 
     
   %argc 
   , 
     
   i8 
   ** 
     
   %argv 
   ) 
     
   { 
   
   %1 
     
   = 
     
   getelementptr 
     
   [ 
   13 
    x 
     
   i8 
   ] 
   * 
     
   @.hello 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   call 
     
   i32 
     
   @puts 
   ( 
   i8 
   * 
     
   %1 
   ) 
   
   ret 
     
   i32 
     
   0 
   
   }

How to Interface to the Windows Operating System

On Windows, the C run-time library is mostly considered of relevance to the C and C++ languages only, so you have a plethora (thousands) of standard system interfaces that any client application may use.

Sample Windows "Hello World" Application

Hello world on Windows is nowhere as straightforward as on POSIX:

 
   target datalayout 
     
   = 
     
   "e-p:32:32:32-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-f80:128:128-v64:64:64-v128:128:128-a0:0:64-f80:32:32-n8:16:32-S32" 
   
 target triple 
     
   = 
     
   "i686-pc-win32" 
   
   %struct._OVERLAPPED 
     
   = 
     
   type 
     
   { 
     
   i32 
   , 
     
   i32 
   , 
     
   %union.anon 
   , 
     
   i8 
   * 
     
   } 
   
   %union.anon 
     
   = 
     
   type 
     
   { 
     
   %struct.anon 
     
   } 
   
   %struct.anon 
     
   = 
     
   type 
     
   { 
     
   i32 
   , 
     
   i32 
     
   } 
   
   declare 
     
   dllimport 
    x86_stdcallcc 
     
   i8 
   * 
     
   @ 
   "\01_GetStdHandle@4" 
   ( 
   i32 
   ) 
    # 
   1 
   
   declare 
     
   dllimport 
    x86_stdcallcc 
     
   i32 
     
   @ 
   "\01_WriteFile@20" 
   ( 
   i8 
   *, 
     
   i8 
   *, 
     
   i32 
   , 
     
   i32 
   *, 
   %struct._OVERLAPPED 
   * 
   ) 
    # 
   1 
   
   @hello 
     
   = 
     
   internal 
     
   constant 
     
   [ 
   13 
    x 
     
   i8 
   ] 
    c 
   "Hello world\0A\00" 
   
   define 
     
   i32 
     
   @main 
   ( 
   i32 
     
   %argc 
   , 
     
   i8 
   ** 
     
   %argv 
   ) 
     
   nounwind 
     
   { 
   
   %1 
     
   = 
     
   call 
     
   i8 
   * 
     
   @ 
   "\01_GetStdHandle@4" 
   ( 
   i32 
     
   - 
   11 
   ) 
        
   ; -11 = STD_OUTPUT_HANDLE 
   
   %2 
     
   = 
     
   getelementptr 
     
   [ 
   13 
    x 
     
   i8 
   ] 
   * 
     
   @hello 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %3 
     
   = 
     
   call 
     
   i32 
     
   @ 
   "\01_WriteFile@20" 
   ( 
   i8 
   * 
     
   %1 
   , 
     
   i8 
   * 
     
   %2 
   , 
     
   i32 
     
   12 
   , 
     
   i32 
   * 
     
   null 
   , 
     
   %struct._OVERLAPPED 
   * 
   null 
   ) 
   
   ; todo: Check that %4 is not equal to -1 (INVALID_HANDLE_VALUE) 
   
   ret 
     
   i32 
     
   0 
   
   } 
   
 attributes # 
   1 
     
   = 
     
   { 
     
   "less-precise-fpmad" 
   = 
   "false" 
     
   "no-frame-pointer-elim" 
   = 
   "true" 
     
   "no-frame-pointer-elim-non-leaf" 
   
   "no-infs-fp-math" 
   = 
   "fa lse" 
     
   "no-nans-fp-math" 
   = 
   "false" 
     
   "stack-protector-buffer-size" 
   = 
   "8" 
   "unsafe-fp-math" 
   = 
   "false" 
   
   "use-soft-float" 
   = 
   "false" 
   
   }

Resources

This chapter lists some resources that may be of interest to the reader:

Modern Compiler Implementation in Java, 2nd Edition.
Alex Darby's series of articles on low-level stuff.

Epilogue

If you discover any errors in this document or you need more information than given here, please write to the author at Mikael Lyngvig.

Please also remember that you can learn a lot by using the -emit-llvm option to the clang++ compiler. This gives you a chance to see a live production compiler in action and precisely how it does things.

Appendix A: How to Implement a String Type in LLVM

There are two ways to implement a string type in LLVM:

To write the implementation in LLVM IR.
To write the implementation in a higher-level language that generates IR.

I'd personally much prefer to use the second method, but for the sake of the example, I'll go ahead and illustrate a simple but useful string type in LLVM IR. It assumes a 32-bit architecture, so please replace all occurences of i32 with i64 if you are targetting a 64-bit architecture.

We'll be making a dynamic, mutable string type that can be appended to and could also be inserted into, converted to lower case, and so on, depending on which support functions are defined to operate on the string type.

It all boils down to making a suitable type definition for the class and then define a rich set of functions to operate on the type definition:

 
   ; The actual type definition for our 'String' type. 
   
   %String 
     
   = 
     
   type 
     
   { 
   
   i8 
   *, 
        
   ; 0: buffer; pointer to the character buffer 
   
   i32 
   , 
        
   ; 1: length; the number of chars in the buffer 
   
   i32 
   , 
        
   ; 2: maxlen; the maximum number of chars in the buffer 
   
   i32 
          
   ; 3: factor; the number of chars to preallocate when growing 
   
   } 
   
   define 
     
   fastcc 
     
   void 
     
   @String_Create_Default 
   ( 
   %String 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   ; Initialize 'buffer'. 
   
   %1 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   store 
     
   i8 
   * 
     
   null 
   , 
     
   i8 
   ** 
     
   %1 
   
   ; Initialize 'length'. 
   
   %2 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   store 
     
   i32 
     
   0 
   , 
     
   i32 
   * 
     
   %2 
   
   ; Initialize 'maxlen'. 
   
   %3 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   2 
   
   store 
     
   i32 
     
   0 
   , 
     
   i32 
   * 
     
   %3 
   
   ; Initialize 'factor'. 
   
   %4 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   3 
   
   store 
     
   i32 
     
   16 
   , 
     
   i32 
   * 
     
   %4 
   
   ret 
     
   void 
   
   } 
   
   declare 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
   ) 
   
   declare 
     
   void 
     
   @free 
   ( 
   i8 
   * 
   ) 
   
   declare 
     
   i8 
   * 
     
   @memcpy 
   ( 
   i8 
   *, 
     
   i8 
   *, 
     
   i32 
   ) 
   
   define 
     
   fastcc 
     
   void 
     
   @String_Delete 
   ( 
   %String 
   * 
     
   %this 
   ) 
     
   nounwind 
     
   { 
   
   ; Check if we need to call 'free'. 
   
   %1 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %2 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %1 
   
   %3 
     
   = 
     
   icmp 
    ne 
     
   i8 
   * 
     
   %2 
   , 
     
   null 
   
   br 
     
   i1 
     
   %3 
   , 
     
   label 
     
   %free_begin 
   , 
     
   label 
     
   %free_close 
   
 free_begin 
   : 
   
   call 
     
   void 
     
   @free 
   ( 
   i8 
   * 
     
   %2 
   ) 
   
   br 
     
   label 
     
   %free_close 
   
 free_close 
   : 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   fastcc 
     
   void 
     
   @String_Resize 
   ( 
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   %value 
   ) 
     
   { 
   
   ; %output = malloc(%value) 
   
   %output 
     
   = 
     
   call 
     
   i8 
   * 
     
   @malloc 
   ( 
   i32 
     
   %value 
   ) 
   
   ; todo: check return value 
   
   ; %buffer = this->buffer 
   
   %1 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %buffer 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %1 
   
   ; %length = this->length 
   
   %2 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %length 
     
   = 
     
   load 
     
   i32 
   * 
     
   %2 
   
   ; memcpy(%output, %buffer, %length) 
   
   %3 
     
   = 
     
   call 
     
   i8 
   * 
     
   @memcpy 
   ( 
   i8 
   * 
     
   %output 
   , 
     
   i8 
   * 
     
   %buffer 
   , 
     
   i32 
     
   %length 
   ) 
   
   ; free(%buffer) 
   
   call 
     
   void 
     
   @free 
   ( 
   i8 
   * 
     
   %buffer 
   ) 
   
   ; this->buffer = %output 
   
   store 
     
   i8 
   * 
     
   %output 
   , 
     
   i8 
   ** 
     
   %1 
   
   ret 
     
   void 
   
   } 
   
   define 
     
   fastcc 
     
   void 
     
   @String_Add_Char 
   ( 
   %String 
   * 
     
   %this 
   , 
     
   i8 
     
   %value 
   ) 
     
   { 
   
   ; Check if we need to grow the string. 
   
   %1 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   1 
   
   %length 
     
   = 
     
   load 
     
   i32 
   * 
     
   %1 
   
   %2 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   2 
   
   %maxlen 
     
   = 
     
   load 
     
   i32 
   * 
     
   %2 
   
   ; if length == maxlen: 
   
   %3 
     
   = 
     
   icmp 
    eq 
     
   i32 
     
   %length 
   , 
     
   %maxlen 
   
   br 
     
   i1 
     
   %3 
   , 
     
   label 
     
   %grow_begin 
   , 
     
   label 
     
   %grow_close 
   
 grow_begin 
   : 
   
   %4 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   3 
   
   %factor 
     
   = 
     
   load 
     
   i32 
   * 
     
   %4 
   
   %5 
     
   = 
     
   add 
     
   i32 
     
   %maxlen 
   , 
     
   %factor 
   
   call 
     
   void 
     
   @String_Resize 
   ( 
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   %5 
   ) 
   
   br 
     
   label 
     
   %grow_close 
   
 grow_close 
   : 
   
   %6 
     
   = 
     
   getelementptr 
     
   %String 
   * 
     
   %this 
   , 
     
   i32 
     
   0 
   , 
     
   i32 
     
   0 
   
   %buffer 
     
   = 
     
   load 
     
   i8 
   ** 
     
   %6 
   
   %7 
     
   = 
     
   getelementptr 
     
   i8 
   * 
     
   %buffer 
   , 
     
   i32 
     
   %length 
   
   store 
     
   i8 
     
   %value 
   , 
     
   i8 
   * 
     
   %7 
   
   %8 
     
   = 
     
   add 
     
   i32 
     
   %length 
   , 
     
   1 
   
   store 
     
   i32 
     
   %8 
   , 
     
   i32 
   * 
     
   %1 
   
   ret 
     
   void 
   
   }

Appendix B: Task List

This chapter serves as an informal task list and is to be updated as new to-do items are completed:

How to enable debug information? (Line and Function, Variable)
How to interface with a garbage collector? (link to existing docs)
How to express a custom calling convention? (link to existing docs)
Representing constructors, destructors, finalization
How to examine the stack at runtime? How to modify it? (i.e. reflection, interjection)
Representing subtyping checks (with full alias info), TBAA, struct-path TBAA.
How to exploit inlining (external, vs within LLVM)?
How to express array bounds checks for best optimization?
How to express null pointer checks?
How to express domain specific optimizations? (i.e. lock elision, or matrix math simplification) (link to existing docs)
How to optimize call dispatch or field access in dynamic languages? (ref new patchpoint intrinsics for inline call caching and field access caching)

todo: Ask various front-end implementors (Rust, Haskell (GHC), Rubinius, and more) to review and/or contribute so as to make the document great.

你可能感兴趣的:(文档,compiler,类型,ir,llvm)

斤斤计较的婚姻到底有多难？白心之岂必有为
很多人私聊我会问到在哪个人群当中斤斤计较的人最多？我都会回答他，一般婚姻出现问题的斤斤计较的人士会非常多，以我多年经验，在婚姻落的一塌糊涂的人当中，斤斤计较的人数占比在20～30%以上，也就是说10个婚姻出现问题的斤斤计较的人有2-3个有多不减。在婚姻出问题当中，有大量的心理不平衡的、尖酸刻薄的怨妇。在婚姻中仅斤斤计较有两种类型：第一种是物质上的，另一种是精神上的。在物质与精神上抠门已经严重的影响
机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
android系统selinux中添加新属性property 辉色投像
1.定位/android/system/sepolicy/private/property_contexts声明属性开头：persist.charge声明属性类型：u:object_r:system_prop:s0图12.定位到android/system/sepolicy/public/domain.te删除neverallow{domain-init}default_prop:property
C语言如何定义宏函数？小九格物 c语言
在C语言中，宏函数是通过预处理器定义的，它在编译之前替换代码中的宏调用。宏函数可以模拟函数的行为，但它们不是真正的函数，因为它们在编译时不会进行类型检查，也不会分配存储空间。宏函数的定义通常使用#define指令，后面跟着宏的名称和参数列表，以及宏展开后的代码。宏函数的定义方式：1.基本宏函数：这是最简单的宏函数形式，它直接定义一个表达式。#defineSQUARE(x)((x)*(x))2.带参
Cell Insight | 单细胞测序技术又一新发现，可用于HIV-1和Mtb共感染个体诊断尐尐呅
结核病是艾滋病合并其他疾病中导致患者死亡的主要原因。其中结核病由结核分枝杆菌（Mycobacteriumtuberculosis,Mtb）感染引起，获得性免疫缺陷综合症（艾滋病）由人免疫缺陷病毒（Humanimmunodeficiencyvirustype1,HIV-1）感染引起。国家感染性疾病临床医学研究中心/深圳市第三人民医院张国良团队携手深圳华大生命科学研究院吴靓团队，共同研究得出单细胞测序
c++ 的iostream 和 c++的stdio的区别和联系黄卷青灯77 c++算法开发语言 iostream stdio
在C++中，iostream和C语言的stdio.h都是用于处理输入输出的库，但它们在设计、用法和功能上有许多不同。以下是两者的区别和联系：区别1.编程风格iostream（C++风格）：C++标准库中的输入输出流类库，支持面向对象的输入输出操作。典型用法是cin（输入）和cout（输出），使用>操作符来处理数据。更加类型安全，支持用户自定义类型的输入输出。#includeintmain(){in
Long类型前后端数据不一致 igotyback 前端
响应给前端的数据浏览器控制台中response中看到的Long类型的数据是正常的到前端数据不一致前后端数据类型不匹配是一个常见问题，尤其是当后端使用Java的Long类型（64位）与前端JavaScript的Number类型（最大安全整数为2^53-1，即16位）进行数据交互时，很容易出现精度丢失的问题。这是因为JavaScript中的Number类型无法安全地表示超过16位的整数。为了解决这个问
扫地机类清洁产品之直流无刷电机控制悟空胆好小清洁服务机器人单片机人工智能
扫地机类清洁产品之直流无刷电机控制1.1前言扫地机产品有很多的电机控制，滚刷电机1个，边刷电机1-2个，清水泵电机，风机一个，部分中高端产品支持抹布功能，也就是存在抹布盘电机，还有追觅科沃斯石头等边刷抬升电机，滚刷抬升电机等的，这些电机有直流有刷电机，直接无刷电机，步进电机，电磁阀，挪动泵等不同类型。电机的原理，驱动控制方式也不行。接下来一段时间的几个文章会作个专题分析分享。直流有刷电机会自动持续
消息中间件有哪些常见类型 xmh-sxh-1314 java
消息中间件根据其设计理念和用途，可以大致分为以下几种常见类型：点对点消息队列（Point-to-PointMessagingQueues）：在这种模型中，消息被发送到特定的队列中，消费者从队列中取出并处理消息。队列中的消息只能被一个消费者消费，消费后即被删除。常见的实现包括IBM的MQSeries、RabbitMQ的部分使用场景等。适用于任务分发、负载均衡等场景。发布/订阅消息模型（Pub/Sub
【一起学Rust | 设计模式】习惯语法——使用借用类型作为参数、格式化拼接字符串、构造函数广龙宇一起学Rust #Rust设计模式 rust 设计模式开发语言
提示：文章写完后，目录可以自动生成，如何生成可参考右边的帮助文档文章目录前言一、使用借用类型作为参数二、格式化拼接字符串三、使用构造函数总结前言Rust不是传统的面向对象编程语言，它的所有特性，使其独一无二。因此，学习特定于Rust的设计模式是必要的。本系列文章为作者学习《Rust设计模式》的学习笔记以及自己的见解。因此，本系列文章的结构也与此书的结构相同（后续可能会调成结构），基本上分为三个部分
python os.environ 江湖偌大 python 深度学习
os.environ['TF_CPP_MIN_LOG_LEVEL']='0'#默认值，输出所有信息os.environ['TF_CPP_MIN_LOG_LEVEL']='1'#屏蔽通知信息（INFO）os.environ['TF_CPP_MIN_LOG_LEVEL']='2'#屏蔽通知信息和警告信息（INFO\WARNING）os.environ['TF_CPP_MIN_LOG_LEVEL']='
Python中os.environ基本介绍及使用方法鹤冲天Pro #Python python 服务器开发语言
文章目录python中os.environos.environ简介os.environ进行环境变量的增删改查python中os.environ的使用详解1.简介2.key字段详解2.1常见key字段3.os.environ.get()用法4.环境变量的增删改查和判断是否存在4.1新增环境变量4.2更新环境变量4.3获取环境变量4.4删除环境变量4.5判断环境变量是否存在python中os.envi
Pyecharts数据可视化大屏：打造沉浸式数据分析体验我的运维人生信息可视化数据分析数据挖掘运维开发技术共享
Pyecharts数据可视化大屏：打造沉浸式数据分析体验在当今这个数据驱动的时代，如何将海量数据以直观、生动的方式展现出来，成为了数据分析师和企业决策者关注的焦点。Pyecharts，作为一款基于Python的开源数据可视化库，凭借其丰富的图表类型、灵活的配置选项以及高度的定制化能力，成为了构建数据可视化大屏的理想选择。本文将深入探讨如何利用Pyecharts打造数据可视化大屏，并通过实际代码案例
linux sdl windows.h,Windows下的SDL安装奔跑吧linux内核 linux sdl windows.h
首先你要下载并安装SDL开发包。如果装在C盘下，路径为C:\SDL1.2.5如果在WINDOWS下。你可以按以下步骤：1.打开VC++，点击"Tools",Options2,点击directories选项3.选择"Includefiles"增加一个新的路径。"C:\SDL1.2.5\include"4，现在选择"Libaryfiles“增加"C:\SDL1.2.5\lib"现在你可以开始编写你的第
Python教程：一文了解使用Python处理XPath 旦莫 Python进阶 python 开发语言
目录1.环境准备1.1安装lxml1.2验证安装2.XPath基础2.1什么是XPath？2.2XPath语法2.3示例XML文档3.使用lxml解析XML3.1解析XML文档3.2查看解析结果4.XPath查询4.1基本路径查询4.2使用属性查询4.3查询多个节点5.XPath的高级用法5.1使用逻辑运算符5.2使用函数6.实战案例6.1从网页抓取数据6.1.1安装Requests库6.1.2代
python os.environ_python os.environ 读取和设置环境变量 weixin_39605414 python os.environ
>>>importos>>>os.environ.keys()['LC_NUMERIC','GOPATH','GOROOT','GOBIN','LESSOPEN','SSH_CLIENT','LOGNAME','USER','HOME','LC_PAPER','PATH','DISPLAY','LANG','TERM','SHELL','J2REDIR','LC_MONETARY','QT_QPA
使用Faiss进行高效相似度搜索 llzwxh888 faiss python
在现代AI应用中，快速和高效的相似度搜索是至关重要的。Faiss（FacebookAISimilaritySearch）是一个专门用于快速相似度搜索和聚类的库，特别适用于高维向量。本文将介绍如何使用Faiss来进行相似度搜索，并结合Python代码演示其基本用法。什么是Faiss？Faiss是一个由FacebookAIResearch团队开发的开源库，主要用于高维向量的相似性搜索和聚类。Faiss
python八股文面试题分享及解析(1) Shawn________ python
#1.'''a=1b=2不用中间变量交换a和b'''#1.a=1b=2a,b=b,aprint(a)print(b)结果：21#2.ll=[]foriinrange(3):ll.append({'num':i})print(11)结果:#[{'num':0},{'num':1},{'num':2}]#3.kk=[]a={'num':0}foriinrange(3):#0,12#可变类型，不仅仅改变
Faiss Tips：高效向量搜索与聚类的利器焦习娜Samantha
FaissTips：高效向量搜索与聚类的利器faiss_tipsSomeusefultipsforfaiss项目地址:https://gitcode.com/gh_mirrors/fa/faiss_tips项目介绍Faiss是由FacebookAIResearch开发的一个用于高效相似性搜索和密集向量聚类的库。它支持多种硬件平台，包括CPU和GPU，能够在海量数据集上实现快速的近似最近邻搜索（AN
番茄西红柿叶子病害分类数据集12882张11类别 futureflsl 数据集分类数据挖掘人工智能
数据集类型：图像分类用，不可用于目标检测无标注文件数据集格式：仅仅包含jpg图片，每个类别文件夹下面存放着对应图片图片数量(jpg文件个数)：12882分类类别数：11类别名称:["Bacterial_Spot_Bacteria","Early_Blight_Fungus","Healthy","Late_Blight_Water_Mold","Leaf_Mold_Fungus","Powdery
钢筋长度超限检测检数据集VOC+YOLO格式215张1类别 futureflsl 数据集 YOLO 深度学习机器学习
数据集格式：PascalVOC格式+YOLO格式(不包含分割路径的txt文件，仅仅包含jpg图片以及对应的VOC格式xml文件和yolo格式txt文件)图片数量(jpg文件个数)：215标注数量(xml文件个数)：215标注数量(txt文件个数)：215标注类别数：1标注类别名称:["iron"]每个类别标注的框数：iron框数=215总框数：215使用标注工具：labelImg标注规则：对类别进
python os 环境变量 CV矿工 python 开发语言 numpy
环境变量：环境变量是程序和操作系统之间的通信方式。有些字符不宜明文写进代码里，比如数据库密码，个人账户密码，如果写进自己本机的环境变量里，程序用的时候通过os.environ.get（）取出来就行了。os.environ是一个环境变量的字典。环境变量的相关操作importos"""设置/修改环境变量：os.environ[‘环境变量名称’]=‘环境变量值’#其中key和value均为string类
Python爬虫解析工具之xpath使用详解 eqa11 python 爬虫开发语言
文章目录Python爬虫解析工具之xpath使用详解一、引言二、环境准备1、插件安装2、依赖库安装三、xpath语法详解1、路径表达式2、通配符3、谓语4、常用函数四、xpath在Python代码中的使用1、文档树的创建2、使用xpath表达式3、获取元素内容和属性五、总结Python爬虫解析工具之xpath使用详解一、引言在Python爬虫开发中，数据提取是一个至关重要的环节。xpath作为一门
Redis系列：Geo 类型赋能亿级地图位置计算 Ly768768 redis bootstrap 数据库
1前言我们在篇深刻理解高性能Redis的本质的时候就介绍过Redis的几种基本数据结构，它是基于不同业务场景而设计的：动态字符串(REDIS_STRING)：整数(REDIS_ENCODING_INT)、字符串(REDIS_ENCODING_RAW)双端列表(REDIS_ENCODING_LINKEDLIST)压缩列表(REDIS_ENCODING_ZIPLIST)跳跃表(REDIS_ENCODI
Rust基础知识 GRKF15 rust 开发语言后端
1.Rust语言简介1.1基础语法变量声明：let关键字用于声明变量，可以指定或不指定类型，如leta=10;和letmutc=30i32;。函数定义：使用fn关键字定义函数，并指定参数类型及返回类型，如fnadd(i:i32,j:i32)->i32{i+j}。控制流：包括if、else等，控制语句后需要使用;来结束语句。1.2数据类型整数类型：i8、i16、i32、i64、i128，以及无符号的
C++菜鸟教程 - 从入门到精通第二节 DreamByte c++
一.上节课的补充(数据类型)1.前言继上节课,我们主要讲解了输入,输出和运算符,我们现在来补充一下数据类型的知识上节课遗漏了这个知识点,非常的抱歉顺便说一下,博主要上高中了,更新会慢,2-4周更新一次对了,正好赶上中秋节,小编跟大家说一句:中秋节快乐!2.int类型上节课,我们其实只用了int类型int类型,是整数类型,它们存贮的是整数,不能存小数(浮点数)定义变量的方式很简单inta;//定义一
Faiss：高效相似性搜索与聚类的利器网络·魚大数据 faiss
Faiss是一个针对大规模向量集合的相似性搜索库，由FacebookAIResearch开发。它提供了一系列高效的算法和数据结构，用于加速向量之间的相似性搜索，特别是在大规模数据集上。本文将介绍Faiss的原理、核心功能以及如何在实际项目中使用它。Faiss原理：近似最近邻搜索：Faiss的核心功能之一是近似最近邻搜索，它能够高效地在大规模数据集中找到与给定查询向量最相似的向量。这种搜索是近似的，
Linux MariaDB使用OpenSSL安装SSL证书 Meta39 MySQL Oracle MariaDB Linux Windows ssl linux mariadb
进入到证书存放目录，批量删除.pem证书警告：确保已经进入到证书存放目录find.-typef-iname\*.pem-delete查看是否安装OpenSSLopensslversion没有则安装yuminstallopensslopenssl-devel开启SSL编辑/etc/my.cnf文件（没有的话就创建，但是要注意，在/etc/my.cnf.d/server.cnf配置了datadir的，
ES聚合分析原理与代码实例讲解光剑书架上的书大厂Offer收割机面试题简历程序员读书硅基计算碳基计算认知计算生物计算深度学习神经网络大数据 AIGC AGI LLM Java Python 架构设计 Agent 程序员实现财富自由
ES聚合分析原理与代码实例讲解1.背景介绍1.1问题的由来在大规模数据分析场景中，特别是在使用Elasticsearch（ES）进行数据存储和检索时，聚合分析成为了一个至关重要的功能。聚合分析允许用户对数据集进行细分和分组，以便深入探索数据的结构和模式。这在诸如实时监控、日志分析、业务洞察等领域具有广泛的应用。1.2研究现状目前，ES聚合分析已经成为现代大数据平台的核心组件之一。它支持多种类型的聚
简单了解 JVM 记得开心一点啊 jvm
目录♫什么是JVM♫JVM的运行流程♫JVM运行时数据区♪虚拟机栈♪本地方法栈♪堆♪程序计数器♪方法区/元数据区♫类加载的过程♫双亲委派模型♫垃圾回收机制♫什么是JVMJVM是JavaVirtualMachine的简称，意为Java虚拟机。虚拟机是指通过软件模拟的具有完整硬件功能的、运行在一个完全隔离的环境中的完整计算机系统（如：JVM、VMwave、VirtualBox）。JVM和其他两个虚拟机
js动画html标签（持续更新中） 843977358 html js 动画 media opacity
1.jQuery 效果 - animate() 方法改变 "div" 元素的高度： $(".btn1").click(function(){ $("#box").animate({height:"300px
springMVC学习笔记 caoyong springMVC
1、搭建开发环境 a>、添加jar文件，在ioc所需jar包的基础上添加spring-web.jar,spring-webmvc.jar b>、在web.xml中配置前端控制器 <servlet> &nbs
POI中设置Excel单元格格式 107x poi style 列宽合并单元格自动换行
引用：http://apps.hi.baidu.com/share/detail/17249059 POI中可能会用到一些需要设置EXCEL单元格格式的操作小结：先获取工作薄对象: HSSFWorkbook wb = new HSSFWorkbook(); HSSFSheet sheet = wb.createSheet(); HSSFCellStyle setBorder = wb.
jquery 获取A href 触发js方法的this参数无效的情况一炮送你回车库 jquery
html如下： <td class=\"bord-r-n bord-l-n c-333\"> <a class=\"table-icon edit\" onclick=\"editTrValues(this);\">修改</a> </td>" j
md5 3213213333332132 MD5
import java.security.MessageDigest; import java.security.NoSuchAlgorithmException; public class MDFive { public static void main(String[] args) { String md5Str = "cq
完全卸载干净Oracle11g sophia天雪 orale数据库卸载干净清理注册表
完全卸载干净Oracle11g A、存在OUI卸载工具的情况下：第一步：停用所有Oracle相关的已启动的服务；第二步：找到OUI卸载工具：在“开始”菜单中找到“oracle_OraDb11g_home”文件夹中 &
apache 的access.log 日志文件太大如何解决 darkranger apache
CustomLog logs/access.log common 此写法导致日志数据一致自增变大。直接注释上面的语法 #CustomLog logs/access.log common 增加： CustomLog "|bin/rotatelogs.exe -l logs/access-%Y-%m-d.log
Hadoop单机模式环境搭建关键步骤 aijuans 分布式
Hadoop环境需要sshd服务一直开启，故，在服务器上需要按照ssh服务，以Ubuntu Linux为例，按照ssh服务如下： sudo apt-get install ssh sudo apt-get install rsync 编辑HADOOP_HOME/conf/hadoop-env.sh文件，将JAVA_HOME设置为Java
PL/SQL DEVELOPER 使用的一些技巧 atongyeye java sql
1 记住密码这是个有争议的功能，因为记住密码会给带来数据安全的问题。但假如是开发用的库，密码甚至可以和用户名相同，每次输入密码实在没什么意义，可以考虑让PLSQL Developer记住密码。位置：Tools菜单－－Preferences－－Oracle－－Logon HIstory－－Store with password 2 特殊Copy 在SQL Window
PHP：在对象上动态添加一个新的方法 bardo 方法动态添加闭包
有关在一个对象上动态添加方法，如果你来自Ruby语言或您熟悉这门语言，你已经知道它是什么...... Ruby提供给你一种方式来获得一个instancied对象，并给这个对象添加一个额外的方法。好！不说Ruby了，让我们来谈谈PHP PHP未提供一个“标准的方式”做这样的事情，这也是没有核心的一部分... 但无论如何，它并没有说我们不能做这样
ThreadLocal与线程安全 bijian1013 java java多线程 threadLocal
首先来看一下线程安全问题产生的两个前提条件： 1.数据共享，多个线程访问同样的数据。 2.共享数据是可变的，多个线程对访问的共享数据作出了修改。实例：定义一个共享数据： public static int a = 0;
Tomcat 架包冲突解决征客丶 tomcat Web
环境： Tomcat 7.0.6 win7 x64 错误表象：【我的冲突的架包是：catalina.jar 与 tomcat-catalina-7.0.61.jar 冲突，不知道其他架包冲突时是不是也报这个错误】严重: End event threw exception java.lang.NoSuchMethodException: org.apache.catalina.dep
【Scala三】分析Spark源代码总结的Scala语法一 bit1129 scala
Scala语法 1. classOf运算符 Scala中的classOf[T]是一个class对象，等价于Java的T.class,比如classOf[TextInputFormat]等价于TextInputFormat.class 2. 方法默认值 defaultMinPartitions就是一个默认值，类似C++的方法默认值
java 线程池管理机制 BlueSkator java线程池管理机制
编辑 Add Tools jdk线程池一、引言第一：降低资源消耗。通过重复利用已创建的线程降低线程创建和销毁造成的消耗。第二：提高响应速度。当任务到达时，任务可以不需要等到线程创建就能立即执行。第三：提高线程的可管理性。线程是稀缺资源，如果无限制的创建，不仅会消耗系统资源，还会降低系统的稳定性，使用线程池可以进行统一的分配，调优和监控。
关于hql中使用本地sql函数的问题（问-答） BreakingBad HQL 存储函数
转自于：http://www.iteye.com/problems/23775 问：我在开发过程中，使用hql进行查询（mysql5）使用到了mysql自带的函数find_in_set()这个函数作为匹配字符串的来讲效率非常好，但是我直接把它写在hql语句里面（from ForumMemberInfo fm,ForumArea fa where find_in_set(fm.userId,f
读《研磨设计模式》-代码笔记-迭代器模式-Iterator bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.Arrays; import java.util.List; /** * Iterator模式提供一种方法顺序访问一个聚合对象中各个元素，而又不暴露该对象内部表示 * * 个人觉得，为了不暴露该
常用SQL chenjunt3 oracle sql C++c C#
--NC建库 CREATE TABLESPACE NNC_DATA01 DATAFILE 'E:\oracle\product\10.2.0\oradata\orcl\nnc_data01.dbf' SIZE 500M AUTOEXTEND ON NEXT 50M EXTENT MANAGEMENT LOCAL UNIFORM SIZE 256K ; CREATE TABLESPA
数学是科学技术的语言 comsci 工作活动领域模型
从小学到大学都在学习数学，从小学开始了解数字的概念和背诵九九表到大学学习复变函数和离散数学，看起来好像掌握了这些数学知识，但是在工作中却很少真正用到这些知识，为什么？最近在研究一种开源软件-CARROT2的源代码的时候，又一次感觉到数学在计算机技术中的不可动摇的基础作用，CARROT2是一种用于自动语言分类（聚类）的工具性软件，用JAVA语言编写，它
Linux系统手动安装rzsz 软件包 daizj linux sz rz
1、下载软件 rzsz-3.34.tar.gz。登录linux，用命令 wget http://freeware.sgi.com/source/rzsz/rzsz-3.48.tar.gz下载。 2、解压 tar zxvf rzsz-3.34.tar.gz 3、安装 cd rzsz-3.34 ; make posix 。注意：这个软件安装与常规的GNU软件不
读源码之:ArrayBlockingQueue dieslrae java
ArrayBlockingQueue是concurrent包提供的一个线程安全的队列,由一个数组来保存队列元素.通过 takeIndex和 putIndex来分别记录出队列和入队列的下标,以保证在出队列时不进行元素移动. //在出队列或者入队列的时候对takeIndex或者putIndex进行累加,如果已经到了数组末尾就又从0开始,保证数
C语言学习九枚举的定义和应用 dcj3sjt126com c
枚举的定义 # include <stdio.h> enum WeekDay { MonDay, TuesDay, WednesDay, ThursDay, FriDay, SaturDay, SunDay }; int main(void) { //int day; //day定义成int类型不合适 enum WeekDay day = Wedne
Vagrant 三种网络配置详解 dcj3sjt126com vagrant
Forwarded port Private network Public network Vagrant 中一共有三种网络配置，下面我们将会详解三种网络配置各自优缺点。端口映射(Forwarded port)，顾名思义是指把宿主计算机的端口映射到虚拟机的某一个端口上，访问宿主计算机端口时，请求实际是被转发到虚拟机上指定端口的。Vagrantfile中设定语法为： c
16.性能优化-完结 frank1234 性能优化
性能调优是一个宏大的工程，需要从宏观架构(比如拆分，冗余，读写分离，集群，缓存等)，软件设计（比如多线程并行化，选择合适的数据结构），数据库设计层面（合理的表设计，汇总表，索引，分区，拆分，冗余等）以及微观（软件的配置，SQL语句的编写，操作系统配置等）根据软件的应用场景做综合的考虑和权衡，并经验实际测试验证才能达到最优。性能水很深，笔者经验尚浅，赶脚也就了解了点皮毛而已，我觉得
Word Search hcx2013 search
Given a 2D board and a word, find if the word exists in the grid. The word can be constructed from letters of sequentially adjacent cell, where "adjacent" cells are those horizontally or ve
Spring4新特性——Web开发的增强 jinnianshilongnian spring spring mvc spring4
Spring4新特性——泛型限定式依赖注入 Spring4新特性——核心容器的其他改进 Spring4新特性——Web开发的增强 Spring4新特性——集成Bean Validation 1.1(JSR-349)到SpringMVC Spring4新特性——Groovy Bean定义DSL Spring4新特性——更好的Java泛型操作API Spring4新
CentOS安装配置tengine并设置开机启动 liuxingguome centos
yum install gcc-c++ yum install pcre pcre-devel yum install zlib zlib-devel yum install openssl openssl-devel Ubuntu上可以这样安装 sudo aptitude install libdmalloc-dev libcurl4-opens
第14章工具函数（上） onestopweb 函数
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
Xelsius 2008 and SAP BW at a glance blueoxygen BO Xelsius
Xelsius提供了丰富多样的数据连接方式，其中为SAP BW专属提供的是BICS。那么Xelsius的各种连接的优缺点比较以及Xelsius是如何直接连接到BEx Query的呢？以下Wiki文章应该提供了全面的概览。 http://wiki.sdn.sap.com/wiki/display/BOBJ/Xcelsius+2008+and+SAP+NetWeaver+BW+Co
oracle表空间相关 tongsh6 oracle
在oracle数据库中，一个用户对应一个表空间，当表空间不足时，可以采用增加表空间的数据文件容量，也可以增加数据文件，方法有如下几种： 1.给表空间增加数据文件 ALTER TABLESPACE "表空间的名字" ADD DATAFILE '表空间的数据文件路径' SIZE 50M; &nb
.Net framework4.0安装失败 yangjuanjava .net windows
上午的.net framework 4.0，各种失败，查了好多答案，各种不靠谱，最后终于找到答案了和Windows Update有关系，给目录名重命名一下再次安装，即安装成功了！下载地址：http://www.microsoft.com/en-us/download/details.aspx?id=17113 方法： 1.运行cmd，输入net stop WuAuServ 2.点击开

Mapping High-Level Constructs to LLVM IR

Table of Contents

Introduction

A Quick Primer

Some Useful LLVM Tools

Mapping Basic Constructs to LLVM IR

Global Variables

Local Variables

Constants

Constant Expressions

Size-Of Computations

Function Prototypes

Function Definitions

Simple Public Functions

Simple Private Functions

Functions with a Variable Number of Parameters

Exception-Aware Functions

Function Pointers

Casts

Bitwise Casts

Zero-Extending Casts (Unsigned Upcasts)

Sign-Extending Casts (Signed Upcasts)

Truncating Casts (Signed and Unsigned Downcasts)

Floating-Point Extending Casts (Float Upcasts)

Floating-Point Truncating Casts (Float Downcasts)

Pointer-to-Integer Casts

Integer-to-Pointer Casts

Address-Space Casts (Pointer Casts)

Incomplete Structure Types

Structures

Nested Structures

Unions

Structure Expressions

Getting a Pointer to a Structure Member

Mapping Control Structures to LLVM IR

Mapping Advanced Constructs to LLVM IR

Lambda Functions

Closures

Generators

Mapping Exception Handling to LLVM IR

Exception Handling by Propagated Return Value

Setjmp/Longjmp Exception Handling

Zero Cost Exception Handling

Resources

Mapping Object-Oriented Constructs to LLVM IR

Classes

Virtual Methods

Single Inheritance

Multiple Inheritance

Virtual Inheritance

Interfaces

Boxing and Unboxing

Class Equivalence Test

Class Inheritance Test

The New Operator

The Instance New Operator

The Array New Operator

Interoperating with a Runtime Library

Interfacing to the Operating System

How to Interface to POSIX Operating Systems

Sample POSIX "Hello World" Application

How to Interface to the Windows Operating System

Sample Windows "Hello World" Application

Resources

Epilogue

Appendix A: How to Implement a String Type in LLVM

Appendix B: Task List

你可能感兴趣的:(文档,compiler,类型,ir,llvm)