LiSteven

PLT redirection through shared object injection...

Download source - 6.02 KB

Introduction
Prerequisites
Brief introduction to the ELF format

3.1 Historical notes
3.2 ELF structure

3.2.1 The ELF header
3.2.2 Program header and segments
3.2.3 The section header
3.2.4 The string table
3.2.5 Let's write some code...
3.2.6 The symbol table
3.2.7 Relocation table
3.2.8 Dynamic section
3.2.9 Hash tables
3.2.10 Let's write more code...

ELF loading

4.1 Program headers
4.2 Dynamic section
4.3 The PLT

Building up our lab...
PLT: a practical example
Conclusions
References

Introduction

In the last few weeks, I've found myself playing around with Linux for various reasons, so the first thing that I thought was to apply some of the well known Win32 methods to this Operating System, most of them from the point of view of a reverser. This is the first part of a two-part article which will deal with code injection under Linux. The technique presented here resembles a well known Win32 technique: inject a DLL (shared object in Linux and UNIX-like systems) into a running process, thus being able to hook some of the process' imported functions. There are two ways which may lead us to code injection:

Using theLD_PRELOADmethod (this requires to restart the process in which we are injecting our shared object).
Injecting a stub into the target process which loads the required library.

Of course, as you may have guessed, the second way requires the presence of "libdl" in the address space of the target process, but that's a reasonable requirement since most modern software for Linux and UNIX are quite complex, and most of them will have libdl mapped into their address space. We will use the second method since our goal is to inject a shared object into a running process. Regarding the hook technique, we will use PLT redirection, which is a well known technique under Linux/UNIX to hook functions imported from other libraries in a simple and elegant way.

This first part will teach you the basics of the ELF format (the standard object format in most Linux and UNIX-like Operating Systems) with some code examples, so it will be an introduction to the next part, which will show you the actual technique, but this one is a required reading, since it will build up the required concepts.

Prerequisites

Before moving further down the article, it's best that you take a look at the ELF manuals. They are a required reading to understand most of what I'm writing. I'll give you just a brief introduction to the ELF format, but there is so much to write about it that we might forget our goal with this article (which is not about the ELF format, but is about shared object injection). In various parts of the text, I will recall the manuals, so let's see "how" you should read them. You have to download two manuals, the first is the general one, the second is the x86-specific one. If you take a look at the general one, you will see that it is missing some parts, which you will find in the specific one (there is an additional ELF manual for each platform which supports the ELF format). The missing parts start with a marker, so you will notice them.

You should have a good knowledge of the C programming language and you should know how to use GCC and GDB. A minimal knowledge of the x86 assembly language is required. This is the reference system:

xubuntu 8.10 with kernel 2.6.27-generic
GCC 4.3.2
GNU GDB 6.8-debian
GNU readelf (GNU Binutils for Ubuntu) 2.18.93.20081009
GNU objdump (GNU Binutils for Ubuntu) 2.18.93.20081009

Brief introduction to the ELF format

In this chapter, the principal aspects of the ELF format will be covered from a programmer's point of view. This chapter will not be a raw copy of the ELF manuals, but it will just introduce you to some aspects of the format, so you are required to read the manuals. This chapter will deal with the various structures of the ELF format and how to read them. Dynamic linking will be explained in the next chapter.

Historical notes

The ELF format (which is an acronym for Executable and Linking Format) is one of the many formats that describes the structure of an object file (a file which contains compiled code) from a logical and/or physical point of view. Object file formats similar to ELF includes: the PE format (Portable Executable, used mainly on Microsoft platforms), the Mach-O (used on OSX), the COFF, and the a.out. From a historical point of view, the COFF and the a.out were very influential for the development of the newer formats such as the PE (which derives from COFF) and the ELF (which takes some of its aspects from the a.out format). As you may guess from the name, the ELF format allows both executable and libraries to be described.

The ELF format was introduced to replace the a.out and COFF formats on UNIX systems, becoming a standard as of 1999. Today, most non-Microsoft platforms use the ELF format, being well suited to adapt to most platforms. Moreover, the "ld" linker can be instructed (through the use of custom ld-scripts) to build custom ELF files which can meet almost every requirement you may need. Examples of non-UNIX platforms using the ELF format includes PSP, PS2, PS3, Wii, Dreamcast, BeOS, Haiku, AmigaOS, MorphOS, and SymbianOS (which actually uses a format derived from ELF). This flexibility comes from the fact that most of the ELF main structures are not bound to a specific platform (e.g., the format (fields and their size) of the relocation structure is independent of the platform used, but its contents are highly platform-dependent).

ELF structure

The ELF structure is similar to other image (which is a synonym for object file) formats: it has a header, followed by sections which describe the contents of various segments of memory. In the following paragraphs, we will make use of the "readelf" utility on a test executable, namely "/bin/ls" (which you should have in your system...).

The ELF header

Working with a file format, the first thing that comes into mind is its header, which is the most important part of a file. Regarding the ELF format, the header contains the architecture for which the file was built, its endianness (the byte order), which in most of cases is bound to the architecture (there are some exceptions: modern ARMs and PPCs can be set either to big-endian or to little-endian), the number of sections contained in the file, the number of segments (a single segment contains one or more sections), etc... Using the C structure syntax (in the same way as the manual), let's take a look at the header:

Collapse | Copy Code

#define EI_NIDENT (16) typedef struct { unsigned char e_ident[EI_NIDENT]; /* Magic number and other info */ Elf32_Half    e_type; /* Object file type */ Elf32_Half    e_machine; /* Architecture */ Elf32_Word    e_version; /* Object file version */ Elf32_Addr    e_entry; /* Entry point virtual address */ Elf32_Off     e_phoff; /* Program header table file offset */ Elf32_Off     e_shoff; /* Section header table file offset */ Elf32_Word    e_flags; /* Processor-specific flags */ Elf32_Half    e_ehsize; /* ELF header size in bytes */ Elf32_Half    e_phentsize; /* Program header table entry size */ Elf32_Half    e_phnum; /* Program header table entry count */ Elf32_Half    e_shentsize; /* Section header table entry size */ Elf32_Half    e_shnum; /* Section header table entry count */ Elf32_Half    e_shstrndx; /* Section header string table index */ } Elf32_Ehdr;

where these arethe important fields:

e_ident: Allows the identification of the file. It is an array of 16 bytes, and it is the only part of the file which does not depend on byte ordering. The first four bytes are the ELF signature, and they hold the following values: {0x7F, 'E', 'L', 'F'}, indexed by the constantsEI_MAG0,EI_MAG1,EI_MAG2,EI_MAG3. The next bytes allow to identify the dimensions of the ELF objects (i.e., if we're dealing with a 32 bit or 64bit ELF), the byte ordering, and other parameters which we don't care about. The header file, elf.h, defines two constants which index the endianness byte and the dimension byte; they are:EI_CLASS,EI_DATA. Dealing with an x86 architecture will lead toe_ident[EI_CLASS] = ELFCLASS32ande_ident[EI_DATA] = ELFDATA2LSB, as it will be shown a few paragraphs apart.
e_entry: This field holds the address which is the entry point of the image; that is, the first instruction to be executed by the system once the image has been completely loaded into memory. For executable files, this field will hold a virtual address (since the executables are always loaded at the same base address). For libraries (.so file or shared objects), it will holds a relative virtual address (i.e., a virtual address minus the base address).
e_phoff: A physical offset from the beginning of the file which tells where the program header table resides. So, to read the program header table, you just have to "lseek" to this offset. A program header describes a segment, which is a collection of sections (described by the section header).
e_shoff: Same as above, but this holds the section header table offset.
e_phentsize: Size of one entry of the program header table.
e_shentsize: Size of one entry of the section header table.
e_phnum: Number of program headers.
e_shnum: Number of section headers.
e_shstrndx: An index into the section header table, which holds the string table that must be used to get the section names (string tables will be covered a few paragraphs away).

As you may have seen, header fields are defined using custom data types, e.g.,Elf32_Word,Elf32_Half, etc..., allowing to identify in a unique way the size of a field. Let's clarify this further: the size of an ELF word (binary word)) is always 32 bit (on both 32 bit and 64 bit architectures), soElf32_Wordis an unsigned 32 bit integer. The same goes forElf64_Word. This is true for data words (which are always 32 bit for the ELF format), but not for addresses (which, of course, is 32 bit on a 32 bit architecture, and 64 bit on a 64 bit one), so there are the address types:Elf32_Address,Elf32_Offsetwhich are unsigned 32 bit integers; on a 64 bit architecture, of course, there will beElf64_AddressandElf64_Offset, which are unsigned 64 bit integers.Elf32_HalfandElf64_Halfwill always be half the size of an ELF word; that is, they will be unsigned 16 bit integers. Here are the typedefs:

Collapse | Copy Code

/* Type for a 16-bit quantity.  */ typedef uint16_t Elf32_Half; typedef uint16_t Elf64_Half; /* Types for signed and unsigned 32-bit quantities.  */ typedef uint32_t Elf32_Word; typedef int32_t  Elf32_Sword; typedef uint32_t Elf64_Word; typedef int32_t  Elf64_Sword; /* Types for signed and unsigned 64-bit quantities.  */ typedef uint64_t Elf32_Xword; typedef int64_t  Elf32_Sxword; typedef uint64_t Elf64_Xword; typedef int64_t  Elf64_Sxword; /* Type of addresses.  */ typedef uint32_t Elf32_Addr; typedef uint64_t Elf64_Addr; /* Type of file offsets.  */ typedef uint32_t Elf32_Off; typedef uint64_t Elf64_Off; /* Type for section indices, which are 16-bit quantities.  */ typedef uint16_t Elf32_Section; typedef uint16_t Elf64_Section; /* Type for version symbol information.  */ typedef Elf32_Half Elf32_Versym; typedef Elf64_Half Elf64_Versym;

Let's take a look at the output of "readelf" for our binary:

Collapse | Copy Code

quake2@quake2-desktop:~$ readelf -h /bin/ls
ELF Header:
  Magic:   7f 45 4c 46 01 01 01 00 00 00 00 00 00 00 00 00 
  Class:                             ELF32
  Data:                              2's complement, little endian
  Version:                           1 (current)
  OS/ABI:                            UNIX - System V
  ABI Version:                       0
  Type:                              EXEC (Executable file)
  Machine:                           Intel 80386
  Version:                           0x1
  Entry point address:               0x8049b20
  Start of program headers:          52 (bytes into file)
  Start of section headers:          95096 (bytes into file)
  Flags:                             0x0
  Size of this header:               52 (bytes)
  Size of program headers:           32 (bytes)
  Number of program headers:         9
  Size of section headers:           40 (bytes)
  Number of section headers:         28
  Section header string table index: 27

From this output, it is clear that "/bin/ls" is a 32 bit executable for the x86 platform.

Program header and segments

The program header is the fundamental structure of the ELF format. A program header describes a segment, which holds one or more sections. It contains instructions for the loader on how to map a segment of the file into memory. Some program headers have a special meaning, which will be discussed later. It is important to note that an ELF image without a program header cannot be loaded into memory by the system loader (an intermediate object file (usually with a .o extension), does not need a program header, since it is not loaded into memory), so a valid ELF image must have at least one program header of typePT_LOAD. Once an ELF image is fully loaded into memory, the program header is the only structure which contains reliable information; section headers lose their meaning (they might be present in the memory image, but you must not rely on them). Here's the structure of the program header:

Collapse | Copy Code

typedef struct {
  Elf32_Word    p_type; /* Segment type */ Elf32_Off    p_offset; /* Segment file offset */ Elf32_Addr    p_vaddr; /* Segment virtual address */ Elf32_Addr    p_paddr; /* Segment physical address */ Elf32_Word    p_filesz; /* Segment size in file */ Elf32_Word    p_memsz; /* Segment size in memory */ Elf32_Word    p_flags; /* Segment flags */ Elf32_Word    p_align; /* Segment alignment */ } Elf32_Phdr;

p_type: This field holds the type of the segment. There can be more than one segment of each type (but this is not valid for all types; e.g., there can be only one segment of typePT_DYNAMICandPT_INTERP). Take a look at the ELF manual for a description of the possible types. Of course, each different type of program header holds different kinds of information.
p_offset: File offset at which the segment begins.
p_vaddr: In-memory virtual address at which the segment begins.
p_paddr: In-memory physical address at which the segment begins; this has no meaning on most modern architectures; usually holds the same value ofp_vaddr.
p_filesz: Size of the segment in the file (on-disk size); this might be different from memory size (and usually is for text and data sections).
p_memsz: In-memory size of the segment.
p_flags: A bit-mask which holds the usual memory protection flags (read, write, execute); take a look at the manual.
p_align: Segment alignment; if this field is not equal to 0 or 1, thenp_vaddr % p_align = 0.

Let's remark the fact that, within an image file, the program header holds the important data to allow the loader to load the image; section headers are just helpers and might not be present at all in an image file. Let's take a look at the output from "readelf":

Collapse | Copy Code

quake2@quake2-desktop:~$ readelf -l /bin/ls

Elf file type is EXEC (Executable file)
Entry point 0x8049b20
There are 9 program headers, starting at offset 52

Program Headers:
  Type           Offset   VirtAddr   PhysAddr   FileSiz MemSiz  Flg Align
  PHDR           0x000034 0x08048034 0x08048034 0x00120 0x00120 R E 0x4
  INTERP         0x000154 0x08048154 0x08048154 0x00013 0x00013 R   0x1
      [Requesting program interpreter: /lib/ld-linux.so.2]
  LOAD           0x000000 0x08048000 0x08048000 0x16eb4 0x16eb4 R E 0x1000
  LOAD           0x016ef0 0x0805fef0 0x0805fef0 0x003a0 0x0081c RW  0x1000
  DYNAMIC        0x016f04 0x0805ff04 0x0805ff04 0x000e8 0x000e8 RW  0x4
  NOTE           0x000168 0x08048168 0x08048168 0x00020 0x00020 R   0x4
  GNU_EH_FRAME   0x016dec 0x0805edec 0x0805edec 0x0002c 0x0002c R   0x4
  GNU_STACK      0x000000 0x00000000 0x00000000 0x00000 0x00000 RW  0x4
  GNU_RELRO      0x016ef0 0x0805fef0 0x0805fef0 0x00110 0x00110 R   0x1

 Section to Segment mapping:
  Segment Sections...
   00     
   01     .interp 
   02     .interp .note.ABI-tag .hash .gnu.hash .dynsym .dynstr 
          .gnu.version .gnu.version_r .rel.dyn 
          .rel.plt .init .plt .text .fini .rodata .eh_frame_hdr .eh_frame 
   03     .ctors .dtors .jcr .dynamic .got .got.plt .data .bss 
   04     .dynamic 
   05     .note.ABI-tag 
   06     .eh_frame_hdr 
   07     
   08     .ctors .dtors .jcr .dynamic .got

From this output, it is evident that a single program header describes a single segment, which holds one or more sections (e.g., program headers 2, 3, 8); moreover, a section may appear in multiple program headers (e.g., the .dynamic section appears in 8 and in 4). Take a look at the fact thatPT_LOADsegments (which contain code and data sections) are aligned on a page-boundary basis.

The section header

The section header holds information which describes a portion of the on-disk structure of the file. A section might be as small as a few bytes (e.g., the.interpsection), or it may be as big as the.textsection, which holds almost all the executable code. Sections are a way to logically subdivide an ELF file into smaller portions. If you're familiar with Microsoft's PE, then an ELF section has nothing to do with a PE section, but a PE section is the same thing as an ELF segment. Here's the structure:

Collapse | Copy Code

typedef struct {
  Elf32_Word    sh_name; /* Section name (string tbl index) */ Elf32_Word    sh_type; /* Section type */ Elf32_Word    sh_flags; /* Section flags */ Elf32_Addr    sh_addr; /* Section virtual addr at execution */ Elf32_Off    sh_offset; /* Section file offset */ Elf32_Word    sh_size; /* Section size in bytes */ Elf32_Word    sh_link; /* Link to another section */ Elf32_Word    sh_info; /* Additional section information */ Elf32_Word    sh_addralign; /* Section alignment */ Elf32_Word    sh_entsize; /* Entry size if section holds table */ } Elf32_Shdr;

sh_name: The name of the section. This is an index inside a string table. A string table is a collection of null terminated strings delimited by null bytes; e.g., "\0name1\0name2\0name3\0\0"; the next paragraph will be about them.
sh_type: The section type. There can be more than one section of the same type, with a few exceptions (e.g., there can be only oneSHT_SYMTABand only oneSHT_DYNSYM). Sections with typeSHT_PROGBITScontain actual initialized code/data, the ones with typeSHT_NOBITScontain uninitialized data (that does not require file storage, e.g., the.bsssection).
sh_flags: Section flags. A bit-mask with the usual memory protection attributes (read, write, execute).
sh_addr: Virtual address of the section. Once the image has been mapped in memory, it will be a virtual address inside a segment.
sh_offset: The offset from the start of the file at which the section is located; this field is a physical offset.
sh_size: The physical size of the section in bytes. This might either be the storage space required, or the actual size once the section is mapped forNOBITSsections.
sh_link: This is a special field. Its contents depend on the section type. ForSHT_SYMTABandSHT_DYNSYM, this field holds a section table index, which gives the section containing the string table with the names of the symbols; take a look at the manual for other types.
sh_info: The content of this field depends on the section type.
sh_entsize: If a table is contained within the section, then this field gives the size, in bytes, of one entry in the table.

So, let's see the usual output from "readelf":

Collapse | Copy Code

quake2@quake2-desktop:~$ readelf -S /bin/ls
There are 28 section headers, starting at offset 0x17378:

Section Headers:
  [Nr] Name              Type            Addr     Off    Size   ES Flg Lk Inf Al
  [ 0]                   NULL            00000000 000000 000000 00      0   0  0
  [ 1] .interp           PROGBITS        08048154 000154 000013 00   A  0   0  1
  [ 2] .note.ABI-tag     NOTE            08048168 000168 000020 00   A  0   0  4
  [ 3] .hash             HASH            08048188 000188 000330 04   A  5   0  4
  [ 4] .gnu.hash         GNU_HASH        080484b8 0004b8 00005c 04   A  5   0  4
  [ 5] .dynsym           DYNSYM          08048514 000514 000690 10   A  6   1  4
  [ 6] .dynstr           STRTAB          08048ba4 000ba4 0004af 00   A  0   0  1
  [ 7] .gnu.version      VERSYM          08049054 001054 0000d2 02   A  5   0  2
  [ 8] .gnu.version_r    VERNEED         08049128 001128 0000d0 00   A  6   3  4
  [ 9] .rel.dyn          REL             080491f8 0011f8 000028 08   A  5   0  4
  [10] .rel.plt          REL             08049220 001220 0002e8 08   A  5  12  4
  [11] .init             PROGBITS        08049508 001508 000030 00  AX  0   0  4
  [12] .plt              PROGBITS        08049538 001538 0005e0 04  AX  0   0  4
  [13] .text             PROGBITS        08049b20 001b20 01145c 00  AX  0   0 16
  [14] .fini             PROGBITS        0805af7c 012f7c 00001c 00  AX  0   0  4
  [15] .rodata           PROGBITS        0805afa0 012fa0 003e4c 00   A  0   0 32
  [16] .eh_frame_hdr     PROGBITS        0805edec 016dec 00002c 00   A  0   0  4
  [17] .eh_frame         PROGBITS        0805ee18 016e18 00009c 00   A  0   0  4
  [18] .ctors            PROGBITS        0805fef0 016ef0 000008 00  WA  0   0  4
  [19] .dtors            PROGBITS        0805fef8 016ef8 000008 00  WA  0   0  4
  [20] .jcr              PROGBITS        0805ff00 016f00 000004 00  WA  0   0  4
  [21] .dynamic          DYNAMIC         0805ff04 016f04 0000e8 08  WA  6   0  4
  [22] .got              PROGBITS        0805ffec 016fec 000008 04  WA  0   0  4
  [23] .got.plt          PROGBITS        0805fff4 016ff4 000180 04  WA  0   0  4
  [24] .data             PROGBITS        08060180 017180 000110 00  WA  0   0 32
  [25] .bss              NOBITS          080602a0 017290 00046c 00  WA  0   0 32
  [26] .gnu_debuglink    PROGBITS        00000000 017290 000008 00      0   0  1
  [27] .shstrtab         STRTAB          00000000 017298 0000df 00      0   0  1
Key to Flags:
  W (write), A (alloc), X (execute), M (merge), S (strings)
  I (info), L (link order), G (group), x (unknown)
  O (extra OS processing required) o (OS specific), p (processor specific)

As you can see from the output, the section table starts with a null entry. Some sections have an address field equal to 0: these sections will not be mapped into memory (take a look at the previous program header table, you won't find them). From the output, you can see that there are two string tables. Take a look at their addresses, one is different from zero, the other is zero. This means that one string table will be discarded while mapping the file in memory; the one which is not discarded will hold the names of the dynamic symbols (.dynsym) which are used by the dynamic linker to resolve runtime relocations. Take a note of the index of the last section; it is the same index that you will find in the fielde_shstrndxof the ELF header: the.shstrtabsection holds the name of the ELF sections.

String table

The string table, strictly speaking, is not a structure, but rather is an un-ordered collection of strings. There's not much to say about string tables; just keep in mind that when you see a name field inside an ELF structure, it will always be an index into a string table. Which string table? It depends on the context; but, whenever a string table is encountered, there is always a way to know it. Let's take a look at a memory dump of the last section:

Collapse | Copy Code

quake2@quake2-desktop:~$ readelf -x 27 /bin/ls

Hex dump of section '.shstrtab':
  0x00000000 002e7368 73747274 6162002e 696e7465 ..shstrtab..inte
  0x00000010 7270002e 6e6f7465 2e414249 2d746167 rp..note.ABI-tag
  0x00000020 002e676e 752e6861 7368002e 64796e73 ..gnu.hash..dyns
  0x00000030 796d002e 64796e73 7472002e 676e752e ym..dynstr..gnu.
  0x00000040 76657273 696f6e00 2e676e75 2e766572 version..gnu.ver
  0x00000050 73696f6e 5f72002e 72656c2e 64796e00 sion_r..rel.dyn.
  0x00000060 2e72656c 2e706c74 002e696e 6974002e .rel.plt..init..
  0x00000070 74657874 002e6669 6e69002e 726f6461 text..fini..roda
  0x00000080 7461002e 65685f66 72616d65 5f686472 ta..eh_frame_hdr
  0x00000090 002e6568 5f667261 6d65002e 63746f72 ..eh_frame..ctor
  0x000000a0 73002e64 746f7273 002e6a63 72002e64 s..dtors..jcr..d
  0x000000b0 796e616d 6963002e 676f7400 2e676f74 ynamic..got..got
  0x000000c0 2e706c74 002e6461 7461002e 62737300 .plt..data..bss.
  0x000000d0 2e676e75 5f646562 75676c69 6e6b00   .gnu_debuglink.

It's just a collection of strings (the value on the left-most column is an offset): plain and simple.

Let's write come code...

We've reached the point where we can start writing some code, which will show the ELF header and will perform sanity checks on it. The full source can be found in the attachment.

Collapse | Copy Code

#include <sys/types.h> #include <sys/stat.h> #include <fcntl.h> #include <stdio.h> #include <stdlib.h> #include <string.h> #include <errno.h> #include <elf.h>  int elf_is_valid(Elf32_Ehdr *elf_hdr)
{ if( (elf_hdr->e_ident[EI_MAG0] != 0x7F) || 
        (elf_hdr->e_ident[EI_MAG1] != 'E') ||
        (elf_hdr->e_ident[EI_MAG2] != 'L') ||
        (elf_hdr->e_ident[EI_MAG3] != 'F') )
    { return 0;
    } if(elf_hdr->e_ident[EI_CLASS] != ELFCLASS32) return 0; if(elf_hdr->e_ident[EI_DATA] != ELFDATA2LSB) return 0; return 1;
} static char *elf_types[] = { "ET_NONE", "ET_REL", "ET_EXEC", "ET_DYN", "ET_CORE", "ET_NUM" }; char *get_elf_type(Elf32_Ehdr *elf_hdr)
{ if(elf_hdr->e_type > 5) return NULL; return elf_types[elf_hdr->e_type];
} int print_elf_header(Elf32_Ehdr *elf_hdr)
{ char *sz_elf_type = NULL; if(!elf_hdr) return 0;

    printf("ELF header information\n");

    sz_elf_type = get_elf_type(elf_hdr); if(sz_elf_type)
        printf("- Type: %s\n", sz_elf_type); else printf("- Type: %04x\n", elf_hdr->e_type);

    printf("- Version: %d\n", elf_hdr->e_version);
    printf("- Entrypoint: 0x%08x\n", elf_hdr->e_entry);
    printf("- Program header table offset: 0x%08x\n", elf_hdr->e_phoff);
    printf("- Section header table offset: 0x%08x\n", elf_hdr->e_shoff);
    printf("- Flags: 0x%08x\n", elf_hdr->e_flags);
    printf("- ELF header size: %d\n", elf_hdr->e_ehsize);
    printf("- Program header size: %d\n", elf_hdr->e_phentsize);
    printf("- Program header entries: %d\n", elf_hdr->e_phnum);
    printf("- Section header size: %d\n", elf_hdr->e_shentsize);
    printf("- Section header entries: %d\n", elf_hdr->e_shnum);
    printf("- Section string table index: %d\n", elf_hdr->e_shstrndx); return 1;
} int main(int argc, char *argv[])
{ int fd_elf = -1;
    u_char *p_base = NULL; struct stat elf_stat;
    Elf32_Ehdr *p_ehdr = NULL; if(argc < 2)
    {
        printf("Usage: %s \n", argv[0]); return 1;
    }

    fd_elf = open(argv[1], O_RDONLY); if(fd_elf == -1)
    {
        fprintf(stderr, "Could not open %s: %s\n", argv[1], strerror(errno)); return 1;
    } if(fstat(fd_elf, &elf_stat) == -1)
    {
        fprintf(stderr, "Could not stat %s: %s\n", argv[1], strerror(errno));
        close(fd_elf); return 1;
    }

    p_base = (u_char *)calloc(sizeof(u_char), elf_stat.st_size); if(!p_base)
    {
        fprintf(stderr, "Not enough memory\n");
        close(fd_elf); return 1;
    } if(read(fd_elf, p_base, elf_stat.st_size) != elf_stat.st_size)
    {
        fprintf(stderr, "Error while reading file: %s\n", strerror(errno));
        free(p_base);
        close(fd_elf); return 1;
    }
    
    close(fd_elf);

    p_ehdr = (Elf32_Ehdr *)p_base; if(elf_is_valid(p_ehdr))
        print_elf_header(p_ehdr); else fprintf(stderr, "Invalid ELF file\n");

    free(p_base); return 0;
}

The code is quite simple. As you can see, the ELF header is at the beginning of the file, so we just declare a pointer to the start of the allocated buffer (which contains the file). Take a look at theelf_is_validfunction which performs sanity checks.

Let's see another source, which this time will also show the section header table and the program header table:

Collapse | Copy Code

static char *ptypes[] = { "PT_NULL", "PT_LOAD", "PT_DYNAMIC", "PT_INTERP", "PT_NOTE", "PT_SHLIB", "PT_PHDR" }; int print_program_header(Elf32_Phdr *phdr, uint index)
{ if(!phdr) return 0;

    printf("Program header %d\n", index); if(phdr->p_type <= 6)
        printf("- Type: %s\n", ptypes[phdr->p_type]); else printf("- Type: %08x\n", phdr->p_type);

    printf("- Offset: %08x\n", phdr->p_offset);
    printf("- Virtual Address: %08x\n", phdr->p_vaddr);
    printf("- Physical Address: %08x\n", phdr->p_paddr);
    printf("- File size: %d\n", phdr->p_filesz);
    printf("- Memory size: %d\n", phdr->p_memsz);
    printf("- Flags: %08x\n", phdr->p_flags);
    printf("- Alignment: %08x\n", phdr->p_align);
} static char *stypes[] = { "SHT_NULL", "SHT_PROGBITS", "SHT_SYMTAB", "SHT_STRTAB", "SHT_RELA", "SHT_HASH", "SHT_DYNAMIC", "SHT_NOTE", "SHT_NOBITS", "SHT_REL", "SHT_SHLIB", "SHT_DYNSYM" }; int print_section_header(Elf32_Shdr *shdr, uint index, char *strtable)
{ if(!shdr) return 0;

    printf("Section header: %d\n", index);
    printf("- Name index: %d\n", shdr->sh_name); //as you can see, we're using sh_name as an index into the string table  printf("- Name: %s\n", strtable + shdr->sh_name); if(shdr->sh_type <= 11)
        printf("- Type: %s\n", stypes[shdr->sh_type]); else printf("- Type: %04x\n", shdr->sh_type);
    printf("- Flags: %08x\n", shdr->sh_flags);
    printf("- Address: %08x\n", shdr->sh_addr);
    printf("- Offset: %08x\n", shdr->sh_offset);
    printf("- Size: %08x\n", shdr->sh_size);
    printf("- Link %08x\n", shdr->sh_link);
    printf("- Info: %08x\n", shdr->sh_info);
    printf("- Address alignment: %08x\n", shdr->sh_addralign);
    printf("- Entry size: %08x\n", shdr->sh_entsize);

} int main(int argc, char *argv[])
{ int fd_elf = -1;
    u_char *p_base = NULL; char *p_strtable = NULL; struct stat elf_stat;
    Elf32_Ehdr *p_ehdr = NULL;
    Elf32_Phdr *p_phdr = NULL;
    Elf32_Shdr *p_shdr = NULL; int i; if(argc < 2)
    {
        printf("Usage: %s </path/to/file>\n", argv[0]); return 1;
    }

    fd_elf = open(argv[1], O_RDONLY); if(fd_elf == -1)
    {
        fprintf(stderr, "Could not open %s: %s\n", argv[1], strerror(errno)); return 1;
    } if(fstat(fd_elf, &elf_stat) == -1)
    {
        fprintf(stderr, "Could not stat %s: %s\n", argv[1], strerror(errno));
        close(fd_elf); return 1;
    }

    p_base = (u_char *)calloc(sizeof(u_char), elf_stat.st_size); if(!p_base)
    {
        fprintf(stderr, "Not enough memory\n");
        close(fd_elf); return 1;
    } if(read(fd_elf, p_base, elf_stat.st_size) != elf_stat.st_size)
    {
        fprintf(stderr, "Error while reading file: %s\n", strerror(errno));
        free(p_base);
        close(fd_elf); return 1;
    }
    
    close(fd_elf);

    p_ehdr = (Elf32_Ehdr *)p_base; if(elf_is_valid(p_ehdr))
    {
        print_elf_header(p_ehdr);

        printf("\n"); //to reach the section header table and the program header table  //we simply add the offset of these table to the base address  p_phdr = (Elf32_Phdr *)(p_base + p_ehdr->e_phoff);
        p_shdr = (Elf32_Shdr *)(p_base + p_ehdr->e_shoff); //this is the first example of string table usage: the e_shstrndx field  //holds an index into the section header table,  //which is address by p_shdr. The section's  //sh_offset field will hold the offset  //of the string table, to get the actual pointer  //we have just to sum it to the base address  p_strtable = (char *)(p_base + p_shdr[p_ehdr->e_shstrndx].sh_offset); for(i = 0; i < p_ehdr->e_phnum; i++)
        {
            print_program_header(&p_phdr[i], i);
        } for(i = 0; i < p_ehdr->e_shnum; i++)
        {
            print_section_header(&p_shdr[i], i, p_strtable);
        }
    } else printf("Invalid ELF file\n");

    free(p_base); return 0;
}

The interesting parts have been commented.

Symbol table

The symbol table is a fundamental structure of the ELF format. As the name says, it's a table which contains an array of symbols. The ELF manual provides us an elegant definition for the symbol table:

"An object file's symbol table holds information needed to locate and relocate a program's symbolic definitions and references."

Let's make an example, anticipating the dynamic linking process: when an executable needs to use a function defined elsewhere (usually inside a shared object), the linker will create an entry in the dynamic symbol table, with its value equal to 0 (the symbol's value is not the same as the symbol's name), and its name equal to the name of the external function needed by the executable; together with the entry in the symbol table, an additional entry will be created in the relocation table which holds additional information on how to resolve the symbol (will be explained later). The linker will look at the loaded libraries, searching for a library that has a symbol with the same name as the symbol used by the executable. The value of the symbol found in one of the loaded libraries will be the actual function's entry point. This way, the dynamic linker "resolves" a symbol which refers to an external one. Here is the symbol table structure:

Collapse | Copy Code

typedef struct {
  Elf32_Word    st_name; /* Symbol name (string tbl index) */ Elf32_Addr    st_value; /* Symbol value */ Elf32_Word    st_size; /* Symbol size */ unsigned char st_info; /* Symbol type and binding */ unsigned char st_other; /* Symbol visibility */ Elf32_Section    st_shndx; /* Section index */ } Elf32_Sym;

st_name: The name of the symbol; of course, it's an index into a string table. The string table to use depends, as usual, on the context.
st_value: The value of the symbol; it may be an address, an offset, an index, etc... It depends on the symbol type.
st_size: The size of the symbol; it is meaningful only for some kind of symbols: e.g., for a symbol of typeOBJECT, it will hold the object's size in bytes.
st_info: Actually, this is a byte-packed type. It's made up of two nibbles: one contains the type of symbols, the other contains the binding (local, global, weak). The fileelf.hdefines some useful macros to manipulate this field:

ELF32_ST_BIND(i): returns the bind of the symbol.
ELF32_ST_TYPE(i): returns the type of the symbol.
ELF32_ST_INFO(b,t): packsbandtinto a single byte,bis the binding type andtis the symbol type.

st_other: According to ELF specification, the value of this field in undefined. In Linux systems, it holds the symbol visibility (defined values:DEFAULT,HIDDEN).
st_shndx: The index of the section which holds the symbol described by this entry.

The symbol table is quite a complex structure, so it's better if you take a look at the ELF manual. Let's see the output from "readelf":

Collapse | Copy Code

quake2@quake2-desktop:~$ readelf -W -s /bin/ls

Symbol table '.dynsym' contains 105 entries:
   Num:    Value  Size Type    Bind   Vis      Ndx Name
[...]
    23: 00000000   198 FUNC    GLOBAL DEFAULT  UND strncpy@GLIBC_2.0 (2)
    24: 00000000    35 FUNC    GLOBAL DEFAULT  UND freecon
    25: 00000000    88 FUNC    GLOBAL DEFAULT  UND memset@GLIBC_2.0 (2)
    26: 00000000   441 FUNC    GLOBAL DEFAULT  UND __libc_start_main@GLIBC_2.0 (2)
    27: 00000000    68 FUNC    GLOBAL DEFAULT  UND mempcpy@GLIBC_2.1 (5)
    28: 00000000    80 FUNC    GLOBAL DEFAULT  UND __memcpy_chk@GLIBC_2.3.4 (6)
    29: 00000000   186 FUNC    GLOBAL DEFAULT  UND _obstack_begin@GLIBC_2.0 (2)
    30: 00000000    19 FUNC    GLOBAL DEFAULT  UND _exit@GLIBC_2.0 (2)
    31: 00000000   441 FUNC    GLOBAL DEFAULT  UND strrchr@GLIBC_2.0 (2)
    32: 00000000   336 FUNC    GLOBAL DEFAULT  UND __assert_fail@GLIBC_2.0 (2)
    33: 00000000    29 FUNC    GLOBAL DEFAULT  UND bindtextdomain@GLIBC_2.0 (2)
    34: 00000000   597 FUNC    GLOBAL DEFAULT  UND mbrtowc@GLIBC_2.0 (2)
    35: 00000000    62 FUNC    GLOBAL DEFAULT  UND gettimeofday@GLIBC_2.0 (2)
    36: 00000000    64 FUNC    GLOBAL DEFAULT  UND __ctype_toupper_loc@GLIBC_2.3 (7)
    37: 00000000    69 FUNC    GLOBAL DEFAULT  UND __lxstat64@GLIBC_2.2 (3)
    38: 00000000   446 FUNC    GLOBAL DEFAULT  UND _obstack_newchunk@GLIBC_2.0 (2)
    39: 00000000   102 FUNC    GLOBAL DEFAULT  UND __overflow@GLIBC_2.0 (2)
    40: 00000000    73 FUNC    GLOBAL DEFAULT  UND dcgettext@GLIBC_2.0 (2)
    41: 00000000   100 FUNC    GLOBAL DEFAULT  UND sigaction@GLIBC_2.0 (2)
    42: 00000000   351 FUNC    GLOBAL DEFAULT  UND strverscmp@GLIBC_2.1 (5)
    43: 00000000   152 FUNC    GLOBAL DEFAULT  UND opendir@GLIBC_2.0 (2)
    44: 00000000    71 FUNC    GLOBAL DEFAULT  UND getopt_long@GLIBC_2.0 (2)
    45: 00000000    64 FUNC    GLOBAL DEFAULT  UND ioctl@GLIBC_2.0 (2)
    46: 00000000    64 FUNC    GLOBAL DEFAULT  UND __ctype_b_loc@GLIBC_2.3 (7)
    47: 00000000   226 FUNC    GLOBAL DEFAULT  UND iswcntrl@GLIBC_2.0 (2)
    48: 00000000    50 FUNC    GLOBAL DEFAULT  UND isatty@GLIBC_2.0 (2)
    49: 00000000   539 FUNC    GLOBAL DEFAULT  UND fclose@GLIBC_2.1 (5)
    50: 00000000    25 FUNC    GLOBAL DEFAULT  UND mbsinit@GLIBC_2.0 (2)
    51: 00000000    54 FUNC    GLOBAL DEFAULT  UND _setjmp@GLIBC_2.0 (2)
    52: 00000000    56 FUNC    GLOBAL DEFAULT  UND tcgetpgrp@GLIBC_2.0 (2)
    53: 00000000    60 FUNC    GLOBAL DEFAULT  UND mktime@GLIBC_2.0 (2)
    54: 00000000   222 FUNC    GLOBAL DEFAULT  UND readdir64@GLIBC_2.2 (3)
    55: 00000000    70 FUNC    GLOBAL DEFAULT  UND memcpy@GLIBC_2.0 (2)
    56: 00000000    76 FUNC    GLOBAL DEFAULT  UND strtoul@GLIBC_2.0 (2)
    57: 00000000   175 FUNC    GLOBAL DEFAULT  UND strlen@GLIBC_2.0 (2)
    58: 00000000   299 FUNC    GLOBAL DEFAULT  UND getpwuid@GLIBC_2.0 (2)
    59: 00000000   186 FUNC    GLOBAL DEFAULT  UND acl_extended_file@ACL_1.0 (8)
    60: 00000000  1931 FUNC    GLOBAL DEFAULT  UND setlocale@GLIBC_2.0 (2)
    61: 00000000    37 FUNC    GLOBAL DEFAULT  UND strcpy@GLIBC_2.0 (2)
    62: 00000000   148 FUNC    GLOBAL DEFAULT  UND raise@GLIBC_2.0 (2)
    63: 00000000   178 FUNC    GLOBAL DEFAULT  UND fwrite_unlocked@GLIBC_2.1 (5)
    64: 00000000   293 FUNC    GLOBAL DEFAULT  UND clock_gettime@GLIBC_2.2 (9)
    65: 00000000   123 FUNC    GLOBAL DEFAULT  UND getfilecon
    66: 00000000    98 FUNC    GLOBAL DEFAULT  UND closedir@GLIBC_2.0 (2)
    67: 00000000   403 FUNC    GLOBAL DEFAULT  UND fwrite@GLIBC_2.0 (2)
    68: 00000000   174 FUNC    GLOBAL DEFAULT  UND sigprocmask@GLIBC_2.0 (2)
    69: 00000000    32 FUNC    GLOBAL DEFAULT  UND __stack_chk_fail@GLIBC_2.4 (10)
    70: 00000000    42 FUNC    GLOBAL DEFAULT  UND __fpending@GLIBC_2.2 (3)
    71: 00000000   123 FUNC    GLOBAL DEFAULT  UND lgetfilecon
    72: 00000000   223 FUNC    GLOBAL DEFAULT  UND error@GLIBC_2.0 (2)
    73: 00000000   299 FUNC    GLOBAL DEFAULT  UND getgrgid@GLIBC_2.0 (2)
    74: 00000000    75 FUNC    GLOBAL DEFAULT  UND __strtoull_internal@GLIBC_2.0 (2)
    75: 00000000   115 FUNC    GLOBAL DEFAULT  UND sigaddset@GLIBC_2.0 (2)
[...]

As you can see, the executable "/bin/ls" has only dynamic symbols, i.e., symbols that are resolved by the dynamic linker. You may have noticed that some symbols have a "@GLIBC_2.2" or similar appended to their names: these string are not part of the name, and is appended by "readelf", which actually parses GNU's versioning information. If you want to know how to read the versioning information, take a look at the source of "readelf", which can be found in the "binutils" package.

Relocation table

The relocation table holds an array of entries, each one of these describes how to relocate code/data sections. Let's try to understand better the relocation process: various instructions within the code section actually make references to objects (variables and/or functions) which reside somewhere else (within the code section or in another place). To access these objects, an absolute address is needed: this might not be a problem for executables since they're always loaded at the same base address, but libraries are not, so we need a way to locate these objects, by "manipulating" the absolute address which references them. This is exactly what relocations do: they instruct the dynamic linker (or the loader) on how to manipulate the address they're referencing, since a relocation will always reference an address to be manipulated in some way. There are various kinds of relocations, but for this article, the important ones are those for the x86 architecture. There are two big categories of relocations: RELA and REL. On the x86 architecture, there are only relocations of type REL (whilst on x64, there are only relocations of type RELA). Here is the REL relocation structure:

Collapse | Copy Code

typedef struct {
  Elf32_Addr    r_offset; /* Address */ Elf32_Word    r_info; /* Relocation type and symbol index */ } Elf32_Rel;

r_offset: Offset to which the relocation must be applied.
r_info: A word which contains the relocation type and if the relocation is associated with a symbol; it will contain the symbol index into the symbol table being used.

The interesting type of relocation for this article will beR_386_JMP_SLOT, which will be described in the last paragraph of the last chapter. It's time to make an example which deals with relocations. The shared object used is "libhook.so" which will be built in the next part of the article, but this is just an example, so don't worry if you can't reproduce it (you should try to reproduce it using another library, as an exercise). Let's take a look at the first few relocations from "libhook.so":

Collapse | Copy Code

quake2@quake2-desktop:~/elfinj$ readelf -r libhook.so

Relocation section '.rel.dyn' at offset 0x3b4 contains 49 entries:
 Offset     Info    Type            Sym.Value  Sym. Name
00000679  00000008 R_386_RELATIVE   
0000069d  00000008 R_386_RELATIVE   
000006b8  00000008 R_386_RELATIVE   
000006c5  00000008 R_386_RELATIVE   
000006ca  00000008 R_386_RELATIVE

From this output, it is clear that the relocation being used isR_386_RELATIVE, so from the ELF manual:

R_386_RELATIVE: The link editor creates this relocation type for dynamic linking. Its offset member gives a location within a shared object that contains a value representing a relative address. The dynamic linker computes the corresponding virtual address by adding the virtual address at which the shared object was loaded to the relative address. Relocation entries for this type must specify 0 for the symbol table index.

So, ther_offsetfield holds the address to which the relocation must be applied. Let's choose the relocation with offset0x000006b8. Here's the contents at this address:

Collapse | Copy Code

6b5:    c7 04 24 81 0a 00 00     movl   $0xa81,(%esp)
6bc:    e8 fc ff ff ff           call   6bd <inithooklib+0x2d>

The offset referenced by the relocation is the actual argument of the "movl" instruction, which, rewritten with Intel syntax, reads "mov [esp], 0x00000A81", so the relocation is to be applied to the immediate value on the right hand side of "mov", which is the value at offset 0x6B8. From the description, it is clear that the value referenced must be added to the image base (that is, the address at which the image of the library resides in memory) to form a valid absolute address. These kind of relocations do not have any symbols associated with them. The relocation process is performed by the linker while it's preparing the memory image of the ELF file. Have a look at the ELF manual for a complete list of valid relocation types for the x86 architecture.

Dynamic section

The last structure which will be described is the dynamic section. This section holds information for the dynamic linker, in particular, about how to retrieve dynamic symbols once the image has been loaded, the libraries required to load the image, the dynamic relocations, etc... Usually, the dynamic section is contained in its own section of typeSHT_DYNAMIC; once the ELF has been loaded into memory, the dynamic section can be retrieved from the program header of typePT_DYNAMIC. As usual, the dynamic section is a table made up of various entries. The table ends with aNULLentry. Let's see the structure:

Collapse | Copy Code

typedef struct {
  Elf32_Sword    d_tag; /* Dynamic entry type */ union {
      Elf32_Word d_val; /* Integer value */ Elf32_Addr d_ptr; /* Address value */ } d_un;
} Elf32_Dyn;

d_tag: The type of the entry; the interesting ones for this article areDT_PLTRELSZ,DT_SYMTAB,DT_STRTAB,DT_REL, andDT_JMPREL, so be sure to check them on the ELF manual and to understand how they work. The entry of typeDT_SYMTABgives the symbol table which holds the dynamic symbols, and the one of typeDT_STRTABgives the string table which holds the names of the dynamic symbols. So, as mentioned before, there is no ambiguity on which symbol table/string table to use.
d_ptr: The virtual address of the entry identified by the symbol. If we're dealing with an executable, this will be a virtual address; otherwise, it will be a relative virtual address (needs to be added to the image base).

Here's the dynamic section of "/bin/ls":

Collapse | Copy Code

quake2@quake2-desktop:~$ readelf -d /bin/ls

Dynamic section at offset 0x16f04 contains 24 entries:
  Tag        Type                         Name/Value
 0x00000001 (NEEDED)                     Shared library: [librt.so.1]
 0x00000001 (NEEDED)                     Shared library: [libselinux.so.1]
 0x00000001 (NEEDED)                     Shared library: [libacl.so.1]
 0x00000001 (NEEDED)                     Shared library: [libc.so.6]
 0x0000000c (INIT)                       0x8049508
 0x0000000d (FINI)                       0x805af7c
 0x00000004 (HASH)                       0x8048188
 0x6ffffef5 (GNU_HASH)                   0x80484b8
 0x00000005 (STRTAB)                     0x8048ba4
 0x00000006 (SYMTAB)                     0x8048514
 0x0000000a (STRSZ)                      1199 (bytes)
 0x0000000b (SYMENT)                     16 (bytes)
 0x00000015 (DEBUG)                      0x0
 0x00000003 (PLTGOT)                     0x805fff4
 0x00000002 (PLTRELSZ)                   744 (bytes)
 0x00000014 (PLTREL)                     REL
 0x00000017 (JMPREL)                     0x8049220
 0x00000011 (REL)                        0x80491f8
 0x00000012 (RELSZ)                      40 (bytes)
 0x00000013 (RELENT)                     8 (bytes)
 0x6ffffffe (VERNEED)                    0x8049128
 0x6fffffff (VERNEEDNUM)                 3
 0x6ffffff0 (VERSYM)                     0x8049054
 0x00000000 (NULL)                       0x0

Of course, the dynamic section does not hold any section index, since once the image is loaded into memory, sections lose their meaning, so there are only (relative) virtual addresses. From the dynamic section of "/bin/ls", it can be seen that this executable requires four libraries to run, namely: "librt.so.1", "libselinux.so.1", "libacl.so.1", and "libc.so.6".

Hash tables

If you take a deeper look at the sections list, you will find a section of typeSHT_HASH. This means that the section holds a hash table. Hash tables are used for fast symbol lookup, but they're not used in this article, so there's nothing to care about, except one thing: each hash table has a field callednchains(take a look at Fig. 5-11 in page 94 of the general manual), which is equal to the total number of symbol names that have been hashed. So, this field gives the total number of symbols, and it will be used in the second part to perform symbol lookup (to know when to stop searching for a particular symbol).

Let's write more code...

The brief description of the ELF format has ended, so it's time to see more snippets of code. Here they are:

Collapse | Copy Code

static char *btypes[] = { "STB_LOCAL", "STB_GLOBAL", "STB_WEAK" }; static char *symtypes[] = { "STT_NOTYPE", "STT_OBJECT", "STT_FUNC", "STT_SECTION", "STT_FILE" }; void print_bind_type(u_char info)
{
    u_char bind = ELF32_ST_BIND(info); if(bind <= 2)
        printf("- Bind type: %s\n", btypes[bind]); else printf("- Bind type: %d\n", bind);
} void print_sym_type(u_char info)
{
    u_char type = ELF32_ST_TYPE(info); if(type <= 4)
        printf("- Symbol type: %s\n", symtypes[type]); else printf("- Symbol type: %d\n", type);
} int print_sym_table(u_char *filebase, Elf32_Shdr *section, char *strtable)
{
    Elf32_Sym *symbols;
    size_t sym_size = section->sh_entsize;
    size_t cur_size = 0; if(section->sh_type == SHT_SYMTAB)
        printf("Symbol table\n"); else printf("Dynamic symbol table\n"); if(sym_size != sizeof(Elf32_Sym))
    {
        printf("There's something evil with symbol table...\n"); return 0;
    }

    symbols = (Elf32_Sym *)(filebase + section->sh_offset);
    symbols++;
    cur_size += sym_size; do {
        printf("- Name index: %d\n", symbols->st_name);
        printf("- Name: %s\n", strtable + symbols->st_name);
        printf("- Value: 0x%08x\n", symbols->st_value);
        printf("- Size: 0x%08x\n", symbols->st_size);

        print_bind_type(symbols->st_info);
        print_sym_type(symbols->st_info);

        printf("- Section index: %d\n", symbols->st_shndx);
        cur_size += sym_size;
        symbols++;
    } while(cur_size < section->sh_size); return 1;
} int main(int argc, char *argv[])
{ int fd_elf = -1;
    u_char *p_base = NULL; char *p_strtable = NULL; struct stat elf_stat;
    Elf32_Ehdr *p_ehdr = NULL;
    Elf32_Phdr *p_phdr = NULL;
    Elf32_Shdr *p_shdr = NULL; int i; if(argc < 2)
    {
        printf("Usage: %s \n", argv[0]); return 1;
    }

    fd_elf = open(argv[1], O_RDONLY); if(fd_elf == -1)
    {
        fprintf(stderr, "Could not open %s: %s\n", argv[1], strerror(errno)); return 1;
    } if(fstat(fd_elf, &elf_stat) == -1)
    {
        fprintf(stderr, "Could not stat %s: %s\n", argv[1], strerror(errno));
        close(fd_elf); return 1;
    }

    p_base = (u_char *)calloc(sizeof(u_char), elf_stat.st_size); if(!p_base)
    {
        fprintf(stderr, "Not enough memory\n");
        close(fd_elf); return 1;
    } if(read(fd_elf, p_base, elf_stat.st_size) != elf_stat.st_size)
    {
        fprintf(stderr, "Error while reading file: %s\n", strerror(errno));
        free(p_base);
        close(fd_elf); return 1;
    }
    
    close(fd_elf);

    p_ehdr = (Elf32_Ehdr *)p_base; if(elf_is_valid(p_ehdr))
    {
        print_elf_header(p_ehdr);

        printf("\n");
        p_phdr = (Elf32_Phdr *)(p_base + p_ehdr->e_phoff);
        p_shdr = (Elf32_Shdr *)(p_base + p_ehdr->e_shoff);
        p_strtable = (char *)(p_base + p_shdr[p_ehdr->e_shstrndx].sh_offset); for(i = 0; i < p_ehdr->e_phnum; i++)
        {
            print_program_header(&p_phdr[i], i);
        } for(i = 0; i < p_ehdr->e_shnum; i++)
        {
            print_section_header(&p_shdr[i], i, p_strtable); if(p_shdr[i].sh_type == SHT_SYMTAB || p_shdr[i].sh_type == SHT_DYNSYM)
            {
                printf("This section holds a symbol table...\n"); //being a symbol table, the field sh_link of the section header  //will hold an index into the section table which gives the  //section containing the string table  print_sym_table(p_base, &p_shdr[i], 
                  (char *)(p_base + p_shdr[p_shdr[i].sh_link].sh_offset));
            }
        }
    } else printf("Invalid ELF file\n");

    free(p_base); return 0;
}

As an exercise, you should write the code which prints out the dynamic section (by now, you should know how to do it).

ELF loading

This section is a brief description of the loading process of an ELF image. Mainly, it will be about the PLT, so much more attention will be given to that argument. But, it will present a general overview of the loading process. For detailed information, you should read the manual (which gives a very good description of the loading process, with various examples of memory configurations).

Program headers

Program headers describe how to map one or more sections of the file into memory (e.g., thePT_LOADtype), and can hold information that might be useful at runtime after the loading process has ended. Some program headers have a special role:

PT_INTERP: This segment describes a memory area which holds a string. This string is an absolute path to a file, which is a program/library that works in conjunction with the system loader to actually load the ELF. On ELF built for Linux, this program header holds:

Collapse | Copy Code

quake2@quake2-desktop:~$ readelf -l /bin/ls
[...]
  INTERP         0x000154 0x08048154 0x08048154 0x00013 0x00013 R   0x1
      [Requesting program interpreter: /lib/ld-linux.so.2]
[...]

The program header is requesting the collaboration of /lib/ld-linux.so.2, which is the ELF loader in Linux:

Collapse | Copy Code

quake2@quake2-desktop:~$ /lib/ld-linux.so.2
Usage: ld.so [OPTION]... EXECUTABLE-FILE [ARGS-FOR-PROGRAM...]
You have invoked `ld.so', the helper program for shared library executables.
This program usually lives in the file `/lib/ld.so', and special directives
in executable files using ELF shared libraries tell the system's program
loader to load the helper program from this file. This helper program loads
the shared libraries needed by the program executable, prepares the program
to run, and runs it. You may invoke this helper program directly from the
command line to load and run an ELF executable file; this is like executing
that file itself, but always uses this helper program from the file you
specified, instead of the helper program file specified in the executable
file you run. This is mostly of use for maintainers to test new versions
of this helper program; chances are you did not intend to run this program.

  --list                list all dependencies and how they are resolved
  --verify              verify that given object really is a dynamically linked
                        object we can handle
  --library-path PATH   use given PATH instead of content of the environment
                        variable LD_LIBRARY_PATH
  --inhibit-rpath LIST  ignore RUNPATH and RPATH information in object names
                        in LIST

PT_LOAD: This segment type holds information on how to map sections which usually contain data and/or code.
PT_DYNAMIC: This segment holds information for the linker on where to find symbols, string tables, relocations, etc... Usually, its content is the same as in the dynamic section.

Dynamic section

The dynamic section holds all the information needed to dynamically link the executable/library. Let's have a few words on dynamic linking: it's the actual process that will apply relocations to the ELF image and, if lazy binding is used, will resolve any external symbol not already resolved. The dynamic section also holds information not strictly needed for dynamic linking as the entry point to the initialization/finalization function. The next paragraph will be entirely on PLT, which is the process through which dynamic symbols get resolved at runtime. Here's a list of the important dynamic types:

DT_JMPREL: This field holds the address of a table containing an array of relocation entries related only to the PLT. This way, if lazy binding in enabled, the linker will ignore these relocations during loading.
DT_PLTREL: This field holds the type of the PLT relocations. For the x86 architecture, there can be only REL relocations.
DT_PLTRELSZ: The total size, in bytes, of the table addressed byDT_JMPREL.
DT_NEEDED: A string which holds the name of a library that is required to load the image. As usual, this field is an index into a string table, the one addressed by theDT_STRTABdynamic entry.
DT_INIT: This field holds the address of a function to call on initialization. It is executed before the "main" if the image is an executable; otherwise, it is executed before the control goes back to the main executable if the image is a library.
DT_FINI: This field holds the address of a function to be called on finalization. The order in which the finalization functions are called is the inverse order of the initialization ones.

The PLT

The Procedure Linkage Table is a table in which the various entries are made up of code blocks. It's the principal component which allows the dynamic linker to resolve external functions. Here's an example: suppose that you write a program which references a function defined inside a library:

Collapse | Copy Code

int main()
{ int res = external_function(3,4); return 0;
}

Of course, to invoke the function, you need to know its absolute address, being an external one. The absolute address cannot be hardcoded by the linker, because usually libraries are loaded at different base addresses, so absolute addresses have no meaning for them. This situation is overcome by the use of the PLT, so when you call an external function, the following code is generated:

Collapse | Copy Code

push 0x04 push 0x03 call external_function@plt add esp, 8 [,..] ; this is a PLT entry external_function@plt (address 0xXXXXXX00): ; reloc_address is just a memory location  external_function@plt+0x00: jmp dword ptr [reloc_address] ; reloc_offset is a byte offset (not an index) into the relocation table  external_function@plt+0x06: push reloc_offset ; resolve_function is a function that will resolve the external symbol  external_function@plt+0x0B: jmp resolve_function ; this is what you find at reloc_address, data is displayed using dwords reloc_address: XXXXXX06 ........

What's happening here is that when the program reaches "call", it will transfer execution to a PLT entry. The first instruction executed then is a "jmp", which will transfer execution to the value contained in the location "reloc_address", which, as you can see, is the address of the instruction following the first "jmp" in the PLT entry. So, back again in the PLT, a byte offset isPUSHed into the stack, and then execution is transferred to a function which, taking out of the stack the last value pushed, will resolve the external symbol. By now, you might think that this procedure is painfully slow, with all those cache-killing jumps. But, let's go one step ahead and look at what's happening after the external symbol has been resolved:

Collapse | Copy Code

push 0x04 push 0x03 call external_function@plt add esp, 8 [...]

external_function@plt:
    external_function@plt+0x00: jmp dword ptr [reloc_address]
    external_function@plt+0x06: push reloc_index
    external_function@plt+0x0B: jmp resolve_function
    
reloc_address: BFF31337 ........

Something has changed, hasn't it? After the external symbol has been resolved, the memory location addressed by "reloc_address" will not contain the address of the instruction following the firstjmp, but will contain the actual entry point to the external function, so all thejmp-crazyness will be done only the first time. If you did not understand something, then read the manual carefully. The PLT is the most important thing in this two-part article, and the next part will deal much more with it, so be sure to know how it works. Anyway, by the end of this part, there will be a practical example using GDB on how the PLT works.

Building up our lab...

After the brief introduction to the ELF format, it's time to start working on preparing our "laboratory", that is, building a bunch of sample ELFs that we will use in the next part. We will build three ELFs: one executable, and two libraries, one of which will be the library that will get injected into the executable and will hook the function inside the other library (which is used by the executable directly). Let's start building the first two image files that we will use: the executable and the library used by it. Here's the source of the library, only the .c file is shown:

Collapse | Copy Code

#include "libdummy.h"  int dummy_add(int a, int b)
{ return a+b;
}

Compile and link it:

Collapse | Copy Code

gcc -fPIC -c libdummy.c
ld -shared -soname libdummy.so.1 -o libdummy.so.1.0 -lc libdummy.o

Now, let's update the cache with "ldconfig" (if you move the library to /usr/lib or any other system path, you might remove the "-n ." parameter):

Collapse | Copy Code

ldconfig -v -n .

and create the symbolic link needed by the linker, so we can link with "-ldummy":

Collapse | Copy Code

ln -sf libdummy.so.1 libdummy.so

Here's the executable which makes use of the library just built:

Collapse | Copy Code

#include <stdio.h> #include "libdummy.h"  int main()
{ int a,b; int res = 0;

    printf("Enter the first number: ");
    scanf("%d", &a);
    printf("Enter the second number: ");
    scanf("%d", &b);
    res = dummy_add(a,b);
    printf("Result is: %d\n", res); return 0;
}

Compile and link it:

Collapse | Copy Code

gcc -o dummyelf dummyelf.c -L. -ldummy

Don't forget to try to to see if everything worked (if you did not move libdummy.so.1.0 to /usr/lib, you should setLD_LIBRARY_PATH: "export LD_LIBRARY_PATH=.:$LD_LIBRARY_PATH").

That's all for now; in the next chapter, we will explore the PLT. We will build the second library in the next part of this article.

PLT: A practical example

The PLT has been discussed much in detail, but there's nothing better than a real example. So, let's play around with the executable just built. First of all, let's debug the executable with GDB, setting a breakpoint on the "dummy_add" call (keep in mind that your addresses can be different):

Collapse | Copy Code

quake2@quake2-desktop:~/elfinj$ gdb dummyelf
GNU gdb 6.8-debian
Copyright (C) 2008 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http:>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type "show copying"
and "show warranty" for details.
This GDB was configured as "i486-linux-gnu"...
(gdb) disassemble main
Dump of assembler code for function main:
[...]
0x08048507 <main+99>:    call   0x80483cc <dummy_add@plt>
[...]
End of assembler dump.
(gdb) break *0x08048507
Breakpoint 1 at 0x8048507
(gdb)

Start the program and step until it breaks, then step until you enter the call:

Collapse | Copy Code

(gdb) display/i $pc
(gdb) run
Starting program: /home/quake2/elfinj/dummyelf 
Enter the first number: 23
Enter the second number: 23

Breakpoint 1, 0x08048507 in main ()
1: x/i $pc
0x8048507 <main+99>:    call   0x80483cc <dummy_add@plt>
Current language:  auto; currently asm
(gdb) stepi
0x080483cc in dummy_add@plt ()
1: x/i $pc
0x80483cc <dummy_add@plt>:    jmp    *0x804a00c
(gdb)

So, as expected, the first instruction is ajmpto the value addressed by0x804a00c; let's see what's contained there:

Collapse | Copy Code

(gdb) print /x *0x804a00c
$1 = 0x80483d2
(gdb)

It holds0x80483d2, which is the address of the instruction following the firstjmp; this can be checked by disassembling the instructions aroundeip:

Collapse | Copy Code

(gdb) disassemble
Dump of assembler code for function dummy_add@plt:
0x080483cc <dummy_add@plt+0>:    jmp    *0x804a00c
0x080483d2 <dummy_add@plt+6>:    push   $0x18
0x080483d7 <dummy_add@plt+11>:    jmp    0x804838c <_init+48>
End of assembler dump.
(gdb)

As I told you, the instruction following thejmpwill push an offset into the stack, which is a byte offset into the relocation table; let's check if this is true: take the base address of the relocation table (value of theDT_JMPRELdynamic entry) and add it to the offset:

Collapse | Copy Code

(gdb) print /x *(0x8048334+0x18)
$4 = 0x804a00c
(gdb)

And, if you check theElf32_Relstructure, you will see that the first field hasElf32_Addras its type, which is a 32 bit unsigned integer and holds the address to which the relocation must be applied. The dynamic linker will replace the value at0x804a00cwith the actual address of thedummy_addfunction:

Collapse | Copy Code

0x80483d7 <dummy_add@plt+11>:    jmp    0x804838c <_init+48>
(gdb) step
Single stepping until exit from function dummy_add@plt, 
which has no line number information.
0xb8095168 in dummy_add () from ./libdummy.so.1
1: x/i $pc
0xb8095168 <dummy_add>:    push   %ebp
(gdb) print /x *0x804a00c
$1 = 0xb8095168
(gdb)

The memory location0x804a00cnow holds the virtual address ofdummy_add, that is,0xb8095168.

Conclusions

This tutorial should have introduced the reader to the basics of the ELF format and how the system (in this case, Linux) loads it. This part, by no means, is intended to replace the ELF manuals, which are a required reading for the next part, where things will get far more complicated. I've decided to split the article, so if you're familiar with the ELF format, you can just skip to the second part. Also, since there's a lot of code and other space consuming sections, making a big single article would have made a really long HTML file, which is quite unpleasant. Things should be simple and straight.

The next part will deal with the actual injection and hooking. The arguments covered will be: theptraceinterface with examples, code injection (general view), shared object injection and limitations of the technique, and PLT hooking (which will use all the concepts learned so far). The next part is expected to come out in the next few weeks (I hope not more than 2 weeks), depending on my time schedule (I'm an university student with a full-time job, it's quite difficult to find free-time :) ). I hope you enjoyed this first part...see you on the next one!

References

Here, you will find all the tools/books used during this article:

The GNU Compiler Collection (GCC)
The GNU Debugger (GDB)
The Netwide Assembler (NASM)
GNU's binutils
The ELF format specifications (for every platform)
The ELF format specifications for ELF64 (for every platform)
The ELF format specifications for the x86 architecture
The ELF format specifications for the x64 architecture
Brief description of the AT&T syntax, with the NASM version of most statements

History

10/11/2008: First version.

License

This article, along with any associated source code and files, is licensed under The BSD License

你可能感兴趣的:(PLT redirection through shared object injection...)

django.db.utils.DatabaseError：线程错误（sql_server环境）生如夏花~之绚烂 Django
报错信息django.db.utils.DatabaseError:DatabaseWrapperobjectscreatedinathreadcanonlybeusedinthatsamethread.Theobjectwithalias‘default’wascreatedinthreadid8576andthisisthreadid11652场景我在django开发的时候用的是sql_ser
Android学习总结之MMKV（代替SharedPreferences）每次的天空 android 学习
一、引言：存储革命的必然性在Android开发领域，SharedPreferences（SP）作为官方推荐的轻量级存储方案，曾是开发者的首选。然而，随着应用复杂度提升，SP的缺陷逐渐暴露：ANR风险、性能瓶颈、多进程灾难等问题频发。据统计，某头部应用因SP导致的ANR占比高达18%，而微信团队通过自研MMKV实现了零ANR的突破。本文将深度解析MMKV如何解决SP的"七宗罪"，并揭秘其碾压级技术方
Qt的4种多线程实现方式 m0_74824025 面试学习路线阿里巴巴 qt 开发语言
一、QThread类的run一、实现方法：新建一个集成QThread的类，重写虚函数run,通过run启动线程二、示例：classWorkerThread:publicQThread{Q_OBJECTvoidrun()override{QStringresult;/*...hereistheexpensiveorblockingoperation...*/emitresultReady(resul
GaussDB 内存结构详解笑远 gaussdb 数据库
GaussDB内存结构详解GaussDB是华为推出的高性能、可扩展的关系型数据库管理系统，广泛应用于企业级应用、大数据处理和云计算场景。内存管理在数据库性能和稳定性中扮演着至关重要的角色。本文将深入探讨GaussDB的内存结构，包括其主要组件、内存分配机制、缓存管理以及内存优化策略，帮助您全面理解并优化GaussDB的内存使用。目录GaussDB内存架构概述内存组成部分共享内存（SharedMem
在64位Ubuntu上编译安装x264库和ffmpeg库信道者人工智能 x264 ffmpeg installation compilation
x264库x264下载地址：ftp://ftp.videolan.org/pub/videolan/x264/snapshots/下载一个合适的版本后，先用tar解压缩，然后切换到该目录，配置包，编译安装包。在64位的机器上，--enable-shared--enable-pic两个选项必须打开。tarxvfXXX.tar.bz2cd./XXX./configure--enable-shared-
gatsby_从零到部署：我如何使用Netlify + Gatsby从零开始创建静态网站 cumian8165 java python 大数据编程语言数据库
gatsbyAftermyfirstyearworkingasafrontendwebdeveloper,Igottheideatohavemyownpersonalsite.It’dbeaplatformtoshowcasemywork,sharecontent,andserveasacreativeoutletformeoutsideofwork.Here,I’llwalkyouthrough
mysql json类型查询效率高吗 zhihu-sys mysql json 数据库
MySQLJSON类型查询效率解析作为一名经验丰富的开发者，我很高兴能帮助你了解MySQL中JSON类型的查询效率。JSON（JavaScriptObjectNotation）是一种轻量级的数据交换格式，广泛用于Web应用中。MySQL5.7版本开始支持JSON数据类型，这为存储和查询JSON数据提供了便利。1.准备工作在开始之前，我们需要确保你的MySQL版本至少是5.7。可以通过以下命令查看你
JavaScript的DOM节点操作 DTcode7 #前端基础入门三大核心之JS HTML 核心知识点 web 知识点网页开发
JavaScript的DOM节点操作基本概念和作用说明什么是DOM节点？DOM节点操作的意义示例一：创建新节点代码解析示例二：删除已有节点代码解析示例三：修改现有节点代码解析示例四：替换节点代码解析示例五：克隆节点代码解析实际开发中的技巧与经验分享在Web前端开发中，JavaScript与DOM（DocumentObjectModel）的结合是实现动态网页交互的核心技术之一。本文将深入探讨如何通过
前端基础入门三大核心之JS篇：DOM事件传播与监听的艺术——addEventListener()的深度解析【含代码示例】 DTcode7 HTML网站开发 #前端基础入门三大核心之JS 前端 javascript 开发语言
前端基础入门三大核心之JS篇：DOM事件传播与监听DOM事件传播：一场层次间的旅行基本概念案例一：直观感受传播使用思路`addEventListener()`：监听的艺术基本概念案例二：灵活监听技巧用与优化实战技巧案例三：性能优化安全性考量与漏洞防范案例四：XSS注入防范排查错思路与解决问题案例五：事件未触发结语与引思在JavaScript的王国里，DOM（DocumentObjectModel）
JavaScript DOM操作实战指南：从元素获取到事件处理全解析 Ch1oy javascript 开发语言 ecmascript 前端 html5
一、DOM操作基础认知1.1什么是DOM？文档对象模型（DocumentObjectModel）是浏览器将HTML文档转换为树形结构的编程接口。每个HTML标签对应一个节点对象，开发者可以通过JavaScript操作这些节点实现页面动态更新。DOM操作包含三个核心方向：元素获取、内容修改和事件交互。1.2元素的获取①ID精准定位：通过元素的id属性进行唯一性查找使用方法：document.getE
HarmonyOS NEXT状态管理实践 harmonyos-next
在HarmonyOSNEXT开发中，状态管理是构建高效、响应式应用的核心。本文深入探讨状态管理的最佳实践，结合代码示例与案例分析，帮助开发者掌握这一关键技能。一、状态管理装饰器的合理使用HarmonyOSNEXT提供多种状态管理装饰器，如@State、@Prop、@Link和@ObjectLink等。@State用于组件内部状态管理，适合独立、不影响其他组件的状态。@Prop用于父组件向子组件传递
ORACLE创建用户给予权限刘寰运营 oracle 数据库 mysql
–CreatetheusercreateuserMKJK--创建用户identifiedby“”;----密码–Grant/RevokeobjectprivilegesgrantselectonHISDB.EXAM_TA_BILLtoMKJK;grantselectonHISDB.EXAM_TA_BOOKtoMKJK;grantselectonHISDB.EXAM_TA_REPtoMKJK;gra
【Mysql】忘记Root密码后如何不影响数据进行重置密码 wei_work@ mysql adb 数据库
方法一：通用方法--启动时跳过权限表1>停止数据库以管理员方式打开cmd！！C:\Users\Administrator>netstopmysqlMySQL服务正在停止..MySQL服务已成功停止。2>启动时跳过权限表mysqld--console--skip-grant-tables--shared-memoryC:\Users\Administrator>mysqld--console--sk
error while loading shared libraries: libdlt.so.2: cannot open shared object file: No such file or d 执念挽笙歌 qt
以下是解决libdlt.so.2缺失问题的分步方案：一、问题原因分析错误提示表明系统缺少libdlt.so.2动态库文件，可能原因包括：未安装相关库：系统中未安装包含该文件的软件包（如libdlt-dev或dlt-daemon）。路径配置问题：库文件存在但未添加到系统环境变量LD_LIBRARY_PATH中。架构不匹配：程序编译时使用的架构（如x86_64）与当前系统支持的架构不一致。二、解决方法
Qt5 QApplication类---基本用法 First Snowflakes QT
留得青山在，不怕没柴烧qt5.12官方文档：https://doc.qt.io/qt-5/qapplication.html#details，文档上讲解的比较全面。下面几个地方需要注意：1）qAppAglobalpointerreferringtotheuniqueapplicationobject.ItisequivalenttoQCoreApplication::instance(),butc
iOS 语言基础&初探 Xcode 工具蓝天资源分享 ios xcode macos
iOS语言基础&初探Xcode工具iOS是由苹果公司研发的一款手机操作系统，广泛应用于iPhone、iPodTouch和苹果电视等设备。iOS开发主要依赖于Objective-C和Swift两种编程语言，同时Xcode是苹果公司提供的集成开发环境（IDE），用于开发iOS、macOS、watchOS和tvOS等应用。下面将详细探讨iOS语言基础和Xcode工具的相关知识。一、iOS语言基础iOS应
六十天Linux从0到项目搭建（第五天）（file、bash 和 shell 的区别、目录权限、默认权限umask、粘滞位、使用系统自带的包管理工具） h^hh Linux linux
1.file[选项]文件名用于确定文件类型的实用工具。它会通过分析文件内容（而不仅仅是文件扩展名）来判断文件的实际类型示例输出解析$file/bin/bash/bin/bash:ELF64-bitLSBsharedobject,x86-64,version1(SYSV),dynamicallylinked,interpreter/lib64/ld-linux-x86-64.so.2,forGNU/
对该Django ORM查询的改进方案及详细说明大霸王龙 django python 后端
以下是对该DjangoORM查询的改进方案及详细说明：一、基础安全性改进try:instance=mc_groupcustomerlkwist.objects.get(filenamemark=filenamemark)returninstance.toJson()exceptmc_groupcustomerlkwist.DoesNotExist:returnJsonResponse({'erro
小样本学习综述2025 wuxuand 深度学习计算机视觉深度学习人工智能
一、Few-ShotClass-IncrementalLearningforClassificationandObjectDetection:ASurvey用于分类和目标检测的少样本类增量学习：综述引用：@ARTICLE{10840313,author={Zhang,JinghuaandLiu,LiandSilvén,OlliandPietikäinen,MattiandHu,Dewen},jou
雪球网数据爬取 weixin_30270561 json golang 数据库
1importrequests2importjson3importpymysql45classmysql_conn(object):6#魔术方法,初始化,构造函数7def__init__(self):8self.db=pymysql.connect(host='127.0.0.1',user='root',password='abc123',port=3306,database='py1011')
Java基础(四) Object 数组转成 String 数组 JUN_LLLL Java基础 Java object Sting Array 数组
Java有个问题就是toArray()方法是Object[]，所以总结了几种Object数组转成String数组的方法：1、System.arraycopy把一个数组中某一段字节数据放到另一个数组中//src:源数组;srcPos:源数组要复制的起始位置;dest:目的数组;destPos:目的数组放置的起始位置;length:复制的长度.publicstaticvoidarraycopy(Obj
pythonjson数据_一文看懂Python类型数据JSON序列化 weixin_39582569 pythonjson数据
现代网络应用WebAPP或大型网站的后台一般只有一个，然后客户端却是各种各样的(iOS,android,浏览器),而且客户端的开发语言很可能与后台的开发语言不一样。这时我们需要后台能够提供可以跨平台跨语言的一种标准的数据交换格式供前后端沟通(这就是WebAPI的作用)。如今大家最常用的跨平台跨语言数据交换格式就是JSON（JavaScriptObjectNotation）了。JSON是一种文本序列
Python||JSON文件 VS. json模块一文读懂异与同 the_time_runner #小白学Python json模块 JSON编码格式 json.loads()json.dumps()
JSON(JavaScriptObjectNotation)是一种文件编码格式。python中json是一个模块(官方文档菜鸟教程),用于解析或编码JSON文件。importjson#将python格式编码成JSON数据格式json.dumps([1,2,3,{'4':5,'6':7}],separators=(',',':'))#'[1,2,3,{"4":5,"6":7}]'>>>将JSON数据
Vala编成语言教程-构造函数和析构函数 __XYZ vala 教程开发语言 c#c语言 c++后端
构造函数Vala支持两种略有不同的构造方案：我们将重点讨论Java/C#风格的构造方案，另一种是GObject风格的构造方案。Vala不支持构造函数重载的原因与方法重载不被允许的原因相同，这意味着一个类不能有多个同名构造函数。但这并不构成问题，因为Vala支持命名构造函数。如果您需要提供多个构造函数，可以为它们添加不同的名称后缀：publicclassButton:Object{ publicB
数据类--CSV导入 err2008 Orange3 实际案例使用教程 python scikit-learn orange3 orange3中文版
CSV文件导入从CSV格式文件中导入数据表。输出数据:来自.csv文件的数据集数据框:pandas数据框对象CSV文件导入组件读取逗号分隔文件，并将数据集发送至输出通道。支持的分隔符包括逗号、分号、空格、制表符或自定义分隔符。该组件会保留最近打开文件的历史记录。数据框输出可通过连接至[Python脚本]组件的in_object输入使用（例如df=in_object），作为常规数据框进行操作。导入选
CloudCompare中不同点云数据结构之间的继承关系点云SLAM 点云数据处理技术数据结构 CloudCompare 点云数据处理点云继承 c++
在CloudCompare（CC）中，点云数据的组织方式是基于继承关系和层次化树结构的。不同的点云数据结构继承自ccHObject，并在此基础上扩展功能。以下是详细的继承关系和它们之间的作用。1.主要的点云数据类层次结构CloudCompare主要有以下几个与点云相关的类：ccHObject├──ccGenericPointCloud//通用点云类（抽象基类）├──ccPointCloud//主要
Panda3D 载入角色 bcbobo21cn 图形学和3D 3d Actor
Panda3D推荐，将模型和动画数据，按照panda.egg、panda-walk.egg，类似这样的方式分开保存；在命令行连续输入命令；将自动绑定模型和动画数据；可查看模型的动画；在Python中有一个Actor类，从DirectObject和NodePath派生而来，用来载入角色及动画；C++没有这样一个类，角色及动画还是使用NodePath实现；参看前文的入门示例代码；图解Panda3D引擎
Java HotSpot(TM) 64-Bit Server VM warning: Insufficient space for shared memory file: fzip Java Flink flink大日志文件
执行hdfs命令查看目录时，控制台上有这么一句警告JVM报错：共享内存文件空间不足df-h以可读性较高的方式来显示磁盘使用信息可以看到主盘已使用100%暴力解决办法：找到对应占用磁盘的文件，然后rm-f，再重启造成大日志文件的程序我的原因是flink的taskmanager的日志很大，删除日志之后需要重新启动flink集群，防止问题重复发生，需要找出taskmanager日志很大的原因：我的原因是
LC17. 电话号码的字母组合 996冲冲冲 LC回溯 Python 回溯算法电话号码字母组合字符串操作
classSolution(object):defletterCombinations(self,digits):""":typedigits:str:rtype:List[str]"""res=[]s=""map=["","","abc","def","ghi","jkl","mno","pqrs","tuv","wxyz"]iflen(digits)==0:return[]defbacktra
VO、DTO、POJO、PO和DO 的区别好似是故人 Java基础状态模式 java spring
在Java开发中，VO、DTO、POJO、PO、DO等概念经常被使用，它们的主要区别在于用途和设计目的。1.VO（ViewObject）——视图对象目的：用于前端展示，通常是后端返回给前端的数据格式。特点：只包含展示需要的字段，可能会组合多个实体的数据不包含业务逻辑可能用于接口返回值示例：publicclassUserVO{privateStringusername;privateStringem
异常的核心类Throwable 无量 java 源码异常处理 exception
java异常的核心是Throwable，其他的如Error和Exception都是继承的这个类里面有个核心参数是detailMessage，记录异常信息，getMessage核心方法，获取这个参数的值，我们可以自己定义自己的异常类，去继承这个Exception就可以了，方法基本上，用父类的构造方法就OK，所以这么看异常是不是很easy package com.natsu;
mongoDB 游标（cursor）实现分页迭代开窍的石头 mongodb
上篇中我们讲了mongoDB 中的查询函数，现在我们讲mongo中如何做分页查询如何声明一个游标 var mycursor = db.user.find({_id:{$lte:5}}); 迭代显示游标数
MySQL数据库INNODB 表损坏修复处理过程 0624chenhong tomcat mysql
最近mysql数据库经常死掉，用命令net stop mysql命令也无法停掉，关闭Tomcat的时候，出现Waiting for N instance(s) to be deallocated 信息。查了下，大概就是程序没有对数据库连接释放，导致Connection泄露了。因为用的是开元集成的平台，内部程序也不可能一下子给改掉的，就验证一下咯。启动Tomcat,用户登录系统，用netstat -
剖析如何与设计人员沟通不懂事的小屁孩工作
最近做图烦死了，不停的改图，改图……。烦，倒不是因为改，而是反反复复的改，人都会死。很多需求人员不知该如何与设计人员沟通，不明白如何使设计人员知道他所要的效果，结果只能是沟通变成了扯淡，改图变成了应付。那应该如何与设计人员沟通呢？我认为设计人员与需求人员先天就存在语言障碍。对一个合格的设计人员来说，整天玩的都是点、线、面、配色，哪种构图看起来协调；哪种配色看起来合理心里跟明镜似的，
qq空间刷评论工具换个号韩国红果果 JavaScript
var a=document.getElementsByClassName('textinput'); var b=[]; for(var m=0;m<a.length;m++){ if(a[m].getAttribute('placeholder')!=null) b.push(a[m]) } var l
S2SH整合之session 灵静志远 spring AOP struts session
错误信息： Caused by: org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'cartService': Scope 'session' is not active for the current thread; consider defining a scoped
xmp标签 a-john 标签
今天在处理数据的显示上遇到一个问题： var html = '<li><div class="pl-nr"><span class="user-name">' + user + '</span>' + text + '</div></li>'; ulComme
Ajax的常用技巧（2）---实现Web页面中的级联菜单 aijuans Ajax
在网络上显示数据，往往只显示数据中的一部分信息，如文章标题，产品名称等。如果浏览器要查看所有信息，只需点击相关链接即可。在web技术中，可以采用级联菜单完成上述操作。根据用户的选择，动态展开，并显示出对应选项子菜单的内容。在传统的web实现方式中，一般是在页面初始化时动态获取到服务端数据库中对应的所有子菜单中的信息，放置到页面中对应的位置，然后再结合CSS层叠样式表动态控制对应子菜单的显示或者隐
天-安-门，好高 atongyeye 情感
我是85后，北漂一族，之前房租1100，因为租房合同到期，再续，房租就要涨150。最近网上新闻，地铁也要涨价。算了一下，涨价之后，每次坐地铁由原来2块变成6块。仅坐地铁费用，一个月就要涨200。内心苦痛。晚上躺在床上一个人想了很久，很久。我生在农
android 动画百合不是茶 android 透明度平移缩放旋转
android的动画有两种 tween动画和Frame动画 tween动画;,透明度,缩放,旋转,平移效果 Animation 动画 AlphaAnimation 渐变透明度 RotateAnimation 画面旋转 ScaleAnimation 渐变尺寸缩放 TranslateAnimation 位置移动 Animation
查看本机网络信息的cmd脚本 bijian1013 cmd
@echo 您的用户名是：%USERDOMAIN%\%username%>"%userprofile%\网络参数.txt" @echo 您的机器名是：%COMPUTERNAME%>>"%userprofile%\网络参数.txt" @echo ___________________>>"%userprofile%\
plsql 清除登录过的用户征客丶 plsql
tools---preferences----logon history---history 把你想要删除的删除 -------------------------------------------------------------------- 若有其他凝问或文中有错误，请及时向我指出，我好及时改正，同时也让我们一起进步。 email ： binary_spac
【Pig一】Pig入门 bit1129 pig
Pig安装 1.下载pig wget http://mirror.bit.edu.cn/apache/pig/pig-0.14.0/pig-0.14.0.tar.gz 2. 解压配置环境变量如果Pig使用Map/Reduce模式，那么需要在环境变量中，配置HADOOP_HOME环境变量 expor
Java 线程同步几种方式 BlueSkator volatile synchronized ThredLocal ReenTranLock Concurrent
为何要使用同步？ java允许多线程并发控制，当多个线程同时操作一个可共享的资源变量时（如数据的增删改查），将会导致数据不准确，相互之间产生冲突，因此加入同步锁以避免在该线程没有完成操作之前，被其他线程的调用，从而保证了该变量的唯一性和准确性。 1.同步方法&
StringUtils判断字符串是否为空的方法（转帖） BreakingBad null StringUtils “”
转帖地址：http://www.cnblogs.com/shangxiaofei/p/4313111.html public static boolean isEmpty(String str) 　　判断某字符串是否为空，为空的标准是 str== null 或 str.length()== 0
编程之美-分层遍历二叉树 bylijinnan java 数据结构算法编程之美
import java.util.ArrayList; import java.util.LinkedList; import java.util.List; public class LevelTraverseBinaryTree { /** * 编程之美分层遍历二叉树 * 之前已经用队列实现过二叉树的层次遍历，但这次要求输出换行，因此要
jquery取值和ajax提交复习记录 chengxuyuancsdn jquery取值 ajax提交
// 取值 // alert($("input[name='username']").val()); // alert($("input[name='password']").val()); // alert($("input[name='sex']:checked").val()); // alert($("
推荐国产工作流引擎嵌入式公式语法解析器-IK Expression comsci java 应用服务器工作 Excel 嵌入式
这个开源软件包是国内的一位高手自行研制开发的，正如他所说的一样，我觉得它可以使一个工作流引擎上一个台阶。。。。。。欢迎大家使用，并提出意见和建议。。。 ----------转帖--------------------------------------------------- IK Expression是一个开源的（OpenSource），可扩展的（Extensible），基于java语言
关于系统中使用多个PropertyPlaceholderConfigurer的配置及PropertyOverrideConfigurer daizj spring
1、PropertyPlaceholderConfigurer Spring中PropertyPlaceholderConfigurer这个类，它是用来解析Java Properties属性文件值，并提供在spring配置期间替换使用属性值。接下来让我们逐渐的深入其配置。基本的使用方法是：(1) <bean id="propertyConfigurerForWZ&q
二叉树:二叉搜索树 dieslrae 二叉树
所谓二叉树,就是一个节点最多只能有两个子节点,而二叉搜索树就是一个经典并简单的二叉树.规则是一个节点的左子节点一定比自己小,右子节点一定大于等于自己(当然也可以反过来).在树基本平衡的时候插入,搜索和删除速度都很快,时间复杂度为O(logN).但是,如果插入的是有序的数据,那效率就会变成O(N),在这个时候,树其实变成了一个链表. tree代码:
C语言字符串函数大全 dcj3sjt126com c function
C语言字符串函数大全函数名: stpcpy 功能: 拷贝一个字符串到另一个用法: char *stpcpy(char *destin, char *source); 程序例: #include <stdio.h> #include <string.h> int main
友盟统计页面技巧 dcj3sjt126com 技巧
在基类调用就可以了, 基类ViewController示例代码 -(void)viewWillAppear:(BOOL)animated { [super viewWillAppear:animated]; [MobClick beginLogPageView:[NSString stringWithFormat:@"%@",self.class]];
window下在同一台机器上安装多个版本jdk，修改环境变量不生效问题处理办法 flyvszhb java jdk
window下在同一台机器上安装多个版本jdk，修改环境变量不生效问题处理办法本机已经安装了jdk1.7，而比较早期的项目需要依赖jdk1.6，于是同时在本机安装了jdk1.6和jdk1.7. 安装jdk1.6前，执行java -version得到 C:\Users\liuxiang2>java -version java version "1.7.0_21&quo
Java在创建子类对象的同时会不会创建父类对象 happyqing java 创建子类对象父类对象
1.在thingking in java 的第四版第六章中明确的说了，子类对象中封装了父类对象， 2."When you create an object of the derived class, it contains within it a subobject of the base class. This subobject is the sam
跟我学spring3 目录贴及电子书下载 jinnianshilongnian spring
一、《跟我学spring3》电子书下载地址：《跟我学spring3》（1-7 和 8-13） http://jinnianshilongnian.iteye.com/blog/pdf 跟我学spring3系列 word原版下载二、源代码下载最新依
第12章 Ajax（上） onestopweb Ajax
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
BI and EIM 4.0 at a glance blueoxygen BO
http://www.sap.com/corporate-en/press.epx?PressID=14787 有机会研究下EIM家族的两个新产品~~~~ New features of the 4.0 releases of BI and EIM solutions include: Real-time in-memory computing –
Java线程中yield与join方法的区别 tomcat_oracle java
长期以来，多线程问题颇为受到面试官的青睐。虽然我个人认为我们当中很少有人能真正获得机会开发复杂的多线程应用(在过去的七年中，我得到了一个机会)，但是理解多线程对增加你的信心很有用。之前，我讨论了一个wait()和sleep()方法区别的问题，这一次，我将会讨论join()和yield()方法的区别。坦白的说，实际上我并没有用过其中任何一个方法，所以，如果你感觉有不恰当的地方，请提出讨论。 &nb
android Manifest.xml选项阿尔萨斯 Manifest
结构继承关系 public final class Manifest extends Objectjava.lang.Objectandroid.Manifest 内部类 class Manifest.permission权限 class Manifest.permission_group权限组构造函数 public Manifest () 详细 androi
Oracle实现类split函数的方 zhaoshijie oracle
关键字：Oracle实现类split函数的方项目里需要保存结构数据，批量传到后他进行保存，为了减小数据量，子集拼装的格式，使用存储过程进行保存。保存的过程中需要对数据解析。但是oracle没有Java中split类似的函数。从网上找了一个，也补全了一下。 CREATE OR REPLACE TYPE t_split_100 IS TABLE OF VARCHAR2(100); cr

PLT redirection through shared object injection...

Table of Contents

Introduction

Prerequisites

Brief introduction to the ELF format

Historical notes

ELF structure

The ELF header

Program header and segments

The section header

String table

Let's write come code...

Symbol table

Relocation table

Dynamic section

Hash tables

Let's write more code...

ELF loading

Program headers

Dynamic section

The PLT

Building up our lab...

PLT: A practical example

Conclusions

References

History

License

你可能感兴趣的:(PLT redirection through shared object injection...)