gaoxiangnumber1

63-ALTERNATIVE IO MODELS

Please indicate the source: http://blog.csdn.net/gaoxiangnumber1

Welcome to my github: https://github.com/gaoxiangnumber1

63.1 Overview

Traditional blocking I/O model: A process performs I/O on one file descriptor at a time, and each I/O system call blocks until the data is transferred.
Disk files are a special case. The kernel employs the buffer cache to speed disk I/O requests.
1. A write() to a disk returns as soon as the requested data has been transferred to the kernel buffer cache, rather than waiting until the data is written to disk(unless O_SYNC flag was specified when opening the file).
2. A read() transfers data from the buffer cache to a user buffer, and if the required data is not in the buffer cache, then the kernel puts the process to sleep while a disk read is performed.
Some applications need to able to do one or both of the following:
1. Check whether I/O is possible on a file descriptor without blocking if it is not possible.
2. Monitor multiple file descriptors to see if I/O is possible on any of them.
Three techniques that partially address these needs: nonblocking I/O and the use of multiple processes or threads.
1. If we place a file descriptor in nonblocking mode by enabling the O_NONBLOCK open file status flag, then an I/O system call that can’t be immediately completed returns an error instead of blocking. Nonblocking I/O can be employed with pipes, FIFOs, sockets, terminals, pseudo-terminals, and some other types of devices. Nonblocking I/O allows us to periodically check(“poll”) whether I/O is possible on a file descriptor.
2. If we don’t want a process to block when performing I/O on a file descriptor, we can create a new process to perform the I/O. The parent process can then carry on to perform other tasks, while the child process blocks until the I/O is complete. If we need to handle I/O on multiple file descriptors, we can create one child for each descriptor. The problems are expense and complexity. Creating and maintaining processes places a load on the system, and the child processes will need to use some form of IPC to inform the parent about the status of I/O operations.
3. Using multiple threads instead of processes is less demanding of resources, but the threads will probably still need to communicate information to one another about the status of I/O operations, and the programming can be complex, especially if we are using thread pools to minimize the number of threads used to handle large numbers of simultaneous clients.(One place where threads can be useful is if the application needs to call a third-party library that performs blocking I/O. An application can avoid blocking in this case by making the library call in a separate thread.)
Because of the limitations of both nonblocking I/O and the use of multiple threads or processes, one of the following alternatives is preferable:
1. I/O multiplexing allows a process to simultaneously monitor multiple file descriptors to find out whether I/O is possible on any of them. select() and poll() perform I/O multiplexing.
2. Signal-driven I/O is a technique whereby a process requests that the kernel send it a signal when input is available or data can be written on a specified file descriptor. The process can then carry on performing other activities, and is notified when I/O becomes possible via receipt of the signal. When monitoring large numbers of file descriptors, signal-driven I/O provides better performance than select() and poll().
3. epoll is a Linux-specific feature.
  Like the I/O multiplexing APIs, epoll allows a process to monitor multiple file descriptors to see if I/O is possible on any of them.
  Like signal-driven I/O, epoll provides better performance when monitoring large numbers of file descriptors.
I/O multiplexing, signal-driven I/O, and epoll are all methods of achieving the same result: monitoring one or several file descriptors simultaneously to see if they are ready to perform I/O(to be precise, to see whether an I/O system call could be performed without blocking). The transition of a file descriptor into a ready state is triggered by some type of I/O event(the arrival of input, the completion of a socket connection and so on). None of these techniques performs I/O. They merely tell us that a file descriptor is ready.
One I/O model that we don’t describe in this chapter is POSIX asynchronous I/O(AIO). POSIX AIO allows a process to queue an I/O operation to a file and then later be notified when the operation is complete.
Advantage: The initial I/O call returns immediately, so that the process is not tied up waiting for data to be transferred to the kernel or for the operation to complete. This allows the process to perform other tasks in parallel with the I/O(which may include queuing further I/O requests).

Which technique?

select() and poll() are standard interfaces that have been present on UNIX for many years.
1. Advantage: portability.
2. Disadvantage: they don’t scale well when monitoring large numbers(hundreds or thousands) of file descriptors.
Advantage of epoll: it allows an application to efficiently monitor large numbers of file descriptors.
Disadvantage: it is only on Linux.
Signal-driven I/O allows an application to efficiently monitor large numbers of file descriptors. But epoll provides advantages over signal-driven I/O:
1. Avoid the complexities of dealing with signals.
2. Ability to specify the kind of monitoring that we want to perform(e.g., ready for reading/writing).
3. Ability to select either level-triggered or edge-triggered notification(Section 63.1.1).
select() and poll() are more portable, while signal-driven I/O and epoll deliver better performance. For some applications, it is worthwhile writing an abstract software layer for monitoring file descriptor events. With such a layer, portable programs can employ epoll on Linux, and fall back to the use of select() or poll() on other systems.
libevent is a software layer that provides an abstraction for monitoring file descriptor events. It can employ any of the techniques: select(), poll(), signal-driven I/O, or epoll, as well as the Solaris specific /dev/poll interface or the BSD kqueue interface.

63.1.1 Level-Triggered and Edge-Triggered Notification

Level-triggered notification: A file descriptor is considered to be ready if it is possible to perform an I/O system call without blocking.
Edge-triggered notification: Notification is provided if there is I/O activity(e.g., new input) on a file descriptor since it was last monitored.
epoll can employ both level-triggered notification(the default) and edge-triggered notification.

How different notification model affects the way we design a program?

When we employ level-triggered notification, we can check the readiness of a file descriptor at any time. This means that when we determine that a file descriptor is ready(e.g., it has input available), we can perform I/O on the descriptor, and then repeat the monitoring operation to check if the descriptor is still ready(e.g., it still has more input available), in which case we can perform more I/O, and so on.
Because the level-triggered model allows us to repeat the I/O monitoring operation at any time, it is not necessary to perform as much I/O as possible(e.g., read as many bytes as possible) on the file descriptor(or even perform any I/O at all) each time we are notified that a file descriptor is ready.
When we employ edge-triggered notification, we receive notification only when an I/O event occurs. We don’t receive any further notification until another I/O event occurs. Furthermore, when an I/O event is notified for a file descriptor, we usually don’t know how much I/O is possible(e.g., how many bytes are available for reading). Therefore, programs that employ edge-triggered notification are usually designed according to the following rules:
1. After notification of an I/O event, the program should(at some point) perform as much I/O as possible(e.g., read as many bytes as possible) on the corresponding file descriptor. If the program fails to do this, then it might miss the opportunity to perform some I/O, because it would not be aware of the need to operate on the file descriptor until another I/O event occurred. This could lead to spurious data loss or blockages in a program.
  We said “at some point” because sometimes it may not be desirable to perform all of the I/O immediately after we determine that the file descriptor is ready. The problem is that we may starve other file descriptors of attention if we perform a large amount of I/O on one file descriptor(Section 63.4.6).
2. If the program employs a loop to perform as much I/O as possible on the file descriptor, and the descriptor is marked as blocking, then eventually an I/O system call will block when no more I/O is possible. For this reason, each monitored file descriptor is normally placed in nonblocking mode, and after notification of an I/O event, I/O operations are performed repeatedly until the relevant system call(e.g., read() or write()) fails with the error EAGAIN or EWOULDBLOCK.

63.1.2 Employing Nonblocking I/O with Alternative I/O Models

Nonblocking I/O(the O_NONBLOCK flag) is often used in conjunction with the I/O models described in this chapter. Examples of why this can be useful are:
1. As explained in the previous section, nonblocking I/O is usually employed in conjunction with I/O models that provide edge-triggered notification of I/O events.
2. If multiple processes(or threads) are performing I/O on the same open file descriptions, then, from a particular process’s point of view, a descriptor’s readiness may change between the time the descriptor was notified as being ready and the time of the subsequent I/O call. Consequently, a blocking I/O call could block, thus preventing the process from monitoring other file descriptors.(This can occur for all of the I/O models that we describe in this chapter, regardless of whether they employ level-triggered or edge-triggered notification.)
3. Even after a level-triggered API such as select() or poll() informs us that a file descriptor for a stream socket is ready for writing, if we write a large enough block of data in a single write() or send(), then the call will nevertheless block.
4. In rare cases, level-triggered APIs such as select() and poll() can return spurious readiness notifications—they can falsely inform us that a file descriptor is ready. This could be caused by a kernel bug or be expected behavior in an uncommon scenario.
Section 16.6 of UNP describes one example of spurious readiness notifications on BSD systems for a listening socket. If a client connects to a server’s listening socket and then resets the connection, a select() performed by the server between these two events will indicate the listening socket as being readable, but a subsequent accept() that is performed after the client’s reset will block.

63.2 I/O Multiplexing

I/O multiplexing allows us to simultaneously monitor multiple file descriptors to see if I/O is possible on any of them. We can perform I/O multiplexing using select()/ poll() to monitor file descriptors for regular files, terminals, pseudo-terminals, pipes, FIFOs, sockets, and some types of character devices.

63.2.1 The select() System Call

#include <sys/time.h>
#include <sys/select.h>
#include <sys/types.h>
#include <unistd.h>
int select(int nfds, fd_set *readfds, fd_set *writefds, fd_set *exceptfds,                                                                      struct timeval *timeout);
Return: number of ready file descriptors, 0 on timeout, -1 on error

nfds, readfds, writefds, and exceptfds arguments specify the file descriptors that select() is to monitor.
timeout can be used to set an upper limit on the time for which select() will block.

File descriptor sets

readfds, writefds, and exceptfds are pointers to file descriptor sets that use the data type fd_set. These arguments are used as follows:
1. readfds is the set of file descriptors to be tested to see if input is possible;
2. writefds is the set of file descriptors to be tested to see if output is possible;
3. exceptfds is the set of file descriptors to be tested to see if an exceptional condition has occurred. An exceptional condition occurs in just two circumstances on Linux:
  -1- A state change occurs on a pseudo-terminal slave connected to a master that is in packet mode(Section 64.5).
  -2- Out-of-band data is received on a stream socket(Section 61.13.1).
fd_set data type is implemented as a bit mask. All manipulation of file descriptor sets is done via four macros: FD_ZERO(), FD_SET(), FD_CLR(), and FD_ISSET().

#include <sys/time.h>
#include <sys/select.h>
#include <sys/types.h>
#include <unistd.h>
void FD_ZERO(fd_set *fdset);
void FD_SET(int fd, fd_set *fdset);
void FD_CLR(int fd, fd_set *fdset);

int FD_ISSET(int fd, fd_set *fdset);
Return: true(1) if fd is in fdset, or false(0) otherwise

FD_ZERO() initializes the set pointed to by fdset to be empty.
FD_SET() adds the file descriptor fd to the set pointed to by fdset.
FD_CLR() removes the file descriptor fd from the set pointed to by fdset.
FD_ISSET() returns true if the file descriptor fd is a member of the set pointed to by fdset.
A file descriptor set has a maximum size FD_SETSIZE, which is 1024 on Linux. If we want to change this limit, we must modify the definition in the glibc header files. If we need to monitor large numbers of descriptors, then using epoll is preferable to the use of select().
readfds, writefds, and exceptfds are all value-result. Before the call to select(), the fd_set structures pointed to by these arguments must be initialized(using FD_ZERO() and FD_SET()) to contain the set of file descriptors of interest. select() modifies each of these structures and on return, they contain the set of file descriptors that are ready. The structures can then be examined using FD_ISSET().
If we are not interested in a particular class of events, then the corresponding fd_set argument can be specified as NULL.
nfds is set one greater than the highest file descriptor number included in any of the three file descriptor sets. This argument allows select() to be efficient since the kernel knows not to check whether file descriptor numbers higher than this value are part of each file descriptor set.

The timeout argument

timeout can be specified as
1. NULL: select() blocks indefinitely;
2. A pointer to a timeval structure.

struct timeval
{
    time_t      tv_sec; /* Seconds */
    suseconds_t tv_usec;    /* Microseconds(long int) */
};

If both fields of timeout are 0, then select() doesn’t block; it polls the specified file descriptors to see which ones are ready and returns immediately. Otherwise, timeout specifies an upper limit on the time for which select() is to wait.
Although the timeval structure affords microsecond precision, the accuracy of the call is limited by the granularity of the software clock(Section 10.6).
When timeout is NULL, or points to a structure containing nonzero fields, select() blocks until one of the following occurs:
1. at least one of the file descriptors specified in readfds, writefds, or exceptfds becomes ready;
2. the call is interrupted by a signal handler;
3. the amount of time specified by timeout has passed.
On Linux, if select() returns because one or more file descriptors became ready, and if timeout was non-NULL, then select() updates the structure to which timeout points to indicate how much time remained until the call would have timed out. Most other UNIX systems don’t modify this structure. Portable applications that employ select() within a loop should always ensure that the structure pointed to by timeout is initialized before each select(), and should ignore the information returned in the structure after the call.
On Linux, if select() is interrupted by a signal handler(so that it fails with the error EINTR), then the structure is modified to indicate the time remaining until a timeout would have occurred.
If we use the Linux-specific personality() system call to set a personality that includes the STICKY_TIMEOUTS personality bit, then select() doesn’t modify the structure pointed to by timeout.

Return value from select()

-1 indicates that an error occurred. Possible errors include EBADF and EINTR. EBADF indicates that one of the file descriptors in readfds, writefds, or exceptfds is invalid(e.g., not currently open).
EINTR indicates that the call was interrupted by a signal handler.(select() is never automatically restarted if interrupted by a signal handler.)
0 means that the call timed out before any file descriptor became ready. In this case, each of the returned file descriptor sets will be empty.
A positive return value indicates that one or more file descriptors is ready. The return value is the number of ready descriptors. In this case, each of the returned file descriptor sets must be examined(using FD_ISSET()) in order to find out which I/O events occurred. If the same file descriptor is specified in more than one of readfds, writefds, and exceptfds, it is counted multiple times if it is ready for more than one event.

Example program

The first command-line argument specifies the timeout for select(), in seconds. If a hyphen(-) is specified here, then select() is called with a timeout of NULL, meaning block indefinitely. Each of the remaining command-line arguments specifies the number of a file descriptor to be monitored, followed by letters indicating the operations for which the descriptor is to be checked. The letters we can specify here are r(ready for read) and w(ready for write).
First example: make a request to monitor file descriptor 0 for input with a 10-second timeout:

$ ./t_select 10 0r
                    #Press Enter, so that a line of input is available on file descriptor 0
ready = 1
0: r
timeout after select(): 8.003
$                  #Next shell prompt is displayed

The output shows us that select() determined that file descriptor 0 was ready for reading and the timeout was modified. The final line of output, consisting of just the shell $ prompt, appeared because the t_select program didn’t read the newline character that made file descriptor 0 ready, and so that character was read by the shell, which responded by printing another prompt.
Next example: monitor file descriptor 0 for input with a timeout of 0 seconds:

$ ./t_select 0 0r
ready = 0
timeout after select(): 0.000

The select() call returned immediately, and found no file descriptor was ready.
Next example: monitor two file descriptors: descriptor 0, to see if input is available, and descriptor 1, to see if output is possible. In this case, we specify the timeout as NULL(the first command-line argument is a hyphen):

$ ./t_select - 0r 1w
ready = 1
0:
1: w

The select() call returned immediately, informing us that output was possible on file descriptor 1.

63.2.2 The poll() System Call

Difference between select/poll lies in how we specify the file descriptors to be monitored.
1. select(): provide three sets, each marked to indicate the file descriptors of interest.
2. poll(): provide a list of file descriptors, each marked with the set of events of interest.

#include <poll.h>
int poll(struct pollfd fds[], nfds_t nfds, int timeout);
Returns number of ready file descriptors, 0 on timeout, or -1 on error

The pollfd array

fds lists the file descriptors to be monitored by poll(). This argument is an array of pollfd structures, defined as follows:

struct pollfd
{
    int     fd;     /* File descriptor */
    short   events; /* Requested events bit mask */
    short   revents;    /* Returned events bit mask */
};

nfds specifies the number of items in the fds array. The nfds_t data type is an unsigned integer type.
The events and revents fields of the pollfd structure are bit masks. The caller initializes events to specify the events to be monitored for the file descriptor fd. Upon return from poll(), revents is set to indicate which of those events actually occurred for this file descriptor.

Table 63-2 lists the bits that may appear in the events and revents fields.
1. The first five bits are concerned with input events.
2. The next three bits are concerned with output events.
3. The next three bits are set in the revents field to return additional information about the file descriptor. If specified in the events field, these three bits are ignored.
4. The final bit is unused by poll() on Linux. On systems providing STREAMS devices, POLLMSG indicates that a message containing a SIGPOLL signal has reached the head of the stream.
It is permissible to specify events as 0 if we are not interested in events on a particular file descriptor. Specify a negative value for the fd field(e.g., negating its value if nonzero) causes the corresponding events field to be ignored and the revents field always to be returned as 0. Either of these techniques can be used to disable monitoring of a single file descriptor, without needing to rebuild the entire fds list.

Points with Linux implementation of poll()

Synonymous: POLLIN = POLLRDNORM; POLLOUT = POLLWRNORM
POLLRDBAND is generally unused: it is ignored in the events field and not set in revents. The only place where it is set is in code implementing the obsolete DECnet networking protocol. There are no circumstances in which POLLWRBAND is set when POLLOUT and POLLWRNORM are not also set.
POLLRDBAND and POLLWRBAND are meaningful on implementations that provide System V STREAMS(which Linux does not). Under STREAMS, a message can be assigned a nonzero priority, and such messages are queued to the receiver in decreasing order of priority, in a band ahead of normal(priority 0) messages.
The _XOPEN_SOURCE feature test macro must be defined in order to obtain the definitions of the constants POLLRDNORM, POLLRDBAND, POLLWRNORM, and POLLWRBAND from

$ ./poll_pipes 10 3
Writing to fd:  4   (read fd:   3)
Writing to fd:  14  (read fd:   13)
Writing to fd:  14  (read fd:   13)
poll() returned:    2
Readable:       3
Readable:       13

63.2.3 When Is a File Descriptor Ready?

SUSv3 says that a file descriptor(with O_NONBLOCK clear) is considered to be ready if a call to an I/O function would not block, regardless of whether the function would actually transfer data. select() and poll() tell us whether an I/O operation would not block, rather than whether it would successfully transfer data.
We show this information in tables containing two columns:
1. select() column indicates whether a file descriptor is marked as readable(r), writable(w), or having an exceptional condition(x).
2. poll() column indicates the bit(s) returned in the revents field. In these tables, we omit mention of POLLRDNORM, POLLWRNORM, POLLRDBAND, and POLLWRBAND because they convey no useful information beyond that provided by POLLIN, POLLOUT, POLLHUP, and POLLERR.

Regular files

File descriptors that refer to regular files are always marked as readable and writable by select(), and returned with POLLIN and POLLOUT set in revents for poll(), for the following reasons:
1. A read() will always immediately return data, end-of-file, or an error(e.g., the file was not opened for reading).
2. A write() will always immediately transfer data or fail with some error.

Terminals and pseudo-terminals

When one half of a pseudo-terminal pair is closed, the revents setting returned by poll() for the other half of the pair depends on the implementation. On Linux, at least the POLLHUP flag is set.

Pipes and FIFOs

Table 63-4 summarizes the details for the read end of a pipe or FIFO. The “Data in pipe?” column indicates whether the pipe has at least 1 byte of data available for reading. In this table, we assume that POLLIN was specified in the events field for poll().
On some other UNIX implementations, if the write end of a pipe is closed, instead of returning with POLLHUP set, poll() returns with the POLLIN bit set(since a read() will return immediately with end-of-file). Portable applications should check to see if either bit is set in order to know if a read() will block.

Table 63-5 summarizes the details for the write end of a pipe. In this table, we assume that POLLOUT was specified in the events field for poll().
The “Space for PIPE_BUF bytes?” column indicates whether the pipe has room to atomically write PIPE_BUF bytes without blocking. This is the criterion on which Linux considers a pipe ready for writing. Some other UNIX use the same criterion; others consider a pipe writable if even a single byte can be written.(In Linux 2.6.10 and earlier, the capacity of a pipe is the same as PIPE_BUF. This means that a pipe is considered un-writable if it contains even a single byte of data.)
On some other UNIX implementations, if the read end of a pipe is closed, instead of returning with POLLERR set, poll() returns with either the POLLOUT bit or the POLLHUP bit set. Portable applications need to check to see if any of these bits is set to determine if a write() will block.

Sockets

Table 63-6 summarizes the behavior of select() and poll() for sockets. This table covers just the common cases, not all possible scenarios.
For the poll() column, we assume that events was specified as
(POLLIN | POLLOUT | POLLPRI).
For the select() column, we assume that the file descriptor is being tested to see if input is possible, output is possible, or an exceptional condition occurred(i.e., the file descriptor is specified in all three sets passed to select()).
The Linux poll() behavior for UNIX domain sockets after a peer close() differs from that shown in Table 63-6. poll() additionally returns POLLHUP in revents.
The Linux-specific POLLRDHUP flag: This flag(actually in the form of EPOLLRDHUP) is designed primarily for use with the edge-triggered mode of epoll(Section 63.4). It is returned when the remote end of a stream socket connection has shut down the writing half of the connection. The use of this flag allows an application that uses the epoll edge-triggered interface to employ simpler code to recognize a remote shutdown.(The alternative is for the application to note that the POLLIN flag is set and then perform a read(), which indicates the remote shutdown with a return of 0.)

63.2.4 Comparison of select() and poll()

Implementation details

Within the Linux kernel, select() and poll() both employ the same set of kernel-internal poll routines. These poll routines are distinct from the poll() system call itself. Each routine returns information about the readiness of a single file descriptor. This readiness information takes the form of a bit mask whose values correspond to the bits returned in the revents field by the poll() system call(Table 63-2).
The implementation of the poll() system call involves calling the kernel poll routine for each file descriptor and placing the resulting information in the corresponding revents field.
To implement select(), a set of macros is used to convert the information returned by the kernel poll routines into the corresponding event types returned by select():

/* Ready for reading */
#define POLLIN_SET (POLLRDNORM | POLLRDBAND | POLLIN | POLLHUP | POLLERR)
/* Ready for writing */
#define POLLOUT_SET (POLLWRBAND | POLLWRNORM | POLLOUT | POLLERR)
/* Exceptional condition */
#define POLLEX_SET (POLLPRI)

These macro definitions reveal the semantic correspondence between the information returned by select() and poll().(If we look at the select() and poll() columns in the tables in Section 63.2.3, we see that the indications provided by each system call are consistent with the above macros.) The only additional information we need to complete the picture is that poll() returns POLLNVAL in the revents field if one of the monitored file descriptors was closed at the time of the call, while select() returns -1 with errno set to EBADF .

API differences

The use of the fd_set data type places an upper limit(FD_SETSIZE) on the range of file descriptors that can be monitored by select(). By default, this limit is 1024 on Linux, and changing it requires recompiling the application. By contrast, poll() places no intrinsic limit on the range of file descriptors that can be monitored.
Because the fd_set arguments of select() are value-result, we must reinitialize them if making repeated select() calls from within a loop. By using separate events(input) and revents(output) fields, poll() avoids this requirement.
The timeout precision afforded by select()(microseconds) is greater than that afforded by poll()(milliseconds).(The accuracy of the timeouts of both of these system calls is nevertheless limited by the software clock granularity.)
If one of the file descriptors being monitored was closed, then poll() informs us exactly which one, via the POLLNVAL bit in the corresponding revents field. By contrast, select() merely returns -1 with errno set to EBADF, leaving us to determine which file descriptor is closed by checking for an error when performing an I/O system call on the descriptor. However, this is typically not an important difference, since an application can usually keep track of which file descriptors it has closed.

Portability

Both interfaces are standardized by SUSv3 and available on contemporary implementations.
However, there is some variation in the behavior of poll() across implementations, as noted in Section 63.2.3.

Performance

The performance of poll() and select() is similar if either of the following is true:
1. The range of file descriptors to be monitored is small(i.e., the maximum file descriptor number is low).
2. A large number of file descriptors are being monitored, but they are densely packed(i.e., most or all of the file descriptors from 0 up to some limit are being monitored).
The performance of select() and poll() can differ noticeably if the set of file descriptors to be monitored is sparse; that is, the maximum file descriptor number, N, is large, but only one or a few descriptors in the range 0 to N are being monitored. In this case, poll() can perform better than select().
We can understand the reasons for this by considering the arguments passed to the two system calls.
1. With select(), we pass one or more file descriptor sets and an integer, nfds, which is one greater than the maximum file descriptor to be examined in each set. The nfds argument has the same value, regardless of whether we are monitoring all file descriptors in the range 0 to(nfds - 1) or only the descriptor(nfds - 1). In both cases, the kernel must examine nfds elements in each set in order to check exactly which file descriptors are to be monitored.
2. When using poll(), we specify only the file descriptors of interest to us, and the kernel checks only those descriptors.

63.2.5 Problems with select() and poll()

select() and poll() suffer problems when monitoring a large number of file descriptors:
1. On each call to select() or poll(), the kernel must check all of the specified file descriptors to see if they are ready. When monitoring a large number of file descriptors that are in a densely packed range, the time required for this operation greatly outweighs the time required for the next two operations.
2. In each call to select() or poll(), the program must pass a data structure to the kernel describing all of the file descriptors to be monitored, and, after checking the descriptors, the kernel returns a modified version of this data structure to the program.(Furthermore, for select(), we must initialize the data structure before each call.)
  For poll(), the size of the data structure increases with the number of file descriptors being monitored, and the task of copying it from user to kernel space and back again consumes a noticeable amount of CPU time when monitoring many file descriptors.
  For select(), the size of the data structure is fixed by FD_SETSIZE, regardless of the number of file descriptors being monitored.
3. After the call to select() or poll(), the program must inspect every element of the returned data structure to see which file descriptors are ready.
The consequence of the above points is that the CPU time required by select() and poll() increases with the number of file descriptors being monitored(Section 63.4.5). This creates problems for programs that monitor large numbers of file descriptors.
The poor scaling performance of select() and poll() stems from a simple limitation of these APIs: typically, a program makes repeated calls to monitor the same set of file descriptors; however, the kernel doesn’t remember the list of file descriptors to be monitored between successive calls.
Signal-driven I/O and epoll are both mechanisms that allow the kernel to record a persistent list of file descriptors in which a process is interested. Doing this eliminates the performance scaling problems of select() and poll(), yielding solutions that scale according to the number of I/O events that occur, rather than according to the number of file descriptors being monitored. So, signal-driven I/O and epoll provide superior performance when monitoring large numbers of file descriptors.

63.3 Signal-Driven I/O

With I/O multiplexing, a process makes a system call(select() or poll()) in order to check whether I/O is possible on a file descriptor.
With signal-driven I/O, a process requests that the kernel send it a signal when I/O is possible on a file descriptor. The process can then perform any other activity until I/O is possible, at which time the signal is delivered to the process.
To use signal-driven I/O, a program performs the following steps:
1. Establish a handler for the signal delivered by the signal-driven I/O mechanism. By default, this notification signal is SIGIO.
2. Set the owner of the file descriptor—that is, the process or process group that is to receive signals when I/O is possible on the file descriptor. Typically, we make the calling process the owner. The owner is set using an fcntl() F_SETOWN operation of the following form: fcntl(fd, F_SETOWN, pid);
3. Enable nonblocking I/O by setting the O_NONBLOCK open file status flag.
4. Enable signal-driven I/O by turning on the O_ASYNC open file status flag. This can be combined with the previous step, since they both require the use of the fcntl() F_SETFL operation(Section 5.3), as in the following example:

flags = fcntl(fd, F_GETFL); /* Get current flags */
fcntl(fd, F_SETFL, flags | O_ASYNC | O_NONBLOCK);

The calling process can now perform other tasks. When I/O becomes possible, the kernel generates a signal for the process and invokes the signal handler established in step 1.
Signal-driven I/O provides edge-triggered notification(Section 63.1.1). This means that once the process has been notified that I/O is possible, it should perform as much I/O(e.g., read as many bytes) as possible. Assuming a nonblocking file descriptor, this means executing a loop that performs I/O system calls until a call fails with the error EAGAIN or EWOULDBLOCK.
- Signal-driven I/O can be employed with file descriptors for sockets, terminals, pseudo-terminals, pipes, FIFOs, inotify file descriptors and certain other types of devices.

The program in 63-3 performs the steps described above for enabling signal-driven I/O on standard input, and then places the terminal in cbreak mode(Section 62.6.3), so that input is available a character at a time. The program then enters an infinite loop, performing the “work” of incrementing a variable, cnt, while waiting for input to become available. Whenever input becomes available, the SIGIO handler sets a flag, gotSigio, that is monitored by the main program. When the main program sees that this flag is set, it reads all available input characters and prints them along with the current value of cnt.
If a hash character(#) is read in the input, the program terminates. Example when type the x character a number of times, followed by a hash( #) character:

$ ./demo_sigio
cnt=37; read x
cnt=100; read x
cnt=159; read x
cnt=223; read x
cnt=288; read x
cnt=333; read #

Establish the signal handler before enabling signal-driven I/O

Because the default action of SIGIO is to terminate the process, we should enable the handler for SIGIO before enabling signal-driven I/O on a file descriptor. If we enable signal-driven I/O before establishing the SIGIO handler, then there is a time window during which, if I/O becomes possible, delivery of SIGIO will terminate the process.

Setting the file descriptor owner

We set the file descriptor owner using an fcntl() operation of the following form:
fcntl(fd, F_SETOWN, pid);
We may specify that either a single process or all of the processes in a process group are to be signaled when I/O is possible on the file descriptor. If pid is positive, it is interpreted as a process ID. If pid is negative, its absolute value specifies a process group ID.
Typically, pid is specified as the process ID of the calling process(so that the signal is sent to the process that has the file descriptor open). It is possible to specify another process or a process group(e.g., the caller’s process group), and signals will be sent to that target, subject to the permission checks described in Section 20.5, where the sending process is considered to be the process that does the F_SETOWN.
The fcntl() F_GETOWN operation returns the ID of the process or process group that is to receive signals when I/O is possible on a specified file descriptor:

id = fcntl(fd, F_GETOWN);
if(id == -1)
    Exit("fcntl");

A process group ID is returned as a negative number by this call.
A limitation in the system call convention employed on Linux architectures(x86) means that if a file descriptor is owned by a process group ID less than 4096, then, instead of returning that ID as a negative function result from the fcntl() F_GETOWN operation, glibc misinterprets it as a system call error. Consequently, the fcntl() wrapper function returns -1, and errno contains the(positive) process group ID. This is a consequence of the fact that the kernel system call interface indicates errors by returning a negative errno value as a function result, and there are a few cases where it is necessary to distinguish such results from a successful call that returns a valid negative value.
To make this distinction, glibc interprets negative system call returns in the range -1 to -4095 as indicating an error, copies this(absolute) value into errno, and returns -1 as the function result for the application program. This technique is generally sufficient for dealing with the few system call service routines that can return a valid negative result; the fcntl() F_GETOWN operation is the only practical case where it fails. This limitation means that an application that uses process groups to receive “I/O possible” signals(which is unusual) can’t reliably use F_GETOWN to discover which process group owns a file descriptor.
Since glibc version 2.11, the fcntl() wrapper function fixes the problem of F_GETOWN with process group IDs less than 4096. It does this by implementing F_GETOWN in user space using the F_GETOWN_EX operation(Section 63.3.2), which is provided by Linux 2.6.32 and later.

63.3.1 When Is “I/O Possible” Signaled?

Terminals and pseudo-terminals

For terminals and pseudo-terminals, a signal is generated whenever new input becomes available, even if previous input has not yet been read.
“Input possible” is also signaled if an end-of-file condition occurs on a terminal(but not on a pseudo-terminal).
There is no “output possible” signaling for terminals. A terminal disconnect is also not signaled. Linux provides “output possible” signaling for the slave side of a pseudo-terminal. This signal is generated whenever input is consumed on the master side of the pseudo-terminal.

Pipes and FIFOs

For the read end of a pipe or FIFO, a signal is generated in these circumstances:
1. Data is written to the pipe(even if there was already unread input available).
2. The write end of the pipe is closed.
For the write end of a pipe or FIFO, a signal is generated in these circumstances:
1. A read from the pipe increases the amount of free space in the pipe so that it is now possible to write PIPE_BUF bytes without blocking.
2. The read end of the pipe is closed.

Sockets

Signal-driven I/O works for datagram sockets in both the UNIX and the Internet domains. A signal is generated in the following circumstances:
1. An input datagram arrives on the socket(even if there were already unread datagrams waiting to be read).
2. An asynchronous error occurs on the socket.
Signal-driven I/O works for stream sockets in both the UNIX and the Internet domains. A signal is generated in the following circumstances:
1. A new connection is received on a listening socket.
2. A TCP connect() request completes; that is, the active end of a TCP connection entered the ESTABLISHED state. The analogous condition is not signaled for UNIX domain sockets.
3. New input is received on the socket(even if there was already unread input available).
4. The peer closes its writing half of the connection using shutdown(), or closes its socket altogether using close().
5. Output is possible on the socket(e.g., space has become available in the socket send buffer).
6. An asynchronous error occurs on the socket.

inotify file descriptors

A signal is generated when the inotify file descriptor becomes readable, that is, when an event occurs for one of the files monitored by the inotify file descriptor.

63.3.2 Refining the Use of Signal-Driven I/O

In applications that need to simultaneously monitor large numbers(i.e., thousands) of file descriptors, signal-driven I/O can provide better performance by comparison with select() and poll(). The kernel “remembers” the list of file descriptors to be monitored, and signals the program only when I/O events actually occur on those descriptors. So, the performance of a program employing signal-driven I/O scales according to the number of I/O events that occur, rather than the number of file descriptors being monitored.
To take advantage of signal-driven I/O, we must perform two steps:
1. Employ a Linux-specific fcntl() operation, F_SETSIG, to specify a real-time signal that should be delivered instead of SIGIO when I/O is possible on a file descriptor.
2. Specify the SA_SIGINFO flag when using sigaction() to establish the handler for the real-time signal employed in the previous step(Section 21.4).
The fcntl() F_SETSIG operation specifies an alternative signal that should be delivered instead of SIGIO when I/O is possible on a file descriptor:

if(fcntl(fd, F_SETSIG, sig) == -1)
    Exit("fcntl");

The F_GETSIG operation performs the converse of F_SETSIG, retrieving the signal currently set for a file descriptor:

sig = fcntl(fd, F_GETSIG);
if(sig == -1)
    Exit("fcntl");

In order to obtain the definitions of the F_SETSIG and F_GETSIG constants from

struct f_owner_ex
{
    int     type;
    pid_t   pid;
};

The type field defines the meaning of the pid field, and has one of the following values:
1. F_OWNER_PGRP
  The pid field specifies the ID of a process group that is to be the target of “I/O possible” signals. Unlike with F_SETOWN, a process group ID is specified as a positive value.
2. F_OWNER_PID
  The pid field specifies the ID of a process that is to be the target of “I/O possible” signals.
3. F_OWNER_TID
  The pid field specifies the ID of a thread that is to be the target of “I/O possible” signals. The ID specified in pid is a value returned by clone() or gettid().
The F_GETOWN_EX operation is the converse of the F_SETOWN_EX operation. It uses the f_owner_ex structure pointed to by the third argument of fcntl() to return the settings defined by a previous F_SETOWN_EX operation.
Because the F_SETOWN_EX and F_GETOWN_EX operations represent process group IDs as positive values, F_GETOWN_EX doesn’t suffer the problem described earlier for F_GETOWN when using process group IDs less than 4096.

63.4 The epoll API

The primary advantages of epoll:
1. The performance of epoll scales better than select() and poll() when monitoring large numbers of file descriptors.
2. epoll permits either level-triggered or edge-triggered notification. select() and poll() provide only level-triggered notification, and signal-driven I/O provides only edge-triggered notification.
The performance of epoll and signal-driven I/O is similar. But epoll has advantages over signal-driven I/O:
1. We avoid the complexities of signal handling(e.g., signal-queue overflow).
2. We have greater flexibility in specifying what kind of monitoring we want to perform(e.g., check to ready for reading, writing, or both).
The central data structure of epoll is an epoll instance, which is referred to via an open file descriptor. This file descriptor is not used for I/O, it is a handle for kernel data structures that serve two purposes:
1. recording a list of file descriptors that this process has declared an interest in monitoring—the interest list; and
2. maintaining a list of file descriptors that are ready for I/O—the ready list. ready list is a subset of the interest list.
For each file descriptor monitored by epoll, we can specify a bit mask indicating events that we are interested in knowing about. These bit masks correspond to the bit masks used with poll().
epoll consists of three system calls:
1. epoll_create(): Create an epoll instance and returns a file descriptor referring to the instance.
2. epoll_ctl(): Manipulate the interest list associated with an epoll instance. Using epoll_ctl(), we can add a new file descriptor to the list, remove an existing descriptor from the list, and modify the mask that determines which events are to be monitored for a descriptor.
3. epoll_wait(): Return items from the ready list associated with an epoll instance.

63.4.1 Creating an epoll Instance: epoll_create()

#include <sys/epoll.h>
int epoll_create(int size);
int epoll_create1(int flags);
Returns file descriptor on success, or -1 on error

epoll_create() creates a new epoll instance whose interest list is initially empty.
size specifies the number of file descriptors that we expect to monitor via the epoll instance. This argument is not an upper limit, but rather a hint to the kernel about how to initially dimension internal data structures.
Since Linux 2.6.8, size is ignored, the kernel dynamically sizes the required data structures without needing the hint, but size must be greater than zero, in order to ensure backward compatibility when new epoll applications are run on older kernels.
epoll_create() returns a file descriptor referring to the new epoll instance. This file descriptor is used to refer to the epoll instance in other epoll system calls. When the file descriptor is no longer required, it should be closed using close().
Multiple file descriptors may refer to the same epoll instance as a consequence of calls to fork() or descriptor duplication using dup() or similar. When all file descriptors referring to an epoll instance are closed, the instance is destroyed and its associated resources are released back to the system.
epoll_create1 performs the same task as epoll_create(), but drops the size argument and adds a flags argument that can be used to modify the behavior of the system call.
One flag is supported: EPOLL_CLOEXEC, which causes the kernel to enable the close-on-exec flag(FD_CLOEXEC) for the new file descriptor.
Linux 3.7: epoll_ctl() adds a new flag, EPOLL_CTL_DISABLE that allows multithreaded applications to safely disable monitoring of a file descriptor.

63.4.2 Modifying the epoll Interest List: epoll_ctl()

#include <sys/epoll.h>
int epoll_ctl(int epfd, int op, int fd, struct epoll_event *ev);
Returns 0 on success, or -1 on error

epoll_ctl() modifies the interest list of the epoll instance referred to by the file descriptor epfd.
fd identifies which of the file descriptors in the interest list is to have its settings modified. It can be a file descriptor for a pipe, FIFO, socket, POSIX message queue, inotify instance, terminal, device, or another epoll descriptor(i.e., we can build hierarchy of monitored descriptors). But fd can’t be a file descriptor for a regular file or a directory(the error EPERM results).
op specifies the operation to be performed, and has one of the following values:
1. EPOLL_CTL_ADD
  Add fd to the interest list for epfd. The set of events that we are interested in monitoring for fd is specified in the buffer pointed to by ev. If we attempt to add a file descriptor that is already in the interest list, epoll_ctl() fails with the error EEXIST.
2. EPOLL_CTL_MOD
  Modify the events setting for the file descriptor fd, using the information specified in the buffer pointed to by ev. If we attempt to modify the settings of a file descriptor that is not in the interest list for epfd, epoll_ctl() fails with the error ENOENT.
3. EPOLL_CTL_DEL
  Remove fd from the interest list for epfd. ev is ignored for this operation. If we attempt to remove a file descriptor that is not in the interest list for epfd, epoll_ctl() fails with the error ENOENT. Closing a file descriptor automatically removes it from all of the epoll interest lists of which it is a member.
ev is a pointer to a structure of type epoll_event, defined as follows:

struct epoll_event
{
    uint32_t        events; /* epoll events(bit mask) */
    epoll_data_t    data;   /* User data */
};

The data field of the epoll_event structure is typed as follows:

typedef union epoll_data
{
    void*   ptr;        /* Pointer to user-defined data */
    int     fd;     /* File descriptor */
    uint32_t    u32;    /* 32-bit integer */
    uint64_t    u64;    /* 64-bit integer */
} epoll_data_t;

ev specifies settings for the file descriptor fd as follows:
1. events is a bit mask specifying the set of events that we are interested in monitoring for fd(next section).
2. data is a union, one of whose members can be used to specify information that is passed back to the calling process(via epoll_wait()) if fd later becomes ready.

#include <stdio.h>
#include <stdlib.h>
#include <sys/epoll.h>

void Exit(char *string)
{
    printf("%s\n", string);
    exit(1);
}

int main()
{
    int epfd = epoll_create(1);
    if(epfd == -1)
    {
        Exit("epoll_create error");
    }

    struct epoll_event ev;
    int fd = 1;
    ev.data.fd = fd;
    ev.events = EPOLLIN;
    if(epoll_ctl(epfd, EPOLL_CTL_ADD, fd, &ev) == -1)
    {
        Exit("epoll_ctl error");
    }

    exit(0);
}

The max_user_watches limit

Because each file descriptor registered in an epoll interest list requires a small amount of non-swappable kernel memory, the kernel provides an interface that defines a limit on the total number of file descriptors that each user can register in all epoll interest lists. The value of this limit can be viewed and modified via max_user_watches, a Linux-specific file in the /proc/sys/fs/epoll directory. The default value of this limit is calculated based on available system memory(see the epoll(7) manual page).

63.4.3 Waiting for Events: epoll_wait()

#include <sys/epoll.h>
int epoll_wait(int epfd, struct epoll_event *evlist, int maxevents, int timeout);
Returns number of ready file descriptors, 0 on timeout, or -1 on error

epoll_wait() returns information about ready file descriptors from the epoll instance referred to by the file descriptor epfd. A single epoll_wait() call can return information about multiple ready file descriptors.
The information about ready file descriptors is returned in the array of epoll_event structures pointed to by evlist. The evlist array is allocated by the caller, and the number of elements it contains is specified in maxevents.

struct epoll_event
{
    uint32_t        events; /* epoll events(bit mask) */
    epoll_data_t    data;   /* User data */
};
typedef union epoll_data
{
    void*       ptr;        /* Pointer to user-defined data */
    int         fd;     /* File descriptor */
    uint32_t        u32;    /* 32-bit integer */
    uint64_t        u64;    /* 64-bit integer */
} epoll_data_t;

Each item in the array evlist returns information about a single ready file descriptor. The events field returns a mask of the events that have occurred on this descriptor. The data field returns whatever value was specified in ev.data when we registered interest in this file descriptor using epoll_ctl().
The data field provides the only mechanism for finding out the number of the file descriptor associated with this event. When we make the epoll_ctl() call that places a file descriptor in the interest list, we should either set ev.data.fd to the file descriptor number(as in 63-4) or set ev.data.ptr to point to a structure that contains the file descriptor number.
If timeout:
1. = -1, block until an event occurs for one of the file descriptors in the interest list for epfd or until a signal is caught.
2. = 0, perform a nonblocking check to see which events are currently available on the file descriptors in the interest list for epfd.
3. > 0, block for up to timeout milliseconds, until an event occurs on one of the file descriptors in the interest list for epfd, or until a signal is caught.
On success, epoll_wait() returns the number of items that have been placed in the array evlist, or 0 if no file descriptors were ready within the interval specified by timeout.
On error, epoll_wait() returns -1, with errno set to indicate the error.
In a multithreaded program, it is possible for one thread to use epoll_ctl() to add file descriptors to the interest list of an epoll instance that is already being monitored by epoll_wait() in another thread. These changes to the interest list will be taken into account immediately, and the epoll_wait() call will return readiness information about the newly added file descriptors.

epoll events

The bit values that can be specified in ev.events when we call epoll_ctl() and that are placed in the evlist[].events fields returned by epoll_wait() are shown in Table 63-8.

When specified as input to epoll_ctl() or returned as output via epoll_wait(), these bits convey exactly the same meaning as the corresponding poll() event bits.

The EPOLLONESHOT flag

By default, once a file descriptor is added to an epoll interest list using the epoll_ctl() EPOLL_CTL_ADD operation, it remains active(i.e., subsequent calls to epoll_wait() will inform us whenever the file descriptor is ready) until we explicitly remove it from the list using the epoll_ctl() EPOLL_CTL_DEL operation.
If we want to be notified only once about a particular file descriptor, then we can specify the EPOLLONESHOT flag in the ev.events value passed in epoll_ctl(). If this flag is specified, after the next epoll_wait() call that informs us that the corresponding file descriptor is ready, the file descriptor is marked inactive in the interest list, and we won’t be informed about its state by future epoll_wait() calls. If desired, we can subsequently reenable monitoring of this file descriptor using the epoll_ctl() EPOLL_CTL_MOD operation. (We can’t use EPOLL_CTL_ADD operation for this purpose, because the inactive file descriptor is still part of the epoll interest list.)

Example program

As command-line arguments, this program expects the pathnames of one or more terminals or FIFOs. The program performs the following steps:
Create an epoll instance*1*.
Open each of the files named on the command line for input 2 and add the resulting file descriptor to the interest list of the epoll instance 3, specifying the set of events to be monitored as EPOLLIN.
Execute a loop 4 that calls epoll_wait() 5 to monitor the interest list of the epoll instance and handles the returned events from each call. Note the following points about this loop:
After the epoll_wait() call, the program checks for an EINTR return 6, which may occur if the program was stopped by a signal in the middle of the epoll_wait() call and then resumed by SIGCONT. If this occurs, the program restarts the epoll_wait() call.
It the epoll_wait() call was successful, the program uses a further loop to check each of the ready items in evlist 7. For each item in evlist, the program checks the events field for the presence of not just EPOLLIN 8, but also EPOLLHUP and EPOLLERR 9. These latter events can occur if the other end of a FIFO was closed or a terminal hangup occurred. If EPOLLIN was returned, then the program reads some input from the corresponding file descriptor and displays it on standard output. Otherwise, if either EPOLLHUP or EPOLLERR occurred, the program closes the corresponding file descriptor 10 and decrements the counter of open files(numOpenFds).
The loop terminates when all open file descriptors have been closed(i.e., when numOpenFds equals 0).
The following shell session logs demonstrate the use of the program in Listing 63-5. We use two terminal windows. In one window, we use the program in Listing 63-5 to monitor two FIFOs for input.(Each open of a FIFO for reading by this program will complete only after another process has opened the FIFO for writing, as described in Section 44.7.) In the other window, we run instances of cat(1) that write data to these FIFOs.

Above, we suspended our monitoring program so that we can now generate input on both FIFOs, and close the write end of one of them:

qqq
Type Control-D to terminate “cat > q”
$ fg %1
cat >p
ppp

Now we resume our monitoring program by bringing it into the foreground, at which point epoll_wait() returns two events:

$ fg
./epoll_input p q
About to epoll_wait()
Ready: 2
fd=4; events: EPOLLIN
read 4 bytes: ppp
fd=5; events: EPOLLIN EPOLLHUP
read 4 bytes: qqq
closing fd 5
About to epoll_wait()

The two blank lines in the above output are the newlines that were read by the instances of cat, written to the FIFOs, and then read and echoed by our monitoring program.
Now we type Control-D in the second terminal window in order to terminate the remaining instance of cat, which causes epoll_wait() to once more return, this time with a single event:

Type Control-D to terminate “cat >p”
Ready: 1
fd=4; events: EPOLLHUP
closing fd 4
All file descriptors closed; bye

#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <sys/types.h>
#include <sys/stat.h>
#include <fcntl.h>
#include <unistd.h>
#include <errno.h>
#include <sys/epoll.h>

/* Maximum bytes fetched by a single read() */
#define MAX_BUF 1000
/* Maximum number of events to be returned from a single epoll_wait() call */
#define MAX_EVENTS 5

void Exit(char *string)
{
    printf("%s\n", string);
    exit(1);
}

int main(int argc, char *argv[])
{
    int epfd, ready, fd, nread, index, numOpenFds;
    struct epoll_event ev;
    struct epoll_event evlist[MAX_EVENTS];
    char buf[MAX_BUF];

    if(argc < 2 || strcmp(argv[1], "--help") == 0)
    {
        printf("%s file...\n", argv[0]);
        exit(1);
    }

    epfd = epoll_create(1);
    if (epfd == -1)
    {
        Exit("epoll_create error");
    }

    // Open each file on command line, and add it to
    // the "interest list" for the epoll instance
    for(index = 1; index < argc; ++index)
    {
        fd = open(argv[index], O_RDONLY);
        if(fd == -1)
        {
            Exit("open error");
        }
        printf("Opened \"%s\" on fd %d\n", argv[index], fd);

        ev.events = EPOLLIN;    // Only interested in input events
        ev.data.fd = fd;
        if (epoll_ctl(epfd, EPOLL_CTL_ADD, fd, &ev) == -1)
        {
            Exit("epoll_ctl error");
        }
    }

    numOpenFds = argc - 1;

    while (numOpenFds > 0)
    {
        // Fetch up to MAX_EVENTS items from the ready list of the epoll instance
        printf("About to epoll_wait()\n");
        ready = epoll_wait(epfd, evlist, MAX_EVENTS, -1);
        if(ready == -1)
        {
            if (errno == EINTR)
            {
                continue;   // Restart if interrupted by signal
            }
            else
            {
                Exit("epoll_wait error");
            }
        }
        printf("Ready: %d\n", ready);

        /* Deal with returned list of events */

        for(index = 0; index < ready; ++index)
        {
            printf(" fd=%d; events: %s%s%s\n", evlist[index].data.fd,
                   (evlist[index].events & EPOLLIN)  ? "EPOLLIN "  : "",
                   (evlist[index].events & EPOLLHUP) ? "EPOLLHUP " : "",
                   (evlist[index].events & EPOLLERR) ? "EPOLLERR " : "");

            if(evlist[index].events & EPOLLIN)
            {
                nread = read(evlist[index].data.fd, buf, MAX_BUF);
                if (nread == -1)
                {
                    Exit("read");
                }
                printf("read %d bytes: %.*s\n", nread, nread, buf);
            }
            else if(evlist[index].events & (EPOLLHUP | EPOLLERR))
            {
                /* After the epoll_wait(), EPOLLIN and EPOLLHUP may both have * been set. But we'll only get here, and thus close the file * descriptor, if EPOLLIN was not set. This ensures that all * outstanding input (possibly more than MAX_BUF bytes) is * consumed (by further loop iterations) before the file * descriptor is closed. */
                printf(" closing fd %d\n", evlist[index].data.fd);
                if(close(evlist[index].data.fd) == -1)
                {
                    Exit("close error");
                }
                numOpenFds--;
            }
        }
    }
    printf("All file descriptors closed; bye\n");
    exit(EXIT_SUCCESS);
}

63.4.4 A Closer Look at epoll Semantics

Figure 5-2(page 95) shows the relationship between file descriptors, open file descriptions, and the system-wide file i-node table.

When we create an epoll instance using epoll_create(), the kernel creates a new in-memory i-node and open file description, and allocates a new file descriptor in the calling process that refers to the open file description. The interest list for an epoll instance is associated with the open file description, not with the epoll file descriptor.
This has the following consequences:
1. If we duplicate an epoll file descriptor using dup()(or similar), then the duplicated descriptor refers to the same epoll interest and ready lists as the original descriptor. We may modify the interest list by specifying either file descriptor as the epfd argument in a call to epoll_ctl(). Similarly, we can retrieve items from the ready list by specifying either file descriptor as the epfd argument in a call to epoll_wait().
2. The preceding point also applies after a call to fork(). The child inherits a duplicate of the parent’s epoll file descriptor, and this duplicate descriptor refers to the same epoll data structures.
When we perform an epoll_ctl() EPOLL_CTL_ADD operation, the kernel adds an item to the epoll interest list that records both the number of the monitored file descriptor and a reference to the corresponding open file description. For the purpose of epoll_wait() calls, the kernel monitors the open file description. This means that we must refine our earlier statement that when a file descriptor is closed, it is automatically removed from any epoll interest lists of which it is a member.
The refinement is this: an open file description is removed from the epoll interest list once all file descriptors that refer to it have been closed. This means that if we create duplicate descriptors referring to an open file(using dup()(or similar) or fork()), then the open file will be removed only after the original descriptor and all of the duplicates have been closed.

Suppose we execute the code shown in Listing 63-6. The epoll_wait() call in this code will tell us that the file descriptor fd1 is ready(i.e, evlist[0].data.fd = fd1), even though fd1 has been closed. This is because there is still one open file descriptor, fd2, referring to the open file description contained in the epoll interest list.
A similar scenario occurs when two processes hold duplicate descriptors for the same open file description(typically after fork()), and the process performing the epoll_wait() has closed its file descriptor, but the other process still holds the duplicate descriptor open.

63.4.5 Performance of epoll Versus I/O Multiplexing

From Table 63-9, we see that as the number of file descriptors to be monitored grows large, poll() and select() perform poorly. The performance of epoll hardly declines as N grows large.
Reasons for why select() and poll() perform poorly when monitoring large numbers of file descriptors are in Section 63.2.5, now look at reasons why epoll performs better:
1. On each call to select() or poll(), the kernel must check all of the file descriptors specified in the call.
  When we mark a descriptor to be monitored with epoll_ctl(), the kernel records this fact in a list associated with the underlying open file description, and whenever an I/O operation that makes the file descriptor ready is performed, the kernel adds an item to the ready list for the epoll descriptor. (An I/O event on a single open file description may cause multiple file descriptors associated with that description to become ready.) Subsequent epoll_wait() calls simply fetch items from the ready list.
2. Each time we call select() or poll(), we pass a data structure to the kernel that identifies all of the file descriptors that are to be monitored, and, on return, the kernel passes back a data structure describing the readiness of all of these descriptors.
  With epoll, we use epoll_ctl() to build up a data structure in kernel space that lists the set of file descriptors to be monitored. Once this data structure has been built, each later call to epoll_wait() doesn’t need to pass any information about file descriptors to the kernel, and the call returns information about only those descriptors that are ready.
Additionally, for select(), we must initialize the input data structure prior to each call; for both select() and poll(), we must inspect the returned data structure to find out which of the N file descriptors are ready.
Very roughly, we can say that for large values of N(the number of file descriptors being monitored), the performance of select() and poll() scales linearly with N. epoll scales(linearly) according to the number of I/O events that occur.
epoll is thus efficient in a scenario that is common in servers that handle many simultaneous clients: of the many file descriptors being monitored, most are idle; only a few descriptors are ready.

63.4.6 Edge-Triggered Notification

By default, epoll provides level-triggered notification. That is, epoll tells us whether an I/O operation can be performed on a file descriptor without blocking. This is the same type of notification as is provided by poll() and select().
epoll also allows for edge-triggered notification: a call to epoll_wait() tells us if there has been I/O activity on a file descriptor since the previous call to epoll_wait()(or since the descriptor was opened, if there was no previous call).
Using epoll with edge-triggered notification is similar to signal-driven I/O, except that if multiple I/O events occur, epoll coalesces them into a single notification returned via epoll_wait(); with signal-driven I/O, multiple signals may be generated.
To employ edge-triggered notification, we specify the EPOLLET flag in ev.events when calling epoll_ctl():

struct epoll_event ev;
ev.data.fd = fd;
ev.events = EPOLLIN | EPOLLET;
if (epoll_ctl(epfd, EPOLL_CTL_ADD, fd, ev) == -1)
{
    Exit("epoll_ctl error");
}

Suppose use epoll to monitor a socket for input(EPOLLIN), the following steps occur:
1. Input arrives on the socket.
2. We perform an epoll_wait(). This call will tell us that the socket is ready, regardless of whether we are employing level-triggered or edge-triggered notification.
3. We perform a second call to epoll_wait().
If employ level-triggered notification: the second epoll_wait() call will inform us that the socket is ready.
If employ edge-triggered notification: the second epoll_wait() call will block, because no new input has arrived since the previous call to epoll_wait().
Section 63.1.1: edge-triggered notification is usually employed in conjunction with nonblocking file descriptors. The general framework for using edge-triggered epoll notification is as follows:
1. Make all file descriptors that are to be monitored nonblocking.
2. Build the epoll interest list using epoll_ctl().
3. Handle I/O events using the following loop:
  a) Retrieve a list of ready descriptors using epoll_wait().
  b) For each file descriptor that is ready, process I/O until the relevant system call(e.g., read(), write(), recv(), send(), or accept()) returns with the error EAGAIN or EWOULDBLOCK.

Preventing file-descriptor starvation when using edge-triggered notification

Suppose that we are monitoring multiple file descriptors using edge-triggered notification, and that a ready file descriptor has a large amount(perhaps an endless stream) of input available. If, after detecting that this file descriptor is ready, we attempt to consume all of the input using nonblocking reads, then we risk starving the other file descriptors of attention(i.e., it may be a long time before we again check them for readiness and perform I/O on them).
One solution is for the application to maintain a list of file descriptors that have been notified as being ready, and execute a loop that continuously performs the following actions:
1. Monitor the file descriptors using epoll_wait() and add ready descriptors to the application list. If any file descriptors are already registered as being ready in the application list, then the timeout for this monitoring step should be small or 0, so that if no new file descriptors are ready, the application can quickly proceed to the next step and service any file descriptors that are already known to be ready.
2. Perform a limited amount of I/O on those file descriptors registered as being ready in the application list(perhaps cycling through them in round-robin fashion, rather than always starting from the beginning of the list after each call to epoll_wait()). A file descriptor can be removed from the application list when the relevant nonblocking I/O system call fails with the EAGAIN or EWOULDBLOCK error.
This approach offers other benefits in addition to preventing file-descriptor starvation. For example, we can include other steps in the above loop, such as handling timers and accepting signals with sigwaitinfo()(or similar).
Starvation considerations can also apply when using signal-driven I/O, since it also presents an edge-triggered notification mechanism. By contrast, starvation considerations don’t necessarily apply in applications employing a level-triggered notification mechanism. This is because we can employ blocking file descriptors with level-triggered notification and use a loop that continuously checks descriptors for readiness, and then performs some I/O on the ready descriptors before once more checking for ready file descriptors.

63.5 Waiting on Signals and File Descriptors

63.5.1 The pselect() System Call

63.5.2 The Self-Pipe Trick

63.6 Summary

select() and poll() I/O multiplexing calls simultaneously monitor multiple file descriptors to see if I/O is possible on any of the descriptors. With both system calls, we pass a complete list of to-be-checked file descriptors to the kernel on each system call, and the kernel returns a modified list indicating which descriptors are ready. The fact that complete file descriptor lists are passed and checked on each call means that select() and poll() perform poorly when monitoring large numbers of file descriptors.
Signal-driven I/O allows a process to receive a signal when I/O is possible on a file descriptor. Linux allows us to change the signal used for notification, and if we instead employ a real-time signal, then multiple notifications can be queued, and the signal handler can use its siginfo_t argument to determine the file descriptor and event type that generated the signal.
The performance advantage of epoll(and signal-driven I/O) derives from the fact that the kernel “remembers” the list of file descriptors that a process is monitoring(by contrast with select() and poll(), where each system call must again tell the kernel which file descriptors to check).
epoll has notable advantages over the use of signal-driven I/O: we avoid the complexities of dealing with signals and can specify which types of I/O events(e.g., input or output) are to be monitored.
With a level-triggered notification model, we are informed whether I/O is currently possible on a file descriptor.
Edge-triggered notification informs us whether I/O activity has occurred on a file descriptor since it was last monitored. Edge-triggered notification is usually employed in conjunction with nonblocking I/O.

Further information

A particularly interesting online resource is at http://www.kegel.com/c10k.html. This web page explores the issues facing developers of web servers designed to simultaneously serve tens of thousands of clients.

Exercises(Redo)

Please indicate the source: http://blog.csdn.net/gaoxiangnumber1

Welcome to my github: https://github.com/gaoxiangnumber1

你可能感兴趣的:(github,IO)

【iOS】MVC设计模式 Magnetic_h ios mvc 设计模式 objective-c 学习 ui
MVC前言如何设计一个程序的结构，这是一门专门的学问，叫做"架构模式"（architecturalpattern），属于编程的方法论。MVC模式就是架构模式的一种。它是Apple官方推荐的App开发架构，也是一般开发者最先遇到、最经典的架构。MVC各层controller层Controller/ViewController/VC（控制器）负责协调Model和View，处理大部分逻辑它将数据从Mod
UI学习——cell的复用和自定义cell Magnetic_h ui 学习
目录cell的复用手动（非注册）自动（注册）自定义cellcell的复用在iOS开发中，单元格复用是一种提高表格（UITableView）和集合视图（UICollectionView）滚动性能的技术。当一个UITableViewCell或UICollectionViewCell首次需要显示时，如果没有可复用的单元格，则视图会创建一个新的单元格。一旦这个单元格滚动出屏幕，它就不会被销毁。相反，它被添
c++ 的iostream 和 c++的stdio的区别和联系黄卷青灯77 c++算法开发语言 iostream stdio
在C++中，iostream和C语言的stdio.h都是用于处理输入输出的库，但它们在设计、用法和功能上有许多不同。以下是两者的区别和联系：区别1.编程风格iostream（C++风格）：C++标准库中的输入输出流类库，支持面向对象的输入输出操作。典型用法是cin（输入）和cout（输出），使用>操作符来处理数据。更加类型安全，支持用户自定义类型的输入输出。#includeintmain(){in
如何在 Fork 的 GitHub 项目中保留自己的修改并同步上游更新？github_fork_update iBaoxing github
如何在Fork的GitHub项目中保留自己的修改并同步上游更新？在GitHub上Fork了一个项目后，你可能会对项目进行一些修改，同时原作者也在不断更新。如果想要在保留自己修改的基础上，同步原作者的最新更新，很多人会不知所措。本文将详细讲解如何在不丢失自己改动的情况下，将上游仓库的更新合并到自己的仓库中。问题描述假设你在GitHub上Fork了一个项目，并基于该项目做了一些修改，随后你发现原作者对
Linux下QT开发的动态库界面弹出操作（SDL2） 13jjyao QT类 qt 开发语言 sdl2 linux
需求：操作系统为linux，开发框架为qt，做成需带界面的qt动态库，调用方为java等非qt程序难点：调用方为java等非qt程序，也就是说调用方肯定不带QApplication::exec()，缺少了这个，QTimer等事件和QT创建的窗口将不能弹出(包括opencv也是不能弹出)；这与qt调用本身qt库是有本质的区别的思路：1.调用方缺QApplication::exec()，那么我们在接口
C#中使用split分割字符串互联网打工人no1 c#
1、用字符串分隔：usingSystem.Text.RegularExpressions;stringstr="aaajsbbbjsccc";string[]sArray=Regex.Split(str,"js",RegexOptions.IgnoreCase);foreach(stringiinsArray)Response.Write(i.ToString()+"");输出结果：aaabbbc
WPF中的ComboBox控件几种数据绑定的方式互联网打工人no1 wpf c#
一、用字典给ItemsSource赋值（此绑定用的地方很多，建议熟练掌握）在XMAL中：在CS文件中privatevoidBindData(){DictionarydicItem=newDictionary();dicItem.add(1,"北京");dicItem.add(2,"上海");dicItem.add(3,"广州");cmb_list.ItemsSource=dicItem;cmb_l
第四天旅游线路预览——从换乘中心到喀纳斯湖陟彼高冈yu 基于Google earth studio 的旅游规划和预览旅游
第四天：从贾登峪到喀纳斯风景区入口，晚上住宿贾登峪；换乘中心有4路车，喀纳斯①号车，去喀纳斯湖，路程时长约5分钟；将上面的的行程安排进行动态展示，具体步骤见”Googleearthstudio进行动态轨迹显示制作过程“、“Googleearthstudio入门教程”和“Googleearthstudio进阶教程“相关内容，得到行程如下所示：Day4-2-480p
linux sdl windows.h,Windows下的SDL安装奔跑吧linux内核 linux sdl windows.h
首先你要下载并安装SDL开发包。如果装在C盘下，路径为C:\SDL1.2.5如果在WINDOWS下。你可以按以下步骤：1.打开VC++，点击"Tools",Options2,点击directories选项3.选择"Includefiles"增加一个新的路径。"C:\SDL1.2.5\include"4，现在选择"Libaryfiles“增加"C:\SDL1.2.5\lib"现在你可以开始编写你的第
Goolge earth studio 进阶4——路径修改与平滑陟彼高冈yu Google earth studio 进阶教程旅游
如果我们希望在大约中途时获得更多的城市鸟瞰视角。可以将相机拖动到这里并创建一个新的关键帧。camera_target_clip_7EarthStudio会自动平滑我们的路径，所以当我们通过这个关键帧时，不是一个生硬的角度，而是一个平滑的曲线。camera_target_clip_8路径上有贝塞尔控制手柄，允许我们调整路径的形状。右键单击，我们可以选择“平滑路径”，这是默认的自动平滑算法，或者我们可
Google earth studio 简介陟彼高冈yu 旅游
GoogleEarthStudio是一个基于Web的动画工具，专为创作使用GoogleEarth数据的动画和视频而设计。它利用了GoogleEarth强大的三维地图和卫星影像数据库，使用户能够轻松地创建逼真的地球动画、航拍视频和动态地图可视化。网址为https://www.google.com/earth/studio/。GoogleEarthStudio是一个基于Web的动画工具，专为创作使用G
下载github patch到本地小米人er 我的博客 git patch
以下是几种从GitHub上下载以.patch结尾的补丁文件的方法：通过浏览器直接下载打开包含该.patch文件的GitHub仓库。在仓库的文件列表中找到对应的.patch文件。点击该文件，浏览器会显示文件的内容，在页面的右上角通常会有一个“Raw”按钮，点击它可以获取原始文件内容。然后在浏览器中使用快捷键（如Ctrl+S或者Command+S）将原始文件保存到本地，选择保存的文件名并确保后缀为.p
509. 斐波那契数(每日一题) lzyprime
lzyprime博客(github)创建时间：2021.01.04qq及邮箱：2383518170leetcode笔记题目描述斐波那契数，通常用F(n)表示，形成的序列称为斐波那契数列。该数列由0和1开始，后面的每一项数字都是前面两项数字的和。也就是：F(0)=0，F(1)=1F(n)=F(n-1)+F(n-2)，其中n>1给你n，请计算F(n)。示例1：输入：2输出：1解释：F(2)=F(1)+
SQL Server_查询某一数据库中的所有表的内容 qq_42772833 SQL Server 数据库 sqlserver
1.查看所有表的表名要列出CrabFarmDB数据库中的所有表（名），可以使用以下SQL语句：USECrabFarmDB;--切换到目标数据库GOSELECTTABLE_NAMEFROMINFORMATION_SCHEMA.TABLESWHERETABLE_TYPE='BASETABLE';对这段SQL脚本的解释：SELECTTABLE_NAME：这个语句的作用是从查询结果中选择TABLE_NAM
python是什么意思中文-在python中%是什么意思编程大乐趣
Python中%有两种：1、数值运算：%代表取模，返回除法的余数。如：>>>7%212、%操作符（字符串格式化，stringformatting），说明如下：%[(name)][flags][width].[precision]typecode(name)为命名flags可以有+，-，''或0。+表示右对齐。-表示左对齐。''为一个空格，表示在正数的左侧填充一个空格，从而与负数对齐。0表示使用0填
GitHub上克隆项目 bigbig猩猩 github
从GitHub上克隆项目是一个简单且直接的过程，它允许你将远程仓库中的项目复制到你的本地计算机上，以便进行进一步的开发、测试或学习。以下是一个详细的步骤指南，帮助你从GitHub上克隆项目。一、准备工作1.安装Git在克隆GitHub项目之前，你需要在你的计算机上安装Git工具。Git是一个开源的分布式版本控制系统，用于跟踪和管理代码变更。你可以从Git的官方网站（https://git-scm.
git - Webhook让部署自动化大猪大猪
我们现在有一个需求，将项目打包上传到gitlab或者github后，程序能自动部署，不用手动地去服务器中进行项目更新并运行，如何做到？这里我们可以使用gitlab与github的挂钩，挂钩的原理就是，每当我们有请求到gitlab与github服务器时，这时他俩会根据我们配置的挂钩地扯进行访问，webhook挂钩程序会一直监听着某个端口请求，一但收到他们发过来的请求，这时就知道用户有请求提交了，这时
libyuv之linux编译 jaronho Linux linux 运维服务器
文章目录一、下载源码二、编译源码三、注意事项1、银河麒麟系统（aarch64）（1）解决armv8-a+dotprod+i8mm指令集支持问题（2）解决armv9-a+sve2指令集支持问题一、下载源码到GitHub网站下载https://github.com/lemenkov/libyuv源码，或者用直接用git克隆到本地，如：gitclonehttps://github.com/lemenko
pyecharts——绘制柱形图折线图 2224070247 信息可视化 python java 数据可视化
一、pyecharts概述自2013年6月百度EFE(ExcellentFrontEnd）数据可视化团队研发的ECharts1.0发布到GitHub网站以来，ECharts一直备受业界权威的关注并获得广泛好评，成为目前成熟且流行的数据可视化图表工具，被应用到诸多数据可视化的开发领域。Python作为数据分析领域最受欢迎的语言，也加入ECharts的使用行列，并研发出方便Python开发者使用的数据
node.js学习小猿L node.js node.js 学习 vim
node.js学习实操及笔记温故node.js，node.js学习实操过程及笔记~node.js学习视频node.js官网node.js中文网实操笔记githubcsdn笔记为什么学node.js可以让别人访问我们编写的网页为后续的框架学习打下基础，三大框架vuereactangular离不开node.jsnode.js是什么官网：node.js是一个开源的、跨平台的运行JavaScript的运行
回溯算法-重新安排行程 chirou_ 算法数据结构图论 c++图搜索
leetcode332.重新安排行程这题我还没自己ac过，只能现在凭着刚学完的热乎劲把我对题解的理解记下来。本题我认为对数据结构的考察比较多，用什么数据结构去存数据，去读取数据，都是很重要的。classSolution{private:unordered_map>targets;boolbacktracking(intticketNum,vector&result){//1.确定参数和返回值//2
ARM驱动学习之5 LEDS驱动 JT灬新一嵌入式 C 底层 arm开发学习单片机
ARM驱动学习之5LEDS驱动知识点：•linuxGPIO申请函数和赋值函数–gpio_request–gpio_set_value•三星平台配置GPIO函数–s3c_gpio_cfgpin•GPIO配置输出模式的宏变量–S3C_GPIO_OUTPUT注意点：DRIVER_NAME和DEVICE_NAME匹配。实现步骤：1.加入需要的头文件：//Linux平台的gpio头文件#include//三
ARM驱动学习之4小结 JT灬新一嵌入式 C++arm开发学习 linux
ARM驱动学习之4小结#include#include#include#include#include#defineDEVICE_NAME"hello_ctl123"MODULE_LICENSE("DualBSD/GPL");MODULE_AUTHOR("TOPEET");staticlonghello_ioctl(structfile*file,unsignedintcmd,unsignedlo
C++ | Leetcode C++题解之第409题最长回文串 Ddddddd_158 经验分享 C++Leetcode 题解
题目：题解：classSolution{public:intlongestPalindrome(strings){unordered_mapcount;intans=0;for(charc:s)++count[c];for(autop:count){intv=p.second;ans+=v/2*2;if(v%2==1andans%2==0)++ans;}returnans;}};
【华为OD技术面试真题 - 技术面】- python八股文真题题库（1）算法大师华为od 面试 python
华为OD面试真题精选专栏：华为OD面试真题精选目录:2024华为OD面试手撕代码真题目录以及八股文真题目录文章目录华为OD面试真题精选1.数据预处理流程数据预处理的主要步骤工具和库2.介绍线性回归、逻辑回归模型线性回归（LinearRegression）模型形式：关键点：逻辑回归（LogisticRegression）模型形式：关键点：参数估计与评估：3.python浅拷贝及深拷贝浅拷贝（Shal
数字里的世界17期：2021年全球10大顶级数据中心，中国移动榜首张三叨
你知道吗？2016年，全球的数据中心共计用电4160亿千瓦时，比整个英国的发电量还多40％！前言每天，我们都会创造超过250万TB的数据。并且随着物联网（IOT）的不断普及，这一数据将持续增长。如此庞大的数据被存储在被称为“数据中心”的专用设施中。虽然最早的数据中心建于20世纪40年代，但直到1997-2000年的互联网泡沫期间才逐渐成为主流。当前人类的技术，比如人工智能和机器学习，已经将我们推向
EIO国际确定性的交易（3/10）资管，资金委托安全吗？古城鹏哥
大家可能都知道资金托管，账户是自己开，钱在自己的账户上，密码是由自己掌控，别人提不走你账户的资金，每天可以看下到自己的账户，也可以看到交易流水。现金只能提到自己的银行卡中。账户由技术人员或操作人员，或者是机构团队帮你操作账户，产生盈利和收入，以获得的利润来分配盈利，技术强硬和做的时间久了过硬技术团队，会保证你的资金本金，不会让你的本金亏损的按照一定比例分配收入。所以在这个过程当中一定要看清楚技术的
Linux MariaDB使用OpenSSL安装SSL证书 Meta39 MySQL Oracle MariaDB Linux Windows ssl linux mariadb
进入到证书存放目录，批量删除.pem证书警告：确保已经进入到证书存放目录find.-typef-iname\*.pem-delete查看是否安装OpenSSLopensslversion没有则安装yuminstallopensslopenssl-devel开启SSL编辑/etc/my.cnf文件（没有的话就创建，但是要注意，在/etc/my.cnf.d/server.cnf配置了datadir的，
Xinference如何注册自定义模型玩人工智能的辣条哥人工智能 AI 大模型 Xinference
环境：Xinference问题描述：Xinference如何注册自定义模型解决方案：1.写个model_config.json，内容如下{"version":1,"context_length":2048,"model_name":"custom-llama-3","model_lang":["en","ch"],"model_ability":["generate","chat"],"model
Java 重写(Override)与重载(Overload) 叨唧唧的
Java重写(Override)与重载(Overload)重写(Override)重写是子类对父类的允许访问的方法的实现过程进行重新编写,返回值和形参都不能改变。即外壳不变，核心重写！重写的好处在于子类可以根据需要，定义特定于自己的行为。也就是说子类能够根据需要实现父类的方法。重写方法不能抛出新的检查异常或者比被重写方法申明更加宽泛的异常。例如：父类的一个方法申明了一个检查异常IOExceptio
项目中枚举与注解的结合使用飞翔的马甲 java enum annotation
前言：版本兼容，一直是迭代开发头疼的事，最近新版本加上了支持新题型，如果新创建一份问卷包含了新题型，那旧版本客户端就不支持，如果新创建的问卷不包含新题型，那么新旧客户端都支持。这里面我们通过给问卷类型枚举增加自定义注解的方式完成。顺便巩固下枚举与注解。一、枚举 1.在创建枚举类的时候，该类已继承java.lang.Enum类，所以自定义枚举类无法继承别的类，但可以实现接口。
【Scala十七】Scala核心十一：下划线_的用法 bit1129 scala
下划线_在Scala中广泛应用，_的基本含义是作为占位符使用。_在使用时是出问题非常多的地方，本文将不断完善_的使用场景以及所表达的含义 1. 在高阶函数中使用 scala> val list = List(-3,8,7,9) list: List[Int] = List(-3, 8, 7, 9) scala> list.filter(_ > 7) r
web缓存基础：术语、http报头和缓存策略 dalan_123 Web
对于很多人来说，去访问某一个站点，若是该站点能够提供智能化的内容缓存来提高用户体验，那么最终该站点的访问者将络绎不绝。缓存或者对之前的请求临时存储，是http协议实现中最核心的内容分发策略之一。分发路径中的组件均可以缓存内容来加速后续的请求，这是受控于对该内容所声明的缓存策略。接下来将讨web内容缓存策略的基本概念，具体包括如如何选择缓存策略以保证互联网范围内的缓存能够正确处理的您的内容，并谈论下
crontab 问题周凡杨 linux crontab unix
一： 0481-079 Reached a symbol that is not expected. 背景： */5 * * * * /usr/IBMIHS/rsync.sh
让tomcat支持2级域名共享session g21121 session
tomcat默认情况下是不支持2级域名共享session的，所有有些情况下登陆后从主域名跳转到子域名会发生链接session不相同的情况，但是只需修改几处配置就可以了。打开tomcat下conf下context.xml文件找到Context标签,修改为如下内容如果你的域名是www.test.com <Context sessionCookiePath="/path&q
web报表工具FineReport常用函数的用法总结（数学和三角函数）老A不折腾 Web finereport 总结
ABS ABS(number):返回指定数字的绝对值。绝对值是指没有正负符号的数值。 Number:需要求出绝对值的任意实数。示例: ABS(-1.5)等于1.5。 ABS(0)等于0。 ABS(2.5)等于2.5。 ACOS ACOS(number):返回指定数值的反余弦值。反余弦值为一个角度，返回角度以弧度形式表示。 Number:需要返回角
linux 启动java进程 sh文件墙头上一根草 linux shell jar
#!/bin/bash #初始化服务器的进程PId变量 user_pid=0; robot_pid=0; loadlort_pid=0; gateway_pid=0; ######### #检查相关服务器是否启动成功 #说明： #使用JDK自带的JPS命令及grep命令组合，准确查找pid #jps 加 l 参数，表示显示java的完整包路径 #使用awk，分割出pid
我的spring学习笔记5-如何使用ApplicationContext替换BeanFactory aijuans Spring 3 系列
如何使用ApplicationContext替换BeanFactory？ package onlyfun.caterpillar.device; import org.springframework.beans.factory.BeanFactory; import org.springframework.beans.factory.xml.XmlBeanFactory; import
Linux 内存使用方法详细解析 annan211 linux 内存 Linux内存解析
来源 http://blog.jobbole.com/45748/ 我是一名程序员，那么我在这里以一个程序员的角度来讲解Linux内存的使用。一提到内存管理，我们头脑中闪出的两个概念，就是虚拟内存，与物理内存。这两个概念主要来自于linux内核的支持。 Linux在内存管理上份为两级，一级是线性区，类似于00c73000-00c88000，对应于虚拟内存，它实际上不占用
数据库的单表查询常用命令及使用方法(-) 百合不是茶 oracle 函数单表查询
创建数据库; --建表 create table bloguser(username varchar2(20),userage number(10),usersex char(2)); 创建bloguser表,里面有三个字段 &nbs
多线程基础知识 bijian1013 java 多线程 thread java多线程
一．进程和线程进程就是一个在内存中独立运行的程序，有自己的地址空间。如正在运行的写字板程序就是一个进程。 “多任务”：指操作系统能同时运行多个进程（程序）。如WINDOWS系统可以同时运行写字板程序、画图程序、WORD、Eclipse等。线程：是进程内部单一的一个顺序控制流。线程和进程 a. 每个进程都有独立的
fastjson简单使用实例 bijian1013 fastjson
一.简介阿里巴巴fastjson是一个Java语言编写的高性能功能完善的JSON库。它采用一种“假定有序快速匹配”的算法，把JSON Parse的性能提升到极致，是目前Java语言中最快的JSON库；包括“序列化”和“反序列化”两部分，它具备如下特征：
【RPC框架Burlap】Spring集成Burlap bit1129 spring
Burlap和Hessian同属于codehaus的RPC调用框架，但是Burlap已经几年不更新，所以Spring在4.0里已经将Burlap的支持置为Deprecated,所以在选择RPC框架时，不应该考虑Burlap了。这篇文章还是记录下Burlap的用法吧，主要是复制粘贴了Hessian与Spring集成一文，【RPC框架Hessian四】Hessian与Spring集成
【Mahout一】基于Mahout 命令参数含义 bit1129 Mahout
1. mahout seqdirectory $ mahout seqdirectory --input (-i) input Path to job input directory(原始文本文件). --output (-o) output The directory pathna
linux使用flock文件锁解决脚本重复执行问题 ronin47 linux lock　重复执行
linux的crontab命令，可以定时执行操作，最小周期是每分钟执行一次。关于crontab实现每秒执行可参考我之前的文章《linux crontab 实现每秒执行》现在有个问题，如果设定了任务每分钟执行一次，但有可能一分钟内任务并没有执行完成，这时系统会再执行任务。导致两个相同的任务在执行。例如： <? // test .php
java-74-数组中有一个数字出现的次数超过了数组长度的一半，找出这个数字 bylijinnan java
public class OcuppyMoreThanHalf { /** * Q74 数组中有一个数字出现的次数超过了数组长度的一半，找出这个数字 * two solutions: * 1.O(n) * see <beauty of coding>--每次删除两个不同的数字，不改变数组的特性 * 2.O(nlogn) * 排序。中间
linux 系统相关命令 candiio linux
系统参数 cat /proc/cpuinfo cpu相关参数 cat /proc/meminfo 内存相关参数 cat /proc/loadavg 负载情况性能参数 1）top M：按内存使用排序 P：按CPU占用排序 1：显示各CPU的使用情况 k：kill进程 o：更多排序规则回车：刷新数据 2）ulimit ulimit -a：显示本用户的系统限制参
[经营与资产]保持独立性和稳定性对于软件开发的重要意义 comsci 软件开发
一个软件的架构从诞生到成熟，中间要经过很多次的修正和改造如果在这个过程中，外界的其它行业的资本不断的介入这种软件架构的升级过程中那么软件开发者原有的设计思想和开发路线
在CentOS5.5上编译OpenJDK6 Cwind linux OpenJDK
几番周折终于在自己的CentOS5.5上编译成功了OpenJDK6，将编译过程和遇到的问题作一简要记录，备查。 0. OpenJDK介绍 OpenJDK是Sun（现Oracle）公司发布的基于GPL许可的Java平台的实现。其优点： 1、它的核心代码与同时期Sun（-> Oracle）的产品版基本上是一样的，血统纯正，不用担心性能问题，也基本上没什么兼容性问题；（代码上最主要的差异是
java乱码问题 dashuaifu java乱码问题 js中文乱码
swfupload上传文件参数值为中文传递到后台接收中文乱码在js中用setPostParams（{"tag" : encodeURI( document.getElementByIdx_x("filetag").value，"utf-8")}）; 然后在servlet中String t
cygwin很多命令显示command not found的解决办法 dcj3sjt126com cygwin
cygwin很多命令显示command not found的解决办法修改cygwin.BAT文件如下 @echo off D: set CYGWIN=tty notitle glob set PATH=%PATH%;d:\cygwin\bin;d:\cygwin\sbin;d:\cygwin\usr\bin;d:\cygwin\usr\sbin;d:\cygwin\us
[介绍]从 Yii 1.1 升级 dcj3sjt126com PHP yii2
2.0 版框架是完全重写的，在 1.1 和 2.0 两个版本之间存在相当多差异。因此从 1.1 版升级并不像小版本间的跨越那么简单，通过本指南你将会了解两个版本间主要的不同之处。如果你之前没有用过 Yii 1.1，可以跳过本章，直接从"入门篇"开始读起。请注意，Yii 2.0 引入了很多本章并没有涉及到的新功能。强烈建议你通读整部权威指南来了解所有新特性。这样有可能会发
Linux SSH免登录配置总结 eksliang ssh-keygen Linux SSH免登录认证 Linux SSH互信
转载请出自出处：http://eksliang.iteye.com/blog/2187265 一、原理我们使用ssh-keygen在ServerA上生成私钥跟公钥，将生成的公钥拷贝到远程机器ServerB上后,就可以使用ssh命令无需密码登录到另外一台机器ServerB上。生成公钥与私钥有两种加密方式，第一种是
手势滑动销毁Activity gundumw100 android
老是效仿ios，做android的真悲催！有需求：需要手势滑动销毁一个Activity 怎么办尼？自己写？不用~，网上先问一下百度。结果： http://blog.csdn.net/xiaanming/article/details/20934541 首先将你需要的Activity继承SwipeBackActivity，它会在你的布局根目录新增一层SwipeBackLay
JavaScript变换表格边框颜色 ini JavaScript html Web html5 css
效果查看：http://hovertree.com/texiao/js/2.htm代码如下，保存到HTML文件也可以查看效果： <html> <head> <meta charset="utf-8"> <title>表格边框变换颜色代码-何问起</title> </head> <body&
Kafka Rest : Confluent kane_xie kafka REST confluent
最近拿到一个kafka rest的需求，但kafka暂时还没有提供rest api（应该是有在开发中，毕竟rest这么火），上网搜了一下，找到一个Confluent Platform，本文简单介绍一下安装。这里插一句，给大家推荐一个九尾搜索，原名叫谷粉SOSO，不想fanqiang谷歌的可以用这个。以前在外企用谷歌用习惯了，出来之后用度娘搜技术问题，那匹配度简直感人。环境声明：Ubu
Calender不是单例 men4661273 单例 Calender
在我们使用Calender的时候，使用过Calendar.getInstance()来获取一个日期类的对象，这种方式跟单例的获取方式一样，那么它到底是不是单例呢，如果是单例的话，一个对象修改内容之后，另外一个线程中的数据不久乱套了吗？从试验以及源码中可以得出，Calendar不是单例。测试： Calendar c1 =
线程内存和主内存之间联系 qifeifei java thread
1， java多线程共享主内存中变量的时候，一共会经过几个阶段， lock:将主内存中的变量锁定，为一个线程所独占。 unclock:将lock加的锁定解除，此时其它的线程可以有机会访问此变量。 read:将主内存中的变量值读到工作内存当中。 load:将read读取的值保存到工作内存中的变量副本中。
schedule和scheduleAtFixedRate tangqi609567707 java timer schedule
原文地址：http://blog.csdn.net/weidan1121/article/details/527307 import java.util.Timer;import java.util.TimerTask;import java.util.Date; /** * @author vincent */public class TimerTest {
erlang 部署 wudixiaotie erlang
1.如果在启动节点的时候报这个错： {"init terminating in do_boot",{'cannot load',elf_format,get_files}} 则需要在reltool.config中加入 {app, hipe, [{incl_cond, exclude}]}, 2.当generate时，遇到： ERROR