PatentTips - Enhanced I/O Performance in a Multi-Processor System Via Interrupt Affinity Schemes

BACKGROUND OF THE INVENTION

This relates to Input/Output (I/O) performance in a host system having multiple processors, and more particularly, to efficient usage of multiple processors in handling I/O completions by using interrupt affinity schemes that associate various interrupts for I/O completions to their corresponding processors for processing.

Most data centers have bottleneck areas that impact application performance and service delivery to users. One of those bottlenecks could be poor I/O performance in a host or server, which usually results in increased response time and latency, as additional activity or application workload including transactions or file access is formed and queued. Particularly, in a host system having multiple processors, each processor can be executing multiple host applications, which frequently causes a large number of I/O commands from different processors to be serviced. In addition, the interrupts resulting from completion of those I/O commands need to be processed timely enough for each processor that has requested the I/O to be aware of the completions in order to proceed with its assigned applications. Without proper coordination, poor I/O performance in a multi-CPU system can cause significant time delay that would almost defeat the purpose of using multiple processors to expedite application or transaction processing.

Among existing multi-processor systems, there are various solutions to improve I/O performance, such as designating a particular processor out of the multiple processors for handling all interrupts arising from any I/O transactions. However, none of these solutions can achieve system-wide efficiency in minimizing time for processing interrupts in connection with I/O performance in multi-processor systems.

SUMMARY OF THE INVENTION

Embodiments of the present invention relate to improving Input/Output (I/O) performance in a host system having multiple CPUs. In one embodiment, a method for improving Input/Output (I/O) performance in a multi-processor system comprises: creating an interrupt affinity scheme having associations between a plurality of processors, interrupt identifiers and I/O channels; generating an interrupt upon completion of an I/O command; and sending said interrupt from a particular I/O channel of said I/O channels to a particular processor of said processors in accordance with said interrupt affinity scheme, said interrupt having an interrupt identifier associated with said particular processor and said particular I/O channel. This method can comprise further steps of identifying a first mapping scheme having a first group of associations between said processors and said interrupt identifiers; creating a second mapping scheme in accordance with said first mapping scheme, said second mapping scheme having a second group of associations between said interrupt identifiers and said I/O channels; and including said first and second mapping schemes in said interrupt affinity scheme.

In another embodiment, a method for improving CPU usage in handling Input/Output (I/O) performance comprises: identifying an interrupt affinity scheme in a system having a number of processors, said interrupt affinity scheme comprising associations between said processors and a number of interrupt identifiers to be requested for generating interrupts upon I/O completions; and associating said interrupt identifiers with a number of I/O channels in accordance with said interrupt affinity scheme such that interrupts sent from said I/O channels are evenly distributed to each of said processors for processing.

Yet another embodiment of the invention provides a method of improving CPU usage in handling Input/Output (I/O) performances in a multi-processor system, which comprises: detecting a total number of interrupt identifiers available in said system, each interrupt identifier to be used for generating an interrupt upon an I/O completion; for each interrupt identifier, creating a worker kernel thread for handling interrupts having the interrupt identifier; and binding each created worker kernel thread to a unique processor among multiple processors in said system, said unique processor associated with the interrupt identifier corresponding to the worker kernel thread according to an interrupt mapping scheme comprising associations between different processors and said interrupt identifiers.

According to an alternative embodiment of the invention, a method for processing interrupts in a multi-processor system is provided, which comprises: receiving an interrupt triggered by completion of an Input/Output (I/O) command, said interrupt having an interrupt identifier; identifying a processor from multiple processors for processing said interrupt, said processor associated with said interrupt identifier according to an interrupt affinity scheme comprising associations between said multiple processors and a number of interrupt identifiers including said interrupt identifier; and processing said interrupt at said processor.

Also, one embodiment of the invention provides a multi-processor system comprising: a host comprising multiple processors, each of said processors configured to generate Input/Output (I/O) requests and process interrupts; and a host bus adapter coupled with said host, said host bus adapter comprising having multiple I/O channels, each of said I/O channels configured to receive said I/O requests from said host, wherein said host bus adapter is configured to generate said interrupts upon completion of said I/O requests and select one of said multiple I/O channels for sending each of said interrupts back to said host in accordance with an interrupt affinity scheme comprising associations between said processors, multiple interrupt identifiers and said I/O channels. The host bus adapter of this system can be further configured to identify a first mapping scheme comprising a first group of associations between said processors and said interrupt identifiers; establish a second mapping scheme in accordance with said first mapping scheme, said second mapping scheme comprising a second group of associations between said interrupt identifiers and said I/O channels; and create said interrupt affinity scheme by incorporating said first and second mapping schemes.

Embodiments of the present invention also provide computer readable storage media comprising computer-executable instructions in which the above-described methods can be implemented. For example, one embodiment of the invention provides computer readable storage medium comprising computer-executable instructions, said instructions, when executed, causing a computer to: create an interrupt affinity scheme comprising associations between a plurality of processors, interrupt identifiers and I/O channels in a multi-processor system; generate an interrupt upon completion of an I/O command; and send said interrupt from a particular I/O channel of said I/O channels to a particular processor of said processors in accordance with said interrupt affinity scheme, said interrupt having an interrupt identifier associated with said particular processor and said particular I/O channel.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Embodiments of the present invention relate to improving Input/Output (I/O) performance in a host system having multiple CPUs. In particular, embodiments of the present invention aim to use the multiple processors efficiently by evenly distributing and loading all interrupts triggered by I/O completions among the processors, and further, to take advantage of data locality by associating each interrupt to its source processor, namely, the processor originating the I/O request that results in the interrupt. To that end, embodiments of the present invention provide various interrupt affinity schemes that associate multiple processors, interrupts, and I/O channels for sending the interrupts, which allows the interrupts to be evenly loaded among the multiple I/O channels.

Although embodiments of the invention may be described and illustrated herein using interrupt-CPU mapping schemes pre-defined by certain operating systems, such as Solaris by Sun Microsystems, Inc., to demonstrate how to create interrupt affinity schemes, it should be understood that embodiments of this invention are not so limited, but may additionally allow for creating interrupt affinity schemes in the absence of such pre-provided mapping schemes. In addition, although embodiments of the invention may be described and illustrated herein in terms of implementation in certain hardware components such as a host bus adapter and an I/O controller hub, it should be understood that embodiments of the invention can be implemented in variable ways depending on specific structures of different multi-processor systems.

FIG. 1 is a block diagram illustrating an exemplary configuration of a multi-processor system 10 in which the overall I/O (Input/Output) performance and CPU usage can be improved according to various embodiments of the present invention. As shown in FIG. 1, the multi-processor system 10comprises, at a high level, a host 100, coupled with a Host Bus Adapter (HBA) 150, which is configured to communicate with a Storage Area Network (SAN) 160 that is attached to a number of computer storage devices 170, such as hard disks, tape libraries, and optical jukeboxes. A SAN, such as the SAN 160 in FIG. 1, is usually utilized to attach remote computer devices to different servers, such as the host 100 in FIG. 1, so that those remote storage devices can be easily accessed as if they are local to the servers.

PatentTips - Enhanced I/O Performance in a Multi-Processor System Via Interrupt Affinity Schemes

To further facilitate communications, including transmission of data or commands for data between the SAN 160 and the host 100, an adapter or host controller, such as the HBA 150, is typically introduced in the system to assist with certain tasks, such as processing I/O commands, generating interrupts in response to I/O completions, reading data into a host memory through DMA (Direct Memory Access) actions, and so forth. As shown in FIG. 1, a number of Fibre Channel (FC) links 130 or other data links are employed for establishing transmission connections between the HBA 150and the SAN 160, which typically support different transport protocols including without limitation Fibre Channel (FC) protocol, Small Computer System Interface (SCSI) protocol, Fibre Channel over Ethernet (FCoE) protocol, and ATA over Ethernet (AoE) protocol. On the other hand, the HBA150 is coupled to the host 100 as either an integrated or separate component. The HBA 150 is configured to communicate with the host 100 over a host bus, such as a PCI (Peripheral Component Interconnect) bus, a PCI-E (Peripheral Component Interconnect Express) bus 120 shown in FIG. 1, or any other type of host bus known in the art.

Typically, a simplified I/O process works as follows: the host 100 sends an I/O request to the HBA 150 over the PCI-E bus 120 for data to be retrieved from a remote storage device into a memory of the host 100 (or in a reverse direction), and the HBA 150, after retrieving the data through the SAN 160 and performing a DMA (Direct Memory Access) action to write data in the memory of the host 100, would respond by generating an interrupt to notify the host 100 of the I/O completion. Given the large number of I/O processes between the host 100 and HBA 150, an interrupt controller, such as an I/O APIC (Input/Output Advanced Programmable Interrupt Controller) Hub 140 in FIG. 1, can be used to manage I/O completions and corresponding interrupts. In one embodiment, the I/O APIC Hub 140 is configured to, upon receiving each interrupt from the HBA150, determine which processor, among the multiple processors of the host 100, should receive and process the interrupt.

As shown in FIG. 1, the host 100 comprises multiple processors, such as CPU₀101a, CPU₁101b, CPU₂101c, and CPU₃101d. Although only four processors are depicted in FIG. 1, it should be understood that the host 100 may comprise any number of processors depending on the specific system configuration. In one embodiment, each processor is coupled with a LAPIC (Local Advanced Programmable Interrupt Controller), such as LAPIC₀102a, LAPIC₁102b, LAPIC₂102c, or LAPIC₃102d, that has access to a local cache, for example, Cache₀112a, Cache₁112b, Cache₂112c,or Cache₃112d. Each LAPIC is configured to handle interrupts received from the HBA 150 by accessing its local cache storing the most frequently-used data and/or instructions for processing the interrupts. Alternatively, without the LAPIC, each processor can be configured to execute software programs or codes stored in the memory for processing interrupts received from the HBA 150.

FIG. 1 depicts a number of memories, including Memory₀103a, Memory₁103b, Memory₂103c and Memory₃103d, each of which is coupled with a processor and/or associated LAPIC. These memories can be separate or consecutive memory units in the host 100, or represent addresses of memory space that is physically close to their corresponding processor. Each memory is configured to receive and store data from various host applications or from outside the host 100, such as from the remote storage devices 170, and provide access to such data for its related processor. It should be understood that the number of CPU, LAPIC, cache and memory shown in FIG. 1 is for illustration purposes only, and can be increased or reduced as needed in actual implementation.

The host 100 also includes one or more applications to be executed in the host, such as Application₀104a, Application₁104b, Application₂104c,and Application₃104d illustrated in FIG. 1. These applications can range from a local software application (e.g., an Oracle accounting application) to a web-based application (e.g., online data entry). In operation, each application can be assigned to and handled by a designated processor. For example, Application₀104a can be assigned to CPU₀101a for execution. While Application₀104a is being executed at CPU₀, certain data may be needed from a remote hard disk, which will trigger an I/O process for purposes of obtaining such data. The I/O process starts with an I/O request from CPU₀. In one embodiment, such an I/O request is transmitted to the HBA 150 and more particularly, to one of the multiple I/O channels therein, as shown in FIG. 2. In response to the I/O request, the HBA 150 retrieves the data from the relevant storage device over SAN 160 and writes the data into a certain address of the host memory, which, for example, can be any one of the illustrated Memory₀103a, Memory₁103b, Memory₂103cand Memory₃103d. Once the I/O operation is completed, the HBA 150 generates an interrupt and submits the interrupt to the I/O APIC Hub 140 so that ultimately the host 100 is notified of the I/O completion and can access the required data. Such an interrupt can be one of following types of interrupts that are supported by most current operating systems, such as Solaris provided by Sun Microsystem Inc., including (1) a conventional or legacy interrupt that is signaled using one or more external interrupt pins that are wired "out of band" (i.e., separate from main lines of the host bus), (2) a Message-Signaled Interrupt (MSI) that is an "in-band" message implemented as writing a particular value in a particular address, and (3) an Extended Message-Signaled Interrupt (MSI-X) that is an enhanced version of MSI with additional advantages such as an increased number of messages, address independency, etc.

In most existing systems and methods, the host 100 would designate a particular processor, CPU₃, for example, to handle all interrupts sent from the HBA 150, regardless of the source of each interrupt, i.e., which processor originally requested the I/O corresponding to that interrupt. Thus, for example, whether CPU₀or CPU₁has requested the performance of an I/O operation, once that I/O operation is completed, the triggered interrupt would always be sent back to CPU₃for preliminary handling or processing before CPU₀is notified. As a result of such an arrangement, certain coordination or synchronization is required between CPU₀and CPU₃or CPU₀and CPU₃in order for the I/O completion message to be delivered to CPU₀or CPU₁. In addition, when the I/O request and resulting interrupt are originated from CPU₀, data or instructions necessary for processing the interrupt were stored in Cache₀to which the designated CPU₃does not have direct local access. This requires CPU₃to first locate the proper hardware cache that includes the interrupt related information (i.e., "warm cache"), thereby causing additional delay in processing the interrupt. When there are a large number of I/O completions, the designated processor for handling all interrupts can easily become the bottleneck, as all other processors have to wait for their I/O responses before they can proceed with their pending applications. Therefore, despite the existence of multiple processors, the total number of I/O requests that can be processed by the system would be limited to the capacity of the single processor designated for handling interrupts. Such imbalanced usage of different CPUs significantly compromises the overall system efficiency.

Currently, various solutions have been introduced to balance the usage of all CPUs in a multi-CPU system by assigning or distributing interrupts to different processors. For example, Solaris, an OS (Operating System) provided by Sun Microsystem, Inc., defines an affinity or mapping between multiple CPUs and different interrupts. Specifically, the system associates one or more interrupts, each having a unique identifier, to a particular CPU among the multiple CPUs. As a result of such association, when the OS receives an interrupt, the system can determine from the unique interrupt ID which corresponding CPU should be used for handling the interrupt. By evenly distributing interrupts to different processors, the system can achieve a balanced loading on each CPU.

FIG. 2 is a block diagram showing an exemplary configuration of a Host Bus Adapter (HBA) 250 and associated driver 220 for performing I/O operations in a multi-processor system, such as the one illustrated in FIG. 1, according to various embodiments of the present invention. As shown in FIG. 2, the HBA 250 comprises multiple I/O channels, i.e., I/O channels 252a-d. Each I/O channel is configured to carry demands from the driver220 to the HBA 250, or conversely, responses (e.g., interrupts) from the HBA 250 to the driver 220. As aforementioned, the HBA 250 is also configured to communicate with a SAN using, for example, the FC links 230 shown in FIG. 2. The HBA 250 also comprises a processor 254 and a memory 256 coupled to the processor through a bus (not shown). The processor 256 can be any conventional processor such as an Intel® Pentium® or Core Duo™ microprocessor by Intel Corporation. The memory 256 can be dynamic or static random access memory (RAM). In one embodiment, the memory 256 is configured to store data as well as computer-executable instructions for executing certain processes or one or more steps therein, such as the flowchart diagrams illustrated in FIGS. 5 and 6. As can be understood by a person of ordinary skill of art, such computer-executable instructions are written in a computer programming language. In operation, the processor 254 can access the computer-executable instructions in the memory 256 for performing the methods described herein.

PatentTips - Enhanced I/O Performance in a Multi-Processor System Via Interrupt Affinity Schemes

The driver 220 usually comprises software code to be executed by the host computer. In one embodiment, the driver 200 is configured to initialize the settings in the HBA 250, such as configurations of each of the I/O channels 252a-d that typically define what type of devices are associated with each I/O channel, or what type of commands are to be carried by each channel, or what type of protocol is to be supported by each channel. For example, I/O channel 252d can be pre-configured and reserved for transmitting SCSI commands. It should be understood that although only four I/O channels are shown in FIG. 2, the HBA 250 can be configured with any number of I/O channels in different implementations.

In FIG. 2, four exemplary interrupts 240a-d are illustrated, i.e., Interrupt 0, Interrupt 1, Interrupt 2, Interrupt 3, and each interrupt has a unique identifier (e.g., 0, 1, 2, or 3). It should be understood that the interrupt IDs in FIG. 2 are for illustration only and various forms of identifiers can be used for different types of interrupts. According to a pre-defined mapping or affinity scheme, such as the one provided in a Solaris system, each interrupt can be associated with a particular CPU in a group of CPUs, such as CPUs 201a-d. For example, as illustrated in FIG. 2; Interrupt 0 is assigned to CPU₀, Interrupt 1 to CPU₁, Interrupt 2 to CPU₂, and Interrupt 3 to CPU₃. In operation, upon an I/O completion, the HBA 250 requests an interrupt ID and generates an interrupt accordingly. The generated interrupt will be sent back to the host via one of the multiple I/O channels in the HBA 250. As an example, if the generated interrupt is in the form of Interrupt 0, the host operating system can determine from the interrupt ID, i.e., zero (0), that CPU₀should be the processor to process this interrupt. Likewise, if the interrupt is Interrupt 3, the operating system can determine from the interrupt ID being three (3) that CPU₃should be the processor to process this interrupt. However, if the HBA 250 is given the same interrupt ID each time for generating an interrupt, every interrupt will be sent back to the same CPU associated with that interrupt ID, which would result in one processor being overly busy as if it is fully designated for processing interrupts. Also, ideally, if the completed I/O was initially requested by CPU₀, then CPU₀should be the processor to handle the corresponding interrupt in order to take the advantage of warm cache or data locality. That requires the HBA 250 to generate the interrupt using an interrupt ID of zero. However, the HBA would not know which processor initiated the I/O request or which interrupt ID is to be used for matching the right processor. Using the example illustrated in FIG. 2, there is only 25% chance of such matching when the HBA 250 can randomly assign any one of the four CPUs to a received I/O completion. Therefore, the existing interrupt-processor affinity scheme is insufficient for evenly distributing interrupts among multiple processors or automatically sending an interrupt to its source processor, namely, the processor that requested the I/O triggering the interrupt.

As aforementioned, any one of the I/O channels 252a-d can be used for passing I/O requests from different processors of the host to the HBA and sending responses/interrupts from the HBA back to the host. In either direction, there are multiple I/O channels to choose from, which add the uncertainty or difficulty in tracking down the source processor of each I/O request and destination processor for each interrupt. For example, an I/O request can be received from I/O channel 252a, and the interrupt responsive to the I/O completion can be sent through I/O channel 252c. In addition, without knowing how frequently each channel is being or will be used for carrying the interrupts, the HBA may overload one particular channel. One approach is to pre-configure the I/O channels to the extent that they each are associated with different types of devices, data commands or communication protocols to be utilized in completing the I/O operation. As a result of such a configuration, when an I/O request is received, depending on which types of devices, data commands or communication protocols need to be used in servicing the I/O request, the driver220 can identify the associated channel for passing the request to the HBA 250. Likewise, when the I/O operation is completed, depending on which types of devices, data commands or communication protocols are used in the I/O performance, the HBA 250 can identify the associated I/O channel for sending back the response or interrupt. This way, the I/O requests and corresponding interrupts for the same types of devices, data commands or communication protocols will always share the same I/O channel. For example, as illustrated in FIG. 2, the I/O channels 252a and 252b are programmed for FCP (Fibre Channel Protocol) commands, the I/O channel 252c for IP (Internet Protocol) commands, and the I/O channel 252d for SCSI (Small Computer System Interface) commands. If a received I/O command is a SCSI command, the I/O channel 252d will be selected for sending this I/O command to the HBA 250, and once the I/O operation is completed, the resulting interrupt will be sent back to the host over the same I/O channel 252d.

The above-described approach works well when there is an even distribution of I/O completions among different types of devices, data commands or communication protocols. In operation, however, there may be a large number of I/O operations for SCSI commands, and as such, the I/O channel 252d designated for SCSI commands will be heavily loaded with I/O requests and responses. Accordingly, a better solution is needed for efficient usage of multiple I/O channels, interrupts and processors in a multi-CPU system.

FIGS. 3a-b provide exemplary mapping schemes 300a-b that establish an affinity between different I/O channels, interrupts and processors for improving I/O performance in a multi-CPU system as illustrated in FIG. 1 according to one embodiment of the present invention. Both mapping schemes, 300a in FIGS. 3a and 300b in FIG. 3b, include a first mapping between a number of CPU IDs 302 and a number of interrupt IDs 304, and a second mapping between the interrupt IDs 304 and multiple I/O channel IDs 306. Typically, the mapping or association between the CPU IDs and interrupt IDs are set up by the operating system of a multi-CPU system, such as Solaris, when the system is initialized. In that mapping process, the OS detects a total number of processors in the system and a total number of interrupt IDs allocated for a specific instance of a device, and assigns one or more interrupts to one processor. In one implementation, a data table is created to store each pair of a CPU ID and associated interrupt ID. In an ideal situation, the mapping between the CPUs and interrupts can be one-to-one, meaning each interrupt is assigned to a unique CPU for processing. However, because the number of interrupts often exceeds the number of processors, one CPU can be assigned to process multiple interrupts. As shown in FIGS. 3a-b, the three interrupts, Interrupt 0, Interrupt 1 and Interrupt 2, are associated with the same processor, CPU₀, while Interrupt 3 is assigned to CPU₁, and Interrupt 4 is assigned to CPU₂.

PatentTips - Enhanced I/O Performance in a Multi-Processor System Via Interrupt Affinity Schemes

In one embodiment, the mapping or affinity between the interrupts and different I/O channels is established by the HBA 250 and associated driver220. When the driver 220 is initialized to configure the I/O channels in the HBA 250, a copy of interrupt-CPU mapping or association scheme is saved and used for establishing the affinity between the I/O channels and interrupt IDs. Again, ideally, a one-to-one mapping between each unique I/O channel and each unique interrupt is desirable, but because the number of I/O channels oftentimes exceeds the number of interrupts, one or more I/O channels can be assigned to share one interrupt ID. For example, in FIG. 3a the I/O channels 0-2 share the same interrupt ID, Interrupt 0, and the I/O channels 3, 5 share the same interrupt ID, Interrupt 1.

There are variable ways to establish the I/O-interrupt affinity. For example, FIG. 3a demonstrates an I/O-interrupt affinity scheme without considering the pre-defined interrupt-processor scheme, while FIG. 3b provides another I/O-interrupt mapping scheme that takes into consideration the pre-defined interrupt-processor scheme. As will be described in detail below, with the mapping scheme illustrated in FIG. 3b, the multiple processors in the system can have a more balanced load of interrupts from different I/O channels.

FIG. 3a shows a random mapping of multiple I/O channels to different interrupt IDs without taking into consideration the CPU-Interrupt association information. As seen in FIG. 3a, three channels, i.e., I/O channels 0-2, are mapped to Interrupt 0, two channels, i.e., I/O channels 3 and 5, are mapped to Interrupt 1, and I/O channel 4 is mapped to Interrupt 2. Because all these three interrupts, Interrupts 0-2, are mapped to or associated with the same processor, CPU₀, this processor will be loaded with interrupts received from six channels (I/O channels 0-5) in total. CPU₀can be heavily loaded, especially compared with the other processor, CPU₁, which will receive interrupts from only two channels, i.e., I/O channels 6 and 7, according to the mapping scheme 300a in FIG. 3a. This would cause an imbalanced usage of CPUs and inefficient handling of I/O requests.

In contrast, the Interrupt-I/O mapping scheme in FIG. 3b is based on the knowledge of the CPU-Interrupt association. Since CPU₀is known to have been designated for processing interrupts having IDs of 0-2, while CPU₁is only designated for one interrupt ID of 3 and CPU₂is for Interrupt 4 only, the HBA and driver can assign or map fewer I/O channels to interrupt IDs 0-2 than interrupt IDs 3 or 4 so that interrupts received from different I/O channels can be evenly distributed among the multiple processors. For example, as illustrated in FIG. 3b, for Interrupts 0-2, only one I/O channel is mapped to each interrupt, namely, I/O channel 0 to Interrupt 0, I/O channel 1 to Interrupt 1 and I/O channel 2 to Interrupt 2. This is different from Interrupt 3 to which three channels (I/O channels 4, 6, 7) are mapped, or Interrupt 4 to which two channels (I/O channels 3, 5) are mapped. Ultimately, CPU₀and CPU₁will each handle interrupts from three I/O channels and CPU₂will process interrupts from two I/O channels, resulting in enhanced CPU usage in a multi-CPU system.

Referring to FIG. 4, a worker kernel thread scheme 400 is provided for establishing the affinity between different I/O channels, interrupts and processors in a multi-CPU system as illustrated in FIG. 1 according to another embodiment of the present invention. Many operating systems, such as Solaris, have implemented a multi-threaded process model. Under such a model, the I/O performance including I/O completions and interrupts triggered therefrom can be viewed, at a detailed thread level, as involving multiple threads in two spaces, namely, the kernel space 410 and the user space 420 as shown in FIG. 4. In the user space 420, for example, an I/O process can be viewed as including a number of user threads 426. Each user thread 426 corresponds to a unique kernel thread 406 that is bound to a unique processor of the multiple CPUs 402a-d. Typically, all kernel threads 406 in the kernel space are managed by a dispatcher 404. In one configuration, the dispatcher 404 receives a kernel thread 406, identifies a processor that the thread is bound with, and inserts each kernel thread into a per-processor dispatch queue associated with the processor. The kernel thread usually waits in the dispatch queue until a system scheduler decides that the priority of this kernel thread becomes current and the kernel thread is ready to be serviced by its bound processor.

PatentTips - Enhanced I/O Performance in a Multi-Processor System Via Interrupt Affinity Schemes

In the context of I/O performance, a dedicated worker kernel thread can be employed by each processor to assist an interrupt thread with processing completed I/O commands. Without a dedicated worker kernel, a simplified I/O completion process, at a detailed thread level, works as follows: when an interrupt is received at a processor, it triggers the interrupt thread, which, due to its highest priority, would require the processor to stop all other threads in the middle of processing to service the interrupt thread. This is often not the best way of utilizing the processor. The use of a dedicated worker kernel thread improves the CPU usage by allowing the interrupt thread to hand over the process for any completed I/O commands to the worker thread. Specifically, once a dedicated worker kernel thread is created for the interrupt thread, it is placed in a round queue (e.g., a per-processor dispatch queue) and remains in the sleeping mode until it is woken up by the interrupt thread. When an interrupt is received at the processor, the interrupt thread performs certain operations and wakes up the dedicated worker kernel thread and hands over to it the remaining process for the I/O completion triggering the interrupt. Because the dedicated worker kernel thread has a pre-assigned priority that may or may not become current, the processor does not have to stop processing other threads in the middle. Rather, the processor can take time to service the dedicated worker kernel thread, as with all other threads waiting in the round queue.

As described above, a worker kernel thread can be bound with a unique processor, while each interrupt ID is also associated with a unique processor according to the interrupt-processor affinity already provided by the operating system. Therefore, it is desirable to create at least one worker kernel thread for all interrupt IDs associated with one processor and bind this worker kernel thread to the same processor. In one embodiment, the worker kernel threads for I/O purposes are created during system initialization when a total number of interrupt IDs are detected, and for each interrupt ID a corresponding worker kernel thread is created and further bound to a processor associated with that interrupt ID based on the CPU-Interrupt affinity already provided by the system. In an alternative embodiment, the worker kernel threads can be created and configured dynamically. In other words, instead of being pre-defined during the system initialization, a corresponding worker kernel thread is created whenever an interrupt triggered when an I/O completion is received at a processor.

In creating a worker kernel thread, kernel calls such as thread_create( ) provided by Solaris can be used. Usually when a kernel worker thread is first created via thread_create( ), the scheduling class of this thread is inherited from the thread issuing the thread_create( ) call, and the CPU assigned to the kernel thread, by default, would be the one in which the thread_create( ) code is being executed. The priority of the work kernel thread can be adjusted by specifying a value in the thread_create( ) call. The thread affinity with different processors can then be established or adjusted through kernel calls such as thread_affinity_set(kthread_id_t t, int processorid_t) and thread_affinity_clear(kthread_id_t t). It should be understood that the above-listed kernel calls are only exemplary, and there are many variations in creating worker kernel threads and establishing their affinity with different processors.

FIG. 5 is a flowchart showing an exemplary process of handling I/O requests using the mapping or affinity scheme illustrated in FIGS. 3a-baccording to various embodiments of the present invention. As shown in FIG. 5, the process starts as the system is initialized at step 510, where the CPU-Interrupt affinity or association, as illustrated in FIGS. 3a-b, is established. At step 520, the driver is initialized to configure the multiple I/O channels in the HBA so that an affinity or mapping scheme between the interrupt IDs and the channel IDs is established. This is accomplished by having a copy of the CPU-interrupt scheme that is pre-defined by the operating system at step 5202, and creating the interrupt-I/O mapping scheme according to the CPU-interrupt scheme at step 5204. The general goal of step 520 is, as described above with reference to FIGS. 3a-b, to ensure that interrupts from different I/O channels are more evenly distributed and assigned to different processors in the system.

PatentTips - Enhanced I/O Performance in a Multi-Processor System Via Interrupt Affinity Schemes

A typical I/O performance module 530 includes the following steps: receiving an I/O request for data in or out of remote storage devices in a HBA at step 532, I/O completion by the HBA at step 534, and triggering interrupts upon I/O completion and sending the interrupts back for processing at a certain CPU at step 536. In one embodiment, the I/O request includes a CPU ID indicating the source processor of the I/O request. In another embodiment, the I/O request includes an I/O channel ID to designate which I/O channel should be used for sending back the reply or interrupt corresponding to the requested I/O operation. As will be described below, without the designated return I/O channel, by default the HBA will use the same I/O channel from which the I/O request was received for sending back a reply or an interrupt message. In operation, the HBA can store information regarding each received I/O request in a data table for future reference. For instance, the HBA can refer to the stored I/O request for the source processor ID or a designated I/O channel ID in generating the interrupt.

The interrupt generation, delivery and processing step 536 can be performed in variable ways, depending on specific system configurations including different I/O channels, interrupts IDs, CPUs and their association schemes. FIG. 5 provides one exemplary process comprising steps5362-5366, the order of which can be varied in practice. As shown in FIG. 5, in generating an interrupt upon the I/O completion, the HBA first determines an interrupt ID for the interrupt at step 5362. This interrupt ID can be determined by identifying the source CPU ID included in the previously-stored I/O request and using the CPU-interrupt mapping scheme pre-stored in the HBA to identify at least one interrupt ID associated with that CPU. If no CPU ID is included in the original I/O request, the HBA can treat the processor on which the I/O was last executed as the source CPU for purposes of determining an interrupt ID. Alternatively, if the I/O request does not include the source CPU ID but a designated I/O channel ID instead, the HBA can refer to the interrupt-I/O mapping scheme created at step 520 to identify the interrupt ID associated with the designated I/O channel.

Once the interrupt ID is selected, the HBA can proceed to determine the I/O channel for sending back the interrupt to the source CPU at step 5364. The I/O channel can be determined in one of the following ways: (1) if the original I/O request includes an I/O channel ID, this previously designated channel will be used for sending the interrupt; (2) if no such I/O ID is included in the I/O request, then by default the I/O channel originally used for sending the I/O request will be used for sending the interrupt, or (3) the interrupt ID will be used to identify at least one associated I/O channel according to the interrupt-I/O mapping scheme created in the I/O channel configuration step 520 and the identified I/O channel will be used for sending the interrupt. For example, referring back to FIG. 2, if an I/O request is received from the I/O channel 252d, by default any response or interrupt upon the I/O completion will be sent back to the processor via the I/O channel 252d. But if the I/O request includes an I/O channel ID of one (1), the interrupt will be sent back from the I/O channel 252b. In the third approach, assuming an interrupt ID of three (3), i.e., Interrupt 3, is determined and in view of the mapping scheme in FIG. 3b, the I/O channel to be used can be any one of I/O channels 4, 5 and 7 associated with Interrupt 3.

At step 5366, an interrupt is generated using the interrupt ID and sent over the determined I/O channel back to the source CPU for processing. As will be understood by those skilled in the art, many variations to the above-described process can be incorporated and implemented for improving I/O performance via a mapping or affinity scheme between different channels, interrupts and processors according to various embodiments of the invention.

FIG. 6 is a flowchart showing an exemplary process of handling I/O operations by use of a worker kernel thread affinity scheme, such as the illustration in FIG. 4, according to various embodiments of the present invention. Similar to FIG. 5, the exemplary process in FIG. 6 also includes a system initialization step 610 for the operating system to set up the CPU-Interrupt affinity, a channel configuration step 620 and the I/O performance module 630. Specifically, when the I/O channels are configured at step 620, it involves the following actions: a copy of the system-provided interrupt-CPU mapping is stored in the HBA at step 6202, an interrupt-I/O mapping is created according to the CPU-interrupt affinity at step 6204, and for each interrupt ID, a corresponding worker kernel thread is created and placed in a dispatch queue of the binding processor at step 6206. As described above with reference to FIG. 5, the worker kernel threads can be dynamically configured in response to interrupts received at each processor.

PatentTips - Enhanced I/O Performance in a Multi-Processor System Via Interrupt Affinity Schemes

As with step 536 in FIG. 5, the interrupt generation, delivery and processing step 636 of FIG. 6 can be performed in variable ways, depending on specific system configurations. The exemplary process in FIG. 6 comprises steps 6362-6370, the order of which can be varied in practice. As shown in FIG. 6, in generating an interrupt upon receipt of an I/O completion, the HBA first determines an interrupt ID for the interrupt at step 6362. This interrupt ID can be determined by identifying the source CPU ID included in the previously-stored I/O request and using the CPU-interrupt mapping scheme pre-stored in the HBA to identify at least one interrupt ID associated with that CPU. If no CPU ID is included in the original I/O request, the HBA can treat the processor in which the I/O was last executed as the source CPU for purposes of determining an interrupt ID. Alternatively, if the I/O request does not include the source CPU ID but a designated I/O channel ID instead, the HBA can refer to the interrupt-I/O mapping scheme created at step 620 to identify the interrupt ID associated with the designated I/O channel. Once the interrupt ID is selected, the HBA can proceed to determine the I/O channel for sending back the interrupt to the source CPU at step 6364. The I/O channel can be determined in one of the following ways: (1) if the original I/O request includes an I/O channel ID, this previously designated channel will be used for sending the interrupt; (2) if no such I/O ID is included in the I/O request, then by default the I/O channel originally used for sending the I/O request will be used for sending the interrupt, or (3) the interrupt ID will be used to identify at least one associated I/O channel according to the interrupt-I/O mapping scheme created in the I/O channel configuration step 520 and the identified I/O channel will be used for sending the interrupt. For example, referring back to FIG. 2, if an I/O request is received from the I/O channel 252d, by default any response or interrupt upon receipt of the I/O completion will be sent back to the processor via the I/O channel 252d. But if the I/O request includes an I/O channel ID of one (1), the interrupt will be sent back from the I/O channel252b. In the third approach, assuming an interrupt ID of three (3), i.e., Interrupt 3, is determined and in view of the mapping scheme in FIG. 3b, the I/O channel to be used can be any one of I/O channels 4, 5 and 7 associated with Interrupt 3. At step 6366, an interrupt is generated using interrupt ID and sent over the determined I/O channel back to the source CPU for processing.

When the interrupt is received at the correct processor, at step 6368 the interrupt thread wakes up a worker kernel thread corresponding to the interrupt to hand over the remaining process for the completed I/O command. As aforementioned, this worker kernel thread can be pre-created for the interrupt during the system initialization step of 620 or dynamically configured as the interrupt is being received. If the worker kernel thread is pre-created, it is already placed in the dispatch queue associated with the bound processor. Otherwise the newly created worker kernel thread will be assigned and inserted in the dispatch queue at step 6370. Once the priority of the worker kernel thread becomes current, the processor will attend to and service the thread, at which time the processor is notified of the I/O completion and concludes the performance of the I/O request.

The flowchart in FIG. 6 is an exemplary process, and as will be understood by those skilled in the art, many variations can be incorporated and implemented for improving I/O performances via a mapping or affinity scheme between different channels, interrupts and processors and a worker kernel scheme according to various embodiments of the invention.

SRC=http://www.freepatentsonline.com/y2011/0087814.html

你可能感兴趣的:(performance)

MongoDB知识概括 GeorgeLin98 持久层 mongodb
MongoDB知识概括MongoDB相关概念单机部署基本常用命令索引-IndexSpirngDataMongoDB集成副本集分片集群安全认证MongoDB相关概念业务应用场景：传统的关系型数据库（如MySQL），在数据操作的“三高”需求以及应对Web2.0的网站需求面前，显得力不从心。解释：“三高”需求：①Highperformance-对数据库高并发读写的需求。②HugeStorage-对海量数
pnpm解說白总Server 服务器 kubernetes 网络运维云原生 python java
pnpm（PerformanceNodePackageManager）是一个高性能的Node.js包管理器，它旨在解决npm和yarn在处理依赖关系时可能遇到的一些问题，如重复安装相同版本的包、包的存储空间占用过大等。pnpm使用了一种称为“硬链接”和“符号链接”的文件系统技术，这使得它能够以更高效的方式存储和管理依赖项。关键特点：高效存储：pnpm使用一种称为内容可寻址存储（ContentAdd
SIPp常用脚本之三：UAC weixin_34075551 网络
UAC是作为SIP消息的发起端，可以控制消息速率什么的，方便极了。一、uac.xml;tag=[call_number]To:Call-ID:[call_id]CSeq:1INVITEContact:sip:[field0]@[local_ip]:[local_port]Max-Forwards:70Subject:PerformanceTestContent-Type:application/s
应用Visual Studio Profiler分析CPU使用情况 Rverdoser windows
使用VisualStudioProfiler分析CPU使用情况‌的步骤如下：1.‌启动CPU分析：‌在VisualStudio中打开你要分析的项目。在菜单栏中选择Debug>PerformanceProfiler，或者使用快捷键Alt+F2。在性能分析工具窗口中，选择CPUUsage选项，这将帮助你分析应用程序的CPU使用情况。2.‌运行CPU分析‌选择CPUUsage后，点击Start按钮。Vi
php工程师绩效考核表_如何对程序员绩效考核？ weixin_39637233 php工程师绩效考核表
如何对程序员绩效考核？1、什么是绩效考核？来在百度百科的解释，绩效考核(performanceexamine)，是企业绩效管理中的一个环节，是指考核主体对照工作目标和绩效标准，采用科学的考核方式，评定员工的工作任务完成情况、员工的工作职责履行程度和员工的发展情况，并且将评定结果反馈给员工的过程。常见绩效考核方法包括BSC、KPI及360度考核等。绩效考核是一项系统工程。2、绩效考核是否有用？对企业
[Kaiming]Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification MTandHJ neural networks
文章目录概主要内容PReLUKaiming初始化ForwardcaseBackwardcaseHeK,ZhangX,RenS,etal.DelvingDeepintoRectifiers:SurpassingHuman-LevelPerformanceonImageNetClassification[C].internationalconferenceoncomputervision,2015:1
「RIA学习力」《学习心理学》No.1，未闻 Nathan_2
「RIA学习力授权导师」便签输出第6期第1天《学习心理学》拆页一来自《第一章学习理论与教学导论》P9(一)学习的定义虽然本书讨论的学习理论之间存在差异，但这些理论在学习上确实有一些基本的确定性的假设。首先，它们都指出学习是人类行为表现performance，又译表现)或行为表现潜能的持久改变。这意味着学习者能够执行一些在学习发生之前不能执行的行动而且不管它们实际上是否有展示新习得行为表现的机会，这
华为云全栈可观测平台（APM）8月新功能特性华为云PaaS服务小智华为云
华为云应用性能管理服务（ApplicationPerformanceManagement，简称APM）帮助运维人员快速发现应用的性能瓶颈，以及故障根源的快速定位，为用户体验保驾护航。您无需修改代码，只需为应用安装一个APMAgent，就能够对该应用进行全方位监控，帮助您快速定位出错接口和慢接口、重现调用参数、发现系统瓶颈，从而大幅提升线上问题诊断的效率。8月APM更新了3大新特性，一起来看看吧！（
SQL Server内存性能监视工具 culuo4781 java linux python 数据库 mysql
内存压力使查询变慢(Memorypressureslowingdownqueries)ThisarticleisthesequelinaseriesaboutSQLServermonitoringtoolsandcommonperformanceissues.ThefirstarticleSQLServermonitoringtoolsfordiskI/Operformanceisabouthow
.NET Core —如何使用Redis缓存提高应用程序性能 weixin_26737625 redis java 缓存 python mysql
Redisisaverypowerfuldistributedcachingengineandoffersverylowlatencykey-valuepaircaching.Ifusedintherightbusinesscontext,Rediscansignificantlyboostapplicationperformance.Inthisarticlewewilldoawalkthrou
Python+Pytest压力测试浪里一条鱼技术分享 python 压力测试
在现代Web应用程序中，性能是至关重要的。为了确保应用程序能够在高负载下正常运行，我们需要进行性能测试。今天，应小伙伴的提问，老向老师来写一个Pytest进行压力测试的简单案例。这个案例的测试网站我们就隐藏了，不过网站的基本情况是：阿里框架：FastAdmin.net1.程序说明1.1设置测试参数首先，我做的第一件事情就是设置测试参数。代码如下#定义测试用例deftest_performance(
推荐开源项目：Fluxter - Elixir连接InfluxDB的高效桥梁江奎钰
推荐开源项目：Fluxter-Elixir连接InfluxDB的高效桥梁fluxterHigh-performanceandreliableInfluxDBwriterforElixir项目地址:https://gitcode.com/gh_mirrors/fl/fluxter项目介绍Fluxter是一款专为Elixir社区打造的轻量级工具，旨在简化与InfluxDB——高性能的时间序列数据库之间
Redis概述 AC编程
一、为什么需要NoSQLHighperformance高并发读写HugeStorage海量数据的高效率存储和访问HighScalability&&HighAvailability高可拓展性和高可用性二、NoSQL数据库的四大分类键值（Key-Value）存储列存储文档数据库图形数据库三、四类NoSQL数据库比较键值（Key-Value）存储相关产品：Redis、Voldemort、TokyoCab
Performance Tips ngugg
相关链接：https://developer.apple.com/library/archive/documentation/FileManagement/Conceptual/FileSystemProgrammingGuide/PerformanceTips/PerformanceTips.html#//apple_ref/doc/uid/TP40010672-CH7-SW1Relativet
Zookeeper简介 Daly罗 Zookeeper zookeeper 分布式云原生
1.什么是ZookeeperZooKeeperisahigh-performancecoordinationservicefordistributedapplications.Itexposescommonservices-suchasnaming,configurationmanagement,synchronization,andgroupservices-inasimpleinterface
前端性能监控、异常监控的一些记录一只小白菜~ 其他!前端异常监控性能监控
文章目录常见异常类型常用的一些异常监控的方法window.errorwindow.addEventListener('error')window.addEventListener('load')window.addEventListener('DOMContentLoaded')window.performancenavigator.sendBeacon1*1像素gifaxios请求/响应拦截器V
prometheus监控mysql jads_ prometheus mysql prometheus
1、在mysql中创建监控用户CREATEUSER'exporter'@'%'IDENTIFIEDBY'123456'WITHMAX_USER_CONNECTIONS3;GRANTPROCESS,REPLICATIONCLIENT,SELECTON*.*TO'exporter'@'%';GRANTSELECTONperformance_schema.*TO'exporter'@'%';flushp
Window Performance API TE-茶叶蛋前端项目性能优化 javascript 开发语言 ecmascript
文章目录前言WindowPerformanceAPI详细分析**1.主要接口****1.1`performance.now()`****1.2`performance.mark()`****1.3`performance.measure()`****1.4`performance.getEntriesByType()`****1.5`performance.getEntriesByName()`*
【大模型】大模型 CPU 推理之 llama.cpp szZack 大语言模型人工智能大模型人工智能 llama.cpp
【大模型】大模型CPU推理之llama.cppllama.cpp安装llama.cppMemory/DiskRequirementsQuantization测试推理下载模型测试参考llama.cpp描述Themaingoalofllama.cppistoenableLLMinferencewithminimalsetupandstate-of-the-artperformanceonawideva
XILINX AXI总线热爱学习地派大星网络 fpga开发 fpga 嵌入式硬件
简介本文主要针对XILINX使用的AXILite总线对寄存器读写的使用，首先对AXI总线做详细介绍AXI总线AXI是一种总线协议，可以挂在多个master和slave，AXI总线包括3中类型接口，介绍如下：AXI4：（Forhigh-performancememory-mappedrequirements.）主要面向高性能地址映射通信的需求，是面向地址映射的接口，允许最大256轮的数据突发传输；A
【MySQL数据库管理问答题】第5章监控 MySQL summer.335 MySQL数据库管理问答题 MySQL 数据库 mysql
目录1.MySQL服务器都提供了哪几种类型的日志文件？说明每种日志的用途。2.MySQL8.0默认启用哪两种日志记录？3.请说明常规查询日志和慢速查询日志在记录的内容上有何不同。4.如何配置才能将慢速查询日志和常规查询日志在文件和表里同时保存？5.从DBA的角度，谈一下使用Performanceschema的目的或作用？6.Performanceschema中的顶级检测组件都有哪些？7.请谈一下M
企业群集应用概述与 LVS 负载均衡详解爱吃糖的蠢猫 lvs 负载均衡运维
文章目录企业群集应用概述与LVS负载均衡详解一、企业群集应用概述1.1群集的含义1.2现有问题1.3解决方法二、企业群集分类2.1负载均衡群集（LoadBalanceCluster）2.2高可用群集（HighAvailabilityCluster）2.3高性能运算群集（HighPerformanceComputerCluster）三、负载均衡群集架构3.1第一层：负载调度器（LoadBalance
RDMA相关git 今天周一 git
perftest性能测试工具perftest：GitHub-linux-rdma/perftest:InfinibandVerbsPerformanceTestsrdma-corerdma-core：GitHub-linux-rdma/rdma-core:RDMAcoreuserspacelibrariesanddaemons
【论文阅读】GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation Bosenya12 模型窃取科研学习论文阅读知识蒸馏成员推理攻击黑盒
摘要While（虽然）DeepNeuralNetworks(DNNs)havedemonstratedremarkableperformanceintasksrelatedtoperception（感知）andcontrol（控制）,therearestillseveralunresolvedconcerns（未解决的问题）regardingtheprivacyoftheirtrainingdat
SplitDB: Closing the Performance Gap for LSM-Tree-Based Key-Value Stores 简单翻译和思考 Such Devotion LSM-
来源IEEETRANSACTIONSONCOMPUTERS,VOL.73,NO.1,JANUARY2024主要内容：设计了NVM存储层用于在LSM压缩过程中衔接内存和SSD/HDDAbstract日志结构化合并树(LSM树)是现代键值存储的核心数据存储引擎。随着云计算和数据中心的发展，它的采用速度迅速加快。尽管LSM树得到了广泛的应用，但它仍然面临严重的性能问题，例如写入停顿、写入放大和读取效率低
tcp delayed ack 子羽潇潇 tcpip tcp/ip
whatisTCPdelayedACKTCPdelayedacknowledgmentisatechniqueusedbysomeimplementationsoftheTransmissionControlProtocolinanefforttoimprovenetworkperformance.Inessence,severalACKresponsesmaybecombinedtogether
go的fasthttp学习 ~kiss~ 计算机网络 golang 学习开发语言
背景介绍fasthttpwasdesignedforsomehighperformanceedgecases.Unlessyourserver/clientneedstohandlethousandsofsmalltomediumrequestspersecondandneedsaconsistentlowmillisecondresponsetimefasthttpmightnotbeforyo
学习笔记——前端页面性能指标 Garfield的子非鱼
意义Findouthowyoustackuptonewindustrybenchmarksformobilepagespeed曾提到随着页面加载时间从1秒增加到10秒，移动站点访问者跳转的概率增加了123%。相关指标计算NavigationTimingLevel2为了帮助开发者更好的衡量和改进前端页面性能，W3C性能小组引入了PerformanceNamvaigationTimeAPI（IE和Sa
Gemini 模型将被引入Performance Max 新加坡内哥谈技术人工智能
每周跟踪AI热点新闻动向和震撼发展想要探索生成式人工智能的前沿进展吗？订阅我们的简报，深入解析最新的技术突破、实际应用案例和未来的趋势。与全球数同行一同，从行业内部的深度分析和实用指南中受益。不要错过这个机会，成为AI领域的领跑者。点击订阅，与未来同行！订阅：https://rengongzhineng.io/谷歌近日宣布其性能最大化广告系列中引入了创新特性，包括融合双子星模型，旨在为广告商解锁更
IPQ9574 QCN9224 QCN9274: What is the throughput of WiFi7 cards? linuxubuntu
IPQ9574QCN9224QCN9274:WhatisthethroughputpotentialofWiFi7cards?IntherealmofWiFirouters,throughputservesasakeydeterminantofperformance.Ahigherthroughputsignifiesenhanceddatatransferrates,enablingseamle
windows下源码安装golang 616050468 golang安装 golang环境 windows
系统： 64位win7，开发环境：sublime text 2， go版本： 1.4.1 1. 安装前准备(gcc, gdb, git) golang在64位系
redis批量删除带空格的key bylijinnan redis
redis批量删除的通常做法： redis-cli keys "blacklist*" | xargs redis-cli del 上面的命令在key的前后没有空格时是可以的，但有空格就不行了： $redis-cli keys "blacklist*" 1) "blacklist:12: [email protected]
oracle正则表达式的用法 0624chenhong oracle 正则表达式
方括号表达示方括号表达式描述 [[:alnum:]] 字母和数字混合的字符 [[:alpha:]] 字母字符 [[:cntrl:]] 控制字符 [[:digit:]] 数字字符 [[:graph:]] 图像字符 [[:lower:]] 小写字母字符 [[:print:]] 打印字符 [[:punct：]] 标点符号字符 [[:space:]]
2048源码(核心算法有，缺少几个anctionbar，以后补上) 不懂事的小屁孩 2048
2048游戏基本上有四部分组成， 1：主activity，包含游戏块的16个方格，上面统计分数的模块 2：底下的gridview，监听上下左右的滑动，进行事件处理， 3：每一个卡片，里面的内容很简单，只有一个text，记录显示的数字 4：Actionbar，是游戏用重新开始，设置等功能(这个在底下可以下载的代码里面还没有实现) 写代码的流程 1：设计游戏的布局，基本是两块，上面是分
jquery内部链式调用机理换个号韩国红果果 JavaScript jquery
只需要在调用该对象合适(比如下列的setStyles)的方法后让该方法返回该对象（通过this 因为一旦一个函数称为一个对象方法的话那么在这个方法内部this（结合下面的setStyles）指向这个对象） function create(type){ var element=document.createElement(type); //this=element;
你订酒店时的每一次点击背后都是NoSQL和云计算蓝儿唯美 NoSQL
全球最大的在线旅游公司Expedia旗下的酒店预订公司，它运营着89个网站，跨越68个国家，三年前开始实验公有云，以求让客户在预订网站上查询假期酒店时得到更快的信息获取体验。云端本身是用于驱动网站的部分小功能的，如搜索框的自动推荐功能，还能保证处理Hotels.com服务的季节性需求高峰整体储能。 Hotels.com的首席技术官Thierry Bedos上个月在伦敦参加“2015 Clou
java笔记1 a-john java
1，面向对象程序设计（Object-oriented Propramming，OOP）：java就是一种面向对象程序设计。 2，对象：我们将问题空间中的元素及其在解空间中的表示称为“对象”。简单来说，对象是某个类型的实例。比如狗是一个类型，哈士奇可以是狗的一个实例，也就是对象。 3，面向对象程序设计方式的特性： 3.1 万物皆为对象。
C语言 sizeof和strlen之间的那些事 C/C++软件开发求职面试题必备考点（一） aijuans C/C++求职面试必备考点
找工作在即，以后决定每天至少写一个知识点，主要是记录，逼迫自己动手、总结加深印象。当然如果能有一言半语让他人收益，后学幸运之至也。如有错误，还希望大家帮忙指出来。感激不尽。后学保证每个写出来的结果都是自己在电脑上亲自跑过的，咱人笨，以前学的也半吊子。很多时候只能靠运行出来的结果再反过来
程序员写代码时就不要管需求了吗？ asia007 程序员不能一味跟需求走
编程也有2年了，刚开始不懂的什么都跟需求走，需求是怎样就用代码实现就行，也不管这个需求是否合理，是否为较好的用户体验。当然刚开始编程都会这样，但是如果有了2年以上的工作经验的程序员只知道一味写代码，而不在写的过程中思考一下这个需求是否合理，那么，我想这个程序员就只能一辈写敲敲代码了。我的技术不是很好，但是就不代
Activity的四种启动模式百合不是茶 android 栈模式启动 Activity的标准模式启动栈顶模式启动单例模式启动
android界面的操作就是很多个activity之间的切换,启动模式决定启动的activity的生命周期 ; 启动模式xml中配置 <activity android:name=".MainActivity" android:launchMode="standard&quo
Spring中@Autowired标签与@Resource标签的区别 bijian1013 java spring @Resource @Autowired @Qualifier
Spring不但支持自己定义的@Autowired注解，还支持由JSR-250规范定义的几个注解，如：@Resource、 @PostConstruct及@PreDestroy。 1. @Autowired @Autowired是Spring 提供的，需导入 Package:org.springframewo
Changes Between SOAP 1.1 and SOAP 1.2 sunjing Changes Enable SOAP 1.1 SOAP 1.2
JAX-WS SOAP Version 1.2 Part 0: Primer (Second Edition) SOAP Version 1.2 Part 1: Messaging Framework (Second Edition) SOAP Version 1.2 Part 2: Adjuncts (Second Edition) Which style of WSDL
【Hadoop二】Hadoop常用命令 bit1129 hadoop
以Hadoop运行Hadoop自带的wordcount为例， hadoop脚本位于/home/hadoop/hadoop-2.5.2/bin/hadoop，需要说明的是，这些命令的使用必须在Hadoop已经运行的情况下才能执行 Hadoop HDFS相关命令 hadoop fs -ls 列出HDFS文件系统的第一级文件和第一级
java异常处理（初级）白糖_ java DAO spring 虚拟机 Ajax
从学习到现在从事java开发一年多了，个人觉得对java只了解皮毛，很多东西都是用到再去慢慢学习，编程真的是一项艺术，要完成一段好的代码，需要懂得很多。最近项目经理让我负责一个组件开发，框架都由自己搭建，最让我头疼的是异常处理，我看了一些网上的源码，发现他们对异常的处理不是很重视，研究了很久都没有找到很好的解决方案。后来有幸看到一个200W美元的项目部分源码，通过他们对异常处理的解决方案，我终
记录整理-工作问题 braveCS 工作
1）那位同学还是CSV文件默认Excel打开看不到全部结果。以为是没写进去。同学甲说文件应该不分大小。后来log一下原来是有写进去。只是Excel有行数限制。那位同学进步好快啊。 2）今天同学说写文件的时候提示jvm的内存溢出。我马上反应说那就改一下jvm的内存大小。同学说改用分批处理了。果然想问题还是有局限性。改jvm内存大小只能暂时地解决问题，以后要是写更大的文件还是得改内存。想问题要长远啊
org.apache.tools.zip实现文件的压缩和解压，支持中文 bylijinnan apache
刚开始用java.util.Zip，发现不支持中文（网上有修改的方法，但比较麻烦）后改用org.apache.tools.zip org.apache.tools.zip的使用网上有更简单的例子下面的程序根据实际需求，实现了压缩指定目录下指定文件的方法 import java.io.BufferedReader; import java.io.BufferedWrit
读书笔记-4 chengxuyuancsdn 读书笔记
1、JSTL 核心标签库标签 2、避免SQL注入 3、字符串逆转方法 4、字符串比较compareTo 5、字符串替换replace 6、分拆字符串 1、JSTL 核心标签库标签共有13个，学习资料：http://www.cnblogs.com/lihuiyy/archive/2012/02/24/2366806.html 功能上分为4类： (1)表达式控制标签：out
[物理与电子]半导体教材的一个小问题 comsci 问题
各种模拟电子和数字电子教材中都有这个词汇-空穴书中对这个词汇的解释是; 当电子脱离共价键的束缚成为自由电子之后,共价键中就留下一个空位,这个空位叫做空穴我现在回过头翻大学时候的教材,觉得这个
Flashback Database --闪回数据库 daizj oracle 闪回数据库
Flashback 技术是以Undo segment中的内容为基础的，因此受限于UNDO_RETENTON参数。要使用flashback 的特性，必须启用自动撤销管理表空间。在Oracle 10g中， Flash back家族分为以下成员： Flashback Database， Flashback Drop，Flashback Query(分Flashback Query,Flashbac
简单排序:插入排序 dieslrae 插入排序
public void insertSort(int[] array){ int temp; for(int i=1;i<array.length;i++){ temp = array[i]; for(int k=i-1;k>=0;k--)
C语言学习六指针小示例、一维数组名含义，定义一个函数输出数组的内容 dcj3sjt126com c
# include <stdio.h> int main(void) { int * p; //等价于 int *p 也等价于 int* p; int i = 5; char ch = 'A'; //p = 5; //error //p = &ch; //error //p = ch; //error p = &i; //
centos下php redis扩展的安装配置3种方法 dcj3sjt126com redis
方法一 1.下载php redis扩展包代码如下复制代码 #wget http://redis.googlecode.com/files/redis-2.4.4.tar.gz 2 tar -zxvf 解压压缩包，cd /扩展包（进入扩展包然后运行phpize 一下是我环境中phpize的目录，/usr/local/php/bin/phpize (一定要
线程池(Executors) shuizhaosi888 线程池
在java类库中，任务执行的主要抽象不是Thread，而是Executor，将任务的提交过程和执行过程解耦 public interface Executor { void execute(Runnable command); } public class RunMain implements Executor{ @Override pub
openstack 快速安装笔记 haoningabc openstack
前提是要配置好yum源版本icehouse，操作系统redhat6.5 最简化安装，不要cinder和swift 三个节点 172 control节点keystone glance horizon 173 compute节点nova 173 network节点neutron control /etc/sysctl.conf net.ipv4.ip_forward =
从c面向对象的实现理解c++的对象（二） jimmee C++面向对象虚函数
1. 类就可以看作一个struct，类的方法，可以理解为通过函数指针的方式实现的，类对象分配内存时，只分配成员变量的，函数指针并不需要分配额外的内存保存地址。 2. c++中类的构造函数，就是进行内存分配(malloc)，调用构造函数 3. c++中类的析构函数，就时回收内存(free) 4. c++是基于栈和全局数据分配内存的，如果是一个方法内创建的对象，就直接在栈上分配内存了。专门在
如何让那个一个div可以拖动 lingfeng520240 html
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml
第10章高级事件（中） onestopweb 事件
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
计算两个经纬度之间的距离 roadrunners 计算纬度 LBS 经度距离
要解决这个问题的时候，到网上查了很多方案，最后计算出来的都与百度计算出来的有出入。下面这个公式计算出来的距离和百度计算出来的距离是一致的。 /** * * @param longitudeA * 经度A点 * @param latitudeA * 纬度A点 * @param longitudeB *
最具争议的10个Java话题 tomcat_oracle java
1、Java8已经到来。什么！？ Java8 支持lambda。哇哦，RIP Scala！　　随着Java8 的发布，出现很多关于新发布的Java8是否有潜力干掉Scala的争论，最终的结论是远远没有那么简单。Java8可能已经在Scala的lambda的包围中突围，但Java并非是函数式编程王位的真正觊觎者。　　2、Java 9 即将到来　　 Oracle早在8月份就发布
zoj 3826 Hierarchical Notation(模拟) 阿尔萨斯 rar
题目链接：zoj 3826 Hierarchical Notation 题目大意：给定一些结构体，结构体有value值和key值，Q次询问，输出每个key值对应的value值。解题思路：思路很简单，写个类词法的递归函数，每次将key值映射成一个hash值，用map映射每个key的value起始终止位置，预处理完了查询就很简单了。这题是最后10分钟出的，因为没有考虑value为{}的情