异想天开的长颈鹿

【翻译】【R-CNN】Rich feature hierarchies for accurate object detection and semantic segmentation

Rich feature hierarchies for accurate object detection and semantic segmentation
用于准确的目标检测和语义分割的丰富的特征层次
Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik

论文：https://openaccess.thecvf.com/content_cvpr_2014/papers/Girshick_Rich_Feature_Hierarchies_2014_CVPR_paper.pdf

Abstract（摘要）

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that typically combine multiple low-level image features with high-level context. In this paper, we propose a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012—achieving a mAP of 53.3%. Our approach combines two key insights: (1) one can apply high-capacity convolutional neural networks (CNNs) to bottom-up region proposals in order to localize and segment objects and (2) when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost. Since we combine region proposals with CNNs, we call our method R-CNN: Regions with CNN features. We also present experiments that provide insight into what the network learns, revealing a rich hierarchy of image features. Source code for the complete system is available at http://www.cs.berkeley.edu/ ̃rbg/rcnn.
　　过去的几年里，在典型的PASCAL VOC数据集上测量的目标检测性能已经趋于平稳。表现最好的方法是复杂的组合系统，通常将多个低层次的图像特征与高层次的背景相结合。在本文中，我们提出了一种简单的、可扩展的检测算法，相对于VOC 2012上的最佳结果，该算法的平均精度（mAP）提高了30%以上，达到了53.3%。我们的方法结合了两个关键的见解：（1）我们可以将大容量的卷积神经网络（CNN）应用于自下而上的region proposals，以便对目标进行定位和分割；（2）当标记的训练数据匮乏时，对一个辅助任务进行监督性的预训练，然后再进行特定领域的微调，可以产生显著的性能提升。由于我们将region proposal与CNN结合起来，我们将我们的方法称为R-CNN。具有CNN特征的区域。我们还提出了一些实验，这些实验提供了对网络学习内容的洞察力，揭示了丰富的图像特征层次结构。完整系统的源代码可在http://www.cs.berkeley.edu/～rbg/rcnn上找到。

1. Introduction（介绍）

Features matter. The last decade of progress on various visual recognition tasks has been based considerably on the use of SIFT [26] and HOG [7]. But if we look at performance on the canonical visual recognition task, PASCAL VOC object detection [12], it is generally acknowledged that progress has been slow during 2010-2012, with small gains obtained by building ensemble systems and employing minor variants of successful methods.
　　特征很重要。过去十年中，各种视觉识别任务的进展在很大程度上是基于SIFT[26]和HOG[7]的使用。但是，如果我们看一下典型的视觉识别任务的表现，即PASCAL VOC目标检测[12]，人们普遍认为在2010-2012年期间进展缓慢，通过建立集合系统和采用成功方法的小变体获得了小的收益。
　　SIFT and HOG are blockwise orientation histograms, a representation we could associate roughly with complex cells in V1, the first cortical area in the primate visual pathway. But we also know that recognition occurs several stages downstream, which suggests that there might be hierarchical, multi-stage processes for computing features that are even more informative for visual recognition.
　　SIFT和HOG是顺时针方向的直方图，这种表示方法我们可以大致与V1的复杂细胞联系起来，V1是灵长类动物视觉通路的第一个皮质区域。但我们也知道，识别发生在下游的几个阶段，这表明可能有分层的、多阶段的过程来计算对视觉识别更有参考价值的特征。
　　Fukushima’s “neocognitron” [16], a biologically-inspired hierarchical and shift-invariant model for pattern recognition, was an early attempt at just such a process. The neocognitron, however, lacked a supervised training algorithm. LeCun et al. [23] provided the missing algorithm by showing that stochastic gradient descent, via backpropagation, can train convolutional neural networks (CNNs), a class of models that extend the neocognitron.
　　福岛邦彦（著名日本科学家）的“神经认知机”（1979年构建了多层次的视觉识别，类似现在的卷积神经网络）[16]，一个受生物启发的分层和移位变异的模式识别，正是这样一个过程的早期尝试。然而，新认知器缺乏一个有监督的训练算法。LeCun等人[23]通过展示随机梯度下降，通过反向传播，可以训练卷积神经网络（CNN），一类扩展新认知模型的算法，提供了所缺少的算法。
　　CNNs saw heavy use in the 1990s (e.g., [24]), but then fell out of fashion, particularly in computer vision, with the rise of support vector machines. In 2012, Krizhevsky et al. [22] rekindled interest in CNNs by showing substantially higher image classification accuracy on the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) [9, 10]. Their success resulted from training a large CNN on 1.2 million labeled images, together with a few twists on LeCun’s CNN (e.g., $ma x (x, 0)$ rectifying non-linearities and “dropout” regularization).
　　CNN在20世纪90年代得到了大量使用（例如，[24]），但后来随着支持向量机的兴起，CNN逐渐淡出了人们的视线，尤其是在计算机视觉领域。2012年，Krizhevsky等人[22]在ImageNet大规模视觉识别挑战赛（ILSVRC）[9, 10]上展示了大幅提高的图像分类准确性，从而重新点燃了人们对CNN的兴趣。他们的成功来自于在120万张已标记的图像上训练一个大型CNN，以及对LeCun的CNN进行了一些调整（例如， $ma x (x, 0)$ 纠正非线性和 "dropout "正则化）。
　　The significance of the ImageNet result was vigorously debated during the ILSVRC 2012 workshop. The central issue can be distilled to the following: To what extent do the CNN classification results on ImageNet generalize to object detection results on the PASCAL VOC Challenge?
　　在ILSVRC 2012研讨会上，对ImageNet结果的意义进行了激烈的辩论。中心问题可以提炼为以下内容。ImageNet上的CNN分类结果在多大程度上可以推广到PASCAL VOC挑战赛的目标检测结果？
　　We answer this question decisively by bridging the chasm between image classification and object detection. This paper is the first to show that a CNN can lead to dramatically higher object detection performance on PASCAL VOC as compared to systems based on simpler HOG-like features.(A tech report describing R-CNN first appeared at http://arxiv. org/abs/1311.2524v1 in Nov. 2013.) Achieving this result required solving two problems: localizing objects with a deep network and training a high-capacity model with only a small quantity of annotated detection data.
　　我们通过弥合图像分类和目标检测之间的鸿沟来决定性地回答这个问题。本文首次表明，与基于更简单的类似HOG的特征的系统相比，CNN可以使PASCAL VOC的目标检测性能大幅提高。（描述R-CNN的技术报告首次出现在2013年11月的http://arxiv. org/abs/1311.2524v1）。实现这一结果需要解决两个问题：用深度网络定位目标，以及仅用少量的注释检测数据训练一个高容量模型。
　　Unlike image classification, detection requires localizing (likely many) objects within an image. One approach frames localization as a regression problem. However, work from Szegedy et al. [31], concurrent with our own, indicates that this strategy may not fare well in practice (they report a mAP of 30.5% on VOC 2007 compared to the 58.5% achieved by our method). An alternative is to build a sliding-window detector. CNNs have been used in this way for at least two decades, typically on constrained object categories, such as faces [28, 33] and pedestrians [29]. In order to maintain high spatial resolution, these CNNs typically only have two convolutional and pooling layers. We also considered adopting a sliding-window approach. However, units high up in our network, which has five convolutional layers, have very large receptive fields (195 × 195 pixels) and strides (32×32 pixels) in the input image, which makes precise localization within the sliding-window paradigm an open technical challenge.
　　与图像分类不同，检测需要对图像中的（可能是许多）目标进行定位。有一种方法将定位作为一个回归问题。然而，与我们同时进行的Szegedy等人[31]的工作表明，这种策略在实践中可能并不理想（他们报告说，2007年VOC的mAP为30.5%，而我们的方法为58.5%）。另一个选择是建立一个滑动窗口检测器。CNN已经以这种方式使用了至少20年，通常用于受限的目标类别，如人脸[28, 33]和行人[29]。为了保持高空间分辨率，这些CNN通常只有两个卷积层和池化层。我们也考虑过采用滑动窗口的方法。然而，在我们的网络中，有五个卷积层的高位单元在输入图像中具有非常大的感受野（195×195像素）和步幅（32×32像素），这使得在滑动窗口范式下的精确定位成为一个公开的技术挑战。
　　Instead, we solve the CNN localization problem by operating within the “recognition using regions” paradigm, as argued for by Gu et al. in [18]. At test-time, our method generates around 2000 category-independent region proposals for the input image, extracts a fixed-length feature vector from each proposal using a CNN, and then classifies each region with category-specific linear SVMs. We use a simple technique (affine image warping) to compute a fixed-size CNN input from each region proposal, regardless of the region’s shape. Figure 1 presents an overview of our method and highlights some of our results. Since our system combines region proposals with CNNs, we dub the method R-CNN: Regions with CNN features.
　　相反，我们通过在 "使用候选框识别 "的范式内操作来解决CNN的定位问题，正如Gu等人在[18]中所论证的。在测试时间，我们的方法为输入图像生成大约2000个与类别无关的region proposals，使用CNN从每个proposals中提取一个固定长度的特征向量，然后用特定类别的线性SVM对每个区域进行分类。我们使用一种简单的技术（仿生图像扭曲），从每个区域的proposals中计算出一个固定大小的CNN输入，而不管该区域的形状如何。图1展示了我们方法的概况，并强调了我们的一些结果。由于我们的系统将region proposals与CNN结合起来，我们将该方法称为R-CNN。具有CNN特征的区域。
　　A second challenge faced in detection is that labeled data is scarce and the amount currently available is insufficient for training a large CNN. The conventional solution to this problem is to use unsupervised pre-training, followed by supervised fine-tuning (e.g., [29]). The second major contribution of this paper is to show that supervised pretraining on a large auxiliary dataset (ILSVRC), followed by domain-specific fine-tuning on a small dataset (PASCAL), is an effective paradigm for learning high-capacity CNNs when data is scarce. In our experiments, fine-tuning for detection improves mAP performance by 8 percentage points. After fine-tuning, our system achieves a mAP of 54% on VOC 2010 compared to 33% for the highly-tuned, HOGbased deformable part model (DPM) [14, 17].
　　检测中面临的第二个挑战是，标注的数据很少，目前可用的数据量不足以训练一个大型的CNN。对这个问题的传统解决方案是使用无监督的预训练，然后再进行有监督的微调（例如，[29]）。本文的第二个主要贡献是表明，在大型辅助数据集（ILSVRC）上进行监督预训练，然后在小型数据集（PASCAL）上进行特定领域的微调，是在数据匮乏时学习大容量CNN的有效范式。在我们的实验中，检测的微调使mAP性能提高了8个百分点。在微调之后，我们的系统在2010年的VOC上实现了54%的mAP，而高度调整的、基于HOG的可变形部件模型（DPM）[14, 17]则为33%。
　　Our system is also quite efficient. The only class-specific computations are a reasonably small matrix-vector product and greedy non-maximum suppression. This computational property follows from features that are shared across all categories and that are also two orders of magnitude lowerdimensional than previously used region features (cf. [32]).
　　我们的系统也是相当高效的。唯一针对类别的计算是一个相当小的矩阵-向量乘积和贪婪的非极大抑制。这一计算特性来自于所有类别共享的特征，这些特征也比以前使用的区域特征低两个数量级（参见[32]）。

　　One advantage of HOG-like features is their simplicity: it’s easier to understand the information they carry (although [34] shows that our intuition can fail us). Can we gain insight into the representation learned by the CNN? Perhaps the densely connected layers, with more than 54 million parameters, are the key? They are not. We “lobotomized” the CNN and found that a surprisingly large proportion, 94%, of its parameters can be removed with only a moderate drop in detection accuracy. Instead, by probing units in the network we see that the convolutional layers learn a diverse set of rich features (Figure 3).
　　类似HOG的特征的一个优点是它们的简单性：更容易理解它们携带的信息（尽管[34]显示我们的直觉可能会让我们失望）。我们能深入了解CNN学到的表征吗？也许拥有超过5400万个参数的密集连接层是关键所在？它们不是。我们对CNN进行了 “切除手术”，发现它的大部分参数（94%）可以被移除，而检测准确率仅有适度的下降。相反，通过探测网络中的单元，我们看到卷积层学习了一套多样化的丰富特征（图3）。
　　Understanding the failure modes of our approach is also critical for improving it, and so we report results from the detection analysis tool of Hoiem et al. [20]. As an immediate consequence of this analysis, we demonstrate that a simple bounding box regression method significantly reduces mislocalizations, which are the dominant error mode.
　　了解我们方法的失败模式对于改进它也很关键，因此我们报告了Hoiem等人[20]的检测分析工具的结果。作为这一分析的直接结果，我们证明了一个简单的边界盒回归方法大大减少了错误定位，而这是最主要的错误模式。
　　Before developing technical details, we note that because R-CNN operates on regions it is natural to extend it to the task of semantic segmentation. With minor modifications, we also achieve state-of-the-art results on the PASCAL VOC segmentation task, with an average segmentation accuracy of 47.9% on the VOC 2011 test set.
　　在发展技术细节之前，我们注意到，由于R-CNN是在区域上操作的，因此很自然地将其扩展到语义分割的任务中。在稍作修改后，我们在PASCAL VOC分割任务上也取得了最先进的结果，在VOC 2011测试集上的平均分割精度为47.9%。

2. Object detection with R-CNN（用R-CNN进行目标检测）

Our object detection system consists of three modules. The first generates category-independent region proposals. These proposals define the set of candidate detections available to our detector. The second module is a large convolutional neural network that extracts a fixed-length feature vector from each region. The third module is a set of classspecific linear SVMs. In this section, we present our design decisions for each module, describe their test-time usage, detail how their parameters are learned, and show results on PASCAL VOC 2010-12.
　　我们的目标检测系统由三个模块组成。第一个模块产生独立于类别的region proposals。这些proposals定义了可供我们的检测器使用的候选检测的集合。第二个模块是一个大型卷积神经网络，从每个区域提取一个固定长度的特征向量。第三个模块是一组特定类别的线性SVMs。在本节中，我们将介绍我们对每个模块的设计决定，描述它们在测试时的使用情况，详细说明它们的参数是如何学习的，并展示PASCAL VOC 2010-12的结果。

2.1. Module design（模型设计）

Region proposals. A variety of recent papers offer methods for generating category-independent region proposals. Examples include: objectness [1], selective search [32], category-independent object proposals [11], constrained parametric min-cuts (CPMC) [5], multi-scale combinatorial grouping [3], and Cires ̧an et al. [6], who detect mitotic cells by applying a CNN to regularly-spaced square crops, which are a special case of region proposals. While R-CNN is agnostic to the particular region proposal method, we use selective search to enable a controlled comparison with prior detection work (e.g., [32, 35]).
　　Region proposals。最近有很多论文提供了生成与类别无关的Region proposals的方法。例如：目标性[1]、选择性搜索[32]、与类别无关的目标proposals [11]、受限参数最小切割（CPMC）[5]、多尺度组合分组[3]，以及Ciresan等人[6]，他们通过将CNN应用于有规律间隔的方形裁剪检测有丝分裂细胞，这是Region proposals的一个特例。虽然R-CNN与特定的Region proposals方法无关，但我们使用选择性搜索，以便能够与先前的检测工作（例如[32，35]）进行有控制的比较。
　　Feature extraction. We extract a 4096-dimensional feature vector from each region proposal using the Caffe [21] implementation of the CNN described by Krizhevsky et al. [22]. Features are computed by forward propagating a mean-subtracted 227 × 227 RGB image through five convolutional layers and two fully connected layers. We refer readers to [21, 22] for more network architecture details.
　　特征提取。我们使用Krizhevsky等人[22]描述的CNN的Caffe[21]实现，从每个region proposal中提取4096维的特征向量。特征的计算是通过将平均减去227×227的RGB图像通过五个卷积层和两个完全连接层进行前向传播。关于更多的网络结构细节，我们请读者参考[21，22]。
　　In order to compute features for a region proposal, we must first convert the image data in that region into a form that is compatible with the CNN (its architecture requires inputs of a fixed 227 × 227 pixel size). Of the many possible transformations of our arbitrary-shaped regions, we opt for the simplest. Regardless of the size or aspect ratio of the candidate region, we warp all pixels in a tight bounding box around it to the required size. Prior to warping, we dilate the tight bounding box so that at the warped size there are exactly p pixels of warped image context around the original box (we use p = 16). Figure 2 shows a random sampling of warped training regions. The supplementary material discusses alternatives to warping.
　　为了计算一个region proposal的特征，我们必须首先将该区域的图像数据转换为与CNN兼容的形式（其架构要求输入固定的227×227像素大小）。在我们的任意形状区域的许多可能的转换中，我们选择了最简单的。无论候选区域的大小或长宽比如何，我们都将其周围一个紧密的边界框内的所有像素扭曲成所需大小。在扭曲之前，我们扩张紧缩边界框，以便在扭曲的尺寸下，原始框周围正好有p像素的扭曲图像背景（我们使用p=16）。图2显示了一个随机抽样的翘曲的训练区域。补充材料中讨论了扭曲的替代方法。

2.2. Test-time detection（测试时间检测）

At test time, we run selective search on the test image to extract around 2000 region proposals (we use selective search’s “fast mode” in all experiments). We warp each proposal and forward propagate it through the CNN in order to read off features from the desired layer. Then, for each class, we score each extracted feature vector using the SVM trained for that class. Given all scored regions in an image, we apply a greedy non-maximum suppression (for each class independently) that rejects a region if it has an intersection-over-union (IoU) overlap with a higher scoring selected region larger than a learned threshold.
　　在测试时间，我们在测试图像上运行选择性搜索，以提取大约2000个region proposals（我们在所有实验中使用选择性搜索的 “快速模式”）。我们对每个proposal进行扭曲，并通过CNN进行前向传播，以便从所需层读出特征。然后，对于每个类别，我们使用为该类别训练的SVM对每个提取的特征向量进行评分。考虑到图像中的所有得分区域，我们应用一个贪婪的非最大限度的抑制（对每个类别独立），如果一个区域与一个得分较高的选定区域的重叠部分大于学习阈值，则拒绝该区域。
　　Run-time analysis. Two properties make detection efficient. First, all CNN parameters are shared across all categories. Second, the feature vectors computed by the CNN are low-dimensional when compared to other common approaches, such as spatial pyramids with bag-of-visual-word encodings. The features used in the UVA detection system [32], for example, are two orders of magnitude larger than ours (360k vs. 4k-dimensional).
　　运行时间分析。有两个特性使检测变得高效。首先，所有的CNN参数在所有类别中都是共享的。其次，与其他常见的方法相比，CNN计算的特征向量是低维的，如带有视觉词包编码的空间金字塔。例如，UVA检测系统[32]中使用的特征比我们的大两个数量级（360k vs. 4000-维）。
　　The result of such sharing is that the time spent computing region proposals and features (13s/image on a GPU or 53s/image on a CPU) is amortized over all classes. The only class-specific computations are dot products between features and SVM weights and non-maximum suppression. In practice, all dot products for an image are batched into a single matrix-matrix product. The feature matrix is typically 2000 × 4096 and the SVM weight matrix is 4096 × N , where N is the number of classes.
　　这种共享的结果是，计算region proposals 和特征的时间（GPU上的13s/每幅图像或CPU上的53s/每幅图像）在所有类别中被分摊。唯一针对类的计算是特征和SVM权重之间的点乘和非极大值抑制。在实践中，一个图像的所有点积都被打包成一个单一的矩阵-矩阵乘积。特征矩阵通常为2000×4096，SVM权重矩阵为4096×N，其中N为类的数量。
　　This analysis shows that R-CNN can scale to thousands of object classes without resorting to approximate techniques, such as hashing. Even if there were 100k classes, the resulting matrix multiplication takes only 10 seconds on a modern multi-core CPU. This efficiency is not merely the result of using region proposals and shared features. The UVA system, due to its high-dimensional features, would be two orders of magnitude slower while requiring 134GB of memory just to store 100k linear predictors, compared to just 1.5GB for our lower-dimensional features.
　　这一分析表明，R-CNN可以扩展到成千上万的对象类，而不需要借助近似的技术，如散列。即使有10万个类，所产生的矩阵乘法在现代多核CPU上只需要10秒。这种效率不仅仅是使用region proposals和共享特征的结果。UVA系统，由于其高维特征，会慢两个数量级，同时需要134GB的内存来存储10万的线性预测器，而我们的低维特征只需要1.5GB。
　　It is also interesting to contrast R-CNN with the recent work from Dean et al. on scalable detection using DPMs and hashing [8]. They report a mAP of around 16% on VOC 2007 at a run-time of 5 minutes per image when introducing 10k distractor classes. With our approach, 10k detectors can run in about a minute on a CPU, and because no approximations are made mAP would remain at 59% (Section 3.2).
　　将R-CNN与Dean等人最近关于使用DPM和散列的可扩展检测工作进行对比也很有意思[8]。他们报告说，在引入1万个干扰物类别时，VOC 2007的mAP约为16%，每幅图像的运行时间为5分钟。使用我们的方法，1万个检测器可以在CPU上运行大约1分钟，而且由于没有进行近似处理，mAP将保持在59%（第3.2节）。

2.3. Training（训练）

Supervised pre-training. We discriminatively pre-trained the CNN on a large auxiliary dataset (ILSVRC 2012) with image-level annotations (i.e., no bounding box labels). Pretraining was performed using the open source Caffe CNN library [21]. In brief, our CNN nearly matches the performance of Krizhevsky et al. [22], obtaining a top-1 error rate 2.2 percentage points higher on the ILSVRC 2012 validation set. This discrepancy is due to simplifications in the training process.
　　监督性预训练。我们在一个大型辅助数据集（ILSVRC 2012）上对CNN进行判别性预训练，该数据集具有图像级别的注释（即没有边界框标签）。预训练是使用开源的Caffe CNN库[21]进行的。简而言之，我们的CNN几乎与Krizhevsky等人[22]的性能相匹配，在ILSVRC 2012的验证集上获得的Top-1错误率高出2.2个百分点。这种差异是由于训练过程中的简化造成的。
　　Domain-specific fine-tuning. To adapt our CNN to the new task (detection) and the new domain (warped VOC windows), we continue stochastic gradient descent (SGD) training of the CNN parameters using only warped region proposals from VOC. Aside from replacing the CNN’s ImageNet-specific 1000-way classification layer with a randomly initialized 21-way classification layer (for the 20 VOC classes plus background), the CNN architecture is unchanged. We treat all region proposals with ≥ 0.5 IoU overlap with a ground-truth box as positives for that box’s class and the rest as negatives. We start SGD at a learning rate of 0.001 (1/10th of the initial pre-training rate), which allows fine-tuning to make progress while not clobbering the initialization. In each SGD iteration, we uniformly sample 32 positive windows (over all classes) and 96 background windows to construct a mini-batch of size 128. We bias the sampling towards positive windows because they are extremely rare compared to background.
　　特定领域的微调。为了使我们的CNN适应新的任务（检测）和新的领域（扭曲的VOC窗口），我们继续对CNN参数进行随机梯度下降（SGD）训练，只使用VOC的扭曲region proposals。除了用一个随机初始化的21路分类层（针对20个VOC类别和背景）取代CNN的ImageNet特定的1000路分类层外，CNN的架构没有变化。我们将所有与GT重叠≥0.5 IoU的region proposals作为该box类的正样本，其余的作为负样本。我们以0.001的学习率（初始预训练率的1/10）开始SGD，这允许微调取得进展，同时不会使初始化崩溃。在SGD的每次迭代中，我们均匀地对32个正样本窗口（所有类别）和96个背景窗口进行抽样，以构建一个大小为128的小型批次。我们偏向于对正样本窗口进行抽样，因为与背景相比，正样本窗口是非常罕见的。
　　Once features are extracted and training labels are applied, we optimize one linear SVM per class. Since the training data is too large to fit in memory, we adopt the standard hard negative mining method [14, 30]. Hard negative mining converges quickly and in practice mAP stops increasing after only a single pass over all images.
　　一旦提取了特征并应用了训练标签，我们就对每个类别的一个线性SVM进行优化。由于训练数据太大，无法装入内存，我们采用了标准的困难负样本挖掘方法[14, 30]。困难负样本挖掘法收敛很快，在实践中，mAP仅在对所有图像进行一次处理后就不再增加。
　　In supplementary material we discuss why the positive and negative examples are defined differently in fine-tuning versus SVM training. We also discuss why it’s necessary to train detection classifiers rather than simply use outputs from the final layer (fc8) of the fine-tuned CNN.
　　在补充材料中，我们讨论了为什么在微调与SVM训练中，正负样本的定义是不同的。我们还讨论了为什么有必要训练检测分类器而不是简单地使用微调CNN最后一层（fc8）的输出。

2.4. Results on PASCAL VOC 2010-12（关于PASCAL VOC 2010-12的结果）

Following the PASCAL VOC best practices [12], we validated all design decisions and hyperparameters on the VOC 2007 dataset (Section 3.2). For final results on the VOC 2010-12 datasets, we fine-tuned the CNN on VOC 2012 train and optimized our detection SVMs on VOC 2012 trainval. We submitted test results to the evaluation server only once for each of the two major algorithm variants (with and without bounding box regression).
　　按照PASCAL VOC的最佳实践[12]，我们在VOC 2007数据集上验证了所有的设计决策和超参数（3.2节）。对于VOC 2010-12数据集的最终结果，我们在VOC 2012 train上对CNN进行了微调，并在VOC 2012 trainval上优化了我们的检测SVMs。对于两种主要的算法变体（有边界盒回归和无边界盒回归），我们只向评估服务器提交了一次测试结果。
　　Table 1 shows complete results on VOC 2010. We compare our method against four strong baselines, including SegDPM [15], which combines DPM detectors with the output of a semantic segmentation system [4] and uses additional inter-detector context and image-classifier rescoring. The most germane comparison is to the UVA system from Uijlings et al. [32], since our systems use the same region proposal algorithm. To classify regions, their method builds a four-level spatial pyramid and populates it with densely sampled SIFT, Extended OpponentSIFT, and RGBSIFT descriptors, each vector quantized with 4000-word codebooks. Classification is performed with a histogram intersection kernel SVM. Compared to their multi-feature, non-linear kernel SVM approach, we achieve a large improvement in mAP, from 35.1% to 53.7% mAP, while also being much faster (Section 2.2). Our method achieves similar performance (53.3% mAP) on VOC 2011/12 test.
　　表1显示了VOC 2010的完整结果。我们将我们的方法与四个强大的基线进行比较，包括SegDPM[15]，它将DPM检测器与语义分割系统[4]的输出相结合，并使用额外的检测器之间的背景和图像分类器的重新评分。最有意义的比较是与Uijlings等人[32]的UVA系统，因为我们的系统使用相同的region proposal算法。为了对区域进行分类，他们的方法建立了一个四级空间金字塔，并用密集采样的SIFT、Extended OpponentSIFT和RGBSIFT描述符填充它，每个向量用4000字的编码簿量化。分类是用直方图相交核SVM进行的。与他们的多特征、非线性核SVM方法相比，我们在mAP方面取得了很大的改进，从35.1%到53.7%的mAP，同时速度也快得多（第2.2节）。我们的方法在VOC 2011/12测试中取得了类似的性能（53.3% mAP）。

3. Visualization, ablation, and modes of error （可视化、消融对照试验和错误模式）

3.1. Visualizing learned features（视觉化学习特征）

First-layer filters can be visualized directly and are easy to understand [22]. They capture oriented edges and opponent colors. Understanding the subsequent layers is more challenging. Zeiler and Fergus present a visually attractive deconvolutional approach in [36]. We propose a simple (and complementary) non-parametric method that directly shows what the network learned.
　　第一层过滤器可以直接可视化，而且很容易理解[22]。它们能捕捉到定向的边缘和对手的颜色。理解后续层则更具挑战性。Zeiler和Fergus在[36]中提出了一种视觉上有吸引力的去卷积方法。我们提出了一个简单的（和互补的）非参数方法，直接显示网络学到了什么。
　　The idea is to single out a particular unit (feature) in the network and use it as if it were an object detector in its own right. That is, we compute the unit’s activations on a large set of held-out region proposals (about 10 million), sort the proposals from highest to lowest activation, perform nonmaximum suppression, and then display the top-scoring regions. Our method lets the selected unit “speak for itself” by showing exactly which inputs it fires on. We avoid averaging in order to see different visual modes and gain insight into the invariances computed by the unit.
　　我们的想法是在网络中挑选出一个特定的单元（特征），并将其作为一个目标检测器来使用。也就是说，我们计算出该单元在一大组被搁置的region proposals上的激活度（大约1000万），将这些proposals从最高激活度到最低激活度进行排序，进行非极大值抑制，然后显示出得分最高的region。我们的方法让所选单元 “为自己说话”，准确地显示它在哪些输入上开火。我们避免平均化，以便看到不同的视觉模式，并深入了解该单元计算的不变性。
　　We visualize units from layer pool $_5$ , which is the maxpooled output of the network’s fifth and final convolutional layer. The pool $_5$ feature map is $6 \times 6 \times 256 = 9216$ -dimensional. Ignoring boundary effects, each pool $_5$ unit has a receptive field of 195×195 pixels in the original 227×227 pixel input. A central pool $_5$ unit has a nearly global view, while one near the edge has a smaller, clipped support.
　　我们将来自第5层pool $_5$ 的单元可视化，它是网络的第五层也是最后一层卷积层的maxpooled输出。pool $_5$ 的特征图是 $6 \times 6 \times 256 = 9216$ 维。忽略边界效应，每个pool $_5$ 单元在原始227×227像素的输入中具有195×195像素的感受野。一个中央的pool $_5$ 单元有一个近乎全局的视野，而靠近边缘的单元有一个较小的、被剪切的支持。
　　Each row in Figure 3 displays the top 16 activations for a pool $_5$ unit from a CNN that we fine-tuned on VOC 2007 trainval. Six of the 256 functionally unique units are visualized (the supplementary material includes more). These units were selected to show a representative sample of what the network learns. In the second row, we see a unit that fires on dog faces and dot arrays. The unit corresponding to the third row is a red blob detector. There are also detectors for human faces and more abstract patterns such as text and triangular structures with windows. The network appears to learn a representation that combines a small number of class-tuned features together with a distributed representation of shape, texture, color, and material properties. The subsequent fully connected layer fc $_6$ has the ability to model a large set of compositions of these rich features.
　　图3中的每一行都显示了我们在VOC 2007 trainval上微调的CNN中一个pool $_5$ 单元的前16个激活值。在256个功能独特的单元中，有6个是可视化的（补充材料中包括更多）。选择这些单元是为了展示网络学习的代表性样本。在第二行，我们看到一个单元在狗脸和点阵上发射。第三行对应的单元是一个红色的圆球检测器。还有一些检测器用于检测人脸和更抽象的图案，如文字和带窗口的三角形结构。该网络似乎在学习一种表征，它将少量的类调整特征与形状、纹理、颜色和材料属性的分布式表征结合在一起。随后的全连接层fc $_6$ 有能力对这些丰富特征的大量组合进行建模。

3.2. Ablation studies（消融研究）

Performance layer-by-layer, without fine-tuning. To understand which layers are critical for detection performance, we analyzed results on the VOC 2007 dataset for each of the CNN’s last three layers. Layer pool $_5$ was briefly described in Section 3.1. The final two layers are summarized below.
　　逐层性能，无微调。为了了解哪些层对检测性能至关重要，我们分析了CNN最后三层中每一层在VOC 2007数据集上的结果。第3.1节中简要介绍了pool $_5$ 。最后两层的情况总结如下。
　　Layer fc $_6$ is fully connected to pool $_5$ . To compute features, it multiplies a 4096×9216 weight matrix by the pool $_5$ feature map (reshaped as a 9216-dimensional vector) and then adds a vector of biases. This intermediate vector is component-wise half-wave rectified (x ← max(0, x)).
　　层fc $_6$ 与pool $_5$ 完全相连。为了计算特征，它用一个4096×9216的权重矩阵乘以pool $_5$ 的特征图（重塑为9216维的向量），然后加上一个偏置向量。这个中间向量是分量明智的半波整流（x ← max(0, x)）。
　　Layer fc $_7$ is the final layer of the network. It is implemented by multiplying the features computed by fc6 by a 4096 × 4096 weight matrix, and similarly adding a vector of biases and applying half-wave rectification.
　　fc $_7$ 层是网络的最后一层。它是通过将fc6计算出的特征乘以4096×4096的权重矩阵来实现的，同样地，加入一个偏置矢量并应用半波整流。
　　We start by looking at results from the CNN without fine-tuning on PASCAL, i.e. all CNN parameters were pretrained on ILSVRC 2012 only. Analyzing performance layer-by-layer (Table 2 rows 1-3) reveals that features from fc $_7$ generalize worse than features from fc $_6$ . This means that 29%, or about 16.8 million, of the CNN’s parameters can be removed without degrading mAP. More surprising is that removing both fc $_7$ and fc $_6$ produces quite good results even though pool $_5$ features are computed using only 6% of the CNN’s parameters. Much of the CNN’s representational power comes from its convolutional layers, rather than from the much larger densely connected layers. This finding suggests potential utility in computing a dense feature map, in the sense of HOG, of an arbitrary-sized image by using only the convolutional layers of the CNN. This representation would enable experimentation with sliding-window detectors, including DPM, on top of pool $_5$ features.
　　我们首先看一下没有在PASCAL上进行微调的CNN的结果，即所有的CNN参数都只在ILSVRC 2012上进行了预训练。逐层分析性能（表2第1-3行），发现fc $_7$ 的特征比fc $_6$ 的特征泛化得差。这意味着29%，即大约1680万个CNN的参数可以被移除而不降低mAP。更令人惊讶的是，尽管pool $_5$ 特征的计算只使用了CNN的6%的参数，但去除fc $_7$ 和fc $_6$ 都会产生相当好的结果。CNN的大部分表征能力来自其卷积层，而不是来自更大的密集连接层。这一发现表明，只用CNN的卷积层就能计算出任意大小图像的密集特征图，即HOG的意义。这种表示方法将使人们能够在pool $_5$ 的特征之上进行滑动窗口检测器的实验，包括DPM。
　　Performance layer-by-layer, with fine-tuning. We now look at results from our CNN after having fine-tuned its parameters on VOC 2007 trainval. The improvement is striking (Table 2 rows 4-6): fine-tuning increases mAP by 8.0 percentage points to 54.2%. The boost from fine-tuning is much larger for fc $_6$ and fc $_7$ than for pool $_5$ , which suggests that the pool $_5$ features learned from ImageNet are general and that most of the improvement is gained from learning domain-specific non-linear classifiers on top of them.
　　逐层微调的性能。我们现在看看我们的CNN在VOC 2007 trainval上微调了参数后的结果。改进是惊人的（表2第4-6行）：微调使mAP增加了8.0个百分点，达到54.2%。微调对fc $_6$ 和fc $_7$ 的提升比对pool $_5$ 的提升大得多，这表明从ImageNet学到的pool $_5$ 特征是通用的，大部分的改进是在它们之上学习特定领域的非线性分类器而获得的。
　　Comparison to recent feature learning methods. Relatively few feature learning methods have been tried on PASCAL VOC detection. We look at two recent approaches that build on deformable part models. For reference, we also include results for the standard HOG-based DPM [17].
　　与最近的特征学习方法进行比较。在PASCAL VOC检测中尝试的特征学习方法相对较少。我们看一下最近两种建立在可变形部件模型上的方法。作为参考，我们还包括基于HOG的标准DPM的结果[17]。
　　The first DPM feature learning method, DPM ST [25], augments HOG features with histograms of “sketch token” probabilities. Intuitively, a sketch token is a tight distribution of contours passing through the center of an image patch. Sketch token probabilities are computed at each pixel by a random forest that was trained to classify 35 × 35 pixel patches into one of 150 sketch tokens or background.
　　第一个DPM特征学习方法，DPM ST[25]，用 "草图标记 "概率直方图来增强HOG特征。直观地说，草图标记是通过图像斑块中心的轮廓线的紧密分布。草图标记概率是由一个随机森林在每个像素上计算出来的，该随机森林被训练成将35×35像素的斑块分类为150个草图标记或背景之一。
　　The second method, DPM HSC [27], replaces HOG with histograms of sparse codes (HSC). To compute an HSC, sparse code activations are solved for at each pixel using a learned dictionary of 100 7 × 7 pixel (grayscale) atoms. The resulting activations are rectified in three ways (full and both half-waves), spatially pooled, unit $\ell_2$ normalized, and then power transformed ( $x ← sign(x)|x|^α$ ).
　　第二种方法，DPM HSC[27]，用稀疏代码直方图（HSC）取代HOG。为了计算HSC，使用100个7×7像素（灰度）原子的学习字典来解决每个像素的稀疏代码激活问题。得到的激活以三种方式（全波和半波）进行整顿，空间汇集，单位 $\ell_2$ 归一化，然后进行功率变换（ $x ← sign(x)|x|^α$ ）。
　　All R-CNN variants strongly outperform the three DPM baselines (Table 2 rows 8-10), including the two that use feature learning. Compared to the latest version of DPM, which uses only HOG features, our mAP is more than 20 percentage points higher: 54.2% vs. 33.7%—a 61% relative improvement. The combination of HOG and sketch tokens yields 2.5 mAP points over HOG alone, while HSC improves over HOG by 4 mAP points (when compared internally to their private DPM baselines—both use nonpublic implementations of DPM that underperform the open source version [17]). These methods achieve mAPs of 29.1% and 34.3%, respectively.
　　所有的R-CNN变体都强烈地超越了三个DPM基线（表2第8-10行），包括两个使用特征学习的变体。与只使用HOG特征的DPM的最新版本相比，我们的mAP高出20多个百分点：54.2%对33.7%–61%的相对改进。HOG和草图标记的组合比单独的HOG产生了2.5个mAP点，而HSC比HOG提高了4个mAP点（当与他们的私有DPM基线进行内部比较时–两者都使用了DPM的非公开实现，性能低于开源版本[17]）。这些方法的mAPs分别为29.1%和34.3%。

3.3. Detection error analysis（检测错误分析）

　　We applied the excellent detection analysis tool from Hoiem et al. [20] in order to reveal our method’s error modes, understand how fine-tuning changes them, and to see how our error types compare with DPM. A full summary of the analysis tool is beyond the scope of this paper and we encourage readers to consult [20] to understand some finer details (such as “normalized AP”). Since the analysis is best absorbed in the context of the associated plots, we present the discussion within the captions of Figure 4 and Figure 5.
　　我们应用了Hoiem等人[20]的优秀检测分析工具，以揭示我们方法的错误模式，了解微调如何改变它们，并查看我们的错误类型与DPM的比较。对分析工具的全面总结超出了本文的范围，我们鼓励读者查阅[20]以了解一些更精细的细节（如 “归一化AP”）。由于分析最好是在相关图表的背景下进行，我们在图4和图5的标题中提出讨论。

3.4. Bounding box regression（Bounding box回归）

Based on the error analysis, we implemented a simple method to reduce localization errors. Inspired by the bounding box regression employed in DPM [14], we train a linear regression model to predict a new detection window given the pool5 features for a selective search region proposal. Full details are given in the supplementary material. Results in Table 1, Table 2, and Figure 4 show that this simple approach fixes a large number of mislocalized detections, boosting mAP by 3 to 4 points.
　　基于误差分析，我们实施了一个简单的方法来减少定位误差。受DPM[14]中采用的bbox回归的启发，我们训练了一个线性回归模型，以预测一个新的检测窗口，给定oool $_5$ 的特征，以提出一个选择性的搜索区域。完整的细节在补充材料中给出。表1、表2和图4中的结果显示，这种简单的方法修复了大量的错误定位的检测，使mAP提高了3到4个点。

4. Semantic segmentation（语义分割）

Region classification is a standard technique for semantic segmentation, allowing us to easily apply R-CNN to the PASCAL VOC segmentation challenge. To facilitate a direct comparison with the current leading semantic segmentation system (called O2P for “second-order pooling”) [4], we work within their open source framework. O2P uses CPMC to generate 150 region proposals per image and then predicts the quality of each region, for each class, using support vector regression (SVR). The high performance of their approach is due to the quality of the CPMC regions and the powerful second-order pooling of multiple feature types (enriched variants of SIFT and LBP). We also note that Farabet et al. [13] recently demonstrated good results on several dense scene labeling datasets (not including PASCAL) using a CNN as a multi-scale per-pixel classifier.
　　区域分类是语义分割的标准技术，使我们能够轻松地将R-CNN应用于PASCAL VOC分割的挑战。为了便于与目前领先的语义分割系统（被称为O2P的 “二阶集合”）[4]进行直接比较，我们在其开源框架内工作。O2P使用CPMC为每幅图像生成150个区域建议，然后使用支持向量回归（SVR）预测每个区域的质量，适用于每个类别。他们的方法的高性能是由于CPMC区域的质量和多种特征类型（SIFT和LBP的丰富变体）的强大二阶集合。我们还注意到，Farabet等人[13]最近在几个密集场景标签数据集（不包括PASCAL）上使用CNN作为多尺度每像素分类器，显示了良好的结果。
　　We follow [2, 4] and extend the PASCAL segmentation training set to include the extra annotations made available by Hariharan et al. [19]. Design decisions and hyperparameters were cross-validated on the VOC 2011 validation set. Final test results were evaluated only once.
　　我们遵循[2，4]，扩展了PASCAL分割训练集，以包括Hariharan等人[19]提供的额外注释。设计决策和超参数在VOC 2011验证集上进行了交叉验证。最终的测试结果只被评估了一次。
　　CNN features for segmentation. We evaluate three strategies for computing features on CPMC regions, all of which begin by warping the rectangular window around the region to 227 × 227. The first strategy (full) ignores the region’s shape and computes CNN features directly on the warped window, exactly as we did for detection. However, these features ignore the non-rectangular shape of the region. Two regions might have very similar bounding boxes while having very little overlap. Therefore, the second strategy (fg) computes CNN features only on a region’s foreground mask. We replace the background with the mean input so that background regions are zero after mean subtraction. The third strategy (full+fg) simply concatenates the full and fg features; our experiments validate their complementarity.
　　用于分割的CNN特征。我们评估了三种计算CPMC区域特征的策略，所有这些策略都是从将区域周围的矩形窗口扭曲为227×227开始的。第一种策略（完全）忽略了区域的形状，直接在扭曲的窗口上计算CNN特征，与我们在检测时的做法完全一样。然而，这些特征忽略了区域的非矩形形状。两个区域可能有非常相似的边界框，同时又有非常少的重叠。因此，第二种策略（fg）只在一个区域的前景遮罩上计算CNN特征。我们用均值输入替换背景，这样背景区域在均值减去后为零。第三种策略（full+fg）简单地将full和fg的特征连接起来；我们的实验验证了它们的互补性。
　　Results on VOC 2011. Table 3 shows a summary of our results on the VOC 2011 validation set compared with O2P. (See supplementary material for complete per-category results.) Within each feature computation strategy, layer fc $_6$ always outperforms fc $_7$ and the following discussion refers to the fc $_6$ features. The fg strategy slightly outperforms full, indicating that the masked region shape provides a stronger signal, matching our intuition. However, full+fg achieves an average accuracy of 47.9%, our best result by a margin of 4.2% (also modestly outperforming $O_2P$ ), indicating that the context provided by the full features is highly informative even given the fg features. Notably, training the 20 SVRs on our full+fg features takes an hour on a single core, compared to 10+ hours for training on $O_2P$ features.
　　VOC 2011的结果。表3显示了我们在VOC 2011验证集上与O2P比较的结果摘要。(在每个特征计算策略中，fc $_6$ 层总是优于fc $_7$ 层，下面的讨论指的是fc $_6$ 特征。fg策略略微优于full，表明被遮蔽的区域形状提供了更强的信号，与我们的直觉相符。然而，full+fg达到了47.9%的平均准确率，以4.2%的幅度达到了我们的最佳结果（也略微超过了 $O_2P$ ），表明即使考虑到fg特征，full特征所提供的背景也是非常有价值的。值得注意的是，在我们的完整特征+fg特征上训练20个SVR只需要一个小时，而在 $O_2P$ 特征上训练则需要10多个小时。
　　In Table 4 we present results on the VOC 2011 test set, comparing our best-performing method, fc $_6$ (full+fg), against two strong baselines. Our method achieves the highest segmentation accuracy for 11 out of 21 categories, and the highest overall segmentation accuracy of 47.9%, averaged across categories (but likely ties with the $O_2P$ result under any reasonable margin of error). Still better performance could likely be achieved by fine-tuning.
　　在表4中，我们展示了VOC 2011测试集的结果，将我们表现最好的方法fc $_6$ （full+fg）与两个强大的基线进行比较。我们的方法在21个类别中的11个取得了最高的分割准确率，并且在不同类别中平均取得了最高的整体分割准确率47.9%（但在任何合理的误差范围内都可能与 $O_2P$ 的结果持平）。通过微调，可能还能取得更好的性能。

5. Conclusion（结论）

In recent years, object detection performance had stagnated. The best performing systems were complex ensembles combining multiple low-level image features with high-level context from object detectors and scene classifiers. This paper presents a simple and scalable object detection algorithm that gives a 30% relative improvement over the best previous results on PASCAL VOC 2012.
　　近年来，目标检测性能停滞不前。表现最好的系统是将多个低层次图像特征与来自目标检测器和场景分类器的高层次背景相结合的复杂组合。本文提出了一种简单的、可扩展的目标检测算法，比以前在PASCAL VOC 2012上的最佳结果有30%的相对改进。
　　We achieved this performance through two insights. The first is to apply high-capacity convolutional neural networks to bottom-up region proposals in order to localize and segment objects. The second is a paradigm for training large CNNs when labeled training data is scarce. We show that it is highly effective to pre-train the networkwith supervision—for a auxiliary task with abundant data (image classification) and then to fine-tune the network for the target task where data is scarce (detection). We conjecture that the “supervised pre-training/domain-specific finetuning” paradigm will be highly effective for a variety of data-scarce vision problems.
　　我们通过两种见解实现了这种性能。第一是将高容量的卷积神经网络应用于自下而上的区域建议，以便对目标进行定位和分割。第二是在标记的训练数据稀少时训练大型CNN的范式。我们表明，用监督的方式预先训练网络是非常有效的，因为它是一个具有丰富数据的辅助任务（图像分类），然后为数据匮乏的目标任务（检测）微调网络。我们猜想，"有监督的预训练/特定领域的微调 "范式将对各种数据稀缺的视觉问题非常有效。
　　We conclude by noting that it is significant that we achieved these results by using a combination of classical tools from computer vision and deep learning (bottomup region proposals and convolutional neural networks). Rather than opposing lines of scientific inquiry, the two are natural and inevitable partners.
　　我们最后指出，重要的是，我们通过使用计算机视觉和深度学习的经典工具（自下而上的区域建议和卷积神经网络）的组合来实现这些结果。与其说这两者是科学探索的对立路线，不如说它们是自然的、不可避免的伙伴。

你可能感兴趣的:(卷积神经网络,翻译,目标检测,r语言,cnn)

【目标检测数据集】卡车数据集1073张VOC+YOLO格式熬夜写代码的平头哥∰ 目标检测 YOLO 人工智能
数据集格式：PascalVOC格式+YOLO格式(不包含分割路径的txt文件，仅仅包含jpg图片以及对应的VOC格式xml文件和yolo格式txt文件)图片数量(jpg文件个数)：1073标注数量(xml文件个数)：1073标注数量(txt文件个数)：1073标注类别数：1标注类别名称:["truck"]每个类别标注的框数：truck框数=1120总框数：1120使用标注工具：labelImg标注
番茄西红柿叶子病害分类数据集12882张11类别 futureflsl 数据集分类数据挖掘人工智能
数据集类型：图像分类用，不可用于目标检测无标注文件数据集格式：仅仅包含jpg图片，每个类别文件夹下面存放着对应图片图片数量(jpg文件个数)：12882分类类别数：11类别名称:["Bacterial_Spot_Bacteria","Early_Blight_Fungus","Healthy","Late_Blight_Water_Mold","Leaf_Mold_Fungus","Powdery
简介Shell、zsh、bash zhaosuningsn Shell zsh bash shell linux bash
Shell是Linux和Unix的外壳，类似衣服，负责外界与Linux和Unix内核的交互联系。例如接收终端用户及各种应用程序的命令，把接收的命令翻译成内核能理解的语言，传递给内核，并把内核处理接收的命令的结果返回给外界，即Shell是外界和内核沟通的桥梁或大门。Linux和Unix提供了多种Shell，其中有种bash，当然还有其他好多种。Mac电脑中不但有bash，还有一个zsh，预装的，据说
Shell、Bash、Zsh这都是啥啊小白码上飞 bash linux 开发语言
Zsh和Bash都是我们常用的Shell，那先搞明白啥是shell吧。Shell作为一个单词，他是“壳”的意思，蛋壳坚果壳。之所以叫壳，是为了和计算机的“核”来区分，用它表示“为使用者提供的操作界面”。所以这个命名其实很形象，翻译成中文，直译过来叫“壳层”。个人认为这个叫法很奇怪，意译貌似也没有什么好的词汇来匹配。就还是叫shell吧。维基百科给的定义是：Incomputing,ashellisa
可以赚钱的app，你们都在用哪些？配音新手圈
1.七猫免费小说2.有柿3.番茄小说兼职副业推荐公众号，配音新手圈，声优配音圈，新配音兼职圈，配音就业圈，鼎音副业，有声新手圈，每天更新各种远程工作与在线兼职，职位包括：写手、程序开发、剪辑、设计、翻译、配音、无门槛、插画、翻译、等等。。。每日更新兼职。4.速读免费小说5.得间免费小说6.快手7.快手极速8.抖音火山版（可提0.2，可能我懒赚的慢，但真不推荐）9.拼多多10.淘宝11.点淘12.美
[Python] 数据结构详解及代码 AIAdvocate 算法 python 数据结构链表
今日内容大纲介绍数据结构介绍列表链表1.数据结构和算法简介程序大白话翻译,程序=数据结构+算法数据结构指的是存储,组织数据的方式.算法指的是为了解决实际业务问题而思考思路和方法,就叫:算法.2.算法的5大特性介绍算法具有独立性算法是解决问题的思路和方式,最重要的是思维,而不是语言,其(算法)可以通过多种语言进行演绎.5大特性有输入,需要传入1或者多个参数有输出,需要返回1个或者多个结果有穷性,执行
100天持续行动—Day01 Richard_DL
今天开始站着学习，发现效率大幅提升。把fast.ai的Lesson1的后半部分和Lesson2看完了。由于Keras版本和视频中的不一致，运行notebook时经常出现莫名其妙的错误，导致自己只动手实践了视频中的一小部分内容。为了赶时间，我打算先把与CNN相关的视频过一遍。然后尽快开始做自己的项目。明天继续加油，争取把Lesson3和Lesson4看完。
yolov5＞onnx＞ncnn＞apk 图像处理大大大大大牛啊 opencv实战代码讲解 yolo onnx ncnn 安卓
一.yolov5pt模型转onnx条件：colabnotebookyolov51.安装环境!pipinstallonnx>=1.7.0#forONNXexport!pipinstallcoremltools==4.0#forCoreMLexport!pipinstallonnx-simplifier2.修改common.py在classFocus下面
免费的GPT可在线直接使用（一键收藏） kkai人工智能 gpt
1、LuminAI（https://kk.zlrxjh.top）LuminAI标志着一款融合了星辰大数据模型与文脉深度模型的先进知识增强型语言处理系统，旨在自然语言处理（NLP）的技术开发领域发光发热。此系统展现了卓越的语义把握与内容生成能力，轻松驾驭多样化的自然语言处理任务。VisionAI在NLP界的应用领域广泛，能够胜任从机器翻译、文本概要撰写、情绪分析到问答等众多任务。通过对大量文本数据的
[数据集][目标检测]汽车头部尾部检测数据集VOC+YOLO格式5319张3类别 FL1623863129 数据集目标检测汽车 YOLO
数据集制作单位：未来自主研究中心(FIRC)版权单位：未来自主研究中心(FIRC)版权声明：数据集仅仅供个人使用，不得在未授权情况下挂淘宝、咸鱼等交易网站公开售卖,由此引发的法律责任需自行承担数据集格式：PascalVOC格式+YOLO格式(不包含分割路径的txt文件，仅仅包含jpg图片以及对应的VOC格式xml文件和yolo格式txt文件)图片数量(jpg文件个数)：5319标注数量(xml文件
浅评《记忆像铁轨一样长》中的修辞手法后会定无期
《记忆像铁轨一样长》是已逝世的余光中先生在一九八四年创作的一篇散文，后成为其代表作之一。余光中先生作为著名的作家、诗人和翻译家，素有文坛“璀璨五彩笔”、“诗文双绝”和“诗坛最后的守夜人”等美誉。《记忆像铁轨一样长》这篇散文也继承了作者一贯的风格，全文语言优美隽永，结构清晰紧凑，节奏张弛有度，想象天马行空，感情细腻真挚。其中运用了大量的修辞手法，或新颖巧妙，或生动有趣，用词准确灵活，给读者留下了深刻
轻量级模型解读——轻量transformer系列 lishanlu136 #图像分类轻量级模型 transformer 图像分类
先占坑，持续更新。。。文章目录1、DeiT2、ConViT3、Mobile-Former4、MobileViTTransformer是2017谷歌提出的一篇论文，最早应用于NLP领域的机器翻译工作，Transformer解读，但随着2020年DETR和ViT的出现(DETR解读，ViT解读)，其在视觉领域的应用也如雨后春笋般渐渐出现，其特有的全局注意力机制给图像识别领域带来了重要参考。但是tran
个人学习笔记7-6：动手学深度学习pytorch版-李沐浪子L 深度学习深度学习笔记计算机视觉 python 人工智能神经网络 pytorch
#人工智能##深度学习##语义分割##计算机视觉##神经网络#计算机视觉13.11全卷积网络全卷积网络（fullyconvolutionalnetwork，FCN）采用卷积神经网络实现了从图像像素到像素类别的变换。引入l转置卷积（transposedconvolution）实现的，输出的类别预测与输入图像在像素级别上具有一一对应关系：通道维的输出即该位置对应像素的类别预测。13.11.1构造模型下
上班族怎么赚钱搞副业，每月让你多挣几千元的方法配音就业圈
适合上班族的副业有哪些?1、投稿赚在线贡献，节省邮费，但也很快，一篇手稿也可以投资于许多手稿。文章不会写，找别人的改变，拼凑在一起，非常简单方便。兼职副业推荐公众号，配音新手圈，声优配音圈，新配音兼职圈，配音就业圈，鼎音副业，有声新手圈，每天更新各种远程工作与在线兼职，职位包括：写手、程序开发、剪辑、设计、翻译、配音、无门槛、插画、翻译、等等。。。每日更新兼职。如果你不能写软文章，请去软文章网络学
后端开发刷题 | 把数字翻译成字符串（动态规划） jingling555 笔试题目动态规划 java 算法数据结构后端
描述有一种将字母编码成数字的方式：'a'->1,'b->2',...,'z->26'。现在给一串数字，返回有多少种可能的译码结果数据范围：字符串长度满足0=10&&num<=26){if(i==1){dp[i]+=1;}else{dp[i]+=dp[i-2];}}}returndp[nums.length()-1];}}
基于STM32的简易RTOS分析-预备知识騏威嵌入式
写下这篇文章的主要目的是对自己学习RTOS的历程做一个记录和总结，方便以后回忆翻看。以下内容主要来自宋岩先生翻译的《Cortex-M3权威指南》。目录一、Cortex-M3寄存器简介二、堆栈操作简介三、汇编指令简介LDR和STR指令STMDB和LDMIA指令B、BX、BL、BLX指令MRS和MSR指令四、中断简介中断响应过程简介SVC和PensSV中断简介软件中断五、汇编基础一、Cortex-M3
计算机视觉中，Pooling的作用 Wils0nEdwards 计算机视觉人工智能
在计算机视觉中，Pooling（池化）是一种常见的操作，主要用于卷积神经网络（CNN）中。它通过对特征图进行下采样，减少数据的空间维度，同时保留重要的特征信息。Pooling的作用可以归纳为以下几个方面：1.降低计算复杂度与内存需求Pooling操作通过对特征图进行下采样，减少了特征图的空间分辨率（例如，高度和宽度）。这意味着网络需要处理的数据量会减少，从而降低了计算量和内存需求。这对大型神经网络
探索创新科技： Lite-Mono - 简约高效的小型化Mono框架杭律沛Meris
探索创新科技：Lite-Mono-简约高效的小型化Mono框架Lite-Mono[CVPR2023]Lite-Mono:ALightweightCNNandTransformerArchitectureforSelf-SupervisedMonocularDepthEstimation项目地址:https://gitcode.com/gh_mirrors/li/Lite-Mono如果你在寻找一个轻
女孩子下班后可以做的兼职是什么，女生下班后可以做的事情有哪些配音就业圈
一、女孩子下班后的兼职推荐女孩子下班后可以考虑一些灵活的兼职方式来增加收入。一种推荐的兼职是线上销售，兼职副业推荐公众号，配音新手圈，声优配音圈，新配音兼职圈，配音就业圈，鼎音副业，有声新手圈，每天更新各种远程工作与在线兼职，职位包括：写手、程序开发、剪辑、设计、翻译、配音、无门槛、插画、翻译、等等。。。每日更新兼职。可以通过社交媒体或电商平台开店，销售自己喜欢的产品。另外，可以考虑做代理，代理一
【仿RabbitMQ消息队列项目day2】使用muduo库中基于protobuf的应用层协议进行通信月夜星辉雪 rabbitmq 网络分布式 c++后端服务器 linux
一.什么是muduo?muduo库是⼀个基于非阻塞IO和事件驱动的C++高并发TCP网络编程库。简单来理解，它就是对原生的TCP套接字的封装，是一个比socket编程接口更好用的编程库。二.使用muduo库完成一个英译汉翻译服务TranslateServer.hpp:#pragmaonce#include#include#include#include#include"muduo/net/TcpC
【日本鲫鱼钓】浮游矶钓不同目标鱼不同钓法，日本专业矶钓书籍夏说钓鱼
夏说钓鱼，聊海外钓鱼，助钓友钓技！浮游矶钓不同目标鱼不同钓法，翻译来自《日本図解釣り入門基礎から始める海のウキ釣り入門》说到浮钓，由于它的目标鱼类多种多样，因此针对不同类型的目标也会有不同的浮钓方式。下面介绍一下同种类的浮钓方法和目标鱼类。【伸缩竿的小型钓法】用4.5～5.3米的伸缩竿的钓鱼方法。与矶钓竿相比，这种钓鱼竿更加轻便，连儿童也可以使用。目标鱼类有鲰虎鱼、海鲫、沙氏下鱲、竹荚鱼、鲪鱼、小
抖音开始怎么吸粉（可以试试这几种办法）配音新手圈
如何在抖音短视频平台上快速积累人气和粉丝，抖音短视频平台已成为“我们媒体”和全媒体矩阵，是客户获取、推广和收入的重要平台之一。兼职副业推荐公众号，配音新手圈，声优配音圈，新配音兼职圈，配音就业圈，鼎音副业，有声新手圈，每天更新各种远程工作与在线兼职，职位包括：写手、程序开发、剪辑、设计、翻译、配音、无门槛、插画、翻译、等等。。。每日更新兼职。但对于初学者来说，如何在抖音上建立自己的品牌，积累粉丝，
骑文探古访百村（5）：里水镇孔西村钮海津
祝孔后再辉煌口钮海津2012年12月21的报网消息云，“广东省民间遗产抢救工程这股暖风昨天吹到了孔西村，让这个历史悠久、民风淳朴的村落洋溢着冬日的暖意”。孔西村古建筑群，听说过，在佛山市南海区里水镇。找个时间去看看。看看还有没有这句“子曰”：君子谋道不谋食。耕也，馁在其中矣；学也，禄在其中矣。君子忧道不忧贫。翻译过来就是，老孔对地位高的人说：君子用心谋求大道而不费心思去谋求衣食。即使你亲自去耕田种
解决：java.lang.IllegalStateException: Invalid host: lb://xxx_xxx_xxx 方九九 java 开发语言
在项目了配置了服务名gateway网关也配置了完全没有问题同时nacos这边也能发现服务但就是访问的时候状态码500报错java.lang.IllegalStateException:Invalidhost:lb://…翻译的一下大概是无效的主机解决办法：看自己的服务名是不是xxx_xxx(这种下滑线格式的)，是的话去掉下划线或改成”-“就可以了。
CV、NLP、数据控掘推荐、量化海的那边- AI算法自然语言处理人工智能
下面是对CV（计算机视觉）、NLP（自然语言处理）、数据挖掘推荐和量化的简要概述及其应用领域的介绍：1.CV（计算机视觉，ComputerVision）定义：计算机视觉是一门让计算机能够从图像或视频中提取有用信息，并做出决策的学科。它通过模拟人类的视觉系统来识别、处理和理解视觉信息。主要任务：图像分类：识别图像中的物体并分类，比如猫、狗、车等。目标检测：在图像或视频中定位并识别多个对象，如人脸检测
陕西省家庭教育指导师，家庭教育指导师的薪资待遇配音新手圈
一、陕西省家庭教育指导师的薪资待遇陕西省家庭教育指导师的薪资待遇多种多样，主要取决于个人经验、技能、知识背景及所在机构的规模和地理位置。兼职副业推荐公众号，配音新手圈，声优配音圈，新配音兼职圈，配音就业圈，鼎音副业，有声新手圈，每天更新各种远程工作与在线兼职，职位包括：写手、程序开发、剪辑、设计、翻译、配音、无门槛、插画、翻译、等等。。。每日更新兼职。一般来说，初级家庭教育指导师的月薪在2000-
数据分析-24-时间序列预测之基于keras的VMD-LSTM和VMD-CNN-LSTM预测风速皮皮冰燃数据分析数据分析
文章目录1普通的LSTM模型1.1数据重采样1.2数据标准化1.3切分窗口1.4划分数据集1.5建立模型1.6预测效果2VMD-LSTM模型2.1VMD分解时间序列2.2对每一个IMF建立LSTM模型2.2.1IMF1—LSTM2.2.2IMF2-LSTM2.2.3统一代码2.3评估效果3CNN-LSTM模型3.1数据预处理3.2建立模型3.3效果预测4VMD-CNN-LSTM模型4.1VMD分解
小红书怎么直播（小红书直播条件）配音新手圈
小红书直播开通条件：1)身份证实名认证;2)年满18周岁;兼职副业推荐公众号，配音新手圈，声优配音圈，新配音兼职圈，配音就业圈，鼎音副业，有声新手圈，每天更新各种远程工作与在线兼职，职位包括：写手、程序开发、剪辑、设计、翻译、配音、无门槛、插画、翻译、等等。。。每日更新兼职。3)绑定手机号;4)完成创作者认证(需满足实名认证、粉丝数不少于5千、近半年自然阅读量2千以上的原创合规笔记数不小于10篇)
麦克白读后感学号叁拾
最光明的天使也会堕落，可是天使总是光明的；虽然小人全都貌似忠良，可是忠良的一定仍然不失他的本色。倏落秋零之际，黄叶正自凋残。看了辜正坤翻译的麦克白，全文采用了诗歌的形式，文字优美，语句漂亮，只是脑海中一边想着舞台剧一边读这种词总感觉别扭。麦克白作为苏格兰的骁勇大将，在战争大获全胜之后偶遇了三个女巫，女巫预言他会成为韦尔多王爵，又会成为国王。麦克白应该是很开心的吧，在封爵成功之后就告诉了自己的夫人这
前端使用react-intl-universal进行国际化 Stephy_Yy #调研 reactjs javascript css
一、国际化/i18n目前国际化，就是开发者写对象，一个key关联若干语种的翻译。相比于浏览器自带的翻译功能，语义更加准确。“国际化”的简称：i18n（其来源是英文单词internationalization的首末字符i和n，18为中间的字符数）二、react项目国际化react-intl是业界最受欢迎的软件包之一：React-intl是雅虎的语言国际化开源项目FormatJS的一部分，通过其提供的
微信开发者验证接口开发 362217990 微信开发者 token 验证
微信开发者接口验证。 Token，自己随便定义，与微信填写一致就可以了。根据微信接入指南描述 http://mp.weixin.qq.com/wiki/17/2d4265491f12608cd170a95559800f2d.html 第一步：填写服务器配置第二步：验证服务器地址的有效性第三步：依据接口文档实现业务逻辑这里主要讲第二步验证服务器有效性。建一个
一个小编程题-类似约瑟夫环问题 BrokenDreams 编程
今天群友出了一题：一个数列,把第一个元素删除,然后把第二个元素放到数列的最后,依次操作下去,直到把数列中所有的数都删除,要求依次打印出这个过程中删除的数。 &
linux复习笔记之bash shell (5) 关于减号-的作用 eksliang linux关于减号“-”的含义 linux关于减号“-”的用途 linux关于“-”的含义 linux关于减号的含义
转载请出自出处： http://eksliang.iteye.com/blog/2105677 管道命令在bash的连续处理程序中是相当重要的，尤其在使用到前一个命令的studout（标准输出）作为这次的stdin（标准输入）时，就显得太重要了，某些命令需要用到文件名，例如上篇文档的的切割命令（split）、还有
Unix(3) 18289753290 unix ksh
1)若该变量需要在其他子进程执行，则可用"$变量名称"或${变量}累加内容什么是子进程？在我目前这个shell情况下，去打开一个新的shell，新的那个shell就是子进程。一般状态下，父进程的自定义变量是无法在子进程内使用的，但通过export将变量变成环境变量后就能够在子进程里面应用了。 2)条件判断： &&代表and ||代表or&nbs
关于ListView中性能优化中图片加载问题酷的飞上天空 ListView
ListView的性能优化网上很多信息，但是涉及到异步加载图片问题就会出现问题。具体参看上篇文章http://314858770.iteye.com/admin/blogs/1217594 如果每次都重新inflate一个新的View出来肯定会造成性能损失严重，可能会出现listview滚动是很卡的情况，还会出现内存溢出。现在想出一个方法就是每次都添加一个标识，然后设置图
德国总理默多克：给国人的一堂“震撼教育”课永夜-极光教育
http://bbs.voc.com.cn/topic-2443617-1-1.html德国总理默多克：给国人的一堂“震撼教育”课　安吉拉—默克尔，一位经历过社会主义的东德人，她利用自己的博客，发表一番来华前的谈话，该说的话，都在上面说了，全世界想看想传播——去看看默克尔总理的博客吧！　　德国总理默克尔以她的低调、朴素、谦和、平易近人等品格给国人留下了深刻印象。她以实际行动为中国人上了一堂
关于Java继承的一个小问题。。。随便小屋 java
今天看Java 编程思想的时候遇见一个问题，运行的结果和自己想想的完全不一样。先把代码贴出来！ //CanFight接口 interface Canfight { void fight(); } //ActionCharacter类 class ActionCharacter { public void fight() { System.out.pr
23种基本的设计模式 aijuans 设计模式
Abstract Factory：提供一个创建一系列相关或相互依赖对象的接口，而无需指定它们具体的类。　　Adapter：将一个类的接口转换成客户希望的另外一个接口。A d a p t e r模式使得原本由于接口不兼容而不能一起工作的那些类可以一起工作。　　Bridge：将抽象部分与它的实现部分分离，使它们都可以独立地变化。　　Builder：将一个复杂对象的构建与它的表示分离，使得同
《周鸿祎自述：我的互联网方法论》读书笔记 aoyouzi 读书笔记
从用户的角度来看,能解决问题的产品才是好产品,能方便/快速地解决问题的产品,就是一流产品. 商业模式不是赚钱模式一款产品免费获得海量用户后,它的边际成本趋于0,然后再通过广告或者增值服务的方式赚钱,实际上就是创造了新的价值链. 商业模式的基础是用户,木有用户,任何商业模式都是浮云.商业模式的核心是产品,本质是通过产品为用户创造价值. 商业模式还包括寻找需求
JavaScript动态改变样式访问技术百合不是茶 JavaScript style属性 ClassName属性
一:style属性格式: HTML元素.style.样式属性="值"; 创建菜单:在html标签中创建或者在head标签中用数组创建 <html> <head> <title>style改变样式</title> </head> &l
jQuery的deferred对象详解 bijian1013 jquery deferred对象
jQuery的开发速度很快，几乎每半年一个大版本，每两个月一个小版本。每个版本都会引入一些新功能，从jQuery 1.5.0版本开始引入的一个新功能----deferred对象。 &nb
淘宝开放平台TOP Bill_chen C++c 物流 C#
淘宝网开放平台首页：http://open.taobao.com/ 淘宝开放平台是淘宝TOP团队的产品，TOP即TaoBao Open Platform，是淘宝合作伙伴开发、发布、交易其服务的平台。支撑TOP的三条主线为： 1.开放数据和业务流程 * 以API数据形式开放商品、交易、物流等业务； &
【大型网站架构一】大型网站架构概述 bit1129 网站架构
大型互联网特点面对海量用户、海量数据大型互联网架构的关键指标高并发高性能高可用高可扩展性线性伸缩性安全性大型互联网技术要点前端优化 CDN缓存反向代理 KV缓存消息系统分布式存储 NoSQL数据库搜索监控安全想到的问题： 1.对于订单系统这种事务型系统，如
eclipse插件hibernate tools安装白糖_ Hibernate
eclipse helios(3.6)版 1.启动eclipse 2.选择 Help > Install New Software...> 3.添加如下地址： http://download.jboss.org/jbosstools/updates/stable/helios/ 4.选择性安装：hibernate tools在All Jboss tool
Jquery easyui Form表单提交注意事项 bozch jquery easyui
jquery easyui对表单的提交进行了封装，提交的方式采用的是ajax的方式，在开发的时候应该注意的事项如下： 1、在定义form标签的时候，要将method属性设置成post或者get，特别是进行大字段的文本信息提交的时候，要将method设置成post方式提交，否则页面会抛出跨域访问等异常。所以这个要
Trie tree(字典树)的Java实现及其应用-统计以某字符串为前缀的单词的数量 bylijinnan java实现
import java.util.LinkedList; public class CaseInsensitiveTrie { /** 字典树的Java实现。实现了插入、查询以及深度优先遍历。 Trie tree's java implementation.(Insert,Search,DFS) Problem Description Igna
html css 鼠标形状样式汇总 chenbowen00 html css
css鼠标手型cursor中hand与pointer Example：CSS鼠标手型效果 <a href="#" style="cursor:hand">CSS鼠标手型效果</a><br/> Example：CSS鼠标手型效果 <a href="#" style=&qu
[IT与投资]IT投资的几个原则 comsci it
无论是想在电商,软件,硬件还是互联网领域投资,都需要大量资金,虽然各个国家政府在媒体上都给予大家承诺,既要让市场的流动性宽松,又要保持经济的高速增长....但是,事实上,整个市场和社会对于真正的资金投入是非常渴望的,也就是说,表面上看起来,市场很活跃,但是投入的资金并不是很充足的......
oracle with语句详解 daizj oracle with with as
oracle with语句详解转在oracle中，select 查询语句，可以使用with,就是一个子查询，oracle 会把子查询的结果放到临时表中，可以反复使用例子:注意，这是sql语句，不是pl/sql语句，可以直接放到jdbc执行的 ----------------------------------------------------------------
hbase的简单操作 deng520159 数据库 hbase
近期公司用hbase来存储日志,然后再来分析 ,把hbase开发经常要用的命令找了出来. 用ssh登陆安装hbase那台linux后用hbase shell进行hbase命令控制台! 表的管理 1）查看有哪些表 hbase(main)> list 2）创建表 # 语法：create <table>, {NAME => <family&g
C语言scanf继续学习、算术运算符学习和逻辑运算符 dcj3sjt126com c
/* 2013年3月11日20:37:32 地点：北京潘家园功能：完成用户格式化输入多个值目的：学习scanf函数的使用 */ # include <stdio.h> int main(void) { int i, j, k; printf("please input three number:\n"); //提示用
2015越来越好 dcj3sjt126com 歌曲
越来越好房子大了电话小了感觉越来越好假期多了收入高了工作越来越好商品精了价格活了心情越来越好天更蓝了水更清了环境越来越好活得有奔头人会步步高想做到你要努力去做到幸福的笑容天天挂眉梢越来越好婆媳和了家庭暖了生活越来越好孩子高了懂事多了学习越来越好朋友多了心相通了大家越来越好道路宽了心气顺了日子越来越好活的有精神人就不显
java.sql.SQLException: Value '0000-00-00' can not be represented as java.sql.Tim feiteyizu mysql
数据表中有记录的time字段（属性为timestamp）其值为：“0000-00-00 00:00:00” 程序使用select 语句从中取数据时出现以下异常： java.sql.SQLException:Value '0000-00-00' can not be represented as java.sql.Date java.sql.SQLException: Valu
Ehcache（07）——Ehcache对并发的支持 234390216 并发 ehcache 锁 ReadLock WriteLock
Ehcache对并发的支持在高并发的情况下，使用Ehcache缓存时，由于并发的读与写，我们读的数据有可能是错误的，我们写的数据也有可能意外的被覆盖。所幸的是Ehcache为我们提供了针对于缓存元素Key的Read（读）、Write（写）锁。当一个线程获取了某一Key的Read锁之后，其它线程获取针对于同
mysql中blob,text字段的合成索引 jackyrong mysql
在mysql中，原来有一个叫合成索引的，可以提高blob,text字段的效率性能，但只能用在精确查询，核心是增加一个列，然后可以用md5进行散列，用散列值查找则速度快比如： create table abc(id varchar(10),context blog,hash_value varchar(40)); insert into abc(1,rep
逻辑运算与移位运算 latty 位运算逻辑运算
源码：正数的补码与原码相同例+7 源码：00000111 补码：00000111 （用8位二进制表示一个数）负数的补码：符号位为1，其余位为该数绝对值的原码按位取反；然后整个数加1。 -7 源码： 10000111 ，其绝对值为00000111 取反加一：11111001 为-7补码已知一个数的补码，求原码的操作分两种情况：
利用XSD 验证XML文件 newerdragon java xml xsd
XSD文件（XML Schema 语言也称作 XML Schema 定义（XML Schema Definition，XSD）。具体使用方法和定义请参看： http://www.w3school.com.cn/schema/index.asp java自jdk1.5以上新增了SchemaFactory类可以实现对XSD验证的支持，使用起来也很方便。以下代码可用在J
搭建 CentOS 6 服务器(12) - Samba rensanning centos
（1）安装 # yum -y install samba Installed: samba.i686 0:3.6.9-169.el6_5 # pdbedit -a rensn new password:123456 retype new password:123456 …… （2）Home文件夹 # mkdir /etc
Learn Nodejs 01 toknowme nodejs
（1）下载nodejs https://nodejs.org/download/ 选择相应的版本进行下载（2）安装nodejs 安装的方式比较多，请baidu下我这边下载的是“node-v0.12.7-linux-x64.tar.gz”这个版本（1）上传服务器（2）解压 tar -zxvf node-v0.12.
jquery控制自动刷新的代码举例 xp9802 jquery
1、html内容部分复制代码代码示例: <div id='log_reload'> <select name="id_s" size="1"> <option value='2'>-2s-</option> <option value='3'>-3s-</option