菜菜菜菜菜菜菜

YOLOv3: An Incremental Improvement (YOLOv3 论文翻译)

英文版论文原文：https://pjreddie.com/media/files/papers/YOLOv3.pdf

YOLOv3:一个渐进的改进

YOLOv3: An Incremental Improvement

Joseph Redmon& Jinsong Zhao

华盛顿大学
University of Washington

Abstract

我们向YOLO提供一些更新！我们做了一些小的设计更改以使其更好。我们还培训了这个相当庞大的新网络。比上次要大一点，但更准确。不过速度还是很快的，请放心。 YOLOv3以320×320的速度运行时，在28.2 mAP的速度下运行时间为22毫秒，与SSD一样精确，但速度提高了三倍。当我们查看旧的.5 IOU mAP检测指标YOLOv3时，它是相当不错的。在Titan X上，它在51毫秒内可达到57：9的AP50，相比之下，RetinaNet在198毫秒内可达到57：5的AP50，性能相似，但速度快3.8倍。与往常一样，所有代码都可以在https://pjreddie.com/yolo/在线获得。

We present some updates to YOLO! We made a bunch of little design changes to make it better. We also trained this new network that’s pretty swell. It’s a little bigger than last time but more accurate. It’s still fast though, don’t worry. At 320 × 320 YOLOv3 runs in 22 ms at 28.2 mAP, as accurate as SSD but three times faster. When we look at the old .5 IOU mAP detection metric YOLOv3 is quite good. It achieves 57:9 AP50 in 51 ms on a Titan X, compared to 57:5 AP50 in 198 ms by RetinaNet, similar performance but 3.8× faster. As always, all the code is online at https://pjreddie.com/yolo/.

1. Introduction

有时候，您只需要拨入一年的电话，就知道吗？我今年没有做很多研究。在Twitter上花费了很多时间。和GAN一起玩了一点。去年[12] [1]我剩下一点动力。我设法对YOLO进行了一些改进。但是，老实说，没有什么比超级有趣的了，只是一堆小小的改进而已。我也帮助了其他人的研究。

Sometimes you just kinda phone it in for a year, you know? I didn’t do a whole lot of research this year. Spent a lot of time on Twitter. Played around with GANs a little. I had a little momentum left over from last year [12] [1]; I managed to make some improvements to YOLO. But, honestly, nothing like super interesting, just a bunch of small changes that make it better. I also helped out with other people’s research a little.

实际上，这就是今天把我们带到这里的原因。我们有一个可随时使用相机的截止日期[4]，我们需要引用我对YOLO所做的一些随机更新，但我们没有消息来源。因此，准备一份技术报告

Actually, that’s what brings us here today. We have a camera-ready deadline [4] and we need to cite some of the random updates I made to YOLO but we don’t have a source. So get ready for a TECH REPORT!

技术报告的优点在于它们不需要介绍，大家都知道我们为什么在这里。因此，本导论的结尾将为本文的其余部分指明路标。首先，我们将告诉您与YOLOv3达成的交易。然后，我们将告诉您我们的做法。我们还将告诉您一些我们尝试过的无效的事情。最后，我们将考虑所有这些。

The great thing about tech reports is that they don’t need intros, y’all know why we’re here. So the end of this introduction will signpost for the rest of the paper. First we’ll tell you what the deal is with YOLOv3. Then we’ll tell you how we do. We’ll also tell you about some things we tried that didn’t work. Finally we’ll contemplate what this all means.

2. The Deal

因此，这是与YOLOv3达成的交易：我们大多从别人那里吸取了好主意。我们还培训了一个新的分类器网络，该网络要比其他分类器更好。我们将带您从头开始学习整个系统，以便您可以全部了解。

So here’s the deal with YOLOv3: We mostly took good ideas from other people. We also trained a new classifier network that’s better than the other ones. We’ll just take you through the whole system from scratch so you can understand it all.

图1.我们根据Focal Loss论文[9]修改了该图。 YOLOv3的运行速度明显快于其他具有可比性能的检测方法。从M40或Titan X来看，它们基本上是相同的GPU。

Figure 1. We adapt this figure from the Focal Loss paper [9]. YOLOv3 runs significantly faster than other detection methods with comparable performance. Times from either an M40 or Titan X, they are basically the same GPU.

2.1. 边界框预测

2.1. Bounding Box Prediction

遵循YOLO9000，我们的系统使用尺寸簇作为锚定框来预测边界框[15]。网络为每个边界框 $t_x$ , $t_y$ , $t_w$ , $t_h$ 预测4个坐标。如果单元格从图像的左上角偏移 $c_x, c_y)$ ，并且先验边界框的宽度和高度为 $p_w$ , $p_h$ ，则预测对应于：

Following YOLO9000 our system predicts bounding boxes using dimension clusters as anchor boxes [15]. The network predicts 4 coordinates for each bounding box, $t_x$ , $t_y$ , $t_w$ , $t_h$ . If the cell is offset from the top left corner of the image by $c_x, c_y)$ and the bounding box prior has width and height $p_w$ , $p_h$ , then the predictions correspond to:

$b_x=\sigma(t_x)+c_x$
$b_y=\sigma(t_y)+c_y$
$b_w=p_we^{t_w}$
$b_h=p_he^{t_h}$

在训练期间，我们使用平方误差损失之和。如果某个坐标预测的地面真实值为 $\hat{t}_*$ ，则我们的梯度为地面真实值（从地面真实框计算得出）减去我们的预测： $\hat{t}_* − t_*$ 。通过倒转上述公式，可以很容易地计算出地面真实值。

During training we use sum of squared error loss. If the ground truth for some coordinate prediction is $\hat{t}_*$ our gradient is the ground truth value (computed from the ground truth box) minus our prediction: $\hat{t}_* − t_*$ . This ground truth value can be easily computed by inverting the equations above.

图2.具有尺寸先验和位置预测的边界框。我们预测盒子的宽度和高度为与簇质心的偏移量。我们使用sigmoi函数预测盒子相对于过滤器应用位置的中心坐标。这个数字公然自夸[15]。

Figure 2. Bounding boxes with dimension priors and location prediction. We predict the width and height of the box as offsets from cluster centroids. We predict the center coordinates of the box relative to the location of filter application using a sigmoi function. This figure blatantly self-plagiarized from [15].

YOLOv3使用逻辑回归预测每个边界框的客观性得分。如果边界框先验与地面真值对象的重叠量大于任何其他边界框先验，则应为1。如果边界框先验不是最好的，但是与地面真实对象的重叠超过某个阈值，我们将忽略预测[17]。我们使用的阈值为：5。与[17]不同，我们的系统仅为每个地面真值对象分配一个边界框。如果没有将边界框先验分配给地面真理对象，则不会对坐标或类别预测造成任何损失，而只会造成客观性的损失。

YOLOv3 predicts an objectness score for each bounding box using logistic regression. This should be 1 if the bounding box prior overlaps a ground truth object by more than any other bounding box prior. If the bounding box prior is not the best but does overlap a ground truth object by more than some threshold we ignore the prediction, following [17]. We use the threshold of :5. Unlike [17] our system only assigns one bounding box prior for each ground truth object. If a bounding box prior is not assigned to a ground truth object it incurs no loss for coordinate or class predictions, only objectness.

2.2. Class Prediction

每个框使用多标签分类预测边界框可能包含的类。我们不使用softmax，因为我们发现它不需要良好的性能，而是仅使用独立的逻辑分类器。在训练期间，我们使用二进制交叉熵损失进行类别预测。

Each box predicts the classes the bounding box may contain using multilabel classification. We do not use a softmax as we have found it is unnecessary for good performance, instead we simply use independent logistic classifiers. During training we use binary cross-entropy loss for the class predictions.

当我们移至开放图像数据集[7]等更复杂的领域时，这种表达方式会有所帮助。在此数据集中，有许多重叠的标签（即“女人”和“人”）。使用softmax会假设每个盒子只有一个类，而通常并非如此。多标签方法可以更好地对数据建模。

This formulation helps when we move to more complex domains like the Open Images Dataset [7]. In this dataset there are many overlapping labels (i.e. Woman and Person). Using a softmax imposes the assumption that each box has exactly one class which is often not the case. A multilabel approach better models the data.

2.3. Predictions Across Scales

YOLOv3预测3种不同比例的盒子。我们的系统使用相似的概念从金字塔尺度中提取特征，以金字塔网络为特征[8]。从基本特征提取器中，我们添加了几个卷积层。这些中的最后一个预测3D张量编码边界框，客观性和类预测。在我们用COCO [10]进行的实验中，我们预测每个尺度上有3个盒子，因此对于4个边界框偏移，1个客观性预测和80个类预测，张量为 $\times (4 + 1 + 80)]$ 。

YOLOv3 predicts boxes at 3 different scales. Our system extracts features from those scales using a similar concept to feature pyramid networks [8]. From our base feature extractor we add several convolutional layers. The last of these predicts a 3-d tensor encoding bounding box, objectness, and class predictions. In our experiments with COCO [10] we predict 3 boxes at each scale so the tensor is $\times (4 + 1 + 80)]$ for the 4 bounding box offsets, 1 objectness prediction, and 80 class predictions.

接下来，我们从先前的2层中获取特征图，并将其上采样2倍。我们还从网络中较早的地方获取了一个特征图，并使用串联将其与我们的上采样特征合并。这种方法使我们能够从上采样的特征中获取更有意义的语义信息，并从较早的特征图中获取更细粒度的信息。然后，我们再添加一些卷积层来处理此组合特征图，并最终预测出相似的张量，尽管现在的大小是原来的两倍。

Next we take the feature map from 2 layers previous and upsample it by 2×. We also take a feature map from earlier in the network and merge it with our upsampled features using concatenation. This method allows us to get more meaningful semantic information from the upsampled features and finer-grained information from the earlier feature map. We then add a few more convolutional layers to process this combined feature map, and eventually predict a similar tensor, although now twice the size.

我们再执行一次相同的设计，以预测最终比例的盒子。因此，我们对第3层的预测受益于所有先前的计算以及网络早期的细粒度功能。

We perform the same design one more time to predict boxes for the final scale. Thus our predictions for the 3rd scale benefit from all the prior computation as well as fine-grained features from early on in the network.

我们仍然使用k均值聚类来确定边界框先验。我们只是随意选择了9个聚类和3个比例，然后将这些聚类在各个比例之间平均分配。在COCO数据集上，9个聚类为： $(10 \times 13), (16 \times 30), (33 \times 23), (30 \times 61); (62 \times 45), (59 \times 119), (116 \times 90), (156 \times 198), (373 \times 326)$ 。

We still use k-means clustering to determine our bounding box priors. We just sort of chose 9 clusters and 3 scales arbitrarily and then divide up the clusters evenly across scales. On the COCO dataset the 9 clusters were: $(10 \times 13), (16 \times 30), (33 \times 23), (30 \times 61); (62 \times 45), (59 \times 119), (116 \times 90), (156 \times 198), (373 \times 326)$ .

2.4. Feature Extractor

我们使用一个新的网络来执行特征提取。我们的新网络是YOLOv2，Darknet-19中使用的网络与新的残留网络内容之间的一种混合方法。我们的网络使用了连续的3×3和1×1卷积层，但现在也有了一些快捷连接，并且规模更大。它有53个卷积层，所以我们称它为…等待它… Darknet-53！

We use a new network for performing feature extraction. Our new network is a hybrid approach between the network used in YOLOv2, Darknet-19, and that newfangled residual network stuff. Our network uses successive 3 × 3 and 1 × 1 convolutional layers but now has some shortcut connections as well and is significantly larger. It has 53 convolutional layers so we call it… wait for it… Darknet-53!

Table 1. Darknet-53.

这个新网络比Darknet-19强大得多，但仍比ResNet-101或ResNet-152高效。这是一些ImageNet结果：

This new network is much more powerful than Darknet-19 but still more efficient than ResNet-101 or ResNet-152. Here are some ImageNet results:

每个网络都经过相同设置的训练，并以256×256的单作物精度进行测试。运行时间是在Titan X上以256×256进行测量的。因此Darknet-53与最新的分类器具有同等的性能，但浮点运算更少，速度更高。 Darknet-53比ResNet-101更好，且速度是1：5倍。 Darknet-53具有与ResNet-152相似的性能，并且快2倍。

表2.骨干的比较。精度，数十亿次操作，每秒十亿次浮点操作以及各种网络的FPS。

Table 2. Comparison of backbones. Accuracy, billions of operations, billion floating point operations per second, and FPS for various networks.

Each network is trained with identical settings and tested at 256×256, single crop accuracy. Run times are measured on a Titan X at 256 × 256. Thus Darknet-53 performs on par with state-of-the-art classifiers but with fewer floating point operations and more speed. Darknet-53 is better than ResNet-101 and 1:5× faster. Darknet-53 has similar performance to ResNet-152 and is 2× faster.

Darknet-53还实现了每秒最高的测量浮点运算。这意味着网络结构可以更好地利用GPU，从而使其评估效率更高，从而速度更快。这主要是因为ResNets层太多了，效率也不高。

Darknet-53 also achieves the highest measured floating point operations per second. This means the network structure better utilizes the GPU, making it more efficient to evaluate and thus faster. That’s mostly because ResNets have just way too many layers and aren’t very efficient.

2.5. Training

我们仍然会训练完整的图像，而不会进行任何艰苦的负面挖掘。我们使用多尺度培训，大量数据扩充，批处理规范化以及所有标准内容。我们使用Darknet神经网络框架进行培训和测试[14]。

We still train on full images with no hard negative mining or any of that stuff. We use multi-scale training, lots of data augmentation, batch normalization, all the standard stuff. We use the Darknet neural network framework for training and testing [14].

3. How We Do

YOLOv3很好！参见表3。就COCO而言，平均平均AP度量标准很奇怪，与SSD变体相当，但速度提高了3倍。不过，在此指标上，它仍然比其他模型（例如RetinaNet）要落后很多。

YOLOv3 is pretty good! See table 3. In terms of COCOs weird average mean AP metric it is on par with the SSD variants but is 3× faster. It is still quite a bit behind other models like RetinaNet in this metric though.

但是，当我们以IOU =：5（或图表中的AP50）查看mAP的“旧”检测指标时，YOLOv3非常强大。它几乎与RetinaNet相当，并且远远超过SSD变体。这表明YOLOv3是一个非常强大的检测器，擅长于为物体制造体面的盒子。但是，随着IOU阈值的增加，性能会显着下降，这表明YOLOv3难以使盒子与对象完美对齐。

However, when we look at the “old” detection metric of mAP at IOU= :5 (or AP50 in the chart) YOLOv3 is very strong. It is almost on par with RetinaNet and far above the SSD variants. This indicates that YOLOv3 is a very strong detector that excels at producing decent boxes for objects. However, performance drops significantly as the IOU threshold increases indicating YOLOv3 struggles to get the boxes perfectly aligned with the object.

过去，YOLO一直在努力处理小物件。但是，现在我们看到了这种趋势的逆转。通过新的多尺度预测，我们看到YOLOv3具有相对较高的APS性能。但是，它在中型和大型对象上的性能相对较差。要深入了解这一点，还需要进行更多调查。

In the past YOLO struggled with small objects. However, now we see a reversal in that trend. With the new multi-scale predictions we see YOLOv3 has relatively high APS performance. However, it has comparatively worse performance on medium and larger size objects. More investigation is needed to get to the bottom of this.

当我们在AP50度量标准上绘制精度与速度的关系时（参见图5），我们看到YOLOv3比其他检测系统具有明显的优势。即更快，更好。

When we plot accuracy vs speed on the AP50 metric (see figure 5) we see YOLOv3 has significant benefits over other detection systems. Namely, it’s faster and better.

4.我们尝试过的无效的事情

4. Things We Tried That Didn’t Work

在开发YOLOv3时，我们尝试了很多东西。很多都行不通。这是我们能记住的东西。

We tried lots of stuff while we were working on YOLOv3. A lot of it didn’t work. Here’s the stuff we can remember.

锚框 $x$ , $y$ 偏移量预测。我们尝试使用普通锚框预测机制，在该机制中，您可以使用线性激活将 $x$ , $y$ 偏移量预测为框宽度或高度的倍数。我们发现此公式降低了模型的稳定性，并且效果不佳。

Anchor box $x$ , $y$ offset predictions. We tried using the normal anchor box prediction mechanism where you predict the $x$ , $y$ offset as a multiple of the box width or height using a linear activation. We found this formulation decreased model stability and didn’t work very well.

线性 $x$ , $y$ 预测而非逻辑预测。我们尝试使用线性激活来直接预测 $x$ , $y$ 偏移量，而不是逻辑激活。这导致mAP下降了两点

Linear $x$ , $y$ predictions instead of logistic. We tried using a linear activation to directly predict the $x$ , $y$ offset instead of the logistic activation. This led to a couple point drop in mAP

失焦。我们尝试使用焦点损失。它降低了我们的mAP大约2点。 YOLOv3可能已经对焦点损失试图解决的问题具有鲁棒性，因为它具有独立的客观性预测和条件类预测。因此，对于大多数示例而言，分类预测不会带来损失吗？或者其他的东西？我们不太确定。

Focal loss. We tried using focal loss. It dropped our mAP about 2 points. YOLOv3 may already be robust to the problem focal loss is trying to solve because it has separate objectness predictions and conditional class predictions. Thus for most examples there is no loss from the class predictions? Or something? We aren’t totally sure.

双IOU阈值和真值分配。更快的RCNN在训练期间使用两个IOU阈值。如果预测与基本事实的重叠量为0.7，则为正例；由[.3-.7]的预测将被忽略，对于所有基本实物，小于0.3则为否定例。我们尝试了类似的策略，但未取得良好的效果。

Dual IOU thresholds and truth assignment. Faster RCNN uses two IOU thresholds during training. If a prediction overlaps the ground truth by .7 it is as a positive example, by [.3−.7] it is ignored, less than .3 for all ground truth objects it is a negative example. We tried a similar strategy but couldn’t get good results.

我们非常喜欢我们目前的表述，似乎至少是局部最优。这些技术中的某些可能最终会产生良好的结果，也许它们只需要进行一些调整即可稳定训练。

We quite like our current formulation, it seems to be at a local optima at least. It is possible that some of these techniques could eventually produce good results, perhaps they just need some tuning to stabilize the training.

表3.我很认真地只是从[9]中偷走了所有这些表格，它们花了很长时间才能从头开始制作。好的，YOLOv3一切正常。请记住，RetinaNet的图像处理时间要长3：8倍。 YOLOv3比SSD变种要好得多，可与AP50指标上的最新模型相媲美。

Table 3. I’m seriously just stealing all these tables from [9] they take soooo long to make from scratch. Ok, YOLOv3 is doing alright. Keep in mind that RetinaNet has like 3:8× longer to process an image. YOLOv3 is much better than SSD variants and comparable to state-of-the-art models on the AP50 metric.

图3.再次根据[9]进行改编，这次显示了在mAP上以0.5 IOU度量标准的速度/精度折衷。您可以说YOLOv3很好，因为它很高而且离左边很远。你可以引用自己的论文吗？猜猜谁会尝试，这个家伙！[16]。哦，我忘了，我们还修复了YOLOv2中的数据加载错误，该错误通过2 mAP的帮助而得以解决。只是潜入这里不放弃布局。

Figure 3. Again adapted from the [9], this time displaying speed/accuracy tradeoff on the mAP at .5 IOU metric. You can tell YOLOv3 is good because it’s very high and far to the left. Can you cite your own paper? Guess who’s going to try, this guy ! [16]. Oh, I forgot, we also fix a data loading bug in YOLOv2, that helped by like 2 mAP. Just sneaking this in here to not throw off layout.

5. What This All Means

YOLOv3是一个很好的检测器。快速，准确。在.5至.95 IOU度量标准之间的COCO平均AP效果不佳。但是，对于.5 IOU的旧检测指标而言，这非常好。

YOLOv3 is a good detector. It’s fast, it’s accurate. It’s not as great on the COCO average AP between .5 and .95 IOU metric. But it’s very good on the old detection metric of .5 IOU.

为什么我们仍要转换指标？原始的COCO论文只是这样一个含糊的句子：“评估服务器完成后，将添加对评估指标的完整讨论”。 Russakovsky等人的报告指出，人类很难区分.3和.5的IOU！ “训练人员视觉检查IOU为0.3的边界框并将其与IOU 0.5的边界框区别开来是非常困难的。” [18]如果人类很难分辨出差异，那么这有多重要？

Why did we switch metrics anyway? The original COCO paper just has this cryptic sentence: “A full discussion of evaluation metrics will be added once the evaluation server is complete”. Russakovsky et al report that that humans have a hard time distinguishing an IOU of .3 from .5! “Training humans to visually inspect a bounding box with IOU of 0.3 and distinguish it from one with IOU 0.5 is surprisingly difficult.” [18] If humans have a hard time telling the difference, how much does it matter?

但是也许更好的问题是：“既然有了探测器，我们将如何处理这些探测器？”许多从事这项研究的人都在Google和Facebook上。我想至少我们知道该技术掌握得很好，并且绝对不会被用来收集您的个人信息并将其出售给…。等等，您是在说这正是它的用途？？哦。

But maybe a better question is: “What are we going to do with these detectors now that we have them?” A lot of the people doing this research are at Google and Facebook. I guess at least we know the technology is in good hands and definitely won’t be used to harvest your personal information and sell it to… wait, you’re saying that’s exactly what it will be used for?? Oh.

好吧，那些为视觉研究投入大量资金的人是军人，他们从未做过任何可怕的事情，例如用新技术杀死许多人，等等… 1

Well the other people heavily funding vision research are the military and they’ve never done anything horrible like killing lots of people with new technology oh wait…1

我非常希望大多数使用计算机视觉的人都在用它做快乐的好事，例如计算国家公园中斑马的数量[13]或在猫徘徊在房子周围时追踪它们的猫[19]。]。但是计算机视觉已经被质疑使用，作为研究人员，我们有责任至少考虑我们的工作可能造成的危害并想办法减轻它。我们欠世界那么多。

I have a lot of hope that most of the people using computer vision are just doing happy, good stuff with it, like counting the number of zebras in a national park [13], or tracking their cat as it wanders around their house [19]. But computer vision is already being put to questionable use and as researchers we have a responsibility to at least consider the harm our work might be doing and think of ways to mitigate it. We owe the world that much.

最后，不要@我。（因为我终于退出了Twitter）。

In closing, do not @ me. (Because I finally quit Twitter).

References

[1] Analogy. Wikipedia, Mar 2018. 1

[2] M. Everingham, L. Van Gool, C. K. Williams, J. Winn, and A. Zisserman. The pascal visual object classes (voc) challenge. International journal of computer vision, 88(2):303–
338, 2010. 6

[3] C.-Y. Fu, W. Liu, A. Ranga, A. Tyagi, and A. C. Berg. Dssd: Deconvolutional single shot detector. arXiv preprint arXiv:1701.06659, 2017. 3

[4] D. Gordon, A. Kembhavi, M. Rastegari, J. Redmon, D. Fox, and A. Farhadi. Iqa: Visual question answering in interactive environments. arXiv preprint arXiv:1712.03316, 2017. 1

[5] K. He, X. Zhang, S. Ren, and J. Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016. 3

[6] J. Huang, V. Rathod, C. Sun, M. Zhu, A. Korattikara, A. Fathi, I. Fischer, Z. Wojna, Y. Song, S. Guadarrama, et al. Speed/accuracy trade-offs for modern convolutional object detectors. 3

[7] I. Krasin, T. Duerig, N. Alldrin, V. Ferrari, S. Abu-El-Haija, A. Kuznetsova, H. Rom, J. Uijlings, S. Popov, A. Veit, S. Belongie, V. Gomes, A. Gupta, C. Sun, G. Chechik, D. Cai, Z. Feng, D. Narayanan, and K. Murphy. Openimages: A public dataset for large-scale multi-label and multi-class image classification. Dataset available from https://github.com/openimages, 2017. 2

[8] T.-Y. Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, and S. Belongie. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2117–2125, 2017. 2, 3

[9] T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar. ´ Focal loss for dense object detection. arXiv preprint arXiv:1708.02002, 2017. 1, 3, 4

[10] T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick. Microsoft coco: Com- ´ mon objects in context. In European conference on computer vision, pages 740–755. Springer, 2014. 2

[11] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.- Y. Fu, and A. C. Berg. Ssd: Single shot multibox detector. In European conference on computer vision, pages 21–37. Springer, 2016. 3

[12] I. Newton. Philosophiae naturalis principia mathematica. William Dawson & Sons Ltd., London, 1687. 1

[13] J. Parham, J. Crall, C. Stewart, T. Berger-Wolf, and D. Rubenstein. Animal population censusing at scale with citizen science and photographic identification. 2017. 4

[14] J. Redmon. Darknet: Open source neural networks in c. http://pjreddie.com/darknet/, 2013–2016. 3

[15] J. Redmon and A. Farhadi. Yolo9000: Better, faster, stronger. In Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, pages 6517–6525. IEEE, 2017. 1, 2, 3

[16] J. Redmon and A. Farhadi. Yolov3: An incremental improvement. arXiv, 2018. 4

[17] S. Ren, K. He, R. Girshick, and J. Sun. Faster r-cnn: Towards real-time object detection with region proposal networks. arXiv preprint arXiv:1506.01497, 2015. 2

[18] O. Russakovsky, L.-J. Li, and L. Fei-Fei. Best of both worlds: human-machine collaboration for object annotation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2121–2131, 2015. 4

[19] M. Scott. Smart camera gimbal bot scanlime:027, Dec 2017. 4

[20] A. Shrivastava, R. Sukthankar, J. Malik, and A. Gupta. Beyond skip connections: Top-down modulation for object detection. arXiv preprint arXiv:1612.06851, 2016. 3

[21] C. Szegedy, S. Ioffe, V. Vanhoucke, and A. A. Alemi. Inception-v4, inception-resnet and the impact of residual connections on learning. 2017. 3

反驳

Rebuttal

图4.零轴图表可能在理论上更诚实……我们仍然可以使用变量来使自己看起来不错！

Figure 4. Zero-axis charts are probably more intellectually honest… and we can still screw with the variables to make ourselves look good!

我们要感谢Reddit评论员，同事，电子邮件发送者，以及走廊上的欢呼声，感谢他们的可爱，由衷的话。如果您像我一样，正在审查ICCV，那么我们知道您可能还会阅读其他37篇论文，您将不可避免地推迟到最后一周，然后在该领域中有一些传奇人物通过电子邮件向您发送有关您应该如何完成的论文这些评论只是不清楚他们在说什么，也许他们来自未来？无论如何，如果没有您过去自己过去所做的所有工作，这篇论文将不会变成及时的事情，但是只有一点点前进，直到现在为止都没有。而且，如果您发了推文，我不会知道。只是在说。

We would like to thank the Reddit commenters, labmates, emailers, and passing shouts in the hallway for their lovely, heartfelt words. If you, like me, are reviewing for ICCV then we know you probably have 37 other papers you could be reading that you’ll invariably put off until the last week and then have some legend in the field email you about how you really should finish those reviews execept it won’t entirely be clear what they’re saying and maybe they’re from the future? Anyway, this paper won’t have become what it will in time be without all the work your past selves will have done also in the past but only a little bit further forward, not like all the way until now forward. And if you tweeted about it I wouldn’t know. Just sayin.

审稿人＃2 AKA丹·格罗斯曼（笑的是谁呢）坚持认为，我在这里指出我们的图不是一个而是两个非零的原点。丹，您说的完全正确，这是因为看起来比承认我们自己都在战胜2-3％的行动计划更好。但是这是要求的图形。我也加入了FPS，因为当我们在FPS上绘图时，我们看起来就像是超级棒。

Reviewer #2 AKA Dan Grossman (lol blinding who does that) insists that I point out here that our graphs have not one but two non-zero origins. You’re absolutely right Dan, that’s because it looks way better than admitting to ourselves that we’re all just here battling over 2-3% mAP. But here are the requested graphs. I threw in one with FPS too because we look just like super good when we plot on FPS.

评论者4在Reddit上的AKA JudasAdventus写道：“有趣的阅读，但反对MSCOCO指标的论点似乎有些虚弱”。好吧，我一直都知道你会成为打开我犹大的人。您知道在进行项目时是如何进行的，而且只能顺利进行，因此您必须找出某种方法来证明您所做的工作真的很酷吗？我基本上是想这样做，并且对COCO指标大加抨击。但是，既然我已经放完了这座山丘，我不妨死在它上面。

Reviewer #4 AKA JudasAdventus on Reddit writes “Entertaining read but the arguments against the MSCOCO metrics seem a bit weak”. Well, I always knew you would be the one to turn on me Judas. You know how when you work on a project and it only comes out alright so you have to figure out some way to justify how what you did actually was pretty cool? I was basically trying to do that and I lashed out at the COCO metrics a little bit. But now that I’ve staked out this hill I may as well die on it.

看到问题了，mAP已经有点坏了，因此对其进行更新也许可以解决一些问题，或者至少说明为什么更新版本在某种程度上更好。这就是我遇到的最大问题是缺乏合理性。对于PASCAL VOC，将IOU阈值“故意设置得较低，以解决地面真实数据中边界框中的不准确性” [2]。 COCO的标签是否比VOC更好？绝对有可能，因为COCO带有分割蒙版，也许标签更值得信赖，因此我们不必担心准确性。但同样，我的问题是缺乏正当性。

See here’s the thing, mAP is already sort of broken so an update to it should maybe address some of the issues with it or at least justify why the updated version is better in some way. And that’s the big thing I took issue with was the lack of justification. For PASCAL VOC, the IOU threshold was ”set deliberately low to account for inaccuracies in bounding boxes in the ground truth data“ [2]. Does COCO have better labelling than VOC? This is definitely possible since COCO has segmentation masks maybe the labels are more trustworthy and thus we aren’t as worried about inaccuracy. But again, my problem was the lack of justification.

COCO度量标准强调更好的边界框，但强调必须意味着它不再强调其他内容，在这种情况下，是分类准确性。是否有充分的理由认为更精确的边界框比更好的分类更重要？未分类的示例比稍微移动的边界框更明显。

The COCO metric emphasizes better bounding boxes but that emphasis must mean it de-emphasizes something else, in this case classification accuracy. Is there a good reason to think that more precise bounding boxes are more important than better classification? A miss-classified example is much more obvious than a bounding box that is slightly shifted.

mAP已经搞砸了，因为重要的是按类别排序。例如，如果您的测试集仅包含这两个图像，则根据mAP，产生这些结果的两个检测器就如常：

mAP is already screwed up because all that matters is per-class rank ordering. For example, if your test set only has these two images then according to mAP two detectors that produce these results are JUST AS GOOD:

图5.根据这两个图像的mAP，这两个假设检测器是完美的。他们俩都是完美的。完全相等。

Figure 5. These two hypothetical detectors are perfect according to mAP over these two images. They are both perfect. Totally equal.

现在，这显然是对mAP问题的过分夸张，但是我想我最近重新定义的一点是，“现实世界”中的人们所关心的与我们当前的度量标准之间存在如此明显的差异。要提出新的指标，我们应该关注这些差异。另外，例如，它已经是平均精度了，我们甚至可以将COCO指标称为平均平均年龄精度吗？

Now this is OBVIOUSLY an over-exaggeration of the problems with mAP but I guess my newly retconned point is that there are such obvious discrepancies between what people in the “real world” would care about and our current metrics that I think if we’re going to come up with new metrics we should focus on these discrepancies. Also, like, it’s already mean average precision, what do we even call the COCO metric, average mean average precision?

这是一个建议，给人们真正关心的是图像和检测器，检测器对图像中的对象进行查找和分类的程度如何。摆脱每个类别的AP而仅执行全球平均精度又如何呢？还是对每个图像进行AP计算并求平均值？

Here’s a proposal, what people actually care about is given an image and a detector, how well will the detector find and classify objects in the image. What about getting rid of the per-class AP and just doing a global average precision? Or doing an AP calculation per-image and averaging over that?

无论如何，盒子都是愚蠢的，我可能是面具的真正信徒，但我无法让YOLO来学习它们。

Boxes are stupid anyway though, I’m probably a true believer in masks except I can’t get YOLO to learn them.

你可能感兴趣的:(英文论文翻译)

【练习】PAT 乙 1078 字符串压缩与解压柠石榴输入输出 PAT 题解有阻碍算法 c++
题目文本压缩有很多种方法，这里我们只考虑最简单的一种：把由相同字符组成的一个连续的片段用这个字符和片段中含有这个字符的个数来表示。例如ccccc就用5c来表示。如果字符没有重复，就原样输出。例如aba压缩后仍然是aba。解压方法就是反过来，把形如5c这样的表示恢复为ccccc。本题需要你根据压缩或解压的要求，对给定字符串进行处理。这里我们简单地假设原始字符串是完全由英文字母和空格组成的非空字符串。
2025-03-15 学习记录--C/C++-PTA 练习3-4 统计字符小呀小萝卜儿学习-C/C++学习 c语言
合抱之木，生于毫末；九层之台，起于累土；千里之行，始于足下。一、题目描述⭐️练习3-4统计字符本题要求编写程序，输入10个字符，统计其中英文字母、空格或回车、数字字符和其他字符的个数。输入格式:输入为10个字符。最后一个回车表示输入结束，不算在内。输出格式:在一行内按照letter=英文字母个数,blank=空格或回车个数,digit=数字字符个数,other=其他字符个数的格式输出。输入样例:a
Python正则表达式（re模块） qq742234984 python 正则表达式 mysql
Python正则表达式（re模块）概述正则表达式Python正则表达式re模块re.match方法常用的匹配规则-匹配字符常用的匹配规则-匹配字符数量常用的匹配规则-原生字符串常用的匹配规则-匹配开头结尾常用的匹配规则-分组匹配re.compile方法re.search方法re.findall方法re.sub方法re.split方法贪婪模式与非贪婪模式概述案例概述正则表达式英文名为RegularE
OpenAI Agents SDK 中文文档中文教程（1） wtsolutions openai agents sdk openai agents sdk python 中文文档教程
英文文档原文详见OpenAIAgentsSDKhttps://openai.github.io/openai-agents-python/本文是OpenAI-agents-sdk-python使用翻译软件翻译后的中文文档/教程。分多个帖子发布，帖子的目录如下：(1)OpenAI代理SDK，介绍及快速入门(2)OpenAIagentssdk,agents，运行agents，结果，流，工具，交接目录O
CNBr活化琼脂糖凝胶4B，CNBr-Activated Sepharose 4B 陕西星贝爱科 CNBr活化琼脂糖凝胶4B
CNBr活化琼脂糖凝胶4B是一种用于固定含伯胺配基的预活化填料，以下是其详细介绍：基本信息中文名称：溴化氰活化琼脂糖凝胶4B英文名称：CNBr-ActivatedSepharose4B外观：白色浆状物，放置可分层/白色粉末状固体基架：4%琼脂糖颗粒大小：45~165μm偶联官能团：伯氨基储存条件：2~8℃，100%丙酮特点反应条件温和：与蛋白等生物大分子的反应条件温和，可直接偶联生物大分子，不需偶
久违了，那书本的墨香！--- 闻华为新宇兄加班过劳致死有感 weixin_30765505
毕业两年了，如果不是试着去回顾，不会感觉时间流淌的这么快！我从来不否认自己怀旧，也时常在闲暇之余莫名地怀念那过去的人、景和事，尤其是那种在书海中浸淫的味道，那些纸面散发出的墨香，常引我心醉而神往！可是如今却成了那匆匆忙忙上班族中的普通一员，美其名曰：“白领”，实际上只不过去自欺欺人、聊以自慰的“头衔”罢了。每天超过十二小时面对电脑，专业人士提醒的“辐射量”我们自是无暇统计的，而眼前由英文字母、数字
基于AI算法实现的情感倾向分析的方法程序员奇奇计算机毕设人工智能算法
完整代码：https://download.csdn.net/download/pythonyanyan/87430621背景目前，情感倾向分析的方法主要分为两类：一种是基于情感词典的方法；一种是基于机器学习的方法，如基于大规模语料库的机器学习。前者需要用到标注好的情感词典，英文的词典有很多，中文主要有知网整理的情感词典Hownet和台湾大学整理发布的NTUSD两个情感词典，还有哈工大信息检索研究
华为OD机试 - 字符串消除 - 栈Stack（Python/JS/C/C++ 2024 C卷 100分）哪吒华为od python javascript
华为OD机试2024E卷题库疯狂收录中，刷题点这里专栏导读本专栏收录于《华为OD机试真题（Python/JS/C/C++）》。刷的越多，抽中的概率越大，私信哪吒，备注华为OD，加入华为OD刷题交流群，每一题都有详细的答题思路、详细的代码注释、3个测试用例、为什么这道题采用XX算法、XX算法的适用场景，发现新题目，随时更新。一、题目描述游戏规则：输入一个只包含英文字母的字符串,字符串中的两个字母如果
信息技术基础专有名词和计算机硬件学习笔记 learning-striving 信息技术学习笔记信息技术计算机硬件
信息技术常见专有名词信息技术基础课程中常见的专有名词英文缩写或简称及其详细含义，按领域分类整理：硬件与存储CPU(CentralProcessingUnit)中央处理器，负责执行计算机指令和处理数据。GPU(GraphicsProcessingUnit)图形处理器，专用于处理图形和并行计算。RAM(RandomAccessMemory)随机存取存储器，临时存储运行中的程序和数据。ROM(Read-
【模拟面试】计算机考研复试集训（第二天） Albert Edison 计算机考研复试高频考点面试考研职场和发展 c++数据结构算法操作系统
文章目录前言一、专业面试1、OSI参考模型和TCP/IP模型的主要区别是什么？简述各层功能2、什么是瀑布模型？其优缺点是什么？3、什么是递归？使用时需注意什么？4、监督学习与无监督学习的核心区别是什么？请举例说明典型算法5、你在项目中遇到过哪些技术挑战？是如何解决的？二、英文口语1、Canyoutellusaboutatimeyouworkedinateamandfacedchallenges?H
【头歌C语言程序与设计】数据类型与基本操作畅游星辰大海 #头歌C语言程序设计 c语言
目录写在前面正文第1关：数值与字符的通用性实验第2关：转义字符实验第3关：浮点数实验第4关：数值类型综合实验写在最后写在前面本文代码是我自己所作，本人水平有限，可能部分代码看着不够简练，运行效率不高,但都能运行成功。另外，如果想了解更多，请订阅专栏头歌C语言程序与设计正文第1关：数值与字符的通用性实验本关任务：了解C语言中字符型和整型的通用性，根据提示，输出字母p-Q的数值大小，理解英文姓名排序方
牛客练习赛135——小柒的逆序对(2) KyollBM 算法数据结构
这里还得说一下，调换一个排列中任意两个不同的数，该排列的逆序数奇偶会改变题目：思路：这道题的数据给的很大，如果我们用树状数组维护前缀和都没用，但是我们观察到英文字符只有26个，那我们可以开一个二维数组g[i][j]表示ij字符对有多少个如何维护这个数组呢，其实也很简单，遍历s每个字符c，同时开一个数组储存26个字符对于字符c，先遍历26个字符y，将g[y][c]加上y的个数，结束后再将c的数量加一
探究Visual Studio中的乱码问题 L-Super 杂记 visual studio ide
关于乱码，没遇到皆大欢喜，遇到了头痛不已。在VisualStudio中程序遇到乱码，需要明确三个概念，那么问题就好解决了。三个字符集概念源码字符集MSVC中/source-charset即源代码文本文件的字符集，NodePad++、记事本、VSCode这样类似的文本编辑器，可以打开源文件看一下你的字符集（文件编码）。源代码文本文件是以二进制的形式存在硬盘里的，无论中文英文都一样，当你输入一个汉字后
java24种设计模式目录,为大家整理最全的24种设计模式详解，必收藏高补 java24种设计模式目录
设计模式六大原则单一职责原则一个方法尽可能做一件事情，一般来说不应该让一个方法承担多个职责。单一职责原则的英文名称是SingleResponsibilityPrinciple，简称是SRP。单一职责原则的定义是：应该有且仅有一个原因引起类的变更。SRP的原话解释是：Thereshouldneverbemorethanonereasonforaclasstochange.单一职责原则提出了一个编写程
PTA:空心字母金字塔悦悦子a啊 C语言PTA习题 c++算法
输入一个大写的英文字母，输出空心的字母金字塔。输入格式:一个大写英文字母。输出格式:一个空心的大写英文字母金字塔，其中第1层的“A”在第1行的第40列，列从1开始计数。输入样例:E输出样例:ABBCCDDEEEEEEEEE代码如下：#includeusingnamespacestd;intmain(){chara;cin>>a;intn=a-'A';charb='A';if(a=='A'){for
新科研神器！这回读英文论文真跟读中文没两样了量子位
原创关注前沿科技量子位大模型时代，读论文这事儿真是越来越爽了~你敢信，这样式儿的论文并非中文原版，而是出自翻译软件之手的翻译版。原文长这样：不仅译文流畅，公式图表也丝毫不乱，原模原样清晰美观不说，各种图注表头该翻译也都能翻译到位。并且在大模型加持之下，有什么疑点划线引用直接就能问，再也不怕没人一起讨论最新前沿科技进展，被导师一问一个不吱声了。都说搞科研英语必须过硬，但毕竟作为非母语者，想要如阅读中
2024架构设计师论文题目数字化信息化智能化解决方案 2024架构
论文1大数据lamda架构1、简要说明你参开发的软件项目,吸你所承担的主要作2、lamada体系架构将数据流分为批处理层(对应的英文、加速层文、服务层。简要叙这三个层次的用途和特点3、详细阐述你参与开发的软件项目如何基于lamada体系架构进行大数据处理的架构论文2模型驱动架构设计方法及其用1、简要说明你参与分析和研发的软件项目,吸你所承担的要工作2、简要阐述采用模型驱动架构思想进行软件开发的全过
static关键字直面秃头恐惧 Java java
1.含义static的英文本义是静态的，在java语法中，static既可以修饰成员变量又可以修饰成员方法。被static修饰的成员变量叫作静态成员变量，被static修饰的方法叫作静态成员方法。需要注意的是，在Java中，静态成员(类成员)不属于某个具体的对象，可以被类的所有对象共享(访问、修改)。2.使用2.1static修饰成员变量被static修饰的成员，储存在方法区当中，它的生命周期伴随
AI编程方法第二弹：边提问边调整 leeshuqing AI编程 AI编程
AI编程的提问词非常类似于传统搜索引擎中的检索词，虽然采取了自然语言表示，但是在获取结果的策略上却很一致。因为用户在一开始可能并不非常清楚AI编程工具如何理解用户的提问，因此输出结果可能并不能完全满足用户要求，此时用户可以不断的根据生成结果，动态的灵活的调整提问，使之不断趋近于自己满意的结果。比如，对于“Python”等任意英文单词，允许用户指定总宽度后，通过自动填充空格，使之总宽度尽可能等于该宽
java日记1（小白常见的错误） xxxlllli java
一.找不到文件出现这种情况的话1.检查自己文件名是否输入正确2.检查文件所在目录是否正确二.主类名和文件名不一致例如该文件名是lianxi而主类名为lianxi01，应把两者统一三.缺少分号根据提示添加分号即可（分号要英文模式下的！！！）
论文摘要生成器：用TextRank算法实现文献关键信息提取 Atlas Shepherd python 算法自然语言处理 python 信息可视化
我们基于python代码，使用PyQt5创建图形用户界面（GUI），同时支持中英文两种语言的文本论文文献关键信息提取。PyQt5：用于创建GUI应用程序。jieba：中文分词库，用于中文文本的处理。re：正则表达式模块，用于文本清理和句子分割。numpy：提供数值计算能力，如数组操作、矩阵运算等，主要用于TextRank算法的实现。importsysimportreimportjiebaimpor
[网络]IP地址详解逻辑与&& 云计算与运维 #网络与协议 tcp/ip 网络网络协议
一、IP介绍IP是英文InternetProtocol的缩写，意思是“网络之间互连的协议”，也就是为计算机网络相互连接进行通信而设计的协议。在因特网中，它是能使连接到网上的所有计算机网络实现相互通信的一套规则，规定了计算机在因特网上进行通信时应当遵守的规则。任何厂家生产的计算机系统，只要遵守IP协议就可以与因特网互连互通。正是因为有了IP协议，因特网才得以迅速发展成为世界上最大的、开放的计算机通信
解密代理IP：住宅、ISP 与双 ISP 代理大起底 IPFLY代理 tcp/ip 接口隔离原则网络协议
在跨境电商、社媒营销和数据采集等领域，IP代理是突破地域限制、提升效率的必备工具。住宅代理、ISP代理和双ISP代理看似相似，实则大不相同。本文将为你拆解这三类代理的区别，助你选对工具，事半功倍。住宅代理：以假乱真的网络“伪装者”住宅代理，英文名为ResidentialProxy，其IP地址源自互联网服务提供商（ISP）分配给家庭用户的地址。这些IP地址与真实的物理地址紧密相连，在网络活动中，它们
ComPDFKit - 专业的PDF文档处理SDK pdf格式转换ocr文档注释
ComPDFKit提供专业、全平台支持的PDF开发库，包括Windows、Mac、Linux、Android、iOS、Web平台。开发者可以快速、灵活整合PDF功能到各开发平台的软件、程序、系统中。丰富的功能，多种开发语言，灵活的部署方案可供选择，满足您对PDF文档的所有需求。联系方式：中文官网:https://www.compdf.com/zh-cn英文官网：https://www.compdf
网上发现的一个《Flash&flex大全》 merryken as3 flash/flex flash flex actionscript 引擎游戏 adobe
官方在线帮助（没标英文的都是中文）用于AdobeFlashPlatform的ActionScript3.0参考更多参考使用这样的链接下载离线版：http://help.adobe.com/en_US/FlashPlatform/reference/actionscript/3/standalone.zip中文离线版将上面的en_US改为zh_CN(注意大小写)用于AdobeFlashProfess
Flash&flex大全 gebizhihu Flex flash flex actionscript 引擎游戏 adobe
官方在线帮助（没标英文的都是中文）用于AdobeFlashPlatform的ActionScript3.0参考更多参考使用这样的链接下载离线版：http://help.adobe.com/en_US/FlashPlatform/reference/actionscript/3/standalone.zip中文离线版将上面的en_US改为zh_CN(注意大小写)用于AdobeFlashProfess
【AI赋能】蓝耘赋能通义万相2.1：AI创作新时代的强力引擎星落无尘人工智能 AIGC
通义万相2.1的强大功能与特性通义万相2.1拥有多项突破性能力，使其在众多AI生成模型中脱颖而出。它支持文生视频、图生视频、视频编辑、文生图和视频生音频等多项任务，是真正意义上的多模态生成模型。在视频生成方面，通义万相2.1推出极速版和专业版两个版本，在权威的VBenchLeaderboard评测榜单上以84.7%的总分登顶。其首创的中文文字生成功能，为视频添加具有电影级效果的中英文文字特效变得轻
论文阅读-秦汉时期北方边疆组织的空间互动模式与直道的定位（中国） MilkLeong 论文阅读空间计算
论文英文题目：AspatialinteractionmodelofQin-HanDynastyorganisationonthenorthernfrontierandthelocationoftheZhidaohighway(China)发表于：journalofarchaeologicalscience，影响因子：3.030论文主要是使用空间互动模型来对秦汉时期的北方边疆直道进行定位和重建。分析
第20周：Pytorch文本分类入门 weixin_46620278 pytorch 分类人工智能
目录前言一、前期准备1.1环境安装导入包1.2加载数据1.3构建词典1.4生成数据批次和迭代器二、准备模型2.1定义模型2.2定义示例2.3定义训练函数与评估函数三、训练模型3.1拆分数据集并运行模型3.2使用测试数据集评估模型总结前言本文为[365天深度学习训练营]中的学习记录博客原作者：[K同学啊]说在前面本周任务：了解文本分类的基本流程、学习常用数据清洗方法、学习如何使用jieba实现英文分
理解 C# 泛型接口中的协变与逆变（抗变）幻凌风 NET
一、协变和逆变是什么？先从字面上理解协变(Covariance)、逆变(Contravariance)。co-是英文中表示“协同”、“合作”的前缀，协变的字面意思就是“与变化的方向相同”。contra-是英文中表示“相反”的前缀，逆变的字面意思就是是“与变化方向相反”。那么问题来了，这里的变化方向指的是什么？C#中对于对象（即对象引用），仅存在一种隐式类型转换，即子类型的对象引用到父类型的对象引用
Maven Array_06 eclipse jdk maven
Maven Maven是基于项目对象模型(POM)，信息来管理项目的构建，报告和文档的软件项目管理工具。 Maven 除了以程序构建能力为特色之外，还提供高级项目管理工具。由于 Maven 的缺省构建规则有较高的可重用性，所以常常用两三行 Maven 构建脚本就可以构建简单的项目。由于 Maven 的面向项目的方法，许多 Apache Jakarta 项目发文时使用 Maven，而且公司
ibatis的queyrForList和queryForMap区别 bijian1013 java ibatis
一.说明 iBatis的返回值参数类型也有种：resultMap与resultClass，这两种类型的选择可以用两句话说明之： 1.当结果集列名和类的属性名完全相对应的时候，则可直接用resultClass直接指定查询结果类
LeetCode[位运算] - #191 计算汉明权重 Cwind java 位运算 LeetCode Algorithm 题解
原题链接：#191 Number of 1 Bits 要求：写一个函数，以一个无符号整数为参数，返回其汉明权重。例如，‘11’的二进制表示为'00000000000000000000000000001011', 故函数应当返回3。汉明权重：指一个字符串中非零字符的个数；对于二进制串，即其中‘1’的个数。难度：简单分析：将十进制参数转换为二进制，然后计算其中1的个数即可。 “
浅谈java类与对象 15700786134 java
java是一门面向对象的编程语言，类与对象是其最基本的概念。所谓对象，就是一个个具体的物体，一个人，一台电脑，都是对象。而类，就是对象的一种抽象，是多个对象具有的共性的一种集合，其中包含了属性与方法，就是属于该类的对象所具有的共性。当一个类创建了对象，这个对象就拥有了该类全部的属性，方法。相比于结构化的编程思路，面向对象更适用于人的思维
linux下双网卡同一个IP 被触发 linux
转自： http://q2482696735.blog.163.com/blog/static/250606077201569029441/ 由于需要一台机器有两个网卡，开始时设置在同一个网段的IP，发现数据总是从一个网卡发出，而另一个网卡上没有数据流动。网上找了下，发现相同的问题不少：一、关于双网卡设置同一网段IP然后连接交换机的时候出现的奇怪现象。当时没有怎么思考、以为是生成树
安卓按主页键隐藏程序之后无法再次打开肆无忌惮_ 安卓
遇到一个奇怪的问题，当SplashActivity跳转到MainActivity之后，按主页键，再去打开程序，程序没法再打开（闪一下），结束任务再开也是这样，只能卸载了再重装。而且每次在Log里都打印了这句话"进入主程序"。后来发现是必须跳转之后再finish掉SplashActivity 本来代码： // 销毁这个Activity fin
通过cookie保存并读取用户登录信息实例知了ing JavaScript html
通过cookie的getCookies()方法可获取所有cookie对象的集合；通过getName()方法可以获取指定的名称的cookie；通过getValue()方法获取到cookie对象的值。另外，将一个cookie对象发送到客户端，使用response对象的addCookie()方法。下面通过cookie保存并读取用户登录信息的例子加深一下理解。（1）创建index.jsp文件。在改
JAVA 对象池矮蛋蛋 java ObjectPool
原文地址： http://www.blogjava.net/baoyaer/articles/218460.html Jakarta对象池 ☆为什么使用对象池恰当地使用对象池化技术，可以有效地减少对象生成和初始化时的消耗，提高系统的运行效率。Jakarta Commons Pool组件提供了一整套用于实现对象池化
ArrayList根据条件+for循环批量删除的方法 alleni123 java
场景如下： ArrayList<Obj> list Obj-> createTime, sid. 现在要根据obj的createTime来进行定期清理。（释放内存） ------------------------- 首先想到的方法就是 for(Obj o:list){ if(o.createTime-currentT>xxx){
阿里巴巴“耕地宝”大战各种宝百合不是茶平台战略
“耕地保”平台是阿里巴巴和安徽农民共同推出的一个 “首个互联网定制私人农场”，“耕地宝”由阿里巴巴投入一亿，主要是用来进行农业方面，将农民手中的散地集中起来不仅加大农民集体在土地上面的话语权，还增加了土地的流通与利用率，提高了土地的产量，有利于大规模的产业化的高科技农业的发展，阿里在农业上的探索将会引起新一轮的产业调整，但是集体化之后农民的个体的话语权将更少，国家应出台相应的法律法规保护
Spring注入有继承关系的类（1） bijian1013 java spring
一个类一个类的注入 1.AClass类 package com.bijian.spring.test2; public class AClass { String a; String b; public String getA() { return a; } public void setA(Strin
30岁转型期你能否成为成功人士 bijian1013 成功
很多人由于年轻时走了弯路，到了30岁一事无成，这样的例子大有人在。但同样也有一些人，整个职业生涯都发展得很优秀，到了30岁已经成为职场的精英阶层。由于做猎头的原因，我们接触很多30岁左右的经理人，发现他们在职业发展道路上往往有很多致命的问题。在30岁之前，他们的职业生涯表现很优秀，但从30岁到40岁这一段，很多人
[Velocity三]基于Servlet+Velocity的web应用 bit1129 velocity
什么是VelocityViewServlet 使用org.apache.velocity.tools.view.VelocityViewServlet可以将Velocity集成到基于Servlet的web应用中，以Servlet+Velocity的方式实现web应用 Servlet + Velocity的一般步骤 1.自定义Servlet，实现VelocityViewServl
【Kafka十二】关于Kafka是一个Commit Log Service bit1129 service
Kafka is a distributed, partitioned, replicated commit log service.这里的commit log如何理解？ A message is considered "committed" when all in sync replicas for that partition have applied i
NGINX + LUA实现复杂的控制 ronin47 lua nginx 控制
安装lua_nginx_module 模块 lua_nginx_module 可以一步步的安装，也可以直接用淘宝的OpenResty Centos和debian的安装就简单了。。这里说下freebsd的安装： fetch http://www.lua.org/ftp/lua-5.1.4.tar.gz tar zxvf lua-5.1.4.tar.gz cd lua-5.1.4 ma
java-14.输入一个已经按升序排序过的数组和一个数字，在数组中查找两个数，使得它们的和正好是输入的那个数字 bylijinnan java
public class TwoElementEqualSum { /** * 第 14 题：题目：输入一个已经按升序排序过的数组和一个数字，在数组中查找两个数，使得它们的和正好是输入的那个数字。要求时间复杂度是 O(n) 。如果有多对数字的和等于输入的数字，输出任意一对即可。例如输入数组 1 、 2 、 4 、 7 、 11 、 15 和数字 15 。由于
Netty源码学习-HttpChunkAggregator-HttpRequestEncoder-HttpResponseDecoder bylijinnan java netty
今天看Netty如何实现一个Http Server org.jboss.netty.example.http.file.HttpStaticFileServerPipelineFactory： pipeline.addLast("decoder", new HttpRequestDecoder()); pipeline.addLast(&quo
java敏感词过虑-基于多叉树原理 cngolon 违禁词过虑替换违禁词敏感词过虑多叉树
基于多叉树的敏感词、关键词过滤的工具包，用于java中的敏感词过滤 1、工具包自带敏感词词库，第一次调用时读入词库，故第一次调用时间可能较长，在类加载后普通pc机上html过滤5000字在80毫秒左右，纯文本35毫秒左右。 2、如需自定义词库，将jar包考入WEB-INF工程的lib目录，在WEB-INF/classes目录下建一个 utf-8的words.dict文本文件，
多线程知识 cuishikuan 多线程
T1，T2，T3三个线程工作顺序，按照T1，T2，T3依次进行 public class T1 implements Runnable{ @Override
spring整合activemq dalan_123 java spring jms
整合spring和activemq需要搞清楚如下的东东1、ConnectionFactory分： a、spring管理连接到activemq服务器的管理ConnectionFactory也即是所谓产生到jms服务器的链接 b、真正产生到JMS服务器链接的ConnectionFactory还得
MySQL时间字段究竟使用INT还是DateTime？ dcj3sjt126com mysql
环境：Windows XPPHP Version 5.2.9MySQL Server 5.1 第一步、创建一个表date_test（非定长、int时间） CREATE TABLE `test`.`date_test` (`id` INT NOT NULL AUTO_INCREMENT ,`start_time` INT NOT NULL ,`some_content`
Parcel: unable to marshal value dcj3sjt126com marshal
在两个activity直接传递List<xxInfo>时，出现Parcel: unable to marshal value异常。在MainActivity页面（MainActivity页面向NextActivity页面传递一个List<xxInfo>）： Intent intent = new Intent(this, Next
linux进程的查看上（ps） eksliang linux ps linux ps -l linux ps aux
ps:将某个时间点的进程运行情况选取下来转载请出自出处：http://eksliang.iteye.com/admin/blogs/2119469 http://eksliang.iteye.com ps 这个命令的man page 不是很好查阅，因为很多不同的Unix都使用这儿ps来查阅进程的状态，为了要符合不同版本的需求，所以这个
为什么第三方应用能早于System的app启动 gqdy365 System
Android应用的启动顺序网上有一大堆资料可以查阅了，这里就不细述了，这里不阐述ROM启动还有bootloader，软件启动的大致流程应该是启动kernel -> 运行servicemanager 把一些native的服务用命令启动起来（包括wifi, power, rild, surfaceflinger, mediaserver等等）-> 启动Dalivk中的第一个进程Zygot
App Framework发送JSONP请求(3) hw1287789687 jsonp 跨域请求发送jsonp ajax请求越狱请求
App Framework 中如何发送JSONP请求呢? 使用jsonp,详情请参考:http://json-p.org/ 如何发送Ajax请求呢? (1)登录 /*** * 会员登录 * @param username * @param password */ var user_login=function(username,password){ // aler
发福利，整理了一份关于“资源汇总”的汇总 justjavac 资源
觉得有用的话，可以去github关注：https://github.com/justjavac/awesome-awesomeness-zh_CN 通用 free-programming-books-zh_CN 免费的计算机编程类中文书籍精彩博客集合 hacke2/hacke2.github.io#2 ResumeSample 程序员简历
用 Java 技术创建 RESTful Web 服务 macroli java 编程 Web REST
转载：http://www.ibm.com/developerworks/cn/web/wa-jaxrs/ JAX-RS (JSR-311) 【 Java API for RESTful Web Services 】是一种 Java™ API，可使 Java Restful 服务的开发变得迅速而轻松。这个 API 提供了一种基于注释的模型来描述分布式资源。注释被用来提供资源的位
CentOS6.5-x86_64位下oracle11g的安装详细步骤及注意事项超声波 oracle linux
前言：这两天项目要上线了，由我负责往服务器部署整个项目，因此首先要往服务器安装oracle，服务器本身是CentOS6.5的64位系统，安装的数据库版本是11g，在整个的安装过程中碰到很多的坑，不过最后还是通过各种途径解决并成功装上了。转别写篇博客来记录完整的安装过程以及在整个过程中的注意事项。希望对以后那些刚刚接触的菜鸟们能起到一定的帮助作用。安装过程中可能遇到的问题（注
HttpClient 4.3 设置keeplive 和 timeout 的方法 supben httpclient
ConnectionKeepAliveStrategy kaStrategy = new DefaultConnectionKeepAliveStrategy() { @Override public long getKeepAliveDuration(HttpResponse response, HttpContext context) { long keepAlive
Spring 4.2新特性-@Import注解的升级 wiselyman spring 4
3.1 @Import @Import注解在4.2之前只支持导入配置类在4.2,@Import注解支持导入普通的java类,并将其声明成一个bean 3.2 示例演示java类 package com.wisely.spring4_2.imp; public class DemoService { public void doSomethin