薛铁钢

论文翻译《Dense Relation Distillation with Context-aware Aggregation for Few-Shot Object Detection》

论文地址：https://arxiv.org/abs/2103.17115
代码地址：https://github.com/hzhupku/DCNet

Abstract
1.Introduction
2.Related Work
- 2.1. General Object Detection
- 2.2. Few-Shot Learning
- 2.3. Few-Shot Object Detection
3. Method
- 3.1. Preliminaries.
- 3.2. DCNet
- 3.2.1. Dense Relation Distillation Module
- 3.2.2. Context-aware Feature Aggregation
- 3.3. Learning Strategy
4. Experiments
- 4.1. Datasets and Settings
4.2. Experiments on PASCAL VOC
- - 4.2.1 Comparisons with State-of-the-art Methods
  - 4.2.2 Ablation Study
  - 4.2.3 Qualitative Results
- 4.3. Experiments on MS COCO
5. Conclusions

Abstract

Conventional deep learning based methods for object detection require a large amount of bounding box annotations for training, which is expensive to obtain such high quality annotated data. Few-shot object detection, which learns to adapt to novel classes with only a few annotated examples, is very challenging since the fine-grained feature of novel object can be easily overlooked with only a few data available. In this work, aiming to fully exploit features of annotated novel object and capture fine-grained features of query object, we propose Dense Relation Distillation with Context-aware Aggregation (DCNet) to tackle the few-shot detection problem. Built on the meta-learning based framework, Dense Relation Distillation module targets at fully exploiting support features, where support features and query feature are densely matched, covering all spatial locations in a feed-forward fashion. The abundant usage of the guidance information endows model the capability to handle common challenges such as appearance changes and occlusions. Moreover, to better capture scale-aware features, Context-aware Aggregation module adaptively harnesses features from different scales for a more comprehensive feature representation. Extensive experiments illustrate that our proposed approach achieves state-of-the-art results on PASCAL VOC and MS COCO datasets. Code will be made available at https://github.com/hzhupku/DCNet

传统的基于深度学习的物体检测方法需要大量的边界框注释来进行训练，获取如此高质量的标注数据代价高昂。小样本目标检测在只有少数注释实例的情况下学习适应新的类别，这是一种非常具有挑战性的方法，因为在只有少量可用数据的情况下，新对象的细粒度特征很容易被忽略。在本文中，为了充分利用带注释的新对象的特征，获取查询对象的细粒度特征，我们提出了基于上下文感知聚合的密集关系蒸馏法（DCNet） 来解决小样本目标检测的问题。密集关系蒸馏模块建立在基于元学习的框架之上，旨在充分利用支撑特征（支撑特征和查询特征是密集匹配的），以前馈方式覆盖所有空间位置。指导信息的大量使用使模型有能力外观变化和遮挡等常见的挑战。此外，为了更好地捕捉尺度感知的特征，上下文感知聚合模块自适应地利用不同尺度的特征，以获得更全面的特征表示。大量的实验表明，我们提出的方法在PASCAL VOC和MS COCO数据集上取得了最先进的结果。代码将展示在https://github.com/hzhupku/DCNet。

1.Introduction

With the success of deep convolutional neural works, object detection has made great progress these years [20, 23, 8]. The success of deep CNNs, however, heavily relies on large-scale datasets such as ImageNet [2] that enable the training of deep models. When the labeled data becomes scarce, CNNs can severely overfit and fail to generalize. While in contrast, human beings have exhibited strong performance in learning a new concept with only a few examples available. Since some object categories naturally have scarce examples or bounding box annotations are laborsome to obtain such as medical data. These problems have triggered increasing attentions to deal with learning models with limited examples. Few-shot learning aims to train models to generalize well with a few examples provided. However, most existing few-shot learning works focus on image classification [29, 26, 27] problem and only a few focus on few-shot object detection problem. Since object detection not only requires class prediction, but also demands localization of the object, making it much more difficult than few-shot classification task.

随着深度卷积神经工作的成功，目标检测在近年来取得了很大进展[20,23,8]。然而，深度CNN的成功在很大程度上依赖于大规模的数据集，如ImageNet[2]，它支持深度模型的训练。当标记数据变得稀缺时，CNN会严重过拟合，无法进行泛化。与此相反，人类在学习一个新概念时，即使面对很少的例子，也能表现出很强的学习能力。由于某些对象类别的示例自然稀少，或者边界框注释很难获取，例如医疗数据。这些问题引起了人们对有限例学习模型的关注。小样本学习的目的是在提供了少数示例的情况下训练模型的泛化能力。然而，现有的小样本学习工作侧重于图像分类问题[29,26,27]，只有少数侧重于小样本目标检测问题。这是因为目标检测不仅需要进行类别预测，而且需要对目标进行定位，这比小样本分类任务要困难得多。

Figure 1. Two challenges for few-shot object detection. a) Appearance changes between support and query images are common, which results in a misleading manner. b) Occlusion problem brings about incomplete feature representation, causing false classification and missing detection.

图1。小样本目标检测的两个挑战。a)支撑图像和查询图像之间的外观变化很常见，这会导致误导。b)遮挡问题导致特征表示不完整，造成错误分类和漏检。

Prior studies in few-shot object detection mainly consist of two groups. Most of them [13, 35, 34] adopt a meta learning [5] based framework to perform feature reweighting for a class-specific prediction. While Wang et al.[31] adopt a two-stage fine-tuning approach with only finetuning the last layer of detectors and achieve state-of-the-artperformance. Wu et al. [33] also use similar strategy and focus on the scale variation problem in few-shot detection.

以往的小样本目标检测研究主要分为两大类。其中大部分[13,35,34]采用基于元学习的[5]框架，对特定类的预测进行特征重加权。而Wang等人[31]采用两阶段微调方法，只对最后一层检测器进行微调，取得了最先进的性能。Wu等人[33]也采用了类似的策略，关注了小样本目标检测中的尺度变化问题。

However, aforementioned methods often suffer from several drawbacks due to the challenging nature of few-shot object detection. Firstly, relations between support features and query feature are hardly fully explored in previous few-shot detection works, where global pooling operation on support features is mostly adopted to modulate the query branch, which is prone to loss of detailed local context. Specifically, appearance changes and occlusions are common for objects, as shown Fig. 1. Without enough discriminative information provided, the model is obstructed from learning critical features for class and bounding box predictions. Secondly, although scale variation problem has been widely studied in prior works [17, 15, 33], it remains a serious obstacle in few-shot detection tasks. Under few-shot settings, feature extractor with scale-aware modifications is inclined to overfitting, leading to a deteriorated performance for both base and novel classes.

然而，由于小样本目标检测的挑战性，上述方法往往存在一些缺点。首先，在以往的小样本目标检测工作中，支撑特征与查询特征之间的关系几乎没有得到充分的探讨，多采用对支撑特征的全局池化操作来调节查询分支，这很容易造成局部上下文细节的丢失。具体来说，如图1所示，物体的外观变化和遮挡是常见的。如果不提供足够的判别信息，模型就会受到阻碍，无法学习关键的特征来进行类别和边界框的预测。其次，尽管尺度变化问题在前人的工作中得到了广泛的研究[17,15,33]，但它仍然是小样本目标检测任务中的一个严重障碍。在小样本设置下，具有尺度感知修改的特征提取器可能会过拟合，导致基类和新类性能下降。

In order to alleviate the above issues, we first propose the dense relation distillation module to fully exploit support set. Given a query image and a few support images from novel classes, the shared feature learner extracts query feature and support features for subsequent matching procedure. Intuitively, the criteria that determines whether query object and support object belong to the same category mainly measures how much feature similarity they share in common. When appearance changes or occlusions occur, local detailed features are dominant for matching candidate objects and template ones. Hence, instead of obtaining global representations of support set, we propose a dense relation distillation mechanism where query and support features are matched in a pixel-wise level. Specifically, key and value maps are produced from features, which serve as encoding visual semantics for matching and containing detailed appearance information for decoding respectively. With local information of support set effectively retrieved for guidance, the performance can be significantly boosted, especially in extremely low-shot scenarios.

为了缓解上述问题，我们首先提出了密集关系蒸馏模块，以充分利用支持集。给定一个查询图像和一些来自新类的支撑图像，共享特征学习器提取查询特征和支撑特征，用于后续的匹配过程。直观地说，判断查询对象和支撑对象是否属于同一类别的标准主要是衡量它们有多少共同的特征相似性。当出现外观变化或遮挡时，局部细节特征在匹配查询对象和模板对象时占主导地位。因此，我们没有获得支撑集的全局表示，而是提出了一种密集关系蒸馏机制，在这种机制中，查询特征和支撑特征是在一个像素级的水平上匹配的。具体而言，从特征中生成键映射和值映射，分别作为匹配的视觉语义编码和解码的详细外观信息。通过有效地检索支持集的局部信息进行制导，可以显著提高性能，特别是在极少样本场景下。

Furthermore, for the purpose of mitigating the scale variation problem, we design the context-aware feature aggregation module to capture essential cues for different scales during RoI pooling. Since directly modifying feature extractor could result in overfitting, we choose to perform adjustment from a more flexible perspective. Recognition of objects with different scales requires different levels of contextual information, while the fixed pooling resolution may bring about loss of substantial context information. Hence, an adaptive aggregation mechanism that allocates specific attention to local and global features simultaneously could help preserve contextual information for different scales of objects. Therefore, instead of performing RoI pooling with one fixed resolution, we choose three different pooling reso-lutions to capture richer context features. Then an attention mechanism is introduced to adaptively aggregate output features to present a more comprehensive representation.

此外，为了缓解尺度变化问题，我们设计了基于上下文感知的特征聚合模块，以捕获RoI池化过程中不同尺度的基本线索。由于直接修改特征提取器可能导致过拟合，我们选择从更灵活的角度进行调整。不同尺度目标的识别需要不同层次的上下文信息，而固定的池化分辨率可能会导致大量上下文信息的丢失。因此，一种同时对局部和全局特征分配特定注意力的自适应聚合机制可以帮助保留不同尺度物体的上下文信息。因此，我们没有使用一个固定分辨率进行RoI池化，而是选择三个不同的池化分辨率来捕获更丰富的上下文特征。然后，引入注意机制来自适应地聚合输出特征，以呈现更全面的表示。

The contributions of this paper can be summarized as follows:

We propose a dense relation distillation module for few-shot detection problem, which targets at fully exploiting support information to assist the detection process for objects from novel classes.

We propose an adaptive context-aware feature aggregation module to better capture global and local features to alleviate scale variation problem, boosting the performance of few-shot detection.

Extensive experiments illustrate that our approach has achieved a consistent improvement on PASCAL VOC and MS COCO datasets. Specially, our approach achieves better performance than the state-of-the-art methods on the two datasets.

本文的研究成果主要体现在以下几个方面:
1.我们提出了一种针对小样本目标检测问题的密集关系蒸馏模块，其目标是充分利用支撑信息来辅助新类目标的检测过程。
2. 我们提出了一种自适应的上下文感知特征聚合模块，以更好地捕捉全局和局部特征，以缓解尺度变化问题，提高小样本目标检测的性能。
3.大量的实验表明，我们的方法在PASCAL VOC和MS COCO数据集上都取得了改进。特别地，我们的方法在这两个数据集上取得了比最先进方法更好的性能。

2.Related Work

2.1. General Object Detection

Deep learning based object detection can be mainly divided into two categories: one-stage and two-stage detectors. One-stage detector YOLO series [20, 21, 22] provide a proposal-free framework, which uses a single convolutional network to directly perform class and bounding box predictions. SSD [18] uses default boxes to adjust to various object shapes. On the other hand, RCNN and its variants [7, 9, 6, 23, 8] fall into the second category. These methods first extract class-agnostic region proposals of the potential objects from a given image. The generated boxes are then further refined and classified into different categories by subsequent modules. Moreover, many works are proposed to handle scale variance [17, 15, 24, 25]. Compared to one-stage methods, two-stage methods are slower but exhibit better performance. In our work, we adopt Faster R-CNN as the base detector.

基于深度学习的目标检测主要分为单阶段检测器和两阶段检测器两大类。两阶段检测器YOLO系列[20,21,22]提供了一种无提议框架，它使用一个卷积网络直接执行类和边界框预测。SSD[18]使用默认框来适应各种物体形状。另一方面，RCNN及其变体[7,9,6,23,8]属于第二类。这些方法首先从给定的图像中提取潜在目标的类无关的区域提议框。然后，生成的方框被后续模块进一步细化并划分为不同的类别。此外，许多研究提出了处理尺度方差的方法[17,15,24,25]。与单阶段方法相比，两阶段方法速度较慢，但表现出更好的性能。在我们的工作中，我们采用Faster R-CNN作为基检测器。

2.2. Few-Shot Learning

Few-shot learning aims to learn transferable knowledge that can be generalized to new classes with scarce examples. Bayesian inference is utilized in [4] to generalize knowledge from a pretrained model to perform one-shot learning. Meta-learning based methods have been prevalent in few-shot learning these days. Metric learning based methods [16, 29, 26, 27] have achieved state-of-the-art performance in few-shot classification tasks. Matching Network [29] encodes input into deep neural features and performs weighted nearest neighbor matching to classify query images. Our proposed method is also based on matching mechanism. Prototypical Network [26] represents each class with one prototype which is a feature vector. Relation Network [27] learns a distance metric to compare the target image with a few labeled images. While optimization based methods [19, 5] are proposed for fast adaptation to new few-shot task. [11] proposes a cross-attention mechanism to learn correlations between support and query images. Above methods are focusing on the few-shot classification task while few-shot object detection problem is relatively under-explored.

小样本学习的目的是学习可转移的知识，可以推广到新的类和稀缺的例子。[4]中使用贝叶斯推理从预训练的模型中归纳出知识来执行一次学习（one-shot learning）。目前，基于元学习的方法在一次学习中非常流行。基于度量学习的方法[16,29,26,27]在小样本分类任务中取得了最先进的性能。匹配网络[29]将输入编码为深度神经特征，并进行加权最近邻匹配对查询图像进行分类。我们提出的方法也是基于匹配机制。原型网络（Prototypical Network）[26]用一个原型代表每个类别，其中一个原型是一个特征向量。关系网络[27]学习一个距离度量来比较目标图像与一些标记图像。而基于优化的方法[19,5]则被提出用于快速适应新的小样本任务。[11]提出了一种交叉注意机制来学习支持和查询图像之间的相关性。上述方法主要针对的是小样本分类任务，而对小样本目标检测问题的研究相对较少。

2.3. Few-Shot Object Detection

Few-shot object detection aims to detect object from novel classes with only a few annotated training examples provided. LSTD [1] and RepMet [14] adopt a general transfer learning framework which reduces overfitting by adapting pretrained detectors to few-shot scenarios. Recently, Meta YOLO [13] designs a novel few-shot detection model with YOLO v2 [21] that learns generalizable meta features and automatically reweights the features for novel classes by producing classspecific activating coefficients from support examples. Meta R-CNN [35] and FsDetView [34] perform similar process with base detector as Faster R-CNN. TFA [31] simply performs two-stage finetuning approach by only finetuning the classifier on the second stage and achieves better performance. MPSR [33] proposes multiscale positive sample refinement to handle scale variance problem. CoAE [12] proposes non-local RPN and focuses on one-shot detection from the view of tracking by comparing itself with other tracking methods, while our method performs cross-attention on features extracted by the backbone in a more straightforward way and targets at few-shot detection task. FSOD [3] proposes attention-RPN, multi-relation detector and contrastive training strategy to detect novel object. In our work, we adopt the similar meta-learning based framework as Meta R-CNN and further improve the performance. Moreover, with our proposed method, the class-specific prediction procedure can be successfully removed, simplifying the overall process.

小样本目标检测的目的是在只提供少数有注释的训练实例的情况下检测新类别的物体。LSTD[1]和RepMet[14]采用了一种通用的迁移学习框架，通过调整预训练的检测器来减少过拟合，以适应小样本的场景。最近，Meta YOLO[13]使用YOLO v2[21]设计了一种新颖的小样本目标检测模型，该模型学习了可通用的元特征，并通过从支持实例中生成特定类别的激活系数，自动为新类调整特征的权重。Meta R-CNN[35]和FsDetView[34]执行与Faster R-CNN相似的基本检测器过程。TFA[31]简单地执行了两阶段的微调方法，只在第二阶段对分类器进行微调，并取得了更好的性能。MPSR[33]提出了多尺度正样本细化方法来处理尺度方差问题。CoAE[12]提出了非局部RPN，通过与其他跟踪方法的比较，从跟踪的角度关注一次检测，而我们的方法更直接地对backbone提取的特征进行交叉注意，针对的是小样本检测任务。FSOD[3]提出了attention-RPN、多关系检测器和对比训练策略来检测新目标。在我们的工作中，我们采用了类似于Meta R-CNN的基于元学习的框架，进一步提高了性能。此外，通过我们提出的方法，可以成功地去除特定类别的预测程序，简化了整个过程。

3. Method

3.1. Preliminaries.

Problem Definition. Following setting in [13, 35], object classes are divided into base classes $C_{base}$ with abundant annotated data and novel classes $C_{novel}$ with only a few annotated samples, where $C_{base}$ and $C_{novel}$ have no intersection. We aim to obtain a few-shot detection model with the ability to detect objects from both base and novel classes in testing by leveraging generalizable knowledge from base classes. The number of instances per category for novel classes is set as k (i.e., k-shot).

问题的定义。 根据[13,35]的设置，对象类分为基类 $C_{base}$ 和新类 $C_{novel}$ ，基类 $C_{base}$ 有丰富的标注数据，新类 $C_{novel}$ 只有少量的标注样本，其中 $C_{base}$ 和 $C_{novel}$ 没有交集。我们的目标是利用基类的可泛化知识，获得一个能够检测基类和新类对象的小样本目标检测模型。每个新类类别的实例数设置为k(即k-shot)。

We align the training scheme with the episodic paradigm [29] for few-shot scenario. Given a k-shot learning task, each episode is constructed by sampling: 1) a support set containing image-mask pairs for different classes $S={x_{i},y_{i}}_{i=1}^{N}$ , where $x_{i}\in\mathbb R^{h \times w \times 3}$ is an RGB image, $y_{i} \in\mathbb R^{h \times w}$ is a binary mask for objects of class i in the support image generated from bounding box annotations and N is the number of classes in the training set; 2) a query image q and annotations m for the training classes in the query image. The input to the model is the support pairs and query image, the output is detection prediction for query image.

我们的训练方案和[29]的适用场景相一致，用于小样本场景。给定一个k-shot学习任务，每个episode都是通过采样构建的：1)一个包含不同类的图像-掩码对的支持集 $S={x_{i},y_{i}}_{i=1}^{N}$ ，其中 $x_{i}\in\mathbb R^{h \times w \times 3}$ 是RGB图像， $y_{i} \in\mathbb R^{h \times w}$ 是由边界框注释生成的支持图像中第i类对象的二进制掩码，N是训练集中的类数；2)查询图像q和查询图像中训练类别的注释m。模型的输入是支撑对和查询图像，输出是对查询图像的预测结果。

Basic Object Detection. The choice of base detectors is varied. [13] utlizes YOLO v2 [21] which is a one-stage detector, while [35] adopts Faster R-CNN [23] which is a two-stage detector and provides consistently better results. Therefore, we also adopt Faster R-CNN as our base detector which consists of a feature extractor, region proposal network (RPN) and the detection head (RoI head).

基本的对象检测。 基础检测器的选择是多种多样的。[13]使用YOLO v2[21]，这是一个单阶段检测器，而[35]采用Faster R-CNN[23]，这是两阶段检测器，通常来讲效果更好。因此，我们也采用Faster R-CNN作为我们的基础检测器，它由一个特征提取器、区域建议网络(RPN)和检测头(RoI头)组成。

Feature Reweighting for Detection. We choose Meta-RCNN [35] as our baseline method. Formally, let I denote an input query image, ${I_{si}, M_{si}\}|_{i=1}^{N}$ denote support images and masks converted from bounding-box annotations, where N is the number of training classes. RoI features $z^j|_{j=1}^n$ is generated by the RoI pooling layer (n is the number of RoIs) and class-specific vectors $w_{i} \in\mathbb R^C, i =1, 2, ..., N$ are produced with a reweighting module which shares its backbone parameters with the feature extractor, where C is the feature dimension. Then class-specific feature $z_i$ is achieved with:
$z_{i} = z \otimes w_{i}, i = 1, 2,……, N, \tag{1}$
where $\otimes$ denotes channel-wise multiplication. Then classspecific prediction is performed to output the detection results. Based on this methodology, we further make a significant improvement and simplify the prediction procedure by removing the class-specific prediction.

检测的特征重加权。 我们选择Meta R-CNN[35]作为基线方法。形式上，让 I 表示一个输入查询图像， ${I_{si}, M_{si}\}|_{i=1}^{N}$ 表示支撑图像和由边界框注释转换的掩码，其中N是训练类的数量。RoI特征$z^j|_{j=1}n由RoI池化层（n为RoI个数）和类特定向量 $w_{i} \in\mathbb R^C, i =1, 2, ..., N$ 生成， N通过与特征提取器共享其backbone参数的重加权模块产生，其中C为特征维。然后用以下方法获得类特定的特征 $z_i$ ：
$z_{i} = z \otimes w_{i}, i = 1, 2,……, N, \tag{1}$
其中 $\otimes$ 表示逐通道相乘。然后进行分类预测，输出检测结果。在此方法的基础上，我们进一步进行了显著的改进，并通过删除特定于类的预测来简化预测过程。

3.2. DCNet

As illustrated in Fig. 2, we present the Dense Relation Distillation (DRD) module with Context-aware Feature Aggregation (CFA) module to fully exploit support features and capture essential context information. The two proposed components form the final model DCNet. We will first depict the architecture of the proposed DRD module. Then we will bring out the details of the CFA module.

如图2所示，我们提出了密集关系蒸馏（DRD）模块和上下文感知特征聚合（CFA）模块，以充分利用支撑特征并捕获基本的上下文信息。这两个组件组成了最终的模型DCNet。我们将首先介绍DRD模块的体系结构，再介绍CFA模块的细节。

3.2.1. Dense Relation Distillation Module

Figure 2. The overall framework of our proposed DCNet. For training, the input for each episode consists of a query image and N support image-mask pairs from N classes. The shared feature extractor first produces query feature and support features. Then, the dense relation distillation (DRD) module performs dense feature match to activate co-exisiting features of input query. With proposals produced by RPN, context-aware feature aggregation (CFA) module adaptively harnesses features generated with different scales of pooling operations, capturing different levels of features for a more comprehensive representation

图2。我们提出的DCNet的总体框架。对于训练，每个episode的输入由一个查询图像和来自N个类的N个支撑图像掩码对组成。共享特征提取器首先产生查询特征和支撑特征。然后，密集关系蒸馏模块（DRD）进行密集特征匹配，激活输入查询的共存特征。根据RPN提出的建议，上下文感知特征聚合(CFA)模块自适应地利用由不同尺度的池化操作生成的特征，捕获不同级别的特征，以实现更全面的表示。

Key and Value Embedding. Given a query image and support set, query and support features are produced by feeding them into the shared feature extractor. The input of the dense relation distillation (DRD) module is the query feature and support features. Both parts are first encoded into pairs of key and value maps through the dedicated deep en-coders. The query encoder and support encoder adopt the same structure while not sharing parameters.

键和值的嵌入。 给定查询图像和支撑集，通过将查询图像和支撑集输入共享特征提取器来生成查询和支撑特征。密集关系精馏(DRD)模块的输入是查询特征和支撑特征。这两个部分首先通过专用的深度编码器被编码为一对键和值。查询编码器和支撑编码器采用相同的结构，但是不共享参数。

The encoder takes one or multiple feature as input and outputs two feature maps for each input feature: key and value with two parallel $\times 3$ convolution layers, which serve as reducing the dimension of the input feature to save computation cost. Specifically, key maps are used for measuring the similarities between query feature and support features, which help determine where to retrieve relevant support values. Therefore, key maps are learned to encode visual semantics for matching and value maps store detailed information for recognition. Hence, for query feature, the output is a pair of key and value maps: $k_{q} \in\mathbb R^{C/8 \times H \times W}$ , $v_{q}\in\mathbb R^{C/2 \times H \times W}$ , where C is the feature dimension, H is the height, and W is the width of input feature map. For support features, each of the features is independently encoded into key and value maps, the output is $k_{s} \in\mathbb R^{N\times C/8 \times H \times W}$ , $v_{s}\in\mathbb R^{N\times C/2 \times H \times W}$ , where N is the number of target classes (also the number of support samples). The generated key and value maps are further fed into the relation distillation part where keys maps of query and support are densely matched for addressing target objects.

编码器将一个或多个特征作为输入，并为每个输入特征输出两个特征：键和值，具有两个并行的 $\times 3$ 卷积层，用于降低输入特征的维数以节省计算成本。具体来说，键映射用于度量查询特征和支撑特征之间的相似性，这有助于确定在哪里检索相关的支持值。因此，键映射用于编码视觉语义进行匹配，值映射存储了详细的信息用于识别。因此，对于查询特征，输出是一对键值映射： $k_{q} \in\mathbb R^{C/8 \times H \times W}$ ， $v_{q}\in\mathbb R^{C/2 \times H \times W}$ ，其中C为特征维数，H为高度，W为输入特征映射的宽度。对于支撑特征，每个特征都独立编码为键和值，输出为 $k_{s} \in\mathbb R^{N\times C/8 \times H \times W}$ ， $v_{s}\in\mathbb R^{N\times C/2 \times H \times W}$ ，其中N为目标类的数量（即支撑样本的数量）。生成的键和值映射被进一步送入到关系蒸馏模块，其中查询和支撑的键映射被密集匹配以寻址目标对象。

Relation Distillation. After acquiring the key/value maps of query and support features, relation distillation is performed. As illustrated in Fig. 2, soft weights for value maps of support features are computed via measuring the similarities between key maps of query feature and support features. The pixel-wise similarity is performed in a non-local manner, formulated as: $F(k_{qi}, k_{sj}) = φ(k_{qi})^T φ'(k_{sj}), \tag{2}$ where i and j are the index of the query and support location, $φ, φ_{0}$ denote two different linear transformations with parameters learned via back propagation during training process, forming a dynamically learned similarity function. After computing the similarity of pixel features, we perform softmax normalization to output the final weight W : $W_{ij} = \frac{exp(F(k_{qi}, k_{sj}))}{\sum_{i}exp(F(k_{qi}, k_{sj}))}.\tag{3}$ Then the value of the support features are retrieved by a weighted summation with the soft weights produced and then it is concatenated with the value map of query feature. Hence, the final output is formulated as: $concat[v_q, W * v_s],\tag{4}$ where ∗ denotes matrix inner-product. Noted that there are N support features, which brings N key-value pairs. We perform summation over N output results to obtain the final result, which is a refined query feature, activated by support features where there are co-existing classes of objects in query and support images.

关系蒸馏。 获取查询和支撑特征的键/值映射后，进行关系蒸馏。如图2所示，通过测量查询特征与支撑特征关键映射的相似度，计算支撑特征值映射的软权重。像素级别的相似性以非局部的方式进行，表述为： $F(k_{qi}, k_{sj}) = φ(k_{qi})^T φ'(k_{sj}), \tag{2}$
其中i和j为查询和支撑位置的索引， $φ, φ_{0}$ 表示两个不同的线性变换，在训练过程中通过反向传播学习参数，形成一个动态学习的相似函数。计算出像素特征的相似度后，进行softmax归一化，输出最终权重W： $W_{ij} = \frac{exp(F(k_{qi}, k_{sj}))}{\sum_{i}exp(F(k_{qi}, k_{sj}))}.\tag{3}$
然后将软权重进行加权求和，得到支撑特征的值，将其与查询特征的值映射相连接，因此，最终输出的公式为： $concat[v_q, W * v_s],\tag{4}$ 其中 ∗ 表示矩阵内积。注意到有N个支撑特征，带来了N个键值对。我们对 N 个输出结果进行求和以获得最终结果，这是一个细化的查询特征，由支撑特征激活，这些支撑特征的类别是查询和支撑图像集中共存的。

Previous trials [13, 35, 34] utilize class-wise vectors generated by global pooling of support features to modulate the query feature, which guide the feature learning from a holistic view. However, since appearance changes or occlusions are common in natural images, the holistic feature may be misleading when objects of the same class vary much between query and support samples. Also, when most parts of the objects are unseen due to the occlusions, the retrieval of local detailed features becomes substantial, which former methods completely neglect. Hence, equipped with the dense relation distillation module, pixel-level relevant information can be distilled from support features. As long as there exist some common characteristics, the pixels of query features belonging to the co-existing objects between query and support samples will be further activated, providing a robust modulated feature to facilitate the prediction of class and bounding-box.

之前的实验[13,35,34]利用支撑特征的全局池化生成的分类向量来调整查询特征，从整体角度指导特征学习。然而，由于外观变化或遮挡在自然图像中是常见的，当同一类对象在查询和支撑样本之间变化很大时，整体特征可能会产生误导。另外，当大部分物体由于遮挡而看不见时，局部细节特征的检索变得非常重要，而以往的方法完全忽略了这一点。因此，本模型提出了密集关系蒸馏模块，可以从支撑特征中提取像素级的相关信息。只要存在一些共同的特征，属于查询样本和支撑样本之间共存对象的查询特征的像素将被进一步激活，提供一个鲁棒的调整特征，便于类和边界框的预测。

Our distillation method can be seen as an extension of the non-local self-attention mechanism [28, 30]. However, instead of performing self-attention, we specially design the relation distillation model to realize information retrieval from support features to modulate the query feature, which can be treated as a cross attention.

我们的蒸馏方法可以被视为非局部自注意机制的延伸[28,30]。但是，我们没有使用自注意力机制，而是专门设计了密集关系蒸馏模型，从支撑特征中实现信息检索，以调整查询特征，这一手段可以被看作是交叉注意力机制。

3.2.2. Context-aware Feature Aggregation

Figure 3. Illustration of context-aware feature aggregation. Attention mechanism is adopted to adaptively aggregate different features, where the weights are normalized with softmax function.

图3。上下文感知特征聚合的说明。采用注意机制对不同特征进行自适应聚合，其中 GAP 代表全局平均池化，Linear 代表全连接层，权重用softmax函数归一化。

After performing dense relation distillation, DRD module has fulfilled its duty. The refined query feature is subsequently fed into RPN where region proposals are output. Taking proposals and feature as input, RoI Align module performs feature extraction for final class prediction and bounding-box regression. Normally, pooling operation is implemented with a fixed resolution 8 in our original implementation, which is likely to cause information loss during training. For general object detection, this kind of information loss can be remedied with large scale of training data, while the problem becomes severe in few-shot detection scenarios with only a few training data available, which is inclined to induce a misleading detection results. Moreover, with scale variation amplified due to the few-shot nature, the model tends to lose the generalization ability to novel classes with adequate adaption to different scales. To this end, we propose Context-aware Feature Aggregation (CFA) module. Instead of using a fixed resolution 8, we empirically choose 4, 8 and 12 three resolutions and perform parallel pooling operation to obtain a more comprehensive feature representation. The larger resolution tends to focus on local detailed context information specially for smaller objects, while the smaller resolution targets at capturing holis-tic information to benefit the recognition of larger objects, providing a simple and flexible way to alleviate the scale variation problem.

在执行密集关系蒸馏之后，DRD模块已经完成了它的职责。细化后的查询特征随后被送入RPN，在RPN中输出区域建议框。RoI Align模块以proposals和特征为输入，进行特征提取，最终进行类预测和边界框回归。在我们原来的实现中，池操作通常是用固定的分辨率8来实现的，这很容易在训练过程中造成信息丢失。对于一般的目标检测，这种信息丢失可以通过大规模的训练数据来弥补，而在训练数据较少的小样本目标检测场景中，这种信息丢失问题会变得严重，容易导致检测结果的误导。由于小样本的特性，尺度变化被放大，模型往往会失去对新类别的泛化能力。为此，我们提出了上下文感知的特征聚合(CFA)模块。我们不使用固定的分辨率8，而是根据经验地选择4，8和12三种分辨率，并进行并行池化操作，以获得更全面的特征表示。较大的分辨率倾向于关注较小对象的局部详细上下文信息，而小分辨率的目标是捕捉整体信息，有利于较大的物体的识别，为缓解尺度变化问题提供了一种简单而灵活的方法。

Since each generated feature contains different level of semantic information. With the intention to efficiently aggregate features generated from different scales of RoI pooling, we further propose an attention mechanism to adaptively fuse the pooling results. As illustrated in Fig. 3, we add an attention branch for each feature which consists of two blocks. The first block contains a global average pooling. The second one contains two consecutive fc layers. Afterwards, we add a softmax normalization to the generated weights for balancing the contribution of each feature. Then the final output of the aggregated feature is the weighted summation of the three features.

因为每个生成的特征包含不同层次的语义信息。为了有效地聚合不同规模RoI池化生成的特征，我们进一步提出了一种自适应融合池化结果的注意机制。如图3所示，我们为每个特征添加一个注意力分支，它由两个块组成。第一个块包含一个全局平均池。第二个包含两个连续的fc层。在这之后，我们为生成的权重添加一个softmax归一化，以平衡每个特征的贡献。那么聚合特征的最终输出就是三个特征的加权和。

3.3. Learning Strategy

Figure 4. Demonstration of learning strategy of meta-learning based few-shot detection framework. The meta learner aims to acquire meta information and help the model to generalize to novel classes.

图4. 基于元学习的小样本目标检测框架的学习策略演示。元学习器的目的是获取元信息，帮助模型推广到新的类。

As illustrated in Fig. 4, we follow the training paradigm in [13, 35, 34], which consists of meta-training and meta fine-tuning. In the phase of meta-training, abundant annotated data from base classes is provided. We jointly train the feature extractor, dense relation distillation module, context-aware feature aggregation module and other basic components of detection model. In meta fine-tuning phase, we train the model on both base and novel classes. As only k labeled bounding-boxes are available for the novel classes, to balance between samples from base and novel classes, we also include k boxes for each base class. The training procedure is the same as the meta-training phase but with fewer iterations for model to converge.

如图4所示，我们遵循[13,35,34]中的训练范式，包括元训练和元微调。在元训练阶段，提供了来自基类的大量注释数据。我们联合训练特征提取器、密集关系蒸馏模块、上下文感知特征聚合模块和检测模型的其他基本组件。在元微调阶段，我们在基类和新类上训练模型。由于新类只有k个标记的边界框可用，为了平衡基类和新类的样本，我们还为每个基类包括k个框。训练过程与元训练阶段相同，但模型收敛的迭代次数更少。

4. Experiments

In this section, we first introduce the implementation details and experimental configurations in Sec. 4.1. Then we present our detailed experimental analysis on PASCAL VOC dataset in Sec. 4.2 together with ablation studies and qualitative results. Finally, results on COCO dataset will be presented in Sec. 4.3.

在本节中，我们首先介绍第4.1节中的实现细节和实验配置。然后，在第4.2节中介绍了我们对PASCAL VOC数据集的详细实验分析，以及消融研究和定性结果。最后，COCO数据集的结果将在第4.3节中给出。

4.1. Datasets and Settings

Following the instructions in [13], we construct the few-shot detection datasets for fair comparison with other state-of-the-art methods. Moreover, to achieve a more stable few-shot detection results, we perform 10 random runs with different randomly sampled shots. Hence, all the results in the experiments is averaged results by 10 random runs.

根据[13]中的说明，我们构建了小样本检测数据集，以便与其他先进方法进行比较。此外，为了获得更稳定的小样本检测结果，我们对不同的随机采样数量进行了10次随机运行。因此，所有的实验结果都是10次随机运行的平均结果。

PASCAL VOC. For PASCAL VOC dataset, we train our model on the VOC 2007 trainval and VOC 2012 trainval sets and test the model on VOC 2007 test set. The evaluation metric is the mean Average Precision (mAP). Both the trainval sets are split by object categories, where 5 are randomly chosen as novel classes and the left 15 are base classes. We use the same split as [13], where novel classes for four splits are {“bird”, “bus”, “cow”, “motorbike” (“mbike”), “sofa”}, {“aeroplane”(“aero”, “bottle”, “cow”, “horse”, “sofa”}, {“boat”, “cat”, “motorbike”, “sheep”, “sofa”}, respectively. For few-shot object detection experiments, the few-shot dataset consists of images where k object instances are available for each category and k is set as 1/3/5/10.

PASCAL VOC. 对于PASCAL VOC数据集，我们在VOC 2007 trainval集和VOC 2012 trainval集上训练模型，并在VOC 2007测试集上测试模型。评价指标是平均精度(mAP)。这两个训练集都按对象类别划分，其中5个随机选择为新类，其余15个为基类。我们使用与[13]相同的拆分，其中四个拆分的新类分别是{" bird “,” bus “,” cow “,” motorbike " (" mbike “),” sofa “}， {” aeroplane " (" aero ", " bottle “,” cow ", " horse ", " sofa “}， {” boat “,” cat “,” motorbike ", " sheep “,” sofa "}。对于小样本目标检测实验，小样本数据集由图像组成，其中每个类别有k个对象实例，k设置为1/3/5/10。

COCO. MS COCO dataset has 80 object categories, where the 20 categories overlapped with PASCAL VOC are set to be novel classes. 5000 images from the validation set noted as minival are used for evaluation while the left images in the train and validation set are used for training. The process of constructing few-shot dataset is similar to PASCAL VOC dataset and k is set as 10/30.

COCO. COCO数据集有80个对象类别，其中与PASCAL VOC重叠的20个类别被设置为新类。来自minival验证集的5000张图像用于评估，而训练集和验证集中剩下的图像用于训练。构建小样本数据集的过程类似于PASCAL VOC数据集，k设为10/30。

Implementation Details. We perform training and testing process on images with a single scale. The shorter side of the query image is resized to 800 pixels and longer sides are less than 1333 pixels while maintaining the aspect ratio. The support image is resized to a squared image of 256 × 256. We adopt ResNet-101 [10] as feature extractor and RoI Align [8] as RoI feature extractor. The weights of
the backbone is pre-trained on ImageNet [2]. After training on base classes, only the last fully-connected layer (for classification) is removed and replaced by a new one randomly initialized. It is worth noting that all parts of the model participate in learning process in the second meta fine-tuning phase without any freeze operation. We train our model with a mini-batch size as 4 with 2 GPUs. We utilize the
SGD optimizer with the momentum of 0.9, and weight decay of 0.0001. For meta-training on PASCAL VOC, models are trained for 240k, 8k, and 4k iterations with learning rates of 0.005, 0.0005 and 0.00005 respectively. For meta fine-tuning on PASCAL VOC, models are trained for 1300, 400 and 300 iterations with learning rates as 0.005, 0.0005 and 0.00005 respectively. As for MS COCO dataset, during
meta-training, models are trained for 56k, 14k and 10k iterations with learning rates of 0.005, 0.0005 and 0.00005 respectively. And during meta fine-tuning, model are trained for 2800, 700 and 500 iteration for 10-shot fine-tuning and 5600, 1400 and 1000 iterations for 30-shot fine-tuning.

实验细节。 我们对单一尺度的图像进行训练和测试。查询图像较短的边被调整为800像素，较长的边小于1333像素，同时保持长宽比。支持图像被调整为256 × 256的平方图像。我们采用ResNet-101[10]作为特征提取器，RoI Align[8]作为RoI特征提取器。Backbone的权重在ImageNet[2]上进行预训练。在基类上训练之后，只有最后一个全连接层(用于分类)被移除，并被一个随机初始化的新层所取代。值得注意的是，在第二个元微调阶段，模型的所有部分都参与了学习过程，没有任何冻结操作。我们设置mini-batch大小为4，2个GPU来训练模型。使用SGD优化器，momentum 为0.9，weight decay 为0.0001。在PASCAL VOC上进行元训练时，训练模型240k、8k和4k迭代，学习率分别为0.005、0.0005和0.00005。为了在PASCAL VOC上进行元微调，对模型进行1300、400和300次迭代训练，学习率分别为0.005、0.0005和0.00005。对于MS COCO数据集，在元训练过程中，对模型进行56k、14k和10k迭代训练，学习率分别为0.005、0.0005和0.00005。在元微调过程中，训练模型2800、700和500次迭代进行10次微调，训练模型5600、1400和1000次迭代进行30次微调。

Baseline Method. Since we adopt Faster-RCNN as base detector, we choose Meta R-CNN [35] as the baseline method. Moreover, we implement it by ourselves for a more fair comparison.

基线方法。 由于我们采用Faster-RCNN作为基础检测器，我们选择Meta R-CNN[35]作为基线方法。此外，我们自己实现了它，以进行更公平的比较。

4.2. Experiments on PASCAL VOC

In this section, we conduct experiments on PASCAL VOC dataset. We first compare our method with the state-of-the-art methods. Then we carry out ablation studies to perform comprehensive analysis of the components of our proposed DCNet. Finally, some qualitative results are presented to provide an intuitive view of the validity of our method. For all the experiments, we run 10 trials with random support data and report the averaged performance.

在本节中，我们在PASCAL VOC数据集上进行实验。首先将我们的方法与最先进的方法进行比较。然后进行消融研究，对我们提出的DCNet的组成部分进行综合分析。最后，给出了一些定性的结果，以便对我们方法的有效性提供一个直观的看法。对于所有的实验，我们使用随机支持数据运行10个试验，并报告平均性能。

4.2.1 Comparisons with State-of-the-art Methods

In Table 1, we compare our method with former state-of-the-art methods which mostly report results with multiple random runs. Our proposed DCNet achieves state-of-the-art results on almost all the splits with different shots and out-performs previous methods by a large margin. Specifically, in extremely low-shot settings (i.e. 1-shot), our method out-performs others by about 10% in split 1 and 3, providing a convincing proof that our DCNet is able to capture local detailed information to overcome the variations brought by the randomly sampled training shots.

在表1中，将我们的方法与以前最先进的方法进行比较，这些方法大多报告了多次随机运行的结果。我们提出的DCNet在几乎所有不同样本的分割上都取得了最先进的结果，并且大大优于以前的方法。具体来说，在极小样本设置(即1个样本)下，我们的方法在分割1和分割3中比其他方法高出约10%，提供了一个令人信服的证据，证明我们的DCNet能够捕获局部详细信息，克服随机采样训练样本带来的变化。

4.2.2 Ablation Study

We present results of comprehensive ablation studies to analyze the effectiveness of various components of the proposed DCNet. All ablation studies are conducted on the PASCAL VOC 2007 test set with the first novel splits. All results are averaged over 10 random runs.

我们提出了综合消融研究的结果，以分析所提出的DCNet的各个组成部分的有效性。所有的消融实验都是在PASCAL VOC 2007测试集上进行的，并采用了第一种新的分割方式。所有结果均为10次随机运行的平均值。

Impact of dense relation distillation module. We conduct experiments to validate the superiority of the proposed dense relation distillation (DRD) module. Specifically, we implement the baseline method for meta-learning based few-shot detection Meta R-CNN with class-specific prediction for the final box classification and regression. While the DRD module requires no extra class-specific process-
ing. As shown in line 1 and 2 of Table 2, DCNet w/o CFA equals to Faster R-CNN equipped with DRD module, our proposed DRD module achieves consistent improvement on all novel splits with all shots number, which effectively demonstrates the supremacy of the relation distillation mechanism over the baseline method. Moreover, the improvement over baseline is significant when the shot number is low, which proves that the DRD module successfully exploits useful information from limited support data.

密集关系蒸馏模块的影响。 通过实验验证了所提出的密集关系蒸馏DRD)模块的优越性。具体而言，我们实现了基于元学习的小样本目标检测Meta R-CNN的基线方法，并对最终的目标框分类和回归进行了特定类别的预测。而DRD模块不需要额外的类特定处理。如表2的第1和2行所示，DCNet w/o CFA等于配置了DRD模块的Faster R-CNN，我们提出的DRD模块在所有新分割和所有样本数数上都实现了一致的改进，这有效地证明了关系蒸馏机制优于基线方法。此外，当样本数量较小时，对基线的改进是显著的，这证明DRD模块成功地利用了有限的支持数据中的有用信息。

Table 1. Few-shot object detection performance on VOC 2007 test set of PASCAL VOC dataset. We report the mAP with IoU threshold 0.5 (AP50) under three different splits for five novel classes. * denotes the results averaged over multiple random runs.

表1。PASCAL VOC数据集的VOC 2007测试集上的小样本目标检测性能。我们报告了在五个新类别的三种不同划分下，IoU阈值为0.5（AP50）的mAP。*表示多次随机运行的结果的平均值。

Figure 5. (a). Visualizations of features before and after dense relation distillation module. (b). Visualizations of effect of context-aware feature aggregation module.

图5。(a).密集关系蒸馏模块前后的特征可视化。(b).上下文聚合模块效果的可视化。

Table 2. Ablation study to evaluate the effectiveness of different components in our proposed method. The mAP with IoU threhold 0.5 (AP50) is reported. * denotes CFA module with attention aggregation fashion. † denotes our implementation.

表2。消融研究，以评估我们提出的方法中不同组件的有效性。报告了IoU达到0.5时的mAP（AP50）。*表示具有注意力聚集方式的CFA模块。†表示我们的实施。

Table 3. The impact of different RoI pooling resolutions. The experiments are conducted on VOC 2007 test set of PASCAL VOC dataset with novel split1 and AP50 on 10-shot task averaged from 10 random runs is reported.

表3。不同RoI池化的影响。在PASCAL VOC数据集的VOC 2007测试集上，采用新型split1和AP50对10次随机运行中平均的10次任务进行了实验。

Impact of context-aware feature aggregation module.
We carry out experiments to evaluate the validity of the proposed context-aware feature aggregation (CFA) module. Specifically, RoI features generated from parallel branches are aggregated with a simple summation. From line 1 and 3 of the table, with the introduction of CFA module, Meta R-CNN achieves notable gains over the baseline. Since CFA module targets at preserving detailed information in a scale-aware manner, different levels of detailed features can be retrieved to assist the prediction process.

上下文感知特性聚合模块的影响。
我们通过实验来评估所提出的上下文感知特征聚合(CFA)模块的有效性。具体来说，从并行分支生成的RoI特征通过简单的求和进行聚合。从表2的第一行和第三行，随着CFA模块的引入，Meta R-CNN比基线取得了显著的进步。由于CFA模块的目标是以规模意识的方式保留详细信息，不同层次的详细特征可以被检索，以帮助预测过程。

Impact of different RoI pooling resolutions.
To further evaluate the impact of different RoI pooling resolutions, we perform explicit experiments to show the detailed perfor-mance. As shown in Table 3, solely adopting larger pooling resolution could yield better performance. However, only when aggregating features generated with all three resolutions, the best performance could be obtained.

不同RoI池化解决方案的影响。
为了进一步评估不同RoI池化的影响，我们进行了显式实验来显示详细的性能。如表3所示，仅采用较大的池化分辨率可以产生更好的性能。然而，只有在聚合所有三种分辨率产生的特征时，才能获得最佳性能。

Impact of attentive aggregation fashion for CFA module. Based on the plain CFA module, we further propose an attention-based aggregation mechanism to adaptively fuse different RoI features. As presented in line 3 and line 4 of Table 2, the attention aggregation mechanism can further boost the performance of the model, which promotes the plain CFA module with a more comprehensive feature representation, effectively balancing the contributions of each extracted features. Finally, with the combination of DRD module and CFA module, we present DCNet, which achieves the best performance according to Table 2.

聚合方式对CFA模块的影响。 在普通CFA模块的基础上，我们进一步提出了一种基于注意力的聚合机制，以自适应融合不同的RoI特征。如表2第3行和第4行所示，注意力聚合机制可以进一步提升模型的性能，促进普通CFA模块具有更全面的特征表示，有效地平衡了每个提取的特征的贡献。最后，通过DRD模块和CFA模块的结合，我们提出了DCNet，其性能达到了表2所示的最佳。

4.2.3 Qualitative Results

To further comprehend the effect of dense relation distillation (DRD) module, we visualize features before and after DRD module. As shown in Fig. 5 (a), after relation distillation, query features can be activated to facilitate the subsequent detection procedure. Moreover, different from former meta-learning based methods which performs prediction in a class-wise manner, our proposed DRD module can model relations between query and support features in all classes at the same time as shown in the second line of Fig. 5 (a). The DRD module enables the model to focus more on the query objects under the guidance of support information. Additionally, we also visualize the effect of CFA module presented in Fig. 5 (b). With a relatively large or small query object as input, DCNet w/o CFA suffers from false classification or missing detection , while the introduction of CFA module could effectively resolve this issue.

为了进一步理解密集关系蒸馏(DRD)模块的效果，我们可视化了DRD模块前后的特征。如图5 (a)所示，在进行关系蒸馏后，可以激活查询特征，方便后续的检测过程。此外，与以往基于元学习的基于类的预测方法不同，我们提出的DRD模块可以同时建模所有类中的查询和支持特征之间的关系，如图5 (a)的第二行所示。DRD模块使模型在支持信息的引导下更关注查询对象。此外，我们还可视化了CFA模块的效果，如图5 (b)所示。当输入的查询对象比较大或比较小时，DCNet w/o CFA存在误分类或漏检的问题，而CFA模块的引入可以有效地解决这一问题。

4.3. Experiments on MS COCO

We evaluate 10/30-shot setups on MS COCO benchmark and report the averaged performance with the standard COCO metrics over 10 runs with random shots. The results on novel classes can be seen in Table 4. Despite the challenging nature of COCO dataset with large number of categories, our proposed DCNet achieves state-of-the-art performance on most of the metrics.

我们在MS COCO基准上评估了10/30次样本设置，并报告了在10次随机样本中使用标准COCO指标的平均表现。新类的结果如表4所示。尽管COCO数据集具有大量类别的挑战性，我们提出的DCNet在大多数指标上实现了最先进的性能。

5. Conclusions

In this paper, we have presented the Dense Relation Distillation Network with Context-aware Aggregation (DCNet) to tackle few-shot object detection problem. Dense relation distillation module adopts dense matching strategy between query and support features to fully exploit support information. Furthermore, context-aware feature aggregation module adaptively harnesses features from different scales to produce a more comprehensive feature representation. The ablation experiments demonstrate the effectiveness of each component of DCNet. Our proposed DCNet achieves state-of-the-art results on two benchmark datasets, i.e. PASCAL VOC and MS COCO.

本文提出了一种基于上下文感知聚合的密集关系蒸馏网络(DCNet)来解决少镜头目标检测问题。密集关系蒸馏模块采用查询与支持特征之间的密集匹配策略，充分利用支持信息。此外，上下文感知的特征聚合模块自适应地利用来自不同尺度的特征，产生更全面的特征表示。消融实验验证了DCNet各组成的有效性。我们提出的DCNet在两个基准数据集上实现了最先进的结果，即PASCAL VOC和MS COCO。

你可能感兴趣的:(小样本目标检测,论文翻译与阅读,目标检测,计算机视觉,深度学习)

android视频缓存框架 [AndroidVideoCache](https://github.com/danikula/AndroidVideoCache) 源码解析与评估 MrJarvisDong third party 源码
文章目录android视频缓存框架[AndroidVideoCache](https://github.com/danikula/AndroidVideoCache)源码解析与评估引言使用方式关键类解析HttpProxyCacheServer代理缓存服务类**java.net.ProxySelector**代理选择Pinger判断本地serverSocket是否存活GetRequest封装用于获取
一、Python入门基础 MeyrlNotFound python 开发语言
1.Python简介与环境搭建•了解Python的历史、特点和应用领域Python的历史Python是一种高级编程语言，由GuidovanRossum于1989年发明。Python语言的设计目标是让代码易读、易写、易维护，从而提高开发效率和代码质量。自其诞生以来，Python已从一个简单的系统管理工具发展成为一种广泛应用于多个领域的编程语言。Python的特点1.简单易学：Python的语法简洁明
基于JAVA中的spring框架和jsp实现自然灾害论坛平台项目【附项目源码+论文说明】大雄是个程序员项目实践自然灾害论坛平台 java 项目源码 spring 毕业设计课程设计网页设计
摘要在上个世纪末期，也就是20世纪末，随着计算机技术的发展与进步和数据库方面的知识在互联网的大力运用，互联网技术以及网站技术在网上的大力推广，网上论坛（自然灾害论坛）也逐渐在网兴起，它的出现帮助了网上各种特定的群体进行一个在线的知识传递与信息的交流。本计算机自然灾害论坛设计，采用了JSP（JAVA）技术和MYSQL数据库开发，尝试实现了自然灾害论坛的基本功能以及帮助我们掌握了论坛技术的核心特点。该
众多主播都在用的超有趣桌面小宠物！开开心心_Every 宠物 virtualenv eclipse python django pygame java
BongocatMver是一款主播直播必备萌系插件，是一款开源软件。软件由国外一个高中生kuroni开发出来，让手鼓猫中的手臂可以跟随鼠标，按键的操作而发生动作。萌系的猫咪造型以及键盘映射的交互动画，十分适合游戏主播、绘画主播、音游主播在直播时使用的虚拟造型插件，可以给你的直播间或视频带来无限的元气。软件采用Live2d模型来实现自定义形状，用户可以根据自己的设定来更换不同形状的猫。精准的面部捕捉
npm error gyp info 计算机辅助工程 npm 前端 node.js
在使用npm安装Node.js包时，可能会遇到各种错误，其中gyp错误是比较常见的一种。gyp是Node.js的一个工具，用于编译C++代码。这些错误通常发生在需要编译原生模块的npm包时。下面是一些常见的原因和解决方法：常见原因及解决方法Python未安装或版本不兼容：Node.js使用Python来运行gyp。确保你的系统上安装了Python，并且版本与node-gyp兼容。通常推荐使用Pyt
如何实现具备自动重连与心跳检测的WebSocket客户端 FFF-X websocket 网络协议网络
本文介绍如何通过原生WebSocketAPI封装一个具备自动重连、心跳检测、错误恢复等能力的稳健客户端。适用于需要长连接的实时通讯场景（如聊天室、实时数据监控等）。核心功能亮点自动重连机制-指数退避策略重连心跳保活-双向检测连接活性消息可靠性-失败消息自动重发异常处理-错误分类处理机制状态管理-精准控制连接生命周期关键优化点说明事件监听优化改用addEventListener替代onopen等属性
【R语言2】Introduction to R 基础知识复习小测试 Pop quiz 不二程序猿 r语言开发语言数据挖掘
【R语言】基础知识点Popquiz前言Question1Question2Question3Question4Question5Question6Question7Question8Question9Question10是兄弟就砍一刀！答案前言在这里会有10道题，每一道都是对R语言的基础了解。有单选题和填空题，答案在最下面。填空题可以放到Rstudio里运行得出答案。Question1Whicho
Java架构师成长之路 hweiyu00 分享 spring 微服务 spring cloud java
概述本教程主要从6个方面，全面讲解Java技术栈的知识。1.性能调优深入理解MySQL底层原理、索引逻辑，数据结构与算法。使用Explain进行优化分析MVCC原理剖析日志机制解析2.框架源码掌握Spring底层原理带你手写一个Spring解析IOC、AOP源码、以及事务原理3.并发编程剖析Java底层锁机制CAS、JUC工具使用、AQS源码分析以及并发的集合类的讲解4.分布式开发剖析分布式中使用
LangChain组件Tools/Toolkits详解（5）——返回产出artifact 龙焰智能 langchain artifact ToolCall BaseTool 工具产物 ToolMessages
LangChain组件Tools/Toolkits详解（5）——返回产出artifact本篇摘要14.LangChain组件Tools/Toolkits详解14.5返回产出artifact14.5.1定义工具14.5.2使用ToolCall调用工具14.5.3与模型一起使用14.5.4从子例化BaseTool返回参考文献本章目录如下：《LangChain组件Tools/Toolkits详解（1）—
Java面试高频问题深度解析：JVM、锁机制、SQL优化与并发处理 Debug Your Career 面试 java 面试 jvm
问题列表Java中如何实现一个工作流引擎？Bean的作用域有哪些？JVM中的锁机制是如何工作的？三个方法分别被synchronized锁住，方法a调用方法b，b能获取到a的锁吗？会有什么问题？SQL优化时，EXPLAIN中需要关注哪些关键点？什么是覆盖索引？SELECT*一定不会命中索引吗？SELECT*和SELECT全字段在性能上有区别吗？什么是回表？它与索引有什么关系？100万数据分给10个线
JS基础-事件模型(事件&事件流&自定义事件&事件冒泡/代理) LYFlied html&浏览器 javascript 事件模型事件流前端面试
文章目录一、事件与事件流二、事件模型1.DOM0级模型2.IE事件模型3.DOM2级模型4.DOM3级事件处理方式三、事件对象四、事件绑定与解除1.事件绑定1.1对象.on事件名字=事件处理函数1.2.对象.addEventListener("没有on的事件名字",事件处理函数,false)3.对象.attachEvent("有on的事件名字",事件处理函数);2.解除绑定五、EventWrapp
QEMU源码全解析 —— CPU虚拟化（12）蓝天居士 QEMU/KVM QEMU KVM CPU虚拟化
接前一篇文章：本文内容参考：《趣谈Linux操作系统》——刘超，极客时间《QEMU/KVM》源码解析与应用——李强，机械工业出版社《深度探索Linux系统虚拟化原理与实现》——王柏生谢广军，机械工业出版社特此致谢！三、KVM模块初始化介绍1.KVM简介与源码组织结构KVM全称为Kernel-BasedVirtualMachine，中文译为基于内核的虚拟化技术。KVM是由以色列初创公司Qumrane
Android一个APP里面最少有几个线程积跬步DEV Android 开发实战大全 Android
Android应用启动时，默认会创建一个进程，该进程中最少包含5个系统自动创建的线程，具体如下：Main线程（主线程/UI线程）负责处理用户交互、UI更新等核心操作，所有与界面相关的逻辑必须在此线程执行。若在此线程执行耗时操作（如网络请求），会导致界面卡顿甚至触发ANR（应用无响应）。FinalizerDaemon线程（终结者守护线程）当对象重写了finalize()方法时，该线程负责将这些对象放
华为云赋能智能制造，助力图扑软件构造数字孪生场景 36Kr网科技华为云制造 big data
出行手机查看交通方案、物业管理的智能可视勘察管控、疫情地图提前预知危害……这些曾经存在于科幻片中的高科技场景一一在现代生活得到了应用与普及，其背后的数据可视化应用，正贯穿于当今大数据时代的各行各业，成为人们洞察数据内涵的有力工具，推动数字经济发展驶入“快车道”。数字经济发展的背后，是大数据时趋势下各地区积极贯彻国家数字经济发展战略的时代精神；高效便捷管控的背后，是云端平台各大企业的互助共赢；高质精
ESP32-C6助力设备互联互通，Wi-Fi6无线通信方案，物联网交互联动深圳启明云端科技 WiFi6 ESP32-C6 乐鑫物联网无线方案
在物联网飞速发展的今天，连接技术的革新成为推动行业进步的关键力量。Wi-Fi6技术的出现，犹如一颗璀璨的新星，为物联网设备带来了前所未有的高效与低耗体验。乐鑫推出的ESP32-C6作为首款支持Wi-Fi6的SoC，集成了2.4GHzWi-Fi6、Bluetooth5(LE)和802.15.4协议，这一组合使其具备了行业领先的射频性能。其支持的上行、下行正交频分多址（OFDMA）接入和下行多用户多输
Linux系统编程：目录操作、文件权限与库管理网恋东雪莲被骗114514 linux 运维服务器
Linux系统编程：目录操作、文件权限与库管理目录的读取在Linux系统编程中，目录操作是常见的任务之一。以下是用于目录操作的核心函数及其用法：1.opendir功能：打开一个目录，返回指向目录流的指针。原型：#includeDIR*opendir(constchar*name);参数：name：目录路径字符串。返回值：成功：返回DIR*指针；失败返回NULL。示例：DIR*dir=opendir
Uni-App 双栏联动滚动组件开发详解 (电梯导航) FFF-X uni-app
本文基于提供的代码实现一个左右联动的滚动组件，以下是详细的代码解析与实现原理说明：{{item}}{{section.title}}{{para}}exportdefault{//组件参数定义props:{leftData:{//左侧导航数据type:Array,default:()=>['章节1','章节2','章节3','章节4','章节5','章节6'],},rightData:{//右侧内
QEMU与KVM架构三境界虚拟化架构开发语言
完整架构图，来自QEMU官网QEMU与KVM架构总体上分为3部分。VMXroot模式的应用层（左上）VMXroot模式的内核层（左下）虚拟机的运行（右上）VMXroot相对于VMXnon-root模式，CPU引入了硬件虚拟化指令后有了这些概念，VMXroot可以理解为宿主机模式，VMXnon-root可以理解为虚拟机模式虚拟机运行在VMXnon-root模式下VMXroot模式与未引入VT-x之前
《Oracle DBA入门实战：十大高频问题详解与避坑指南》鸿·蒙数据库 Oracle数据库 DBA入门数据库管理 IT技术干货学习笔记
OracleDBA入门作业十问十答本文为OracleDBA入门作业整理，涵盖工具使用、配置管理及权限控制等核心知识点，适合新手快速上手。如有疑问或补充，欢迎评论区交流！1.DBA常用工具有哪些？OracleUniversalInstaller(OUI)用途：安装、升级或删除软件组件。OracleDatabaseConfigurationAssistant(DBCA)用途：通过图形界面创建、删除或修
【商城实战(55)】商城数据库备份：策略与实操指南奔跑吧邓邓子商城实战商城实战数据库备份 MySQL 策略与实操
【商城实战】专栏重磅来袭！这是一份专为开发者与电商从业者打造的超详细指南。从项目基础搭建，运用uniapp、ElementPlus、SpringBoot搭建商城框架，到用户、商品、订单等核心模块开发，再到性能优化、安全加固、多端适配，乃至运营推广策略，102章内容层层递进。无论是想深入钻研技术细节，还是探寻商城运营之道，本专栏都能提供从0到1的系统讲解，助力你打造独具竞争力的电商平台，开启电商实战
Android 中蓝牙Profile与UUID jaylkh android bluetooth
在Android中，常用的几种BluetoothProfile分别为：SPP(SerialPortProfile)、A2DP(AdvancedAudioDistributionProfile)、AVRCP(Audio/VideoRemoteControlProfile)、HID(HumanInterfaceDeviceProfile)、HFP(Hands-FreeProfile)。其中Media相
数据结构之顺序表和栈 Dust-Chasing 数据结构算法 c语言
一、顺序表1.1顺序表的概念及结构顺序表是用一段物理地址连续的存储单元依次存储数据元素的线性结构，一般情况下采用数组存储。在数组上完成数据的增删查改。1.2静态顺序表静态顺序表，即使用定长的数组来存储元素，用下面一张图就可以清楚看懂1.3动态顺序表动态顺序表：使用动态开辟的数组存储。与静态顺序表不同，动态顺序表使用的数组大小可以动态变化，从而实现更灵活的储存数据。二、动态顺序表的实现静态顺序表只适
深入理解指针（1） Dust-Chasing c语言开发语言
指针，一般是代指针变量，指针是C语言中至关重要的一部分。由于内容较多，且较难，所以我们掰开了揉碎了慢慢讲，今天我们开始先讲解字符指针，指针数组，数组指针。一、字符指针指针与数据类型相同，有多种分类inta=0;int*pd=&a;//取a的地址，并将其存入指针变量pd中doubleb=5.20;double*pb=&b;//取b的地址floatc=13.14;float*pc=&c;//取c的地址
Python基于深度学习的动物图片识别技术的研究与实现 Java老徐 Python 毕业设计 python 深度学习开发语言深度学习的动物图片识别技术 Python动物图片识别技术
博主介绍：✌程序员徐师兄、7年大厂程序员经历。全网粉丝12w+、csdn博客专家、掘金/华为云/阿里云/InfoQ等平台优质作者、专注于Java技术领域和毕业项目实战✌文末获取源码联系精彩专栏推荐订阅不然下次找不到哟2022-2024年最全的计算机软件毕业设计选题大全：1000个热门选题推荐✅Java项目精品实战案例《100套》Java微信小程序项目实战《100套》感兴趣的可以先收藏起来，还有大家
【深度学习与大模型基础】第7章-特征分解与奇异值分解 lynn-66 深度学习与大模型基础算法机器学习人工智能
一、特征分解特征分解（EigenDecomposition）是线性代数中的一种重要方法，广泛应用于计算机行业的多个领域，如机器学习、图像处理和数据分析等。特征分解将一个方阵分解为特征值和特征向量的形式，帮助我们理解矩阵的结构和性质。1.特征分解的定义对于一个n×n的方阵A，如果存在一个非零向量v和一个标量λ，使得：则称λ为矩阵A的特征值，v为对应的特征向量。特征分解将矩阵A分解为：其中：Q是由特征
不神话大模型，不做技术乌托邦，用"传统IT+AI积木"实现企业智能转型人工智能
一、开篇：AI革命的务实辩证法在技术狂热与落地鸿沟并存的AI时代，灵燕智能体开发平台提出"三轮驱动法则"：•不颠覆的智慧：MySQL、知识图谱库、MQ等传统中间件构成数字地基•不空想的创新：大模型仅承担"认知苦力"，在人类设计的思考链中定向发力•不取巧的工程：通过D2R映射、低代码工具、元数据治理实现可落地的智能装配二、核心价值：智能开发的工业流水线技术要素原子化拆解将复杂需求分解为可执行的"技术
读取一个字符串，字符串可能含有空格，将字符串逆转,原字符串与逆转字符串进行比较@C语言热心市民小汪代码练习 C语言算法学习 c语言开发语言
读取一个字符串，字符串可能含有空格，将字符串逆转原来的字符串与逆转后字符串比较相同，输出0，原字符串小于逆转后字符串输出-1，大于逆转后字符串输出1。例如输入hello，逆转后的字符串为olleh，因为hello小于olleh，所以输出-1SampleInput1helloSampleOutput1-1#include#includeintmain(){charstr[20];charreStr[
Spring Bean 的生命周期：从创建到销毁的完整解析一点多余. java 开发语言
引言：为什么需要了解SpringBean的生命周期？在Spring框架中，Bean是应用程序的核心构建块，理解其生命周期对于开发高效、稳定的应用至关重要。根据2023年JetBrains开发者调查报告，超过75%的Java开发者使用Spring框架，而Bean的生命周期管理是Spring的核心特性之一。以下数据展示了Bean生命周期的重要性：90%的Spring性能问题与Bean的初始化或销毁不当
188.HarmonyOS NEXT系列教程之列表切换案例工具类与最佳实践 harmonyos-next
温馨提示：本篇博客的详细代码已发布到git:https://gitcode.com/nutpi/HarmonyosNext可以下载运行哦！HarmonyOSNEXT系列教程之列表切换案例工具类与最佳实践效果演示1.日志工具类1.1Logger类实现classLogger{privatedomain:number;privateprefix:string;privateformat:string='
六十天前端强化训练之第二十九天之深入解析：从零构建企业级Vue项目的完整指南编程星辰海 #前端前端 Vue项目
=====欢迎来到编程星辰海的博客讲解======看完可以给一个免费的三连吗，谢谢大佬！目录一、Vite核心原理与开发优势二、项目创建深度解析三、配置体系深度剖析四、企业级项目架构设计五、性能优化实战六、开发提效技巧七、质量保障体系八、扩展阅读推荐一、Vite核心原理与开发优势1.1为什么选择Vite？Vite采用现代浏览器原生ES模块系统（NativeESM）作为开发服务器，颠覆了传统打包工具的
HttpClient 4.3与4.3版本以下版本比较 spjich java httpclient
网上利用java发送http请求的代码很多，一搜一大把，有的利用的是java.net.*下的HttpURLConnection，有的用httpclient，而且发送的代码也分门别类。今天我们主要来说的是利用httpclient发送请求。 httpclient又可分为 httpclient3.x httpclient4.x到httpclient4.3以下 httpclient4.3
Essential Studio Enterprise Edition 2015 v1新功能体验 Axiba .net
概述：Essential Studio已全线升级至2015 v1版本了！新版本为JavaScript和ASP.NET MVC添加了新的文件资源管理器控件，还有其他一些控件功能升级，精彩不容错过，让我们一起来看看吧！ syncfusion公司是世界领先的Windows开发组件提供商，该公司正式对外发布Essential Studio Enterprise Edition 2015 v1版本。新版本
[宇宙与天文]微波背景辐射值与地球温度 comsci 背景
宇宙这个庞大,无边无际的空间是否存在某种确定的,变化的温度呢? 如果宇宙微波背景辐射值是表示宇宙空间温度的参数之一,那么测量这些数值,并观测周围的恒星能量输出值,我们是否获得地球的长期气候变化的情况呢? &nbs
lvs-server 男人50 server
#!/bin/bash # # LVS script for VS/DR # #./etc/rc.d/init.d/functions # VIP=10.10.6.252 RIP1=10.10.6.101 RIP2=10.10.6.13 PORT=80 case $1 in start) /sbin/ifconfig eth2:0 $VIP broadca
java的WebCollector爬虫框架 oloz 爬虫
WebCollector主页： https://github.com/CrawlScript/WebCollector 下载：webcollector-版本号-bin.zip将解压后文件夹中的所有jar包添加到工程既可。接下来看demo package org.spider.myspider; import cn.edu.hfut.dmic.webcollector.cra
jQuery append 与 after 的区别小猪猪08
1、after函数定义和用法： after() 方法在被选元素后插入指定的内容。语法： $(selector).after(content) 实例： <html> <head> <script type="text/javascript" src="/jquery/jquery.js"></scr
mysql知识充电香水浓 mysql
索引索引是在存储引擎中实现的，因此每种存储引擎的索引都不一定完全相同，并且每种存储引擎也不一定支持所有索引类型。根据存储引擎定义每个表的最大索引数和最大索引长度。所有存储引擎支持每个表至少16个索引，总索引长度至少为256字节。大多数存储引擎有更高的限制。MYSQL中索引的存储类型有两种：BTREE和HASH，具体和表的存储引擎相关； MYISAM和InnoDB存储引擎
我的架构经验系列文章索引 agevs 架构
下面是一些个人架构上的总结，本来想只在公司内部进行共享的，因此内容写的口语化一点，也没什么图示，所有内容没有查任何资料是脑子里面的东西吐出来的因此可能会不准确不全，希望抛砖引玉，大家互相讨论。要注意，我这些文章是一个总体的架构经验不针对具体的语言和平台，因此也不一定是适用所有的语言和平台的。（内容是前几天写的，现附上索引）前端架构 http://www.
Android so lib库远程http下载和动态注册 aijuans andorid
一、背景在开发Android应用程序的实现，有时候需要引入第三方so lib库，但第三方so库比较大，例如开源第三方播放组件ffmpeg库, 如果直接打包的apk包里面, 整个应用程序会大很多.经过查阅资料和实验，发现通过远程下载so文件，然后再动态注册so文件时可行的。主要需要解决下载so文件存放位置以及文件读写权限问题。二、主要
linux中svn配置出错 conf/svnserve.conf:12: Option expected 解决方法 baalwolf option
在客户端访问subversion版本库时出现这个错误： svnserve.conf:12: Option expected 为什么会出现这个错误呢，就是因为subversion读取配置文件svnserve.conf时，无法识别有前置空格的配置文件，如### This file controls the configuration of the svnserve daemon, if you##
MongoDB的连接池和连接管理 BigCat2013 mongodb
在关系型数据库中，我们总是需要关闭使用的数据库连接，不然大量的创建连接会导致资源的浪费甚至于数据库宕机。这篇文章主要想解释一下mongoDB的连接池以及连接管理机制，如果正对此有疑惑的朋友可以看一下。通常我们习惯于new 一个connection并且通常在finally语句中调用connection的close()方法将其关闭。正巧，mongoDB中当我们new一个Mongo的时候，会发现它也
AngularJS使用Socket.IO bijian1013 JavaScript AngularJS Socket.IO
目前，web应用普遍被要求是实时web应用，即服务端的数据更新之后，应用能立即更新。以前使用的技术（例如polling）存在一些局限性，而且有时我们需要在客户端打开一个socket，然后进行通信。 Socket.IO(http://socket.io/)是一个非常优秀的库，它可以帮你实
[Maven学习笔记四]Maven依赖特性 bit1129 maven
三个模块为了说明问题，以用户登陆小web应用为例。通常一个web应用分为三个模块，模型和数据持久化层user-core, 业务逻辑层user-service以及web展现层user-web， user-service依赖于user-core user-web依赖于user-core和user-service 依赖作用范围 Maven的dependency定义
【Akka一】Akka入门 bit1129 akka
什么是Akka Message-Driven Runtime is the Foundation to Reactive Applications In Akka, your business logic is driven through message-based communication patterns that are independent of physical locatio
zabbix_api之perl语言写法 ronin47 zabbix_api之perl
zabbix_api网上比较多的写法是python或curl。上次我用java－－http://bossr.iteye.com/blog/2195679，这次用perl。for example: #!/usr/bin/perl use 5.010 ; use strict ; use warnings ; use JSON :: RPC :: Client ; use
比优衣库跟牛掰的视频流出了，兄弟连Linux运维工程师课堂实录，更加刺激，更加实在！ brotherlamp linux运维工程师 linux运维工程师教程 linux运维工程师视频 linux运维工程师资料 linux运维工程师自学
比优衣库跟牛掰的视频流出了，兄弟连Linux运维工程师课堂实录，更加刺激，更加实在！ ----------------------------------------------------- 兄弟连Linux运维工程师课堂实录-计算机基础-1-课程体系介绍1 链接：http://pan.baidu.com/s/1i3GQtGL 密码：bl65 兄弟连Lin
bitmap求哈密顿距离-给定N（1<=N<=100000）个五维的点A(x1,x2,x3,x4,x5)，求两个点X(x1,x2,x3,x4,x5)和Y( bylijinnan java
import java.util.Random; /** * 题目： * 给定N（1<=N<=100000）个五维的点A(x1,x2,x3,x4,x5)，求两个点X(x1,x2,x3,x4,x5)和Y(y1,y2,y3,y4,y5)， * 使得他们的哈密顿距离（d=|x1-y1| + |x2-y2| + |x3-y3| + |x4-y4| + |x5-y5|）最大
map的三种遍历方法 chicony map
package com.test; import java.util.Collection; import java.util.HashMap; import java.util.Iterator; import java.util.Map; import java.util.Set; public class TestMap { public static v
Linux安装mysql的一些坑 chenchao051 linux
1、mysql不建议在root用户下运行 2、出现服务启动不了，111错误，注意要用chown来赋予权限，我在root用户下装的mysql，我就把usr/share/mysql/mysql.server复制到/etc/init.d/mysqld, (同时把my-huge.cnf复制/etc/my.cnf) chown -R cc /etc/init.d/mysql
Sublime Text 3 配置 daizj 配置 Sublime Text
Sublime Text 3 配置解释(默认){// 设置主题文件“color_scheme”: “Packages/Color Scheme – Default/Monokai.tmTheme”,// 设置字体和大小“font_face”: “Consolas”,“font_size”: 12,// 字体选项：no_bold不显示粗体字，no_italic不显示斜体字，no_antialias和
MySQL server has gone away 问题的解决方法 dcj3sjt126com SQL Server
MySQL server has gone away 问题解决方法，需要的朋友可以参考下。应用程序（比如PHP）长时间的执行批量的MYSQL语句。执行一个SQL，但SQL语句过大或者语句中含有BLOB或者longblob字段。比如，图片数据的处理。都容易引起MySQL server has gone away。今天遇到类似的情景，MySQL只是冷冷的说：MySQL server h
javascript/dom:固定居中效果 dcj3sjt126com JavaScript
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml&
使用 Spring 2.5 注释驱动的 IoC 功能 e200702084 spring bean 配置管理 IOC Office
使用 Spring 2.5 注释驱动的 IoC 功能 developerWorks 文档选项将打印机的版面设置成横向打印模式打印本页将此页作为电子邮件发送将此页作为电子邮件发送级别：初级陈雄华 ([email protected]), 技术总监, 宝宝淘网络科技有限公司 2008 年 2 月 28 日 &nb
MongoDB常用操作命令 geeksun mongodb
1. 基本操作 db.AddUser(username,password) 添加用户 db.auth(usrename,password) 设置数据库连接验证 db.cloneDataBase(fromhost)
php写守护进程（Daemon） hongtoushizi PHP
转载自： http://blog.csdn.net/tengzhaorong/article/details/9764655 守护进程（Daemon）是运行在后台的一种特殊进程。它独立于控制终端并且周期性地执行某种任务或等待处理某些发生的事件。守护进程是一种很有用的进程。php也可以实现守护进程的功能。 1、基本概念 &nbs
spring整合mybatis,关于注入Dao对象出错问题 jonsvien DAO spring bean mybatis prototype
今天在公司测试功能时发现一问题：先进行代码说明： 1，controller配置了Scope="prototype"（表明每一次请求都是原子型） @resource/@autowired service对象都可以（两种注解都可以）。 2，service 配置了Scope="prototype"（表明每一次请求都是原子型）
对象关系行为模式之标识映射 home198979 PHP 架构企业应用对象关系标识映射
HELLO!架构一、概念 identity Map:通过在映射中保存每个已经加载的对象，确保每个对象只加载一次，当要访问对象的时候，通过映射来查找它们。其实在数据源架构模式之数据映射器代码中有提及到标识映射，Mapper类的getFromMap方法就是实现标识映射的实现。二、为什么要使用标识映射？在数据源架构模式之数据映射器中 //c
Linux下hosts文件详解 pda158 linux
　1、主机名：　　无论在局域网还是INTERNET上，每台主机都有一个IP地址，是为了区分此台主机和彼台主机，也就是说IP地址就是主机的门牌号。　　公网：IP地址不方便记忆，所以又有了域名。域名只是在公网（INtERNET)中存在，每个域名都对应一个IP地址，但一个IP地址可有对应多个域名。　　局域网：每台机器都有一个主机名，用于主机与主机之间的便于区分，就可以为每台机器设置主机
nginx配置文件粗解 spjich java nginx
#运行用户#user nobody;#启动进程,通常设置成和cpu的数量相等worker_processes 2;#全局错误日志及PID文件#error_log logs/error.log;#error_log logs/error.log notice;#error_log logs/error.log inf
数学函数 w54653520 java
public class S { // 传入两个整数，进行比较，返回两个数中的最大值的方法。 public int get( int num1, int nu