小张好难瘦

【论文翻译】Class-Incremental Few-Shot Object Detection

Class-Incremental Few-Shot Object Detection

论文地址：https://arxiv.org/pdf/2105.07637.pdf

摘要

Conventional detection networks usually need abundant labeled training samples, while humans can learn new concepts incrementally with just a few examples. This paper focuses on a more challenging but realistic class-incremental few-shot object detection problem (iFSD). It aims to incrementally transfer the model for novel objects from only a few annotated samples without catastrophically forgetting the previously learned ones. To tackle this problem, we propose a novel method LEAST, which can transfer with Less forgetting, fEwer training resources, And Stronger Transfer capability. Specifically, we first present the transfer strategy to reduce unnecessary weight adaptation and improve the transfer capability for iFSD. On this basis, we then integrate the knowledge distillation technique using a less resource-consuming approach to alleviate forgetting and propose a novel clustering-based exemplar selection process to preserve more discriminative features previously learned. Being a generic and effective method, LEAST can largely improve the iFSD performance on various benchmarks.

传统的检测网络通常需要大量的标记训练样本，而人类只需几个例子就可以逐步学习新概念。本文主要研究一个更具挑战性但更现实的类增量小样本目标检测问题（iFSD）。它的目标是在忘记先前学习的模型的情况下，从仅有的几个标注样本增量地传递新目标的模型。为了解决这一问题，我们提出了一种新的最小迁移方法，该方法能够以较少的遗忘、较少的训练资源和更强的迁移能力进行迁移。具体来说，我们首先提出了迁移策略，以减少不必要的重量适应，并提高iFSD的转移能力。在此基础上，我们结合知识提取技术，使用一种资源消耗较少的方法来缓解遗忘，并提出了一种新的基于聚类的样本选择过程，以保留先前学习到的更多鉴别特征。作为一种通用且有效的方法，最小二乘法可以大大提高iFSD在各种基准上的性能。

介绍

Object detection has achieved significant improvements in both speed and accuracy based on the deep Convolutional Neural Network (CNN) [Ren et al., 2015; Lin et al., 2017a; Lin et al., 2017b; Liu et al., 2018], but they are facing new practical challenges. A notable bottleneck is their heavy dependency on the large training set that contains carefully annotated images. However, on the one hand, it is hard to collect a large and sufficiently annotated dataset that covers all the required categories for most real-world problems. On the other hand, novel classes may be continually encountered after the learning stage, e.g. detecting new living species. Training a particular model whenever these novel classes emerge is infeasible.

基于深度卷积神经网络（CNN）的目标检测在速度和准确性方面都取得了显著的提高[Ren等人，2015年；Lin等人，2017a；Lin等人，2017b；Liu等人，2018年]，但他们面临着新的实际挑战。一个显著的瓶颈是它们对包含仔细标注的图像的大型训练集的严重依赖性。然而，一方面，很难收集一个包含大多数实际问题所需的所有类别的大型且有足够注释的数据集。另一方面，在学习阶段之后，可能会不断遇到新的类别，例如发现新的活物种。每当这些新课程出现时，训练一个特定的模型是不可行的。

Inspired by the human’s remarkable ability to incrementally learn novel concepts with just a few samples, classincremental few-shot object detection (iFSD) is beginning to raise research attention. Assuming there is a detector that is well pre-trained on base classes, iFSD aims to transfer it for novel classes that are sequentially observed with very few training examples while not forgetting the old ones.

受人类通过小样本逐步学习新概念的非凡能力的启发，经典增量小样本目标检测（iFSD）开始引起研究关注。假设有一个检测器对基类进行了良好的预训练，iFSD的目标是将其转移到新类，这些新类在不忘记旧类的情况下，通过很少的训练示例连续观察到。

The majority of existing works that transfer a detection model to novel classes focus on non-incremental few-shot object detection (FSD) [Karlinsky et al., 2019; Kang et al., 2019; Wang et al., 2020; Xiao and Marlet, 2020] and classincremental object detection (iOD) [Shmelkov et al., 2017; Hao et al., 2019]. Fig.1 illustrates similarity and difference among FSD, iOD and iFSD. Compared with FSD that mainly cares for detecting novel classes while ignoring base ones, iFSD needs to attack the catastrophic forgetting phenomenon [McCloskey and Cohen, 1989]. It refers to that a neural network forgets previous knowledge when learning a new task and often happens when we simply apply FSD solutions to iFSD. In contrast with iOD that transfers the detector utilizing abundant labeled samples of novel classes, iFSD is more realistic and challenging since people are only willing to annotate very few samples. Even if we have abundant novel samples, large-scale training usually needs intensive computing resources to support, such as GPU servers or clusters. How to incrementally learn novel detectors under limited training resources poses another challenge in iFSD.

将检测模型转换为新类的大多数现有工作侧重于非增量小样本物体检测（FSD）[Karlinsky等人，2019；Kang等人，2019；Wang等人，2020；Xiao和Marlet，2020]和经典增量物体检测（iOD）[Shmelkov等人，2017；Hao等人，2019]。图1说明了FSD、iOD和iFSD之间的相似性和差异。与主要关注检测新类而忽略基本类的FSD相比，iFSD需要攻击灾难性遗忘现象[McCloskey和Cohen，1989]。它指的是神经网络在学习新任务时会忘记以前的知识，并且经常发生在我们简单地将FSD解决方案应用于iFSD时。与利用新类的大量标记样本传输检测器的iOD相比，iFSD更具现实性和挑战性，因为人们只愿意注释很少的样本。即使我们有丰富的新样本，大规模训练通常也需要密集的计算资源来支持，例如GPU服务器或集群。如何在有限的训练资源下逐步学习新型探测器是iFSD面临的另一个挑战。

Figure 1: Illustration of similarity and difference among fewshot detection (FSD), class-incremental object detection (iOD), and class-incremental few-shot detection (iFSD). iFSD uses lightweight transferring to incrementally detect novel objects from a sequential data stream, in which novel classes are offered a few samples. 小样本检测（FSD）、类增量检测（iOD）和小样本类增量检测（iFSD）之间的相似性和差异说明。iFSD使用轻量级传输从序列数据流中增量检测新目标，在序列数据流中，新类提供了一些样本。

A straightforward idea to iFSD is integrating standard detection frameworks with class -incremental few-shot classifiers [Ren et al., 2019; Tao et al., 2020; Liu et al., 2020] who use new techniques to avoid forgetting based on the insight from distillation [Hinton et al., 2015], i.e. previous knowledge can be retained by not perturbing the pre-trained discriminative distribution. However, due to the complicated nature of detection tasks, we need to identify multiple objects from millions of candidate regions in one single image. The above classifiers and detection networks cannot be simply merged. The very recent work [Perez-Rua et al., 2020] proposes a class-specific weight generator to register novel classes incrementally. In each incremental novel task, it requires only a single forward pass of novel samples and does not access base classes. Although it can reduce the consumption of training resources for each novel task, it struggles to remember the knowledge learned in previous tasks and has low transfer capability to detect novel objects.

iFSD的一个直截了当的想法是将标准检测框架与类增量小样本分类器集成[Ren等人，2019年；Tao等人，2020年；Liu等人，2020年]，这些分类器使用新技术避免基于蒸馏而遗忘[Hinton等人，2015年]，即，通过不干扰预先训练的判别分布，可以保留先前的知识。然而，由于检测任务的复杂性，我们需要在一幅图像中从数百万个候选区域中识别多个目标。上述分类器和检测网络不能简单地合并。最近的工作[Perez Rua et al.，2020]提出了一种特定于类的权重生成器，以增量方式注册新类。在每个增量新任务中，它只需要一次新样本的前向传递，并且不访问基类。虽然它可以减少每个新任务的训练资源消耗，但它难以记住以前任务中学习到的知识，并且检测新对象的传输能力较低。

To attack aforementioned problems, we propose a novel iFSD method that incrementally detects novel objects with Less forgetting, fEwer training resources, And Stronger Transfer capability (LEAST). It is generic and straightforward while effectively alleviating the catastrophic forgetting and economizing the consumption of training resources. The contributions of this paper are summarized as follows:

为了解决上述问题，我们提出了一种新的iFSD方法，该方法能够以较少的遗忘、较少的训练资源和较强的传输能力（最少）增量检测新目标。它具有通用性和直观性，同时有效地缓解了灾难性遗忘，节约了训练资源的消耗。本文的贡献总结如下：

• We first give a careful analysis of current methods that can solve the iFSD problem, and then propose a new transfer strategy that decouples class-sensitive object feature extractor from the whole detector in order to obtain stronger transfer capability with less unnecessary weight adaptation.
• We integrate the knowledge distillation technique using a less resource-consuming approach in order to alleviate forgetting the previously learned knowledge.
• We propose a clustering-based exemplar selection algorithm, expected to representatively capture the distribution and intra-class variance of base classes leveraging a few exemplars.
• We conduct extensive experiments to demonstrate that our proposed LEAST can significantly outperform the state-of-the-arts in different settings.

•我们首先仔细分析了当前可以解决iFSD问题的方法，然后提出了一种新的传输策略，该策略将类敏感目标特征提取器与整个检测器解耦，以获得更强的传输能力，同时减少不必要的权重适配。
•我们采用资源消耗较少的方法集成知识蒸馏技术，以减轻对先前学习知识的遗忘。
•我们提出了一种基于聚类的示例选择算法，期望能够代表性地利用一些示例捕获基类的分布和类内差异。
•我们进行了广泛的实验，以证明我们提出的最小二乘法可以在不同环境下显著优于最新水平。

方法

3.1 Problem Definition

Let C = Cb ∪ Cn denotes the whole set of object categories. Cb is the set of base classes that have a large number of training instances, annotated with object categories and bounding boxes. Cn is the disjoint set of novel classes that have only K (usually less than 10) instances per class (i.e. K-shot detection). iFSD aims to learn a detector that can incrementally detect novel objects using K-shot per class. It may encounter a different number of novel classes in practice. Here we consider two different settings as in [Perez-Rua et al., 2020]. The typical setting for iFSD is that the novel classes are added at once with a single model transfer. In the more challenging continual iFSD setting, the novel classes are added one by one with |Cn| times model transfer.

设C=Cb∪Cn表示目标类别的整个集合。Cb是具有大量训练实例的基类集，用目标类别和边界框进行注释。Cn是一组不相交的新类，每个类只有K个（通常少于10个）实例（即K-shot检测）。iFSD的目标是学习一种检测器，该检测器可以使用每类K-shot增量检测新目标。在实践中，它可能会遇到不同数量的新类别。在这里，我们考虑两个不同的设置，如[ Perez Rua等人，2020 ]。iFSD的典型设置是通过单个模型传输立即添加新类。在更具挑战性的连续iFSD设置中，使用|Cn| 次模型传输逐个添加新类。

A general iFSD solution consists of two stages: (1) Pretrain stage: pre-train a standard detector on base classes; (2) Incremental transfer stage: transfer the pre-trained detector to novel classes without forgetting the old ones. Considering the computational requirement and the memory limit, it should be computationally-efficient without revisiting the whole base class data. Our main focus in this paper is the essential incremental transfer stage.

一般的iFSD解决方案包括两个阶段：（1）预训练阶段：在基类上预训练标准检测器；（2）增量转移阶段：在不忘记旧类的情况下，将预先训练好的检测器转移到新类。考虑到计算需求和内存限制，它应该在不重新访问整个基类数据的情况下具有计算效率。我们在本文中的主要关注点是基本增量迁移阶段。

3.2 Reducing Unnecessary Weight Adaptation

We start with a detailed analysis of current methods that can solve the iFSD problem (including exemplar-based methods mentioned in Section 2) and then propose our transfer strategy. Previous approaches can be divided into two subgroups, according to their transfer strategy in the incremental transfer stage: (1) Fix the pre-trained detector and only adapt the last layer to novel classes (denoted as FIX ALL) [Wang et al., 2020; Perez-Rua et al., 2020]. (2) Adapt the whole detector to novel classes (denoted by FIT ALL) [Yan et al., 2019; Xiao and Marlet, 2020]. The former consumes minimal resources in transferring for novel classes while has limited generalization ability since the feature extractor is fixed. The latter usually has good performance on novel classes but forgets old ones since the whole network is biased towards novel objects. It also needs more training resources than the former methods. The difference in resource consumption between these two types of approaches is straightforward, and the performance difference can be found in Tab.2, where TFA is the state-of-the-art method of FIX ALL and FSDetView is the state-of-the-art method of FIT ALL.

我们首先详细分析了当前解决iFSD问题的方法（包括第2节提到的基于示例的方法），然后提出了我们的迁移策略。根据增量迁移阶段的迁移策略，先前的方法可分为两个子组：（1）固定预先训练的检测器，并仅使最后一层适应新类（用FIX ALL表示）【Wang等人，2020年；Perez Rua等人，2020年】。（2）使整个检测器适应新的类别（用FIT ALL表示）[Yan等人，2019；Xiao和Marlet，2020]。前者对新类的传输消耗最少的资源，但由于特征提取器是固定的，因此泛化能力有限。后者通常在新类上具有良好的性能，但由于整个网络偏向于新目标，因此会忘记旧类。与以前的方法相比，它还需要更多的训练资源。这两种方法之间的资源消耗差异是显而易见的，性能差异可以在表2中找到，其中TFA是最先进的FIX ALL方法，而FSDetView是最先进的FIT ALL方法。

The low performance of FIT ALL on base classes make sense due to unnecessary weight adaptation. From the architecture’s perspective, we know that even if a neuron of the front layers changes a little, the final output may vary a lot after the feed-forward pass of a neural network. If we transfer the whole detector using limited supervision (i.e. K-shot), the discriminative distribution learned from previous classes will be further influenced as the updated layers get deeper. Besides, the front layers in a deep learning model usually learn generic features for an image and are well trained in the abundant training examples. Adaptation on them is unnecessary and may cause overfitting on the few samples.

由于不必要的权重调整，FIT ALL在基类的低性能是有意义的。从体系结构的角度来看，我们知道，即使前几层的神经元发生了一些变化，在神经网络的前馈传递之后，最终的输出可能会有很大的变化。如果我们使用有限监督（即K-shot）转移整个检测器，则随着更新层的加深，从以前的类中学习到的区分性分布将进一步受到影响。此外，深度学习模型中的前端层通常学习图像的一般特征，并在丰富的训练示例中得到良好的训练。对其进行调整是不必要的，可能会导致少数样本的过度拟合。

Based on the above consideration, we propose to separate the whole detector into class-agnostic image feature extractor (unchanged during the incremental learning stage) and classs ensitive object feature extractor (CSE) (optimized during the incremental learning stage), as is shown in Fig.2. Usually, a deep backbone in the detection network, e.g. ResNet [He et al., 2016], extracts generic features for an input image and can be regarded as a class-agnostic image feature extractor. If we update the backbone with a few novel samples, its feature extraction capability will be damaged instead, possibly resulting in the forgetting. While the object feature extractor (e.g. RPN and ROI head for a two-stage detector like FasterRCNN [Ren et al., 2015], or FPN and extra subnets for a one-stage detector like RetinaNet [Lin et al., 2017b]) is more sensitive to object categories and often extract object-specific features. It expects to be updated to learn more discriminative information of novel classes; otherwise, the detector will be hard to generalize to novel classes. Thus, we propose only to optimize CSE while keep others fixed in the incremental transfer stage (denoted as FIT CSE).

基于上述考虑，我们将整个检测器分为类无关图像特征提取器（增量学习阶段不变）和类敏感目标特征提取器（CSE）（增量学习阶段优化），如图2所示。通常，检测网络中的深层骨干网络（例如ResNet[He et al.，2016]）会提取输入图像的一般特征，可以视为类无关图像特征提取器。如果我们用一些新的样本更新骨干网络，它的特征提取能力反而会被破坏，可能导致遗忘。而目标特征提取器（例如，两级检测器（如Faster RCNN[Ren等人，2015]）的RPN和ROI头部，或一级检测器（如RetinaNet[Lin等人，2017b]）的FPN和额外子网）对目标类别更为敏感，通常提取目标特定的特征。它预计将被更新，以了解更多的新类歧视性信息；否则，检测器将很难推广到新类。因此，我们建议只优化CSE，而在增量转移阶段（表示为FIT_CSE）保持其他固定的CSE。

Figure 2: Illustration of our proposed method based on Faster R-CNN framework. Left: our clustering-based exemplar selection that is expected to have the potential to capture modes of each base class. These exemplars and few novel instances form a balanced set for the incremental transfer stage. Right: the overall architecture. The decoupled detector and distilled knowledge are used to enhance the model’s transfer capability to novel classes, without forgetting the base ones at the same time. Faster R-CNN框架提出的方法的说明。左图：我们基于集群的示例选择，有望捕获每个基类的模式。这些示例和少数新实例构成了增量转移阶段的平衡集。右图：总体架构。在不忘记基本类的情况下，使用解耦检测器和提取知识来增强模型向新类的传输能力。

The proposed transfer strategy combines the advantages of both FIX ALL and FIT ALL. Without the unnecessary weight adaptation on class-agnostic image feature extractor, it consumes fewer training resources and is more generalized to novel objects utilizing the well learned image features.

所提出的转移策略结合了FIX-ALL和FIT-ALL的优点。该算法不需要对类无关图像特征提取器进行不必要的权值调整，只需消耗较少的训练资源，并且可以更广泛地应用于利用已学习图像特征的新目标。

3.3 Less Forgetting with Knowledge Distillation

In the incremental transfer stage, a naive method is to finetune the network with standard classification loss:

在增量传输阶段，一种简单的方法是使用标准分类损耗对网络进行微调：

where X denotes all candidate regions in the few available images. y∗ is the ground-truth label for x. p(y | x，Φ) is the classification probability for observed classes including incrementally added ones, with parameters Φ. However, since base classes are not available in the incremental transfer stage, the model tends to forget the previous knowledge catastrophically. Even if we use the exemplar set to store a few old samples, the model will also forget some discriminative information for previous classes due to the limited supervision.

其中X表示少数可用图像中的所有候选区域。Y∗ 是x的ground-truth 标签。p（y | x，Φ）是观察到的类别的分类概率，包括递增增加的类别，参数为Φ。然而，由于基类在增量迁移阶段不可用，模型往往会灾难性地忘记以前的知识。即使我们使用样本集来存储一些旧样本，由于监督有限，模型也会忘记以前类的一些鉴别信息。

To attack this problem, we propose to use knowledge distillation [Hinton et al., 2015] in iFSD, inspired by its success in incremental classification [Li and Hoiem, 2016]. Although previous works [Shmelkov et al., 2017; Hao et al.,2019] have tried integrating distillation in object detection, they still need an exact copy of the pre-trained detector to compute the learned knowledge in the discriminative distribution . Here y0 belongs to the previous classes Cold. Φold denotes the parameters learned in previous tasks. In this way, much more computing resources are used than directly training the detector. In contrast, we apply knowledge distillation on positive candidate regions with pre-computed knowledge, shown in Fig.2. To be specific, the previous knowledge for positive candidate regions x ∼ Xp will be pre-computed through ground-truth bounding-boxes bgt and ROI pooling [Ren et al., 2015]. Here Xp denote the set of positive candidate regions, whose Intersection over Union (IoU) with ground truth is above α, i.e. IoU(x， bgt) > α. Similarly, it can also be pre-computed in the one-stage detector according to the anchor owning the maximum IOU with bgt. Then, it is straightforward to avoid forgetting the pre-trained knowledge:

为了解决这个问题，我们建议在iFSD中使用知识蒸馏[Hinton et al.，2015]，其灵感来自于增量分类的成功[Li和Hoiem，2016]。尽管之前的工作【Shmelkov等人，2017年；Hao等人，2019年】已经尝试将蒸馏集成到目标检测中，但他们仍然需要一个经过预训练的检测器的精确副本来计算判别分布中的学习知识。这里y0属于前面的类Cold。Φold表示在以前的任务中学习到的参数。这样，使用的计算资源比直接训练检测器要多得多。相比之下，我们使用预先计算的知识对正候选区域进行知识提取，如图2所示。具体地说，正候选区域的先前知识）x∼ Xp将通过ground-truth边界框bgt和ROI池化预先计算[Ren等人，2015]。这里Xp表示正候选区域集，其与ground-truth的联合（IoU）的交集高于α，即IoU（x，bgt）>α。类似地，它也可以在一级检测器中根据具有bgt的最大IOU的锚预先计算。然后，可以直接避免忘记预先训练的知识：

where Kullback-Leibler (KL) divergence measures the forgotten information of the new discriminative distribution for Cold with respect to the pre-trained one. Considering KL(Pold || P) = H(Pold,P) - H(Pold) and the Cross Entropy H(Pold) is irrelevant to Φ, Eq.2 is equivalent to:

其中，KL散度测量了新的 Cold判别分布相对于预训练分布的遗忘信息。考虑到KL（Pold | | P）=H（Pold，P）-H（Pold），且交叉熵H（Pold）与Φ无关，等式2等价于：

where the classification probability for distillation is produced by scaled softmax: and zi is the output logit of class i. T is the temperature, which is suggested T > 1 to encourage the network to better encode previously learned class similarities [Hinton et al., 2015]. The final loss used in iFSD becomes:

其中，蒸馏的分类概率由缩放的softmax生成：和zi是i类的输出逻辑。T是温度，建议T>1，以鼓励网络更好地编码之前学习到的类相似性[Hinton等人，2015]。iFSD中使用的最终损失为：

where Lrpn and Lloc are the same loss as in Faster R-CNN. is to balance the relative contribution of Lkd.

其中，Lrpn和Lloc的损耗与FasterR-CNN相同。是平衡Lkd的相对贡献。

3.4 More Preserving with A Few Exemplars

In order to not forget the old learned knowledge, another feasible method in the incremental transfer stage is to store a few exemplars drawn from the old training set. As discussed in Section 2, several current methods can be regarded as exemplar-based solutions for iFSD, where exemplars are selected randomly. However, the randomly selected exemplar set is unstable and can not guarantee to well represent the non-uniform data distributions of different classes. Another class-average based exemplar selection [Rebuffi et al., 2017] that aims to approximate the class mean vector is also not suitable for iFSD because of the complex scenarios and intraclass variance in the detection task [Karlinsky et al., 2019].

为了不忘记旧的学习知识，增量迁移阶段的另一个可行方法是存储从旧训练集中提取的一些样本。如第2节所述，当前的几种方法可以被视为iFSD的基于示例的解决方案，其中示例是随机选择的。然而，随机选择的样本集是不稳定的，不能保证很好地表示不同类别的非均匀数据分布。另一种基于类平均值的样本选择[Rebuffi等人，2017年]旨在近似类平均向量，但由于检测任务中的复杂场景和类内方差，也不适用于iFSD[Karlinsky等人，2019年]。

As is shown in the left of Fig.2, the data distribution of each class may have multiple modes. To better preserve the discriminative features learned on base classes, we expect the selected exemplars could have the potential to represent these modes as many as possible. Randomly selected exemplars may not be representative, and class-average based exemplars can only capture one mode. To this problem, we propose a novel clustering-based examplar selection algorithm as follows. Details can be found in Algorithm 1

如图2左侧所示，每个类别的数据分布可能有多种模式。为了更好地保留在基类上学习到的区别性特征，我们希望所选的示例能够尽可能多地表示这些模式。随机选择的示例可能不具有代表性，基于类平均的示例只能捕获一种模式。针对这个问题，我们提出了一种新的基于聚类的样本选择算法。详细信息可在算法1中找到

Each image may contain multiple instances from different classes. Thus, we need to calculate multiple features for an image. Firstly, for simplicity, each distinct category contained in an image is represented by the averaged features of instances with the same category, since the same category’s instances in a single image are probably similar. Secondly, for each c ∈ Cb, we use the k-means algorithm to cluster the features from images containing c into K clusters. K is assumed as the number of shots in order to construct a balanced fewshot dataset between base and novel classes. We will then obtain K centroids for each category. Finally, we progressively select the images that best approximates these learned centroids. Since a single image may cover clusters of different classes, we hope that using at most |Cb|∗ K images to representatively capture the discriminative features of base classes. Moreover, although we use the simple but effective clustering method (k-means) in this paper, other exemplar learning methods are worth trying in the future [Bautista et al., 2016; Mairal et al., 2008]

每个映像可能包含来自不同类的多个实例。因此，我们需要计算图像的多个特征。首先，为简单起见，图像中包含的每个不同类别由具有相同类别的实例的平均特征表示，因为单个图像中相同类别的实例可能相似。其次，对于每个c∈cb，我们使用k-means算法将包含c的图像中的特征聚类为k个聚类。为了在基类和新类之间构建一个平衡的小样本数据集，假设K为新类别数。然后，我们将获得每个类别的K个质心。最后，我们逐步选择最接近这些学习到的质心的图像。由于单个映像可能覆盖不同类的集群，因此我们希望最多使用|Cb|∗ K图像代表性地捕获基类的区别特征。此外，尽管我们在本文中使用了简单但有效的聚类方法（k-means），但其他样本学习方法在未来值得尝试【Bautista等人，2016；Mairal等人，2008】

4 实验

4.1 Experimental Setup

Dataset. We use two popular and challenging datasets for non-incremental FSD and incremental FSD (iFSD) in this paper to evaluate the detection performance, i.e. Pascal VOC [Everingham et al., 2015], and MS-COCO [Lin et al., 2014]. We use the same data splits as in the previous work [Kang et al., 2019; Wang et al., 2019; Yan et al., 2019; Wang et al., 2020] for FSD and the work [Perez-Rua et al., 2020] for iFSD, respectively. In MS-COCO, there are 80 classes in total, which include the whole 20 classes in Pascal VOC. The 60 categories disjoint with Pascal VOC are used as base classes, while the remaining 20 categories are used as novel ones. Each novel class has K 2 f1; 5; 10g samples, and 10 random sample groups are considered in this paper, following [Wang et al., 2020]. For the MS-COCO dataset, we use 5000 images as in [Kang et al., 2019] from the validation set for evaluation, and the rest images containing at least one base instance in train/val sets for pre-training. For the Pascal VOC dataset, we use the 2007 test set for testing.

数据集。在本文中，我们使用了两个流行且具有挑战性的非增量FSD和增量FSD（iFSD）数据集来评估检测性能，即Pascal VOC【Everingham等人，2015年】和MS-COCO【Lin等人，2014年】。我们分别对FSD和iFSD使用与先前工作[Kang等人，2019；Wang等人，2019；Yan等人，2019；Wang等人，2020]相同的数据分割。在MS-COCO中，总共有80个类，其中包括Pascal VOC中的全部20个类。与Pascal VOC不相交的60个类别用作基类，其余20个类别用作新类别。每个新类都有k2f1；5.本文考虑了10g样本和10个随机样本组，如下[Wang等人，2020]。对于MS-COCO数据集，我们使用验证集[Kang等人，2019]中的5000张图像进行评估，其余图像至少包含训练集/val集中的一个基本实例进行预训练。对于Pascal VOC数据集，我们使用2007测试集进行测试。

Evaluation metric. To evaluate the detection performance, we use the average precision (AP) with IOU threshold from 0.5 to 0.95 of the top 100 detections and the corresponding average recall (AR) as the evaluation metrics. For nonincremental FSD, only the performance on novel classes is tested. While for iFSD, the performance on both base and novel classes needs to be tested. To evaluate the model’s comprehensive performance between base and novel domains in a balanced way, we also report their harmonic mean value, i.e. HM(x,y) = 2xy/(x+y), same as another incremental fewshot scenario [Cermelli et al., 2020]. Some works only report the mean score of all classes, which ignores the significance of novel classes. Since the number of novel classes is usually much smaller than base classes, the simple mean score will bias a lot to base classes and then can not well evaluate the comprehensive performance.

评价指标。为了评估检测性能，我们使用前100个检测中IOU阈值为0.5到0.95的平均精度（AP）和相应的平均召回率（AR）作为评估指标。对于非增量FSD，仅测试新类的性能。而对于iFSD，基本类和新类的性能都需要测试。为了以平衡的方式评估模型在基本域和新域之间的综合性能，我们还报告了它们的调和平均值，即HM（x,y）=2xy/（x+y），与另一个增量小样本情景相同[Cermelli等人，2020]。有些论文只报告所有类别的平均分数，而忽略了新类别的重要性。由于新类的数量通常比基类小得多，简单的平均分数会对基类产生很大的偏差，从而无法很好地评估综合性能。

Implementation details. We use Faster-RCNN as our basic detection architecture and ResNet 101 as the backbone following [Wang et al., 2020]. We train the model using the SGD optimizer with a momentum of 0.9 and a weight decay of 0.0001. The parameters α and T are set to 0:7 and 20 respectively. During the pre-training stage, we train the standard Faster-RCNN on base classes for 6 epochs with a learning rate of 0.01, which is decreased by 10 after 4 epochs. We freeze the class-agnostic image feature extractor during the incremental transfer stage and then train our proposed method for 10 epochs with a learning rate of 0.001. Our code is uploaded as an attachment.

实施细节。我们使用Faster-RCNN作为我们的基本检测架构，并使用ResNet 101作为主干网[Wang等人，2020]。我们使用SGD优化器训练模型，动量为0.9，权重衰减为0.0001。参数α和T分别设置为0.7和20。在预训练阶段，我们在基类上训练了6个阶段的标准快速RCNN，学习率为0.01，4个阶段后下降了10。我们在增量传输阶段冻结类无关图像特征提取器，然后以0.001的学习率对我们提出的方法进行10个阶段的训练。我们的代码作为附件上传。

4.2 Non-Incremental Few-Shot Detection

When all novel classes are added at once in one incremental learning stage, and only novel classes are focused, the iFSD problem naturally degenerates into the vanilla FSD problem. In this case, our method (LEAST) can be regarded as an effective solution to the non-incremental FSD, along with the specific advantage of not forgetting base classes at the same time. We compare LEAST with several state-of-thearts on MS-COCO in Tab.1, which are MetaDet [Wang et al., 2019], Meta-RCNN [Yan et al., 2019], TFA [Wang et al., 2020], Attn-RPN [Fan et al., 2020], and FSDetView [Xiao and Marlet, 2020]. From Tab.1, we can observe that LEAST can achieve comparable results on novel classes with nonincremental FSD approaches. AP and AR are even a little higher than the state-of-the-art FSDetView. Since unknown objects in a test image may cover all possible categories, the class similarities and discriminative information previously learned will also benefit novel classes’ performance. It validates the effectiveness of LEAST. Furthermore, comparing with all these competitors that generally improve the detection performance of novel classes while sacrificing the performance on base ones, LEAST has a significant advantage of not forgetting previous knowledge.

当在一个增量学习阶段同时添加所有新类，并且只关注新类时，iFSD问题自然退化为普通FSD问题。在这种情况下，我们的方法（最少）可以被视为非增量FSD的有效解决方案，同时具有不忘记基类的特殊优势。我们至少将表1中MS-COCO的几种状态进行了比较，它们是MetaDet[Wang等人，2019]、Meta-RCNN[Yan等人，2019]、TFA[Wang等人，2020]、Attn RPN[Fan等人，2020]和FSDetView[Xiao和Marlet，2020]。从表1中，我们可以观察到，在使用非增量FSD方法的新类上，最小类可以获得可比的结果。AP和AR甚至比最先进的FSDetView略高。由于测试图像中的未知对象可能涵盖所有可能的类别，因此先前学习到的类相似性和鉴别信息也将有助于新类的性能。验证了最小二乘法的有效性。此外，与所有这些竞争对手相比，LEAST在提高新类的检测性能的同时牺牲了基本类的性能，它在不忘记以前的知识方面具有显著的优势。

4.3 Class-Incremental Few-Shot Detection

Typical iFSD results. We first evaluate the iFSD performance under the typical setting on MS-COCO, where all the novel classes are added at once with one incremental transfer session. In this case, as mentioned in Section 2, some methods for FSD can be naturally regarded as exemplar-based solutions for iFSD. We compare LEAST with several stateof-the-arts in Tab.2: ONCE [Perez-Rua et al., 2020], TFA [Wang et al., 2020] and FSDetView [Xiao and Marlet, 2020]. We also report our proposed method’s performance using no exemplars for a fair comparison with methods without exemplars, denoted as ‘LEAST-NE’.

典型的iFSD结果。我们首先评估了MS-COCO上典型设置下的iFSD性能，其中所有新类都是通过一个增量传输会话一次添加的。在这种情况下，如第2节所述，FSD的一些方法自然可以被视为iFSD的基于示例的解决方案。我们在表2中至少与几种最新技术进行了比较：ONCE【Perez Rua等人，2020年】、TFA【Wang等人，2020年】和FSDetView【肖和Marlet，2020年】。我们还报告了我们提出的方法在使用无示例的情况下的性能，以便与没有示例的方法（表示为“最少-NE”）进行公平比较。

From Tab.2, we have the following observations: (1) LEAST performs better on novel classes than all competitors, with a second highest performance on base ones. TFA freezes the whole feature extractor so that it obtains the best base AP that is a little higher than ours, but its performance on novel classes is explicitly limited. Compared with these methods that have a large performance gap between base and novel classes, LEAST can better balance these two domains and thus have a much higher HM value. Even without exemplars, ours can still achieve promising results on avoiding forgetting and adapting to novel classes. This can validate the effectiveness of using knowledge distillation and transferring with less unnecessary weight adaptation. (2) Using a few exemplars selected by our proposed approach is generally beneficial to iFSD. It can largely improve the performance on previous tasks, and then the HM value, which indicates that previous discriminative features are better preserved.

从表2，我们有以下观察结果：（1）在新类中，最少的表现优于所有竞争对手，在基本类中表现第二高。TFA冻结了整个特征提取器，以便获得比我们的略高的最佳基AP，但其在新类上的性能明显受限。与这些基类和新类之间存在较大性能差距的方法相比，LEAST可以更好地平衡这两个领域，因此具有更高的HM值。即使没有范例，我们仍然可以在避免遗忘和适应新课程方面取得有希望的结果。这可以验证使用知识提取和转移的有效性，同时减少不必要的权重调整。（2）使用我们提出的方法选择的一些示例通常对iFSD有益。它可以大大提高以前任务的性能，然后提高HM值，这表明以前的鉴别特征得到了更好的保留。

Continual iFSD results. We then evaluate the iFSD performance under the continual setting, where the novel classes are added one at a time with |Cn| model updates. We report the harmonic mean performance of base classes and all novel classes added so far in the bottom of Fig.3. We can see that as novel classes are processed sequentially, the performance of LEAST first decreases and then levels off. While without exemplars for base classes, LEAST-NE decreases to 0 after 15 incremental transfer stages. This means that the previously learned knowledge is catastrophically forgotten. Comparing with typical iFSD performance (i.e. AP 18.2 and AR 33.9), the final performance (i.e. AP 12.1 and AR 26.5) of continual iFSD after 20 sessions is lower. It is reasonable since fewer model updates naturally forget less previous knowledge. As ONCE does no report the harmonic mean and has no released code, it does not appear in Fig.3. Yet we can find its performance on the typical setting (i.e. AP 2.2 and AR 10.9) is lower than ours on the continual setting. As the continual setting is more challenging, we can legitimately infer that LEAST consistently outperforms it.

持续的iFSD结果。然后，我们在连续设置下评估iFSD性能，在该设置下，通过|Cn|模型更新一次添加一个新类。我们在图3的底部报告了基类和迄今为止添加的所有新类的调和平均性能。我们可以看到，随着新类的顺序处理，LEAST的性能先是下降，然后趋于平稳。在没有基类示例的情况下，经过15个增量传输阶段后，LEAST-NE将降至0。这意味着以前学到的知识被灾难性地遗忘了。与典型的iFSD表现（即AP 18.2和AR 33.9）相比，20个循环后持续iFSD的最终表现（即AP 12.1和AR 26.5）更低。这是合理的，因为较少的模型更新自然会忘记较少的以前的知识。由于ONCE没有报告谐波平均值，也没有发布代码，因此它没有出现在图3中。然而，我们可以发现其在典型设置（即AP 2.2和AR 10.9）上的性能低于我们在连续设置上的性能。由于连续设置更具挑战性，我们可以合理地推断，最不稳定的设置优于连续设置。

Cross-domain iFSD evaluation. We also evaluate the typical iFSD performance in a cross-domain setting from MSCOCO to Pascal VOC. The performance is only evaluated on novel classes since VOC contains no objects of base classes. As shown at the top of Fig.3, our method (either with or without the selected exemplar set) significantly outperforms the previous competitors in terms of both AR and AP, which verifies the efficacy of LEAST in the cross-domain setting. Comparing with the results of MS-COCO in Tab.2, the performance on VOC is higher on both AP and AR. The performance gap is reasonable since MS-COCO images contain more complex scenarios from both base and novel objects.

跨域iFSD评估。我们还评估了从MSCOCO到Pascal VOC的跨域设置中典型的iFSD性能。由于VOC不包含基类的对象，因此性能仅在新类上进行评估。如图3顶部所示，我们的方法（有或没有选择的样本集）在AR和AP两方面都显著优于之前的竞争对手，这验证了LEST在跨域设置中的有效性。与表2中MS-COCO的结果相比，AP和AR的VOC性能都更高。性能差距是合理的，因为MS-COCO图像包含来自基本目标和新目标的更复杂场景。

Ablation studies. In the ablation study, we test the influence of the proposed transfer strategy, distillation loss, and the exemplar selection method for iFSD in Tab.3. It is clear that the proposed transfer strategy FIT CSE significantly outperforms FIX ALL and FIT ALL. Meanwhile, the distillation loss (denoted by d) largely improves novel class performance since it can preserve the previously learned discriminative information. Besides capable of remembering base classes, the proposed clustering-based exemplar selection algorithm (e) performs better than random selection (er) and class-average based selection (ea) [Rebuffi et al., 2017], when we use the same number of selected exemplars for a fair comparison. It shows that the proposed selection method preserves more modes for base classes.

消融研究。在消融研究中，我们测试了表3中提出的转移策略、蒸馏损失和iFSD样本选择方法的影响。显然，建议的转移策略FIT CSE明显优于FIX ALL和FIT ALL。同时，蒸馏损失（用d表示）可以保留以前学习到的鉴别信息，因此极大地提高了新类的性能。除了能够记住基类外，当我们使用相同数量的选定样本进行公平比较时，所提出的基于聚类的样本选择算法（e）的性能优于随机选择（er）和基于类平均数的选择（ea）[Rebuffi et al.，2017]。结果表明，所提出的选择方法为基类保留了更多的模式。

5 结论

We delved into the realistic and challenging problem: classincremental few-shot object detection, which aims at incrementally detecting novel objects from just a few labeled samples while without forgetting the previously learned ones. We proposed a generic and effective method that uses relatively fewer training resources and can still have stronger transfer capability with less forgetting. Extensive experimental results under different settings verified its effectiveness.

我们深入研究了一个现实且具有挑战性的问题：经典增小样本目标检测，其目的是在不忘记先前学习的目标的情况下，从少量标记样本中增量检测新目标。我们提出了一种通用而有效的方法，该方法使用相对较少的训练资源，并且在较少遗忘的情况下仍然具有更强的传输能力。在不同环境下的大量实验结果验证了其有效性。

你可能感兴趣的:(论文,目标检测)

【论文速读】| 利用大语言模型在灰盒模糊测试中生成初始种子云起无垠论文速读/精读语言模型 p2p 人工智能
基本信息论文标题:HarnessingLargeLanguageModelsforSeedGenerationinGreyb0xFuzzing作者:WenxuanShi,YunhangZhang,XinyuXing,JunXu作者单位:NorthwesternUniversity,UniversityofUtah关键词:Greyb0xfuzzing,LargeLanguageModels,Seed
第79期 | GPTSecurity周报云起无垠 GPTSecurity AIGC gpt
GPTSecurity是一个涵盖了前沿学术研究和实践经验分享的社区，集成了生成预训练Transformer（GPT）、人工智能生成内容（AIGC）以及大语言模型（LLM）等安全领域应用的知识。在这里，您可以找到关于GPT/AIGC/LLM最新的研究论文、博客文章、实用的工具和预设指令（Prompts）。现为了更好地知悉近一周的贡献内容，现总结如下。SecurityPapers1.TrojanWhi
第60期 | GPTSecurity周报云起无垠 GPTSecurity 人工智能语言模型网络安全
GPTSecurity是一个涵盖了前沿学术研究和实践经验分享的社区，集成了生成预训练Transformer（GPT）、人工智能生成内容（AIGC）以及大语言模型（LLM）等安全领域应用的知识。在这里，您可以找到关于GPT/AIGC/LLM最新的研究论文、博客文章、实用的工具和预设指令（Prompts）。现为了更好地知悉近一周的贡献内容，现总结如下。SecurityPapers1.映射你的模型：评估
BERT详解 comli_cn 大模型笔记 bert 人工智能深度学习
1.背景结构1.1基础知识BERT（BidirectionalEncoderRepresentationsfromTransformers）是谷歌提出，作为一个Word2Vec的替代者，其在NLP领域的11个方向大幅刷新了精度，可以说是前几年来自残差网络最优突破性的一项技术了。论文的主要特点以下几点：使用了双向Transformer作为算法的主要框架，之前的模型是从左向右输入一个文本序列，或者将l
第83期 | GPTSecurity周报云起无垠 GPTSecurity 人工智能网络安全
GPTSecurity是一个涵盖了前沿学术研究和实践经验分享的社区，集成了生成预训练Transformer（GPT）、人工智能生成内容（AIGC）以及大语言模型（LLM）等安全领域应用的知识。在这里，您可以找到关于GPT/AIGC/LLM最新的研究论文、博客文章、实用的工具和预设指令（Prompts）。现为了更好地知悉近一周的贡献内容，现总结如下。SecurityPapers1.混乱中建立秩序：人
深度学习模块C2f代码详解你是狒狒吗目标检测人工智能计算机视觉 pytorch YOLO 神经网络
C2f是一个用于构建卷积神经网络（CNN）的模块，特别是在YOLOv5和YOLOv8等目标检测模型中。这个模块是一个改进的CSP（CrossStagePartial）Bottleneck结构，旨在提高计算效率和特征提取能力。下面是对C2f类的详细解释：类定义和初始化Python复制classC2f(nn.Module):“”“FasterImplementationofCSPBottleneckw
华为 Ascend 平台 YOLOv5 目标检测推理教程 Lunar* 目标检测华为 YOLO 目标检测
1.背景介绍随着人工智能技术的快速发展，目标检测在智能安防、自动驾驶、工业检测等领域中扮演了重要角色。YOLOv5是一种高效的目标检测模型，凭借其速度和精度的平衡广受欢迎。华为Ascend推理框架（ACL）是AscendCANN软件栈的核心组件，专为AscendAI加速硬件（如Atlas300I）设计，可实现高性能的深度学习推理。在本文中，我们将介绍如何基于华为AscendACL推理框架对YOLO
PLUTO：突破基于模仿学习的自动驾驶规划极限硅谷秋水机器学习自动驾驶人工智能自动驾驶人工智能机器学习计算机视觉
24年4月来自香港科技大学的论文“PLUTO:PushingtheLimitofImitationLearning-basedPlanningforAutonomousDriving”。PLUTO，突破基于模仿学习的自动驾驶规划极限。改进来自三个关键方面：一种纵向横向感知模型架构，可实现灵活多样的驾驶行为；一种创新的辅助损失计算方法，可广泛应用且可高效地进行批量计算；一种利用对比学习的训练框架，采
LargeAD：用于自动驾驶的大规模跨传感器数据预训练硅谷秋水自动驾驶计算机视觉机器学习自动驾驶人工智能机器学习计算机视觉
25年1月来自新加坡国立大学、南京航空航天、德国Bremerhaven技术大学、上海AI实验室、香港科技大学和香港大学的论文“LargeAD:Large-ScaleCross-SensorDataPretrainingforAutonomousDriving”。视觉基础模型(VFM)的最新进展彻底改变2D视觉感知，但它们在3D场景理解方面的潜力，特别是在自动驾驶应用中的潜力仍未得到充分探索。Lar
昇腾NPU推理YOLOV10目标检测（C++） weixin_51923349 c++ffmpeg opencv
1.准备工作基础环境：需要安装NPU固件驱动，CANN的包在昇腾官网下载，安装最新版就可以了。C++环境搭建链接：cplusplus/environment/catenation_environmental_guidance_CN.md·Ascend/samples-Gitee.com按照上面的链接，需要安装：presentagent,opencv,ffmpeg+acllite其中ffmpeg和o
【学术会议论文投稿】Spring Boot实战：零基础打造你的Web应用新纪元 m0_54804970 spring boot 前端后端
第七届人文教育与社会科学国际学术会议（ICHESS2024）_艾思科蓝_学术一站式服务平台更多学术会议请看：https://ais.cn/u/nuyAF3目录一、SpringBoot简介1.1SpringBoot的诞生背景1.2SpringBoot的核心特性二、搭建开发环境2.1安装Java环境2.2安装IDE2.3安装Maven或Gradle三、创建SpringBoot项目3.1使用Spring
2012广东工业大学毕业论文撰写与答辩指南永不放弃yes
本文还有配套的精品资源，点击获取简介：《2012毕业论文手册》是广东工业大学提供的毕业生论文写作与答辩的综合指导手册。它涵盖了从选题到答辩的完整流程，强调研究能力与学术水平的重要性。手册详细介绍了毕业设计的目的、意义，选题与开题报告的撰写，文献调研与引用的规范，研究方法与实验设计的科学性，论文的结构与撰写技巧，以及论文评审与答辩的准备策略。此外，它还提醒学生注意学术诚信与道德规范。通过这份手册，学
第78期 | GPTSecurity周报 aigcgpts
GPTSecurity是一个涵盖了前沿学术研究和实践经验分享的社区，集成了生成预训练Transformer（GPT）、人工智能生成内容（AIGC）以及大语言模型（LLM）等安全领域应用的知识。在这里，您可以找到关于GPT/AIGC/LLM最新的研究论文、博客文章、实用的工具和预设指令（Prompts）。现为了更好地知悉近一周的贡献内容，现总结如下。SecurityPapers1.ChatNVD：借
ACL 2024 | 美团技术团队精选论文解读美团算法人工智能
本文精选了美团技术团队被ACL2024收录的4篇论文进行解读，论文内容覆盖了训练成本优化、投机解码、代码生成优化、指令微调（IFT）等技术领域。这些论文是美团技术团队跟高校、科研机构合作的成果。希望能给从事相关研究工作的同学带来一些帮助或启发。ACL是计算语言学和自然语言处理领域最重要的顶级国际会议，由国际计算语言学协会组织，每年举办一次。据谷歌学术计算语言学刊物指标显示，ACL影响力位列第一，是
PenGymy论文阅读亚里士多没有德775 论文阅读
这里发现idea被人家先发了，没办法，资料收集的不够全面，现在来学习一下这个项目这篇论文的贡献如下：总的来说，他的主要工作是构建逼真的仿真环境，然后根据这个仿真环境生成真实的靶场，使得这个智能体能够在这个真实的环境去互动。下面来逐渐解析他的工作，我尽量详细一点1、背景和动机这种项目是在网络攻防中，攻防双方攻击者处于暗面，防御者处于明面，这时候受到攻击后应急处理多少会造成损失，那么要是可以提前预测攻
假新闻检测论文（24）A comprehensive survey of multimodal fake news detection techniques... weixin_41964296 假新闻检测自然语言处理
本文综述了利用深度学习架构和注意力机制进行假新闻检测的最新和全面的研究一介绍假新闻定义：虚假或误导性新闻，或“假新闻”，是任何捏造或故意欺骗的媒体内容。假新闻危害：它可以被利用来操纵公众情绪，传播错误信息，甚至干预政治选举。它的主要目的是扭曲、欺骗或操纵个人的信仰和观点。假新闻的形式（类型）：虚假信息在媒体上传播的形式多种多样，包括讽刺、谣言、点击诱饵、错误信息等。讽刺作品通常充满幽默，用来强调特
YOLOv8重磅升级：引入DenseOne密集网络革新主干设计，重塑YOLO目标检测性能新高度程序员杨弋 YOLO 目标检测人工智能
随着深度学习技术的不断进步，目标检测作为计算机视觉领域的重要任务之一，其性能和应用范围也在不断扩大。作为目标检测领域的佼佼者，YOLO（YouOnlyLookOnce）系列算法以其出色的性能和实时性受到了广泛关注。而最近提出的YOLOv8更是在前代版本的基础上进行了多项优化，进一步提升了检测精度和速度。然而，尽管YOLOv8已经取得了显著的进步，但在处理复杂场景和遮挡问题时，仍然存在一定的挑战。为
【YOLOv8改进- Backbone主干】YOLOv8更换主干网络之ConvNexts，纯卷积神经网络，更快更准，，降低参数量！ YOLO大师 YOLO 网络 cnn 目标检测论文阅读 yolov8
YOLOv8目标检测创新改进与实战案例专栏专栏目录：YOLOv8有效改进系列及项目实战目录包含卷积，主干注意力，检测头等创新机制以及各种目标检测分割项目实战案例专栏链接:YOLOv8基础解析+创新改进+实战案例介绍摘要视觉识别的“咆哮20年代”开始于视觉Transformer（ViTs）的引入，ViTs迅速取代了卷积神经网络（ConvNets）成为最先进的图像分类模型。然而，普通的ViT在应用于诸
springboot毕设基于java的在线学习交流平台程序+论文明思计算机毕设 spring boot 课程设计后端
本系统（程序+源码）带文档lw万字以上文末可获取一份本项目的java源码和数据库参考。系统程序文件列表开题报告内容研究背景随着互联网技术的飞速发展和全球教育资源的日益丰富，在线学习已成为人们获取知识、提升技能的重要途径。特别是在近年来，受各种因素影响，线上教育需求激增，促使在线学习交流平台不断涌现。这些平台旨在打破传统教育的时空限制，为学习者提供更加灵活、个性化的学习体验。然而，当前市场上的在线学
拿下美赛M奖之必备软件和网站！东方建模. 数学建模
目录前言：一.题目翻译与理解：DeepL+知云文献翻译二.查找文献：国内外平台结合使用三.论文撰写：Word或LaTeX+Overleaf四.公式输入与思维导图：MathType+XMind五.阅读文献与文献管理：AdobeReader+Zotero六.模型求解与编程：MATLAB+Python+Lingo七.图形绘制与结果可视化：MATLAB+Python+Origin八.流程图与示意图：亿图图
单片机实物成品-005 水质监测系统（代码+硬件+论文）学个单片机单片机实物成品单片机嵌入式硬件
水质监测系统（水温+TDS(水质)+PH+浑浊度+蜂鸣器+灯光+自动模式+手动模式+wifi传输控制+送小程序源码）本项目以软硬件结合开发的方式，选择C语言作为硬件开发技术，以STM32单片机作为核心控制板，在数据传输节点上连接GP2Y1014粉尘传感器、DHT11温湿度传感器、MQ-2烟雾传感器、SGP30甲醛传感器对空气中PM2.5含量、温湿度高低、烟雾浓度、甲醛含量进行采集，并针对异常的数据
单片机实物成品-010 智能宠物喂食系统（代码+硬件+论文）学个单片机单片机实物成品单片机宠物嵌入式硬件
项目介绍版本1：oled显示+定时投喂（舵机模拟）+声光报警+显示实时时间---演示视频：智能宠物喂食001_哔哩哔哩_bilibili1.STM32F103C8T6单片机进行数据处理2.OLED液晶显示3，按键1在数据显示界面时按下按键1切换下一个界面，在校准时间界面时按下按键1退出校准时间界面，在设置定时时间界面中如果是处于设置某个时间的状态按下按键1退出否则切换下一个页面。4.按键2数据显示
基于深度学习的人脸表情识别系统：YOLOv5 + YOLOv8 + YOLOv10 + UI界面 + 数据集 2025年数学建模美赛深度学习 YOLO ui 分类人工智能
引言随着人工智能的飞速发展，深度学习技术已广泛应用于各个领域，尤其是在计算机视觉领域。人脸识别和表情识别是其中的一个重要应用，能够在多种场景下提供重要的信息，例如安全监控、情感分析、智能客服、健康监测等。在人脸表情识别任务中，准确识别人脸的情感状态（如高兴、愤怒、悲伤等）是一个极具挑战性的任务。随着YOLO系列算法的不断进步，YOLOv5、YOLOv8和YOLOv10的推出大大提高了目标检测的精度
基于YOLOv8深度学习的人脸年龄检测识别系统 2025年数学建模美赛 YOLO 深度学习人工智能 ui 数据挖掘分类
引言随着人工智能和计算机视觉的飞速发展，人脸分析技术在年龄检测领域取得了显著进展。人脸年龄检测系统在安全监控、广告推荐、健康监测等领域有广泛应用。本文将基于YOLOv8目标检测模型和UI界面，开发一个完整的人脸年龄检测识别系统。我们将详细介绍项目的技术实现、数据集构建、模型训练以及UI设计，并附上完整代码。目录引言系统架构设计数据准备公开人脸年龄数据集数据标注格式数据目录结构模型训练YOLOv8环
基于深度学习的人脸表情识别系统：YOLOv8 + UI界面 + 数据集完整实现 2025年数学建模美赛深度学习 YOLO ui 人工智能代码
1.引言近年来，人脸表情识别在情感计算、智能人机交互、心理学研究等领域有着广泛的应用。深度学习的快速发展，使得高效、准确的人脸表情识别成为可能。通过利用卷积神经网络（CNN）和目标检测技术，可以实现实时、精准的人脸表情识别。本文将基于YOLOv8构建一个完整的人脸表情识别系统。系统集成了数据集准备、YOLOv8模型训练、实时推理以及基于PyQt5的图形用户界面（UI）。通过本文，你将学习如何实现一
第81期 | GPTSecurity周报 aigc网络安全
GPTSecurity是一个涵盖了前沿学术研究和实践经验分享的社区，集成了生成预训练Transformer（GPT）、人工智能生成内容（AIGC）以及大语言模型（LLM）等安全领域应用的知识。在这里，您可以找到关于GPT/AIGC/LLM最新的研究论文、博客文章、实用的工具和预设指令（Prompts）。现为了更好地知悉近一周的贡献内容，现总结如下。SecurityPapers1.大语言模型与代码安
第83期 | GPTSecurity周报 aigcgpts
GPTSecurity是一个涵盖了前沿学术研究和实践经验分享的社区，集成了生成预训练Transformer（GPT）、人工智能生成内容（AIGC）以及大语言模型（LLM）等安全领域应用的知识。在这里，您可以找到关于GPT/AIGC/LLM最新的研究论文、博客文章、实用的工具和预设指令（Prompts）。现为了更好地知悉近一周的贡献内容，现总结如下。SecurityPapers1.混乱中建立秩序：人
未来是计算机科学的天下,数学——计算机科学及应用未来不可或缺英伦百宝箱未来是计算机科学的天下
论文编号:YYSX006论文字数:3935,页数:05数学——计算机科学及应用未来不可或缺[摘要]：自从上世纪七八十年代，计算机科学与技术得到了迅速的发展，但是，世界起初是有了数学以后才出现计算机科学的，它是数学的延续和发展的辅助工具。首先，高等数学是计算机程序设计的奠基石，任何一个计算机程序设计都离不开数学的理论基础；其次，计算机科学的未来发展需要高等数学的辅助，计算机科学的发展过程中所使用的技
Web APP 阶段性综述预测模型的开发与应用研究 APP construction web app
WebAPP阶段性综述当前，WebAPP主要应用于电脑端，常被用于部署数据分析、机器学习及深度学习等高算力需求的任务。在医学与生物信息学领域，WebAPP扮演着重要角色。在生物信息学领域，诸多工具以WebAPP的形式呈现，相较之下，医学领域的此类应用数量相对较少。在医学和生物信息学的学术论文中，WebAPP是展示研究成果的有效工具，并且还能部署到网络上，服务于实际应用场景。ShinyAPP平台特性
GraphRAG 本地 Ollama - 知识图谱 ericliu2017 知识图谱人工智能
欢迎来到GraphRAGLocalOllama！这个存储库是对微软的GraphRAG的激动人心的改编，旨在支持使用Ollama下载的本地模型。告别昂贵的OpenAPI模型，拥抱使用Ollama进行高效、具有成本效益的本地推理！研究论文有关GraphRAG实现的更多详细信息，请参阅GraphRAG论文。论文摘要使用检索增强生成（RAG）从外部知识源中检索相关信息，使大型语言模型（LLMs）能够回答关
java杨辉三角 3213213333332132 java基础
package com.algorithm; /** * @Description 杨辉三角 * @author FuJianyong * 2015-1-22上午10:10:59 */ public class YangHui { public static void main(String[] args) { //初始化二维数组长度 int[][] y
《大话重构》之大布局的辛酸历史白糖_ 重构
《大话重构》中提到“大布局你伤不起”，如果企图重构一个陈旧的大型系统是有非常大的风险，重构不是想象中那么简单。我目前所在公司正好对产品做了一次“大布局重构”，下面我就分享这个“大布局”项目经验给大家。背景公司专注于企业级管理产品软件，企业有大中小之分，在2000年初公司用JSP/Servlet开发了一套针对中
电驴链接在线视频播放源码 dubinwei 源码电驴播放器视频 ed2k
本项目是个搜索电驴（ed2k）链接的应用,借助于磁力视频播放器（官网： http://loveandroid.duapp.com/ 开放平台），可以实现在线播放视频，也可以用迅雷或者其他下载工具下载。项目源码： http://git.oschina.net/svo/Emule,动态更新。也可从附件中下载。项目源码依赖于两个库项目，库项目一链接： http://git.oschina.
Javascript中函数的toString()方法周凡杨 JavaScript js toString function object
简述 The toString() method returns a string representing the source code of the function. 简译之，Javascript的toString()方法返回一个代表函数源代码的字符串。句法 function.
struts处理自定义异常 g21121 struts
很多时候我们会用到自定义异常来表示特定的错误情况，自定义异常比较简单，只要分清是运行时异常还是非运行时异常即可，运行时异常不需要捕获，继承自RuntimeException，是由容器自己抛出，例如空指针异常。非运行时异常继承自Exception，在抛出后需要捕获，例如文件未找到异常。此处我们用的是非运行时异常，首先定义一个异常LoginException: /** * 类描述：登录相
Linux中find常见用法示例 510888780 linux
Linux中find常见用法示例 ·find path -option [ -print ] [ -exec -ok command ] {} \; find命令的参数；
SpringMVC的各种参数绑定方式 Harry642 springMVC 绑定表单
1. 基本数据类型(以int为例，其他类似)： Controller代码： @RequestMapping("saysth.do") public void test(int count) { } 表单代码： <form action="saysth.do" method="post&q
Java 获取Oracle ROWID aijuans java oracle
A ROWID is an identification tag unique for each row of an Oracle Database table. The ROWID can be thought of as a virtual column, containing the ID for each row. The oracle.sql.ROWID class i
java获取方法的参数名 antlove java jdk parameter method reflect
reflect.ClassInformationUtil.java package reflect; import javassist.ClassPool; import javassist.CtClass; import javassist.CtMethod; import javassist.Modifier; import javassist.bytecode.CodeAtt
JAVA正则表达式匹配查找替换提取操作百合不是茶 java 正则表达式替换提取查找
正则表达式的查找;主要是用到String类中的split(); String str; str.split();方法中传入按照什么规则截取,返回一个String数组常见的截取规则: str.split("\\.")按照.来截取 str.
Java中equals()与hashCode()方法详解 bijian1013 java set equals()hashCode()
一.equals()方法详解 equals()方法在object类中定义如下： public boolean equals(Object obj) { return (this == obj); } 很明显是对两个对象的地址值进行的比较（即比较引用是否相同）。但是我们知道，String 、Math、I
精通Oracle10编程SQL(4)使用SQL语句 bijian1013 oracle 数据库 plsql
--工资级别表 create table SALGRADE ( GRADE NUMBER(10), LOSAL NUMBER(10,2), HISAL NUMBER(10,2) ) insert into SALGRADE values(1,0,100); insert into SALGRADE values(2,100,200); inser
【Nginx二】Nginx作为静态文件HTTP服务器 bit1129 HTTP服务器
Nginx作为静态文件HTTP服务器在本地系统中创建/data/www目录，存放html文件(包括index.html) 创建/data/images目录，存放imags图片在主配置文件中添加http指令 http { server { listen 80; server_name
kafka获得最新partition offset blackproof kafka partition offset 最新
kafka获得partition下标，需要用到kafka的simpleconsumer import java.util.ArrayList; import java.util.Collections; import java.util.Date; import java.util.HashMap; import java.util.List; import java.
centos 7安装docker两种方式 ronin47
第一种是采用yum 方式 yum install -y docker
java-60-在O(1)时间删除链表结点 bylijinnan java
public class DeleteNode_O1_Time { /** * Q 60 在O(1)时间删除链表结点 * 给定链表的头指针和一个结点指针(!!)，在O(1)时间删除该结点 * * Assume the list is: * head->...->nodeToDelete->mNode->nNode->..
nginx利用proxy_cache来缓存文件 cfyme cache
user zhangy users; worker_processes 10; error_log /var/vlogs/nginx_error.log crit; pid /var/vlogs/nginx.pid; #Specifies the value for ma
[JWFD开源工作流]JWFD嵌入式语法分析器负号的使用问题 comsci 嵌入式
假如我们需要用JWFD的语法分析模块定义一个带负号的方程式，直接在方程式之前添加负号是不正确的，而必须这样做： string str01 = "a=3.14;b=2.71;c=0;c-((a*a)+(b*b))" 定义一个0整数c,然后用这个整数c去
如何集成支付宝官方文档 dai_lm android
官方文档下载地址 https://b.alipay.com/order/productDetail.htm?productId=2012120700377310&tabId=4#ps-tabinfo-hash 集成的必要条件 1. 需要有自己的Server接收支付宝的消息 2. 需要先制作app，然后提交支付宝审核，通过后才能集成调试的时候估计会真的扣款，请注意
应该在什么时候使用Hadoop datamachine hadoop
原帖地址：http://blog.chinaunix.net/uid-301743-id-3925358.html 存档，某些观点与我不谋而合，过度技术化不可取，且hadoop并非万能。 --------------------------------------------万能的分割线-------------------------------- 有人问我，“你在大数据和Hado
在GridView中对于有外键的字段使用关联模型进行搜索和排序 dcj3sjt126com yii
在GridView中使用关联模型进行搜索和排序首先我们有两个模型它们直接有关联: class Author extends CActiveRecord { ... } class Post extends CActiveRecord { ... function relations() { return array( '
使用NSString 的格式化大全 dcj3sjt126com Objective-C
格式定义The format specifiers supported by the NSString formatting methods and CFString formatting functions follow the IEEE printf specification; the specifiers are summarized in Table 1. Note that you c
使用activeX插件对象object滚动有重影蕃薯耀 activeX插件滚动有重影
使用activeX插件对象object滚动有重影 <object style="width:0;" id="abc" classid="CLSID:D3E3970F-2927-9680-BBB4-5D0889909DF6" codebase="activex/OAX339.CAB#
SpringMVC4零配置 hanqunfeng springmvc4
基于Servlet3.0规范和SpringMVC4注解式配置方式，实现零xml配置，弄了个小demo，供交流讨论。项目说明如下： 1.db.sql是项目中用到的表，数据库使用的是oracle11g 2.该项目使用mvn进行管理，私服为自搭建nexus,项目只用到一个第三方 jar，就是oracle的驱动； 3.默认项目为零配置启动，如果需要更改启动方式，请
《开源框架那点事儿16》：缓存相关代码的演变 j2eetop 开源框架
问题引入上次我参与某个大型项目的优化工作，由于系统要求有比较高的TPS，因此就免不了要使用缓冲。该项目中用的缓冲比较多，有MemCache，有Redis，有的还需要提供二级缓冲，也就是说应用服务器这层也可以设置一些缓冲。当然去看相关实现代代码的时候，大致是下面的样子。 [java] view plain copy print ? public vo
AngularJS浅析 kvhur JavaScript
概念 AngularJS is a structural framework for dynamic web apps. 了解更多详情请见原文链接：http://www.gbtags.com/gb/share/5726.htm Directive 扩展html，给html添加声明语句，以便实现自己的需求。对于页面中html元素以ng为前缀的属性名称，ng是angular的命名空间
架构师之jdk的bug排查(一)---------------split的点号陷阱 nannan408 split
1.前言. jdk1.6的lang包的split方法是有bug的,它不能有效识别A.b.c这种类型,导致截取长度始终是0.而对于其他字符,则无此问题.不知道官方有没有修复这个bug. 2.代码 String[] paths = "object.object2.prop11".split("'"); System.ou
如何对10亿数据量级的mongoDB作高效的全表扫描 quentinXXZ mongodb
本文链接: http://quentinXXZ.iteye.com/blog/2149440 一、正常情况下，不应该有这种需求首先，大家应该有个概念，标题中的这个问题，在大多情况下是一个伪命题，不应该被提出来。要知道，对于一般较大数据量的数据库，全表查询，这种操作一般情况下是不应该出现的，在做正常查询的时候，如果是范围查询，你至少应该要加上limit。说一下，
C语言算法之水仙花数 qiufeihu c 算法
/** * 水仙花数 */ #include <stdio.h> #define N 10 int main() { int x,y,z; for(x=1;x<=N;x++) for(y=0;y<=N;y++) for(z=0;z<=N;z++) if(x*100+y*10+z == x*x*x
JSP指令 wyzuomumu jsp
jsp指令的一般语法格式： <%@ 指令名属性 =”值 ” %> 常用的三种指令： page,include,taglib page指令语法形式： <%@ page 属性 1=”值 1” 属性 2=”值 2”%> include指令语法形式： <%@include file=”relative url”%> (jsp可以通过 include