只会GAN的小朋友

StrongSORT：Make DeepSORT Great Again（翻译）

原文链接：https://arxiv.org/abs/2202.13514
代码链接：https://github.com/dyhBUPT/StrongSORT

Abstract.

Existing Multi-Object Tracking (MOT) methods can be roughly classified as tracking-by-detection and joint-detection-association paradigms. Although the latter has elicited more attention and demonstrates comparable performance relative to the former, we claim that the tracking-by-detection paradigm is still the optimal solution in terms of tracking accuracy. In this paper, we revisit the classic tracker DeepSORT and upgrade it from various aspects, i.e., detection, embedding and association.The resulting tracker, called StrongSORT, sets new HOTA and IDF1records on MOT17 and MOT20. We also present two lightweight andplug-and-play algorithms to further refine the tracking results. Firstly,an appearance-free link model (AFLink) is proposed to associate shorttracklets into complete trajectories. To the best of our knowledge, thisis the first global link model without appearance information. Secondly,we propose Gaussian-smoothed interpolation (GSI) to compensate formissing detections. Instead of ignoring motion information like linear interpolation, GSI is based on the Gaussian process regression algorithmand can achieve more accurate localizations. Moreover, AFLink and GSIcan be plugged into various trackers with a negligible extra compu-tational cost (591.9 and 140.9 Hz, respectively, on MOT17). By integrating StrongSORT with the two algorithms, the final tracker Strong-SORT++ ranks first on MOT17 and MOT20 in terms of HOTA andIDF1 metrics and surpasses the second-place one by 1.3 - 2.2. Code willbe released soon

摘要。现有的多目标跟踪(MOT)方法大致可分为检测跟踪（tracking-by-detection）和联合检测关联（joint-detection-association）两种，虽然后者已经引起了更多的关注，并且表现出与前者相当的性能，但我们认为从跟踪精度来看，检测跟踪（tracking-by-detection）仍然是最优的解决方案。本文对经典的跟踪器DeepSORT进行了回顾，并从detection、embedding和association等方面对其进行了升级，提出了一种称为StrongSORT的跟踪器，在MOT17和MOT20上设置了新的HOTA和IDF1记录。我们还提出了两种轻量级、即插即用的算法来进一步完善跟踪结果。首先，提出了一种无外观链接模型appearence-free link（AFLink）模型，将短轨迹关联成完整的轨迹。据我们所知，这是第一个没有外观信息的全局链接模型。其次，我们提出了高斯平滑插值Gaussian-smoothed interpolation（GSI）来补偿漏检。GSI不像线性插值那样忽略运动信息，而是基于高斯过程回归算法，可以实现更精确的定位。此外，AFLink和GSI可以插入到各种跟踪器中，额外的计算成本可以忽略不计(在MOT17上分别为591.9 Hz和140.9 Hz)。通过将StrongSORT与这两种算法相结合，最终的跟踪器Strong-Sort++在HOTA和IDF1度量方面在MOT17和MOT20上排名第一，并以1.3-2.2的优势超过第二名。

1 Introduction

Multi-Object Tracking (MOT) plays an essential role in video understanding. Itaims to detect and track all specific classes of objects frame by frame. In thepast few years, the tracking-by-detection paradigm [3, 4, 36, 62, 69] dominatedthe MOT task. It performs detection per frame and formulates the MOT prob-lem as a data association task. Benefiting from high-performing object detectionmodels, tracking-by-detection methods have gained favor due to their excellent performance.

多目标跟踪(MOT)在视频理解中起着至关重要的作用。它旨在用于逐帧检测和跟踪所有特定类别的对象。在过去的几年里，tracking-by-detection[3，4，36，62，69]在MOT任务中占据主导地位。它按帧执行检测，并将MOT问题公式化为数据关联任务。得益于高性能的目标检测模型，基于检测的跟踪方法因其优良的性能而受到青睐。

Fig. 1. IDF1-MOTA-HOTA comparisons of state-of-the-art trackers with our proposed StrongSORT and StrongSORT++ on MOT17 and MOT20 test sets. The horizontal axis is MOTA, the vertival axis is IDF1, and the radius of the circle is HOTA. ”*”represents our reproduced version. Our StrongSORT++ achieves the best IDF1 and HOTA and comparable MOTA performance.

图1.IDF1-MOTA-HOTA在MOT17和MOT20测试集上与我们建议的StrongSORT和StrongSORT++进行的最先进跟踪器的比较。水平轴是MOTA，垂直轴是IDF1，圆的半径是HOTA。“*”代表我们的复制版本。我们的StrongSORT++实现了最好的IDF1和HOTA以及可与之媲美的MOTA性能。

However, these methods generally require multiple computationally expensive components, such as a detector and an embedding model. To solve this problem, several recent methods [1,60,74] integrate the detector and embedding model into a unified framework. Moreover, joint detection and embedding training appears to produce better results compared with the seperate one [47]. Thus, these methods (joint trackers) achieve comparable or even better tracking accuracy compared with tracking-by-detection ones (seperate trackers).

然而，这些方法通常需要多个计算昂贵的组件，例如检测器和embedding模型。为了解决这一问题，最近的几种方法[1，60，74]将探测器和embedding模型集成到一个统一的框架中。此外，联合检测和embedding训练似乎比单独的检测和嵌入训练产生了更好的效果。因此，joint trackers与seperate trackers相比，实现了相当的跟踪精度，甚至更高的跟踪精度。

The success of joint trackers has motivated researchers to design unified track-ing frameworks for various components, e.g., detection, motion, embedding, and association models [30, 32, 38, 57, 59, 65, 68]. However, we argue that two problems exist in these joint frameworks: (1) the competition between different components and (2) limited data for training these components jointly. Although several strategies have been proposed to solve them, these problems still lower the upper bound of tracking accuracy. On the contrary, the potential of seperate trackers seems to be underestimated.

联合跟踪器的成功促使研究人员为各种组件设计统一的跟踪框架，例如检测、运动、嵌入和关联模型[30，32，38，57，59，65，68]。然而，我们认为这些联合框架中存在两个问题：(1)不同组件之间的竞争和(2)用于联合训练这些组件的数据有限。虽然已经提出了几种策略来解决这些问题，但这些问题仍然降低了跟踪精度的上限。相反，seperate trackers的潜力似乎被低估了。

In this paper, we revisit the classic seperate tracker DeepSORT [62], which is among the earliest methods that apply the deep learning model to the MOT task. It’s claimed that DeepSORT underperforms compared with state-of-the-artmethods because of its outdated techniques, rather than its tracking paradigm. We show that by simply equipping DeepSORT with advanced components invarious aspects, resulting in the proposed StrongSORT, it can achieve new SOTA on popular benchmarks MOT17 [35] and MOT20 [11].

在本文中，我们回顾了经典的独立跟踪器DeepSORT[62]，它是最早将深度学习模型应用于MOT任务的方法之一。DeepSORT的性能不如最先进的方法是因为它的技术落后，而不是它的跟踪模式，我们证明了通过简单地为DeepSORT配备先进的组件不变方面（We show that by simply equipping DeepSORT with advanced components invarious aspects.不知道怎么翻译了），从而产生了所提出的StrongSORT，它可以在流行的基准MOT17[35]和MOT20[11]上实现新的SOTA。

Two lightweight, plug-and-play, model-independent, appearance-free algorithms are also proposed to refine the tracking results. Firstly, to better exploit the global information, several methods propose to associate short tracklets into trajectories by using a global link model [12,39,55,56,67]. They usually generate accurate but incomplete tracklets and associate them with global information in an offline manner. Although these methods improve tracking performance sig-nificantly, they all rely on computation-intensive models, especially appearance embeddings. By contrast, we propose an appearance-free link model (AFLink)that only utilizes spatio-temporal information to predict whether the two inputtracklets belong to the same ID.

提出了两种轻量级、即插即用、模型无关、外观无关的算法来改进跟踪结果。首先，为了更好地利用全局信息，几种方法提出通过使用全局链接模型将短轨迹关联到轨迹[12，39，55，56，67]。它们通常生成准确但不完整的轨迹，并用离线的方式将它们与全局信息关联。虽然这些方法显著提高了跟踪性能，但它们都依赖于计算密集型模型，尤其是appearance embeddings。相反，我们提出了一种只利用时空信息来预测两个输入轨迹是否属于同一ID的AFLink模型。

Secondly, linear interpolation is widely used to compensate for missing detections [12, 21, 37, 40, 41, 73]. However, it ignores motion information, which limits the accuracy of the interpolated positions. To solve this problem, we propose the Gaussian-smoothed interpolation algorithm (GSI), which enhances the interpolation by using the Gaussian process regression algorithm [61].

其次，线性插值被广泛用于补偿缺失检测[12，21，37，40，41，73]。但是，该算法忽略了运动信息，限制了插值位置的精度。为了解决这个问题，我们提出了高斯平滑插值算法(GSI)，它通过使用高斯过程回归算法来增强插值[61]。

Extensive experiments prove that the two proposed algorithms achieve notable improvements on StrongSORT and other state-of-the-art trackers, e.g.,CenterTrack [77], TransTrack [50] and FairMOT [74]. Particularly, by applyingAFLink and GSI to StrongSORT, we obtain a stronger tracker called Strong-SORT++. It achieves 64.4 HOTA, 79.5 IDF1 and 79.6 MOTA (7.1 Hz) on theMOT17 test set and 62.6 HOTA, 77.0 IDF1 and 73.8 MOTA (1.4 Hz) on theMOT20 test set. Figure 1 compares our StrongSORT and StrongSORT++ withstate-of-the-art trackers on MOT17 and MOT20 test sets. Our method achievesthe best IDF1 and HOTA and a comparable MOTA performance. Furthermore,AFLink and GSI respectively run at 591.9 and 140.9 Hz on MOT17, 224.0 and17.6 Hz on MOT20, resulting in a negligible computational cost.

大量的实验证明，这两种算法在StrongSORT和其他最先进的跟踪器(如CenterTrack[77]、Transtrack[50]和FairMOT[74])上取得了显著改进。特别地，通过将AFLink和GSI应用于StrongSORT，我们得到了一个更强的跟踪器，称为Strong-Sort++。在MOT17测试集上达到64.4HOTA、79.5IDF1和79.6MOTA(7.1 Hz)，在MOT20测试集上达到62.6HOTA、77.0IDF1和73.8MOTA(1.4 Hz)。图1将我们的StrongSORT和StrongSORT++与MOT17和MOT20测试集上最先进的跟踪器进行了比较。我们的方法获得了最好的IDF1和HOTA，并且获得了与MOTA相当的性能。此外，AFLink和GSI分别在MOT17上运行591.9 Hz和140.9 Hz，在MOT20上运行224.0 Hz和17.6 Hz，导致计算量可以忽略不计。

The contributions of our work are summarized as follows:

We revisit the classic seperate tracker DeepSORT and improve it from various aspects, resulting in StrongSORT, which sets new HOTA and IDF1 records on MOT17 and MOT20 datasets.
We propose two lightweight and appearance-free algorithms, AFLink andGSI, which can be plugged into various trackers to improve their performanceby a large margin.
By integrating StrongSORT with AFLink and GSI, our StrongSORT++ ranks first on MOT17 and MOT20 in terms of widely used HOTA and IDF1metrics and surpasses the second-place one [73] by 1.3 - 2.2.

本文的主要工作如下：

对经典的独立跟踪器DeepSORT进行了改进，提出了在MOT17和MOT20数据集上达到新的HOTA和IDF1记录的StrongSORT算法；
提出了AFLink和GSI两种轻量级、appearance-free的跟踪算法，可以嵌入到各种跟踪器中，大大提高了跟踪性能；
通过将StrongSORT与AFLink和GSI集成，在广泛使用的HOTA和IDF1指标方面，我们的StrongSORT++在MOT17和MOT20中排名第一，并以1.3-2.2的优势超过第二名[73]。

2 Related Work

2.1 Seperate and Joint Trackers

MOT methods can be classified as seperate and joint trackers. Seperate trackers[3,4,7,8,15,36,62,69] follow the tracking-by-detection paradigm and localize targets first and then associate them with information on appearance, motion, etc. Benefiting from the rapid development of object detection [17, 42, 43, 52, 53, 78],seperate trackers have dominated the MOT task for years. Recently, severaljoint trackers [30, 32, 38, 57, 59, 65, 68] have been proposed to train detection andsome other components jointly, e.g., motion, embedding and association models.The main benefit of these trackers is their low computational cost and comparable performance. However, we claim that joint trackers face two major problems: competition between different components and limited data for training the components jointly. The two problems limit the upper bound of tracking accuracy.Therefore, we argue that the tracking-by-detection paradigm is still the optimal solution for tracking performance.

MOT方法可分为单独跟踪器和联合跟踪器。独立跟踪器[3，4，7，8，15，36，62，69]遵循先检测后跟踪的范式，首先定位目标，然后将它们与外观、运动等信息相关联，得益于目标检测的快速发展[17，42，43，52，53，78]，独立跟踪器多年来一直主导着MOT任务。最近，人们提出了几种联合跟踪器[30，32，38，57，59，65，68]来联合训练检测和其他一些组件，如运动、嵌入和关联模型，这些跟踪器的主要优点是计算成本低，性能相当。然而，我们声明联合跟踪器面临两大问题：不同组件之间的竞争，有限的数据用于训练联合组件。这两个问题限制了跟踪精度的上限，因此，我们认为tracking-by-detection仍然是跟踪性能的最佳解决方案。

Meanwhile, several recent studies [48, 49, 73] have abandoned appearance information and relied only on high-performance detectors and motion information, which achieve high running speed and state-of-the-art performance on MOTChallenge benchmarks [11,35]. However, we argue that it’s partly due to the general simplicity of motion patterns in these datasets. Abandoning appearance features would lead to poor robustness in more complex scenes. In this paper, we adopt the DeepSORT-like [62] paradigm and equip it with advanced techniques from various aspects to confirm the effectiveness of this classic framework.

同时，最近的一些研究[48，49，73]已经放弃了外观信息，而只依赖于高性能的检测器和运动信息，它们在MOT挑战基准上获得了高运行速度和最先进的性能[11，35]。然而，我们认为这在一定程度上是由于这些数据集中运动模式的普遍简单性。在更复杂的场景中，放弃外观特征会导致较差的鲁棒性。本文采用类DeepSORT[62]范式，并从各个方面为其配备了先进的技术，以证实这一经典框架的有效性。

2.2 Global Link in MOT

To exploit rich global information, several methods refine the tracking results with a global link model [12, 39, 55, 56, 67]. They tend to generate accurate but incomplete tracklets by using spatio-temporal and/or appearance information first. Then, these tracklets are linked by exploring global information in an offline manner. TNT [56] designs a multi-scale TrackletNet to measure the connectivity between two tracklets. It encodes motion and appearance informationin a unified network by using multi-scale convolution kernels. TPM [39] presents a tracklet-plane matching process to push easily confusable tracklets into different tracklet-planes, which helps reduce the confusion in the tracklet matching step. ReMOT [67] is improved from ReMOTS [66]. Given any tracking results, ReMOT splits imperfect trajectories into tracklets and then merges them with appearance features. GIAOTracker [12] proposes a complex global link algorithm that encodes tracklet appearance features by using an improved ResNet50-TP model [16] and associates tracklets together with spatial and temporal costs. Although these methods yield notable improvements, they all rely on appearance features, which bring high computational cost. Differently, we propose the AFLink model that only exploits motion information to predict the link confidence between two tracklets. By designing an appropriate model framework and training process, AFLink benefits various state-of-the-art trackers with a negligible extra cost. To the best of our knowledge, this is the first appearance-freeand lightweight global link model for the MOT task.

为了利用丰富的全局信息，几种方法使用全局链接模型[12，39，55，56，67]来改进跟踪结果。他们倾向于通过首先使用时空和/或外观信息来生成准确但不完整的轨迹。然后，通过以一种灵活的方式探索全局信息，将这些轨迹链接起来。TNT[56]设计了一个多尺度TrackletNet来测量两个轨道之间的连通性。它利用多尺度卷积核在统一的网络中对运动和外观信息进行编码。TPM[39]提出了一种轨迹-平面匹配过程，将容易混淆的轨迹推送到不同的轨迹-平面中，这有助于减少轨迹匹配步骤中的混淆。ReMOT[67]是从ReMOTS[66]改进的。给定任何跟踪结果，ReMOT都会将不完美的轨迹拆分成tracklet，然后将它们与外观特征合并。GIAOTracker[12]提出了一种复杂的全局链接算法，该算法使用改进的ResNet50-TP模型[16]对轨迹块外观特征进行编码，并将轨迹块与空间和时间代价相关联，虽然这些方法都取得了显著的改进，但都依赖于外观特征，带来了较高的计算代价。不同的是，我们提出了AFLink模型，该模型只利用运动信息来预测两个轨迹之间的链接概率。通过设计合适的模型框架和训练过程，AFLink 以微不足道的额外成本使各种最先进的跟踪器受益。据我们所知，这是MOT任务的第一个外观自由和轻量级的全局链接模型。

2.3 Interpolation in MOT

Linear interpolation is widely used to fill the gaps of recovered trajectories for missing detections [12, 21, 37, 40, 41, 73]. Despite its simplicity and effectiveness,linear interpolation ignores motion information, which limits the accuracy of the restored bounding boxes. To solve this problem, several strategies have been proposed to utilize spatio-temporal information effectively. V-IOUTracker [5]extends IOUTracker [4] by falling back to single-object tracking [20, 25] while missing detection occurs. MAT [19] smooths the linearly interpolated trajectories nonlinearly by adopting a cyclic pseudo-observation trajectory filling strategy. An extra camera motion compensation (CMC) model [14] and Kalman filter [26]are needed to predict missing positions. MAATrack [49] simplifies it by applying only the CMC model. All these methods apply extra models, i.e., single-objecttracker, CMC, Kalman filter, in exchange for performance gains. Instead, we propose to model nonlinear motion on the basis of the Gaussian process regression(GPR) algorithm [61]. Without additional time-consuming components, our pro-posed GSI algorithm achieves a good trade-off between accuracy and efficiency.

线性插值被广泛用于填补形成检测的恢复轨迹的空白[12，21，37，40，41，73]。尽管线性插值简单有效，但它忽略了运动信息，这限制了存储的边界框的精度。为了解决这一问题，人们提出了几种有效利用时空信息的策略。V-IOUTracker 扩展了 IOUTracker，在发生漏检时回退到单目标跟踪。MAT [19]采用循环伪观测轨迹填充策略对线性插值轨迹进行非线性平滑。需要额外的相机运动补偿模型[14]和卡尔曼过滤[26]来预测丢失位置。MAATrack[49]通过只应用CMC模型简化了它。所有这些方法都使用额外的模型，即单对象跟踪器、CMC法、卡尔曼过滤法，以换取性能的提高。相反，我们建议在高斯过程回归(GPR)算法的基础上对非线性运动进行建模[61]。在不增加额外耗时组件的情况下，我们提出的GSI算法在精度和效率之间取得了很好的折衷。

Fig. 2. Framework and performance comparison between DeepSORT and Strong-SORT. Performance is evaluated on the MOT17 validation set based on detectionspredicted by YOLOX [17].

图2.DeepSORT和Strong-Sort的结构和性能比较。基于YOLOX[17]预测的检测，在MOT17验证集上评估性能。

The most similar work with our GSI is [79], which uses the GPR algorithm to smooth the uninterpolated tracklets for accurate velocity predictions. How-ever, it works for the event detection task in surveillance videos. Differently, westudy on the MOT task and adopt GPR to refine the interpolated localizations.Moreover, we present an adaptive smoothness factor, instead of presetting ahyperparameter like [79].

与我们的 GSI 最相似的工作是 [79]，它使用 GPR 算法来平滑未插值的轨迹，以进行准确的速度预测。但是，它适用于监控视频中的事件检测任务。不同的是，我们对 MOT 任务进行了研究，并采用 GPR 来改进插值定位。此外，我们提出了一个自适应平滑因子，而不是像 [79] 那样预设超参数。

3 StrongSORT

In this section, we present various approaches to improve the classic tracker DeepSORT [62]. Specifically, we review DeepSORT in Section 3.1 and introduce StrongSORT in Section 3.2. Notably, we do not claim any algorithmic novelty in this section. Instead, our contributions here lie in giving a clear understanding of DeepSORT and equipping it with various advanced techniques to prove the effectiveness of its paradigm.

在本节中，我们将介绍改进经典trackerDeepSORT[62]的各种方法。具体地说，我们在3.1节中回顾了DeepSORT，并在3.2节中介绍了StrongSORT。值得注意的是，我们在这一节中没有声称有任何算法新颖性。相反，我们在这里的贡献在于对DeepSORT有了一个清晰的理解，并为其配备了各种先进的技术来证明其范式的有效性。

3.1 Review of DeepSORT

We briefly summarize DeepSORT as a two-branch framework, that is, appearance branch and motion branch, as shown in the top half of Figure 2

我们简要地将DeepSORT概括为一个由两个分支组成的框架，即外观分支和运动分支，如图2的上半部分所示

In the appearance branch, given detections in each frame, the deep appearance descriptor (a simple CNN), which is pretrained on the person re-identification dataset MARS [75], is applied to extract their appearance features.It utilizes a feature bank mechanism to store the features of the last 100 frames for each tracklet. As new detections come, the smallest cosine distance between the feature bank $R_i$ of the $i$ -th tracklet and the feature $f_j$ of the $j$ -th detectionis computed as

在外观分支中，给定每一帧中的检测，应用在行人重识别数据集MARS[75]上预训练的深度外观描述符(一种简单的CNN)来提取其外观特征，并利用feature bank机制来存储每条轨迹的最后100帧的特征。随着新检测的到来，第i个轨道小程序的feature bank $R_i$ 和第 $j$ 个检测的特征 $f_j$ 之间的最小余弦距离被计算为
$d(i,j)=\min{1-f_j^Tf_k^{(i)}|f_k^{(i)}\in R_i} \ \ \ \ (1)$
The distance is used as the matching cost during the association procedure.

在关联过程中，将距离作为匹配代价。

In the motion branch, the Kalman filter algorithm [26] accounts for predicting the positions of tracklets in the current frame. Then, Mahalanobis distance is used to measure the spatio-temporal dissimilarity between tracklets and detections. DeepSORT takes this motion distance as a gate to filter out unlikely associations.

在运动分支中，卡尔曼过滤算法[26]负责预测当前帧中轨迹的位置。然后，利用马氏距离来度量轨迹和目标之间的时空差异性。DeepSORT以此运动距离为阀来过滤排除不可能的关联。

Afterwards, the matching cascade algorithm is proposed to solve the association task as a series of subproblems instead of a global assignment problem. The core idea is to give greater matching priority to more frequently seen objects. Each association subproblem is solved using the Hungarian algorithm [29].

然后提出匹配级联算法将关联任务作为一系列子问题来求解，而不是全局分配问题。其核心思想是赋予更频繁出现的对象更高的匹配优先级，每个关联子问题都使用匈牙利算法[29]来求解。

3.2 Stronger DeepSORT

Our improvements over DeepSORT lie mainly in the two branches, as shown in the bottom half of Figure 2. For the appearance branch, a stronger appearance feature extractor, BoT [34], is applied to replace the original simple CNN. By taking ResNeSt50 [71] as the backbone and pretraining on the DukeMTMC-reID [44] dataset, it can extract much more discriminative features. In addition,we replace the feature bank with the feature updating strategy proposed in[60], which updates appearance state $e_i^t$ for the $i$ -th tracklet at frame $t$ in an exponential moving average (EMA) manner as follows:

我们对DeepSORT的改进主要体现在两个分支上，如图2的下半部分所示。对于外观分支，应用了更强大的外观特征提取器BOT[34]来取代原来简单的CNN。该算法以ResNeSt50[71]为主干，在DukeMTMC-Reid[44]数据集上进行预训练，可以提取更具区分性的特征。此外，我们用[60]中提出的特征更新策略替换了特征库，该策略以指数移动平均(EMA)的方式更新帧 $t$ 处第 $i$ 个轨迹的外观状态 $e_i^t$ ，如下所示：
$e_i^t=\alpha e_i^{t-1}+(1-\alpha)f_i^t \ \ \ \ (2)$
where $f_i^t$ is the appearance embedding of the current matched detection and $\alpha$ = 0.9 is a momentum term. The EMA updating strategy not only enhances the matching quality, but also reduces the time consumption.

其中， $f_i^t$ 是当前匹配检测的外观嵌入，并且 $\alpha$ = 0.9 是动量项。EMA更新策略不仅提高了匹配质量，而且减少了时间消耗。

For the motion branch, similar to [19, 27, 49], we adopt ECC [14] for cameramotion compensation. Furthermore, the vanilla Kalman filter is vulnerable w.r.t. low-quality detections [49] and ignores the information on the scales of detection noise. To solve this problem, we borrow the NSA Kalman algorithm from [12]that proposes a formula to adaptively calculate the noise covariance $\widetilde{R}_k$ :

对于运动分支，类似于[19，27，49]，我们采用ECC[14]进行摄像机运动补偿。此外，普通卡尔曼滤波很容易受到低质量检测的攻击[49]，并且忽略了检测噪声尺度上的信息。为了解决这个问题，我们借用了[12]中的NSA卡尔曼算法，提出了一个自适应计算噪声协方差 $\widetilde{R}_k$ 的公式：
$\widetilde{R}_k=(1-c_k)R_k \ \ \ \ (3)$

where $R_k$ is the preset constant measurement noise covariance and $c_k$ is the detection confidence score at state $k$ . Furthermore, instead of employing only the appearance feature distance dur-ing matching, we solve the assignment problem with both appearance and motioninformation, similar to [60]. Cost matrix $C$ is a weighted sum of appearance cost $A_a$ and motion cost $A_m$ as follows:

其中 $R_k$ 是预先设定的常量测噪声协方差， $c_k$ 是状态 $k$ 下的检测置信度分数，并且在匹配过程中不再只使用外观特征距离，而是同时考虑外观和运动信息，类似于[60]。成本矩阵 $C$ 是外观成本 $A_a$ 和动作成本 $A_m$ 的加权和，如下所示：
$\lambda A_a+(1-\lambda)A_m\ \ \ \ (4)$
where weight factor $\lambda$ is set to 0.98. Another interesting finding is that although the matching cascade algorithm is not trivial in DeepSORT, it limits the performance as the tracker becomes more powerful. The reason is that as the trackerbecomes stronger, it becomes more robust to confusable associations. Therefore, additional prior constraints would limit the matching accuracy. We replace matching cascade with vanilla global linear assignment.

其中，权重因子λ设置为0.98.另一个有趣的发现是，虽然匹配级联算法在DeepSORT中不是微不足道的，但随着跟踪器变得更强大，它限制了性能。原因是，随着跟踪器变得更强大，它对容易混淆的关联也变得更加健壮。因此，附加的先验约束会限制匹配精度。我们用普通的全局线性分配来代替匹配级联(matching cascade)。

4 StrongSORT++

We presented a strong tracker in Section 3. In this section, we introduce twolightweight, plug-and-play, model-independent, appearance-free algorithms, namelyAFLink and GSI, to further refine the tracking results. We call the final methodStrongSORT++, which integrates StrongSORT with the two algorithms.

我们在第三节介绍了一个强大的跟踪器。在这一节中，我们介绍了两种轻量级、即插即用、模型无关、外观无关的算法，命名为AFLink和GSI，以进一步完善跟踪结果。我们称最终的方法为StrongSORT++，它集成了StrongSORT和这两种算法。

4.1 AFLink

The global link for tracklets is used in several works to pursue highly accurate associations. However, they generally rely on computationally expensive components and numerous hyperparameters to fine-tune. For example, the link algorithm in GIAOTracker [12] utilizes an improved ResNet50-TP [16] to extract tracklets 3D features and performs association with additional spatial andtemporal distances. This means 6 hyperparameters (3 thresholds and 3 weight factors) are to be fine-tuned, which incurs additional tuning experiments and poor robustness. Moreover, we find that over-reliance on appearance featuresis vulnerable to noise. Motivated by this, we design an appearance-free model, AFLink, to predict the connectivity between two tracklets by relying only on spatio-temporal information.

Tracklet的全局链接在几个works中使用，以追求高准确率的关联。然而，它们通常依赖于计算昂贵的组件和大量的超参数来微调。例如，GIAOTracker[12]中的链接算法利用改进的ResNet50-TP[16]来提取Tracklet 3D特征，并执行与附加的空间和时间距离的关联。这意味着需要微调6个超参数(3个阈值和3个权重因子)，这会带来额外的调整实验，并且鲁棒性很差。此外，我们发现过度依赖外观特征很容易受到噪声的影响。受此启发，我们设计了一个appearance-free的模型，AFLink，仅依靠时空信息预测两个tracklet之间的连通性。

Fig. 3. Framework of the AFLink model. It adopts the spatio-temporal information of two tracklets as the input and then predicts their connectivity.

图3.AFLink模型的框架。它采用两个轨迹的时空信息作为输入，然后预测它们的连通性。

Figure 3 shows the two-branch framework of the AFLink model. It adopts two tracklets $T_i$ and $T_j$ as the input, where $T_∗ = \{f_k,x_k,y_k\}^N_{k=1}$ consists of theframes $f_k$ and positions ( $x_k$ , $y_k$ ) of the recent $N$ = 30 frames. Zero padding is used for those shorter than 30 frames. A temporal module is applied to extract features by convolving along the temporal dimension with 7×1 kernels. Then,a fusion module performs 1×3 convolutions to integrate the information from different feature dimensions, namely $f$ , $x$ and $y$ . The two resulting feature mapsare pooled and squeezed to feature vectors respectively, and then concatenated,which includes rich spatio-temporal information. Finally, an MLP is used topredict a confidence score for association. Note that the temporal module andfusion module of the two branches are not tied.

图3显示了AFLink模型的两个分支框架。它采用两个轨迹 $T_i$ 和 $T_j$ 作为输入，其中 $T_∗ = \{f_k,x_k,y_k\}^N_{k=1}$ 由最近 $N$ = 30 帧的帧 $f_k$ 和位置( $x_k$ , $y_k$ ) 组成。零填充用于短于30帧的图像。特征提取采用时间模块，沿时间维度与7×1个核进行卷积。然后，融合模块对来自不同特征维数 $f$ , $x$ 和 $y$ 的信息进行1×3卷积，将得到的两个特征映射分别合并和压缩成特征向量，然后进行拼接，包含丰富的时空信息。最后，使用MLP来判断关联的置信度分数。请注意，两个分支的时态模块和融合模块没有绑定。

During association, we filter out unreasonable tracklet pairs with spatio-temporal constraints. Then, the global link is solved as a linear assignment task[29] with the predicted connectivity score.

在关联过程中，我们过滤出具有时空约束的不合理轨迹对。然后，将全局链路求解为具有预测连通性分数的线性分配任务[29]。

4.2 GSI

Interpolation is widely used to fill the gaps in trajectories caused by missing detections. Linear interpolation is highly popular due to its simplicity. However, its accuracy is limited because it does not use motion information. Although several strategies have been proposed to solve this problem, they generally introduce additional time-consuming modules, e.g., single-object tracker, Kalmanfilter, ECC. Differently, we present a lightweight interpolation algorithm that employs Gaussian process regression [61] to model nonlinear motion.

插值被广泛地用来填补因漏检而造成的轨迹空白。线性插值因其简单性而广受欢迎。但是，由于没有使用运动信息，其精度受到限制。虽然已经提出了几种策略来解决这一问题，但通常都会引入额外的耗时模块，如单目标跟踪器、卡尔曼滤波器、纠错码等。不同的是，我们提出了一种轻量级的插值算法，它使用高斯过程回归[61]来建模非线性运动。

We formulate the GSI model for the $i$ -th trajectory as follows:

我们将 $i$ -th轨迹的GSI模型表示如下：
$p_t=f^{(i)}(t)+\epsilon \ \ \ \ (5)$
where $\in F$ is the frame, $p_t \in P$ is the position coordinate variate at frame $t$ (i.e., $x, y, w, h$ ) and $\epsilon \sim N(0,σ^2)$ is Gaussian noise. Given tracked and linearly interpolated trajectories $S^{(i)} = \{t^{(i)},p_t^{(i)}\}_{t=1}^L$ with length $L$ , the task of nonlinear motion modeling is solved by fitting the function $f^{(i)}$ .We assume that it obeys a Gaussian process $f^{(i)} \in GP(0,k(·,·))$ , where $=exp(−\frac{\|x-x'\|^2}{2\lambda^2} )$ is a radial basis function kernel. On the basis of the properties of the Gaussian process, given new frame set $F^*$ ,its smoothed positions $P^∗$ is predicted by

其中 $\in F$ 是帧， $p_t \in P$ 是帧 $t$ 处的位置坐标变量(即 $x, y, w, h$ )，并且 $\epsilon \sim N(0,σ^2)$ 是高斯噪声。给定长度为 $L$ 的轨迹 $S^{(i)} = \{t^{(i)},p_t^{(i)}\}_{t=1}^L$ ，通过拟合函数 $f^{(i)}$ 来解决非线性运动建模问题，假设其服从高斯过程 $f^{(i)} \in GP(0,k(·,·))$ ，其中 $=exp(−\frac{\|x-x'\|^2}{2\lambda^2} )$ 是径向基函数。根据高斯过程的性质，在给定新的帧集 $F^*$ 的情况下，对其平滑位置 $P^∗$ 进行了预测
$P^*=K(F^*,F)(K(F,F)+\sigma^2I)^{-1}P \ \ \ \ (6)$

where $K (\cdot, \cdot)$ is a covariance function based on $k (\cdot, \cdot)$ . Moreover, hyperparameter $\lambda$ controls the smoothness of the trajectory, which should be related with its length. We simply design it as a function adaptive to length $l$ as follows:

其中 $K (\cdot, \cdot)$ 是基于 $k (\cdot, \cdot)$ 的协方差函数。此外，超参数 $\lambda$ 控制着轨迹的平滑度，这应该与其长度有关。我们简单地将其设计为与长度 $l$ 相适应的函数，如下所示：
$\lambda = \tau * \log(\tau ^3/l) \ \ \ \ (7)$
where $\tau$ is set to 10.

$\tau$ 设置为10。

Fig. 4. Illustration of the difference between linear interpolation(LI) and the proposed Gaussian-smoothed interpolation (GSI).

图4.说明线性插值(LI)和提出的高斯平滑插值(GSI)之间的区别。

Figure 4 illustrates an example of the difference between GSI and linear interpolation ( $L I$ ). The raw tracked results (in orange) generally include noisy jitter, and $L I$ (in blue) ignores motion information. Our GSI (in red) solve both problems simultaneously by smoothing the entire trajectory with an adaptive smoothness factor.

图4举例说明了GSI和线性插值( $L I$ )之间的区别。原始跟踪结果(橙色)通常包括噪波抖动，而 $L I$ (蓝色)忽略运动信息。我们的GSI(红色)通过使用自适应平滑度因子平滑整个轨迹，同时解决了这两个问题。

5 Experiments

5.1 Datasets and Evaluation Metrics

Datasets. We conduct experiments on MOT17 [35] and MOT20 [11] datasets under the ”private detection” protocol. MOT17 is a popular dataset for MOT, which consists of 7 sequences, 5316 frames for training and 7 sequences, 5919 frames for testing. MOT20 is set for highly crowded challenging scenes, with 4 sequences, 8,931 frames for training and 4 sequences, 4,479 frames for testing. For ablation studies, we take the first half of each sequence in the MOT17 training set for training and the last half for validation following [73, 77]. Weuse DukeMTMC [44] to pretrain our appearance feature extractor. We trainthe detector on the CrowdHuman dataset [46] and MOT17 half training set forablation following [50, 63, 70, 73, 77]. We add Cityperson [72] and ETHZ [13] fortesting as in [30, 60, 73, 74].

数据集。在“私有检测”协议下，我们在MOT17[35]和MOT20[11]数据集上进行了实验。MOT17是目前流行的MOT数据集，它包含7个序列5316帧用于训练，7个序列5919帧用于测试。MOT20是为高度拥挤的挑战性场景设置的，有4个序列8931帧用于训练，4个序列4479帧用于测试，对于消融研究，我们取MOT17训练集中每个序列的前一半用于训练，后半部分用于验证[73，77]。我们使用DukeMTMC[44]来预先训练我们的外观特征提取器。我们在CrowdHuman数据集[46]和MOT17半训练集上训练检测器，以便在[50，63，70，73，77]之后进行消融。我们添加CityPerson[72]和ETHZ[13]用于测试，如[30，60，73，74]。

Metrics. We use the metrics MOTA, IDs, IDF1, HOTA, AssA, DetA and FPS to evaluate tracking performance [2, 33, 44]. MOTA is computed based on FP,FN and IDs, and focuses more on detection performance. By comparison, IDF1better measures the consistency of ID matching [23]. HOTA is an explicit com-bination of detection score DetA and association score AssA, which balancesthe effects of performing accurate detection and association into a single unified metric. Moreover, it evaluates at a number of different distinct detection similarity values (0.05 to 0.95 in 0.05 intervals) between predicted and GT bounding boxes, instead of setting a single value (i.e., 0.5) like MOTA and IDF1, and better takes localization accuracy into account.

评估。我们使用度量MOTA、IDS、IDF1、HOTA、ASSA、DETA和FPS来评估跟踪性能[2，33，44]。MOTA是基于FP、FN和ID计算的，更关注检测性能。相比之下，IDF1更好地衡量ID匹配的一致性[23]。HOTA是检测分数DATA和关联分数ASSA的显式组合，它将精确检测和关联的效果平衡到单一的统一度量中。此外，它不像MOTA和IDF1那样设置单一的值(即0.5)，而是在预测和GT边界框之间以不同的检测相似度值(0.05到0.95在0.05的间隔内)进行评估，并且更好地考虑了定位精度。

5.2 Implementation Details

For detection, we adopt YOLOX-X [17] pretrained on COCO [31] as our detectorfor an improved time-accuracy trade-off. The training schedule is similar to that in [73]. In inference, a threshold of 0.8 is set for non-maximum suppression (NMS)and a threshold of 0.6 for detection confidence. For StrongSORT, the featuredistance threshold is 0.45, the warp mode for ECC is MOTION EUCLIDEAN, the momentum term α in EMA is 0.9 and the weight factor for appearance cost λ is 0.98. For GSI, the maximum gap allowed for interpolation is 20 frames, and hyperparameter $\tau$ is 10.

对于检测，我们采用在COCO[31]上预训练的YOLOX-X[17]作为我们的检测器，以提高时间精度。训练与[73]中的类似。在推理中，将非最大抑制(NMS)的阈值设置为0.8，将检测置信度的阈值设置为0.6。对于StrongSORT，特征距离阈值为0.45，ECC的翘曲模式为运动欧几里得，均线方程中的动量项α为0.9，外观成本λ的权重因子为0.98。对于GSI，允许插值的最大间隙是20帧，超参数 $\tau$ 是10。

For AFLink, the temporal module consists of four convolution layers with 7×1 kernels and {32,64,128,256} output channels. Each convolution is followed by a BN layer [24] and a ReLU activation layer [18]. The fusion module includes a 1×3 convolution, a BN and a ReLU. It doesn’t change the number of channels. The classifier is an MLP with two fully connected layers and a ReLU layer inserted in between. The training data are generated by cutting annotated trajectories into tracklets with random spatio-temporal noise at a 1:3 ratio of positive andnegative samples. We use Adam as the optimizer [28], cross-entropy loss as the objective function and train it for 20 epochs with a cosine annealing learning rate schedule. The overall training process takes just over 10 seconds. In inference,a temporal distance threshold of 30 frames and a spatial distance thresholdof 75 pixels are used to filter out unreasonable association pairs. Finally, theassociation is considered if its prediction score is larger than 0.95.

对于AFLink，时间模块由4个卷积层组成，具有7×1个内核和{32，64,128,256}个输出通道。每个卷积之后是BN层[24]和RELU激活层[18]。融合模块包括1×3卷积、BN和REU。它不会改变频道的数量。该分类器是具有两个全连接层和插入其间的RELU层的MLP。训练数据是通过以1：3的正负样本比率将带注释的轨迹切割成具有随机时空噪声的轨迹来生成的。我们使用Adam作为优化器[28]，以交叉熵损失为目标函数，并用余弦退火学习率调度对其进行了20个周期的训练。整个训练过程仅需10秒多一点。在推理中，时间距离阈值为30帧，空间距离阈值为75像素，用于过滤提取不合理的关联对。最后，如果关联性的预测得分大于0.95，则考虑该关联性。

5.3 Ablation Studies

Albation study for StrongSORT. Table 1 summarizes the path from DeepSORT to StrongSORT:

StrongSORT的消融研究。表1总结了从DeepSORT到StrongSORT的路径：

BoT: Replacing the original feature extractor with BoT leads to a signif-icant improvement for IDF1, indicating that association quality benefits from more discriminative appearance features.

BOT：用BOT替换原来的特征提取器导致IDF1的显著改善，表明关联质量受益于更具区分性的外观特征。
ECC: The CMC model results in a slight increase in IDF1 and MOTA, implying that it helps extract more precise motion information.

ECC：CMC模型导致IDF1和MOTA略有增加，这意味着它有助于提取更精确的运动信息。a
NSA: The NSA Kalman filter improves HOTA but not MOTA and IDF1.This means it enhances positioning accuracy.

NSA：NSA卡尔曼滤波改善了HOTA，但没有改善MOTA和IDF1。这意味着它提高了定位精度。
EMA: The EMA feature updating mechanism brings not only superior association, but also faster speed.

EMA：EMA特征更新机制不仅带来了更好的关联性，而且速度更快。
MC: Matching with both appearance and motion cost aids association.

MC：与外观和动作成本辅助关联都匹配。
woC: For the stronger tracker, the matching cascade algorithm with re-dundant prior information limits the tracking accuracy. By simply employing avanilla matching method, IDF1 is improved by a large margin.

woC：对于较强的跟踪器，具有冗余先验信息的匹配级联算法限制了跟踪精度。通过简单地采用avanilla匹配方法，IDF1得到了较大幅度的改善。

Ablation study for AFLink and GSI. We apply AFLink and GSI on six different trackers, i.e., three versions of StrongSORT and three state-of-the-art trackers (CenterTrack [77], TransTrack [50] and FairMOT [74]). Their resultsare shown in Table 2. The first line of the results for each tracker is the originalperformance. The application of AFLink (the second line) brings different levels of improvement for the different trackers. Specifically, poorer trackers tend tobenefit more from AFLink due to more missing associations. Particularly, theIDF1 of CenterTrack is improved by 3.7. The third line of the results for eachtracker proves the effectiveness of GSI for both detection and association. Differ-ent from AFLink, GSI works better on stronger trackers. It would be confusedby the large amount of false association in poor trackers. Table 3 compares ourGSI with LI. The results show that GSI yields better performance with a littleextra computational cost.

AFLink和GSI的消融研究。我们在六个不同的跟踪器上应用AFLink和GSI，即三个版本的StrongSORT和三个最先进的跟踪器(CenterTrack[77]、Transtrack[50]和FairMOT[74])。它们的结果如表2所示。每个跟踪器的结果的第一行是原始性能。AFLink(第二行)的应用为不同的跟踪器带来了不同程度的改进。具体地说，由于更多的关联缺失，表现较差的跟踪者往往从AFLink中获益更多。特别是，CenterTrack的IDF1改进了3.7。每个跟踪器的第三行结果证明了GSI对于检测和关联的有效性。与AFLink不同的是，GSI在更强大的追踪器上工作得更好。它会被糟糕的跟踪器中的大量错误关联所迷惑。表3将我们的GSI与LI进行了比较。结果表明，GSI算法以较小的额外计算量获得了较好的性能。

5.4 MOTChallenge Results

We compare StrongSORT, StrongSORT+ (StrongSORT+AFLink) and Strong-SORT++ (StrongSORT+AFLink+GSI) with state-of-the-art trackers on the test sets of MOT17 and MOT20, as shown in Tables 4 and 5, respectively. Notably, comparing FPS with absolute fairness is difficult because the speedclaimed by each method depends on the devices where they are implemented,and the time spent on detections is generally excluded for tracking-by-detectiontrackers.

我们将StrongSORT，StrongSORT+(StrongSORT+AFLink)和StrongSORT++(StrongSORT+AFLink+GSI)分别在MOT17和MOT20的测试集上与最先进的跟踪器进行了比较，如表4和表5所示，值得注意的是，比较FPS具有绝对公平性是困难的，因为每种方法要求的速度取决于它们实现的设备，并且跟踪检测所花费的时间通常被排除在外。

MOT17. StrongSORT++ ranks first among all published methods on MOT17for metrics HOTA, IDF1, AssA, DetA, and ranks second for MOTA, IDs. In particular, it yields an accurate association and outperforms the second-performancetracker by a large margin (i.e., +2.2 IDF1 and +2.4 AssA). We use the same hyperparameters as in the ablation study and do not carefully tune them foreach sequence like in [73]. The steady improvements on the test set prove the robustness of our methods. It is worth noting that, our reproduced version of DeepSORT (with a stronger detector and several tuned hyperparameters) also performs well on the benchmark, which demonstrates the effectiveness of the DeepSORT-like tracking paradigm.

MOT17。对于HOTA、IDF1、ASA、DETA等指标，StrongSORT++在MOT17发布的所有方法中排名第一，在MOTA、IDS方面排名第二。与之相比，它产生了准确的关联，并大大超过了第二性能的tracker(即+2.2IDF1和+2.4ASSA)。我们使用与消融研究中相同的超参数，并且不像在[73]中那样在每个序列中仔细地调整它们。在测试集上的稳步改进证明了我们方法的有效性。值得注意的是，我们复制的DeepSORT版本(具有更强大的检测器和几个可调的超参数)也在基准测试中运行良好，这证明了类似DeepSORT的跟踪范例的有效性。

MOT20. MOT20 is from more crowded scenarios. High occlusion means a highrisk of missing detections and associations. StrongSORT++ still ranks first formetrics HOTA, IDF1 and AssA. It achieves significantly less IDs than the othertrackers. Note that we use exactly the same hyperparameters as in MOT17,which implies the generalization capability of our method. Its detection per-formance (MOTA and DetA) is slightly poor compared with that of severaltrackers. We think this is beacuse we use the same detection score threshold asin MOT17, which results in many missing detections. Specifically, the metric FN(number of false negatives) of our StrongSORT++ is 117,920, whereas that ofByteTrack [73] is only 87,594.

MOT20。MOT20来自更拥挤的场景。高度遮挡意味着遗漏检测和关联的高风险。StrongSORT++仍然位居HOTA、IDF1和ASSA的第一位。与其他跟踪器相比，它获得的ID要少得多。请注意，我们使用与MOT17中完全相同的超参数，这意味着我们方法的泛化能力。与几种跟踪器相比，其检测性能(MOTA和DETA)略差。我们认为这是因为我们使用与MOT17相同的检测分数阈值，这会导致许多检测丢失。具体地说，我们的StrongSORT++的度量FN(假阴性数)是117,920，而ByteTrack[73]的度量只有87594

Qualitative Results. Figure 5 visualizes several tracking results of Strong-SORT++ on the test sets of MOT17 and MOT20. The results of MOT17-01show the effectiveness of our method in normal scenarios. From the resultsof MOT17-08, we can see correct associations after occlusion. The results ofMOT17-14 prove that our method can work well while the camera is moving.Moreover, the results of MOT20-04 show the excellent performance of Strong-SORT++ in scenarios with severe occlusion.

定性结果。图5可视化了MOT17和MOT20测试集上强排序++的几个跟踪结果。MOT17-01的测试结果表明，该方法在正常情况下是有效的。从MOT17-08的结果中，我们可以看到遮挡后正确的关联。MOT17-14的实验结果证明了我们的方法能够在摄像机移动的情况下很好地工作，MOT20-04的实验结果显示了强排序++算法在严重遮挡场景下的优异性能。

5.5 Limitations

StrongSORT and StrongSORT++ still have several limitations. The main concern is their relatively low running speed compared with joint trackers and several appearance-free seperate trackers. Further research on improving computationalefficiency is necessary. Moreover, although our method ranks first in metricsIDF1 and HOTA, it has a slightly lower MOTA, which is mainly caused bymany missing detections due to the high threshold of the detection score. Webelieve an elaborate threshold strategy or association algorithm would help. Asfor AFLink, although it performs well in restoring missing associations, it ishelpless against false association problems. Specifically, AFLink cannot split IDmixed-up trajectories into accurate tracklets. Future work is needed to developstronger and more flexible global link strategies.

StrongSORT和StrongSORT++仍有几个限制。主要的问题是，与联合跟踪器和几种appearance-free的单独跟踪器相比，它们的运行速度相对较低。在提高计算效率方面还需要进一步的研究。此外，虽然我们的方法在IDF1和HOTA指标中名列前茅，但它的MOTA略低，这主要是由于检测分数的高门限导致了许多漏检造成的。我们相信精心设计的阈值策略或关联算法会有所帮助。至于AFLink，虽然它在恢复丢失的关联方面做得很好，但它对错误关联问题无能为力。具体地说，AFLink不能将ID混淆的轨迹拆分成精确的轨迹。未来需要开展工作，以制定更强大、更灵活的全局联系策略。

6 Conclusion

In this paper, we revisit the classic tracker DeepSORT and improve it in variousaspects. The resulting StrongSORT achieves new SOTA on MOT17 and MOT20benchmarks and demonstrates the effectiveness of the DeepSORT-like paradigm.We also propose two lightweight and appearance-free algorithms to further refinethe tracking results. Experiments show that they can be applied to and benefitvarious state-of-the-art trackers with a negligible extra computational cost. Our final method, StrongSORT++, ranks first on MOT17 and MOT20 in terms of HOTA and IDF1 metrics and surpasses the second-place one by 1.3 - 2.2. Notably, our method runs relatively slow compared with joint trackers. In the future, we will investigate further for an improved time-accuracy trade-off.

本文对经典跟踪器DeepSORT进行了重新审视，并对其进行了多方面的改进。所得到的StrongSORT在MOT17和MOT20基准上实现了新的SOTA，验证了DeepSORT类范式的有效性，并提出了两种轻量级的appearance-free算法来进一步完善跟踪结果。实验表明，它们可以应用于各种最先进的跟踪器，并且可以在可以忽略的额外计算代价的情况下受益于各种跟踪器。在HOTA和IDF1度量方面，我们的最终方法StrongSORT++在MOT17和MOT20上排名第一，超过第二名1.3-2.2。值得注意的是，与联合跟踪器相比，我们的方法运行相对较慢。在未来，我们将进一步研究改进时间精度的权衡。

你可能感兴趣的:(计算机视觉,目标跟踪)

《卷积神经网络到Vision Transformer：计算机视觉的十年架构革命》 HeartException 人工智能学习
前言前些天发现了一个巨牛的人工智能免费学习网站，通俗易懂，风趣幽默，忍不住分享一下给大家。点击跳转到网站题目《卷积神经网络到VisionTransformer：计算机视觉的十年架构革命》展开深度解析，全文采用技术演进史+架构对比+产业影响的三段式结构，附关键数据与趋势预测：卷积神经网络到VisionTransformer：计算机视觉的十年架构革命副标题：从局部感知到全局建模，一场改变AI视觉基石的
目标检测：从基础原理到前沿技术全面解析随机森林404 计算机视觉目标检测人工智能计算机视觉
引言在计算机视觉领域，目标检测是一项核心且极具挑战性的任务，它不仅要识别图像中有什么物体，还要确定这些物体在图像中的具体位置。随着人工智能技术的快速发展，目标检测已成为智能监控、自动驾驶、医疗影像分析等众多应用的基础技术。本文将全面介绍目标检测的基础概念、发展历程、关键技术、实践应用以及未来趋势，为读者提供系统性的知识框架。第一章目标检测概述1.1目标检测的定义与重要性目标检测（ObjectDet
【LangChain编程：从入门到实践】LangChain与其他框架的比较 AI天才研究院 Agentic AI 实战计算 AI人工智能与大数据计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
【LangChain编程：从入门到实践】LangChain与其他框架的比较1.背景介绍1.1人工智能发展现状在当今时代，人工智能(AI)已经成为科技领域中最热门和最具革命性的话题之一。随着计算能力的不断提升和算法的持续优化,AI系统正在不断扩展其应用范围,包括自然语言处理、计算机视觉、决策系统等各个领域。1.2LangChain概述在这种背景下,LangChain作为一个新兴的AI框架应运而生。L
PHP接单涨薪系列（九）之计算机视觉实战：PHP+Stable Diffusion接单指南（2025高溢价秘籍）攻城狮凌霄 PHP PHP接单涨薪 AI php 计算机视觉 stable diffusion
案例场景某电商公司使用本方案后，产品图制作成本降低90%，广告转化率提升35%，单月节省设计费用超¥80,000。本文将彻底解密如何用PHP+AI视觉技术接取高单价设计外包，让你在竞争激烈的市场中脱颖而出！一、视觉设计市场的AI革命1.1传统设计vsAI设计设计任务传统流程AI流程需求沟通初稿设计反复修改最终交付AI生成微调即时交付2025年设计市场数据对比：指标传统设计AI设计提升幅度单图制作时
纹理贴图算法研究论文综述点云SLAM 算法图形图像处理算法纹理贴图计算机图形学计算机视觉人工智能虚拟现实（VR）纹理贴图算法综述
纹理贴图（TextureMapping）是计算机图形学和计算机视觉中的核心技术，广泛应用于三维重建、游戏渲染、虚拟现实（VR）、增强现实（AR）等领域。对其算法的研究涵盖了纹理生成、映射、缝合、优化等多个方面。1.引言纹理贴图是指将二维图像纹理映射到三维几何表面上，以增强模型的视觉真实感。传统方法主要关注静态几何模型上的纹理生成与映射，而近年来，随着多视角图像重建、RGB-D扫描、神经渲染的发展，
ConvNeXT：面向 2020 年代的卷积神经网络
摘要视觉识别的“咆哮二十年代”始于VisionTransformer（ViT）的引入，ViT很快取代了ConvNet，成为图像分类任务中的最新最强模型。然而，vanillaViT在应用于目标检测、语义分割等通用计算机视觉任务时面临困难。HierarchicalTransformer（如SwinTransformer）重新引入了若干ConvNet的先验知识，使Transformer成为实用的通用视觉
深度学习前置知识全面解析：从机器学习到深度学习的进阶之路
一、引言：人工智能时代的核心技术在当今这个数据爆炸的时代，人工智能(AI)已经成为推动社会进步的核心技术之一。作为AI领域最重要的分支，深度学习(DeepLearning)在计算机视觉、自然语言处理、语音识别等领域取得了突破性进展，彻底改变了我们与机器交互的方式。本教案将从机器学习的基础知识出发，系统性地介绍深度学习的核心概念、数学基础、网络架构和训练方法，为读者构建完整的知识体系框架。无论你是刚
PyTorch实战：从零构建CNN模型，轻松搞定MNIST手写数字识别
PyTorch实战：从零构建CNN模型，轻松搞定MNIST手写数字识别大家好！欢迎来到我的深度学习博客！对于每个踏入计算机视觉领域的人来说，MNIST手写数字识别就像是编程世界的“Hello,World!”。它足够简单，能够让我们快速上手；也足够完整，可以帮我们走通一个深度学习项目的全流程。之前我们可能用Keras体验过“搭积木”式的快乐，今天，我们将换一个同样强大且灵活的框架——PyTorch，
计算机视觉中的Transformer：ViT模型详解与代码实现 AI大模型应用工坊计算机视觉 transformer 人工智能 ai
计算机视觉中的Transformer：ViT模型详解与代码实现关键词：计算机视觉、Transformer、ViT、自注意力机制、图像分块摘要：传统卷积神经网络（CNN）统治计算机视觉领域多年，但2020年一篇《AnImageisWorth16x16Words:TransformersforImageRecognitionatScale》的论文打破了这一格局——它将NLP领域的Transformer
《YOLO11的ONNX推理部署：多语言多架构实践指南》空云风语 YOLO 人工智能深度学习目标跟踪人工智能计算机视觉 YOLO
引言：YOLO11与ONNX的相遇在计算机视觉的广袤星空中，目标检测始终是一颗耀眼的明星，其在自动驾驶、智能安防、工业检测、医疗影像分析等诸多领域都有着举足轻重的应用。想象一下，自动驾驶汽车需要实时准确地检测出道路上的车辆、行人、交通标志；智能安防系统要快速识别出监控画面中的异常行为和可疑人员；工业生产线上，需要精准检测产品的缺陷；医疗影像分析中，辅助医生检测病变区域。这些场景都对目标检测技术的准
【CVPR2024】计算机视觉|即插即用|DFAM:marine！不懂DFAM，别说你会做水下动物分割！
论文地址：http://arxiv.org/pdf/2404.04996v1代码地址：https://github.com/Drchip61/Dual_SAM关注UPCV缝合怪，分享最计算机视觉新即插即用模块，并提供配套的论文资料与代码。https://space.bilibili.com/473764881摘要本研究提出了一种新颖的特征学习框架，名为**Dual-SAM，用于高性能的海洋动物分割
Python与Dlib库实现人脸技术实战西域情歌
本文还有配套的精品资源，点击获取简介：本项目详细说明了如何使用Python结合Dlib库实现人脸检测、识别、数量检测和距离检测。利用Dlib提供的机器学习算法和计算机视觉功能，包括HOG特征检测、级联分类器、面部特征向量模型和关键点预测等，项目能够快速准确地在图像中检测和识别人脸。此外，还介绍了如何统计图像中的人脸数量以及如何计算人脸之间的距离。通过实际代码资源，开发者能够掌握实时人脸技术的应用，
视觉表征和多模态融合一只齐刘海的猫语言模型
视觉表征和多模态融合是当前人工智能领域的研究热点，特别是在计算机视觉和自然语言处理的交叉领域。视觉表征是指将图像或视频信息转化为模型可以处理的向量形式，而多模态融合则是将不同类型的数据（如视觉、文本、音频等）进行整合，以实现更全面、准确的信息理解和处理。视觉表征(VisualRepresentation)目的：将图像或视频数据转化为深度学习模型可以理解的特征向量。方法：卷积神经网络(CNN)：传
从0到1掌握OpenCV！Python图像处理实战全解析（附代码+案例）小张在编程 Python学习 opencv python 图像处理
引言你有没有想过，手机里的美颜滤镜如何精准识别五官？监控摄像头如何在人流中锁定可疑目标？医学影像软件如何从CT片中快速标注病灶？这些“神奇操作”的背后，往往藏着一个低调的“图像处理神器”——OpenCV。作为Python生态中最受欢迎的计算机视觉库，它用一行行代码将抽象的像素点变成可操作的“数字画布”。今天，我们就从最基础的图像读写开始，手把手带你解锁OpenCV的“十八般武艺”，从图像处理小白变
目标检测在国防和政府的应用实例 MzKyle 计算机视觉目标检测人工智能计算机视觉
一、目标检测技术概述目标检测是计算机视觉的核心任务，通过算法对图像/视频中的物体进行识别与定位，当前主流技术包括：经典算法：YOLO系列（实时性强）、FasterR-CNN（精度高）、SSD（平衡速度与精度）技术升级：结合深度学习（CNN、Transformer）、多模态融合（视觉+红外+雷达）、边缘计算实时处理二、国防领域核心应用实例（一）军事侦察与监控系统无人机侦察与目标识别应用场景：战术无人
VLA模型
一介绍在机器人领域，视觉-语言-动作(VLA)模型的发展经历了显著的演变，这得益于计算机视觉和自然语言处理领域的进步。VLA模型代表了一类旨在处理多模态输入的模型，整合了来自视觉、语言和动作的信息。这些模型对于实现具身智能至关重要，使机器人能够理解物理世界并与之互动。以下是VLA模型发展的时间线：早期阶段：计算机视觉和自然语言处理的集成大约在2015年开始，随着视觉问答(VQA)系统的出现。这些系
Random Erasing：计算机视觉的「隐形斗篷」——遮挡艺术的对抗学习革命星光银河深度学习-代表性技术主题 /概念层面计算机视觉学习人工智能 cnn 神经网络深度学习
当ImageNet冠军模型在真实世界的遮挡面前崩溃时（识别准确率骤降38%），中科院自动化研究所2017年提出的RandomErasing技术以一纸惊艳了学界。这种在图像中随机挖洞的简单操作，让ResNet-50在Partial-iNaturalist数据集上抗遮挡能力提升4.2倍，错误率降低59%，揭示了模型鲁棒性的深层密码。️遮挡困境：视觉模型的阿喀琉斯之踵图像识别鲁棒性演化史时代技术Imag
AI人工智能与自动驾驶的协同创新模式 AI大模型应用之禅人工智能自动驾驶机器学习 ai
AI人工智能与自动驾驶的协同创新模式关键词：人工智能、自动驾驶、协同创新、深度学习、计算机视觉、传感器融合、决策系统摘要：本文深入探讨了人工智能与自动驾驶技术的协同创新模式。我们将从基础概念出发，逐步分析AI如何赋能自动驾驶系统，涵盖感知、决策和控制三大核心模块。文章将通过生动的比喻解释复杂技术原理，展示实际代码实现，并探讨未来发展趋势和挑战。通过这篇文章，读者将全面理解AI与自动驾驶如何相互促进
基于 OpenCV 的图像 ROI 切割实现
一、引言在计算机视觉领域，我们经常需要处理各种各样的图像数据。有时候，我们只对图像中的某一部分区域感兴趣，例如在一张人物照片中，我们可能只关注人物的脸部。在这种情况下，将我们感兴趣的区域从整个图像中切割出来，不仅可以节省计算量，还能提高程序的运行速度。这就是我们所说的ROI（RegionofInterest，感兴趣区域）切割。二、ROI切割的原理2.1图像数据的存储在使用OpenCV进行图像读取时
【Python】车牌自动识别幽兰的天空 Python python opencv
实现车牌自动识别（LicensePlateRecognition,LPR）是计算机视觉和深度学习领域中的一个常见任务。用Python和OpenCV，结合其他深度学习库，可以建立一个简单的车牌识别系统。以下是一个基于这两者的基本实现思路和示例代码。实现步骤环境准备：安装必要的库：bashpipinstallopencv-pythonopencv-python-headlessnumpypillowp
Python和OpenCV实现车牌识别的毕业设计案例媛源啊
本文还有配套的精品资源，点击获取简介：本项目通过Python和OpenCV库，实现了一个实用的车牌识别系统，包含图像捕获、预处理、车牌定位、车牌分割和字符识别等步骤。系统提供了一键运行的完整代码，使学生能够快速掌握计算机视觉和深度学习应用。遇到的挑战和解决方案也进行了讨论，比如光照变化、车牌角度不一致和污损的处理，以及数据增强技术和模型参数优化。1.车牌识别系统的基本理论和应用1.1车牌识别的背景
【大模型面试】大模型Prompt Engineer面试题及参考答案大模型知识 prompt 人工智能开发语言 python chatgpt 深度学习大模型
一、基础概念类1.什么是大模型？大模型通常指具有庞大参数规模的机器学习模型，尤其是在自然语言处理（NLP）和计算机视觉等领域。这些模型能够学习到大量数据中的复杂模式和特征，具备强大的泛化能力，可在多种任务上表现出色，如GPT系列、BERT等。2.大模型与传统机器学习模型的区别是什么？传统机器学习模型参数规模相对较小，往往针对特定任务进行设计和训练，需要较多人工特征工程。而大模型参数数量庞大，通过在
AI产品经理技术篇：AI领域常用术语解析让我看看好学吗人工智能产品经理机器学习深度学习学习
作为AI产品经理，深入理解人工智能领域的核心术语是高效沟通、需求定义和产品落地的关键。无论是与算法工程师协作优化模型，还是向业务方解释技术方案，准确掌握专业术语能显著提升决策效率，避免因概念混淆导致的开发偏差。本文系统梳理了模型与算法、NLP（自然语言处理）、CV（计算机视觉）、数据处理、核心评估指标等领域的核心术语，帮助产品经理快速构建AI技术认知框架。目录1.基础概念2.模型与算法3.自然语言
【CVPR2025】计算机视觉|Salience DETR：显著性目标检测，精度暴涨！
论文地址：http://arxiv.org/pdf/2403.16131v1代码地址：https://github.com/xiuqhou/Salience-DETR关注UPCV缝合怪，分享最计算机视觉新即插即用模块，并提供配套的论文资料与代码。https://space.bilibili.com/473764881摘要本研究旨在解决类DETR方法中存在的计算负担重和对稳定查询选择依赖性高的问题。
图像分类：从基础原理到前沿技术随机森林404 计算机视觉分类数据挖掘人工智能
引言在当今数字化时代，图像数据正以惊人的速度增长。从社交媒体上的照片分享到医疗影像诊断，从自动驾驶到工业质检，图像分类技术已经成为人工智能领域最基础也最重要的应用之一。本文将全面介绍图像分类的基础概念、发展历程、关键技术、应用场景以及未来趋势，帮助读者系统性地理解这一领域。第一章图像分类概述1.1什么是图像分类图像分类（ImageClassification）是计算机视觉中的一项核心任务，其目标是
《dlib库中的聚类》算法详解：从原理到实践 A小庞算法算法聚类数据挖掘机器学习 c++
一、dlib库与聚类算法的关联1.1dlib库的核心功能dlib是一个基于C++的机器学习和计算机视觉工具库，其聚类算法模块提供了多种高效的无监督学习工具。聚类算法在dlib中主要用于：数据分组：将相似的数据点划分为同一簇。特征分析：通过聚类结果发现数据潜在的结构。降维辅助：结合聚类结果进行特征选择或数据压缩。dlib支持的经典聚类算法包括K-Means和ChineseWhispers，适用于图像
基于深度学习的草莓成熟度检测系统：YOLOv5 + UI界面 + 数据集 YOLO实战营深度学习YOLO实战项目深度学习 YOLO ui 人工智能目标跟踪
引言随着农业科技的发展，智能化的农业生产方式正逐步替代传统农业。果实的成熟度检测对于农业生产的管理至关重要，尤其是在果蔬的采摘、分拣和运输过程中。草莓作为一种广泛种植且受消费者喜爱的水果，其成熟度检测一直是农业智能化的重要研究方向。传统的草莓成熟度检测方法大多依赖人工经验，劳动强度大且容易出现误差，因此，基于计算机视觉和深度学习的草莓成熟度自动检测系统成为了一种理想选择。深度学习技术，尤其是卷积神
【人工智能】 AI的进化之路：大模型如何重塑技术格局蒙娜丽宁 Python杂谈人工智能人工智能 python
《PythonOpenCV从菜鸟到高手》带你进入图像处理与计算机视觉的大门！解锁Python编程的无限可能：《奇妙的Python》带你漫游代码世界本文深入探讨了人工智能大模型的进化历程及其对技术格局的深远影响。从早期神经网络到现代大模型的突破，文章分析了关键技术进步，如Transformer架构、预训练机制和分布式计算。结合数学公式和代码示例，详细阐述了大模型的训练原理、优化方法及实际应用场景。文
数字人视频剪辑与数字人分身源码开发的的核心技术解析微~18339948121 数字人分身源码数字人剪辑源码数字人源码 django pygame virtualenv plotly scikit-learn flask tornado
数字人视频剪辑与分身的核心技术解析数字人视频剪辑和分身技术是近年来人工智能与计算机视觉领域的热点，涉及虚拟形象生成、动作驱动、语音合成等多项技术。以下从技术实现、应用场景和工具选择三个方面展开分析。数字人视频剪辑的关键技术视频剪辑中数字人的核心在于动态形象的生成与编辑。基于深度学习的生成对抗网络（GAN）和3D建模技术可实现高保真虚拟形象构建。典型流程包括：人物建模：通过多视角图像或视频数据重建3
百度颠覆了自己，飞算JavaAI造福了中国程序员！飞算JavaAI开发助手百度
在当今这个科技日新月异的时代，企业纷纷寻求技术突破，以期在激烈的市场竞争中脱颖而出。百度，作为中国互联网行业的领军企业之一，凭借其强大的科技实力和创新能力，在人工智能等多个领域取得了显著成就，并正在逐步颠覆自身的传统形象。百度自成立之初，就将技术创新视为企业的生命线。从最初的搜索引擎技术，到如今的深度学习、自然语言处理、计算机视觉等前沿领域，百度始终走在技术革新的前沿。其自主研发的飞桨深度学习平台
Spring4.1新特性——综述 jinnianshilongnian spring 4.1
目录 Spring4.1新特性——综述 Spring4.1新特性——Spring核心部分及其他 Spring4.1新特性——Spring缓存框架增强 Spring4.1新特性——异步调用和事件机制的异常处理 Spring4.1新特性——数据库集成测试脚本初始化 Spring4.1新特性——Spring MVC增强 Spring4.1新特性——页面自动化测试框架Spring MVC T
Schema与数据类型优化 annan211 数据结构 mysql
目前商城的数据库设计真是一塌糊涂，表堆叠让人不忍直视，无脑的架构师，说了也不听。在数据库设计之初，就应该仔细揣摩可能会有哪些查询，有没有更复杂的查询，而不是仅仅突出很表面的业务需求，这样做会让你的数据库性能成倍提高，当然，丑陋的架构师是不会这样去考虑问题的。选择优化的数据类型 1 更小的通常更好更小的数据类型通常更快，因为他们占用更少的磁盘、内存和cpu缓存，
第一节 HTML概要学习 chenke html Web css
第一节 HTML概要学习 1. 什么是HTML HTML是英文Hyper Text Mark-up Language(超文本标记语言)的缩写，它规定了自己的语法规则，用来表示比“文本”更丰富的意义，比如图片，表格，链接等。浏览器（IE,FireFox等）软件知道HTML语言的语法，可以用来查看HTML文档。目前互联网上的绝大部分网页都是使用HTML编写的。打开记事本输入一下内
MyEclipse里部分习惯的更改 Array_06 eclipse
继续补充中---------------------- 1.更改自己合适快捷键windows-->prefences-->java-->editor-->Content Assist--> Activation triggers for java的右侧“.”就可以改变常用的快捷键选中 Text
近一个月的面试总结 cugfy 面试
本文是在学习中的总结，欢迎转载但请注明出处：http://blog.csdn.net/pistolove/article/details/46753275 前言打算换个工作，近一个月面试了不少的公司，下面将一些面试经验和思考分享给大家。另外校招也快要开始了，为在校的学生提供一些经验供参考，希望都能找到满意的工作。
HTML5一个小迷宫游戏 357029540 html5
通过《HTML5游戏开发》摘抄了一个小迷宫游戏，感觉还不错，可以画画，写字，把摘抄的代码放上来分享下，喜欢的同学可以拿来玩玩！ <html> <head> <title>创建运行迷宫</title> <script type="text/javascript"
10步教你上传githib数据张亚雄 git
官方的教学还有其他博客里教的都是给懂的人说得，对已我们这样对我大菜鸟只能这么来锻炼，下面先不玩什么深奥的，先暂时用着10步干净利索。等玩顺溜了再用其他的方法。操作过程（查看本目录下有哪些文件NO.1）ls （跳转到子目录NO.2）cd+空格+目录（继续NO.3）ls （匹配到子目录NO.4）cd+ 目录首写字母+tab键+（首写字母“直到你所用文件根就不再按TAB键了”）（查看文件
MongoDB常用操作命令大全 adminjun mongodb 操作命令
成功启动MongoDB后，再打开一个命令行窗口输入mongo，就可以进行数据库的一些操作。输入help可以看到基本操作命令，只是MongoDB没有创建数据库的命令，但有类似的命令如：如果你想创建一个“myTest”的数据库，先运行use myTest命令，之后就做一些操作（如：db.createCollection('user')）,这样就可以创建一个名叫“myTest”的数据库。一
bat调用jar包并传入多个参数 aijuans
下面的主程序是通过eclipse写的： 1.在Main函数接收bat文件传递的参数（String[] args）如： String ip =args[0]; String user=args[1]; &nbs
Java中对类的主动引用和被动引用 ayaoxinchao java 主动引用对类的引用被动引用类初始化
在Java代码中，有些类看上去初始化了，但其实没有。例如定义一定长度某一类型的数组，看上去数组中所有的元素已经被初始化，实际上一个都没有。对于类的初始化，虚拟机规范严格规定了只有对该类进行主动引用时，才会触发。而除此之外的所有引用方式称之为对类的被动引用，不会触发类的初始化。虚拟机规范严格地规定了有且仅有四种情况是对类的主动引用，即必须立即对类进行初始化。四种情况如下：1.遇到ne
导出数据库提示 outfile disabled BigBird2012 mysql
在windows控制台下，登陆mysql，备份数据库： mysql>mysqldump -u root -p test test > D:\test.sql 使用命令 mysqldump 格式如下： mysqldump -u root -p *** DBNAME > E:\\test.sql。注意：执行该命令的时候不要进入mysql的控制台再使用，这样会报
Javascript 中的 && 和 || bijian1013 JavaScript &&||
准备两个对象用于下面的讨论 var alice = { name: "alice", toString: function () { return this.name; } } var smith = { name: "smith",
[Zookeeper学习笔记之四]Zookeeper Client Library会话重建 bit1129 zookeeper
为了说明问题，先来看个简单的示例代码： package com.tom.zookeeper.book; import com.tom.Host; import org.apache.zookeeper.WatchedEvent; import org.apache.zookeeper.ZooKeeper; import org.apache.zookeeper.Wat
【Scala十一】Scala核心五：case模式匹配 bit1129 scala
package spark.examples.scala.grammars.caseclasses object CaseClass_Test00 { def simpleMatch(arg: Any) = arg match { case v: Int => "This is an Int" case v: (Int, String)
运维的一些面试题 yuxianhua linux
1、Linux挂载Winodws共享文件夹 mount -t cifs //1.1.1.254/ok /var/tmp/share/ -o username=administrator,password=yourpass 或 mount -t cifs -o username=xxx,password=xxxx //1.1.1.1/a /win
Java lang包-Boolean BrokenDreams boolean
Boolean类是Java中基本类型boolean的包装类。这个类比较简单，直接看源代码吧。 public final class Boolean implements java.io.Serializable,
读《研磨设计模式》-代码笔记-命令模式-Command bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.ArrayList; import java.util.Collection; import java.util.List; /** * GOF 在《设计模式》一书中阐述命令模式的意图：“将一个请求封装
matlab下GPU编程笔记 cherishLC matlab
不多说，直接上代码 gpuDevice % 查看系统中的gpu,,其中的DeviceSupported会给出matlab支持的GPU个数。 g=gpuDevice(1); %会清空 GPU 1中的所有数据,,将GPU1 设为当前GPU reset(g) %也可以清空GPU中数据。 a=1; a=gpuArray(a); %将a从CPU移到GPU中 onGP
SVN安装过程 crabdave SVN
SVN安装过程 subversion-1.6.12 ./configure --prefix=/usr/local/subversion --with-apxs=/usr/local/apache2/bin/apxs --with-apr=/usr/local/apr --with-apr-util=/usr/local/apr --with-openssl=/
sql　行列转换 daizj sql 行列转换行转列列转行
行转列的思想是通过case when 来实现列转行的思想是通过union all 来实现下面具体例子：假设有张学生成绩表(tb)如下: Name Subject Result 张三语文　　74 张三数学　　83 张三物理　　93 李四语文　　74 李四数学　　84 李四物理　　94 */ /* 想变成姓名 &
MySQL--主从配置 dcj3sjt126com mysql
linux下的mysql主从配置：说明：由于MySQL不同版本之间的(二进制日志)binlog格式可能会不一样，因此最好的搭配组合是Master的MySQL版本和Slave的版本相同或者更低， Master的版本肯定不能高于Slave版本。（版本向下兼容） mysql1 : 192.168.100.1 //master mysq
关于yii 数据库添加新字段之后model类的修改 dcj3sjt126com Model
rules: array('新字段','safe','on'=>'search') 1、array('新字段', 'safe')//这个如果是要用户输入的话，要加一下， 2、array('新字段', 'numerical'),//如果是数字的话 3、array('新字段', 'length', 'max'=>100),//如果是文本 1、2、3适当的最少要加一条，新字段才会被
sublime text3 中文乱码解决 dyy_gusi Sublime Text
sublime text3中文乱码解决原因：缺少转换为UTF-8的插件目的：安装ConvertToUTF8插件包第一步：安装能自动安装插件的插件，百度“Codecs33”，然后按照步骤可以得到以下一段代码： import urllib.request,os,hashlib; h = 'eb2297e1a458f27d836c04bb0cbaf282' + 'd0e7a30980927
概念了解：CGI，FastCGI，PHP-CGI与PHP-FPM geeksun PHP
CGI CGI全称是“公共网关接口”(Common Gateway Interface)，HTTP服务器与你的或其它机器上的程序进行“交谈”的一种工具，其程序须运行在网络服务器上。 CGI可以用任何一种语言编写，只要这种语言具有标准输入、输出和环境变量。如php,perl,tcl等。 FastCGI FastCGI像是一个常驻(long-live)型的CGI，它可以一直执行着，只要激活后，不
Git push 报错 "error: failed to push some refs to " 解决 hongtoushizi git
Git push 报错 "error: failed to push some refs to " . 此问题出现的原因是：由于远程仓库中代码版本与本地不一致冲突导致的。由于我在第一次git pull --rebase 代码后，准备push的时候，有别人往线上又提交了代码。所以出现此问题。解决方案： 1： git pull 2：
第四章 Lua模块开发 jinnianshilongnian nginx lua
在实际开发中，不可能把所有代码写到一个大而全的lua文件中，需要进行分模块开发；而且模块化是高性能Lua应用的关键。使用require第一次导入模块后，所有Nginx 进程全局共享模块的数据和代码，每个Worker进程需要时会得到此模块的一个副本（Copy-On-Write），即模块可以认为是每Worker进程共享而不是每Nginx Server共享；另外注意之前我们使用init_by_lua中初
java.lang.reflect.Proxy liyonghui160com
1.简介 Proxy 提供用于创建动态代理类和实例的静态方法（1）动态代理类的属性代理类是公共的、最终的，而不是抽象的未指定代理类的非限定名称。但是，以字符串 "$Proxy" 开头的类名空间应该为代理类保留代理类扩展 java.lang.reflect.Proxy 代理类会按同一顺序准确地实现其创建时指定的接口
Java中getResourceAsStream的用法 pda158 java
1.Java中的getResourceAsStream有以下几种： 1. Class.getResourceAsStream(String path) ： path 不以’/'开头时默认是从此类所在的包下取资源，以’/'开头则是从ClassPath根下获取。其只是通过path构造一个绝对路径，最终还是由ClassLoader获取资源。　　2. Class.getClassLoader.get
spring 包官方下载地址（非maven） sinnk spring
SPRING官方网站改版后，建议都是通过 Maven和Gradle下载，对不使用Maven和Gradle开发项目的，下载就非常麻烦，下给出Spring Framework jar官方直接下载路径: http://repo.springsource.org/libs-release-local/org/springframework/spring/ s
Oracle学习笔记(7) 开发PLSQL子程序和包 vipbooks oracle sql 编程
哈哈，清明节放假回去了一下，真是太好了，回家的感觉真好啊！现在又开始出差之旅了，又好久没有来了，今天继续Oracle的学习！这是第七章的学习笔记，学习完第六章的动态SQL之后，开始要学习子程序和包的使用了……，希望大家能多给俺一些支持啊！编程时使用的工具是PLSQL