datamonday

【Paper】LSTM-FCN: LSTM Fully Convolutional Networks for Time Series Classification

论文年份：2017
论文被引：211(04/26/20)
论文原文：点击此处
论文源码：点击此处

文章目录

LSTM Fully Convolutional Networks for Time Series Classification
I. INTRODUCTION
II. BACKGROUND WORKS

A. TEMPORAL CONVOLUTIONS
B. RECURRENT NEURAL NETWORKS
C. LONG SHORT-TERM MEMORY RNNs
D. ATTENTION MECHANISM

III. LSTM FULLY CONVOLUTIONAL NETWORK

A. NETWORK ARCHITECTURE
B. NETWORK INPUT
C. REFINEMENT OF MODELS

IV. EXPERIMENTS

A. EVALUATION METRICS
B. RESULTS

V. CONCLUSION & FUTURE WORK

LSTM Fully Convolutional Networks for Time Series Classification

Fully convolutional neural networks (FCNs) have been shown to achieve the state-of-theart performance on the task of classifying time series sequences. We propose the augmentation of fully convolutional networks with long short term memory recurrent neural network (LSTM RNN) sub-modules for time series classification. Our proposed models significantly enhance the performance of fully convolutional networks with a nominal increase in model size and require minimal preprocessing of the data set. The proposed long short term memory fully convolutional network (LSTM-FCN) achieves the state-of-theart performance compared with others. We also explore the usage of attention mechanism to improve time series classification with the attention long short term memory fully convolutional network (ALSTM-FCN). The attention mechanism allows one to visualize the decision process of the LSTM cell. Furthermore, we propose refinement as a method to enhance the performance of trained models. An overall analysis of the performance of our model is provided and compared with other techniques.

INDEX TERMS Convolutional neural network, long short term memory recurrent neural network, time series classification.

全卷积神经网络（FCN）已显示在对时间序列进行分类的任务中实现了最先进的性能。我们建议使用长期短期记忆递归神经网络（LSTM RNN）子模块增强全卷积网络，以进行时间序列分类。我们提出的模型以模型尺寸的名义上的增加显着增强了全卷积网络的性能，并且需要对数据集进行最少的预处理。拟议的长期短期记忆全卷积网络（LSTM-FCN）与其他网络相比，可实现最新的性能。我们还探索了注意力机制的使用，以通过注意力长期短期记忆完全卷积网络（ALSTM-FCN）改善时间序列分类。注意机制允许人们可视化LSTM单元的决策过程。此外，我们提出了改进方法，以增强训练模型的性能。提供了对模型性能的全面分析，并将其与其他技术进行了比较。

关键词：卷积神经网络，长期短期记忆递归神经网络，时间序列分类。

I. INTRODUCTION

Over the past decade, there has been an increased interest in time series classification. Time series data is ubiquitous [1], existing in weather readings [2], financial recordings [3], industrialobservations[4],andpsychologicalsignals[5],[6]. Several approaches, including feature-based [7], ensembles [8]–[10], and deep learning [11], [12], have been utilized to classify time series. Deep learning has been successfully utilized in various applications that require time series data, especially in control systems [13], [14]. In this paper, two deep learning models to classify time series datasets are proposed, both of which outperform existing state-of-the-art models and do not require heavy preprocessing.

在过去的十年中，人们对时间序列分类越来越感兴趣。时间序列数据无处不在[1]，存在于天气读数[2]，财务记录[3]，工业观测[4]和心理信号[5]，[6]中。几种方法，包括基于特征的[7]，集合[8]-[10]和深度学习[11]，[12]，已被用于对时间序列进行分类。深度学习已成功用于需要时间序列数据的各种应用程序中，尤其是在控制系统中[13]，[14]。在本文中，提出了两种用于对时间序列数据集进行分类的深度学习模型，它们均优于现有的最新模型，并且不需要大量的预处理。

A plethora of research has been done using feature-based approaches or methods to extract a set of features that represent time series patterns. Bag-of-Words (BoW) [15], Bagof-features (TSBF) [16], Bag-of-SFA-Symbols (BOSS) [17], BOSSVS [18], and Word ExtrAction for time Series cLassification (WEASEL) [19] have obtained promising results in the field. Bag-of-words quantizes the extracted features and feeds the BoW into a classifier. TSBF extracts multiple subsequences of random local information, which a supervised learner condenses into a cookbook used to predict time series labels. BOSS introduces a combination of a distance based classifier and histograms. The histograms represent substructures of a time series that are created using a symbolic Fourier approximation. BOSSVS extends this method by proposing a vector space model to reduce time complexity while maintaining performance. WEASEL converts time series into feature vectors using a sliding window. Machine learning algorithms utilize these feature vectors to detect and classify the time series. All these classifiers require heavy feature extraction and feature engineering. Using multiple of these feature-based algorithms as an ensemble algorithm yields better results.

已经使用基于特征的方法或方法进行了大量研究，以提取代表时间序列模式的一组特征。时间序列分类（WEASEL）的词袋（BoW）[15]，词袋特征（TSBF）[16]，SFA符号袋（BOSS）[17]，BOSSVS [18]和Word ExtrAction [19]在该领域取得了可喜的成果。词袋对提取的特征进行量化，并将BoW馈入分类器。 TSBF提取随机局部信息的多个子序列，有监督的学习者将这些子序列浓缩为用于预测时间序列标签的菜谱。 BOSS引入了基于距离的分类器和直方图的组合。直方图表示使用符号傅立叶逼近创建的时间序列的子结构。 BOSSVS通过提出向量空间模型来扩展此方法，以减少时间复杂度并保持性能。 WEASEL使用滑动窗口将时间序列转换为特征向量。机器学习算法利用这些特征向量来检测和分类时间序列。所有这些分类器都需要繁重的特征提取和特征工程。将这些基于特征的算法中的多个算法用作集成算法可产生更好的结果。

Ensemble algorithms also yield state-of-the-art performance with time series classification problems. Three of the most successful ensemble algorithms that integrate various features of a time series are Proportional Elastic Ensemble (PROP) [20], a model that integrates 11 time series classifiers using a weighted ensemble method, shapelet ensemble (SE) [8], a model that applies a heterogeneous ensemble onto transformed shapelets, and a flat collective of transform based ensembles (COTE) [8], a model that fuses 35 various classifiers into a single classifier.

集成算法还可以产生具有时间序列分类问题的最新性能。三种最成功的集成时间序列特征的集成算法是比例弹性集成（PROP）[20]，该模型使用加权集成方法集成了11个时间序列分类器，即小波集成（SE）[8]。该模型将异构集合应用于已变形的Shapelet，以及一个基于变换的集成的平面集合（COTE）[8]，该模型将35个不同的分类器融合为一个分类器。

Recently, deep neural networks have been employed for time series classification tasks. Multi-scale convolutional neural network (MCNN) [12], fully convolutional network (FCN) [11], and residual network (ResNet) [11] are deep learning approaches that take advantage of convolutional neural networks (CNN) for end-to-end classification of univariate time series. MCNN uses down-sampling, skip sampling and sliding window to preprocess the data. The performance of the MCNN classifier is highly dependent on the preprocessing applied to the dataset and the tuning of a largesetofhyperparametersofthatmodel.Ontheotherhand, FCN and ResNet do not require any heavy preprocessing on the data or feature engineering.

最近，深度神经网络已被用于时间序列分类任务。多尺度卷积神经网络（MCNN）[12]，全卷积网络（FCN）[11]和残差网络（ResNet）[11]是利用卷积神经网络（CNN）进行端到端的深度学习方法变量时间序列的末端分类。 MCNN使用下采样，跳过采样和滑动窗口对数据进行预处理。 MCNN分类器的性能高度依赖于应用于数据集的预处理以及对该模型的大量超参数的调整。另一方面，FCN和ResNet不需要对数据或要素工程进行任何繁重的预处理。

In this paper, we improve the performance of FCN by augmenting the FCN module with either a Long Short Term Recurrent Neural Network (LSTM RNN) sub-module, called LSTM-FCN, or a LSTM RNN with attention, called ALSTM-FCN. In addition, the Attention LSTM can also be used detect regions of the input sequence that contribute to the class label through the context vector of the Attention LSTM cells. Results indicate the new proposed models, LSTM-FCN and ALSTM-FCN, dramatically improve performance on the University of California Riverside (UCR) Benchmark datasets [21]. LSTM-FCN and ALSTM-FCN produce better results than several state-of-the-art algorithms on a majority of the UCR Benchmark datasets.

在本文中，我们通过使用称为LSTM-FCN的长期短期递归神经网络（LSTM RNN）子模块或称为ALSTM-FCN的LSTM RNN扩展FCN模块来提高FCN的性能。另外，Attention LSTM也可以用于检测输入序列的区域，这些区域通过Attention LSTM单元的上下文向量对类别标签有贡献。结果表明，新提出的模型LSTM-FCN和ALSTM-FCN大大提高了University of California Riverside（UCR）基准数据集的性能[21]。与大多数UCR Benchmark数据集上的几种最新算法相比，LSTM-FCN和ALSTM-FCN产生更好的结果。

This paper proposes two deep learning models for endto-end time series classification. The proposed models do not require heavy preprocessing on the data or feature engineering. Both the models are tested on all 85 UCR time series benchmarks and outperform most of the state-of-theart models. The remainder of the paper is organized as follows. Section II reviews the background work. Section III presents the architecture of the proposed models. Section IV analyzes and discusses the experiments performed. Finally, conclusions are drawn in Section V.

本文为端到端时间序列分类提出了两种深度学习模型。提出的模型不需要对数据或特征工程进行大量预处理。两种模型均在所有85个UCR时间序列基准上进行了测试，并优于大多数最新模型。在本文的其余部分安排如下。第二节回顾了背景工作。第三节介绍了提出的模型的体系结构。第四节分析并讨论了进行的实验。最后，在第五节中得出结论。

II. BACKGROUND WORKS

A. TEMPORAL CONVOLUTIONS

The input to a Temporal Convolutional Network is generally a time series signal. As stated in Lea et al. [22], let $X_t∈ \R^{F_0}$ be the input feature vector of length $F_0$ for time step $t$ for $0 < t \leq T$ . Note that the time $T$ may vary for each sequence, and we denote the number of time steps in each layer as $T_l$ . The true action label for each frame is given by $y_t∈ {1,...,C}$ , where $C$ is the number of classes.

时间卷积网络的输入通常是时间序列信号。如Lea等人所述。 [22]，令 $X_t∈\R^{F_0}$ 为长度为 $F_0$ 时间步长为 $t$ 的输入特征向量，其中 $0 < t ≤ T 0。注意，时间 T T 对于每个序列可以变化，并且我们将每个层中的时间步数表示为 T 1 T_1 。每帧的真实动作标签由 y t ∈ 1 ， . . . ， C y_t∈{1，...，C} 给出，其中 C C 是类数。$

Consider $L$ convolutional layers. We apply a set of $1 D$ filters on each of these layers that capture how the input signals evolve over the course of an action. According to Lea et al. [22], the filters for each layer are parameterized by tensor $W^{(l)} \in \R^{F_l×d×F_{l−1}}$ and biases $b^{(l)} \in R^{F_l}$ , where $\in {1, . . . ,L}$ is the layer index and $d$ is the filter duration. For the $l - t h$ layer, the $i - t h$ component of the (unnormalized) activation $\hat{E}^{(l)}_t \in \R^{F_l}$ is a function of the incoming (normalized) activation matrix $E^{(l−1)} \in R^{F_{l−1}×T_{l−1}}$ from the previous layer

for each time t where f (·) is a Rectified Linear Unit.

考虑 $L$ 卷积层。我们在这些层中的每一层应用一组1D滤波器，捕捉输入信号在动作过程中的变化。根据Lea等人的说法。[22]用张量 $W^{(l)} \in \R^{F_l×d×F_{l−1}}$ 和偏置 $b^{(l)} \in R^{F_l}$ 对每一层的滤波器进行参数化，其中 $\in {1, . . . ,L}$ 是层索引， $d$ 是过滤持续时间。对于第 $l$ 层，非标准化的激活函数 $\hat{E}^{(l)}_t \in \R^{F_l}$ 的第 $i$ 个分量是来自前一层的传入（标准化的）时间 $t$ 时的激活函数矩阵 $E^{(l−1)} \in R^{F_{l−1}×T_{l−1}}$ ，其中 $f(\cdot)$ 是校正线性单元（ReLU）。

We use Temporal Convolutional Networks as a feature extraction module in a Fully Convolutional Network (FCN) branch. A basic convolution block consists of a convolution layer, followed by batch normalization [23], followed by an activation function, which can be either a Rectified Linear Unit or a Parametric Rectified Linear Unit [24].

在全卷积网络（FCN）分支中，我们使用时间卷积网络作为特征提取模块。基本卷积块包括卷积层，接着是批量标准化[23]，接着是激活函数，激活函数可以是校正线性单元或参数校正线性单元[24]。

B. RECURRENT NEURAL NETWORKS

Recurrent Neural Networks, often shortened to RNNs, are a class of neural networks which exhibit temporal behaviour due to directed connections between units of an individual layer. As reported by Pascanu et al. [25], recurrent neural networks maintain a hidden vectorh, which is updated at time step t as follows:

递归神经网络，通常简称为RNNs，是一类由于单个层单元之间的有向连接而表现出时间行为的神经网络。如Pascanu等人所述。[25]，递归神经网络保持一个隐藏向量，在时间步骤t更新如下：

tanh is the hyperbolic tangent function, W is the recurrent weight matrix and I is a projection matrix. The hidden state h is used to make a prediction

tanh是双曲正切函数，W是递推权重矩阵，I是投影矩阵。隐藏状态h用于进行预测

softmax provides a normalized probability distribution over the possible classes, σ is the logistic sigmoid function and Wis a weight matrix. By usinghas the input to another RNN, we can stack RNNs, creating deeper architectures

softmax提供了可能类上的规范化概率分布，σ是logistic sigmoid函数，W是权重矩阵。通过使用另一个RNN的输入，可以堆叠RNN，创建更深层的体系结构

C. LONG SHORT-TERM MEMORY RNNs

Long short-term memory recurrent neural networks are an improvement over the general recurrent neural networks, which possess a vanishing gradient problem. As stated in Hochreiter and Schmidhuber [26], LSTM RNNs address the vanishing gradient problem commonly found in ordinary recurrent neural networks by incorporating gating functions into their state dynamics. At each time step, an LSTM maintains a hidden vector h and a memory vector m responsible for controlling state updates and outputs. More concretely, Graves et al. [27] define the computation at time step t as follows :

长短期记忆递归神经网络是对一般递归神经网络的改进，它具有消失梯度问题。如Hochreiter和Schmidhuber[26]所述，LSTM RNN通过将选通函数纳入其状态动力学，解决了普通递归神经网络中常见的消失梯度问题。在每个时间步骤，LSTM维护隐藏向量h和负责控制状态更新和输出的存储器向量m。更具体地说，格雷夫斯等人。[27]定义时间步骤t的计算如下：

where $\sigma$ is the logistic sigmoid function, $\bigodot$ represents elementwise multiplication, $W_u,W_f,W_o,W_c$ are recurrent weight matrices and $I_u,I_f,I_o,I_c$ are projection matrices.

其中 $\sigma$ 是logistic sigmoid函数， $\bigodot$ 表示元素乘法， $W_u，W_f，W_o，W_c$ 是递归权重矩阵， $I_u，I_f，I_o，I_c$ 是投影矩阵。

While LSTMs possess the ability to learn temporal dependencies in sequences, they have difficulty with long term dependencies in long sequences. The attention mechanism proposed by Bahdanau et al. [28] can help the LSTM RNN learn these dependencies.

虽然LSTMs能够学习序列中的时间依赖项，但在长序列中很难学习长时间依赖项。Bahdanau等人提出的注意机制。[28]可以帮助LSTM RNN学习这些依赖关系。

FIGURE 1. The LSTM-FCN architecture. LSTM cells can be replaced by Attention LSTM cells to construct the ALSTM-FCN architecture.

D. ATTENTION MECHANISM

The attention mechanism is a technique often used in neural translation of text, where a context vector $C$ is conditioned on the target sequence $y$ . As discussed in Bahdanau et al. [28], the context vector $c_i$ depends on a sequence of annotations $h_1, ...,h_{T_x})$ to which an encoder maps the input sequence. Each annotation hicontains information about the whole input sequence with a strong focus on the parts surrounding the $i - t h$ word of the input sequence.The context vector $c_i$ is then computed as a weighted sum of these annotations $h_i$ :

注意机制是文本神经翻译中经常使用的一种技术，其中上下文向量 $C$ 是以目标序列 $y$ 为条件的，如Bahdanau等人所讨论的。[28]，上下文向量 $c_i$ 取决于编码器将输入序列映射到的注释序列 $h_1, ...,h_{T_x})$ 。每个注释 $h_i$ 都包含有关整个输入序列的信息，并重点关注输入序列的第 $i$ 个单词周围的部分。

然后，将上下文向量 $c_i$ 计算为这些注释的加权和 $h_i$ ：

where $e_{ij}= a(s_{i−1},h_j)$ is an alignment model, which scores how well the input around position j and the output at position i match. The score is based on the RNN hidden state $s_{iâĹ'1}$ and the j-th annotation $h_j$ of the input sentence.

其中 $e_ {ij} = a(s_ {i-1}，h_j)$ 是一个对齐模型，它对位置j周围的输入与位置 $i$ 处的输出的匹配程度进行评分。得分基于RNN隐藏状态 $s_{iâĹ'1}$ 和输入语句的第 $j$ 个注释 $h_j$ 。

Bahdanau et al. [28] parametrize the alignment model $a$ as a feedforward neural network which is jointly trained with all the other components of the model. The alignment model directly computes a soft alignment, which allows the gradient of the cost function to be backpropagated.

Bahdanau等人。[28]将对准模型 $a$ 参数化为一个前馈神经网络，它与模型的所有其他组成部分联合训练。对齐模型直接计算软对齐，允许成本函数的梯度反向传播。

III. LSTM FULLY CONVOLUTIONAL NETWORK

FIGURE 2. Visualization of context vector on CBF dataset.

A. NETWORK ARCHITECTURE

Temporal convolutions have proven to be an effective learning model for time series classification problems [11]. Fully Convolutional Networks, comprised of temporal convolutions, are typically used as feature extractors. Global average pooling [29] is used to reduce the number of parameters in the model prior to classification. In the proposed models, the fully convolutional block is augmented by an LSTM block followed by dropout [30], as shown in Figure 1.

时间卷积被证明是时间序列分类问题的有效学习模型[11]。由时间卷积组成的全卷积网络通常用作特征提取器。全局平均池化[29]用于在分类之前减少模型中的参数数量。在所提出的模型中，全卷积块由LSTM块加dropout[30]组成，如图1所示。

The fully convolutional block consists of three stacked temporal convolutional blocks with filter sizes of 128, 256, and 128 respectively. Each convolutional block is identical to the convolution block in the CNN architecture proposed by Wang et al. [11]. Each block consists of a temporal convolutional layer, which is accompanied by batch normalization [23] (momentum of 0.99, epsilon of 0.001) and followed by a ReLU activation function. Finally, global average pooling is applied after the final convolution block.

全卷积块由三个堆叠的时间卷积块组成，它们的滤波器大小分别为128、256和128。每个卷积块与Wang等人提出的CNN架构中的卷积块相同。 [11]。每个块由一个时间卷积层组成，使用批量归一化[23]（动量为0.99，ε为0.001），然后是ReLU激活函数。最后，在最后的卷积块之后应用全局平均池化。

Simultaneously, the time series input is conveyed into a dimension shuffle layer (explained more in Section III-B). The transformed time series from the dimension shuffle is then passed into the LSTM block. The LSTM block, comprising of either a general LSTM layer or an Attention LSTM layer, is followed by a dropout. The output of the global poolinglayerandtheLSTMblockisconcatenatedandpassed onto a softmax classification layer.

同时，时间序列输入被传送到维度混洗层（dimension shuffle layer）（在第III-B节中有更多说明）。然后将来自维度混洗的变换后的时间序列传递到LSTM块中。由常规LSTM层或Attention LSTM层组成的LSTM块后面是一个辍学部分。全局池层和LSTM块的输出被串联并传递到softmax分类层。

B. NETWORK INPUT

The fully convolutional block and LSTM block perceive the same time series input in two different views. The fully convolutional block views the time series as a univariate time series with multiple time steps. If there is a time series of length N, the fully convolutional block will receive the data in N time steps.

全卷积块和LSTM块在两个不同的视图中感知相同的时间序列输入。全卷积块将时间序列视为具有多个时间步长的单变量时间序列。如果存在一个长度为N的时间序列，则全卷积块将以N个时间步长接收数据。

In contrast, the LSTM block in the proposed architecture receives the input time series as a multivariate time series with a single time step. This is accomplished by the dimension shuffle layer, which transposes the temporal dimension of the time series. A univariate time series of length N,after transformation, will be viewed as a multivariate time series (having N variables) with a single time step. Without the dimension shuffle, the performance of the LSTM block is significantly reduced due to the rapid overfitting of small short-sequence UCR datasets and a failure to learn long term dependencies in the larger long-sequence UCR datasets.

相反，所提出的体系结构中的LSTM块将输入时间序列作为具有单个时间步长的多元时间序列来接收。这是通过维度混洗层实现的，该层对时间序列的时间维度进行转置。长度为N的单变量时间序列，转换后，将被视为具有单个时间步长的多元时间序列（具有N个变量）。如果不进行尺寸调整，由于小的短序列UCR数据集的快速过拟合以及无法学习较大的长序列UCR数据集中的长期依存关系，LSTM块的性能将大大降低。

In addition, dimension shuffle improves the efficiency of this model by requiring an order of magnitude less time to train. When a dataset of N time steps and M variables use a LSTM without dimension shuffling, the LSTM will require N time steps to process a batch of M variables. In contrast, applying the dimension shuffle to the input will allow the LSTM model to process a batch of N variables in M time steps. This suggests that as long as the number of variables M is significantly smaller than the number of time steps N, dimension shuffle will greatly improve the speed of training. As each of the UCR datasets is univariate, the LSTM component of this model will require only 1 time step to process a batch of N variables.

另外，维度混洗通过减少所需的训练时间来提高此模型的效率。当N个时间步长和M个变量的数据集使用不带维数改组的LSTM时，LSTM将需要N个时间步长来处理一批M个变量。相比之下，将尺寸改组应用于输入将允许LSTM模型以M个时间步长处理一批N个变量。这表明，只要变量数M显着小于时间步长N，维数改组将极大地提高训练速度。由于每个UCR数据集都是单变量的，因此该模型的LSTM组件仅需要1个时间步即可处理一批N变量。

To illustrate this, a total of 18 hours is required on a single GTX 1080 Ti to train an LSTM-FCN for each of the 85 UCR datasets, and 19 hours for ALSTM-FCN.Without the dimension shuffle, it would take more than 100 hours to train the respective models on all 85 UCR datasets.

为了说明这一点，单个GTX 1080 Ti总共需要18个小时来训练85个UCR数据集中的LSTM-FCN，而ALSTM-FCN则需要19个小时。如果不进行维变换，则在所有85个UCR数据集上训练各自的模型将花费100多个小时。

C. REFINEMENT OF MODELS

Transfer learning is a technique wherein the knowledge gained from training a model on a dataset can be reused when training the model on another dataset, such that the domain of the new dataset has some similarity with the prior domain [31]. Similarly, we propose refinement, which can be can be described as transfer learning on the same dataset.

迁移学习是一种技术，其中当在另一个数据集上训练模型时，可以重复使用从在一个数据集上训练模型所获得的知识，从而使新数据集的域与先前域具有某些相似性[31]。同样，我们提出了改进，可以将其描述为对同一数据集的转移学习。

The training procedure can thus be split into two distinct phases. In the initial phase, the optimal hyperparameters for the model are selected for a given dataset. The model is then trained on the given dataset with these hyperparameter settings. In the second step, we apply refinement to this initial model.

训练过程因此可以分为两个不同的阶段。在初始阶段，为给定数据集选择模型的最佳超参数。然后使用这些超参数设置在给定的数据集上训练模型。在第二步中，我们对初始模型进行优化。

The procedure of transfer learning is iterated over in the refinement phase, using the original dataset. Each repetition is initialized using the model weight of the previous iteration. At each iteration the learning rate is halved. Furthermore, the batch size is halved once every alternate iteration. This is done until the initial learning rate is 1e−4 and batch size is 32. The procedure is repeated K times, where K is an arbitrary constant, generally set as 5.

迁移学习的过程在优化阶段使用原始数据集进行迭代。使用上一次迭代的模型权重初始化每个重复。在每次迭代中，学习率减半。此外，每个备用迭代将批次大小减半。一直执行到初始学习率为1e-4且批大小为32为止。此过程重复K次，其中K为任意常数，通常设置为5。

Refinement is a procedure which successively attempts to improve the performance of a pre-trained model. As discussed by Huang et al. [32], multiple local minima lie along the optimization path of a model. Once a model has converged to some local minima during its initial training phase, it can be re-trained using a larger learning rate to escape the previous minima and hopefully land upon a better local minima.

细化是连续尝试提高a的性能的过程。预训练模型。如Huang等人所述。 [32]，沿着模型的优化路径存在多个局部最小值。一旦模型在初始训练阶段收敛到某个局部最小值，就可以使用更高的学习率对其进行重新训练，以逃避先前的最小值，并希望落入一个更好的局部最小值。

Re-training while simultaneously reducing the learning rate and batch size allows for a more refined search to the best local optima.

重新训练，同时降低学习率和批量大小，可以进行更精细的搜索以达到最佳的局部最优。

IV. EXPERIMENTS

The proposed models have been tested on all 85 UCR time series datasets [21]. The FCN block was kept constant throughout all experiments. The optimal number of LSTM cells was found by hyperparameter search over a range of 8 cells to 128 cells. The number of training epochs was generally kept constant at 2000 epochs, but was increased for datasets where the algorithm required a longer time to converge. Initial batch size of 128 was used, and halved for each successive iteration of the refinement algorithm. A high dropout rate of 80% was used after the LSTM or Attention LSTM layer to combat overfitting. Class imbalance was handled via a class weighing scheme inspired by King and Zeng [33]. All models were trained using the Keras [34] library with the TensorFlow [35] backend.

所提出的模型已经在所有85个UCR时间序列数据集上进行了测试[21]。在所有实验中，FCN块均保持恒定。通过超参数搜索在8个单元至128个单元的范围内找到了LSTM单元的最佳数量。训练epoch的数量通常保持在2000个epoch，但对于算法需要较长时间收敛的数据集，则增加了。使用的初始批处理大小为128，对于优化算法的每个连续迭代，将其减半。在LSTM或Attention LSTM层之后使用了80％的高辍学率来防止过度拟合。分类失衡是由King和Zeng [33]提出的班级称重方案处理的。使用带有TensorFlow [35]后端的Keras [34]库对所有模型进行了训练。

All models were trained via the Adam optimizer [36], with an initial learning rate of 1e−3 and a final learning rate of 1e−4. All convolution kernels were initialized with the initialization proposed by He et al. [37]. The learning rate was reduced by a factor of $\frac{1}{\sqrt[3]{2}}$ every 100 epochs of no improvement in the validation score, until the final learning rate was reached. No additional preprocessing was done on the UCR datasets as they have close to zero mean and unit variance. All models were refined. Scores stated in Table 1 refer to the scores obtained by models prior to and after refinement.1

所有模型都通过Adam优化器[36]进行了训练，初始学习率为1e-3，最终学习率为1e-4。所有卷积内核都使用He等人提出的初始化方法进行了初始化。 [37]。如果验证准确率在连续100个epochs内没有改善，则学习率降低 $\frac{1}{\sqrt[3]{2}}$ ，直到达到最终学习率。由于UCR数据集的均值和单位方差接近零，因此无需进行其他预处理。所有模型都经过精制。表1中列出的分数是指模型在精炼之前和之后所获得的分数。

A. EVALUATION METRICS

In this paper, the proposed model was evaluated using accuracy, rank based statistics, and the mean per class error as stated by Wang et al. [11]. The rank-based evaluations used are the arithmetic rank, geometric rank, and the Wilcoxon signed rank test. The arithmetic rank is the arithmetic mean of the rank of dataset. The geometric rank is the geometric mean of the rank of each dataset. The Wilcoxson signed rank test is used to compare the median rank of the proposed model and the existing state-of-the-art models. The null hypothesis and alternative hypothesis are as follows:

在本文中，如Wang等人所述，使用准确性，基于等级的统计数据和每类均值误差对提出的模型进行了评估。 [11]。使用的基于等级的评估是算术等级，几何等级和Wilcoxon有符号等级检验。算术等级是数据集等级的算术平均值。几何等级是每个数据集的等级的几何平均值。 Wilcoxson带符号秩检验用于比较提议的模型和现有的最新模型的中位数。零假设和替代假设如下：

平均每类错误（MPCE）定义为每类错误（PCE）的算术平均值，

B. RESULTS

TABLE 2. Wilcoxon signed rank test comparison of each model.

TABLE 3. Summary of advantages of the proposed models.

Figure 2 is an example of the visual representation of the Attention LSTM cell on the ‘‘CBF’’ dataset. The points in the figure where the sequences are ‘‘squeezed’’ together are points at which all the classes have the same weight. These are the points in the time series at which the Attention LSTM can correctly identify the class. This is further supported by visual inspection ofthe actual time series. The squeezepoints are points where each of the classes can be distinguished from each other, as shown in Figure 2.

图2是“ CBF”数据集上Attention LSTM单元的直观表示示例。图中顺序被“挤压”在一起的点是所有类别具有相同权重的点。这些是时间序列中Attention LSTM可以正确识别类别的点。通过目视检查实际时间序列进一步支持了这一点。挤压点是可以区分每个类的点，如图2所示。

The performance of the proposed models on the UCR datasets are summarized in Table1. The colored cells are cells that outperform the state-of-the-art model for that dataset. Both proposed models, the ALSTM-FCN model and the LSTM-FCN model, with both phases, without refinement (Phase 1) and with refinement (Phase 2), outperforms the state-of-the-art models in at least 43 datasets. The average arithmetic rank in Figure 3 indicates the superiority of our proposed models over the existing state-of-the-art models. This is further validated using the Wilcoxon signed rank test, where the p-value of each of the proposed models are less than 0.05 when compared to existing state-of-the-art models, Table 2.

表1总结了建议模型在UCR数据集上的性能。有色单元格是优于该数据集的最新模型的单元格。两种建议的模型，ALSTM-FCN模型和LSTM-FCN模型，无论是在两个阶段，都没有精炼（阶段1）还是经过精炼（阶段2），在至少43个数据集中都超过了最新模型。图3中的平均算术等级表明我们提出的模型优于现有的最新模型。使用Wilcoxon符号秩检验进一步验证了这一点，与现有的最新模型相比，每个提议模型的p值均小于0.05（表2）。

The Wilcoxon Signed Test also provides evidence that refinement maintains or improves the overall accuracy on each of the proposed models. The MPCE of the LSTM-FCN and ALSTM-FCN models was found to reduce by 0.0035 and 0.0007 respectively when refinement was applied.Refinement improves the accuracy of the LSTM-FCN models on a greater number of datasets as compared to the ALSTM-FCN models. We postulate that this discrepancy is due to the fact that the LSTM-FCN model contains fewer total parameters than the ALSTM-FCN model. This indicates a lower rate of overfitting on the UCR datasets. As a consequence, refinement is more effective on the LSTM-FCN models for the UCR datasets.

Wilcoxon签名测试还提供了证据，表明改进可以保持或提高每个提议模型的总体准确性。当进行细化时，LSTM-FCN和ALSTM-FCN模型的MPCE分别降低了0.0035和0.0007。与ALSTM-FCN模型相比，精化提高了LSTM-FCN模型在更多数据集上的准确性。我们假定这种差异是由于LSTM-FCN模型包含的总参数少于ALSTM-FCN模型的事实。这表明UCR数据集的过拟合率较低。因此，在针对UCR数据集的LSTM-FCN模型上，优化更为有效。

A significant drawback of refinement is that it requires more training time due to the added computational complexity of re-training the model using smaller batch sizes. The disadvantages of refinement are mitigated when using the ALSTM-FCN within Phase 1. At the end of Phase 1, the ALSTM-FCN model outperforms the Phase 1 LSTMFCN model. One of the major advantage of using the Attention LSTM cell is it provides a visual representation of the attention vector. The Attention LSTM also benefits from refinement, but the effect is less significant as compared to the general LSTM model. A summary of the performance of each model type on certain characteristics is provided on Table 3.

改进的一个显着缺点是，由于使用较小的批处理量重新训练模型会增加计算复杂性，因此需要更多的训练时间。在阶段1中使用ALSTM-FCN时，可以减轻优化的缺点。在阶段1结束时，ALSTM-FCN模型的性能优于阶段1的LSTMFCN模型。使用Attention LSTM单元的主要优点之一是它提供了注意力向量的可视表示。 Attention LSTM也可以从改进中受益，但与普通LSTM模型相比，效果不那么明显。表3提供了每种模型在某些特性上的性能摘要。

V. CONCLUSION & FUTURE WORK

With the proposed models, we achieve a notable improvement on the current state-of-the-art for time series classification using deep neural networks. Our baseline models, with and without refinement, are trainable end-to-end with nominal preprocessing and are able to achieve significantly improved performance. LSTM-FCNs are able to augment FCN models, appreciably increasing their performance with a nominal increase in the number of parameters. ALSTM-FCNs enable one to visually inspect the decision process of the LSTM RNN and provide a strong baseline on their own. Refinement can be applied as a general procedure to a model to further elevate its performance. The strong increase in performance in comparison to the FCN models shows that LSTM RNNs can beneficially supplement the performance of FCN modules for time series classification. An overall analysis of the performance of our model is provided and compared to other techniques.

借助提出的模型，我们在使用深度神经网络进行时间序列分类的当前最新技术方面取得了显着改进。我们的基准模型，无论是否经过改进，都可以通过名义上的预处理进行端到端的培训，并且能够显着提高性能。 LSTM-FCN能够扩充FCN模型，并通过名义上增加参数数量来显着提高其性能。 ALSTM-FCN使人们能够直观地检查LSTM RNN的决策过程，并自行提供强大的基准。细化可以作为一般过程应用于模型，以进一步提高其性能。与FCN模型相比，性能的强劲增长表明LSTM RNN可以有益地补充FCN模块的时间序列分类性能。提供了对模型性能的整体分析，并将其与其他技术进行了比较。

Due to the generality of the input to this model, it has wide ranging applicability on several sequence modelling tasks such as text analysis, music recognition and voice detection. Furthermore, due to its small size and efficiency, it can be easily deployed to real time systems or embedded systems. Additional research is to be done on understanding why the Attention LSTM cell is unsuccessful in matching the performance of the general LSTM cell on some of the datasets.

由于此模型输入的通用性，因此它在多种序列建模任务（例如文本分析，音乐识别和语音检测）上具有广泛的适用性。此外，由于其体积小，效率高，因此可以轻松地部署到实时系统或嵌入式系统中。在理解为什么Attention LSTM单元无法在某些数据集上匹配常规LSTM单元的性能时，还需要进行其他研究。

你可能感兴趣的:(时间序列处理（Time,Series）,论文学习（Paper）,LSTM,全卷积神经网络,神经网络,分类算法,时间序列)

情绪觉察日记第37天露露_e800
今天是家庭关系规划师的第二阶最后一天，慧萍老师帮我做了个案，帮我处理了埋在心底好多年的一份恐惧，并给了我深深的力量！这几天出来学习，爸妈过来婆家帮我带小孩，妈妈出于爱帮我收拾东西，并跟我先生和婆婆产生矛盾，妈妈觉得他们没有照顾好我…。今晚回家见到妈妈，我很欣赏她并赞扬她，妈妈说今晚要跟我睡我说好，当我们俩躺在床上准备睡觉的时候，我握着妈妈的手对她说:妈妈这几天辛苦你了，你看你多利害把我们的家收拾得
【iOS】MVC设计模式 Magnetic_h ios mvc 设计模式 objective-c 学习 ui
MVC前言如何设计一个程序的结构，这是一门专门的学问，叫做"架构模式"（architecturalpattern），属于编程的方法论。MVC模式就是架构模式的一种。它是Apple官方推荐的App开发架构，也是一般开发者最先遇到、最经典的架构。MVC各层controller层Controller/ViewController/VC（控制器）负责协调Model和View，处理大部分逻辑它将数据从Mod
C语言宏函数南林yan C语言 c语言
一、什么是宏函数？通过宏定义的函数是宏函数。如下，编译器在预处理阶段会将Add(x,y)替换为((x)*(y))#defineAdd(x,y)((x)*(y))#defineAdd(x,y)((x)*(y))intmain(){inta=10;intb=20;intd=10;intc=Add(a+d,b)*2;cout<
C语言如何定义宏函数？小九格物 c语言
在C语言中，宏函数是通过预处理器定义的，它在编译之前替换代码中的宏调用。宏函数可以模拟函数的行为，但它们不是真正的函数，因为它们在编译时不会进行类型检查，也不会分配存储空间。宏函数的定义通常使用#define指令，后面跟着宏的名称和参数列表，以及宏展开后的代码。宏函数的定义方式：1.基本宏函数：这是最简单的宏函数形式，它直接定义一个表达式。#defineSQUARE(x)((x)*(x))2.带参
c++ 的iostream 和 c++的stdio的区别和联系黄卷青灯77 c++算法开发语言 iostream stdio
在C++中，iostream和C语言的stdio.h都是用于处理输入输出的库，但它们在设计、用法和功能上有许多不同。以下是两者的区别和联系：区别1.编程风格iostream（C++风格）：C++标准库中的输入输出流类库，支持面向对象的输入输出操作。典型用法是cin（输入）和cout（输出），使用>操作符来处理数据。更加类型安全，支持用户自定义类型的输入输出。#includeintmain(){in
LocalDateTime 转 String igotyback java 开发语言
importjava.time.LocalDateTime;importjava.time.format.DateTimeFormatter;publicclassMain{publicstaticvoidmain(String[]args){//获取当前时间LocalDateTimenow=LocalDateTime.now();//定义日期格式化器DateTimeFormatterformat
Linux下QT开发的动态库界面弹出操作（SDL2） 13jjyao QT类 qt 开发语言 sdl2 linux
需求：操作系统为linux，开发框架为qt，做成需带界面的qt动态库，调用方为java等非qt程序难点：调用方为java等非qt程序，也就是说调用方肯定不带QApplication::exec()，缺少了这个，QTimer等事件和QT创建的窗口将不能弹出(包括opencv也是不能弹出)；这与qt调用本身qt库是有本质的区别的思路：1.调用方缺QApplication::exec()，那么我们在接口
2021-08-26 影幽
在生活中，女人与男人的感悟往往有所不同。人生最大的舞台就是生活，大幕随时都可能拉开，关键是你愿不愿意表演都无法躲避。在生活中，遇事不要急躁，不要急于下结论，尤其生气时不要做决断，要学会换位思考，大事化小小事化了，把复杂的事情尽量简单处理，千万不要把简单的事情复杂化。永远不要扭曲，别人善意，无药可救。昨天是张过期的支票，明天是张信用卡，只有今天才是现金，要善加利用！执着的攀登者不必去与别人比较自己的
消息中间件有哪些常见类型 xmh-sxh-1314 java
消息中间件根据其设计理念和用途，可以大致分为以下几种常见类型：点对点消息队列（Point-to-PointMessagingQueues）：在这种模型中，消息被发送到特定的队列中，消费者从队列中取出并处理消息。队列中的消息只能被一个消费者消费，消费后即被删除。常见的实现包括IBM的MQSeries、RabbitMQ的部分使用场景等。适用于任务分发、负载均衡等场景。发布/订阅消息模型（Pub/Sub
《大清方方案》| 第二话谁佐清欢
和珅究竟说了些什么？竟能令堂堂九五之尊龙颜失色！此处暂且按下不表；单说这位乾隆皇帝，果真不愧是康熙从小带过的，一旦决定了要做的事，便杀伐决断毫不含糊。他当即亲自拟旨，着令和珅为钦差大臣，全权负责处理方方事件，并钦赐尚方宝剑，遇急则三品以下官员可先斩后奏。和珅身负皇上重托，岂敢有半点怠慢，当夜即率领相关人等，马不停蹄杀奔江汉。这一路上，和珅的几位幕僚一直在商讨方方事件的处置方案。有位年轻幕僚建议快刀
Python数据分析与可视化实战指南 William数据分析 python python 数据
在数据驱动的时代，Python因其简洁的语法、强大的库生态系统以及活跃的社区，成为了数据分析与可视化的首选语言。本文将通过一个详细的案例，带领大家学习如何使用Python进行数据分析，并通过可视化来直观呈现分析结果。一、环境准备1.1安装必要库在开始数据分析和可视化之前，我们需要安装一些常用的库。主要包括pandas、numpy、matplotlib和seaborn等。这些库分别用于数据处理、数学
Python教程：一文了解使用Python处理XPath 旦莫 Python进阶 python 开发语言
目录1.环境准备1.1安装lxml1.2验证安装2.XPath基础2.1什么是XPath？2.2XPath语法2.3示例XML文档3.使用lxml解析XML3.1解析XML文档3.2查看解析结果4.XPath查询4.1基本路径查询4.2使用属性查询4.3查询多个节点5.XPath的高级用法5.1使用逻辑运算符5.2使用函数6.实战案例6.1从网页抓取数据6.1.1安装Requests库6.1.2代
python os.environ_python os.environ 读取和设置环境变量 weixin_39605414 python os.environ
>>>importos>>>os.environ.keys()['LC_NUMERIC','GOPATH','GOROOT','GOBIN','LESSOPEN','SSH_CLIENT','LOGNAME','USER','HOME','LC_PAPER','PATH','DISPLAY','LANG','TERM','SHELL','J2REDIR','LC_MONETARY','QT_QPA
LLM 词汇表落难Coder LLMs NLP 大语言模型大模型 llama 人工智能
Contextwindow“上下文窗口”是指语言模型在生成新文本时能够回溯和参考的文本量。这不同于语言模型训练时所使用的大量数据集，而是代表了模型的“工作记忆”。较大的上下文窗口可以让模型理解和响应更复杂和更长的提示，而较小的上下文窗口可能会限制模型处理较长提示或在长时间对话中保持连贯性的能力。Fine-tuning微调是使用额外的数据进一步训练预训练语言模型的过程。这使得模型开始表示和模仿微调数
关于提高复杂业务逻辑代码可读性的思考编程经验分享开发经验 java 数据库开发语言
目录前言需求场景常规写法拆分方法领域对象总结前言实际工作中大部分时间都是在写业务逻辑，一般都是三层架构，表示层（Controller）接收客户端请求，并对入参做检验，业务逻辑层（Service）负责处理业务逻辑，一般开发都是在这一层中写具体的业务逻辑。数据访问层（Dao）是直接和数据库交互的，用于查数据给业务逻辑层，或者是将业务逻辑层处理后的数据写入数据库。简单的增删改查接口不用多说，基本上写好一
【夜读】提升生活品质的8个建议茳淮秀水
停止攀比很多人之所以感觉疲惫，部分原因是来自于跟别人攀比。殊不知，攀比得到的满足只是片刻的，过后往往会感到空虚。过分在意别人的评价，丢失的是自己原有的审美，扰乱的是自己最初的节奏。不妨活得洒脱些，自己内心丰盈了，快乐就能更持久。停止自责想改变自己，先从接纳自己开始。越是过分自责，就越难改变现状，因为如果把精力全耗在自责上，就没有精力用来改变了。遇到问题，我们要用正确的心态去面对。与其一味自责，不如
【华为OD机试真题2023B卷 JAVA&JS】We Are A Team 若博豆 java 算法华为 javascript
华为OD2023（B卷）机试题库全覆盖，刷题指南点这里WeAreATeam时间限制：1秒|内存限制：32768K|语言限制：不限题目描述：总共有n个人在机房，每个人有一个标号（1<=标号<=n），他们分成了多个团队，需要你根据收到的m条消息判定指定的两个人是否在一个团队中，具体的：1、消息构成为：abc，整数a、b分别代
探索OpenAI和LangChain的适配器集成：轻松切换模型提供商 nseejrukjhad langchain easyui 前端 python
#探索OpenAI和LangChain的适配器集成：轻松切换模型提供商##引言在人工智能和自然语言处理的世界中，OpenAI的模型提供了强大的能力。然而，随着技术的发展，许多人开始探索其他模型以满足特定需求。LangChain作为一个强大的工具，集成了多种模型提供商，通过提供适配器，简化了不同模型之间的转换。本篇文章将介绍如何使用LangChain的适配器与OpenAI集成，以便轻松切换模型提供商
使用Apify加载Twitter消息以进行微调的完整指南 nseejrukjhad twitter easyui 前端 python
#使用Apify加载Twitter消息以进行微调的完整指南##引言在自然语言处理领域，微调模型以适应特定任务是提升模型性能的常见方法。本文将介绍如何使用Apify从Twitter导出聊天信息，以便进一步进行微调。##主要内容###使用Apify导出推文首先，我们需要从Twitter导出推文。Apify可以帮助我们做到这一点。通过Apify的强大功能，我们可以批量抓取和导出数据，适用于各类应用场景。
深入理解 MultiQueryRetriever：提升向量数据库检索效果的强大工具 nseejrukjhad 数据库 python
深入理解MultiQueryRetriever：提升向量数据库检索效果的强大工具引言在人工智能和自然语言处理领域，高效准确的信息检索一直是一个关键挑战。传统的基于距离的向量数据库检索方法虽然广泛应用，但仍存在一些局限性。本文将介绍一种创新的解决方案：MultiQueryRetriever，它通过自动生成多个查询视角来增强检索效果，提高结果的相关性和多样性。MultiQueryRetriever的工
Day1笔记-Python简介&标识符和关键字&输入输出 ~在杰难逃~ Python python 开发语言大数据数据分析数据挖掘
大家好，从今天开始呢，杰哥开展一个新的专栏，当然，数据分析部分也会不定时更新的，这个新的专栏主要是讲解一些Python的基础语法和知识，帮助0基础的小伙伴入门和学习Python，感兴趣的小伙伴可以开始认真学习啦！一、Python简介【了解】1.计算机工作原理编程语言就是用来定义计算机程序的形式语言。我们通过编程语言来编写程序代码，再通过语言处理程序执行向计算机发送指令，让计算机完成对应的工作，编程
MYSQL面试系列-04 king01299 面试 mysql 面试
MYSQL面试系列-0417.关于redolog和binlog的刷盘机制、redolog、undolog作用、GTID是做什么的？innodb_flush_log_at_trx_commit及sync_binlog参数意义双117.1innodb_flush_log_at_trx_commit该变量定义了InnoDB在每次事务提交时，如何处理未刷入（flush）的重做日志信息（redolog）。它
Kafka 消息丢失如何处理？架构文摘JGWZ 学习
今天给大家分享一个在面试中经常遇到的问题：Kafka消息丢失该如何处理？这个问题啊，看似简单，其实里面藏着很多“套路”。来，咱们先讲一个面试的“真实”案例。面试官问：“Kafka消息丢失如何处理？”小明一听，反问：“你是怎么发现消息丢失了？”面试官顿时一愣，沉默了片刻后，可能有点不耐烦，说道：“这个你不用管，反正现在发现消息丢失了，你就说如何处理。”小明一头雾水：“问题是都不知道怎么丢的，处理起来
webpack图片等资源的处理 dmengmeng
需要的loaderfile-loader（让我们可以引入这些资源文件）url-loader（其实是file-loader的二次封装）img-loader（处理图片所需要的）在没有使用任何处理图片的loader之前，比如说css中用到了背景图片，那么最后打包会报错的，因为他没办法处理图片。其实你只想能够使用图片的话。只加一个file-loader就可以，打开网页能准确看到图片。{test:/\.(p
走向以教育叙事为载体的教育叙事研究 666小飞鱼
今天我读了吴松超老师的《给教师的68条建写作建议》中的第23条《如何通过教育叙事走向研究》，吴老师在文中与我们分享了一个德育案例，这是一个反面的案例，意在告知我们在处理问题时，不能就考虑的点太窄，思考要全面。走向教育叙事研究，教师要有敏锐的“感知力”，这个感知力来自于背后专业知识的支撑，思维能力以及广阔的视野和见识等。所以对于同一件事处理方法不同，这个就是教师背后“敏锐力”的不同造成的，也就是说是
ARM中断处理过程落汤老狗嵌入式linux
一、前言本文主要以ARM体系结构下的中断处理为例，讲述整个中断处理过程中的硬件行为和软件动作。具体整个处理过程分成三个步骤来描述：1、第二章描述了中断处理的准备过程2、第三章描述了当发生中的时候，ARM硬件的行为3、第四章描述了ARM的中断进入过程4、第五章描述了ARM的中断退出过程本文涉及的代码来自3.14内核。另外，本文注意描述ARM指令集的内容，有些sourcecode为了简短一些，删除了T
学霸父母学渣娃，这孩子真是亲生的？太扎心了！东北SK皇家成长中心
现在的社会，每个家庭基本都把孩子的教育放在第一位，哪怕父母平时上班再苦再累也不敢在孩子的教育上有丝毫的马虎，平时对孩子的照顾真的是无微不至，每天早起送孩子上学，晚上回家辅导孩子写作业，有的父母的文化程度非常高，但是每每到了辅导孩子写作业这个时候，父母们内心都有这样一种想法，这个孩子真的是我亲生的吗？真想一巴掌拍死他，我上辈子是做了什么孽生出这么一个智障的孩子，家里每每就要上演全武行，看看这些孩子到
18、架构-可观测性之聚合度量大树~~ 架构 java python 后端架构
聚合度量聚合度量是指对系统运行时产生的各种指标数据进行收集、聚合和分析，以了解系统的健康状况和性能表现。聚合度量是可观测性的关键组成部分，通过对度量数据的分析，可以及时发现系统中的异常和瓶颈。以下是对聚合度量各个方面的详细解析，并结合具体的数据案例和技术支撑。指标收集收集系统运行时产生的各种指标数据是聚合度量的基础。常见的指标包括CPU使用率、内存使用率、请求处理时间、请求数、错误率等。以下是指标
【华为OD技术面试真题 - 技术面】- python八股文真题题库（1）算法大师华为od 面试 python
华为OD面试真题精选专栏：华为OD面试真题精选目录:2024华为OD面试手撕代码真题目录以及八股文真题目录文章目录华为OD面试真题精选1.数据预处理流程数据预处理的主要步骤工具和库2.介绍线性回归、逻辑回归模型线性回归（LinearRegression）模型形式：关键点：逻辑回归（LogisticRegression）模型形式：关键点：参数估计与评估：3.python浅拷贝及深拷贝浅拷贝（Shal
nosql数据库技术与应用知识点皆过客，揽星河 NoSQL nosql 数据库大数据数据分析数据结构非关系型数据库
Nosql知识回顾大数据处理流程数据采集(flume、爬虫、传感器)数据存储(本门课程NoSQL所处的阶段)Hdfs、MongoDB、HBase等数据清洗(入仓)Hive等数据处理、分析(Spark、Flink等)数据可视化数据挖掘、机器学习应用(Python、SparkMLlib等)大数据时代存储的挑战(三高)高并发(同一时间很多人访问)高扩展(要求随时根据需求扩展存储)高效率(要求读写速度快)
java类加载顺序 3213213333332132 java
package com.demo; /** * @Description 类加载顺序 * @author FuJianyong * 2015-2-6上午11:21:37 */ public class ClassLoaderSequence { String s1 = "成员属性"; static String s2 = "
Hibernate与mybitas的比较 BlueSkator sql Hibernate 框架 ibatis orm
第一章 Hibernate与MyBatis Hibernate 是当前最流行的O/R mapping框架，它出身于sf.net，现在已经成为Jboss的一部分。 Mybatis 是另外一种优秀的O/R mapping框架。目前属于apache的一个子项目。 MyBatis 参考资料官网：http:
php多维数组排序以及实际工作中的应用 dcj3sjt126com PHP usort uasort
自定义排序函数返回false或负数意味着第一个参数应该排在第二个参数的前面, 正数或true反之, 0相等usort不保存键名uasort 键名会保存下来uksort 排序是对键名进行的 <!doctype html> <html lang="en"> <head> <meta charset="utf-8&q
DOM改变字体大小周华华前端
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml&q
c3p0的配置 g21121 c3p0
c3p0是一个开源的JDBC连接池，它实现了数据源和JNDI绑定，支持JDBC3规范和JDBC2的标准扩展。c3p0的下载地址是：http://sourceforge.net/projects/c3p0/这里可以下载到c3p0最新版本。以在spring中配置dataSource为例：  <bean name="prope
Java获取工程路径的几种方法 510888780 java
第一种： File f = new File(this.getClass().getResource("/").getPath()); System.out.println(f); 结果: C:\Documents%20and%20Settings\Administrator\workspace\projectName\bin 获取当前类的所在工程路径; 如果不加“
在类Unix系统下实现SSH免密码登录服务器 Harry642 免密 ssh
1.客户机 (1)执行ssh-keygen -t rsa -C "[email protected]"生成公钥，xxx为自定义大email地址 (2)执行scp ~/.ssh/id_rsa.pub root@xxxxxxxxx:/tmp将公钥拷贝到服务器上，xxx为服务器地址 (3)执行cat
Java新手入门的30个基本概念一 aijuans java java 入门新手
在我们学习Java的过程中,掌握其中的基本概念对我们的学习无论是J2SE,J2EE,J2ME都是很重要的,J2SE是Java的基础,所以有必要对其中的基本概念做以归纳,以便大家在以后的学习过程中更好的理解java的精髓,在此我总结了30条基本的概念。　　Java概述:　　目前Java主要应用于中间件的开发(middleware)---处理客户机于服务器之间的通信技术,早期的实践证明,Java不适合
Memcached for windows 简单介绍 antlove java Web windows cache memcached
1. 安装memcached server a. 下载memcached-1.2.6-win32-bin.zip b. 解压缩，dos 窗口切换到 memcached.exe所在目录，运行memcached.exe -d install c.启动memcached Server,直接在dos窗口键入 net start "memcached Server&quo
数据库对象的视图和索引百合不是茶索引 oeacle数据库视图
视图视图是从一个表或视图导出的表，也可以是从多个表或视图导出的表。视图是一个虚表，数据库不对视图所对应的数据进行实际存储，只存储视图的定义，对视图的数据进行操作时,只能将字段定义为视图,不能将具体的数据定义为视图为什么oracle需要视图; &
Mockito(一) --入门篇 bijian1013 持续集成 mockito 单元测试
Mockito是一个针对Java的mocking框架，它与EasyMock和jMock很相似，但是通过在执行后校验什么已经被调用，它消除了对期望行为（expectations）的需要。其它的mocking库需要你在执行前记录期望行为（expectations），而这导致了丑陋的初始化代码。 &nb
精通Oracle10编程SQL(5)SQL函数 bijian1013 oracle 数据库 plsql
/* * SQL函数 */ --数字函数 --ABS(n):返回数字n的绝对值 declare v_abs number(6,2); begin v_abs:=abs(&no); dbms_output.put_line('绝对值：'||v_abs); end; --ACOS(n):返回数字n的反余弦值，输入值的范围是-1~1，输出值的单位为弧度
【Log4j一】Log4j总体介绍 bit1129 log4j
Log4j组件：Logger、Appender、Layout Log4j核心包含三个组件：logger、appender和layout。这三个组件协作提供日志功能：日志的输出目标日志的输出格式日志的输出级别(是否抑制日志的输出) logger继承特性 A logger is said to be an ancestor of anothe
Java IO笔记白糖_ java
public static void main(String[] args) throws IOException { //输入流 InputStream in = Test.class.getResourceAsStream("/test"); InputStreamReader isr = new InputStreamReader(in); Bu
Docker 监控 ronin47 docker监控
目前项目内部署了docker，于是涉及到关于监控的事情，参考一些经典实例以及一些自己的想法，总结一下思路。 1、关于监控的内容监控宿主机本身监控宿主机本身还是比较简单的，同其他服务器监控类似，对cpu、network、io、disk等做通用的检查，这里不再细说。额外的，因为是docker的
java-顺时针打印图形 bylijinnan java
一个画图程序要求打印出： 1.int i=5; 2.1 2 3 4 5 3.16 17 18 19 6 4.15 24 25 20 7 5.14 23 22 21 8 6.13 12 11 10 9 7. 8.int i=6 9.1 2 3 4 5 6 10.20 21 22 23 24 7 11.19
关于iReport汉化版强制使用英文的配置方法 Kai_Ge iReport汉化英文版
对于那些具有强迫症的工程师来说，软件汉化固然好用，但是汉化不完整却极为头疼，本方法针对iReport汉化不完整的情况，强制使用英文版，方法如下：在 iReport 安装路径下的 etc/ireport.conf 里增加红色部分启动参数，即可变为英文版。 # ${HOME} will be replaced by user home directory accordin
[并行计算]论宇宙的可计算性 comsci 并行计算
现在我们知道,一个涡旋系统具有并行计算能力.按照自然运动理论,这个系统也同时具有存储能力,同时具备计算和存储能力的系统,在某种条件下一般都会产生意识...... 那么,这种概念让我们推论出一个结论 &nb
用OpenGL实现无限循环的coverflow dai_lm android coverflow
网上找了很久，都是用Gallery实现的，效果不是很满意，结果发现这个用OpenGL实现的，稍微修改了一下源码，实现了无限循环功能源码地址： https://github.com/jackfengji/glcoverflow public class CoverFlowOpenGL extends GLSurfaceView implements GLSurfaceV
JAVA数据计算的几个解决方案1 datamachine java Hibernate 计算
老大丢过来的软件跑了10天，摸到点门道，正好跟以前攒的私房有关联，整理存档。 -----------------------------华丽的分割线------------------------------------- 数据计算层是指介于数据存储和应用程序之间，负责计算数据存储层的数据，并将计算结果返回应用程序的层次。J &nbs
简单的用户授权系统,利用给user表添加一个字段标识管理员的方式 dcj3sjt126com yii
怎么创建一个简单的(非 RBAC)用户授权系统通过查看论坛，我发现这是一个常见的问题，所以我决定写这篇文章。本文只包括授权系统.假设你已经知道怎么创建身份验证系统(登录)。数据库首先在 user 表创建一个新的字段(integer 类型),字段名 'accessLevel',它定义了用户的访问权限扩展 CWebUser 类在配置文件(一般为 protecte
未选之路 dcj3sjt126com 诗
作者:罗伯特*费罗斯特黄色的树林里分出两条路, 可惜我不能同时去涉足, 我在那路口久久伫立, 我向着一条路极目望去, 直到它消失在丛林深处. 但我却选了另外一条路, 它荒草萋萋,十分幽寂; 显得更诱人,更美丽, 虽然在这两条小路上, 都很少留下旅人的足迹. 那天清晨落叶满地, 两条路都未见脚印痕迹. 呵,留下一条路等改日再
Java处理15位身份证变18位蕃薯耀 18位身份证变15位 15位身份证变18位身份证转换
15位身份证变18位，18位身份证变15位 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 蕃薯耀 201
SpringMVC4零配置--应用上下文配置【AppConfig】 hanqunfeng springmvc4
从spring3.0开始，Spring将JavaConfig整合到核心模块，普通的POJO只需要标注@Configuration注解，就可以成为spring配置类，并通过在方法上标注@Bean注解的方式注入bean。 Xml配置和Java类配置对比如下： applicationContext-AppConfig.xml <!-- 激活自动代理功能参看：
Android中webview跟JAVASCRIPT中的交互 jackyrong JavaScript html android 脚本
在android的应用程序中,可以直接调用webview中的javascript代码,而webview中的javascript代码,也可以去调用ANDROID应用程序(也就是JAVA部分的代码).下面举例说明之: 1 JAVASCRIPT脚本调用android程序要在webview中,调用addJavascriptInterface(OBJ,int
8个最佳Web开发资源推荐 lampcy 编程 Web 程序员
Web开发对程序员来说是一项较为复杂的工作，程序员需要快速地满足用户需求。如今很多的在线资源可以给程序员提供帮助，比如指导手册、在线课程和一些参考资料，而且这些资源基本都是免费和适合初学者的。无论你是需要选择一门新的编程语言，或是了解最新的标准，还是需要从其他地方找到一些灵感，我们这里为你整理了一些很好的Web开发资源，帮助你更成功地进行Web开发。这里列出10个最佳Web开发资源，它们都是受
架构师之面试------jdk的hashMap实现 nannan408 HashMap
1.前言。如题。 2.详述。 (1)hashMap算法就是数组链表。数组存放的元素是键值对。jdk通过移位算法（其实也就是简单的加乘算法），如下代码来生成数组下标(生成后indexFor一下就成下标了）。 static int hash(int h) { h ^= (h >>> 20) ^ (h >>>
html禁止清除input文本输入缓存 Rainbow702 html 缓存 input 输入框 change
多数浏览器默认会缓存input的值，只有使用ctl+F5强制刷新的才可以清除缓存记录。如果不想让浏览器缓存input的值，有2种方法：方法一：在不想使用缓存的input中添加 autocomplete="off"; <input type="text" autocomplete="off" n
POJO和JavaBean的区别和联系 tjmljw POJO java beans
POJO 和JavaBean是我们常见的两个关键字，一般容易混淆，POJO全称是Plain Ordinary Java Object / Pure Old Java Object，中文可以翻译成：普通Java类，具有一部分getter/setter方法的那种类就可以称作POJO，但是JavaBean则比 POJO复杂很多， Java Bean 是可复用的组件，对 Java Bean 并没有严格的规
java中单例的五种写法 liuxiaoling java 单例
/** * 单例模式的五种写法： * 1、懒汉 * 2、恶汉 * 3、静态内部类 * 4、枚举 * 5、双重校验锁 */ /** * 五、双重校验锁，在当前的内存模型中无效 */ class LockSingleton { private volatile static LockSingleton singleton; pri