JasonYangQ

Paper翻译：《Apple Leaf Diseases Recognition Based on An Improved Convolutional Neural Network》

论文名称：《Apple Leaf Diseases Recognition Based on An Improved Convolutional Neural Network》
论文作者：Yan Q , Yang B , Wang W , et al.
发表期刊：Sensors, 2020, 20(12):3535.
论文总结：

Research Gap:
改进VGG16（VGG16 + BN + GAP）对苹果叶部病害进行检测
Importance：
VGG16模型的模型参数上可以减少89%
对4类苹果叶片的ACC为99.01%

论文地址： https://www.researchgate.net/publication/342401878_Apple_Leaf_Diseases_Recognition_Based_on_An_Improved_Convolutional_Neural_Network
论文目录
- Abstract
- 1.Introduction
- 2.Methods
- - 2.1 Data
  - 2.2. VGG16 and Transfer Learning
  - - 2.2.1. VGG16
    - 2.2.2. Transfer Learning
  - 2.3. Improved CNNs Based on VGG16
  - - 2.3.1. Global Average Pooling Layer (GAP)
    - 2.3.2. Batch Normalization (BN)
    - 2.3.3. Adaptive Moment Estimation
- 3.Results and Discussion
- - 3.1.Comparison of Model Performance
  - 3.2.Convergence Rate Analysis
  - 3.3.Training Time and Parameters
  - 3.4.Comparison of Optimal Algorithms
  - 3.5.Data Augmentation
- 4.Conclusions

Abstract

摘要

原文译文

Abstract: Scab, frogeye spot, and cedar rust are three common types of apple leaf diseases, and the rapid diagnosis and accurate identification of them play an important role in the development of apple production. In this work, an improved model based on VGG16 is proposed to identify apple leaf diseases, in which the global average poling layer is used to replace the fully connected layer to reduce the parameters and a batch normalization layer is added to improve the convergence speed. A transfer learning strategy is used to avoid a long training time. The experimental results show that the overall accuracy of apple leaf classification based on the proposed model can reach 99.01%. Compared with the classical VGG16, the model parameters are reduced by 89%, the recognition accuracy is improved by 6.3%, and the training time is reduced to 0.56% of that of the original model. Therefore, the deep convolutional neural network model proposed in this work provides a better solution for the identification of apple leaf diseases with higher accuracy and a faster convergence speed. 摘要：黑星病、蛙眼斑病和雪松锈病是苹果常见的三种叶片病害，对其的快速诊断和准确鉴定对苹果生产的发展具有重要意义。在这项工作中，提出了一种基于VGG16的改进模型来识别苹果叶片病害，其中使用全局平均极化层代替全连接层以减少参数，并添加批量归一化层以提高收敛速度。迁移学习策略用于避免长时间的训练。实验结果表明，基于该模型的苹果叶片分类总体准确率可达99.01%。与经典VGG16相比，模型参数降低89%，识别准确率提升6.3%，训练时间减少至原模型的0.56%。因此，本文提出的深度卷积神经网络模型为苹果叶片病害识别提供了更好的解决方案，具有更高的准确率和更快的收敛速度。

Keywords: apple leaf diseases; transfer learning; deep learning; convolutional neural networks 关键词：苹果叶病；迁移学习；深度学习；卷积神经网络

原文	译文
Abstract: Scab, `frogeye spot`, and `cedar rust` are three common types of apple leaf diseases, and the `rapid` diagnosis and accurate identification of them play an important role in the development of apple production. In this work, an improved model based on VGG16 is proposed to identify apple leaf diseases, in which the global average poling layer is used to replace the fully connected layer to reduce the parameters and a batch normalization layer is added to improve the convergence speed. A transfer learning strategy is used to avoid a long training time. The experimental results show that the overall accuracy of apple leaf classification based on the proposed model can reach 99.01%. Compared with the classical VGG16, the model parameters are reduced by 89%, the recognition accuracy is improved by 6.3%, and the training time is reduced to 0.56% of that of the original model. Therefore, the deep convolutional neural network model proposed in this work provides a better solution for the identification of apple leaf diseases with higher accuracy and a faster `convergence` speed.	摘要：黑星病、`蛙眼斑病`和`雪松锈病`是苹果常见的三种叶片病害，对其的`快速`诊断和准确鉴定对苹果生产的发展具有重要意义。在这项工作中，提出了一种基于VGG16的改进模型来识别苹果叶片病害，其中使用全局平均极化层代替全连接层以减少参数，并添加批量归一化层以提高收敛速度。迁移学习策略用于避免长时间的训练。实验结果表明，基于该模型的苹果叶片分类总体准确率可达99.01%。与经典VGG16相比，模型参数降低89%，识别准确率提升6.3%，训练时间减少至原模型的0.56%。因此，本文提出的深度卷积神经网络模型为苹果叶片病害识别提供了更好的解决方案，具有更高的准确率和更快的`收敛`速度。
Keywords: apple leaf diseases; transfer learning; deep learning; convolutional neural networks	关键词：苹果叶病；迁移学习；深度学习；卷积神经网络

1.Introduction

原文	译文
Leaf diseases are one of the main obstacles to apple production. `Among them`, scab, frogeye spot, and cedar rust are three most common types of apple leaf diseases and have a bad impact on apple growing. Therefore, the detection of apple leaf diseases has attracted more and more attention, and the early identification of apple leaf disease is very important for the intervention of treatment. In the past, disease identification methods were generally divided into manual identification and an expert system. However, both of them are highly dependent on fruit growers and experts and are time-consuming and usually poor in generalization.	叶病是苹果生产的主要障碍之一。 `其中`，疮痂病、蛙眼斑病和雪松锈病是最常见的三种苹果叶病害，对苹果的生长影响很大。因此，苹果叶病的检测越来越受到重视，而苹果叶病的早期识别对于干预治疗非常重要。过去，疾病识别方法一般分为人工识别和专家系统。然而，两者都高度依赖水果种植者和专家，并且耗时且普遍性较差。
With the development of machine learning methods, some computational models have been proposed for plant disease diagnosis based on different algorithms. Some studies have found diseased regions by K-means clustering-based segmentation and build disease recognition models using supervised learning methods, including the random forest, support vector machine (SVM), and K-nearest neighbor methods [1–3]. Rothe et al. used an active contour model for image segmentation and extracted Hu’s moments as features for the training of an adaptive neuro-fuzzy inference system, by which a classification accuracy of 85% can be achieved [4]. Gupta et al. proposed an autonomously modified SVM-CS model where a SVM model was trained and optimized using the concept of a `cuckoo` search [5]. However, these classification features are heavily depended on man-made selection and the recognition rates are not satisfactory.	随着机器学习方法的发展，一些基于不同算法的植物病害诊断计算模型被提出。一些研究通过基于 K 均值聚类的分割发现了病变区域，并使用监督学习方法构建了疾病识别模型，包括随机森林、支持向量机 (SVM) 和 K 近邻方法 [1-3]。罗特等人使用主动轮廓模型进行图像分割，并提取 Hu 的矩作为特征用于训练自适应神经模糊推理系统，由此可以实现 85% 的分类准确率 [4]。古普塔等人提出了一种自主修改的 SVM-CS 模型，其中使用`布谷鸟`搜索的概念对 SVM 模型进行训练和优化 [5]。然而，这些分类特征严重依赖于人为选择，识别率并不理想。
In recent years, convolutional neural networks (CNNs) have shown good results in recognition tasks by reducing the need for image preprocessing and improving the identification accuracy [6–13]. Leaf disease recognition based on CNNs has become a new hotspot in the agricultural informatization area [14–16]. Lu et al. proposed a rice disease identification method based on deep CNN techniques and achieved an accuracy of 95.48% on a dataset of 500 natural images of diseased and healthy rice leaves [17]. Zhang et al. proposed the improved GoogLeNet and Cifar10 models and obtained the average identification accuracies of 98.9% and 98.8%, respectively [18]. Liu et al. designed a novel architecture of AlexNet to detect apple leaf diseases, and the experimental results showed that this approach achieved an overall accuracy of 97.62% for disease identification [19]. Although the recognition accuracy of these CNN models is higher than that of traditional machine learning methods, there are still some shortcomings—such as high model `complexity`, much more parameters, and a long training time—which prevent their application in real environments.	近年来，卷积神经网络（CNN）通过减少对图像预处理的需求和提高识别精度[6-13]，在识别任务中显示出良好的效果。基于CNNs的叶片病害识别已成为农业信息化领域的新热点[14-16]。卢等人提出了一种基于深度 CNN 技术的水稻病害识别方法，在 500 张病叶和健康水稻自然图像的数据集上实现了 95.48% 的准确率 [17]。张等人提出了改进的 GoogLeNet 和 Cifar10 模型，分别获得了 98.9% 和 98.8% 的平均识别准确率 [18]。刘等人设计了一种新的 AlexNet 架构来检测苹果叶病，实验结果表明，该方法对病害识别的总体准确率达到了 97.62% [19]。尽管这些 CNN 模型的识别准确率高于传统机器学习方法，但仍存在一些缺点，例如模型`复杂度`高、参数多、训练时间长等，阻碍了它们在实际环境中的应用。
In this work, we propose a method for apple leaf disease identification based on an improved deep convolution neural network architecture which can effectively reduce the model complexity and training time. The network proposed in this work `adopts` the concept of transfer learning to pre-train a VGG16 network and adjusts the network structure by removing three fully connected layers, adding a global average pooling layer, a batch normalization layer, and a fully connected layer. Based on a benchmark dataset, the proposed model, which can reach a 89% reduction in the model parameters of the original VGG16 model, greatly reduced the training time and achieved a higher accuracy rate.	在这项工作中，我们提出了一种基于改进的深度卷积神经网络架构的苹果叶片病害识别方法，可以有效降低模型复杂度和训练时间。这项工作中提出的网络`采用`迁移学习的概念来预训练一个 VGG16 网络，并通过移除三个全连接层、添加一个全局平均池化层、一个批量归一化层和一个全连接层来调整网络结构。基于基准数据集，所提出的模型在原始VGG16模型的模型参数上可以减少89%，大大减少了训练时间并获得了更高的准确率。

2.Methods

2.1 Data

原文译文

The dataset in this work is from the “2008 ’AI Challenger’ Global Challenge” and includes 10 kinds of plants with 27 categories of diseases. This work addresses the automatic identification of apple leaf diseases, therefore only apple leaves are selected from this dataset. There are four categories of apple leaf images within the dataset, and Figure 1 lists some of them. With the exception of healthy leaves, three types of disease images—i.e., scab, frogeye spot, and cedar rust—are collected within the dataset. Typically, the lesions on scab leaves are gray-brown and nearly round or radial, frogeye spot is tan and the shape is flakes or dots, and cedar rust leaves have round orange-yellow lesions with red edges. Some spot and cedar rust lesions are similar in color and shape, which increases the difficulty in recognition by computational methods. 本作品中的数据集来自“2008‘AI Challenger’全球挑战赛”，包括10种植物27类病害。这项工作解决了苹果叶病害的自动识别，因此仅从该数据集中选择了苹果叶。数据集中有四类苹果叶图像，图 1 列出了其中的一些。除了健康的叶子外，数据集中收集了三种类型的疾病图像，即疮痂病、蛙眼斑病和雪松锈病。通常，疮痂病叶上的病斑呈灰褐色，近圆形或放射状，蛙眼斑呈棕褐色，呈片状或点状，雪松锈病叶呈橙黄色圆形病斑，边缘呈红色。一些斑锈病和雪松锈病在颜色和形状上相似，这增加了计算方法识别的难度。

In this work, there are 2446 pictures collected within our dataset, where 1340 of them are healthy, 411 are scab, 487 are frogeye spot, and 208 are cedar rust. In the original dataset, the dataset was divided into two subsets—i.e., 2141 pictures were for model training and the remaining 305 ones for testing. The details about the dataset are shown in Table 1. 在这项工作中，我们的数据集中收集了 2446 张图片，其中 1340 张是健康的，411 张是痂，487 张是蛙眼斑，208 张是雪松锈病。在原始数据集中，数据集被分为两个子集——即 2141 张图片用于模型训练，其余 305 张用于测试。数据集的详细信息如表 1 所示。

原文	译文
The dataset in this work is from the “2008 ’AI Challenger’ Global Challenge” and includes 10 kinds of plants with 27 categories of diseases. This work `addresses` the automatic identification of apple leaf diseases, therefore only apple leaves are selected from this dataset. There are four categories of apple leaf images within the dataset, and Figure 1 lists some of them. With the exception of healthy leaves, three types of disease images—i.e., scab, frogeye spot, and cedar rust—are collected within the dataset. Typically, the lesions on scab leaves are gray-brown and nearly round or radial, frogeye spot is tan and the shape is flakes or dots, and cedar rust leaves have round orange-yellow lesions with red edges. Some spot and cedar rust lesions are similar in color and shape, which increases the difficulty in recognition by computational methods.	本作品中的数据集来自“2008‘AI Challenger’全球挑战赛”，包括10种植物27类病害。这项工作`解决`了苹果叶病害的自动识别，因此仅从该数据集中选择了苹果叶。数据集中有四类苹果叶图像，图 1 列出了其中的一些。除了健康的叶子外，数据集中收集了三种类型的疾病图像，即疮痂病、蛙眼斑病和雪松锈病。通常，疮痂病叶上的病斑呈灰褐色，近圆形或放射状，蛙眼斑呈棕褐色，呈片状或点状，雪松锈病叶呈橙黄色圆形病斑，边缘呈红色。一些斑锈病和雪松锈病在颜色和形状上相似，这增加了计算方法识别的难度。
In this work, there are 2446 pictures collected within our dataset, where 1340 of them are healthy, 411 are scab, 487 are frogeye spot, and 208 are cedar rust. In the original dataset, the dataset was divided into two subsets—i.e., 2141 pictures were for model training and the remaining 305 ones for testing. The details about the dataset are shown in Table 1.	在这项工作中，我们的数据集中收集了 2446 张图片，其中 1340 张是健康的，411 张是痂，487 张是蛙眼斑，208 张是雪松锈病。在原始数据集中，数据集被分为两个子集——即 2141 张图片用于模型训练，其余 305 张用于测试。数据集的详细信息如表 1 所示。

2.2. VGG16 and Transfer Learning

2.2.1. VGG16

原文译文

With the rapid development of deep learning, CNNs had been applied widely in different fields, especially in image classification and recognition and target location and detection [20]. A CNN is a special multi-layer perceptron (MLP) or multilayered feed forward neural network, which generally consists of an input layer, convolution layer, pooling layer, fully connected layer, and output layer. The convolution layer can realize dimensionality reduction and feature extraction by implementing two design concepts: local perception and parameter sharing. The pooling layer can reduce the size of the data, where smart sampling also has the invariance of local linear transformation, which enhances the generalization ability of convolutional neural networks. The fully connected layer acts as a classifier in the whole neural network. It is common for multiple fully connected layers to be used after several rounds of convolution, and the resulting structure of the last convolutional layer is flattened [21,22]. 随着深度学习的飞速发展，CNNs 在不同领域得到了广泛的应用，特别是在图像分类识别和目标定位检测等领域[20]。 CNN 是一种特殊的多层感知器 (MLP) 或多层前馈神经网络，一般由输入层、卷积层、池化层、全连接层和输出层组成。卷积层通过实现局部感知和参数共享两个设计理念，可以实现降维和特征提取。池化层可以减少数据的规模，其中智能采样还具有局部线性变换的不变性，增强了卷积神经网络的泛化能力。全连接层充当整个神经网络中的分类器。在几轮卷积后使用多个全连接层是很常见的，最后一个卷积层的结果结构被展平[21,22]。

The VGG16 contains 16 convolutional layers with very small receptive fields, 3 × 3, and five max‐pooling layers of size 2 × 2 for carrying out spatial pooling, followed by three fully connected layers. A classical VGG16 model involves 144 million parameters, where rectification nonlinearity (ReLU) activation is applied to all hidden space pooling and the softmax function is applied in the final layer [23]. The model also uses dropout regularization in the fully connected layers. A schematic of the VGG16 architecture is shown in Figure 2, where the marked red box shows a classifier consisting of three fully connected layers. VGG16 包含 16 个具有非常小的感受野的卷积层，3 × 3，和五个大小为 2 × 2 的最大池化层，用于执行空间池化，然后是三个全连接层。一个经典的 VGG16 模型涉及 1.44 亿个参数，其中整流非线性 (ReLU) 激活应用于所有隐藏空间池化，而 softmax 函数应用于最后一层 [23]。该模型还在全连接层中使用了 dropout 正则化。 VGG16 架构的示意图如图 2 所示，其中标记的红色框显示了一个由三个全连接层组成的分类器。

原文	译文
With the rapid development of deep learning, CNNs had been applied widely in different fields, especially in image classification and recognition and target location and detection [20]. A CNN is a special multi-layer perceptron (MLP) or multilayered feed forward neural network, which generally consists of an input layer, convolution layer, pooling layer, fully connected layer, and output layer. The convolution layer can realize `dimensionality reduction` and feature extraction by implementing two design concepts: local perception and parameter sharing. The pooling layer can reduce the size of the data, where smart sampling also has the invariance of local linear transformation, which enhances the generalization ability of convolutional neural networks. The fully connected layer acts as a classifier in the whole neural network. It is common for multiple fully connected layers to be used after several rounds of convolution, and the resulting structure of the last convolutional layer is flattened [21,22].	随着深度学习的飞速发展，CNNs 在不同领域得到了广泛的应用，特别是在图像分类识别和目标定位检测等领域[20]。 CNN 是一种特殊的多层感知器 (MLP) 或多层前馈神经网络，一般由输入层、卷积层、池化层、全连接层和输出层组成。卷积层通过实现局部感知和参数共享两个设计理念，可以实现`降维`和特征提取。池化层可以减少数据的规模，其中智能采样还具有局部线性变换的不变性，增强了卷积神经网络的泛化能力。全连接层充当整个神经网络中的分类器。在几轮卷积后使用多个全连接层是很常见的，最后一个卷积层的结果结构被展平[21,22]。
The VGG16 contains 16 convolutional layers with very small receptive fields, 3 × 3, and five max‐pooling layers of size 2 × 2 for carrying out spatial pooling, followed by three fully connected layers. A classical VGG16 model involves 144 million parameters, where rectification nonlinearity (ReLU) activation is applied to all hidden space pooling and the softmax function is applied in the final layer [23]. The model also uses dropout regularization in the fully connected layers. A `schematic` of the VGG16 architecture is shown in Figure 2, where the marked red box shows a classifier consisting of three fully connected layers.	VGG16 包含 16 个具有非常小的感受野的卷积层，3 × 3，和五个大小为 2 × 2 的最大池化层，用于执行空间池化，然后是三个全连接层。一个经典的 VGG16 模型涉及 1.44 亿个参数，其中整流非线性 (ReLU) 激活应用于所有隐藏空间池化，而 softmax 函数应用于最后一层 [23]。该模型还在全连接层中使用了 dropout 正则化。 VGG16 架构的`示意图`如图 2 所示，其中标记的红色框显示了一个由三个全连接层组成的分类器。

2.2.2. Transfer Learning

原文	译文
CNNs typically require a large annotated image dataset to achieve a high predictive accuracy. However, the acquisition of such data is difficult and labeling them is costly in many areas. In light of these challenges, the concept of transfer learning is adopted in many previous studies for solving cross-domain image classification problems and has been shown to be very useful, where the “off-the-shelf” features of `well-established` CNNs, such as VGG16, AlexNet, and GoogLeNet, are pre-trained on large-scale annotated natural image datasets, such as ImageNet, where 15 million images are involved [24–27].	CNN 通常需要一个大的带注释的图像数据集来实现高预测精度。然而，获取此类数据很困难，并且在许多领域标记它们的成本很高。鉴于这些挑战，之前许多研究都采用了迁移学习的概念来解决跨域图像分类问题，并且已被证明非常有用，其中`成熟`的 CNN 的“现成”特征，例如 VGG16、AlexNet 和 GoogLeNet，在大规模带注释的自然图像数据集上进行了预训练，例如 ImageNet，其中涉及 1500 万张图像 [24-27]。
One common strategy of transfer learning is feature transfer, which removes the last layer of the pre-trained network and sends its previous activation values, which can be regarded as feature vectors, into classifiers for training. Another is parameter transfer, which only needs to re-initialize a few layers of the network, such as the last layer, and the other layers directly using the weight parameters of the pre-trained network, while a new dataset is used to finetune the network parameters [28–30].	迁移学习的一种常见策略是特征迁移，即移除预训练网络的最后一层，并将其先前的激活值（可以视为特征向量）发送到分类器中进行训练。另一个是参数传递，只需要重新初始化网络的几层，比如最后一层，其他层直接使用预训练网络的权重参数，同时使用新的数据集对网络进行微调网络参数 [28-30]。
Because of the small amount of data in this work, training a neural network from scratch will take a long time, and the data insufficiency easily causes an over-fitting problem, which will bring the model poor robustness. Therefore, we can use the idea of transfer learning, where a pre‐trained model is built on ImageNet to optimize the classification and recognition of apple leaf diseases. Herein, the VGG16 is fine tuned to fit our own data, which can save a lot of training time.	由于这项工作的数据量较小，从头开始训练一个神经网络需要很长时间，而且数据不足容易造成过拟合问题，从而导致模型鲁棒性较差。因此，我们可以使用迁移学习的思想，在 ImageNet 上建立一个预训练的模型来优化苹果叶病的分类和识别。在这里，VGG16经过微调以适合我们自己的数据，可以节省大量训练时间。

2.3. Improved CNNs Based on VGG16

原文	译文
A classical VGG16 network has a strong ability of image feature extraction and recognition. Its core idea is to use smaller convolution kernels to increase the depth of the network, which was the key to win the runner-up position in positioning and classification tasks in the ILSVRC Challenge in 2014. However, the VGG16 model has a huge amount of parameters, which will cause a slow convergence speed, long training time, and large storage capacity in practical applications.	经典的 VGG16 网络具有很强的图像特征提取和识别能力。它的核心思想是使用更小的卷积核来增加网络的深度，这是在 2014 年 ILSVRC Challenge 中获得定位和分类任务亚军的关键。然而，VGG16 模型有大量的参数，在实际应用中会导致收敛速度慢、训练时间长、存储容量大。
To address these problems, this work improves the VGG16 model by using a global average pooling layer, a batch normalization layer and a fully connected layer to replace the three fully connected layers in the original model. The global average pooling layer is used to replace the fully connected layer to reduce the parameters, and the batch normalization layer is added to improve the convergence speed. In order to avoid a long training time, the weights of the convolution layers are pre-trained by VGG16 on ImageNet. The stochastic gradient descent (SGD) optimizer is replaced by an adaptive moment estimation (Adam) to accelerate the convergence of the network. The network structure is shown in Figure 3, where the improvement of a classifier consisting of a global average pooling layer, a batch normalization layer, and a fully connected layer is shown within the marked green box.	为了解决这些问题，这项工作通过使用全局平均池化层、批量归一化层和全连接层来替换原始模型中的三个全连接层来改进 VGG16 模型。使用全局平均池化层代替全连接层减少参数，加入批量归一化层提高收敛速度。为了避免训练时间过长，卷积层的权重在 ImageNet 上通过 VGG16 进行预训练。随机梯度下降 (SGD) 优化器被自适应矩估计 (Adam) 取代，以加速网络的收敛。网络结构如图 3 所示，其中由全局平均池化层、批量归一化层和全连接层组成的分类器的改进显示在标记的绿色框中。

原文

译文

A classical VGG16 network has a strong ability of image feature extraction and recognition. Its core idea is to use smaller convolution kernels to increase the depth of the network, which was the key to win the runner-up position in positioning and classification tasks in the ILSVRC Challenge in 2014. However, the VGG16 model has a huge amount of parameters, which will cause a slow convergence speed, long training time, and large storage capacity in practical applications.

经典的 VGG16 网络具有很强的图像特征提取和识别能力。它的核心思想是使用更小的卷积核来增加网络的深度，这是在 2014 年 ILSVRC Challenge 中获得定位和分类任务亚军的关键。然而，VGG16 模型有大量的参数，在实际应用中会导致收敛速度慢、训练时间长、存储容量大。

To address these problems, this work improves the VGG16 model by using a global average pooling layer, a batch normalization layer and a fully connected layer to replace the three fully connected layers in the original model. The global average pooling layer is used to replace the fully connected layer to reduce the parameters, and the batch normalization layer is added to improve the convergence speed. In order to avoid a long training time, the weights of the convolution layers are pre-trained by VGG16 on ImageNet. The stochastic gradient descent (SGD) optimizer is replaced by an adaptive moment estimation (Adam) to accelerate the convergence of the network. The network structure is shown in Figure 3, where the improvement of a classifier consisting of a global average pooling layer, a batch normalization layer, and a fully connected layer is shown within the marked green box.

为了解决这些问题，这项工作通过使用全局平均池化层、批量归一化层和全连接层来替换原始模型中的三个全连接层来改进 VGG16 模型。使用全局平均池化层代替全连接层减少参数，加入批量归一化层提高收敛速度。为了避免训练时间过长，卷积层的权重在 ImageNet 上通过 VGG16 进行预训练。随机梯度下降 (SGD) 优化器被自适应矩估计 (Adam) 取代，以加速网络的收敛。网络结构如图 3 所示，其中由全局平均池化层、批量归一化层和全连接层组成的分类器的改进显示在标记的绿色框中。

2.3.1. Global Average Pooling Layer (GAP)

原文	译文
Global average pooling is to regularize the whole network structure to prevent over-fitting and reduce the dimensions from 3D to 1D [31,32]. In this work, the feature maps in the last convolution layer are averaged into a series of 1D outputs which is shown in Figure 4. A GAP can omit the expansion of the feature maps into vectors and full connection processing, and therefore greatly reduces the number of parameters. The advantage of a GAP over a fully connected layer is that it can preserve the convolution structure better by enhancing the correspondence between the feature maps and analogy, making the classification of the feature map credible and well-explained.	全局平均池化是对整个网络结构进行正则化以防止过拟合并将维度从 3D 减少到 1D [31,32]。在这项工作中，最后一个卷积层中的特征图被平均为一系列一维输出，如图 4 所示。 GAP 可以省略将特征图扩展为向量和全连接处理，因此大大减少了数量的参数。 GAP 相对于全连接层的优势在于它可以通过增强特征图和类比之间的对应关系更好地保留卷积结构，使特征图的分类可信且易于解释。

原文

译文

Global average pooling is to regularize the whole network structure to prevent over-fitting and reduce the dimensions from 3D to 1D [31,32]. In this work, the feature maps in the last convolution layer are averaged into a series of 1D outputs which is shown in Figure 4. A GAP can omit the expansion of the feature maps into vectors and full connection processing, and therefore greatly reduces the number of parameters. The advantage of a GAP over a fully connected layer is that it can preserve the convolution structure better by enhancing the correspondence between the feature maps and analogy, making the classification of the feature map credible and well-explained.

全局平均池化是对整个网络结构进行正则化以防止过拟合并将维度从 3D 减少到 1D [31,32]。在这项工作中，最后一个卷积层中的特征图被平均为一系列一维输出，如图 4 所示。 GAP 可以省略将特征图扩展为向量和全连接处理，因此大大减少了数量的参数。 GAP 相对于全连接层的优势在于它可以通过增强特征图和类比之间的对应关系更好地保留卷积结构，使特征图的分类可信且易于解释。

2.3.2. Batch Normalization (BN)

原文	译文
In deep learning, because the number of layers in the network is very large, if the data distribution at a certain layer starts to deviate significantly, this problem will intensify as the network deepens, which will increase the difficulty of the model optimization. Therefore, normalization helps to alleviate this problem. This method of batch normalization divides the data into several groups and updates the parameters according to the groups. The data in one group jointly determines the direction of the gradient and reduces the randomness when declining. On the other hand, because the number of samples in the batch is much smaller than the entire dataset, the amount of calculation has also dropped significantly. The batch normalization layer normalizes the inputs to the layer before the activation function is implemented, which can solve the problems of input data offset and increase [33].	在深度学习中，由于网络的层数非常多，如果某一层的数据分布开始出现明显偏差，这个问题会随着网络的加深而加剧，从而增加模型优化的难度。因此，归一化有助于缓解这个问题。这种批量归一化的方法将数据分成几组，并根据组更新参数。一组中的数据共同决定梯度的方向，减少下降时的随机性。另一方面，由于batch中的样本数量远小于整个数据集，计算量也大幅下降。批量归一化层在激活函数实现之前对层的输入进行归一化，可以解决输入数据偏移和增加的问题[33]。
Based on the BN algorithm, the parameters of the input layer are normalized and the activation function cannot affect the distribution of neurons. The importance of neurons will be weakened and some of them may be removed automatically. Because of the normalization of each epoch, the risk of parameter changes caused by a different data distribution is reduced and the convergence speed is accelerated.	基于BN算法，对输入层的参数进行归一化，激活函数不会影响神经元的分布。神经元的重要性将被削弱，其中一些可能会自动删除。由于每个epoch的归一化，降低了数据分布不同导致参数变化的风险，加快了收敛速度。

原文

译文

In deep learning, because the number of layers in the network is very large, if the data distribution at a certain layer starts to deviate significantly, this problem will intensify as the network deepens, which will increase the difficulty of the model optimization. Therefore, normalization helps to alleviate this problem. This method of batch normalization divides the data into several groups and updates the parameters according to the groups. The data in one group jointly determines the direction of the gradient and reduces the randomness when declining. On the other hand, because the number of samples in the batch is much smaller than the entire dataset, the amount of calculation has also dropped significantly. The batch normalization layer normalizes the inputs to the layer before the activation function is implemented, which can solve the problems of input data offset and increase [33].

在深度学习中，由于网络的层数非常多，如果某一层的数据分布开始出现明显偏差，这个问题会随着网络的加深而加剧，从而增加模型优化的难度。因此，归一化有助于缓解这个问题。这种批量归一化的方法将数据分成几组，并根据组更新参数。一组中的数据共同决定梯度的方向，减少下降时的随机性。另一方面，由于batch中的样本数量远小于整个数据集，计算量也大幅下降。批量归一化层在激活函数实现之前对层的输入进行归一化，可以解决输入数据偏移和增加的问题[33]。

Based on the BN algorithm, the parameters of the input layer are normalized and the activation function cannot affect the distribution of neurons. The importance of neurons will be weakened and some of them may be removed automatically. Because of the normalization of each epoch, the risk of parameter changes caused by a different data distribution is reduced and the convergence speed is accelerated.

基于BN算法，对输入层的参数进行归一化，激活函数不会影响神经元的分布。神经元的重要性将被削弱，其中一些可能会自动删除。由于每个epoch的归一化，降低了数据分布不同导致参数变化的风险，加快了收敛速度。

2.3.3. Adaptive Moment Estimation

2.3.3. 自适应矩估计

原文	译文
Adam is an extension of the stochastic gradient descent algorithm which can iteratively update the neural network weights based on training data [34,35]. This method not only stores the exponential decay mean of the square gradient but also preserves the exponential decay mean of the previously calculated first-order and second-order moment estimation of the gradient. It also designs different adaptive learning rates for different parameters. Optimization algorithms such as SGD maintain a single learning rate during the training process, and Adam can iteratively update the neural network weights based on the training data. When the parameters are backpropagated and updated, the Adam algorithm can better adjust the learning rate. Thus, Adam has a fast convergence speed and effective learning effect. It can also correct the problems existing in other optimization techniques, such as the loss function fluctuation caused by the disappearance of the learning rate, slow convergence, or parameter updating with high variance.	Adam 是随机梯度下降算法的扩展，它可以根据训练数据迭代更新神经网络权重 [34,35]。该方法不仅存储了平方梯度的指数衰减均值，而且还保留了先前计算的梯度一阶和二阶矩估计的指数衰减均值。它还针对不同的参数设计了不同的自适应学习率。 SGD 等优化算法在训练过程中保持单一学习率，Adam 可以根据训练数据迭代更新神经网络权重。当参数进行反向传播和更新时，Adam 算法可以更好地调整学习率。因此，Adam 具有快速的收敛速度和有效的学习效果。它还可以纠正其他优化技术中存在的问题，例如由于学习率消失、收敛速度慢或参数更新方差大而导致的损失函数波动。

原文

译文

Adam is an extension of the stochastic gradient descent algorithm which can iteratively update the neural network weights based on training data [34,35]. This method not only stores the exponential decay mean of the square gradient but also preserves the exponential decay mean of the previously calculated first-order and second-order moment estimation of the gradient. It also designs different adaptive learning rates for different parameters. Optimization algorithms such as SGD maintain a single learning rate during the training process, and Adam can iteratively update the neural network weights based on the training data. When the parameters are backpropagated and updated, the Adam algorithm can better adjust the learning rate. Thus, Adam has a fast convergence speed and effective learning effect. It can also correct the problems existing in other optimization techniques, such as the loss function fluctuation caused by the disappearance of the learning rate, slow convergence, or parameter updating with high variance.

Adam 是随机梯度下降算法的扩展，它可以根据训练数据迭代更新神经网络权重 [34,35]。该方法不仅存储了平方梯度的指数衰减均值，而且还保留了先前计算的梯度一阶和二阶矩估计的指数衰减均值。它还针对不同的参数设计了不同的自适应学习率。 SGD 等优化算法在训练过程中保持单一学习率，Adam 可以根据训练数据迭代更新神经网络权重。当参数进行反向传播和更新时，Adam 算法可以更好地调整学习率。因此，Adam 具有快速的收敛速度和有效的学习效果。它还可以纠正其他优化技术中存在的问题，例如由于学习率消失、收敛速度慢或参数更新方差大而导致的损失函数波动。

3.Results and Discussion

原文	译文
In this work, the proposed model was implemented with the Keras deep learning framework using a Intel® Core™ i7-8750H GPU (LENOVO, Jiangsu, China). The ImageNet pre-trained VGG16 CNN implemented within Keras Applications takes in a default image input size of 227 × 227. Therefore, all the pictures in our dataset were cut to the same size of 227 × 227.	在这项工作中，所提出的模型是通过使用英特尔® 酷睿™ i7-8750H GPU（中国江苏联想）的 Keras 深度学习框架实现的。在 Keras Applications 中实现的 ImageNet 预训练 VGG16 CNN 接受默认图像输入大小为 227 × 227。因此，我们数据集中的所有图片都被切割为相同的 227 × 227 大小。
The proposed CNN is trained on 2141 training pictures and tested on 305 ones, and the confusion is totally accurate, only one healthy picture is misclassified as scab, and only one is misclassified as healthy in both of scab and frogeye spot categories.	所提出的 CNN 在 2141 张训练图片上进行了训练，并在 305 张上进行了测试，混淆矩阵是完全准确的，只有一张健康的图片被错误分类为痂，只有一张在痂和蛙眼斑类别中都被错误分类为健康。

原文	译文
For the three misclassified pictures in the original dataset, Figure 5 lists the original one, its visualization of the last convolution layer and the superposition of the heat map of the original picture. There are some enlightenments can be found from these pictures. In Figure 5b, the strong light and small disease features may lead to the inaccurate extraction of disease features by the model. The frogeye spots in Figure 5c are small in size and light in color, which will leads to prediction errors with comparison to the dark area, for light is strongly learned in the network and therefore has a bigger weight.	对于原始数据集中的三张错误分类的图片，图5列出了原始的一张，它对最后一个卷积层的可视化和原始图片热图的叠加。从这些图片中可以找到一些启示。在图5b中，强光和小的疾病特征可能导致模型对疾病特征的提取不准确。图 5c 中的蛙眼斑点尺寸小，颜色浅，与暗区域相比，这将导致预测错误，因为光在网络中被强烈学习，因此具有更大的权重。

原文

译文

For the three misclassified pictures in the original dataset, Figure 5 lists the original one, its visualization of the last convolution layer and the superposition of the heat map of the original picture. There are some enlightenments can be found from these pictures. In Figure 5b, the strong light and small disease features may lead to the inaccurate extraction of disease features by the model. The frogeye spots in Figure 5c are small in size and light in color, which will leads to prediction errors with comparison to the dark area, for light is strongly learned in the network and therefore has a bigger weight.

对于原始数据集中的三张错误分类的图片，图5列出了原始的一张，它对最后一个卷积层的可视化和原始图片热图的叠加。从这些图片中可以找到一些启示。在图5b中，强光和小的疾病特征可能导致模型对疾病特征的提取不准确。图 5c 中的蛙眼斑点尺寸小，颜色浅，与暗区域相比，这将导致预测错误，因为光在网络中被强烈学习，因此具有更大的权重。

3.1.Comparison of Model Performance

原文	译文
To evaluate the performance of the proposed VGG model, four typical convolutional neural networks—i.e., AlexNet, GoogleNet, Resnet-34, and VGG16—are also implemented. Another apple leaf disease recognition structure presented by Liu et al., where the inception structure was added into the AlexNet framework, has also been compared. The recognition accuracy of the different models is shown in Figure 6.	为了评估所提出的 VGG 模型的性能，还实现了四个典型的卷积神经网络，即 AlexNet、GoogleNet、Resnet-34 和 VGG16。 Liu 等人提出的另一种苹果叶病识别结构，将初始结构添加到 AlexNet 框架中，也进行了比较。不同模型的识别准确率如图6所示。
It can be found that the accuracy of AlexNet and the original VGG16 is 93.11%, ResNet34 is 95.73%, and GoogleNet can reach 97.70%. When the inception structure was combined with AlexNet, the identification accuracy can be increased to 97.05%, which is higher than the original AlexNet. It can be seen that our work achieves the highest accuracy in the identification of apple leaf diseases—i.e, a 99.01% accuracy—which demonstrates the effectiveness of the proposed model. Compared to the other five models, whether in terms of precision, recall, or F1‐score, our model achieved the highest value.	可以发现AlexNet和原始VGG16的准确率为93.11%，ResNet34为95.73%，GoogleNet可以达到97.70%。当inception结构与AlexNet结合时，识别准确率可以提高到97.05%，高于原来的AlexNet。可以看出，我们的工作在识别苹果叶病害方面达到了最高的准确率——即 99.01% 的准确率——这证明了所提出模型的有效性。与其他五个模型相比，无论是在精度、召回率还是 F1 分数方面，我们的模型都取得了最高的值。

原文

译文

To evaluate the performance of the proposed VGG model, four typical convolutional neural networks—i.e., AlexNet, GoogleNet, Resnet-34, and VGG16—are also implemented. Another apple leaf disease recognition structure presented by Liu et al., where the inception structure was added into the AlexNet framework, has also been compared. The recognition accuracy of the different models is shown in Figure 6.

为了评估所提出的 VGG 模型的性能，还实现了四个典型的卷积神经网络，即 AlexNet、GoogleNet、Resnet-34 和 VGG16。 Liu 等人提出的另一种苹果叶病识别结构，将初始结构添加到 AlexNet 框架中，也进行了比较。不同模型的识别准确率如图6所示。

It can be found that the accuracy of AlexNet and the original VGG16 is 93.11%, ResNet34 is 95.73%, and GoogleNet can reach 97.70%. When the inception structure was combined with AlexNet, the identification accuracy can be increased to 97.05%, which is higher than the original AlexNet. It can be seen that our work achieves the highest accuracy in the identification of apple leaf diseases—i.e, a 99.01% accuracy—which demonstrates the effectiveness of the proposed model. Compared to the other five models, whether in terms of precision, recall, or F1‐score, our model achieved the highest value.

可以发现AlexNet和原始VGG16的准确率为93.11%，ResNet34为95.73%，GoogleNet可以达到97.70%。当inception结构与AlexNet结合时，识别准确率可以提高到97.05%，高于原来的AlexNet。可以看出，我们的工作在识别苹果叶病害方面达到了最高的准确率——即 99.01% 的准确率——这证明了所提出模型的有效性。与其他五个模型相比，无论是在精度、召回率还是 F1 分数方面，我们的模型都取得了最高的值。

原文	译文
Table 3 shows the precision, recall, f1‐score, and accuracy of different models achieved for the four categories of apple images. The Table 4 shows that AlexNet does not learn the features of the scab well enough, and the detection effect is poor; the improved Alex + Inception model recognition is better than the original Alex; what is more, the original VGG16 network has the worst learning of each feature. For these four‐leaf types, all the networks have the best recognition rate for healthy and the lowest scab recognition rate. Regardless of the accuracy or the detection index of each leaf type, our model achieved the best results. In general, our model has the best recognition effect.	表 3 显示了针对四类苹果图像实现的不同模型的准确率、召回率、f1-score 和准确率。表4表明AlexNet对scab的特征学习得不够好，检测效果较差；改进后的 Alex + Inception 模型识别比原来的 Alex 更好；更重要的是，原始的 VGG16 网络对每个特征的学习最差。对于这些四叶类型，所有网络的健康识别率最高，痂识别率最低。无论是每种叶子类型的准确率还是检测指标，我们的模型都取得了最好的结果。总的来说，我们的模型识别效果最好。

原文

译文

Table 3 shows the precision, recall, f1‐score, and accuracy of different models achieved for the four categories of apple images. The Table 4 shows that AlexNet does not learn the features of the scab well enough, and the detection effect is poor; the improved Alex + Inception model recognition is better than the original Alex; what is more, the original VGG16 network has the worst learning of each feature. For these four‐leaf types, all the networks have the best recognition rate for healthy and the lowest scab recognition rate. Regardless of the accuracy or the detection index of each leaf type, our model achieved the best results. In general, our model has the best recognition effect.

表 3 显示了针对四类苹果图像实现的不同模型的准确率、召回率、f1-score 和准确率。表4表明AlexNet对scab的特征学习得不够好，检测效果较差；改进后的 Alex + Inception 模型识别比原来的 Alex 更好；更重要的是，原始的 VGG16 网络对每个特征的学习最差。对于这些四叶类型，所有网络的健康识别率最高，痂识别率最低。无论是每种叶子类型的准确率还是检测指标，我们的模型都取得了最好的结果。总的来说，我们的模型识别效果最好。

3.2.Convergence Rate Analysis

原文	译文
The loss values in this work are calculated by cross entropy. Figure 7 shows the accuracy and loss values of the five models during training. The experimental results show that AlexNet, ResNet-34, GoogleNet, Alex + Inception, and our convolutional neural network converge within 60 training epochs, while VGG16 converges slowly. It can be found the proposed network structure converges in 10 training epochs, which is faster than the other five CNN models. The training process of GoogleNet is similar to the process of ResNet-34, and both converge after 20 training epochs, and AlexNet and the Alex + Inception model tend to be stable after 40 epochs.	这项工作中的损失值是通过交叉熵计算的。图 7 显示了训练过程中五个模型的准确率和损失值。实验结果表明，AlexNet、ResNet-34、GoogleNet、Alex + Inception 和我们的卷积神经网络在 60 个训练时期内收敛，而 VGG16 收敛缓慢。可以发现所提出的网络结构在 10 个训练时期内收敛，这比其他五个 CNN 模型更快。 GoogleNet 的训练过程与 ResNet-34 的过程类似，都在 20 个训练 epoch 后收敛，而 AlexNet 和 Alex + Inception 模型在 40 个 epoch 后趋于稳定。

原文

译文

The loss values in this work are calculated by cross entropy. Figure 7 shows the accuracy and loss values of the five models during training. The experimental results show that AlexNet, ResNet-34, GoogleNet, Alex + Inception, and our convolutional neural network converge within 60 training epochs, while VGG16 converges slowly. It can be found the proposed network structure converges in 10 training epochs, which is faster than the other five CNN models. The training process of GoogleNet is similar to the process of ResNet-34, and both converge after 20 training epochs, and AlexNet and the Alex + Inception model tend to be stable after 40 epochs.

这项工作中的损失值是通过交叉熵计算的。图 7 显示了训练过程中五个模型的准确率和损失值。实验结果表明，AlexNet、ResNet-34、GoogleNet、Alex + Inception 和我们的卷积神经网络在 60 个训练时期内收敛，而 VGG16 收敛缓慢。可以发现所提出的网络结构在 10 个训练时期内收敛，这比其他五个 CNN 模型更快。 GoogleNet 的训练过程与 ResNet-34 的过程类似，都在 20 个训练 epoch 后收敛，而 AlexNet 和 Alex + Inception 模型在 40 个 epoch 后趋于稳定。

3.3.Training Time and Parameters

原文	译文
Table 4 shows the number of parameters for each model and training time required when the model becomes stable. It can be found that the classical VGG16 model has the most parameters and the longest training time, the Alex + Inception model has the least training parameters, and AlexNet has the shortest training time. Our improved model can reduce 119,534,592 training parameters in comparison to the original VGG16 model. The convolutional neural network proposed in this work has fewer training parameters than AlexNet, ResNet34, and VGG16. The training time of the proposed model is 692 s, which is similar to that of ResNet34 and GoogleNet.	表 4 显示了每个模型的参数数量和模型稳定时所需的训练时间。可以发现经典的VGG16模型参数最多，训练时间最长，Alex+Inception模型训练参数最少，AlexNet训练时间最短。与原始 VGG16 模型相比，我们改进的模型可以减少 119,534,592 个训练参数。这项工作中提出的卷积神经网络的训练参数比 AlexNet、ResNet34 和 VGG16 少。所提出模型的训练时间为 692 s，与 ResNet34 和 GoogleNet 的训练时间相似。

原文

译文

Table 4 shows the number of parameters for each model and training time required when the model becomes stable. It can be found that the classical VGG16 model has the most parameters and the longest training time, the Alex + Inception model has the least training parameters, and AlexNet has the shortest training time. Our improved model can reduce 119,534,592 training parameters in comparison to the original VGG16 model. The convolutional neural network proposed in this work has fewer training parameters than AlexNet, ResNet34, and VGG16. The training time of the proposed model is 692 s, which is similar to that of ResNet34 and GoogleNet.

表 4 显示了每个模型的参数数量和模型稳定时所需的训练时间。可以发现经典的VGG16模型参数最多，训练时间最长，Alex+Inception模型训练参数最少，AlexNet训练时间最短。与原始 VGG16 模型相比，我们改进的模型可以减少 119,534,592 个训练参数。这项工作中提出的卷积神经网络的训练参数比 AlexNet、ResNet34 和 VGG16 少。所提出模型的训练时间为 692 s，与 ResNet34 和 GoogleNet 的训练时间相似。

3.4.Comparison of Optimal Algorithms

原文	译文
The optimization algorithm is of great importance for the model performance. In this work, the SGD optimization algorithm in the original VGG16 is replaced by the Adam optimization algorithmto improve the converge rate. Figure 8 shows the training process of these two optimization algorithmswith same learning rate of 1 × 10−5. The results show that the model using the Adam algorithm has a faster convergence speed. It can be found that the accuracy of testing is 98.03% when the SGD algorithm is used, while that of the Adam algorithm is 99.01%. From the loss curve in Figure 8, it can be seen that the Adam algorithm can converge quickly and is more stable than SGD.	优化算法对模型性能非常重要。在这项工作中，将原始 VGG16 中的 SGD 优化算法替换为 Adam 优化算法，以提高收敛速度。图 8 显示了这两种优化算法的训练过程，具有相同的 1 × 10−5 学习率。结果表明，采用Adam算法的模型具有更快的收敛速度。可以发现，使用SGD算法测试的准确率为98.03%，而Adam算法的准确率为99.01%。从图8的损失曲线可以看出，Adam算法收敛速度快，比SGD更稳定。

原文

译文

The optimization algorithm is of great importance for the model performance. In this work, the SGD optimization algorithm in the original VGG16 is replaced by the Adam optimization algorithmto improve the converge rate. Figure 8 shows the training process of these two optimization algorithmswith same learning rate of 1 × 10−5. The results show that the model using the Adam algorithm has a faster convergence speed. It can be found that the accuracy of testing is 98.03% when the SGD algorithm is used, while that of the Adam algorithm is 99.01%. From the loss curve in Figure 8, it can be seen that the Adam algorithm can converge quickly and is more stable than SGD.

优化算法对模型性能非常重要。在这项工作中，将原始 VGG16 中的 SGD 优化算法替换为 Adam 优化算法，以提高收敛速度。图 8 显示了这两种优化算法的训练过程，具有相同的 1 × 10−5 学习率。结果表明，采用Adam算法的模型具有更快的收敛速度。可以发现，使用SGD算法测试的准确率为98.03%，而Adam算法的准确率为99.01%。从图8的损失曲线可以看出，Adam算法收敛速度快，比SGD更稳定。

3.5.Data Augmentation

原文	译文
In this work, the dataset used herein includes only 2446 pictures, which is very small in comparison to that with which the VGG16 was pre-trained. In order to evaluate the performance of the proposed method, a data augmentation strategy is adopted to amplify the original dataset and test the classification performance on it. The augmented dataset is generated based on the original dataset by image geometric transformation, color changing, and noise adding, which increase the size of the test dataset from 2141 to 21,410.	在这项工作中，这里使用的数据集仅包含 2446 张图片，与 VGG16 预训练的图片相比非常小。为了评估所提出方法的性能，采用数据增强策略来放大原始数据集并测试其分类性能。增强数据集是在原始数据集的基础上通过图像几何变换、颜色变化和噪声添加生成的，将测试数据集的大小从 2141 增加到 21410。
Image rotation and flipping are two types of image geometric transformations where only the location of each pixel is changed. Rotating the pictures at different angles and flipping can expand the diversity of directions. It is generally difficult to capture each picture from different directions, and therefore to simulate this situation to eliminate the effect of direction on picture recognition, we rotated the original image around the center point by 90, 180, and 270 and when flipped horizontally. As shown in Figure 9, after rotation and flipping, the number of pictures increased by 4 times the original data set.	图像旋转和翻转是两种类型的图像几何变换，其中仅更改每个像素的位置。以不同的角度旋转图片和翻转可以扩展方向的多样性。通常很难从不同方向捕捉每张图片，因此为了模拟这种情况以消除方向对图片识别的影响，我们将原始图像围绕中心点旋转 90、180 和 270，并在水平翻转时进行。如图 9 所示，经过旋转和翻转后，图片数量增加了原始数据集的 4 倍。

原文

译文

In this work, the dataset used herein includes only 2446 pictures, which is very small in comparison to that with which the VGG16 was pre-trained. In order to evaluate the performance of the proposed method, a data augmentation strategy is adopted to amplify the original dataset and test the classification performance on it. The augmented dataset is generated based on the original dataset by image geometric transformation, color changing, and noise adding, which increase the size of the test dataset from 2141 to 21,410.

在这项工作中，这里使用的数据集仅包含 2446 张图片，与 VGG16 预训练的图片相比非常小。为了评估所提出方法的性能，采用数据增强策略来放大原始数据集并测试其分类性能。增强数据集是在原始数据集的基础上通过图像几何变换、颜色变化和噪声添加生成的，将测试数据集的大小从 2141 增加到 21410。

Image rotation and flipping are two types of image geometric transformations where only the location of each pixel is changed. Rotating the pictures at different angles and flipping can expand the diversity of directions. It is generally difficult to capture each picture from different directions, and therefore to simulate this situation to eliminate the effect of direction on picture recognition, we rotated the original image around the center point by 90, 180, and 270 and when flipped horizontally. As shown in Figure 9, after rotation and flipping, the number of pictures increased by 4 times the original data set.

图像旋转和翻转是两种类型的图像几何变换，其中仅更改每个像素的位置。以不同的角度旋转图片和翻转可以扩展方向的多样性。通常很难从不同方向捕捉每张图片，因此为了模拟这种情况以消除方向对图片识别的影响，我们将原始图像围绕中心点旋转 90、180 和 270，并在水平翻转时进行。如图 9 所示，经过旋转和翻转后，图片数量增加了原始数据集的 4 倍。

原文	译文
Adjusting the brightness, contrast, and hue of the image is another common image augmentation method widely used in image processing. During the process of image acquisition, pictures may be affected by different weather and exposed to different intensities of light, which possibly affects the experimental results. In order to simulate image collection under different light backgrounds, we adjusted the brightness and contrast, as shown in Figure 10, and the data was expanded by 4 times.	调整图像的亮度、对比度和色调是图像处理中广泛使用的另一种常见的图像增强方法。在图像采集过程中，图片可能会受到不同天气的影响，暴露在不同强度的光线下，这可能会影响实验结果。为了模拟不同光线背景下的图像采集，我们调整了亮度和对比度，如图10所示，数据扩大了4倍。

原文

译文

Adjusting the brightness, contrast, and hue of the image is another common image augmentation method widely used in image processing. During the process of image acquisition, pictures may be affected by different weather and exposed to different intensities of light, which possibly affects the experimental results. In order to simulate image collection under different light backgrounds, we adjusted the brightness and contrast, as shown in Figure 10, and the data was expanded by 4 times.

调整图像的亮度、对比度和色调是图像处理中广泛使用的另一种常见的图像增强方法。在图像采集过程中，图片可能会受到不同天气的影响，暴露在不同强度的光线下，这可能会影响实验结果。为了模拟不同光线背景下的图像采集，我们调整了亮度和对比度，如图10所示，数据扩大了4倍。

原文	译文
In the same experimental setup, the model we proposed is trained on the augmented 21,410 images and the final classification accuracy can reach 99.34%. When we used the original dataset to train the model, the accuracy rate can also reach 99.01%. Figure 12 shows the recognition accuracy. It can be seen that after the data expansion, all the measures have been slightly improved on the model proposed.	在相同的实验设置中，我们提出的模型在增强的 21,410 张图像上进行训练，最终分类准确率可以达到 99.34%。当我们使用原始数据集训练模型时，准确率也可以达到 99.01%。图 12 显示了识别准确率。可以看出，经过数据扩展后，所有的措施都在所提出的模型上略有改进。

原文

译文

In the same experimental setup, the model we proposed is trained on the augmented 21,410 images and the final classification accuracy can reach 99.34%. When we used the original dataset to train the model, the accuracy rate can also reach 99.01%. Figure 12 shows the recognition accuracy. It can be seen that after the data expansion, all the measures have been slightly improved on the model proposed.

在相同的实验设置中，我们提出的模型在增强的 21,410 张图像上进行训练，最终分类准确率可以达到 99.34%。当我们使用原始数据集训练模型时，准确率也可以达到 99.01%。图 12 显示了识别准确率。可以看出，经过数据扩展后，所有的措施都在所提出的模型上略有改进。

4.Conclusions

原文	译文
An improved convolution neural network model based on VGG16 is proposed in this work. The classifier of classical VGG16 network is modified by adding a batch normalization layer, a global average pooling layer, and a fully connected layer to accelerate convergence and reduce training parameters. The proposed model trains on 2141 apple leaves in the training set to identify apple leaf diseases. The experimental results show that the accuracy of the model test can reach 99.01% after 692 s training. Compared with the classical VGG16 network, the model parameters are reduced by 119,534,592, and the accuracy is improved by 6.3%.	本文提出了一种基于 VGG16 的改进卷积神经网络模型。经典 VGG16 网络的分类器通过添加批量归一化层、全局平均池化层和全连接层进行修改，以加速收敛并减少训练参数。提出的模型在训练集中的 2141 片苹果叶上进行训练，以识别苹果叶病。实验结果表明，经过692 s的训练，模型测试的准确率可以达到99.01%。与经典的VGG16网络相比，模型参数减少了119,534,592，准确率提高了6.3%。
Although the training time is longer than that of AlexNet and ResNet, our model has fewer parameters and a higher accuracy. Compared with GoogleNet and Alex + Inception, some parameters and training time are sacrificed, but our model has the highest accuracy of up to 99.01%. After data expansion, the accuracy of the model can be increased to 99.34%. The convolution neural network proposed in this work can identify apple leaf diseases quickly and accurately and provides a feasible scheme for identifying apple leaf diseases.	虽然训练时间比 AlexNet 和 ResNet 长，但我们的模型参数更少，准确率更高。与 GoogleNet 和 Alex + Inception 相比，牺牲了一些参数和训练时间，但我们的模型准确率最高，高达 99.01%。数据扩展后，模型的准确率可以提高到99.34%。本文提出的卷积神经网络可以快速准确地识别苹果叶片病害，为识别苹果叶片病害提供了可行的方案。
In the future, our work can be improved in the following aspects: (1) collecting more kinds and quantities of apple disease pictures to enrich the datasets to train better models, (2) trying other deep convolution neural networks to improve the accuracy and speed of recognition, (3) trying to run other deep learning methods and apply them to the real‐time detection of apple disease.	未来，我们的工作可以在以下几个方面进行改进：（1）收集更多种类和数量的苹果病害图片以丰富数据集以训练更好的模型，（2）尝试其他深度卷积神经网络以提高准确性和速度 (3) 尝试运行其他深度学习方法并将其应用于苹果病害的实时检测。

你可能感兴趣的:(Paper,#,Paper研读_图像分类,计算机视觉,人工智能,深度学习,神经网络,图像处理)

Python助力自动驾驶：深度学习模型优化全攻略 Echo_Wish Python！实战！python 自动驾驶深度学习
Python助力自动驾驶：深度学习模型优化全攻略说起自动驾驶，大家第一反应往往是“高精地图”“传感器融合”“路径规划”等等，背后真正的“大脑”其实是各式各样的深度学习模型。它们负责感知环境、识别路况、预测行为，甚至实时做出决策。可是，跑在车上的这些模型不仅要精准，还得轻量、实时、稳定，这可不是简单的“丢GPU就能解决”的问题。今天，咱们就从Python开发者的视角，聊聊自动驾驶里深度学习模型的优化
TensorFlow：开启智能时代的引擎科技林总 DeepSeek学AI 人工智能
想象一下，计算机能看懂病历、汽车能自动驾驶、机器能创作艺术——这一切的核心，正是深度学习的力量。而推动这场革命的引擎之一，就是今天的主角：**TensorFlow**。---###**一、背景：为什么需要TensorFlow？1.**深度学习的爆发**-传统编程无法解决图像识别、自然语言处理等复杂问题。-神经网络需要高效工具处理海量数据和计算。2.**Google的答案**-2015年开源Tens
搜索领域知识图谱的知识推理算法研究搜索引擎技术知识图谱算法人工智能 ai
搜索领域知识图谱的知识推理算法研究关键词：知识图谱、知识推理、搜索算法、图神经网络、路径推理、规则推理、表示学习摘要：本文深入探讨搜索领域中知识图谱的知识推理算法。我们将从知识图谱的基本概念出发，分析不同类型的知识推理算法原理，包括基于规则的推理、基于表示的推理和基于路径的推理。通过实际案例和代码实现，展示这些算法如何提升搜索效果，最后讨论该领域的未来发展趋势和挑战。背景介绍目的和范围本文旨在系统
深度剖析AI人工智能在自动驾驶中的系统优化 AI云原生与云计算技术学院人工智能自动驾驶机器学习 ai
深度剖析AI人工智能在自动驾驶中的系统优化关键词：AI人工智能、自动驾驶、系统优化、传感器融合、决策算法摘要：本文深入探讨了AI人工智能在自动驾驶系统中的优化问题。从自动驾驶的背景入手，详细解释了相关核心概念，如传感器、决策算法等。阐述了这些核心概念之间的关系，介绍了核心算法原理和具体操作步骤，还通过数学模型和公式进行了理论支持。给出了项目实战案例，分析了实际应用场景，推荐了相关工具和资源，最后探
AI教父Hinton：别太相信科技领袖们的公开说辞，他们私下对AI的看法会让你不安 | 不摸鱼的独立开发者日报（第36期）不摸鱼_ 不摸鱼的独立开发者日报人工智能科技产品经理 microsoft 个人开发游戏
✍️说明日报相关信息：网站：https://daily.nomoyu.com/RSS：https://daily.nomoyu.com/rss/rss.xml欢迎一起沟通交流AI教父Hinton：别太相信科技领袖们的公开说辞，他们私下对AI的看法会让你不安“人工智能教父”GeoffreyHinton在访谈中表示，他对自己毕生的工作成果表示深切忧虑，并致力于警告世界AI带来的巨大风险，他的主要观点如
22种创新思路！今年必将是特征选择爆发的一年小唯啊小唯人工智能注意力机制特征选择
2025深度学习发论文&模型涨点之——特征选择特征选择是机器学习和数据挖掘领域中一个非常重要的步骤。它指的是从原始特征集合中挑选出对目标变量有较强预测能力的特征子集。在实际的数据集中，往往包含众多特征，但并非所有特征都对模型的性能有正面影响。例如在房价预测任务中，原始特征可能包括房屋的面积、房间数量、所在小区、周边配套设施等众多内容。通过特征选择，可以剔除一些无关的或者冗余的特征，比如可能存在的重
openai-go v1.6.0版本详解：新增功能与优化全面解析福大大架构师每日一题文心一言vschatgpt golang easyui 开发语言
一、前言openai-go作为OpenAI官方提供的Go语言客户端库，一直备受广大Go语言开发者关注和喜爱。随着人工智能技术的飞速发展，openai-go的迭代速度也在不断加快。最近，openai-go发布了v1.6.0版本，该版本带来了多项新功能和优化，进一步提升了API的灵活性和开发者体验。本文将基于官方发布的完整更新日志，深入解析v1.6.0版本的新增功能、改进细节及实际应用，帮助读者全面掌
Deepseek：多轮对话与上下文拼接 chilavert318 熬之滴水穿石 ai
今天的内容，应该很好理解。我们先从场景切入来理解。首先，你回想一下，有没有遇到过这样的情况：和朋友聊天时，聊了一会儿，突然朋友说起之前的某个话题，你却有点反应不过来，得努力回忆之前说了啥。人工智能之所以“智能”，因为它就不可能这么健忘。在和Deepseek聊天，在多轮对话中，Deepseek就像一个记忆力超强的小伙伴，能清楚记得你们聊过的每一个重要细节，让对话一直顺顺畅畅。这背后呀，藏着Deeps
【深度学习|学习笔记】什么是正则化？如何理解正则化？L0、L1、L2正则化的起源、发展、原理、应用和对比详解，附代码。努力毕业的小土博^_^ 深度学习学习笔记深度学习学习笔记人工智能机器学习
【深度学习|学习笔记】什么是正则化？如何理解正则化？L0、L1、L2正则化的起源、发展、原理、应用和对比详解，附代码。【深度学习|学习笔记】什么是正则化？如何理解正则化？L0、L1、L2正则化的起源、发展、原理、应用和对比详解，附代码。文章目录【深度学习|学习笔记】什么是正则化？如何理解正则化？L0、L1、L2正则化的起源、发展、原理、应用和对比详解，附代码。前言一、什么是正则化？为什么需要它？✅
OpenCV实战：图像颜色识别与提取、掩膜制作
前言在计算机视觉和图像处理领域，颜色识别是一项基础而重要的技术。无论是交通标志识别、工业分拣还是美颜滤镜开发，都离不开对特定颜色的处理。本文将带你全面掌握使用OpenCV进行颜色识别的关键技术，包含完整的代码实现和原理讲解。一、颜色空间基础1.1RGB颜色空间在图像处理中，最常见的就是RGB颜色空间。RGB颜色空间是我们接触最多的颜色空间，是一种用于表示和显示彩色图像的一种颜色模型。RGB代表红色
OpenCV图像添加水印
一、前言在数字图像处理中，为图片添加水印是一项常见且重要的技术。无论是版权保护、品牌宣传还是防止未经授权的使用，水印都能发挥重要作用。OpenCV作为一款强大的计算机视觉库，提供了丰富的功能来实现各种水印效果。本教程将详细介绍如何使用OpenCV为图像添加文字水印和图片水印。二、环境准备在开始之前，请确保已安装以下环境：Python3.xOpenCV库（可通过pipinstallopencv-py
MCP 与 AI 任务分解：如何让 AI 高效执行复杂任务？ Echo_Wish Python 进阶人工智能
MCP与AI任务分解：如何让AI高效执行复杂任务？在人工智能应用中，任务分解（TaskDecomposition）是一个绕不开的话题。无论是自动驾驶、智能客服，还是代码生成，AI都需要将复杂问题拆解成可执行的小任务，逐步完成目标。而在AI领域，MCP（Multi-StepCognitiveProcessing，多步认知处理）是一种前沿技术，旨在提升AI的任务分解能力，使其能够更精准、高效地执行复杂
OpenCV图像噪点消除五大滤波方法慕婉0307 opencv基础 opencv 人工智能计算机视觉
在数字图像处理中，噪点消除是提高图像质量的关键步骤。本文将基于OpenCV库，详细讲解五种经典的图像去噪滤波方法：均值滤波、方框滤波、高斯滤波、中值滤波和双边滤波，并通过丰富的代码示例展示它们的实际应用效果。一、图像噪点与滤波基础1.1常见图像噪声类型高斯噪声：符合正态分布的随机噪声椒盐噪声：随机出现的黑白像素点泊松噪声：光子计数噪声量化噪声：模拟信号数字化过程中产生1.2滤波方法分类滤波类型特点
AIGC领域Prompt工程：原理、方法与行业应用 AI天才研究院 ChatGPT 计算 AI大模型应用入门实战与进阶 AIGC prompt ai
AIGC领域Prompt工程：原理、方法与行业应用关键词：Prompt工程、大语言模型（LLM）、提示设计、少样本学习、AIGC应用、思维链（CoT）、提示优化摘要：随着AIGC（人工智能生成内容）技术的爆发式发展，大语言模型（如GPT-4、LLaMA、通义千问）的性能已达到前所未有的高度。然而，模型的强大能力能否被充分释放，很大程度上依赖于"提示（Prompt）"的设计质量。本文系统解析Prom
大语言模型中的思维链提示：解锁高效互动的秘密 t0_54program 大数据与人工智能语言模型人工智能自然语言处理个人开发
在当今的人工智能领域，大语言模型（LLMs）已然成为一颗耀眼的明星，它经过海量训练，能够理解并生成人类语言，在编程等诸多领域助力人们完成日常任务。然而，若想与这些模型实现高效沟通，掌握正确的请求方式至关重要，而思维链提示（Chainofthoughtprompting）便是与LLMs互动时最为高效的技术之一。什么是提示（Prompting）？LLMs基于海量数据集进行训练，以理解并生成类人文本。其
数据标注师学习内容汇总试着数据标注师学习数据标注师
目录文本标注图像标注语音标注文本标注词性标注1词性标注2实体标注关系标注事件标注1事件标注2意图标注关键词标注分类标注问答标注对话标注图像标注拉框标注关键点标注2D标注3D标注线标注目标跟踪标注OCR标注图像分类标注语音标注语音切割转写语音校对标注拼音和停顿标注
人工智能大模型原理与应用实战：大模型在金融风控中的应用 AI天才研究院 LLM大模型落地实战指南大数据人工智能语言模型 AI LLM Java Python 架构设计 Agent RPA
文章目录人工智能大模型原理与应用实战：大模型在金融风控中的应用01.背景介绍1.1金融风控的挑战1.2大模型的优势2.核心概念与联系2.1大模型在金融风控中的应用场景2.2大模型与传统风控技术的结合3.核心算法原理具体操作步骤3.1基于大模型的欺诈检测3.2基于大模型的信用评估4.数学模型和公式详细讲解举例说明4.1逻辑回归模型4.2XGBoost模型5.项目实践：代码实例和详细解释说明5.1基于
庙算兵棋推演AI开发初探（7-神经网络训练与评估概述）超自然祈祷智能决策人工智能神经网络深度学习
前面我们提取了特征做了数据集、设计并实现了处理数据集的神经网络，接下来我们需要训练神经网络了，就是把数据对接好灌进去，训练后查看预测的和实际的结果是否一致——也就是训练与评估。数据解析提取数据编码为数据集设计神经网络-->>神经网络训练与评估神经网络一个重要指标是收敛，就是用可以逼近任意函数的神经网络是否可以逼近你数据集中隐含的模式。再重复一遍【特征工程】与【神经网络】的区别：前者就像人发现了牛顿
浅谈卷积神经网络(CNN) cyc&阿灿 cnn 人工智能神经网络
卷积神经网络(ConvolutionalNeuralNetworks,CNN)作为深度学习领域最具影响力的架构之一，已在计算机视觉、自然语言处理、医学影像分析等领域取得了革命性突破。本文将系统全面地剖析CNN的核心原理、关键组件、经典模型、数学基础、训练技巧以及最新进展，通过理论解析与代码实践相结合的方式，帮助读者深入掌握这一重要技术。一、CNN基础与核心思想1.1传统神经网络的局限性在处理图像等
AlphaStar 星际首秀，人工智能走向星辰大海谷歌开发者
文/王晶，资深工程师，GoogleBrain团队作者王晶，现为GoogleBrain团队的资深工程师，主要致力深度强化学习的研发，和DeepMind团队在强化学习的应用上有许多合作。北京时间1月25日凌晨2点，DeepMind直播了他们的AIAlphaStar和人类顶尖的职业电竞选手对战星际争霸2。根据DeepMind介绍，AlphaStar在2018年12月10日和19日先后以5：0全胜的战绩击
**双生“基尼”**：跨越世纪的术语撞车与学科分野
在学术的宇宙中，“基尼”（Gini）这个名字如同一个奇特的星标，闪耀在两个看似毫不相关的领域：衡量社会贫富差距的经济学与驱动人工智能的机器学习。然而，当人们在这两个领域都遇到“基尼指数”或“基尼系数”时，困惑油然而生——它们为何如此不同？又为何共享同一个名字？这不是某个“傻逼”的随意命名，而是一场跨越学科与世纪的“术语交通事故”，其背后是学术传承与概念抽象的交织。本文由「大千AI助手」原创发布，专
【第二章:机器学习与神经网络概述】03.类算法理论与实践-(3)决策树分类器 IT古董人工智能课程机器学习算法神经网络
第二章:机器学习与神经网络概述第三部分：类算法理论与实践第三节：决策树分类器内容：信息增益、剪枝技术、过拟合与泛化能力。决策树是一种常用于分类和回归的树状结构模型，它通过一系列特征判断进行决策，有良好的可解释性。一、基本概念节点（Node）：表示特征判断条件边（Branch）：表示特征判断的结果路径叶子节点（Leaf）：表示分类结果二、划分准则：信息增益（InformationGain）信息增益衡
第 3 章：神经网络如何学习鱼摆摆拜拜神经网络学习人工智能
第3章：神经网络如何学习在第二章中，我们详细了解了神经网络的静态结构：由神经元组成的层，以及连接它们的权重和偏置。现在，我们将进入整个教程最核心的部分：神经网络是如何从数据中"学习"的？这个学习过程是一个动态的、不断调整自身参数以求更佳预测的过程。我们将通过四个关键概念来揭示这个秘密：前向传播(ForwardPropagation)：数据如何通过网络产生一个预测？损失函数(LossFunction
AI算力综述和资料整理木鱼时刻人工智能
目录总体介绍计算精度传输协议GPU池化资源调度CUDA技术GPU硬件参考链接总体介绍AI算力是人工智能系统的核心基础设施，涵盖了从计算精度、传输协议到硬件架构的完整技术栈。计算精度混合精度训练原生满血版DeepSeek671B是FP8精度。FP16在训练计算力占比有80-90%，FP32占比10%-20%。大模型训练中通常会用到FP16（半精度浮点数），但并不是只使用FP16，而是采用**混合精度
【PyTorch】2024保姆级安装教程-Python-（CPU+GPU详细完整版）金枝玉叶9 程序员知识储备1 程序员知识储备2 程序员知识储备3 python pytorch 人工智能
【PyTorch】2024保姆级安装教程（CPU+GPU详细完整版）PyTorch是当前最受欢迎的深度学习框架之一。本文将详细讲解在Python环境中安装PyTorch，包括CPU和GPU版本的全方位指南。一、前置环境首先确保已安装Python环境，推荐使用Python3.8或以上版本。验证Python安装：python--versionpip--version推荐使用虚拟环境（如conda或ve
LSNet: 基于侧向抑制的神经网络碳酸的唐模型养成与叙述有意思的py库神经网络人工智能深度学习
引言在计算机视觉领域，我们一直在寻找灵感来源以提高图像处理和识别的效果。而人类视觉系统作为经过数百万年进化的精密系统，无疑是最好的参考对象之一。今天，我要向大家介绍一个名为LSNet（LateralSuppressionNetwork，侧向抑制网络）的技术，它模拟了人类视觉系统中的侧向抑制机制，为计算机视觉任务带来了新的可能性。什么是侧向抑制？侧向抑制（LateralSuppression），也被
【学习】《算法图解》第七章学习笔记：树程序员
前言在前面的章节中，我们学习了数组、链表、散列表等基本数据结构，以及一些基础算法。本章将介绍一种非常重要的数据结构——树(Tree)，特别是二叉搜索树(BinarySearchTree)。树结构在计算机科学中应用广泛，从文件系统到数据库再到人工智能，都能看到树的身影。《算法图解》第七章深入浅出地介绍了树的基本概念、实现和应用，帮助读者理解这一关键数据结构。一、树的基本概念（一）什么是树树是一种分层
基于OpenCV图像分割与PyTorch的增强图像分类方案从零开始学习人工智能 opencv pytorch 分类
在图像分类任务中，背景噪声和复杂场景常常会对分类准确率产生负面影响。为了应对这一挑战，本文介绍了一种结合OpenCV图像分割与PyTorch深度学习框架的增强图像分类方案。通过先对图像进行分割提取感兴趣区域（RegionofInterest，ROI），再进行分类，可以有效减少背景干扰，突出关键特征，从而提高分类准确率。该方案在多种复杂场景下表现出色，尤其适用于图像背景复杂或包含多个对象的情况。一、
智能体综述和参考资料整理木鱼时刻大模型人工智能
目录总体介绍核心组件记忆系统工具系统计划与推理开发框架Single-AgentMulti-Agent智能体平台技术实现通信协议角色系统对话记忆MCP协议参考链接总体介绍智能体（AIAgents）是人工智能领域的重要发展方向，它们能够通过传感器感知环境并通过执行器对环境采取行动。根据罗素和诺维格在《人工智能：一种现代方法》（2016年）中的定义，AIAgent是任何可以通过传感器感知其环境并通过执行
主流AI代码编程工具分享 scuter_yu ai ai编程
在当今数字化时代，AI代码编程工具已成为提升开发效率、优化代码质量的重要助手。这些工具利用人工智能技术，为开发者提供从代码生成、补全到调试、优化等一系列功能，极大地简化了编程流程，让编程变得更加高效、便捷和智能。以下将介绍几款热门的AI代码编程工具。通义灵码产品介绍：通义灵码是阿里云出品的基于通义大模型的智能编程辅助工具，提供行级/函数级实时续写、自然语言生成代码、单元测试生成、代码优化、注释生成
eclipse maven IXHONG eclipse
eclipse中使用maven插件的时候，运行run as maven build的时候报错 -Dmaven.multiModuleProjectDirectory system propery is not set. Check $M2_HOME environment variable and mvn script match. 可以设一个环境变量M2_HOME指
timer cancel方法的一个小实例 alleni123 多线程 timer
package com.lj.timer; import java.util.Date; import java.util.Timer; import java.util.TimerTask; public class MyTimer extends TimerTask { private int a; private Timer timer; pub
MySQL数据库在Linux下的安装 ducklsl mysql
1.建好一个专门放置MySQL的目录 /mysql/db数据库目录 /mysql/data数据库数据文件目录 2.配置用户，添加专门的MySQL管理用户 >groupadd mysql ----添加用户组 >useradd -g mysql mysql ----在mysql用户组中添加一个mysql用户 3.配置，生成并安装MySQL >cmake -D
spring------>>cvc-elt.1: Cannot find the declaration of element Array_06 spring bean
将-------- <?xml version="1.0" encoding="UTF-8"?> <beans xmlns="http://www.springframework.org/schema/beans" xmlns:xsi="http://www.w3
maven发布第三方jar的一些问题 cugfy maven
maven中发布第三方jar到nexus仓库使用的是 deploy:deploy-file命令有许多参数，具体可查看 http://maven.apache.org/plugins/maven-deploy-plugin/deploy-file-mojo.html 以下是一个例子： mvn deploy:deploy-file -DgroupId=xpp3
MYSQL下载及安装 357029540 mysql
好久没有去安装过MYSQL，今天自己在安装完MYSQL过后用navicat for mysql去厕测试链接的时候出现了10061的问题，因为的的MYSQL是最新版本为5.6.24，所以下载的文件夹里没有my.ini文件，所以在网上找了很多方法还是没有找到怎么解决问题，最后看到了一篇百度经验里有这个的介绍，按照其步骤也完成了安装，在这里给大家分享下这个链接的地址
ios TableView cell的布局张亚雄 tableview
cell.imageView.image = [UIImage imageNamed:[imageArray objectAtIndex:[indexPath row]]]; CGSize itemSize = CGSizeMake(60, 50); &nbs
Java编码转义 adminjun java 编码转义
import java.io.UnsupportedEncodingException; /** * 转换字符串的编码 */ public class ChangeCharset { /** 7位ASCII字符，也叫作ISO646-US、Unicode字符集的基本拉丁块 */ public static final Strin
Tomcat 配置和spring aijuans spring
简介 Tomcat启动时，先找系统变量CATALINA_BASE，如果没有，则找CATALINA_HOME。然后找这个变量所指的目录下的conf文件夹，从中读取配置文件。最重要的配置文件：server.xml 。要配置tomcat，基本上了解server.xml，context.xml和web.xml。 Server.xml -- tomcat主
Java打印当前目录下的所有子目录和文件 ayaoxinchao 递归 File
其实这个没啥技术含量，大湿们不要操笑哦，只是做一个简单的记录，简单用了一下递归算法。 import java.io.File; /** * @author Perlin * @date 2014-6-30 */ public class PrintDirectory { public static void printDirectory(File f
linux安装mysql出现libs报冲突解决 BigBird2012 linux
linux安装mysql出现libs报冲突解决安装mysql出现 file /usr/share/mysql/ukrainian/errmsg.sys from install of MySQL-server-5.5.33-1.linux2.6.i386 conflicts with file from package mysql-libs-5.1.61-4.el6.i686
jedis连接池使用实例 bijian1013 redis jedis连接池 jedis
实例代码： package com.bijian.study; import java.util.ArrayList; import java.util.List; import redis.clients.jedis.Jedis; import redis.clients.jedis.JedisPool; import redis.clients.jedis.JedisPoo
关于朋友 bingyingao 朋友兴趣爱好维持
成为朋友的必要条件：志相同，道不合，可以成为朋友。譬如马云、周星驰一个是商人，一个是影星，可谓道不同，但都很有梦想，都要在各自领域里做到最好，当他们遇到一起，互相欣赏，可以畅谈两个小时。志不同，道相合，也可以成为朋友。譬如有时候看到两个一个成绩很好每次考试争做第一，一个成绩很差的同学是好朋友。他们志向不相同，但他
【Spark七十九】Spark RDD API一 bit1129 spark
aggregate package spark.examples.rddapi import org.apache.spark.{SparkConf, SparkContext} //测试RDD的aggregate方法 object AggregateTest { def main(args: Array[String]) { val conf = new Spar
ktap 0.1 released bookjovi kernel tracing
Dear, I'm pleased to announce that ktap release v0.1, this is the first official release of ktap project, it is expected that this release is not fully functional or very stable and we welcome bu
能保存Properties文件注释的Properties工具类 BrokenDreams properties
今天遇到一个小需求：由于java.util.Properties读取属性文件时会忽略注释，当写回去的时候，注释都没了。恰好一个项目中的配置文件会在部署后被某个Java程序修改一下，但修改了之后注释全没了，可能会给以后的参数调整带来困难。所以要解决这个问题。 &nb
读《研磨设计模式》-代码笔记-外观模式-Facade bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ /* * 百度百科的定义： * Facade（外观）模式为子系统中的各类（或结构与方法）提供一个简明一致的界面， * 隐藏子系统的复杂性，使子系统更加容易使用。他是为子系统中的一组接口所提供的一个一致的界面 * * 可简单地
After Effects教程收集 cherishLC After Effects
1、中文入门 http://study.163.com/course/courseMain.htm?courseId=730009 2、videocopilot英文入门教程（中文字幕） http://www.youku.com/playlist_show/id_17893193.html 英文原址： http://www.videocopilot.net/basic/ 素
Linux Apache 安装过程 crabdave apache
Linux Apache 安装过程下载新版本： apr-1.4.2.tar.gz（下载网站：http://apr.apache.org/download.cgi） apr-util-1.3.9.tar.gz（下载网站：http://apr.apache.org/download.cgi） httpd-2.2.15.tar.gz（下载网站：http://httpd.apac
Shell学习之变量赋值和引用 daizj shell 变量引用赋值
本文转自：http://www.cnblogs.com/papam/articles/1548679.html Shell编程中，使用变量无需事先声明，同时变量名的命名须遵循如下规则：首个字符必须为字母（a-z，A-Z）中间不能有空格，可以使用下划线（_）不能使用标点符号不能使用bash里的关键字（可用help命令查看保留关键字）需要给变量赋值时，可以这么写：
Java SE 第一讲（Java SE入门、JDK的下载与安装、第一个Java程序、Java程序的编译与执行） dcj3sjt126com java jdk
Java SE 第一讲： Java SE：Java Standard Edition Java ME: Java Mobile Edition Java EE：Java Enterprise Edition Java是由Sun公司推出的（今年初被Oracle公司收购）。收购价格：74亿美金 J2SE、J2ME、J2EE JDK：Java Development
YII给用户登录加上验证码 dcj3sjt126com yii
1、在SiteController中添加如下代码： /** * Declares class-based actions. */ public function actions() { return array( // captcha action renders the CAPTCHA image displ
Lucene使用说明 dyy_gusi Lucene search 分词器
Lucene使用说明 1、lucene简介 1.1、什么是lucene Lucene是一个全文搜索框架，而不是应用产品。因此它并不像baidu或者googleDesktop那种拿来就能用，它只是提供了一种工具让你能实现这些产品和功能。 1.2、lucene能做什么要回答这个问题，先要了解lucene的本质。实际
学习编程并不难,做到以下几点即可! gcq511120594 数据结构编程算法
不论你是想自己设计游戏，还是开发iPhone或安卓手机上的应用，还是仅仅为了娱乐，学习编程语言都是一条必经之路。编程语言种类繁多，用途各异，然而一旦掌握其中之一，其他的也就迎刃而解。作为初学者，你可能要先从Java或HTML开始学，一旦掌握了一门编程语言，你就发挥无穷的想象，开发各种神奇的软件啦。 1、确定目标学习编程语言既充满乐趣，又充满挑战。有些花费多年时间学习一门编程语言的大学生到
Java面试十问之三：Java与C++内存回收机制的差别 HNUlanwei java C++finalize()堆栈内存回收
大家知道， Java 除了那 8 种基本类型以外，其他都是对象类型（又称为引用类型）的数据。 JVM 会把程序创建的对象存放在堆空间中，那什么又是堆空间呢？其实，堆（ Heap）是一个运行时的数据存储区，从它可以分配大小各异的空间。一般，运行时的数据存储区有堆（ Heap）和堆栈（ Stack），所以要先看它们里面可以分配哪些类型的对象实体，然后才知道如何均衡使用这两种存储区。一般来说，栈中存放的
第二章 Nginx+Lua开发入门 jinnianshilongnian nginx lua
Nginx入门本文目的是学习Nginx+Lua开发，对于Nginx基本知识可以参考如下文章： nginx启动、关闭、重启 http://www.cnblogs.com/derekchen/archive/2011/02/17/1957209.html agentzh 的 Nginx 教程 http://openresty.org/download/agentzh-nginx-tutor
MongoDB windows安装基本命令 liyonghui160com
windows安装安装目录： D:\MongoDB\ 新建目录 D:\MongoDB\data\db 4.启动进城： cd D:\MongoDB\bin mongod -dbpath D:\MongoDB\data\db &n
Linux下通过源码编译安装程序 pda158 linux
一、程序的组成部分　　Linux下程序大都是由以下几部分组成：　　二进制文件：也就是可以运行的程序文件　　库文件：就是通常我们见到的lib目录下的文件　　配置文件：这个不必多说，都知道　　帮助文档：通常是我们在linux下用man命令查看的命令的文档　　二、linux下程序的存放目录　　linux程序的存放目录大致有三个地方：　　/etc, /b
WEB开发编程的职业生涯４个阶段 shw3588 编程 Web 工作生活
觉得自己什么都会 2007年从学校毕业，凭借自己原创的ASP毕业设计，以为自己很厉害似的，信心满满去东莞找工作，找面试成功率确实很高，只是工资不高，但依旧无法磨灭那过分的自信，那时候什么考勤系统、什么OA系统、什么ERP，什么都觉得有信心，这样的生涯大概持续了约一年。根本不是自己想的那样 2008年开始接触很多工作相关的东西，发现太多东西自己根本不会，都需要去学，不管是asp还是js，
遭遇jsonp同域下变作post请求的坑 vb2005xu jsonp 同域post
今天迁移一个站点时遇到一个坑爹问题,同一个jsonp接口在跨域时都能调用成功,但是在同域下调用虽然成功,但是数据却有问题. 此处贴出我的后端代码片段 $mi_id = htmlspecialchars(trim($_GET['mi_id '])); $mi_cv = htmlspecialchars(trim($_GET['mi_cv '])); 贴出我前端代码片段: $.aj