米亚123

F35.（亲测-方法记录）Ubuntu-darknet-YOLOV3算法的使用（训练、测试、性能指标）（亲测成功-2019.07.09-批量测试将继续更新）

注：训练环境为Linux、写作环境为win10。

1.程序的运行环境：

1)系统：Ubuntu16.04
2）显卡：TITAN XP
3）CUDA:9.1
4)CUDNN:7.5
5)运行内存32G
6）pycharm
7）anaconda

2.资源的下载：

本文基于darknet框架Linux系统训练YOLOV3数据集，下载darknet文件：链接：https://download.csdn.net/download/qq_41900772/11340101 下载文件结构见下图所示：

该文档中训练时主要用到：cfg、data，以及后文中所建立的数据集。
该文档中个文件夹所存放的文件的作用：

1）cfg：cfg文件夹中主要存放了网络训练所需要的网络配置文件
2）data：主要存放了，网络测试所需测试图片、网络训练所需训练类别名文件（如：voc.names、coco.names）
3）example：主要存放了可能会用到的一些函数评价接口。
4）python：存放了网络对应的python接口。
5）Makefile文件为训练所需的最重要的配置文件。

3.VOC格式的数据集的制作

在主目录下建立文件夹VOC，在该文件夹下建立数据集。
数据集目录：

1.VOCdevkit     #根目录
	1.1VOC2019   #不同年份的数据集，名称年份可以改
		1.1.1Annotations        #存放xml文件，与JPEGImages中的图片一一对应，解释图片的内容等等
		1.1.2ImageSets          #该目录下存放的都是txt文件，txt文件中每一行包含一个图片的名称，末尾会加上±1表示正负样本
			1.1.2.1Action（训练过程中用不到）
			1.1.2.2Layout（训练过程中用不到）
			1.1.1.3Main
			1.1.1.4Segmentation（训练过程中用不到）
		1.1.3JPEGImages         #存放源图片
		1.1.4SegmentationClass  #存放的是图片，语义分割相关（训练过程中用不到）
		1.1.5SegmentationObject #存放的是图片，实例分割相关（训练过程中用不到）

制作数据集：
1）根据上目录建立数据集
2）加纳所有图片名称进行重排序
3）将训练图片放入JPEGImages文件夹中
4）采用Imagelabel工具对训练图片进行标注，生成xml文件
5）将上一步所生成的xml文件放入（注意：图片与xml文件要一一对应）
6）在Main文件夹下生成train.txt、test.txt两个文件，两个文件中为训练图片的名字
7）将xml文件转换成txt格式文件

Main文件夹下的文件

根据xml文件生成与xml文件相对应的数据单序列，分为训练集与测试集，数据名要存放在Main文件夹下的.txt文件中以作备用

将.xml文件转换到.txt文件

从.xml文件到VOC2019文件夹中“label”文件夹中的.txt文件（标注图像的坐标及中心点信息），同时，在VOCdevkit同级文件夹下生成train.txt、test.txt两个文件，两个文件的内容为训练图片的路径(该步骤提取训练所需要的信息)

至此，数据集制作完成。

制作数据集总结，

按照VOC数据集的结构搭建好框架后，将训练图片、对应xml文件以及以上两个程序，放到相应位置，修改好程序中相应信息，进行执行程序并检查相应的文件生成情况，检查无误，数据集制作完成。接下来进行训练过程中相关配置文件的配置工作。

4.接下来进行文件的配置

文件配置过程需要进行配置的文件：

Makefikle文件配置（最重要）
voc.data（数据集文件配置）文件配置、
voc.names（训练过程中的类别名儿，在voc.data文件中调用）文件配置、
yolov3.cfg文件配置

4.1Makefile文件配置

Linux中Makefile文件配置工作非常重要。在Linux中使用Mousepad编译器或者pycharm编译器打开，不要使用默认的gcc编译器打开。打开后界面如下图所示：：

YOLOV3中，在Makefile文件配置时根据自己设备情况进行配置，配置前四行即可，GPU、CUDA、CUDNN、OPENCV(2/3版本置1)，其他不需要修改。Makefile文件配置完成后再主目录下右键打开终端，输入命令：make，回车完成编译。

4.2 训练数据配置

打开cfg文件夹下的voc.data文件，打开文件后界面如下：

在该文件中内容主要包括：
classese:类别数量
train：路径为voc数据集文件夹下的2019_train.txt文件的路径
test：路径为voc数据集文件夹下的2019_test.txt文件的路径
names：为data文件夹下的voc.names
backup:为主目录下backup文件夹（主要用来存放训练生成的.weights文件（模型）和.backup文件（用于训练过程中断后继续训练时所用文件））
eval：为voc评价标准

4.3训练类别配置

修改训练数据的类别名的文件

4.4 网络模型配置

打开相应的YOLOV3模型文件，如图示：

该文件中主要包括了三部分：网络（net）层、convolutional（卷积）层、yolo层。在文件配置时需要配置net和yolo层的信息，

1）net配置：
	训练过程中将testing下两行注释掉（如果测试时将training下两行注释掉）batch、subdivisions的值根据自己的需要进行修改
	（注意：当GPU显存比较小时将subdivisions调大batch/subdivisions的值为训练时一次传入训练的图片）
	其他的之不需要配置
2）yolo配置
	打开文件后按Ctrl+F键进行搜索yolo，会搜出3个yolo，所以yolo部分需要修改3次（每次修改都有相同）yolo上边的filter改为“3*（5+类别数）”yolo下边classes改为自己的类别数，random为多尺度训练，如果GPU显存很小将其置为0。3个yolo出均需修改。
3）使用k-means可以适当修改anchors值以满足自己训练的需要

至此，文件配置完成。接下来开始训练。

5.训练

5.1模型的保存

训练时默认：迭代次数小于1000时毎迭代100次保存一次模型，迭代次数大于等于1000时毎一万次保存一次模型，保存的模型结果保存在主目录下的backup文件夹下，同时，会保存.backup模型（该模型为：当训练中断时，若想要继续接着上次训练可以使用这个模型），若想修改毎多少次保存一次模型的次数，可以在DarkNet主目录下examples/detector.c文件中进行修改

修改完成后，在DarkNet主目录下鼠标右键打开终端输入命令：make，回车进行重新编译

5.2训练

5.2.1.使用预训练模型训练数据（效果较不使用预训练模型要好）

预训练是在ImageNet上按分类的方式进行预训练160轮，使用SGD优化方法，初始学习率0.1，每次下降4倍，到0.0005时终止。除了训练224x224尺寸的图像外，还是用448x448尺寸的图片。

预训练模型下载链接： https://download.csdn.net/download/qq_41900772/11340093

初次训练模型需要使用darknet53.conv.74作为预训练模型，之后在次训练新数据可以使用已经训练完的模型作为预训练模型，
在DarkNet主目录下打开终端输入训练命令进行训练。
1）不保存训练日志的训练命令：
./darknet detector train cfg/voc.data cfg/yolov3-voc.cfg scripts/darknet53.conv.74
解析：

1）在DarkNet主目录下编译完成，则DarkNet框架训练环境搭建完成。可以使用命令：“./darknet”
2）detector为训练文件（底层c程序）
3）train表示该命令为训练命令（当测试时输入测试命令将train改为test）
4）cfg/voc.data:表示配置的数据集的参数文件（注意路径）
5）cfgyolov3-voc.cfg：表示配置的网络参数文件（注意路径）
6）scripts/darknet53.conv.74：表示训练所需要的预训练模型（注意路径）

2）保存训练日志的训练命令：

./darknet detector train cfg/voc.data cfg/yolov3-voc.cfg scripts/darknet53.conv.74 | tee train_yolov3-voc.log

解析：

1）在DarkNet主目录下编译完成，则DarkNet框架训练环境搭建完成。可以使用命令：“./darknet”
2）detector为训练文件（底层c程序）
3）train表示该命令为训练命令（当测试时输入测试命令将train改为test）
4）cfg/voc.data:表示配置的数据集的参数文件（注意路径）
5）cfgyolov3-voc.cfg：表示配置的网络参数文件（注意路径）
6）scripts/darknet53.conv.74：表示训练所需要的预训练模型（注意路径）
7）| tee train_yolov3-voc.log：表示训练时保存日志文件（保存在DarkNet主目录下）

3）使用GPU、不保存训练日志的训练命令：

./darknet detector train cfg/voc.data cfg/yolov3-voc.cfg scripts/darknet53.conv.74 -gpus 0,1,... | tee train_yolov3-voc.log

解析：

1）在DarkNet主目录下编译完成，则DarkNet框架训练环境搭建完成。可以使用命令：“./darknet”
2）detector为训练文件（底层c程序）
3）train表示该命令为训练命令（当测试时输入测试命令将train改为test）
4）cfg/voc.data:表示配置的数据集的参数文件（注意路径）
5）cfgyolov3-voc.cfg：表示配置的网络参数文件（注意路径）
6）scripts/darknet53.conv.74：表示训练所需要的预训练模型（注意路径）
7）-gpus 0,1：使用多GPU进行训练模型时添加改程序,可指定特定GPU进行训练网络，即：当设备拥有n个GPU时，可以同时使用GPU进行加速训练n个网络。

使用GPU、保存训练日志的训练命令：

./darknet detector train cfg/voc.data cfg/yolov3-voc.cfg scripts/darknet53.conv.74 -gpus 0,1.....

解析：

1）在DarkNet主目录下编译完成，则DarkNet框架训练环境搭建完成。可以使用命令：“./darknet”
2）detector为训练文件（底层c程序）
3）train表示该命令为训练命令（当测试时输入测试命令将train改为test）
4）cfg/voc.data:表示配置的数据集的参数文件（注意路径）
5）cfgyolov3-voc.cfg：表示配置的网络参数文件（注意路径）
6）scripts/darknet53.conv.74：表示训练所需要的预训练模型（注意路径）
7）-gpus 0,1：使用多GPU进行训练模型时添加改程序，,可指定特定GPU进行训练网络，即：当设备拥有n个GPU时，可以同时使用GPU进行加速训练n个网络。

当多GPU训练时模型保存:https://github.com/pjreddie/darknet/issues/664#issuecomment-405448653

5.2.2训练日志参数的介绍

1）Region 82（94/106）：表示cfg文件中yolo-layer的索引值，三个值，82/94/106
2）Avg IOU：表示在训练过程中预测的bounding box与标注的bounding box的交并比（两个框的相交/两个框的并），该值期望越大越好，目标值为1
3）Class：表示标注物体的分类准确率，该值期望越大越好，目标期望值为1
4）obj：表示预测有目标的概率,该值期望越大越好，目标值为1
5）No obj：表示预测没有目标的概率，该值期望越小越好，目标期望值为0
6）.5R：表示以IOU=0.5为阈值时被召回（recall）。recall=检出的正样本/实际的正样本
7）.75R：表示以IOU=0.75为阈值时被召回（recall）
8)count:表示正样本的数量

在该图片中：

1）20277：表示当前训练的迭代次数
2）0.038915：表示训练总的Loss损失
3）0.42692 avg： 表示平均Loss，该值期望越低越好，一般该值低于0.060730 avg就可以终止训练了。
4）0.000100 rate： 代表当前的学习率，是在.cfg文件中定义的。（因为本人已经训练，训练过程中未截图，所以该图借鉴了他人的图，本人训练时学习率为0.001）
5）0.302128 seconds： 表示当前批次训练花费的总时间（batch/subdivisions）
6）162216 images：表示到目前所参与训练的总的图片数量

5.2.3 当avg低于0.06时即可停止训练了

Linux系统下在训练时的当前终端按下Ctrl+c进行停止训练

5.2.3训练停止后想接着上次训练接着训练

接着上次训练接着训练，在DarkNet主目录下鼠标右键打开终端，输入以下命令：

 ./darknet detector train cfg/voc.data cfg/yolov3-voc.cfg backup/yolov3-voc.backup

回车，即可接着上次的训练继续训练
解释：

该条命令同前边介绍的训练命令只是将预训练权重改为上次训练时保存在DarkNet主目录下的backup文件夹下的yolov3-voc.backup文件

5.3 训练日志可视化

输入保存训练日志训练命令训练后，在DarkNet主目录下会生成train_yolov3-voc.log（.log）的训练日志文件。使用visualization_train_yolov3-voc_log.py（存放在与训练日志文件同级目录下） Python脚本程序度训练日志文件进行可视化，运行程序后会得到loss变化曲线和Avg IOU曲线
1）visualization_train_yolov3-voc_log.py程序如下所示：

import pandas as pd
import matplotlib.pyplot as plt
import os
# ==================可能需要修改的地方=====================================#
g_log_path = "train_yolov3-voc.log"  # 此处修改为你的训练日志文件名
# ==========================================================================#
 
def extract_log(log_file, new_log_file, key_word):
    '''
    :param log_file:日志文件
    :param new_log_file:挑选出可用信息的日志文件
    :param key_word:根据关键词提取日志信息
    :return:
    '''
    with open(log_file, "r") as f:
        with open(new_log_file, "w") as train_log:
            for line in f:
                # 去除多gpu的同步log
                if "Syncing" in line:
                    continue
                # 去除nan log
                if "nan" in line:
                    continue
                if key_word in line:
                    train_log.write(line)
    f.close()
    train_log.close()
 
 
def drawAvgLoss(loss_log_path):
    '''
    :param loss_log_path: 提取到的loss日志信息文件
    :return: 画loss曲线图
    '''
    line_cnt = 0
    for count, line in enumerate(open(loss_log_path, "rU")):
        line_cnt += 1
    result = pd.read_csv(loss_log_path, skiprows=[iter_num for iter_num in range(line_cnt) if ((iter_num < 500))],
                         error_bad_lines=False,
                         names=["loss", "avg", "rate", "seconds", "images"])
    result["avg"] = result["avg"].str.split(" ").str.get(1)
    result["avg"] = pd.to_numeric(result["avg"])
 
    fig = plt.figure(1, figsize=(6, 4))
    ax = fig.add_subplot(1, 1, 1)
    ax.plot(result["avg"].values, label="Avg Loss", color="#ff7043")
    ax.legend(loc="best")
    ax.set_title("Avg Loss Curve")
    ax.set_xlabel("Batches")
    ax.set_ylabel("Avg Loss")
 
 
def drawIOU(iou_log_path):
    '''
    :param iou_log_path: 提取到的iou日志信息文件
    :return: 画iou曲线图
    '''
    line_cnt = 0
    for count, line in enumerate(open(iou_log_path, "rU")):
        line_cnt += 1
    result = pd.read_csv(iou_log_path, skiprows=[x for x in range(line_cnt) if (x % 39 != 0 | (x < 5000))],
                         error_bad_lines=False,
                         names=["Region Avg IOU", "Class", "Obj", "No Obj", "Avg Recall", "count"])
    result["Region Avg IOU"] = result["Region Avg IOU"].str.split(": ").str.get(1)
 
    result["Region Avg IOU"] = pd.to_numeric(result["Region Avg IOU"])
 
    result_iou = result["Region Avg IOU"].values
    # 平滑iou曲线
    for i in range(len(result_iou) - 1):
        iou = result_iou[i]
        iou_next = result_iou[i + 1]
        if abs(iou - iou_next) > 0.2:
            result_iou[i] = (iou + iou_next) / 2
 
    fig = plt.figure(2, figsize=(6, 4))
    ax = fig.add_subplot(1, 1, 1)
    ax.plot(result_iou, label="Region Avg IOU", color="#ff7043")
    ax.legend(loc="best")
    ax.set_title("Avg IOU Curve")
    ax.set_xlabel("Batches")
    ax.set_ylabel("Avg IOU")
 
 
if __name__ == "__main__":
    loss_log_path = "train_log_loss.txt"
    iou_log_path = "train_log_iou.txt"
    if os.path.exists(g_log_path) is False:
        exit(-1)
    if os.path.exists(loss_log_path) is False:
        extract_log(g_log_path, loss_log_path, "images")
    if os.path.exists(iou_log_path) is False:
        extract_log(g_log_path, iou_log_path, "IOU")
    drawAvgLoss(loss_log_path)
    drawIOU(iou_log_path)
    plt.show()

2）运行程序后得到loss变化曲线和Avg IOU曲线图


（注：该图为本人训练日志所生成的图）

5.4 训练完后，对模型进行微调微调继续训练新数据样本

初次训练模型时需要下载预训练模型：darknet53.conv.74训练完后，在DarkNet主目录下的backup文件夹下会生成训练模型，当下次训练新数据时，使用上次已经训练过的模型作为预训练模型继续训练，需要对训练的模型进行微调。
1）使用-clear对模型进行微调
通过模型训练出的backup或者final.weights文件代替预训练模型darknet53.conv.74,并在训练命令的末尾加上-clear命令,这样模型的训练会从初始状态开始,

./darknet cfg/yolo.data cfg/yolo.cfg backup/model_pre.backup -clear

这样重新训练的模型就是在原模型微调的基础上训练的结果,这样模型的收敛速度较快,迭代次数将从0开始计算.
2）不使用-clear对模型进行微调
不用-clear命令,训练则不会从初始状态开始,读取原模型的backup或者weights的时候也会读取其中的迭代次数及learning rage.比如原模型的迭代次数微40000次,最终的学习率为0.00001,那么新模型训练的时候也会从40000次开始迭代,并从0.00001的学习率开始,那么此时就需要修改cfg文件的max_batches以及learning rate .
比如我的模型1的cfg是这样的,初始learning rate为0.001:

那么新模型如果想迭代30000次,并有初始0.001的学习率你需要修改max_batches和steps:

本文训练过程借鉴了：
https://blog.csdn.net/qq_34806812/article/details/81459982
https://blog.csdn.net/hunterhe/article/details/89923092

6.网络的评价体系（个人建议采用6.3）

6.1 召回率的计算（个人建议采用6.3）

6.1.1 不修改源程序

因为找到example文件夹下的detector文件找到以下函数处：发现设置的路径为/data/coco_val.list,所以如果下直接使用命令行程序而不做修改源程序的话需要在data文件夹下建立相应的文件，并把需要计算召回率（recall）的图片路径放入该文件中。

在linux系统的命令行窗口中输入命令：
./darknet detector recall cfg/voc.data cfg/bolo.voc.cfg backup/yolov3voc.weights
解释命令：./darknet + detector（训练+测试程序）+recall（召回率计算命令）+ cfg/voc.data （数据集配置文件） + cfg/bolo.voc.cfg（网络配置文件） + backup/yolov3voc.weights（模型文件）

注意：需要在data文件夹下建立文件名为“coco_val_5k”的测试图片文件路径的文件（可以将voc文件夹下的测试图片路径文件中的路径信息复制到”coco_val_5k“中）

6.1.2 修改源程序

在example文件夹中找到detector.c文件打开并找到下图中的函数。

修改detector.c文件中validate_detector_recall函数：
将它原有的：
list plist = get_paths(“data/coco_val_5k.list”);
修改为：
list plist = get_paths("*/darknet/voc/2019_test”);
即可
在linux命令行中输入命令：

./darknet detector recall cfg/voc.data cfg/bolo.voc.cfg backup/yolov3voc.weights

在命令行中输入命令，结果如下：

上图中输出结果即为计算的召回率，输出结果的格式为：

Number Correct Total Rps/Img IOU Recall

其中：

1）Number表示处理到第几张图片。
2）Correct表示正确的识别除了多少bbox。这个值算出来的步骤是这样的，丢进网络一张图片，网络会预测出很多bbox，每个bbox都有其置信概率，概率大于threshold的bbox与实际的bbox，也就是labels中txt的内容计算IOU，找出IOU最大的bbox，如果这个最大值大于预设的IOU的threshold，那么correct加一。
3）IOU： 这个是预测出的bbox和实际标注的bbox的交集 除以 他们的并集。显然，这个数值越大，说明预测的结果越好。
4）Recall召回率， 意思是检测出物体的个数 除以 标注的所有物体个数。通过代码我们也能看出来就是Correct除以Total的值。

平均召回率计算："num（recall）/n"即对算求的每张图片的召回率进行相加求取平均

本节参考：https://www.jianshu.com/p/7ae10c8f7d77/

6.2 使用voc_eval.py进行计算mAP（个人建议采用6.3）

本节参考：https://blog.csdn.net/sihaiyinan/article/details/87903923 （内容不错）

yolov3计算mAP有两种方法，第一种是使用faster rcnn中的voc_eval.py进行计算，另一种是通过修改yolov3中的代码进行计算。相比较而言第一种方法简单一些。

如上图所示，在detector.c中找到validate_detector函数，发现路径为/data/train.list，所以使用之前需要在data文件夹中建立相应文件并把需要计算map的图片路径内容放在该文件下。

在yolov3文件夹darknet主目录下打开终端命令行窗口并且输入命令：

./darknet detector valid data/voc1.data data/yolov3-voc.cfg backup/yolov3-voc.backup

在results文件夹子会生成名为”comp4_det_test_“的每一类的文件信息，如下图示

打开每一类的文件，文件内容如下图示
第一列是图像名字（不带后缀），第二列是置信度，剩下依次是xmin、ymin、xmax、ymax
生成每一类的文件后，使用voc_eval.py文件计算map即可。

计算map需要到文件：
	1）各类检测到的目标框txt文件
	2）Annotations文件
	3）验证图像名字列表
	4）compute_mAP.py  +  voc_eval.py

compute_mAP.py代码及解析

#-*- coding: utf-8 -*-
import os
import numpy as np
from voc_eval import voc_eval   # 注意将voc_eval.py和compute_mAP.py放在同一级目录下
 
detpath = 'Path to dets txt'   # 各类txt文件路径
detfiles = os.listdir(detpath)
 
classes = ('__background__', # always index 0 数据集类别
                  'class1', 'class2', 'class3', 'class4', 'class5', 'class6')
 
 
aps = []      # 保存各类ap
recs = []     # 保存recall
precs = []    # 保存精度
 
annopath = 'Path to Annotations' + '{:s}.xml'    # annotations的路径，{:s}.xml方便后面根据图像名字读取对应的xml文件
imagesetfile = 'Path to VOC2007/ImageSets/Main/test.txt'  # 读取图像名字列表文件
cachedir = 'Path to annotations_cache/'
 
for i, cls in enumerate(classes):
    if cls == '__background__':
        continue
    for f in detfiles:    # 读取cls类对应的txt文件
        if f.find(cls) != -1:
            filename = detpath + f
 
    rec, prec, ap = voc_eval(        # 调用voc_eval.py计算cls类的recall precision ap
        filename, annopath, imagesetfile, cls, cachedir, ovthresh=0,
        use_07_metric=False)
 
    aps += [ap]
 
    print('AP for {} = {:.4f}'.format(cls, ap))
    print('recall for {} = {:.4f}'.format(cls, rec[-1]))
    print('precision for {} = {:.4f}'.format(cls, prec[-1]))
 
print('Mean AP = {:.4f}'.format(np.mean(aps)))
print('~~~~~~~~')
 
print('Results:')
for ap in aps:
    print('{:.3f}'.format(ap))
print('{:.3f}'.format(np.mean(aps)))
print('~~~~~~~~')

voc_eval.py源代码及解析

# -*- coding: utf-8 -*-
# --------------------------------------------------------
# Fast/er R-CNN
# Licensed under The MIT License [see LICENSE for details]
# Written by Bharath Hariharan
# --------------------------------------------------------
 
import xml.etree.ElementTree as ET
import os
import cPickle
import numpy as np
 
def parse_rec(filename):
    """ Parse a PASCAL VOC xml file """
    tree = ET.parse(filename)
    objects = []
    for obj in tree.findall('object'):
        obj_struct = {}
        obj_struct['name'] = obj.find('name').text
        obj_struct['pose'] = obj.find('pose').text
        obj_struct['truncated'] = int(obj.find('truncated').text)
        obj_struct['difficult'] = int(obj.find('difficult').text)
        bbox = obj.find('bndbox')
        obj_struct['bbox'] = [int(bbox.find('xmin').text),
                              int(bbox.find('ymin').text),
                              int(bbox.find('xmax').text),
                              int(bbox.find('ymax').text)]
        objects.append(obj_struct)
 
    return objects
 
def voc_ap(rec, prec, use_07_metric=False):   # voc2007的计算方式和voc2012的计算方式不同，目前一般采用第二种
    """ ap = voc_ap(rec, prec, [use_07_metric])
    Compute VOC AP given precision and recall.
    If use_07_metric is true, uses the
    VOC 07 11 point method (default:False).
    """
    if use_07_metric:
        # 11 point metric
        ap = 0.
        for t in np.arange(0., 1.1, 0.1):
            if np.sum(rec >= t) == 0:
                p = 0
            else:
                p = np.max(prec[rec >= t])
            ap = ap + p / 11.
    else:
        # correct AP calculation
        # first append sentinel values at the end
        mrec = np.concatenate(([0.], rec, [1.]))
        mpre = np.concatenate(([0.], prec, [0.]))
 
        # compute the precision envelope
        for i in range(mpre.size - 1, 0, -1):
            mpre[i - 1] = np.maximum(mpre[i - 1], mpre[i])
 
        # to calculate area under PR curve, look for points
        # where X axis (recall) changes value
        i = np.where(mrec[1:] != mrec[:-1])[0]
 
        # and sum (\Delta recall) * prec
        ap = np.sum((mrec[i + 1] - mrec[i]) * mpre[i + 1])
    return ap
 
 
## 程序入口
 
def voc_eval(detpath,           # 保存检测到的目标框的文件路径，每一类的目标框单独保存在一个文件
             annopath,          # Annotations的路径
             imagesetfile,      # 测试图片名字列表
             classname,         # 类别名称
             cachedir,          # 缓存文件夹
             ovthresh=0.5,      # IoU阈值
             use_07_metric=False):      # mAP计算方法
    """rec, prec, ap = voc_eval(detpath,
                                annopath,
                                imagesetfile,
                                classname,
                                [ovthresh],
                                [use_07_metric])
    Top level function that does the PASCAL VOC evaluation.
    detpath: Path to detections
        detpath.format(classname) should produce the detection results file.
    annopath: Path to annotations
        annopath.format(imagename) should be the xml annotations file.
    imagesetfile: Text file containing the list of images, one image per line.
    classname: Category name (duh)
    cachedir: Directory for caching the annotations
    [ovthresh]: Overlap threshold (default = 0.5)
    [use_07_metric]: Whether to use VOC07's 11 point AP computation
        (default False)
    """
    # assumes detections are in detpath.format(classname)
    # assumes annotations are in annopath.format(imagename)
    # assumes imagesetfile is a text file with each line an image name
    # cachedir caches the annotations in a pickle file
 
    # first load gt   获取真实目标框
    # 当程序第一次运行时，会读取Annotations下的xml文件获取每张图片中真实的目标框
    # 然后把获取的结果保存在annotations_cache文件夹中
    # 以后再次运行时直接从缓存文件夹中读取真实目标
 
    if not os.path.isdir(cachedir):
        os.mkdir(cachedir)
    cachefile = os.path.join(cachedir, 'annots.pkl')
    # read list of images
    with open(imagesetfile, 'r') as f:
        lines = f.readlines()
    imagenames = [x.strip() for x in lines]
 
    if not os.path.isfile(cachefile):
        # load annots
        recs = {}
        for i, imagename in enumerate(imagenames):
            recs[imagename] = parse_rec(annopath.format(imagename))
            if i % 100 == 0:
                print 'Reading annotation for {:d}/{:d}'.format(
                    i + 1, len(imagenames))
        # save
        print 'Saving cached annotations to {:s}'.format(cachefile)
        with open(cachefile, 'w') as f:
            cPickle.dump(recs, f)
    else:
        # load
        with open(cachefile, 'r') as f:
            recs = cPickle.load(f)
 
    # extract gt objects for this class 提取该类的真实目标
    class_recs = {}
    npos = 0     #保存该类一共有多少真实目标
    for imagename in imagenames:
        R = [obj for obj in recs[imagename] if obj['name'] == classname] # 保存名字为imagename的图片中，类别为classname的目标框的信息
        bbox = np.array([x['bbox'] for x in R])  #目标框的坐标
        difficult = np.array([x['difficult'] for x in R]).astype(np.bool)  #是否是难以识别的目标
        det = [False] * len(R)       #每一个目标框对应一个det[i]，用来判断该目标框是否已经处理过
        npos = npos + sum(~difficult)    #计算总的目标个数
        class_recs[imagename] = {'bbox': bbox,              # 把每一张图像中的目标框信息放到class_recs中
                                 'difficult': difficult,
                                 'det': det}
 
    # read dets
    detfile = detpath.format(classname)     # 打开classname类别检测到的目标框文件
    with open(detfile, 'r') as f:
        lines = f.readlines()
 
    splitlines = [x.strip().split(' ') for x in lines]
    image_ids = [x[0] for x in splitlines]           # 图像名字
    confidence = np.array([float(x[1]) for x in splitlines])   # 置信度
    BB = np.array([[float(z) for z in x[2:]] for x in splitlines])   # 目标框坐标
 
    # sort by confidence  按照置信度排序
    sorted_ind = np.argsort(-confidence)
    sorted_scores = np.sort(-confidence)
    BB = BB[sorted_ind, :]
    image_ids = [image_ids[x] for x in sorted_ind]
 
    # go down dets and mark TPs and FPs
    nd = len(image_ids)       # 统计检测到的目标框个数
    tp = np.zeros(nd)         # 创建tp列表，列表长度为目标框个数
    fp = np.zeros(nd)         # 创建fp列表，列表长度为目标框个数
 
    for d in range(nd):
        R = class_recs[image_ids[d]]   # 得到图像名字为image_ids[d]真实的目标框信息
        bb = BB[d, :].astype(float)    # 得到图像名字为image_ids[d]检测的目标框坐标
        ovmax = -np.inf
        BBGT = R['bbox'].astype(float) # 得到图像名字为image_ids[d]真实的目标框坐标
 
        if BBGT.size > 0:
            # compute overlaps  计算IoU
            # intersection
            ixmin = np.maximum(BBGT[:, 0], bb[0])
            iymin = np.maximum(BBGT[:, 1], bb[1])
            ixmax = np.minimum(BBGT[:, 2], bb[2])
            iymax = np.minimum(BBGT[:, 3], bb[3])
            iw = np.maximum(ixmax - ixmin + 1., 0.)
            ih = np.maximum(iymax - iymin + 1., 0.)
            inters = iw * ih
 
            # union
            uni = ((bb[2] - bb[0] + 1.) * (bb[3] - bb[1] + 1.) +
                   (BBGT[:, 2] - BBGT[:, 0] + 1.) *
                   (BBGT[:, 3] - BBGT[:, 1] + 1.) - inters)
 
            overlaps = inters / uni
            ovmax = np.max(overlaps)     # 检测到的目标框可能预若干个真实目标框都有交集，选择其中交集最大的
            jmax = np.argmax(overlaps)
 
        if ovmax > ovthresh:   # IoU是否大于阈值
            if not R['difficult'][jmax]:   # 真实目标框是否难以识别
                if not R['det'][jmax]:     # 该真实目标框是否已经统计过
                    tp[d] = 1.             # 将tp对应第d个位置变成1
                    R['det'][jmax] = 1     # 将该真实目标框做标记
                else:
                    fp[d] = 1.     # 否则将fp对应的位置变为1
        else:
            fp[d] = 1.     # 否则将fp对应的位置变为1
 
    # compute precision recall
    fp = np.cumsum(fp)        # 按列累加，最大值即为tp数量
    tp = np.cumsum(tp)        # 按列累加，最大值即为fp数量
    rec = tp / float(npos)    # 计算recall
    # avoid divide by zero in case the first detection matches a difficult
    # ground truth
    prec = tp / np.maximum(tp + fp, np.finfo(np.float64).eps)   # 计算精度
    ap = voc_ap(rec, prec, use_07_metric)   # 计算ap
 
    return rec, prec, ap

根据具体情况修改具体程序的内容，执行compute_mAP.py（注意需要将两文件放到统级文件夹中）及科技算数出结果

6.3综合计算性能指标

修改yolov3的darknet文件夹example文件夹中的detector程序（计算ap值、recall值、avg iou值、precision值、fp、tp以及绘制P-R曲线指标所需的precision和recall值等值）-------------------6.3节的指标计算非常重要--------------------

在detector文件中添加计算map的函数并进行设置。
在linux系统终端命令行中输入命令计算map。

操作的具体步骤为：

1.找到detector.c文件的最后一段代码：

if(0==strcmp(argv[2], "test")) test_detector(datacfg, cfg, weights, filename, thresh, hier_thresh, outfile, fullscreen);
    else if(0==strcmp(argv[2], "train")) train_detector(datacfg, cfg, weights, gpus, ngpus, clear);
    else if(0==strcmp(argv[2], "valid")) validate_detector(datacfg, cfg, weights, outfile);
    else if(0==strcmp(argv[2], "valid2")) validate_detector_flip(datacfg, cfg, weights, outfile);
    else if(0==strcmp(argv[2], "recall")) validate_detector_recall(datacfg, cfg, weights);

在其后面加上一条代码语句：

else if(0==strcmp(argv[2], "map")) validate_detector_map(datacfg, cfg, weights, thresh);

2.在detector.c程序中的任意位置添加以下代码：

typedef struct {
    box b;
    float p;
    int class_id;
    int image_index;
    int truth_flag;
    int unique_truth_index;
} box_prob;
 
int detections_comparator(const void *pa, const void *pb)
{
    box_prob a = *(box_prob *)pa;
    box_prob b = *(box_prob *)pb;
    float diff = a.p - b.p;
    if (diff < 0) return 1;
    else if (diff > 0) return -1;
    return 0;
}
 
void validate_detector_map(char *datacfg, char *cfgfile, char *weightfile, float thresh_calc_avg_iou)
{
    list *options = read_data_cfg(datacfg);                                          //get .data file
    char *valid_images = option_find_str(options, "valid", "data/train.txt");        //point to the path of valid images
    char *difficult_valid_images = option_find_str(options, "difficult", NULL);      //get the path to the 'difficult', if it doesn't exist,replace it with NULL
 
 
    char *name_list = option_find_str(options, "names", "data/names.list");          // find name of each category 
    char **names = get_labels(name_list);
    //char *mapf = option_find_str(options, "map", 0);               // get the 'map', what is the map
    //int *map = 0;
    //if (mapf) map = read_map(mapf);
    FILE* reinforcement_fd = NULL;
 
    network *net = load_network(cfgfile, weightfile, 0);
    set_batch_network(net, 1);
    //fuse_conv_batchnorm(net);
    //calculate_binary_weights(net);
    srand(time(0));
 
    list *plist = get_paths(valid_images);
    char **paths = (char **)list_to_array(plist);
 
    char **paths_dif = NULL;
    if (difficult_valid_images) {
        list *plist_dif = get_paths(difficult_valid_images);
        paths_dif = (char **)list_to_array(plist_dif);
    }
 
 
    layer l = net->layers[net->n - 1];
    int classes = l.classes;
 
    int m = plist->size;
    int i = 0;
    int t;
 
    const float thresh = .005;
    const float nms = .45;
    const float iou_thresh = 0.5;
 
    int nthreads = 4;
    image *val = calloc(nthreads, sizeof(image));
    image *val_resized = calloc(nthreads, sizeof(image));
    image *buf = calloc(nthreads, sizeof(image));
    image *buf_resized = calloc(nthreads, sizeof(image));
    pthread_t *thr = calloc(nthreads, sizeof(pthread_t));
 
    load_args args = {0};
    args.w = net->w;
    args.h = net->h;
    //args.type = IMAGE_DATA;
    args.type = LETTERBOX_DATA;
 
    //const float thresh_calc_avg_iou = 0.24;
    float avg_iou = 0;
    int tp_for_thresh = 0;
    int fp_for_thresh = 0;
 
    box_prob *detections = calloc(1, sizeof(box_prob));
    int detections_count = 0;
    int unique_truth_count = 0;
 
    int *truth_classes_count = calloc(classes, sizeof(int));
 
    for (t = 0; t < nthreads; ++t) {
        args.path = paths[i + t];
        args.im = &buf[t];
        args.resized = &buf_resized[t];
        thr[t] = load_data_in_thread(args);
    }
    time_t start = time(0);
    for (i = nthreads; i < m + nthreads; i += nthreads) {
        fprintf(stderr, "%d\n", i);
        for (t = 0; t < nthreads && i + t - nthreads < m; ++t) {
            pthread_join(thr[t], 0);
            val[t] = buf[t];
            val_resized[t] = buf_resized[t];
        }
        for (t = 0; t < nthreads && i + t < m; ++t) {
            args.path = paths[i + t];
            args.im = &buf[t];
            args.resized = &buf_resized[t];
            thr[t] = load_data_in_thread(args);
        }
        for (t = 0; t < nthreads && i + t - nthreads < m; ++t) {
            const int image_index = i + t - nthreads;
            char *path = paths[image_index];
            char *id = basecfg(path);                         
            float *X = val_resized[t].data;
            network_predict(net, X);
 
            int nboxes = 0;
            float hier_thresh = 0;
            detection *dets;
            if (args.type == LETTERBOX_DATA) {
                //int letterbox = 1;
                dets = get_network_boxes(net, val[t].w, val[t].h, thresh, hier_thresh, 0, 1, &nboxes);
            }
            else {
                //int letterbox = 0;
                dets = get_network_boxes(net, 1, 1, thresh, hier_thresh, 0, 0, &nboxes);
            }
            //detection *dets = get_network_boxes(&net, val[t].w, val[t].h, thresh, hier_thresh, 0, 1, &nboxes, letterbox); // for letterbox=1
            if (nms) do_nms_sort(dets, nboxes, l.classes, nms);
 
            char labelpath[4096];
            find_replace(path, "images", "labels", labelpath);
            find_replace(labelpath, "JPEGImages", "labels", labelpath);
            find_replace(labelpath, ".jpg", ".txt", labelpath);
            find_replace(labelpath, ".JPEG", ".txt", labelpath);
 
 
            int num_labels = 0;
            box_label *truth = read_boxes(labelpath, &num_labels);
            int i, j;
            for (j = 0; j < num_labels; ++j) {
                truth_classes_count[truth[j].id]++;                  
            }
 
            // difficult
            box_label *truth_dif = NULL;
            int num_labels_dif = 0;
            if (paths_dif)
            {
                char *path_dif = paths_dif[image_index];
 
                char labelpath_dif[4096];
                //replace_image_to_label(path_dif, labelpath_dif);
                find_replace(path_dif, "images", "labels", labelpath_dif);
                find_replace(labelpath_dif, "JPEGImages", "labels", labelpath_dif);
                find_replace(labelpath_dif, ".jpg", ".txt", labelpath_dif);
                find_replace(labelpath_dif, ".JPEG", ".txt", labelpath_dif);
 
                truth_dif = read_boxes(labelpath_dif, &num_labels_dif);
            }
 
            const int checkpoint_detections_count = detections_count;
 
            for (i = 0; i < nboxes; ++i) {
 
                int class_id;
                for (class_id = 0; class_id < classes; ++class_id) {
                    float prob = dets[i].prob[class_id];
                    if (prob > 0) {
                        detections_count++;
                        detections = realloc(detections, detections_count * sizeof(box_prob));
                        detections[detections_count - 1].b = dets[i].bbox;
                        detections[detections_count - 1].p = prob;
                        detections[detections_count - 1].image_index = image_index;
                        detections[detections_count - 1].class_id = class_id;
                        detections[detections_count - 1].truth_flag = 0;
                        detections[detections_count - 1].unique_truth_index = -1;
 
                        int truth_index = -1;
                        float max_iou = 0;
                        for (j = 0; j < num_labels; ++j)
                        {
                            box t = { truth[j].x, truth[j].y, truth[j].w, truth[j].h };
                            //printf(" IoU = %f, prob = %f, class_id = %d, truth[j].id = %d \n",
                                   //box_iou(dets[i].bbox, t), prob, class_id, truth[j].id);
                            float current_iou = box_iou(dets[i].bbox, t);
                            if (current_iou > iou_thresh && class_id == truth[j].id) {
                                if (current_iou > max_iou) {
                                    max_iou = current_iou;
                                    truth_index = unique_truth_count + j;
                                }
                            }
                        }
 
                        // best IoU
                        if (truth_index > -1) {
                            detections[detections_count - 1].truth_flag = 1;
                            detections[detections_count - 1].unique_truth_index = truth_index;
                        }
                        else {
                            // if object is difficult then remove detection
                            for (j = 0; j < num_labels_dif; ++j) {
                                box t = { truth_dif[j].x, truth_dif[j].y, truth_dif[j].w, truth_dif[j].h };
                                float current_iou = box_iou(dets[i].bbox, t);
                                if (current_iou > iou_thresh && class_id == truth_dif[j].id) {
                                    --detections_count;
                                    break;
                                }
                            }
                        }
 
                        // calc avg IoU, true-positives, false-positives for required Threshold
                        if (prob > thresh_calc_avg_iou) {
                            int z, found = 0;
                            for (z = checkpoint_detections_count; z < detections_count-1; ++z)
                                if (detections[z].unique_truth_index == truth_index) {
                                    found = 1; break;
                                }
 
                            if(truth_index > -1 && found == 0) {
                                avg_iou += max_iou;
                                ++tp_for_thresh;
                            }
                            else
                                fp_for_thresh++;
                        }
                    }
                }
            }
 
            unique_truth_count += num_labels;
 
            //static int previous_errors = 0;
            //int total_errors = fp_for_thresh + (unique_truth_count - tp_for_thresh);
            //int errors_in_this_image = total_errors - previous_errors;
            //previous_errors = total_errors;
            //if(reinforcement_fd == NULL) reinforcement_fd = fopen("reinforcement.txt", "wb");
            //char buff[1000];
            //sprintf(buff, "%s\n", path);
            //if(errors_in_this_image > 0) fwrite(buff, sizeof(char), strlen(buff), reinforcement_fd);
 
            free_detections(dets, nboxes);
            free(id);
            free_image(val[t]);
            free_image(val_resized[t]);
        }
    }
 
    if((tp_for_thresh + fp_for_thresh) > 0)
        avg_iou = avg_iou / (tp_for_thresh + fp_for_thresh);
 
 
    // SORT(detections)
    qsort(detections, detections_count, sizeof(box_prob), detections_comparator);
 
    typedef struct {
        double precision;
        double recall;
        int tp, fp, fn;
    } pr_t;
 
    // for PR-curve
    pr_t **pr = calloc(classes, sizeof(pr_t*));
    for (i = 0; i < classes; ++i) {
        pr[i] = calloc(detections_count, sizeof(pr_t));
    }
    printf("detections_count = %d, unique_truth_count = %d  \n", detections_count, unique_truth_count);
 
 
    int *truth_flags = calloc(unique_truth_count, sizeof(int));
 
    int rank;
    for (rank = 0; rank < detections_count; ++rank) {
        if(rank % 100 == 0)
            printf(" rank = %d of ranks = %d \r", rank, detections_count);
 
        if (rank > 0) {
            int class_id;
            for (class_id = 0; class_id < classes; ++class_id) {
                pr[class_id][rank].tp = pr[class_id][rank - 1].tp;
                pr[class_id][rank].fp = pr[class_id][rank - 1].fp;
            }
        }
 
        box_prob d = detections[rank];
        // if (detected && isn't detected before)
        if (d.truth_flag == 1) {
            if (truth_flags[d.unique_truth_index] == 0)
            {
                truth_flags[d.unique_truth_index] = 1;
                pr[d.class_id][rank].tp++;    // true-positive
            }
        }
        else {
            pr[d.class_id][rank].fp++;    // false-positive
        }
 
        for (i = 0; i < classes; ++i)
        {
            const int tp = pr[i][rank].tp;
            const int fp = pr[i][rank].fp;
            const int fn = truth_classes_count[i] - tp;    // false-negative = objects - true-positive
            pr[i][rank].fn = fn;
 
            if ((tp + fp) > 0) pr[i][rank].precision = (double)tp / (double)(tp + fp);
            else pr[i][rank].precision = 0;
 
            if ((tp + fn) > 0) pr[i][rank].recall = (double)tp / (double)(tp + fn);
            else pr[i][rank].recall = 0;
        }
    }
 
    free(truth_flags);
 
 
    double mean_average_precision = 0;
 
    for (i = 0; i < classes; ++i) {
        double avg_precision = 0;
        int point;
        for (point = 0; point < 11; ++point) {
            double cur_recall = point * 0.1;
            double cur_precision = 0;
            for (rank = 0; rank < detections_count; ++rank)
            {
                if (pr[i][rank].recall >= cur_recall) {    // > or >=
                    if (pr[i][rank].precision > cur_precision) {
                        cur_precision = pr[i][rank].precision;
                    }
                }
            }
            printf("class_id = %d, point = %d, cur_recall = %.4f, cur_precision = %.4f \n", i, point, cur_recall, cur_precision);
 
            avg_precision += cur_precision;
        }
        avg_precision = avg_precision / 11;     //  ??
        printf("class_id = %d, name = %s, \t ap = %2.2f %% \n", i, names[i], avg_precision*100);
        mean_average_precision += avg_precision;
    }
 
 
    printf("---------------------caculate end!!------------------------\n");
 
 
    const float cur_precision = (float)tp_for_thresh / ((float)tp_for_thresh + (float)fp_for_thresh);
    const float cur_recall = (float)tp_for_thresh / ((float)tp_for_thresh + (float)(unique_truth_count - tp_for_thresh));
    const float f1_score = 2.F * cur_precision * cur_recall / (cur_precision + cur_recall);
    printf(" for thresh = %1.2f, precision = %1.2f, recall = %1.2f, F1-score = %1.2f \n",
        thresh_calc_avg_iou, cur_precision, cur_recall, f1_score);
 
    printf(" for thresh = %0.2f, TP = %d, FP = %d, FN = %d, average IoU = %2.2f %% \n",
        thresh_calc_avg_iou, tp_for_thresh, fp_for_thresh, unique_truth_count - tp_for_thresh, avg_iou * 100);
 
    mean_average_precision = mean_average_precision / classes;
    printf("\n mean average precision (mAP) = %f, or %2.2f %% \n", mean_average_precision, mean_average_precision*100);
 
 
    for (i = 0; i < classes; ++i) {
        free(pr[i]);
    }
    free(pr);
    free(detections);
    free(truth_classes_count);
 
    fprintf(stderr, "Total Detection Time: %f Seconds\n", (double)(time(0) - start));
    if (reinforcement_fd != NULL) fclose(reinforcement_fd);
}

3.detector.c中代码设置完成后在linux终端命令行中输入以下命令：

./darknet detector map data/voc.data cfg/yolov3-voc.cfg yolov3-voc.weights

即可计算map值

4.在Linux终端输入上一节的命令，执行结果如下图所示：

其中：
	 每个AP中输出的“cur_recall与cur_precision”为P-R指标曲线绘制所需要的数据；
	 “caculate end！！”下方的指标即为算法所求的指标：ap值、recall值、avg iou值、precision值等。

7.detector.c文件中评价函数的使用

7.1 对于单张图片测试（test）

测试单张图片：./darknet detector test
文件中batch和subdivisions两项必须为1。
测试时还可以用-thresh和-hier选项指定对应参数。

7.2 （valid）

./darknet detector valid
文件中batch和subdivisions两项必须为1。
结果生成在的results指定的目录下以开头的若干文件中，若没有指定results，那么默认为/results。

7.3(recall)

validate_detector_recall函数定义和调用改为：

函数中：void validate_detector_recall(char *datacfg, char *cfgfile, char *weightfile)
主函数中调用：validate_detector_recall(datacfg, cfg, weights);

validate_detector_recall内的plist和paths的如下初始化代码：

 list *plist = get_paths("data/voc.2007.test");
 char **paths = (char **)list_to_array(plist);
 
 修改为：
 
 list *options = read_data_cfg(datacfg);
 char *valid_images = option_find_str(options, "valid", 		"data/train.list");
 list *plist = get_paths(valid_images);
 char **paths = (char **)list_to_array(plist);

上述修改完之后务必记住要在darknet下重新make一下就可以进行recall命令：
```
 ./darknet detector recall cfg/voc.data cfg/yolo-voc.cfg backup/yolo-voc_final.weights
```

7.4批量测试图片

1. 用下面代码替换detector.c文件（example文件夹下）的void test_detector函数（注意有3处输出批量测试图片的路径要改成自己批量保存图片的路径）

void test_detector(char *datacfg, char *cfgfile, char *weightfile, char *filename, float thresh, float hier_thresh, char *outfile, int fullscreen)
{
    list *options = read_data_cfg(datacfg);
    char *name_list = option_find_str(options, "names", "data/names.list");
    char **names = get_labels(name_list);
 
    image **alphabet = load_alphabet();
    network *net = load_network(cfgfile, weightfile, 0);
    set_batch_network(net, 1);
    srand(2222222);
    double time;
    char buff[256];
    char *input = buff;
    float nms=.45;
    int i=0;
    while(1){
        if(filename){
            strncpy(input, filename, 256);
            image im = load_image_color(input,0,0);
            image sized = letterbox_image(im, net->w, net->h);
        //image sized = resize_image(im, net->w, net->h);
        //image sized2 = resize_max(im, net->w);
        //image sized = crop_image(sized2, -((net->w - sized2.w)/2), -((net->h - sized2.h)/2), net->w, net->h);
        //resize_network(net, sized.w, sized.h);
            layer l = net->layers[net->n-1];
 
 
            float *X = sized.data;
            time=what_time_is_it_now();
            network_predict(net, X);
            printf("%s: Predicted in %f seconds.\n", input, what_time_is_it_now()-time);
            int nboxes = 0;
            detection *dets = get_network_boxes(net, im.w, im.h, thresh, hier_thresh, 0, 1, &nboxes);
            //printf("%d\n", nboxes);
            //if (nms) do_nms_obj(boxes, probs, l.w*l.h*l.n, l.classes, nms);
            if (nms) do_nms_sort(dets, nboxes, l.classes, nms);
                draw_detections(im, dets, nboxes, thresh, names, alphabet, l.classes);
                free_detections(dets, nboxes);
            if(outfile)
             {
                save_image(im, outfile);
             }
            else{
                save_image(im, "predictions");
#ifdef OPENCV
                cvNamedWindow("predictions", CV_WINDOW_NORMAL); 
                if(fullscreen){
                cvSetWindowProperty("predictions", CV_WND_PROP_FULLSCREEN, CV_WINDOW_FULLSCREEN);
                }
                show_image(im, "predictions");
                cvWaitKey(0);
                cvDestroyAllWindows();
#endif
            }
            free_image(im);
            free_image(sized);
            if (filename) break;
         } 
        else {
            printf("Enter Image Path: ");
            fflush(stdout);
            input = fgets(input, 256, stdin);
            if(!input) return;
            strtok(input, "\n");
   
            list *plist = get_paths(input);
            char **paths = (char **)list_to_array(plist);
             printf("Start Testing!\n");
            int m = plist->size;
            if(access("/home/FENGsl/darknet/data/out",0)==-1)//"/home/FENGsl/darknet/data"修改成自己的路径*************************************************
            {
              if (mkdir("/home/FENGsl/darknet/data/out",0777))//"/home/FENGsl/darknet/data"修改成自己的路径*************************************************
               {
                 printf("creat file bag failed!!!");
               }
            }
            for(i = 0; i < m; ++i){
             char *path = paths[i];
             image im = load_image_color(path,0,0);
             image sized = letterbox_image(im, net->w, net->h);
        //image sized = resize_image(im, net->w, net->h);
        //image sized2 = resize_max(im, net->w);
        //image sized = crop_image(sized2, -((net->w - sized2.w)/2), -((net->h - sized2.h)/2), net->w, net->h);
        //resize_network(net, sized.w, sized.h);
        layer l = net->layers[net->n-1];
 
 
        float *X = sized.data;
        time=what_time_is_it_now();
        network_predict(net, X);
        printf("Try Very Hard:");
        printf("%s: Predicted in %f seconds.\n", path, what_time_is_it_now()-time);
        int nboxes = 0;
        detection *dets = get_network_boxes(net, im.w, im.h, thresh, hier_thresh, 0, 1, &nboxes);
        //printf("%d\n", nboxes);
        //if (nms) do_nms_obj(boxes, probs, l.w*l.h*l.n, l.classes, nms);
        if (nms) do_nms_sort(dets, nboxes, l.classes, nms);
        draw_detections(im, dets, nboxes, thresh, names, alphabet, l.classes);
        free_detections(dets, nboxes);
        if(outfile){
            save_image(im, outfile);
        }
        else{
             
             char b[2048];
            sprintf(b,"/home/FENGsl/darknet/data/out/%s",GetFilename(path));//"/home/FENGsl/darknet/data"修改成自己的路径*******************************
            
            save_image(im, b);
            printf("save %s successfully!\n",GetFilename(path));
#ifdef OPENCV
            cvNamedWindow("predictions", CV_WINDOW_NORMAL); 
            if(fullscreen){
                cvSetWindowProperty("predictions", CV_WND_PROP_FULLSCREEN, CV_WINDOW_FULLSCREEN);
            }
            show_image(im, "predictions");
            cvWaitKey(0);
            cvDestroyAllWindows();
#endif
        }
 
        free_image(im);
        free_image(sized);
        if (filename) break;
        }
      }
    }
}

2. 在前面添加GetFilename(char p)函数（注意后面的注释）

#include "darknet.h"
#include 
#include
#include
#include
static int coco_ids[] = {1,2,3,4,5,6,7,8,9,10,11,13,14,15,16,17,18,19,20,21,22,23,24,25,27,28,31,32,33,34,35,36,37,38,39,40,41,42,43,44,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,67,70,72,73,74,75,76,77,78,79,80,81,82,84,85,86,87,88,89,90};
 
char *GetFilename(char *p)
{ 
    static char name[20]={""};
    char *q = strrchr(p,'/') + 1;
    strncpy(name,q,6);//注意后面的6，如果你的测试集的图片的名字字符（不包括后缀）是其他长度，请改为你需要的长度（官方的默认的长度是6）
    return name;
}

在darknet下重新make，执行命令即可
执行批量测试命令如下：
./darknet detector test cfg/voc.data cfg/yolov3-voc.cfg backup/yolov3-voc_final.weights layer filters size input output ....... 106 detection Loading weights from yolov3.weights...Done! Enter Image Path:
Enter Image Path：后面输入你的txt文件路径（你准备好的所有测试图片的路径全部存放在一个txt文件里），你可以复制voc.data文件里的valid后面的路径，就可以了，如下：

classes= 3
train  =/home/FENGsl/darknet/data/train.txt
valid  = /home/FENGsl/darknet/data/2007_test.txt
names = data/voc.names
backup = backup

然后你所有的图片都保存在了data/out文件夹下

7.5 根据坐标裁剪原图并保存（本小节参考了：输出yolo的测试结果，根据坐标裁剪原图并保存）

7.5.1 单张图片

主要对src/image.c文件中的draw_detections函数做了修改。

//添加了 char *filename ，为了得到当前的图片名。
void draw_detections(image im, int num, float thresh, box *boxes, float **probs, float **masks, char **names, image **alphabet, int classes, char *filename)
{
    printf("num %d\n", num);  
    float rawmax[num];
    int i,j;
    int params[3];
    char savePath[100] = "";
    //为了得到top19的框，将每个框按照最大概率值进行了排序，取了top19. b,c矩阵都是为了得到top9对应的原框的下标，好得到坐标点。
    for(i =0; i rawmax[i]){
          rawmax[i] = probs[i][j];                                     }
    }
    }
    for( i =0;i 0)
       printf("rawmax[ %d]:%f\n",i,rawmax[i]);
    }
   float b[num];
   int c[num];

   for(i =0; i im.w-1) right = im.w-1;
       if(top < 0) top = 0;
       if(bot > im.h-1) bot = im.h-1;

       printf("c[%d]=%d %d %d %d %d\n", i,c[i],left,right,top,bot);
       //接下来的操作需要导入opencv,在文件头应该加入
       #ifdef OPENCV
       #include "opencv2/highgui/highgui_c.h"
       #include "opencv2/imgproc/imgproc_c.h"
       #endif
       CvRect box = cvRect(left, top, right-left, bot-top);
           IplImage* src = cvLoadImage(filename, -1);
       CvSize size = cvSize(right-left, bot-top);
       IplImage* roi = cvCreateImage(size, src->depth,src->nChannels);
       cvSetImageROI(src, box);
       cvCopy(src,roi,NULL); 
       char name1[4] = "dog";
       char name2[5] = ".jpg";
       char newname[100];
       sprintf(newname,"%s_%d%s",name1,i,name2 );  //保存的图片名dog_i.jpg
       cvSaveImage(newname,roi,0);
       cvReleaseImage(&src);
       cvReleaseImage(&roi);
    }

因为这里讲行数参数加入了 char * filename,所以应该在对应的头文件中修改函数定义。
include/darknet.h ，修改函数定义为：

void draw_detections(image im, int num, float thresh, box *boxes, float **probs, float **masks, char **names, image **alphabet, int classes, char* filename);

examples/detector.c 的函数调用改为：

draw_detections(im, l.w*l.h*l.n, thresh, boxes, probs, masks, names, alphabet, l.classes, filename);

还有一些别的地方，因为加入了这个参数需要改动，根据make的提示，修改即可。

7.5.2批量图片

如果需要检测批量图片，只需要修改examples/detector.c 中的test_detector()函数

//加入filelist.读取test.txt,test.txt中每行保存一个图片路径
char **filelist = get_labels("../test.txt");

将：

while(1){
if(filename){……
……
}

修改为：

int index = 0;
while(filelist[index] != NULL){
        filename = filelist[index];
        printf("filename: %s\n", filename);
        if(filename){ ……
        ……
        index++;
      }

去掉：


	if(outfile){
            save_image(im, outfile);
        }
        else{
            save_image(im, "predictions");
#ifdef OPENCV
            cvNamedWindow("predictions", CV_WINDOW_NORMAL); 
            if(fullscreen){
                cvSetWindowProperty("predictions", CV_WND_PROP_FULLSCREEN, CV_WINDOW_FULLSCREEN);
            }
            show_image(im, "predictions");
            cvWaitKey(0);
            cvDestroyAllWindows();
#endif
        }

   if (filename) break;

修改后的test_detector()完整的为：

void test_detector(char *datacfg, char *cfgfile, char *weightfile, char *filename, float thresh, float hier_thresh, char *outfile, int fullscreen)
{
    list *options = read_data_cfg(datacfg);
    char *name_list = option_find_str(options, "names", "data/names.list");
    char **names = get_labels(name_list);

    image **alphabet = load_alphabet();
    network *net = load_network(cfgfile, weightfile, 0);
    set_batch_network(net, 1);
    srand(2222222);
    double time;
    char buff[256];
    char *input = buff;
    int j;
    float nms=.3;
    char **filelist = get_labels("/home/wc/YOLO/darknet/data/test.txt");
    int index = 0;
    while(filelist[index] != NULL){
            filename = filelist[index];
            printf("filename: %s\n", filename);
   // while(1){
        if(filename){
            strncpy(input, filename, 256);
        } else {
            printf("Enter Image Path: ");
            fflush(stdout);
            input = fgets(input, 256, stdin);
            if(!input) return;
            strtok(input, "\n");
        }
        image im = load_image_color(input,0,0);
        image sized = letterbox_image(im, net->w, net->h);
        //image sized = resize_image(im, net->w, net->h);
        //image sized2 = resize_max(im, net->w);
        //image sized = crop_image(sized2, -((net->w - sized2.w)/2), -((net->h - sized2.h)/2), net->w, net->h);
        //resize_network(net, sized.w, sized.h);
        layer l = net->layers[net->n-1];

        box *boxes = calloc(l.w*l.h*l.n, sizeof(box));
        float **probs = calloc(l.w*l.h*l.n, sizeof(float *));
        for(j = 0; j < l.w*l.h*l.n; ++j) probs[j] = calloc(l.classes + 1, sizeof(float *));
        float **masks = 0;
        if (l.coords > 4){
            masks = calloc(l.w*l.h*l.n, sizeof(float*));
            for(j = 0; j < l.w*l.h*l.n; ++j) masks[j] = calloc(l.coords-4, sizeof(float *));
        }

        float *X = sized.data;
        time=what_time_is_it_now();
        network_predict(net, X);
        printf("%s: Predicted in %f seconds.\n", input, what_time_is_it_now()-time);
        //printf(boxes);
        get_region_boxes(l, im.w, im.h, net->w, net->h, thresh, probs, boxes, masks, 0, 0, hier_thresh, 1);
        if (nms) do_nms_sort(boxes, probs, l.w*l.h*l.n, l.classes, nms);
        //else if (nms) do_nms_sort(boxes, probs, l.w*l.h*l.n, l.classes, nms);
        //printf("start draw_detection");
        draw_detections(im, l.w*l.h*l.n, thresh, boxes, probs, masks, names, alphabet, l.classes, filename);
       /*
        if(outfile){
            save_image(im, outfile);
        }
        else{
            save_image(im, "predictions");
#ifdef OPENCV
            cvNamedWindow("predictions", CV_WINDOW_NORMAL); 
            if(fullscreen){
                cvSetWindowProperty("predictions", CV_WND_PROP_FULLSCREEN, CV_WINDOW_FULLSCREEN);
            }
            show_image(im, "predictions");
            cvWaitKey(0);
            cvDestroyAllWindows();
#endif
        }
*/
        free_image(im);
        free_image(sized);
        free(boxes);
        free_ptrs((void **)probs, l.w*l.h*l.n);
        // if (filename) break;
        index++;
    }
}

*********************待更新

你可能感兴趣的:(Darknet框架的使用,Darknet框架的使用)

Spring Boot与MyBatis geinvse_seg 面试学习路线阿里巴巴 spring boot mybatis 后端
SpringBoot与MyBatis的配置一、简介SpringBoot是一个用于创建独立的、基于Spring的生产级应用程序的框架，它简化了Spring应用的初始搭建以及开发过程。MyBatis是一款优秀的持久层框架，它支持定制化SQL、存储过程以及高级映射。将SpringBoot和MyBatis结合使用，可以高效地开发数据驱动的应用程序。二、环境准备（一）创建SpringBoot项目可以使用Sp
C进阶自定义类型一只自律的鸡 C进阶 c语言开发语言
目录前言一结构体二结构体的存储三位段四枚举五联合体总结前言我们之前学习的intchardouble......都是内置类型，但是我们今天所学习的是自定义类型，比如联合体，结构体，枚举一结构体结构体是一些值的集合，这些值统称为成员变量，每个成员都是可以用不同的的基本数据类型结构体的使用场景：结构体的意义在于可以进行封装一个整体的所有变量，这个是十分便捷的，这样就可以不用重复的操作进行重复的定义相同的
SQL笔记#数据更新月吟荧静 SQL笔记 sql 笔记数据库
一、数据的插入(INSERT语句的使用方法)1、什么是INSERT首先通过CREATETABLE语句创建表，但创建的表中没有数据；再通过INSERT语句向表中插入数据。--创建表ProductInsCREATETABLEProductIns(product_idCHAR(4)NOTNULL,product_nameVARCHAR(100)NOTNULL,product_typeVARCHAR(32
【现代前端框架中本地图片资源的处理方案】 Gazer_S 前端框架前端缓存 javascript chrome
现代前端框架中本地图片资源的处理方案前言在前端开发中，正确引用本地图片资源是一个常见但容易被忽视的问题。我们不能像在HTML中那样简单地使用相对路径，因为JavaScript模块中的路径解析规则与HTML不同，且现代构建工具对静态资源有特殊的处理机制。本文将详细探讨在webpack和Vite等构建工具中处理本地图片引用的各种方法。传统方式的局限性在传统开发中，我们可能习惯这样引用图片：constl
【AI+智造】基于阿里云Ubuntu24.04系统，使用Ollama部署开源DeepSeek模型并集成到企业微信邹工转型手札 Duodoo开源 Odoo18开源企业信息化制造人工智能数据分析
作者：Odoo技术开发/资深信息化负责人日期：2025年2月28日本方案结合了本地部署与云服务调用的技术路径，涵盖部署步骤、集成逻辑及关键问题点，适用于企业级AI应用场景。一、方案背景与架构设计1.技术选型背景DeepSeek模型：作为开源大模型，支持文本生成、智能问答等场景，适合企业知识库与自动化服务。Ollama工具：轻量化本地模型部署框架，支持一键拉取模型镜像并启动API服务。企业微信集成：
SpringCloud/Boot集成LogBack azoon.top spring cloud logback spring log4j slf4j
一.简要介绍什么是SLF4J？官网介绍：SimpleLoggingFacadeforJava（SLF4J）充当简单的各种日志记录框架的Facade或抽象（e.g.java.util.logging、logback、log4j）允许最终用户在部署时插入所需的日志记录框架。类似java中的接口，如果只集成SLF4J，日志只能输出在控制台，并没有输出到文件的能力，要实现真正的日志能力，需要引入其实现层：
深入剖析 Java 反序列化：FASTjson 漏洞与 Shiro 漏洞阿贾克斯的黎明网络安全 php web安全开发语言
目录深入剖析Java反序列化：FASTjson漏洞与Shiro漏洞引言Java反序列化原理示例代码FASTjson漏洞分析漏洞成因示例代码防护措施Shiro漏洞分析漏洞成因示例代码（模拟攻击场景）防护措施总结引言在Java应用开发中，反序列化是一项重要的技术，但同时也隐藏着巨大的安全风险。FASTjson和Shiro作为Java开发中常用的工具和框架，其反序列化漏洞曾引发了广泛关注。本文将深入探讨
Spark技术系列（一）：初识Apache Spark——大数据处理的统一分析引擎数据大包哥 #Spark 大数据
Spark技术系列（一）：初识ApacheSpark——大数据处理的统一分析引擎1.背景与核心价值1.1大数据时代的技术演进MapReduce的局限性：磁盘迭代计算、中间结果落盘导致的性能瓶颈Spark诞生背景：UCBerkeleyAMPLab实验室为解决复杂迭代计算需求研发（2010年开源）技术定位：基于内存的通用分布式计算框架（支持批处理、流计算、机器学习、图计算等）1.2Spark内置模块S
Spring Bean 的生命周期全过程 2401_85327573 spring java 后端
SpringBean的生命周期是指从Bean的创建到销毁的整个过程。在这个过程中，Spring容器会按照一系列固定的步骤对Bean进行初始化、配置、使用和销毁。了解SpringBean的生命周期可以帮助我们更好地理解和使用Spring框架，尤其是通过自定义生命周期行为来实现特定功能。以下是SpringBean生命周期的全过程，按阶段详细说明：---###**1.实例化（Instantiation）
怎么在体育直播系统进行足球赛事的直播 sanx18 java 阿里云
在网络直播技术的带动下，体育赛事直播平台看比赛变得越来越普及。下面就详细介绍如何运用源码搭建一个的体育直播系统，让大家能够进行足球赛事的直播。第一步：搭建系统平台首先需要得到一个体育直播系统源码。该源代码有一套完整的平台框架，功能对标虎牙和斗鱼等各大体育直播平台。第二步：注册和申请主播认证完成搭建系统后，接下去需要添加用户或进行注册。通过注册账户，能申请成为主播。申请后，需要登录后台进行审核通过，
Zookeeper（78）Zookeeper的性能优化有哪些方法？辞暮尔尔-烟火年年微服务 zookeeper 性能优化分布式
Zookeeper的性能优化涉及多个方面，包括硬件配置、Zookeeper本身的配置、客户端的使用方式以及网络环境。以下是一些常见的性能优化方法及详细的代码示例。1.硬件配置磁盘:使用高性能的SSD磁盘，确保低延迟和高I/O吞吐量。内存:确保有足够的内存以避免频繁的垃圾回收（GC）。CPU:使用多核CPU，以便更好地处理并发请求。2.Zookeeper配置优化配置参数优化以下是一些关键的Zooke
React vs Vue3深度对比与使用场景分析匹马夕阳 VUE技术集锦 react.js 前端前端框架
在前端开发领域，React和Vue3是两个备受瞩目的框架。它们都提供了强大的功能和灵活的开发方式，但各自的设计理念、使用方式和适用场景有所不同。本文将深入探讨React和Vue3的区别，通过代码示例和具体的使用场景，帮助开发者更好地理解并选择适合自己的框架。一、核心概念与设计理念1.ReactReact是由Facebook开发的一个JavaScript库，主要用于构建用户界面。它的核心理念是组件化
Day5 --- Flask-RESTful请求响应与SQLAlchemy基础 laufing 问题 flask restful python
文章目录昨日回顾今日内容1.请求解析1.1RequestParser处理请求1.2参数详解1.3处理请求案例2.返回响应2.1序列化数据:2.2返回JSON格式3.ORM与Flask-SQLAlchemy3.1ORM介绍ORM框架3.2Flask-SQLAlchemy扩展3.3定义模型类3.4数据库迁移操作4.数据增删改查4.1新增数据4.2简单查询4.3更新数据4.4删除数据5.数据操作案例昨日
第十阶段 -- Flask框架05：【Flask高级06：Restful接口】亚呦u椰 python学习 Flask框架
文章目录1.RESTful接口规范2.RESTful的基本使用3.参数验证4.返回标准化参数5.返回标准化参数强化6.结合蓝图使用和渲染模板7.示例1.RESTful接口规范Restful接口规范介绍REST：RepresentationalStateTransfer，REST指的是一组架构约束条件和原则。满足这些约束条件和原则的应用程序或设计就是RESTful。是一种软件架构风格、设计风格，而不
Java与Spring的“甜蜜毒药”：从辉煌到疲态的技术反思步子哥 java spring python
“Java生态就像一场漫长的婚姻，Spring是那个看似完美的伴侣，但当你意识到对方的控制欲时，已经为TA背上了巨额房贷。”Java，这位曾经的企业级开发之王，如今却像一位中年危机的技术巨人，站在2023年的技术浪潮中，显露出疲态。而Spring，这个曾经被誉为“轻量级救世主”的框架，早已从灵活的工具箱变成了沉重的枷锁。今天，我们就来聊聊这对技术界的“黄金搭档”，如何从蜜月期走向了“分居边缘”。Ⅰ
Spring Boot 集成 Kafka m0_74823471 面试学习路线阿里巴巴 spring boot kafka linq
在现代软件开发中，分布式系统和微服务架构越来越受到关注。为了实现系统之间的异步通信和解耦，消息队列成为了一种重要的技术手段。Kafka作为一种高性能、分布式的消息队列系统，被广泛应用于各种场景。而SpringBoot作为一种流行的Java开发框架，提供了便捷的方式来构建应用程序。本文将介绍如何在SpringBoot项目中集成Kafka，包括Kafka的基本概念、SpringBoot集成Kafka的
《Python入门+Python爬虫》——6Day 数据库可视化——Flask框架应用不摆烂的小劉 python python flask 爬虫
Python学习版本:Python3.X观看：Python入门+Python爬虫+Python数据分析1.Flask入门1.1关于Flask1.1.1了解框架Flask作为Web框架，它的作用主要是为了开发Web应用程序。那么我们首先来了解下Web应用程序。Web应用程序(WorldWideWeb)诞生最初的目的，是为了利用互联网交流工作文档。一切从客户端发起请求开始。所有Flask程序都必须创建
Python异步编程-asyncio详解我爱让机器学习 python 开发语言 asyncio 异步
目录asyncio简介示例什么是asyncio?适用场景APIasyncio的使用可等待对象什么是可等待对象？协程对象任务对象Future对象协程什么是协程？基本使用运行协程Task什么是Task？创建Task取消TaskTask异常获取Task回调TaskGroup什么是TaskGroup？为什么使用TaskGroup？创建任务异常处理同步任务完成asyncio简介示例首先，我们来看一个简单的H
7.asyncio库详解汪汪队~ Python系列教程之进阶篇 python
深入理解Python的asyncio库Python的asyncio库是一个强大的异步I/O框架，用于处理并发和异步编程。它提供了一种基于协程的方式来处理异步任务，使得编写异步代码更加简单和直观。1.什么是asyncio？asyncio是Python3.4引入的标准库，用于编写协程和异步代码。它基于事件循环（EventLoop）的概念，通过异步任务（coroutines）和Future对象来实现非阻
Python -- asyncio库鹿夏
asyncio协程前言问题的引出多线程版本多进程版本生成器版本事件循环协程FutureTask任务协程的使用回调的使用多个任务执行使用回调,如下新语法TCPEchoServer举例aiohttp库安装文档开发前言3.4版本加入标准库。asyncio底层基于selectors实现，看似库，其实就是个框架，包含异步IO、事件循环、协程、任务等内容问题的引出defa():forxinrange(3):p
《AI 大模型 ChatGPT 的传奇》武昌库里写JAVA 面试题汇总与解析课程设计 spring boot vue.js 算法数据结构
《AI大模型ChatGPT的传奇》——段方某世界100强企业大数据/AI总设计师教授北京大学博士后助理：1三6三二四61四五41AI大模型的概念和特点1.1什么是”大模型、多模态“？1.2大模型带来了什么？1.3大模型为什么能产生质变？1.4算法层面的跃升1.4.1RNN到transformor1.4.2扩散模型diffusion1.4.3跨模态的CLIP框架1.5AIGC的耀眼成果1.5.1AI
向量数据库实战介绍 Zhank10 数据库
本文将介绍三种常用的向量数据库：faiss,Milvus和Qdrant，并给出一个具体的使用例子。向量数据库（VectorDatabase）是一种专门用于存储、管理、查询、检索向量的数据库，主要应用于人工智能、机器学习、数据挖掘等领域。在向量数据库中，数据以向量的形式进行存储和处理，需要将原始的非向量型数据转化为向量表示（比如文本使用Embedding技术获得其表征向量）。这种数据库能够高效地进行
Vue的单元测试和端到端测试：确保组件可靠性与应用完整性哎你看 vue vue.js 单元测试前端
引言在软件开发过程中，测试是保证代码质量和应用稳定性的关键环节。Vue.js作为流行的前端框架，提供了一套完善的测试工具和生态系统，支持开发者进行单元测试和端到端测试。本文将深入探讨如何为Vue组件编写单元测试，并讨论如何使用Cypress等工具进行端到端测试。单元测试1.单元测试的概念单元测试是针对程序中最小的可测试单元进行检查和验证的过程，通常关注函数或方法级别的测试。2.Vue组件的单元测试
使用ArcGIS和ArcGISLoader进行地理信息处理 scaFHIO arcgis python
ArcGIS是由Esri开发和维护的地理信息系统（GIS）软件家族，包括客户端、服务器和在线解决方案。对于开发者来说，Python库arcgis提供了强大的功能，支持矢量和栅格分析、地理编码、制图、路径规划等。此外，它还能够管理用户、组和信息项，并可访问Esri及其他权威来源提供的即用型地图和地理数据，也支持自有数据的使用。1.技术背景介绍ArcGIS是一款广泛应用于地理信息系统中的专业工具，它可
利用DSPy优化LangChain RAG系统的实战指南 scaFHIO langchain python
利用DSPy优化LangChainRAG系统的实战指南技术背景介绍DSPy是一个用于大语言模型（LLMs）的出色框架，它引入了一个自动编译器，能够教会模型如何执行你程序中的声明性步骤。具体来说，DSPy编译器会在内部追踪你的程序，然后为大型语言模型（LLMs）创建高质量的提示（或为小型LLMs训练自动微调），以教会它们任务的步骤。感谢OmarKhattab的努力，现在DSPy可以与LangChai
GPT-4提示词冠军如何写 prompt：CO-STAR 框架、文本分段、系统提示天涯倦客的美丽人生 prompt 数据库
CO-STAR框架CO-STAR框架用来构建提示词(prompt)，分隔符对提示词进行文本分段。©上下文：为任务提供背景信息通过为大语言模型（LLM）提供详细的背景信息，可以帮助它精确理解讨论的具体场景，确保提供的反馈具有相关性。(O)目标：明确你要求大语言模型完成的任务清晰地界定任务目标，可以使大语言模型更专注地调整其回应，以实现这一具体目标。(S)风格：明确你期望的写作风格你可以指定一个特定的
使用LangChain与GPT4All模型进行交互 bavDHAUO langchain 交互 python
技术背景介绍近年来，开源模型和框架在AI技术领域迅猛发展。GPT4All是一个开源的对话机器人生态系统，旨在为用户提供干净的助手数据，包括代码、故事和对话。这篇文章将介绍如何使用LangChain与GPT4All模型进行交互，以实现智能问答功能。核心原理解析GPT4All是基于大型语言模型（LLMs）的开源项目，通过训练大量干净的数据，能够生成高质量的对话和回答。LangChain是一种用于简化与
解密AI创作：提升Prompt提示词的提问技巧 chiikawa&q 人工智能 prompt
文章目录AI创作的核心：提示词Prompt的重要性1.什么是提示词工程？1.1提示词的工作原理1.2高薪提示词工程师的现实1.3谁能胜任提示词工程师？2.提示词编写技巧3.常见的提示词框架3.1CO-STAR框架3.2BORKE框架4.提示词的实际应用5.提示词资源网站6.AIGC领域的发展与应用7.生成式AI实验示例AI创作的核心：提示词Prompt的重要性在深入探索AI内容创作时，提示词成为与
Vue.js组件开发：从基础到进阶码上飞扬 vue.js
在现代前端开发中，Vue.js因其简洁、灵活和易上手的特点，成为了众多开发者首选的框架之一。组件化是Vue.js的核心思想之一，它让我们能够更高效、模块化地开发应用。在本文中，我们将从Vue.js的组件开发的基础知识开始，逐步探索如何通过Vue.js进行高效的组件化开发。一、Vue.js组件的基础Vue.js中的组件可以理解为一个具有特定功能的代码块，它通常包含视图（HTML）、样式（CSS）和逻
深入理解 Spring IoC 与 DI：控制反转与依赖注入解析代码江 Spring spring java 后端
前言：在接触Spring框架之前，通常我们会在main方法或其他业务逻辑中手动new对象，然后调用这些对象的方法来完成任务。手动创建对象的方式意味着我们自己掌握了对象的控制权。然而，在Spring中，我们不再直接在代码中手动创建对象，而是将对象的创建、管理、依赖注入等职责交给了Spring容器。Spring框架通过IoC（控制反转）和DI（依赖注入）来实现这一点。大家伙，这次封面是我把标题发给ai
apache ftpserver-CentOS config gengzg apache
<server xmlns="http://mina.apache.org/ftpserver/spring/v1" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation=" http://mina.apache.o
优化MySQL数据库性能的八种方法 AILIKES sql mysql
1、选取最适用的字段属性　　MySQL可以很好的支持大数据量的存取，但是一般说来，数据库中的表越小，在它上面执行的查询也就会越快。因此，在创建表的时候，为了获得更好的性能，我们可以将表中字段的宽度设得尽可能小。例如，在定义邮政编码这个字段时，如果将其设置为CHAR(255),显然给数据库增加了不必要的空间，甚至使用VARCHAR这种类型也是多余的，因为CHAR(6)就可以很
JeeSite 企业信息化快速开发平台 Kai_Ge JeeSite
JeeSite 企业信息化快速开发平台平台简介 JeeSite是基于多个优秀的开源项目，高度整合封装而成的高效，高性能，强安全性的开源Java EE快速开发平台。 JeeSite本身是以Spring Framework为核心容器，Spring MVC为模型视图控制器，MyBatis为数据访问层， Apache Shiro为权限授权层，Ehcahe对常用数据进行缓存，Activit为工作流
通过Spring Mail Api发送邮件 120153216 邮件 main
原文地址：http://www.open-open.com/lib/view/open1346857871615.html 使用Java Mail API来发送邮件也很容易实现，但是最近公司一个同事封装的邮件API实在让我无法接受，于是便打算改用Spring Mail API来发送邮件，顺便记录下这篇文章。【Spring Mail API】 Spring Mail API都在org.spri
Pysvn 程序员使用指南 2002wmj SVN
源文件:http://ju.outofmemory.cn/entry/35762 这是一篇关于pysvn模块的指南. 完整和详细的API请参考 http://pysvn.tigris.org/docs/pysvn_prog_ref.html. pysvn是操作Subversion版本控制的Python接口模块. 这个API接口可以管理一个工作副本, 查询档案库, 和同步两个. 该
在SQLSERVER中查找被阻塞和正在被阻塞的SQL 357029540 SQL Server
SELECT R.session_id AS BlockedSessionID , S.session_id AS BlockingSessionID , Q1.text AS Block
Intent 常用的用法备忘 7454103 .net android Google Blog F#
Intent 应该算是Android中特有的东西。你可以在Intent中指定程序要执行的动作（比如：view,edit,dial），以及程序执行到该动作时所需要的资料。都指定好后，只要调用startActivity()，Android系统会自动寻找最符合你指定要求的应用程序，并执行该程序。下面列出几种Intent 的用法显示网页:
Spring定时器时间配置 adminjun spring 时间配置定时器
红圈中的值由6个数字组成，中间用空格分隔。第一个数字表示定时任务执行时间的秒，第二个数字表示分钟，第三个数字表示小时，后面三个数字表示日，月，年，< xmlnamespace prefix ="o" ns ="urn:schemas-microsoft-com:office:office" /> 测试的时候，由于是每天定时执行，所以后面三个数
POJ 2421 Constructing Roads 最小生成树 aijuans 最小生成树
来源：http://poj.org/problem?id=2421 题意：还是给你n个点，然后求最小生成树。特殊之处在于有一些点之间已经连上了边。思路：对于已经有边的点，特殊标记一下，加边的时候把这些边的权值赋值为0即可。这样就可以既保证这些边一定存在，又保证了所求的结果正确。代码： #include <iostream> #include <cstdio>
重构笔记——提取方法（Extract Method） ayaoxinchao java 重构提炼函数局部变量提取方法
提取方法（Extract Method）是最常用的重构手法之一。当看到一个方法过长或者方法很难让人理解其意图的时候，这时候就可以用提取方法这种重构手法。下面是我学习这个重构手法的笔记：提取方法看起来好像仅仅是将被提取方法中的一段代码，放到目标方法中。其实，当方法足够复杂的时候，提取方法也会变得复杂。当然，如果提取方法这种重构手法无法进行时，就可能需要选择其他
为UILabel添加点击事件 bewithme UILabel
默认情况下UILabel是不支持点击事件的，网上查了查居然没有一个是完整的答案，现在我提供一个完整的代码。 UILabel *l = [[UILabel alloc] initWithFrame:CGRectMake(60, 0, listV.frame.size.width - 60, listV.frame.size.height)]
NoSQL数据库之Redis数据库管理(PHP-REDIS实例) bijian1013 redis 数据库 NoSQL
一.redis.php <?php //实例化 $redis = new Redis(); //连接服务器 $redis->connect("localhost"); //授权 $redis->auth("lamplijie"); //相关操
SecureCRT使用备注 bingyingao secureCRT 每页行数
SecureCRT日志和卷屏行数设置一、使用securecrt时，设置自动日志记录功能。 1、在C:\Program Files\SecureCRT\下新建一个文件夹(也就是你的CRT可执行文件的路径），命名为Logs； 2、点击Options -> Global Options -> Default Session -> Edite Default Sett
【Scala九】Scala核心三：泛型 bit1129 scala
泛型类 package spark.examples.scala.generics class GenericClass[K, V](val k: K, val v: V) { def print() { println(k + "," + v) } } object GenericClass { def main(args: Arr
素数与音乐 bookjovi 素数数学 haskell
由于一直在看haskell，不可避免的接触到了很多数学知识，其中数论最多，如素数，斐波那契数列等，很多在学生时代无法理解的数学现在似乎也能领悟到那么一点。闲暇之余，从图书馆找了<<The music of primes>>和<<世界数学通史>>读了几遍。其中素数的音乐这本书与软件界熟知的&l
Java-Collections Framework学习与总结-IdentityHashMap BrokenDreams Collections
这篇总结一下java.util.IdentityHashMap。从类名上可以猜到，这个类本质应该还是一个散列表，只是前面有Identity修饰，是一种特殊的HashMap。简单的说，IdentityHashMap和HashM
读《研磨设计模式》-代码笔记-享元模式-Flyweight bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.ArrayList; import java.util.Collection; import java.util.HashMap; import java.util.List; import java
PS人像润饰&调色教程集锦 cherishLC PS
1、仿制图章沿轮廓润饰——柔化图像，凸显轮廓 http://www.howzhi.com/course/retouching/ 新建一个透明图层，使用仿制图章不断Alt+鼠标左键选点，设置透明度为21%，大小为修饰区域的1/3左右（比如胳膊宽度的1/3），再沿纹理方向（比如胳膊方向）进行修饰。所有修饰完成后，对该润饰图层添加噪声，噪声大小应该和
更新多个字段的UPDATE语句 crabdave update
更新多个字段的UPDATE语句 update tableA a set (a.v1, a.v2, a.v3, a.v4) = --使用括号确定更新的字段范围
hive实例讲解实现in和not in子句 daizj hive not in in
本文转自：http://www.cnblogs.com/ggjucheng/archive/2013/01/03/2842855.html 当前hive不支持 in或not in 中包含查询子句的语法，所以只能通过left join实现。假设有一个登陆表login(当天登陆记录,只有一个uid),和一个用户注册表regusers(当天注册用户，字段只有一个uid)，这两个表都包含
一道24点的10+种非人类解法（2,3,10,10） dsjt 算法
这是人类算24点的方法？！！！事件缘由：今天晚上突然看到一条24点状态，当时惊为天人，这NM叫人啊？以下是那条状态朱明西 : 24点，算2 3 10 10，我LX炮狗等面对四张牌痛不欲生，结果跑跑同学扫了一眼说，算出来了，2的10次方减10的3次方。。我草这是人类的算24点啊。。然后么。。。我就在深夜很得瑟的问室友求室友算刚出完题，文哥的暴走之旅开始了 5秒后
关于YII的菜单插件 CMenu和面包末breadcrumbs路径管理插件的一些使用问题 dcj3sjt126com yii framework
在使用 YIi的路径管理工具时，发现了一个问题。 <?php
对象与关系之间的矛盾：“阻抗失配”效应[转] come_for_dream 对象
概述 “阻抗失配”这一词组通常用来描述面向对象应用向传统的关系数据库（RDBMS）存放数据时所遇到的数据表述不一致问题。C++程序员已经被这个问题困扰了好多年，而现在的Java程序员和其它面向对象开发人员也对这个问题深感头痛。 “阻抗失配”产生的原因是因为对象模型与关系模型之间缺乏固有的亲合力。“阻抗失配”所带来的问题包括：类的层次关系必须绑定为关系模式（将对象
学习编程那点事 gcq511120594 编程互联网
一年前的夏天，我还在纠结要不要改行，要不要去学php？能学到真本事吗？改行能成功吗？太多的问题，我终于不顾一切，下定决心，辞去了工作，来到传说中的帝都。老师给的乘车方式还算有效，很顺利的就到了学校，赶巧了，正好学校搬到了新校区。先安顿了下来，过了个轻松的周末，第一次到帝都，逛逛吧！接下来的周一，是我噩梦的开始，学习内容对我这个零基础的人来说，除了勉强完成老师布置的作业外，我已经没有时间和精力去
Reverse Linked List II hcx2013 list
Reverse a linked list from position m to n. Do it in-place and in one-pass. For example:Given 1->2->3->4->5->NULL, m = 2 and n = 4, return
Spring4.1新特性——页面自动化测试框架Spring MVC Test HtmlUnit简介 jinnianshilongnian spring 4.1
目录 Spring4.1新特性——综述 Spring4.1新特性——Spring核心部分及其他 Spring4.1新特性——Spring缓存框架增强 Spring4.1新特性——异步调用和事件机制的异常处理 Spring4.1新特性——数据库集成测试脚本初始化 Spring4.1新特性——Spring MVC增强 Spring4.1新特性——页面自动化测试框架Spring MVC T
Hadoop集群工具distcp liyonghui160com
1. 环境描述两个集群：rock 和 stone rock无kerberos权限认证，stone有要求认证。 1. 从rock复制到stone，采用hdfs Hadoop distcp -i hdfs://rock-nn:8020/user/cxz/input hdfs://stone-nn:8020/user/cxz/运行在rock端，即源端问题：报版本
一个备份MySQL数据库的简单Shell脚本 pda158 mysql 脚本
　　主脚本（用于备份mysql数据库）：　　该Shell脚本可以自动备份数据库。只要复制粘贴本脚本到文本编辑器中，输入数据库用户名、密码以及数据库名即可。我备份数据库使用的是mysqlump 命令。后面会对每行脚本命令进行说明。　　 1. 分别建立目录“backup”和“oldbackup” 　　#mkdir /backup 　　#mkdir /oldbackup 　
300个涵盖IT各方面的免费资源（中）——设计与编码篇 shoothao IT资源图标库图片库色彩板字体
A. 免费的设计资源 Freebbble:来自于Dribbble的免费的高质量作品。 Dribbble:Dribbble上“免费”的搜索结果——这是巨大的宝藏。 Graphic Burger:每个像素点都做得很细的绝佳的设计资源。 Pixel Buddha:免费和优质资源的专业社区。 Premium Pixels:为那些有创意的人提供免费的素材。
thrift总结 - 跨语言服务开发 uule thrift
官网官网JAVA例子 thrift入门介绍 IBM-Apache Thrift - 可伸缩的跨语言服务开发框架 Thrift入门及Java实例演示 thrift的使用介绍 RPC POM： <dependency> <groupId>org.apache.thrift</groupId>