mmdetection常用功能指引

一、自己的数据集转换成coco数据集

参考代码。
coco格式数据集的标签主要分为：
（1）"info"
（2）"license"
（3） "images": height、width、id、file_name
（4） "annotations": id、image_id、category_id、segmentation、bbox、iscrowd、area
（5） "categories": id、name、supercategory
代码本地地址：~/mmdetection/data/fabric2coco
使用注意事项：
1.img_dirs与anno_dirs的顺序必须要一一对应；
2.后续需要改进：所有数据一块读入，然后根据缺陷类别进行随机抽取，分割数据集未训练集为测试集。

二、训练模型

完成数据的准备之后就可以开始选择模型，并开始训练。
参考官方文档。
python tools/train.py ${CONFIG_FILE}
CONFIG_FILE代表的是模型配置文件的路径。
可选的参数：
--validate：代表在训练的过程中插入验证，默认每一个epoch后验证一次（但是单GP不支持这个参数）。
--work_dir：会覆盖在配置文件中指定的work_dir路径。work_dir主要用于保存模型的训练结果（包括权值文件、训练日志等）。
--resume_from：从现有的模型权值文件开始训练。主要可以解决训练到一半因为意外情况终止的问题（停电啥的）。有别于load_from，load_from就是相当于重新开始，当前的权值文件相当于是预训练的模型权值。
实际操作：
（1）选定模型：cascade_rcnn_r50_fpn_1x，
（2）修改配置文件中的相关信息，配置文件地址：./mmdetection/configs/cascade_rcnn_r50_fpn_1x.py
1>修改num_classes（有两处）：数值为类别数+1，到我的数据集上则为21。
2>修改图片尺寸（如果原图尺寸过大的话）img_scale：到我的数据集以及计算力情况，设置为(1223,500)
3>修改相关的路径ann_file：到我的本地路径则为，annotations/instances_train2017.json
4>修改学习率lr：lr = 0.00125*batch_size，到我这则为0.0025
（3）开始训练，切换工作目录到./mmdetection，运行指令python tools/train.py configs/cascade_rcnn_r50_fpn_1x.py --gpus 1

三、测试模型

有了训练好的模型之后就可以开始测试了，参考同上。
python tools/test.py ${CONFIG_FILE} ${CHECKPOINT_FILE} [--out ${RESULT_FILE}] [--eval ${EVAL_METRICS}] [--show]
其中：
（1）CONFIG_FILE指的是配置文件所在的路径
（2）CHECKPOINT_FILE指的是模型文件所在的路径
（3）可选项：
1>--out: 然后带上out文件的路径，为.pkl格式
2>--eval:需要评估的项目，包括：proposal_fast, proposal, bbox, segm, keypoints
3>--show:会以可视化的方式展示结果，也即图像和测试的值：类别、回归框等。

实例：工作目录到./mmdetection。
python tools/test.py config/cascade_rcnn_r50_fpn_1x.py ./checkpoints/cascade_rcnn_r50_fpn_1x/epoch_12.pth --out ./checkpoints/cascade_rcnn_r50_fpn_1x/results.pkl --eval bbox

四、测试某给定的图片

from mmdet.apis import init_detector, inference_detector, show_result
import mmcv

config_file = 'configs/faster_rcnn_r50_fpn_1x.py'
checkpoint_file = 'checkpoints/faster_rcnn_r50_fpn_1x_20181010-3d1b3351.pth'

# build the model from a config file and a checkpoint file
model = init_detector(config_file, checkpoint_file, device='cuda:0')

# test a single image and show the results
img = 'test.jpg'  # or img = mmcv.imread(img), which will only load it once
result = inference_detector(model, img)
# visualize the results in a new window
show_result(img, result, model.CLASSES)
# or save the visualization results to image files
show_result(img, result, model.CLASSES, out_file='result.jpg')

# test a video and show the results
video = mmcv.VideoReader('video.mp4')
for frame in video:
    result = inference_detector(model, frame)
    show_result(frame, result, model.CLASSES, wait_time=1)

五、修改CLASS_NAME

参考这里。
1../mmdetection/mmdet/datasets/coco.py，将CLASSES修改为自己的类名，如下图所示。

image.png

2../mmdetection/mmdet/core/evaluation/class_names.py修改coco_classes数据集类别。如下图所示。