血_影

『计算机视觉』Mask-RCNN_训练网络其一：数据集与Dataset类

代码位置
一、原始数据信息录入
二、数据信息整理
- 类别信息记录
- 图片信息记录
三、获取图片
小结

Github地址：Mask_RCNN
『计算机视觉』Mask-RCNN_论文学习
『计算机视觉』Mask-RCNN_项目文档翻译
『计算机视觉』Mask-RCNN_推断网络其一：总览
『计算机视觉』Mask-RCNN_推断网络其二：基于ReNet101的FPN共享网络
『计算机视觉』Mask-RCNN_推断网络其三：RPN锚框处理和Proposal生成
『计算机视觉』Mask-RCNN_推断网络其四：FPN和ROIAlign的耦合
『计算机视觉』Mask-RCNN_推断网络其五：目标检测结果精炼
『计算机视觉』Mask-RCNN_推断网络其六：Mask生成
『计算机视觉』Mask-RCNN_推断网络终篇：使用detect方法进行推断
『计算机视觉』Mask-RCNN_锚框生成
『计算机视觉』Mask-RCNN_训练网络其一：数据集与Dataset类
『计算机视觉』Mask-RCNN_训练网络其二：train网络结构&损失函数
『计算机视觉』Mask-RCNN_训练网络其三：训练Model

本节介绍的数据集class构建为官方demo，对从零开始构建自己的数据集训练感兴趣的建议了解了本文及本文对应的代码文件后，看一下『计算机视觉』Mask-RCNN_关键点检测分支介绍了由自己的数据构建Mask RCNN可用形式的实践。

代码位置

在脚本train_shapes.ipynb中，作者演示了使用合成图片进行训练Mask_RCNN的小demo，我们将以此为例，从训练数据的角度重新审视Mask_RCNN。

在训练过程中，我们最先要做的根据我们自己的数据集，集成改写基础的数据读取class：util.py中的Dataset class，然后根据数据集调整网络配置文件配置config.py中的Config 类，使得网络形状配适数，然后再去考虑训练的问题。按照逻辑流程，本节我们以train_shapes.ipynb中的数据生成为例，学习Dataset class的运作机理。

在示例程序中，首先创建新的Dataset的子类（这里贴出整个class代码，后面会分节讲解）：

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

class ShapesDataset(utils.Dataset):

"""Generates the shapes synthetic dataset. The dataset consists of simple

shapes (triangles, squares, circles) placed randomly on a blank surface.

The images are generated on the fly. No file access required.

"""

def load_shapes(self, count, height, width):

"""Generate the requested number of synthetic images.

count: number of images to generate.

height, width: the size of the generated images.

"""

# Add classes

self.add_class("shapes", 1, "square")

self.add_class("shapes", 2, "circle")

self.add_class("shapes", 3, "triangle")

# Add images

# Generate random specifications of images (i.e. color and

# list of shapes sizes and locations). This is more compact than

# actual images. Images are generated on the fly in load_image().

for i in range(count):

bg_color, shapes = self.random_image(height, width)

self.add_image("shapes", image_id=i, path=None,

width=width, height=height,

bg_color=bg_color, shapes=shapes)

def load_image(self, image_id):

"""Generate an image from the specs of the given image ID.

Typically this function loads the image from a file, but

in this case it generates the image on the fly from the

specs in image_info.

"""

info = self.image_info[image_id]

bg_color = np.array(info['bg_color']).reshape([1, 1, 3])

image = np.ones([info['height'], info['width'], 3], dtype=np.uint8)

image = image * bg_color.astype(np.uint8)

for shape, color, dims in info['shapes']:

image = self.draw_shape(image, shape, dims, color)

return image

def image_reference(self, image_id):

"""Return the shapes data of the image."""

info = self.image_info[image_id]

if info["source"] == "shapes":

return info["shapes"]

else:

super(self.__class__).image_reference(self, image_id)

def load_mask(self, image_id):

"""Generate instance masks for shapes of the given image ID.

"""

info = self.image_info[image_id]

shapes = info['shapes']

count = len(shapes)

mask = np.zeros([info['height'], info['width'], count], dtype=np.uint8)

for i, (shape, _, dims) in enumerate(info['shapes']):

mask[:, :, i:i+1] = self.draw_shape(mask[:, :, i:i+1].copy(),

shape, dims, 1)

# Handle occlusions

occlusion = np.logical_not(mask[:, :, -1]).astype(np.uint8)

for i in range(count-2, -1, -1):

mask[:, :, i] = mask[:, :, i] * occlusion

occlusion = np.logical_and(occlusion, np.logical_not(mask[:, :, i]))

# Map class names to class IDs.

class_ids = np.array([self.class_names.index(s[0]) for s in shapes])

return mask.astype(np.bool), class_ids.astype(np.int32)

def draw_shape(self, image, shape, dims, color):

"""Draws a shape from the given specs."""

# Get the center x, y and the size s

x, y, s = dims

if shape == 'square':

cv2.rectangle(image, (x-s, y-s), (x+s, y+s), color, -1)

elif shape == "circle":

cv2.circle(image, (x, y), s, color, -1)

elif shape == "triangle":

points = np.array([[(x, y-s),

(x-s/math.sin(math.radians(60)), y+s),

(x+s/math.sin(math.radians(60)), y+s),

]], dtype=np.int32)

cv2.fillPoly(image, points, color)

return image

def random_shape(self, height, width):

"""Generates specifications of a random shape that lies within

the given height and width boundaries.

Returns a tuple of three valus:

* The shape name (square, circle, ...)

* Shape color: a tuple of 3 values, RGB.

* Shape dimensions: A tuple of values that define the shape size

and location. Differs per shape type.

"""

# Shape

shape = random.choice(["square", "circle", "triangle"])

# Color

color = tuple([random.randint(0, 255) for _ in range(3)])

# Center x, y

buffer = 20

y = random.randint(buffer, height - buffer - 1)

x = random.randint(buffer, width - buffer - 1)

# Size

s = random.randint(buffer, height//4)

return shape, color, (x, y, s)

def random_image(self, height, width):

"""Creates random specifications of an image with multiple shapes.

Returns the background color of the image and a list of shape

specifications that can be used to draw the image.

"""

# Pick random background color

bg_color = np.array([random.randint(0, 255) for _ in range(3)])

# Generate a few random shapes and record their

# bounding boxes

shapes = []

boxes = []

N = random.randint(1, 4)

for _ in range(N):

shape, color, dims = self.random_shape(height, width)

shapes.append((shape, color, dims))

x, y, s = dims

boxes.append([y-s, x-s, y+s, x+s])

# Apply non-max suppression wit 0.3 threshold to avoid

# shapes covering each other

keep_ixs = utils.non_max_suppression(np.array(boxes), np.arange(N), 0.3)

shapes = [s for i, s in enumerate(shapes) if i in keep_ixs]

return bg_color, shapes

一、原始数据信息录入

然后调用如下方法（IMAGE_SHAPE=[128 128 3]，介绍config时会提到），准备训练用数据和验证集数据，注意，此时仅仅是在做准备并未真实的生成或读入图片数据，

# Training dataset

dataset_train = ShapesDataset()

dataset_train.load_shapes(500, config.IMAGE_SHAPE[0], config.IMAGE_SHAPE[1])

dataset_train.prepare()

# Validation dataset

dataset_val = ShapesDataset()

dataset_val.load_shapes(50, config.IMAGE_SHAPE[0], config.IMAGE_SHAPE[1])

dataset_val.prepare()

其调用的load_shapes方法如下：

def load_shapes(self, count, height, width):

"""Generate the requested number of synthetic images.

count: number of images to generate.

height, width: the size of the generated images.

"""

# Add classes

self.add_class("shapes", 1, "square")

self.add_class("shapes", 2, "circle")

self.add_class("shapes", 3, "triangle")

# Add images

# Generate random specifications of images (i.e. color and

# list of shapes sizes and locations). This is more compact than

# actual images. Images are generated on the fly in load_image().

for i in range(count):

bg_color, shapes = self.random_image(height, width)

self.add_image("shapes", image_id=i, path=None,

width=width, height=height,

bg_color=bg_color, shapes=shapes)

这里涉及了两个父类继承来的方法self.add_class和self.add_image，我们去util.py中的Dataset class看一看，

class Dataset(object):

"""The base class for dataset classes.

To use it, create a new class that adds functions specific to the dataset

you want to use. For example:

class CatsAndDogsDataset(Dataset):

def load_cats_and_dogs(self):

...

def load_mask(self, image_id):

...

def image_reference(self, image_id):

...

See COCODataset and ShapesDataset as examples.

"""

def __init__(self, class_map=None):

self._image_ids = []

self.image_info = []

# Background is always the first class

self.class_info = [{"source": "", "id": 0, "name": "BG"}]

self.source_class_ids = {}

def add_class(self, source, class_id, class_name):

assert "." not in source, "Source name cannot contain a dot"

# Does the class exist already?

for info in self.class_info:

if info['source'] == source and info["id"] == class_id:

# source.class_id combination already available, skip

return

# Add the class

self.class_info.append({

"source": source,

"id": class_id,

"name": class_name,

})

def add_image(self, source, image_id, path, **kwargs):

image_info = {

"id": image_id,

"source": source,

"path": path,

}

image_info.update(kwargs)

self.image_info.append(image_info)

也就是说，在Dataset中有self.image_info 和 self.class_info 两个list，它们的元素都是固定key的字典，

"source"对应数据集名称，

"id"对应本数据集内当前图片/类别标号

"path"仅image_info含有，对应图像路径，可为None

"name"仅class_info含有，对应类别描述

在后面的prepare方法中我们可以进一步了解，使用source.id作key，可以索引到一个内建的新的internal id，这也像我们解释了为什么文档中说Mask_RCNN支持多个数据集同时训练的由来。

回到load_shapes方法，self.random_image方法为新建方法，这里作者使用算法生成图像做训练，该方法返回生成图像函数所需的随机参数，之后调用add_image时传入path为None，也是因为数据并非从磁盘读取，而是自己生成，并传入了额外的self.random_image方法返回的生成参数（我们不必关系具体参数是什么），作为字典参数解读，添加进self.image_info中，

for i in range(count):

bg_color, shapes = self.random_image(height, width)

self.add_image("shapes", image_id=i, path=None,

width=width, height=height,

bg_color=bg_color, shapes=shapes)

从这里，我们进一步了解了self.image_info的含义，记录每一张图片的id信息（"source"和"id"），记录每一张图片的数据信息（如何获取图像矩阵的线索，包含"path"或者其他的字典索引，只要保证后面能实现函数，根据这个信息获取图片数据即可）

二、数据信息整理

在初始化了 self.image_info 和 self.class_info 两个list之后，Dataset已经记录了原始的类别信息和图像信息，调用prepare方法进行规范化，

def prepare(self, class_map=None):

"""Prepares the Dataset class for use.

TODO: class map is not supported yet. When done, it should handle mapping

classes from different datasets to the same class ID.

"""

def clean_name(name):

"""Returns a shorter version of object names for cleaner display."""

return ",".join(name.split(",")[:1])

# Build (or rebuild) everything else from the info dicts.

self.num_classes = len(self.class_info) # 类别数目

self.class_ids = np.arange(self.num_classes) # internal 类别IDs

self.class_names = [clean_name(c["name"]) for c in self.class_info] # 类别名简洁版

self.num_images = len(self.image_info) # 图片数目

self._image_ids = np.arange(self.num_images) # internal 类别IDs

# Mapping from source class and image IDs to internal IDs

self.class_from_source_map = {"{}.{}".format(info['source'], info['id']): id

for info, id in zip(self.class_info, self.class_ids)}

self.image_from_source_map = {"{}.{}".format(info['source'], info['id']): id

for info, id in zip(self.image_info, self.image_ids)}

# Map sources to class_ids they support

self.sources = list(set([i['source'] for i in self.class_info]))

self.source_class_ids = {} # source对应的internal 类别IDs

# Loop over datasets

for source in self.sources:

self.source_class_ids[source] = []

# Find classes that belong to this dataset

for i, info in enumerate(self.class_info):

# Include BG class in all datasets

if i == 0 or source == info['source']:

self.source_class_ids[source].append(i)

类别信息记录

将"source.id"映射为唯一的internal IDs，并将全部的internal IDs存储在self.class_ids

source_class_ids，记录下每一个"source"对应的internal IDs

class_from_source_map，记录下"source.id"：internal IDs的映射关系

print(dataset_train.class_info) # 每个类别原始信息

print(dataset_train.class_ids) # 记录类别internal IDs

print(dataset_train.source_class_ids) # 每个数据集对应的internal IDs

print(dataset_train.class_from_source_map) # 原始信息和internal ID映射关系

输出如下：

[{'source': '', 'id': 0, 'name': 'BG'}, 
 {'source': 'shapes', 'id': 1, 'name': 'square'}, 
 {'source': 'shapes', 'id': 2, 'name': 'circle'}, 
 {'source': 'shapes', 'id': 3, 'name': 'triangle'}]
[0 1 2 3]
{'': [0], 'shapes': [0, 1, 2, 3]}
{'.0': 0, 'shapes.1': 1, 'shapes.2': 2, 'shapes.3': 3}

有固定的source为空的类别0（id和internal ID都是），标记为背景，会添加进source_class_ids中全部的数据集对应的类别中（上面"shape"数据集我们仅定义了3个类，在映射中多了一个0变成4个类）。

图片信息记录

图片信息不像类别一样麻烦，我们简单输出三张，

# Training dataset

dataset_train = ShapesDataset()

dataset_train.load_shapes(3, config.IMAGE_SHAPE[0], config.IMAGE_SHAPE[1])

dataset_train.prepare()

print(dataset_train.image_info) # 记录图像原始信息

print(dataset_train.image_ids) # 记录图像internal IDs

print(dataset_train.image_from_source_map) # 原始信息和internal ID对应关系

结果如下，

[{'id': 0, 'source': 'shapes', 'path': None, 'width': 128, 'height': 128, 'bg_color': array([163, 143, 173]), 
  'shapes': [('circle', (178, 140, 65), (83, 104, 20)), ('circle', (192, 52, 82), (48, 58, 20))]}, 
 {'id': 1, 'source': 'shapes', 'path': None, 'width': 128, 'height': 128, 'bg_color': array([ 5, 99, 71]), 
  'shapes': [('triangle', (90, 32, 55), (39, 21, 22)), ('circle', (214, 49, 173), (39, 78, 21))]}, 
 {'id': 2, 'source': 'shapes', 'path': None, 'width': 128, 'height': 128, 'bg_color': array([138,  52,  83]), 
  'shapes': [('circle', (180, 74, 150), (105, 45, 27))]}]
[0 1 2]
{'shapes.0': 0, 'shapes.1': 1, 'shapes.2': 2}

【注1】由于这是图像检测任务而非图像分类任务，故每张图片仅仅和归属数据集存在映射，和类别信息没有直接映射。图像上的目标和类别才存在映射关系，不过那不在本部分函数涉及范围内。

【注2】internal IDs实际上就是info的索引数组，使用internal IDs的值可以直接索引对应图片顺序的info信息。

总结，在调用self.prepare之前，通过自己的新建方法调用self.add_class()和self.add_image()，将图片和分类的原始信息以dict的形式添加到class_info与image_info两个list中，即可。

三、获取图片

然后我们获取一些样例图片进行展示，

# Load and display random samples

image_ids = np.random.choice(dataset_train.image_ids, 4)

for image_id in image_ids:

image = dataset_train.load_image(image_id)

mask, class_ids = dataset_train.load_mask(image_id)

visualize.display_top_masks(image, mask, class_ids, dataset_train.class_names)

print(image.shape, mask.shape, class_ids, dataset_train.class_names)

由上面代码我们可以获悉如下信息：

使用self.image.ids即internal IDs进行图片选取

自行实现load_image方法，获取图片internal IDs，索引图片原始信息（info），利用原始信息输出图片

自行实现load_mask方法，获取图片internal IDs，索引图片原始信息（info），利用原始信息输出图片的masks和对应internal类别，注意一张图片可以有多个mask并分别对应自己的类别

上述代码输出如下（仅展示前两张），

下面贴出load_image和load_mask方法（详见train_shapes.ipynb），具体实现不是重点，毕竟我们也不是在研究怎么画2D图，重点在于上面提到的它们的功能，这涉及到我们迁移到自己的数据时如何实现接口。load_image方法返回一张图片，load_mask方法返回（h，w，c）的01掩码以及（c，）的class id，注意，c指的是盖章图片中instance的数目

def load_image(self, image_id):

"""Generate an image from the specs of the given image ID.

Typically this function loads the image from a file, but

in this case it generates the image on the fly from the

specs in image_info.

"""

info = self.image_info[image_id]

bg_color = np.array(info['bg_color']).reshape([1, 1, 3])

image = np.ones([info['height'], info['width'], 3], dtype=np.uint8)

image = image * bg_color.astype(np.uint8)

for shape, color, dims in info['shapes']:

image = self.draw_shape(image, shape, dims, color)

return image

def load_mask(self, image_id):

"""Generate instance masks for shapes of the given image ID.

"""

info = self.image_info[image_id]

shapes = info['shapes']

count = len(shapes)

mask = np.zeros([info['height'], info['width'], count], dtype=np.uint8)

for i, (shape, _, dims) in enumerate(info['shapes']):

mask[:, :, i:i+1] = self.draw_shape(mask[:, :, i:i+1].copy(),

shape, dims, 1)

# Handle occlusions

occlusion = np.logical_not(mask[:, :, -1]).astype(np.uint8)

for i in range(count-2, -1, -1):

mask[:, :, i] = mask[:, :, i] * occlusion

occlusion = np.logical_and(occlusion, np.logical_not(mask[:, :, i]))

# Map class names to class IDs.

class_ids = np.array([self.class_names.index(s[0]) for s in shapes])

return mask.astype(np.bool), class_ids.astype(np.int32)

小结

正如Dataset注释所说，要想运行自己的数据集，我们首先要实现一个方法（load_shapes，根据数据集取名即可）收集原始图像、类别信息，然后实现两个方法（load_image、load_mask）分别实现获取单张图片数据、获取单张图片对应的objs的masks和classes，这样基本完成了数据集类的构建。

The base class for dataset classes.
To use it, create a new class that adds functions specific to the dataset
you want to use. For example:

class CatsAndDogsDataset(Dataset):
    def load_cats_and_dogs(self):
        ...
    def load_mask(self, image_id):
        ...
    def image_reference(self, image_id):
        ...

See COCODataset and ShapesDataset as examples.

重点来了，训练自己的数据集

工程目录如下图所示：

train.py

# # Mask R-CNN - Train on Shapes Dataset
# 
# 
# This notebook shows how to train Mask R-CNN on your own dataset. To keep things simple we use a synthetic
# dataset of shapes (squares, triangles, and circles) which enables fast training. You'd still need a GPU,
# though, because the network backbone is a Resnet101, which would be too slow to train on a CPU. On a GPU,
# you can start to get okay-ish results in a few minutes, and good results in less than an hour.
# The code of the *Shapes* dataset is included below. It generates images on the fly, so it doesn't require
# downloading any data. And it can generate images of any size, so we pick a small image size to train faster.

# In[1]:

import os
import sys
import random
import math
import re
import time
import numpy as np
import cv2
import yaml
from PIL import Image
import matplotlib
import matplotlib.pyplot as plt
import tensorflow as tf

# Root directory of the project
ROOT_DIR = os.path.abspath("../")
sys.path.append(ROOT_DIR)  # To find local version of the library

# Import Mask RCNN
from mrcnn.config import Config
from mrcnn import utils
import mrcnn.model as modellib
from mrcnn import visualize
from mrcnn.model import log

# Directory to save logs and trained model
MODEL_DIR = os.path.join(ROOT_DIR, "logs")

# Local path to trained weights file
COCO_MODEL_PATH = os.path.join(ROOT_DIR, "mask_rcnn_coco.h5")
# Download COCO trained weights from Releases if needed
if not os.path.exists(COCO_MODEL_PATH):
    utils.download_trained_weights(COCO_MODEL_PATH)

# Directory to save logs and model checkpoints, if not provided
# through the command line argument --logs
DEFAULT_LOGS_DIR = os.path.join(ROOT_DIR, "logs")


# ## Configurations

# In[2]:


class ShapesConfig(Config):
    """Configuration for training on the bceasy dataset.
    Derives from the base Config class and overrides values specific
    to the toy shapes dataset.
    """
    # Give the configuration a recognizable name
    NAME = "shapes"

    # Train on 1 GPU and 1 images per GPU. We can put multiple images on each
    # GPU because the images are small. Batch size is 1 (GPUs * images/GPU).
    GPU_COUNT = 1
    IMAGES_PER_GPU = 1

    # Number of classes (including background)
    NUM_CLASSES = 1 + 3  # background + 3 shapes(fg, fg1, fg2)

    # Use small images for faster training. Set the limits of the small side
    # the large side, and that determines the image shape.
    IMAGE_MIN_DIM = 320
    IMAGE_MAX_DIM = 320

    # Use smaller anchors because our image and objects are small
    RPN_ANCHOR_SCALES = (8 * 8, 16 * 8, 32 * 8, 64 * 8, 128 * 8)  # anchor side in pixels

    # Reduce training ROIs per image because the images are small and have
    # few objects. Aim to allow ROI sampling to pick 33% positive ROIs.
    TRAIN_ROIS_PER_IMAGE = 32

    # Use a small epoch since the data is simple
    STEPS_PER_EPOCH = 50

    # use small validation steps since the epoch is small
    VALIDATION_STEPS = 5


config = ShapesConfig()
config.display()


# ## Notebook Preferences

# In[3]:

def get_ax(rows=1, cols=1, size=8):
    """Return a Matplotlib Axes array to be used in
    all visualizations in the notebook. Provide a
    central point to control graph sizes.

    Change the default size attribute to control the size
    of rendered images
    """
    _, ax = plt.subplots(rows, cols, figsize=(size * cols, size * rows))
    return ax


# ## Dataset
#
# Create a synthetic dataset
#
# Extend the Dataset class and add a method to load the shapes dataset, `load_shapes()`, and override the following methods:
#
# * load_image()
# * load_mask()
# * image_reference()

# In[4]:
class ShapesDataset(utils.Dataset):
    """Generates the shapes synthetic dataset. The dataset consists of simple
    shapes (fg, fg1, fg2) placed randomly on a blank surface.
    The images are generated on the fly. No file access required.
    """

    def get_obj_index(self, image):
        '''
        get  instances in image
        :param image:
        :return:
        '''
        n = np.max(image)
        return n

    def from_yaml_get_class(self, image_id):
        '''
        get mask label from yaml
        :param image_id:
        :return: labels expect BG
        '''
        info = self.image_info[image_id]
        with open(info['yaml_path']) as f:
            temp = yaml.load(f.read())
            labels = temp['label_names']
            del labels[0]
        return labels

    # rewrite draw_mask
    def draw_mask(self, num_obj, mask, image, image_id):
        info = self.image_info[image_id]
        for index in range(num_obj):
            for i in range(info['width']):
                for j in range(info['height']):
                    pixel = image.getpixel((i, j))
                    if pixel == index + 1:
                        mask[j, i, index] = 1
        return mask

    def load_shapes(self, count, labelme_json_folder):
        '''
        Generate the requested number of synthetic images.
        :param count:number of images to generate
        :param train_img_folder:train image folder
        :param labelme_json_folder: labelme_json_to_dataset's folder
        :param imgs_list:train imgs
        :return:
        '''
        # Add classes
        self.add_class("shapes", 1, "fg")
        self.add_class("shapes", 2, "fg1")
        self.add_class("shapes", 3, "fg2")
        imgs_list = os.listdir(labelme_json_folder)
        # Add images
        # Generate random specifications of images (i.e. color and
        # list of shapes sizes and locations). This is more compact than
        # actual images. Images are generated on the fly in load_image().
        for i in range(count):
            file_name = imgs_list[i][:-5]
            mask_path = labelme_json_folder + "/" + file_name + "_json/label.png"
            yaml_path = labelme_json_folder + "/" + file_name + "_json/info.yaml"
            train_img_path = labelme_json_folder + "/" + file_name + "_json/img.png"
            train_img = cv2.imread(train_img_path)
            self.add_image("shapes", image_id=i, path=train_img_path,
                           width=train_img.shape[1], height=train_img.shape[0],
                           mask_path=mask_path, yaml_path=yaml_path)

    def image_reference(self, image_id):
        """Return the shapes data of the image."""
        info = self.image_info[image_id]
        if info["source"] == "shapes":
            return info["shapes"]
        else:
            super(self.__class__).image_reference(self, image_id)

    # rewrite load_mask
    def load_mask(self, image_id):
        """Generate instance masks for shapes of the given image ID.
        """
        info = self.image_info[image_id]
        # number of object
        img = Image.open(info['mask_path'])
        num_obj = self.get_obj_index(img)
        mask = np.zeros([info['height'], info['width'], num_obj], dtype=np.uint8)
        mask = self.draw_mask(num_obj, mask, img, image_id)
        occlusion = np.logical_not(mask[:, :, -1]).astype(np.uint8)
        for i in range(num_obj - 2, -1, -1):
            mask[:, :, i] = mask[:, :, i] * occlusion
            occlusion = np.logical_and(occlusion, np.logical_not(mask[:, :, i]))

        labels = []
        labels = self.from_yaml_get_class(image_id)
        labels_form = []
        for i in range(len(labels)):
            if labels[i].find("fg") != -1:
                labels_form.append("fg")
            elif labels[i].find("fg1") != -1:
                labels_form.append("fg1")
            elif labels[i].find("fg2") != -1:
                labels_form.append("fg2")

        class_ids = np.array([self.class_names.index(s) for s in labels_form])
        return mask, class_ids.astype(np.int32)

    def draw_shape(self, image, shape, dims, color):
        """Draws a shape from the given specs."""
        # Get the center x, y and the size s
        x, y, s = dims
        if shape == 'square':
            cv2.rectangle(image, (x - s, y - s), (x + s, y + s), color, -1)
        elif shape == "circle":
            cv2.circle(image, (x, y), s, color, -1)
        elif shape == "triangle":
            points = np.array([[(x, y - s),
                                (x - s / math.sin(math.radians(60)), y + s),
                                (x + s / math.sin(math.radians(60)), y + s),
                                ]], dtype=np.int32)
            cv2.fillPoly(image, points, color)
        return image

    def random_shape(self, height, width):
        """Generates specifications of a random shape that lies within
        the given height and width boundaries.
        Returns a tuple of three valus:
        * The shape name (square, circle, ...)
        * Shape color: a tuple of 3 values, RGB.
        * Shape dimensions: A tuple of values that define the shape size
                            and location. Differs per shape type.
        """
        # Shape
        shape = random.choice(["fg", "fg1", "fg2"])
        # Color
        color = tuple([random.randint(0, 255) for _ in range(3)])
        # Center x, y
        buffer = 20
        y = random.randint(buffer, height - buffer - 1)
        x = random.randint(buffer, width - buffer - 1)
        # Size
        s = random.randint(buffer, height // 4)
        return shape, color, (x, y, s)

    def random_image(self, height, width):
        """Creates random specifications of an image with multiple shapes.
        Returns the background color of the image and a list of shape
        specifications that can be used to draw the image.
        """
        # Pick random background color
        bg_color = np.array([random.randint(0, 255) for _ in range(3)])
        # Generate a few random shapes and record their
        # bounding boxes
        shapes = []
        boxes = []
        N = random.randint(1, 4)
        for _ in range(N):
            shape, color, dims = self.random_shape(height, width)
            shapes.append((shape, color, dims))
            x, y, s = dims
            boxes.append([y - s, x - s, y + s, x + s])
        # Apply non-max suppression wit 0.3 threshold to avoid
        # shapes covering each other
        keep_ixs = utils.non_max_suppression(np.array(boxes), np.arange(N), 0.3)
        shapes = [s for i, s in enumerate(shapes) if i in keep_ixs]
        return bg_color, shapes

# In[5]:
# Training dataset
train_labelme_json_folder = os.path.join(ROOT_DIR, "dataset\\bcdata\\Training_dataset")
print('mask_folder', train_labelme_json_folder)
train_imgs_list = os.listdir(train_labelme_json_folder)
train_imgs_count = len(train_imgs_list)
print('train_imgs_count', train_imgs_count)
dataset_train = ShapesDataset()
dataset_train.load_shapes(train_imgs_count, train_labelme_json_folder)
dataset_train.prepare()

# Validation dataset
val_labelme_json_folder = os.path.join(ROOT_DIR, "dataset\\bcdata\\Validation_dataset")
print('val_labelme_json_folder', val_labelme_json_folder)
val_imgs_list = os.listdir(val_labelme_json_folder)
val_imgs_count = len(val_imgs_list)
print('val_imgs_count', val_imgs_count)
dataset_val = ShapesDataset()
dataset_val.load_shapes(val_imgs_count, val_labelme_json_folder)
dataset_val.prepare()

# ## Create Model

# In[7]:


# Create model in training mode
model = modellib.MaskRCNN(mode="training", config=config,
                          model_dir=MODEL_DIR)

# In[ ]:


# In[8]:


# Which weights to start with?
init_with = "coco"  # imagenet, coco, or last

if init_with == "imagenet":
    model.load_weights(model.get_imagenet_weights(), by_name=True)
elif init_with == "coco":
    # Load weights trained on MS COCO, but skip layers that
    # are different due to the different number of classes
    # See README for instructions to download the COCO weights
    model.load_weights(COCO_MODEL_PATH, by_name=True,
                       exclude=["mrcnn_class_logits", "mrcnn_bbox_fc",
                                "mrcnn_bbox", "mrcnn_mask"])
elif init_with == "last":
    # Load the last model you trained and continue training
    model.load_weights(model.find_last(), by_name=True)

# ## Training
#
# Train in two stages:
# 1. Only the heads. Here we're freezing all the backbone layers and training only the randomly initialized layers (i.e. the ones that we didn't use pre-trained weights from MS COCO). To train only the head layers, pass `layers='heads'` to the `train()` function.
#
# 2. Fine-tune all layers. For this simple example it's not necessary, but we're including it to show the process. Simply pass `layers="all` to train all layers.

# In[ ]:


# Train the head branches
# Passing layers="heads" freezes all layers except the head
# layers. You can also pass a regular expression to select
# which layers to train by name pattern.
model.train(dataset_train, dataset_val,
            learning_rate=config.LEARNING_RATE,
            epochs=20,
            layers='heads')

# In[ ]:


# Fine tune all layers
# Passing layers="all" trains all layers. You can also
# pass a regular expression to select which layers to
# train by name pattern.
model.train(dataset_train, dataset_val,
            learning_rate=config.LEARNING_RATE / 10,
            epochs=20,
            layers="all")

test.py

import os
import sys
import random
import math
import numpy as np
import skimage.io
import matplotlib
import matplotlib.pyplot as plt
import cv2
from PIL import Image
import time
# Root directory of the project
ROOT_DIR = os.path.abspath("../")
# Import Mask RCNN
sys.path.append(ROOT_DIR)  # To find local version of the library
from mrcnn.config import Config
from datetime import datetime 

from mrcnn import utils
import mrcnn.model as modellib
from mrcnn import visualize

# Directory to save logs and trained model
MODEL_DIR = os.path.join(ROOT_DIR, "logs")

# Local path to trained weights file
COCO_MODEL_PATH = os.path.join(ROOT_DIR ,"mask_rcnn_shapes_0034.h5")

# Directory of images to run detection on
IMAGE_DIR = os.path.join(ROOT_DIR, "images")

class ShapesConfig(Config):
    """Configuration for training on the toy shapes dataset.
    Derives from the base Config class and overrides values specific
    to the toy shapes dataset.
    """
    # Give the configuration a recognizable name
    NAME = "shapes"

    # Train on 1 GPU and 8 images per GPU. We can put multiple images on each
    # GPU because the images are small. Batch size is 8 (GPUs * images/GPU).
    GPU_COUNT = 2
    IMAGES_PER_GPU = 1

    # Number of classes (including background)
    NUM_CLASSES = 1 + 3  # background + 3 shapes(fg, fg1, fg2)

    # Use small images for faster training. Set the limits of the small side
    # the large side, and that determines the image shape.
    IMAGE_MIN_DIM = 320
    IMAGE_MAX_DIM = 320

    # Use smaller anchors because our image and objects are small
    RPN_ANCHOR_SCALES = (8 * 8, 16 * 8, 32 * 8, 64 * 8, 128 * 8)  # anchor side in pixels

    # Reduce training ROIs per image because the images are small and have
    # few objects. Aim to allow ROI sampling to pick 33% positive ROIs.
    TRAIN_ROIS_PER_IMAGE = 32

    # Use a small epoch since the data is simple
    STEPS_PER_EPOCH = 100

    # use small validation steps since the epoch is small
    VALIDATION_STEPS = 5

def get_instances_mask(image, boxes, masks, class_ids, class_names, scores=None):
    """
    boxes: [num_instance, (y1, x1, y2, x2, class_id)] in image coordinates.
    masks: [height, width, num_instances]
    class_ids: [num_instances]
    class_names: list of class names of the dataset
    scores: (optional) confidence scores for each box
    """
    # Number of instances
    N = boxes.shape[0]
    if not N:
        print("\n*** No instances to display *** \n")
    else:
        assert boxes.shape[0] == masks.shape[-1] == class_ids.shape[0]

    # Show area outside image boundaries.
    height, width = image.shape[:2]
    img = np.zeros((height, width, 1), np.uint8)
    masked_image = image.astype(np.uint32).copy()
    for i in range(N):
        # Bounding box
        if not np.any(boxes[i]):
            # Skip this instance. Has no bbox. Likely lost in image cropping.
            continue
        y1, x1, y2, x2 = boxes[i]
        # Mask
        mask = masks[:, :, i]
        # Mask Polygon
        # Pad to ensure proper polygons for masks that touch image edges.
        padded_mask = np.zeros((mask.shape[0] + 2, mask.shape[1] + 2), dtype=np.uint8)
        padded_mask[1:-1, 1:-1] = mask
        image, contours, hierarchy = cv2.findContours(padded_mask, cv2.RETR_TREE, cv2.CHAIN_APPROX_SIMPLE)
        img = cv2.drawContours(img, contours, -1, (255), cv2.FILLED)
    return img

def get_instances(image, mask):
    """
    Apply the given mask to the image.
    """
    for c in range(3):
        image[:, :, c] = np.where(mask <= 128, 255, image[:, :, c])
    return image

#import train_tongue
class InferenceConfig(ShapesConfig):
    # Set batch size to 1 since we'll be running inference on
    # one image at a time. Batch size = GPU_COUNT * IMAGES_PER_GPU
    GPU_COUNT = 1
    IMAGES_PER_GPU = 1

config = InferenceConfig()
config.display()

# Create model object in inference mode.
model = modellib.MaskRCNN(mode="inference", model_dir=MODEL_DIR, config=config)

# Load weights trained on MS-COCO
model.load_weights(COCO_MODEL_PATH, by_name=True)

# COCO Class names
# Index of the class in the list is its ID. For example, to get ID of
# the teddy bear class, use: class_names.index('teddy bear')
class_names = ['BG', 'fg', 'fg1', 'fg2']
# Load a test image from the images folder
file_names = os.path.join(IMAGE_DIR, '1168231109623.jpg')
img = cv2.imread(file_names)
img = cv2.resize(img, (720, 720))
img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
img_copy = img.copy()
t1 = time.time()
results = model.detect([img], verbose=1)
t2 = time.time()
print("cost time:", t2 - t1)
r = results[0]

#visualize.display_instances(img, r['rois'], r['masks'], r['class_ids'], class_names, r['scores'])
mask_img = get_instances_mask(img, r['rois'], r['masks'], r['class_ids'], class_names, r['scores'])
cv2.imshow('mask_img', mask_img)
mask_img = cv2.medianBlur(mask_img, 5)
instances = get_instances(img_copy, mask_img)

instances = cv2.cvtColor(instances, cv2.COLOR_RGB2BGR)
cv2.imshow('image', instances)


cv2.waitKey(0)

返利可信吗?大家觉得返利APP靠谱吗? 氧惠好项目
直返平台上的商品是否为正品，这是许多用户关心的问题。我们首先要明确的是，直返平台对所有入驻的商家都有严格的要求，包括但不限于资质审查、产品质量把控等方面。我们秉持着对用户负责的态度，尽我们最大的努力确保平台上销售的商品为正品。氧惠APP（带货领导者）——是与以往完全不同的抖客+淘客app！2023全新模式，我的直推也会放到你下面。主打：带货高补贴，深受各位带货团队长喜爱（每天出单带货几十万单）。注
亚马逊优惠券如何叠加？氧惠好项目
亚马逊的优惠券可以叠加使用，但需要注意以下几点：领购物大额优惠券、赚返利佣金用氧惠~氧惠APP（带货领导者）——是与以往完全不同的抖客+淘客app！2023全新模式，我的直推也会放到你下面。主打：带货高补贴，深受各位带货团队长喜爱（每天出单带货几十万单）氧惠是公认的返利最好用的软件。注册即可享受高补贴+0撸+捡漏等带货新体验。氧惠邀请码888999，送万元推广大礼包，教你如何1年做到百万团队。优惠
你的光芒，我看得见风之子的黄昏
手机拍于晨跑路上我还不能够靠近你从世俗意义上讲请让我保留现实的躯体为了那些尘世的事物而灵魂是属于你的在每一个夜晚每一个沉思的片刻笨重的身体，轻盈的灵魂思想的苦痛受制于现实的一切我依然是理想主义者渴望这个世界多一些温度渴望这个世界多一些纯粹就像那些古典的爱情我们是尘世中的安静者是世俗中的被压制者人们对于美好的光芒总不适应黑暗让他们感到自在亲爱的，请保持安静与沉默请用一生保持安静与沉默就像那深邃的夜空
JS 柯里化 (Currying)：函数参数的偏应用与函数复用
各位程序猿，大家好！我是你们今天下午的JS柯里化专题讲座讲师，叫我老王就行。今天咱们不搞虚的，直接上干货，聊聊JS里一个听起来高大上，用起来贼好使的技术——柯里化（Currying）。开场白：柯里化，你别怕，它真不难！很多人一听到“柯里化”三个字，就感觉像进了什么魔法学院，满眼都是咒语和符文，恨不得直接逃课。淡定！柯里化其实没那么可怕，它只是把一个接受多个参数的函数，变成一系列接受单个参数的函数。
全链路压测：影子库与影子表之争阿里巴巴中间件数据库分布式 java 人工智能大数据
01业界盛传的全链路压测是什么Aliware全链路压测诞生于阿里巴巴双11备战过程，如果说双11大促是阿里业务的“期末考试”，全链路压测就是大考前的“模拟考试”，诞生后被誉为双11稳定性保障的“核武器”。全链路压测通过在生产环境对业务大流量场景进行高仿真模拟，获取最真实的线上实际承载能力、执行精准的容量规划，确保系统可用性。分布式架构和业务快速发展给业务系统带来了不确定性。分布式环境的任意节点都可
拉肚，肚拉 5123212
今天拉肚子了，刚刚，两次了。中午在饭店吃饭，吃了牛肉，水煮虾，雪里蕻，西兰花，蒸鲟鱼，再就是喝了一些鸡汤，再没有什么了。晚上吃了凉拌的豆芽木耳，一个梨。一天喝的都是赤小豆薏米水。应该是食物不卫生或者没有熟透吧。不过，我心里有些窃喜。我好久没有拉过肚子了，猛地一下，感觉十分新鲜。还有一个，我觉得我需要有这样大量的外排。自己的身体里，心里堆积了好多的垃圾，好多的沉重，好多的渴求与孤寂，都排一排何尝不是
高并发场景下的技术压测与问题排查：P7面试官考核并发优化方案淳淳同学 Java面试场景题 Java 高并发压测 GC 性能优化
文章标题：“Java求职者面试：高并发场景下的技术压测与问题排查”Tag：Java,高并发,压测,GC,性能优化场景描述：面试官（张工）：一位严肃且专业的Java面试官，专注于高并发和性能优化领域，对技术细节有着深刻的理解。小兰：一名年轻但略显紧张的Java程序员，面试前虽然做了准备，但在复杂问题上显得有些犹豫和含糊。第一轮提问：基础知识与高并发场景引入张工：小兰，你好，很高兴见到你。我们先从简单
深入理解设计模式之外观模式：简化复杂系统的艺术 vvilkin的学习备忘设计模式设计模式外观模式
为什么需要外观模式？在软件开发中，我们经常会遇到这样的情况：一个功能需要调用多个子系统或复杂的类结构来完成。随着系统规模的扩大，子系统之间的交互变得越来越复杂，客户端代码需要了解每个子系统的细节才能正确使用它们。这不仅增加了代码的复杂度，也使得系统难以维护和扩展。想象一下，你每次开车都需要手动控制发动机的点火时机、燃油喷射量、气门开闭时间等所有细节，而不是简单地转动钥匙或按下启动按钮，这将是多么繁
新商业女性：开启女性商业新格局时光拾贝
新商业女性：开启女性商业新格局新商业女性深知社群是最接近用户的地方，在社群中，新商业女性发现当代女性在社会身份诉求中，拥有多重身份，比如在羁绊关系中，存在亲密关系、身心灵、家庭诉求；在商业关系中，存在职场、创业、资源诉求；在独立与价值实现中，存在公益、女性成长、独立意识诉求。基于这些诉求，新商业女性意识到人类的知识和认知水平存在结构性的差异，并开创性提出女性生活方式平台+社群互联网生态的商业设计，
DeepSeekMath：突破开源语言模型在数学推理中的极限 AI专题精讲强化学习人工智能强化学习 AI技术应用
温馨提示：本篇文章已同步至"AI专题精讲"DeepSeekMath：突破开源语言模型在数学推理中的极限摘要数学推理由于其复杂且结构化的特性，对语言模型构成了重大挑战。本文介绍了DeepSeekMath7B，该模型在DeepSeek-Coder-Base-v1.57B的基础上继续进行了预训练，使用了来自CommonCrawl的120B数学相关token，同时包含自然语言和代码数据。DeepSeekM
卡罗林斯卡学院与华大等团队联合发表人类、猪、小鼠大脑中的蛋白编码基因图谱尐尐呅
美国时间2020年3月5日，由卡罗林斯卡学院、瑞典皇家理工学院和华大等团队共同完成的一项题目为“人类、猪、小鼠大脑中的蛋白编码基因图谱”的研究发表于Science（影响因子41）。该研究基于多种转录组学方法和抗体图谱技术，对大脑不同区域进行了全面、深入的分子解析，并且提供了高质量的蛋白编码基因的分子图谱，为进一步研究提供了有力的武器。该研究成功地构建了哺乳动物大脑的基因图谱，是对现有的若干个大脑图
MySQL 配置性能优化实操指南：分版本5.7和8.0适配方案挑战者666888 mysql 《Java面试精选》adb mysql 性能优化服务器数据库 linux 运维
在MySQL性能优化中，不同版本的特性差异会直接影响优化效果。本文基于MySQL5.7和8.0两个主流版本，通过版本适配的配置代码、场景举例和通俗解释，让优化方案更精准落地。一、硬件与系统配置优化（基础层优化）1.服务器硬件选型实战建议CPU：高并发场景优先选多核CPU（如16核IntelXeon），但避免盲目堆核（MySQL5.7对超32核利用率下降明显，8.0有显著改进）。举例：电商秒杀服务器
告别手动引入！PHP自动加载终极指南，效率提升90% Jay_MIng php android 开发语言 linux nginx java python
在没有自动加载机制的前提下，想要使用不同文件的类时，需要逐个手动将文件引入才行require'classes/MyClass.php';//......$obj=newMyClass();这种情况会导致维护困难，随着项目扩大变得难以管理。因此自动加载是PHP中一种重要的机制。自动加载允许在首次使用类时动态加载类文件，而不需要手动包含每个类文件PHP中推荐使用spl_autoload_registe
分布式定时器：原理设计与技术挑战你一身傲骨怎能输架构设计分布式
文章摘要分布式定时器用于在分布式系统中可靠、准确地触发定时任务，常见实现方案包括：基于数据库/消息队列的定时扫描、分布式任务调度框架（如Quartz集群、xxl-job）、时间轮/延迟队列（如Redis/Kafka）以及Zookeeper/Etcd协调服务。主要技术挑战包括时钟同步、任务幂等、高可用、负载均衡和故障恢复等。核心难点在于保证任务唯一性、调度精度与分布式一致性，技术选型需权衡轻量级（R
应用层流量与缓存累积延迟解析你一身傲骨怎能输计算机网络缓存
文章摘要应用层流量指OSI模型中应用层协议（如HTTP、gRPC）产生的数据交互，常见于Web请求、微服务通信等场景。缓存累积延迟指多级缓存或消息队列机制中，各级延迟叠加导致数据更新滞后，例如数据库更新后，因消息队列、缓存刷新等环节延迟，用户最终看到的数据可能滞后数秒。两者分别描述了网络通信的数据流机制和分布式系统中的延迟问题。1.应用层流量应用层流量，一般指的是在网络通信的OSI七层模型中，**
极限高并发压测：P7架构师与应届生的JVM调优对决搞Java的小码农 Java面试场景题 Java面试高并发性能优化 JVM调优极限场景
文章标题：极限高并发压测：P7架构师与应届生的JVM调优对决场景描述在一个互联网大厂的终面环节，面试官决定通过模拟真实业务场景来考察候选人的技术深度和解决问题的能力。面试官是一位有着丰富经验的P7架构师，而候选人是刚刚毕业的应届生小兰，她擅长手写Tomcat并自认为对JVM有一定了解。面试的背景是一个极端的高并发场景，QPS从2000飙升至10万，同时伴随着内存泄漏问题和GC暂停时间的急剧增加。第
UE5网络联机函数 UE星空 UE蓝图 ue5
FindSessionsCreateSessionJoinSessionDestroySessionSteam是p2p直接联机一、steam提供的测试用AppIdAppId是steam为每一款游戏所设定的独有标识，每一款要上架steam的游戏都会拥有独一无二的AppId。不过为了方便开发者测试，steam提供了游戏名为SpaceWar的AppId480供大家免费使用。二、根据虚幻文档接入Onlin
第三方库xlrd,读取excel中的数据听MM的话
1、安装第三方库=============》xlrdpipinstallxlrd2、代码如下，封装成类的形式，方便调用，提高复用性importxlrdfromxlrdimportxldate_as_tuple'''xlrd中单元格的数据类型数字一律按浮点型输出，日期输出成一串小数，布尔型输出0或1，所以我们必须在程序中做判断处理转换成我们想要的数据类型0empty,1string,2number,
电商垃圾桶（一）选品失败的多维度深度解析及典型案例 potatoshops 电商垃圾桶教育电商大数据
选品作为电商运营的核心环节，其失败往往是多重因素叠加的致命伤。本文从六大关键维度剖析选品失误的根源，并通过细节化案例揭示现实中的决策陷阱：维度一：市场研究与需求错位核心问题：脱离真实消费场景的“伪需求”判断典型表现：忽视地域文化差异（如欧美与亚洲审美差异）误判政策法规风险（如电器安全认证缺失）低估用户决策成本（如高价非必需品）案例深化：平衡车惨案：某跨境卖家见欧美网红推广平衡车，未调研即采购200
2022-7-26晨间日记面前大海
今天是什么日子起床：6:10就寝：22:00天气：晴心情：平静任务清单昨日完成的任务，最重要的三件事：练字、锻炼改进：日更字数及质量。习惯养成：早起、锻炼、练字、节食周目标·完成进度读完《十天塑造孩子学习力》，已读八章。学习·信息·阅读完成书恒朗读训练、超级记忆术看1-3集视频。健康·饮食·锻炼节制饮食人际·家人·朋友正确引导孩子养成良好的学习习惯。工作·思考认真工作，从工作中寻找乐趣。最美好的三
软件测试入门指南：零基础到实战通关手册
一、为什么需要软件测试？行业现状（2024年数据）全球软件缺陷造成的经济损失高达$2.4万亿（来源：NIST报告）优秀测试人员与开发人员配比应达1:5（头部互联网企业实际数据）经典案例迪士尼+上线首日因负载测试不足导致服务器崩溃某银行系统未做金额边界测试，引发超额转账漏洞二、测试工程师的职责全景图（配图：测试工作流程图）阶段核心工作产出物示例需求分析参与评审，提取测试点测试需求跟踪矩阵测试设计编写
3D打印遥控投喂船：用ESP32C3打造低成本水上机器人 iotzgq 机器人
项目缘起：从脚踏船到智能投喂的创新转身在创客圈，灵感往往源于意外的"灵光一闪"。这个3D打印遥控投喂船的项目最初只是想做一艘普通的遥控脚踏船，直到开发者突发奇想：为什么不增加一个自动投喂装置？这个改动让项目瞬间具备了实用价值——不仅能在湖面操控小船畅玩，还能精准投放鱼食或鱼药到人工难以到达的水域。最令人称道的是其无线通信方案：放弃了传统遥控模块，采用ESP-NOW协议实现船与遥控器的通信。这种方案
用ESP8266和MicroPython打造WiFi智能遥控小车：从入门到实战
项目概述：WiFi控制的创新体验在物联网技术飞速发展的今天，传统遥控小车早已无法满足创客们的探索欲望。本文将介绍一个基于ESP8266和MicroPython的WiFi遥控小车项目，通过两个ESP8266模块实现无线通信，让你摆脱传统遥控器的束缚，体验物联网控制的乐趣。核心功能亮点WiFi无线控制：无需传统射频模块，通过WiFi网络实现远程操控双ESP8266架构：一个作为车载接收端，一个作为手持
PyCharm高效入门指南：快速提升Python开发效率 famenzhiling python pycharm ide
1.引言PyCharm简介：JetBrains开发的Python集成开发环境（IDE），适用于专业开发者和初学者。为什么选择PyCharm：高效代码编辑、智能工具集成和强大的调试功能。目标读者：Python新手或有其他IDE经验但想快速上手PyCharm的用户。2.安装与初始配置下载与安装：访问JetBrains官网下载PyCharmCommunity（免费版）或Professional（付费版）
Java大视界：Java大数据在智能医疗电子健康档案数据挖掘与健康服务创新＞ Loving_enjoy 计算机学科论文创新点人工智能深度学习迁移学习经验分享
>本文通过完整代码示例，揭秘如何用Java大数据技术挖掘电子健康档案价值，实现疾病预测、个性化健康管理等创新服务。###一、智能医疗时代的数据金矿电子健康档案（EHR）作为医疗数字化的核心载体，包含海量患者全生命周期健康数据。据统计，全球医疗数据量正以每年**48%的速度增长**，单个三甲医院年数据量可达**PB级**。这些数据蕴藏着疾病规律、治疗效能的宝贵知识，但传统技术难以有效挖掘。**Jav
好省省钱赚钱平台揭秘：赚钱潜力与多种赚钱方式详解！浮沉导师
好省是一款省钱赚钱平台，很多人疑惑好省赚钱是否真的高效。本文将揭秘好省省钱赚钱平台的赚钱潜力，帮助您了解如何在好省平台获得更多收益。一、好省平台简介及优势1.好省简介：好省是一个保护消费者权益的省钱赚钱平台，通过与商家合作提供商品和服务折扣，让用户节省开支并享受返现。2.赚钱优势：好省提供多种赚钱方式，如购物返现、邀请奖励、任务奖励等，让用户有机会通过平台赚取额外的收益。大家好！我是氧惠APP最大
Postman + Newman + Jenkins 接口自动化测试 Thomas Kant 自动化测试 postman newman jenkins allure
亲爱的技术爱好者们，热烈欢迎来到Kant2048的博客！我是ThomasKant，很开心能在CSDN上与你们相遇～本博客的精华专栏：【自动化测试】【测试经验】【人工智能】【Python】Postman
《潜夫论》卷16述赦诗解3性恶之人数赦无悔大奸媚上瑞异戒主琴诗书画
《潜夫论》卷16述赦诗解3性恶之人数赦无悔大奸媚上瑞异戒主题文诗：今也不然,性恶之人,居家不孝,出入不敬,轻薄慢傲,凶悍不变,明以威侮,侵利为行,竟以贼残,酷虐为贤,数陷王法,乃民之贼,下愚极恶,之人者也.虽脱桎梏,而出囹圄,终无改悔,之心自恃,以数赦赎,出狱踧踖,复犯法者,何所不然.洛阳至有,之主谐合,杀人谓之,会任之家,受人十万,谢客数千.重馈部吏,吏与通奸,利入深重,幡党盘牙,请至贵戚,谒于
李清照与赵明诚荷塘恋雨
看《金石录序》我泪流满面。她的丈夫赵明诚，那个懂她、疼她、欣赏她的男人永远的去了。战乱中，她要带着和丈夫一起收集的金石逃难，这样一个弱女子，往往保护不了这些沉重的金石，每每少了一个，那都是血肉模糊的疼，那是他们共同生活的见证，是他们爱的记忆……李清照，当你看着日渐减少的金石箱，你的苦无边，你的痛无底，“寻寻觅觅，凄凄惨惨戚戚。”赵明诚这个名字是因为李清照我才得知的，然而了解李清照的生活后，我才知道
opencv常用函数汇总 Sky.Kevin opencv 计算机视觉
一、色彩空间类型转换1、cv2.cvtColordst=cv2.cvtColor(src,code[,dstCn])式中：dst表示输出图像，与原始输入图像具有同样的数据类型和深度。src表示原始输入图像。可以是8位无符号图像、16位无符号图像，或者单精度浮点数等。code是色彩空间转换码，表4-2展示了其枚举值。dstCn是目标图像的通道数。如果参数为默认的0，则通道数自动通过原始输入图像和co
windows下源码安装golang 616050468 golang安装 golang环境 windows
系统： 64位win7，开发环境：sublime text 2， go版本： 1.4.1 1. 安装前准备(gcc, gdb, git) golang在64位系
redis批量删除带空格的key bylijinnan redis
redis批量删除的通常做法： redis-cli keys "blacklist*" | xargs redis-cli del 上面的命令在key的前后没有空格时是可以的，但有空格就不行了： $redis-cli keys "blacklist*" 1) "blacklist:12: [email protected]
oracle正则表达式的用法 0624chenhong oracle 正则表达式
方括号表达示方括号表达式描述 [[:alnum:]] 字母和数字混合的字符 [[:alpha:]] 字母字符 [[:cntrl:]] 控制字符 [[:digit:]] 数字字符 [[:graph:]] 图像字符 [[:lower:]] 小写字母字符 [[:print:]] 打印字符 [[:punct：]] 标点符号字符 [[:space:]]
2048源码(核心算法有，缺少几个anctionbar，以后补上) 不懂事的小屁孩 2048
2048游戏基本上有四部分组成， 1：主activity，包含游戏块的16个方格，上面统计分数的模块 2：底下的gridview，监听上下左右的滑动，进行事件处理， 3：每一个卡片，里面的内容很简单，只有一个text，记录显示的数字 4：Actionbar，是游戏用重新开始，设置等功能(这个在底下可以下载的代码里面还没有实现) 写代码的流程 1：设计游戏的布局，基本是两块，上面是分
jquery内部链式调用机理换个号韩国红果果 JavaScript jquery
只需要在调用该对象合适(比如下列的setStyles)的方法后让该方法返回该对象（通过this 因为一旦一个函数称为一个对象方法的话那么在这个方法内部this（结合下面的setStyles）指向这个对象） function create(type){ var element=document.createElement(type); //this=element;
你订酒店时的每一次点击背后都是NoSQL和云计算蓝儿唯美 NoSQL
全球最大的在线旅游公司Expedia旗下的酒店预订公司，它运营着89个网站，跨越68个国家，三年前开始实验公有云，以求让客户在预订网站上查询假期酒店时得到更快的信息获取体验。云端本身是用于驱动网站的部分小功能的，如搜索框的自动推荐功能，还能保证处理Hotels.com服务的季节性需求高峰整体储能。 Hotels.com的首席技术官Thierry Bedos上个月在伦敦参加“2015 Clou
java笔记1 a-john java
1，面向对象程序设计（Object-oriented Propramming，OOP）：java就是一种面向对象程序设计。 2，对象：我们将问题空间中的元素及其在解空间中的表示称为“对象”。简单来说，对象是某个类型的实例。比如狗是一个类型，哈士奇可以是狗的一个实例，也就是对象。 3，面向对象程序设计方式的特性： 3.1 万物皆为对象。
C语言 sizeof和strlen之间的那些事 C/C++软件开发求职面试题必备考点（一） aijuans C/C++求职面试必备考点
找工作在即，以后决定每天至少写一个知识点，主要是记录，逼迫自己动手、总结加深印象。当然如果能有一言半语让他人收益，后学幸运之至也。如有错误，还希望大家帮忙指出来。感激不尽。后学保证每个写出来的结果都是自己在电脑上亲自跑过的，咱人笨，以前学的也半吊子。很多时候只能靠运行出来的结果再反过来
程序员写代码时就不要管需求了吗？ asia007 程序员不能一味跟需求走
编程也有2年了，刚开始不懂的什么都跟需求走，需求是怎样就用代码实现就行，也不管这个需求是否合理，是否为较好的用户体验。当然刚开始编程都会这样，但是如果有了2年以上的工作经验的程序员只知道一味写代码，而不在写的过程中思考一下这个需求是否合理，那么，我想这个程序员就只能一辈写敲敲代码了。我的技术不是很好，但是就不代
Activity的四种启动模式百合不是茶 android 栈模式启动 Activity的标准模式启动栈顶模式启动单例模式启动
android界面的操作就是很多个activity之间的切换,启动模式决定启动的activity的生命周期 ; 启动模式xml中配置 <activity android:name=".MainActivity" android:launchMode="standard&quo
Spring中@Autowired标签与@Resource标签的区别 bijian1013 java spring @Resource @Autowired @Qualifier
Spring不但支持自己定义的@Autowired注解，还支持由JSR-250规范定义的几个注解，如：@Resource、 @PostConstruct及@PreDestroy。 1. @Autowired @Autowired是Spring 提供的，需导入 Package:org.springframewo
Changes Between SOAP 1.1 and SOAP 1.2 sunjing Changes Enable SOAP 1.1 SOAP 1.2
JAX-WS SOAP Version 1.2 Part 0: Primer (Second Edition) SOAP Version 1.2 Part 1: Messaging Framework (Second Edition) SOAP Version 1.2 Part 2: Adjuncts (Second Edition) Which style of WSDL
【Hadoop二】Hadoop常用命令 bit1129 hadoop
以Hadoop运行Hadoop自带的wordcount为例， hadoop脚本位于/home/hadoop/hadoop-2.5.2/bin/hadoop，需要说明的是，这些命令的使用必须在Hadoop已经运行的情况下才能执行 Hadoop HDFS相关命令 hadoop fs -ls 列出HDFS文件系统的第一级文件和第一级
java异常处理（初级）白糖_ java DAO spring 虚拟机 Ajax
从学习到现在从事java开发一年多了，个人觉得对java只了解皮毛，很多东西都是用到再去慢慢学习，编程真的是一项艺术，要完成一段好的代码，需要懂得很多。最近项目经理让我负责一个组件开发，框架都由自己搭建，最让我头疼的是异常处理，我看了一些网上的源码，发现他们对异常的处理不是很重视，研究了很久都没有找到很好的解决方案。后来有幸看到一个200W美元的项目部分源码，通过他们对异常处理的解决方案，我终
记录整理-工作问题 braveCS 工作
1）那位同学还是CSV文件默认Excel打开看不到全部结果。以为是没写进去。同学甲说文件应该不分大小。后来log一下原来是有写进去。只是Excel有行数限制。那位同学进步好快啊。 2）今天同学说写文件的时候提示jvm的内存溢出。我马上反应说那就改一下jvm的内存大小。同学说改用分批处理了。果然想问题还是有局限性。改jvm内存大小只能暂时地解决问题，以后要是写更大的文件还是得改内存。想问题要长远啊
org.apache.tools.zip实现文件的压缩和解压，支持中文 bylijinnan apache
刚开始用java.util.Zip，发现不支持中文（网上有修改的方法，但比较麻烦）后改用org.apache.tools.zip org.apache.tools.zip的使用网上有更简单的例子下面的程序根据实际需求，实现了压缩指定目录下指定文件的方法 import java.io.BufferedReader; import java.io.BufferedWrit
读书笔记-4 chengxuyuancsdn 读书笔记
1、JSTL 核心标签库标签 2、避免SQL注入 3、字符串逆转方法 4、字符串比较compareTo 5、字符串替换replace 6、分拆字符串 1、JSTL 核心标签库标签共有13个，学习资料：http://www.cnblogs.com/lihuiyy/archive/2012/02/24/2366806.html 功能上分为4类： (1)表达式控制标签：out
[物理与电子]半导体教材的一个小问题 comsci 问题
各种模拟电子和数字电子教材中都有这个词汇-空穴书中对这个词汇的解释是; 当电子脱离共价键的束缚成为自由电子之后,共价键中就留下一个空位,这个空位叫做空穴我现在回过头翻大学时候的教材,觉得这个
Flashback Database --闪回数据库 daizj oracle 闪回数据库
Flashback 技术是以Undo segment中的内容为基础的，因此受限于UNDO_RETENTON参数。要使用flashback 的特性，必须启用自动撤销管理表空间。在Oracle 10g中， Flash back家族分为以下成员： Flashback Database， Flashback Drop，Flashback Query(分Flashback Query,Flashbac
简单排序:插入排序 dieslrae 插入排序
public void insertSort(int[] array){ int temp; for(int i=1;i<array.length;i++){ temp = array[i]; for(int k=i-1;k>=0;k--)
C语言学习六指针小示例、一维数组名含义，定义一个函数输出数组的内容 dcj3sjt126com c
# include <stdio.h> int main(void) { int * p; //等价于 int *p 也等价于 int* p; int i = 5; char ch = 'A'; //p = 5; //error //p = &ch; //error //p = ch; //error p = &i; //
centos下php redis扩展的安装配置3种方法 dcj3sjt126com redis
方法一 1.下载php redis扩展包代码如下复制代码 #wget http://redis.googlecode.com/files/redis-2.4.4.tar.gz 2 tar -zxvf 解压压缩包，cd /扩展包（进入扩展包然后运行phpize 一下是我环境中phpize的目录，/usr/local/php/bin/phpize (一定要
线程池(Executors) shuizhaosi888 线程池
在java类库中，任务执行的主要抽象不是Thread，而是Executor，将任务的提交过程和执行过程解耦 public interface Executor { void execute(Runnable command); } public class RunMain implements Executor{ @Override pub
openstack 快速安装笔记 haoningabc openstack
前提是要配置好yum源版本icehouse，操作系统redhat6.5 最简化安装，不要cinder和swift 三个节点 172 control节点keystone glance horizon 173 compute节点nova 173 network节点neutron control /etc/sysctl.conf net.ipv4.ip_forward =
从c面向对象的实现理解c++的对象（二） jimmee C++面向对象虚函数
1. 类就可以看作一个struct，类的方法，可以理解为通过函数指针的方式实现的，类对象分配内存时，只分配成员变量的，函数指针并不需要分配额外的内存保存地址。 2. c++中类的构造函数，就是进行内存分配(malloc)，调用构造函数 3. c++中类的析构函数，就时回收内存(free) 4. c++是基于栈和全局数据分配内存的，如果是一个方法内创建的对象，就直接在栈上分配内存了。专门在
如何让那个一个div可以拖动 lingfeng520240 html
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml
第10章高级事件（中） onestopweb 事件
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
计算两个经纬度之间的距离 roadrunners 计算纬度 LBS 经度距离
要解决这个问题的时候，到网上查了很多方案，最后计算出来的都与百度计算出来的有出入。下面这个公式计算出来的距离和百度计算出来的距离是一致的。 /** * * @param longitudeA * 经度A点 * @param latitudeA * 纬度A点 * @param longitudeB *
最具争议的10个Java话题 tomcat_oracle java
1、Java8已经到来。什么！？ Java8 支持lambda。哇哦，RIP Scala！　　随着Java8 的发布，出现很多关于新发布的Java8是否有潜力干掉Scala的争论，最终的结论是远远没有那么简单。Java8可能已经在Scala的lambda的包围中突围，但Java并非是函数式编程王位的真正觊觎者。　　2、Java 9 即将到来　　 Oracle早在8月份就发布
zoj 3826 Hierarchical Notation(模拟) 阿尔萨斯 rar
题目链接：zoj 3826 Hierarchical Notation 题目大意：给定一些结构体，结构体有value值和key值，Q次询问，输出每个key值对应的value值。解题思路：思路很简单，写个类词法的递归函数，每次将key值映射成一个hash值，用map映射每个key的value起始终止位置，预处理完了查询就很简单了。这题是最后10分钟出的，因为没有考虑value为{}的情

按字母分类： A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 其他

『计算机视觉』Mask-RCNN_训练网络其一：数据集与Dataset类

代码位置

一、原始数据信息录入

二、数据信息整理

类别信息记录

图片信息记录

三、获取图片

小结

重点来了，训练自己的数据集

你可能感兴趣的:(『计算机视觉』Mask-RCNN_训练网络其一：数据集与Dataset类)