AI Studio

CIHP数据集上使用DeepLabV3+实现多分类语义分割

Crowd Instance-level Human Parsing Dataset 数据集上使用DeepLabV3+实现多分类语义分割

作者: WangXi2016

日期: 2022.10.27

摘要: 实现 DeepLabV3+ 架构多类语义分割。

一、介绍

语义分割，任务是为图像中的每个像素进行分类，这是一项基本的计算机视觉任务。

在此示例中，我们实现了用于多分类语义分割的DeepLabV3+模型，这是一种全卷积的架构，在语义分割基准测试中表现良好。

二、环境设置

导入一些比较基础常用的模块，确认自己的飞桨版本。

import os
import cv2
import numpy as np
from glob import glob
from scipy.io import loadmat
import matplotlib.pyplot as plt
import io
from PIL import Image as PilImage
import paddle
import paddle.nn as nn
from paddle.nn import functional as F

paddle.__version__

'2.3.2'

三、数据集

3.1 数据集下载

地址：https://drive.google.com/uc?id=1B9A9UCJYMwTL4oBEo4RZfbMZMaZhKJaz

我们将使用 Crowd 实例级人员解析数据集来训练我们的模型。

Crowd Instance-level Human Parsing Dataset 包含 38，280 个不同的人体图像。

CIHP中的每个图像都标有20个类别的像素级注释以及实例级标识。

此数据集可用于“人体部位分割”任务。

3.2 数据集概览

Images	images 2
Category_ids: semantic part segmentation labels	Categories: visualized semantic part segmentation labels
Human_ids: semantic person segmentation labels	Human: visualized semantic person segmentation labels
Instance_ids: instance-level human parsing labels	Instances: visualized instance-level human parsing labels

Label order of semantic part segmentation:

at
air
love
unglasses
pperClothes
ress
oat
ocks
ants
Torso-skin
Scarf
Skirt
Face
Left-arm
Right-arm
Left-leg
Right-leg
Left-shoe
Right-shoe

# 解压数据集，此处仅Aistudio环境可直接运行。
!tar -zxvf "data/data129866/instance-level_human_parsing.tar.gz"

DATA_DIR = "./instance-level_human_parsing/Training"
NUM_TRAIN_IMAGES = 1000
NUM_VAL_IMAGES = 50

"""
由于所有文件都是散落在文件夹中，在训练时需要使用的是数据集和标签对应的数据关系，
所以第一步是对原始的数据集进行整理，得到数据集和标签两个数组，分别一一对应。
这样可以在使用的时候能够很方便的找到原始数据和标签的对应关系，否则对于原有的文件夹图片数据无法直接应用。
在这里是用了一个非常简单的方法，按照文件名称进行排序。
"""
train_images = sorted(glob(os.path.join(DATA_DIR, "Images/*")))[:NUM_TRAIN_IMAGES]
train_masks = sorted(glob(os.path.join(DATA_DIR, "Category_ids/*")))[:NUM_TRAIN_IMAGES]
val_images = sorted(glob(os.path.join(DATA_DIR, "Images/*")))[
    NUM_TRAIN_IMAGES : NUM_VAL_IMAGES + NUM_TRAIN_IMAGES
]
val_masks = sorted(glob(os.path.join(DATA_DIR, "Category_ids/*")))[
    NUM_TRAIN_IMAGES : NUM_VAL_IMAGES + NUM_TRAIN_IMAGES
]

# 对数据集进行处理，划分训练集、测试集
def _sort_images(image_dir, image_type):
    """
    对文件夹内的图像进行按照文件名排序
    """
    files = []

    for image_name in os.listdir(image_dir):
        if image_name.endswith('.{}'.format(image_type)) \
                and not image_name.startswith('.'):
            files.append(os.path.join(image_dir, image_name))

    return sorted(files)

def write_file(mode, images, labels):
    with open('./{}.txt'.format(mode), 'w') as f:
        for i in range(len(images)):
            f.write('{}\t{}\n'.format(images[i], labels[i]))

write_file('train', train_images, train_masks)
write_file('val', val_images, val_masks)

3.3 PetDataSet数据集抽样展示

划分好数据集之后，来查验一下数据集是否符合预期，通过划分的配置文件读取图片路径后再加载图片数据来用matplotlib进行展示，这里要注意的是对于分割的标签文件因为是1通道的灰度图片，需要在使用imshow接口时注意下传参cmap=‘gray’。

with open('./train.txt', 'r') as f:
    i = 0

    for line in f.readlines():
        image_path, label_path = line.strip().split('\t')
        image = np.array(PilImage.open(image_path))
        label = np.array(PilImage.open(label_path))

        if i > 2:
            break
        # 进行图片的展示
        plt.figure()

        plt.subplot(1,2,1), 
        plt.title('Train Image')
        plt.imshow(image.astype('uint8'))
        plt.axis('off')

        plt.subplot(1,2,2), 
        plt.title('Label')
        plt.imshow(label.astype('uint8'), cmap='gray')
        plt.axis('off')

        plt.show()
        i = i + 1

3.4 数据集类定义

飞桨（PaddlePaddle）数据集加载方案是统一使用Dataset（数据集定义） + DataLoader（多进程数据集加载）。

首先进行数据集的定义，数据集定义主要是实现一个新的Dataset类，继承父类paddle.io.Dataset，并实现父类中以下两个抽象方法，getitem__和__len：

class MyDataset(Dataset):
def init(self):
…

# 每次迭代时返回数据和对应的标签
def __getitem__(self, idx):
    return x, y

# 返回整个数据集的总数
def __len__(self):
    return count(samples)

在数据集内部可以结合图像数据预处理相关API进行图像的预处理（改变大小、反转、调整格式等）。

由于加载进来的图像不一定都符合自己的需求，举个例子，已下载的这些图片里面就会有RGBA格式的图片，这个时候图片就不符合所需3通道的需求，需要进行图片的格式转换，那么这里直接实现了一个通用的图片读取接口，确保读取出来的图片都是满足需求。

另外图片加载出来的默认shape是HWC，这个时候要看看是否满足后面训练的需要，如果Layer的默认格式和这个不是符合的情况下，需要看下Layer有没有参数可以进行格式调整。不过如果layer较多的话，还是直接调整原数据Shape比较好，否则每个layer都要做参数设置，如果有遗漏就会导致训练出错，那么在本案例中是直接对数据源的shape做了统一调整，从HWC转换成了CHW，因为飞桨的卷积等API的默认输入格式为CHW，这样处理方便后续模型训练。

import random

from paddle.io import Dataset
from paddle.vision.transforms import transforms as T

BATCH_SIZE = 4
NUM_CLASSES = 20
IMAGE_SIZE = (512, 512)

class CIHPDataset(Dataset):
    """
    数据集定义
    """
    def __init__(self, mode='train'):
        """
        构造函数
        """
        self.image_size = IMAGE_SIZE
        self.mode = mode.lower()
        
        assert self.mode in ['train', 'val'], \
            "mode should be 'train' or 'val', but got {}".format(self.mode)
        
        self.train_images = []
        self.label_images = []

        with open('./{}.txt'.format(self.mode), 'r') as f:
            for line in f.readlines():
                image, label = line.strip().split('\t')
                self.train_images.append(image)
                self.label_images.append(label)
        
    def _load_img(self, path, color_mode='rgb', transforms=[]):
        """
        统一的图像处理接口封装，用于规整图像大小和通道
        """
        with open(path, 'rb') as f:
            img = PilImage.open(io.BytesIO(f.read()))
            if color_mode == 'grayscale':
                # if image is not already an 8-bit, 16-bit or 32-bit grayscale image
                # convert it to an 8-bit grayscale image.
                if img.mode not in ('L', 'I;16', 'I'):
                    img = img.convert('L')
            elif color_mode == 'rgba':
                if img.mode != 'RGBA':
                    img = img.convert('RGBA')
            elif color_mode == 'rgb':
                if img.mode != 'RGB':
                    img = img.convert('RGB')
            else:
                raise ValueError('color_mode must be "grayscale", "rgb", or "rgba"')
            
            return T.Compose([
                T.Resize(self.image_size)
            ] + transforms)(img)

    def __getitem__(self, idx):
        """
        返回 image, label
        """
        train_image = self._load_img(self.train_images[idx], 
                                     transforms=[
                                         T.Transpose(), 
                                         T.Normalize(mean=127.5, std=127.5)
                                     ]) # 加载原始图像
        label_image = self._load_img(self.label_images[idx], 
                                     color_mode='grayscale',
                                     transforms=[T.Grayscale()]) # 加载Label图像
    
        # 返回image, label
        train_image = np.array(train_image, dtype='float32')
        label_image = np.array(label_image, dtype='int64')
        return train_image, label_image
        
    def __len__(self):
        """
        返回数据集总数
        """
        return len(self.train_images)

四、模型组网

4.1 DeepLabv3+ 介绍

DeepLabv3+是DeepLab语义分割系列网络的最新作，其前作有 DeepLabv1，DeepLabv2, DeepLabv3,
在最新作中，DeepLab的作者通过encoder-decoder进行多尺度信息的融合，同时保留了原来的空洞卷积和ASPP层，
其骨干网络使用了Xception模型，提高了语义分割的健壮性和运行速率，在 PASCAL VOC 2012 dataset取得新的state-of-art performance，89.0mIOU。

4.11 ASPP (Atrous Spatial Pyramid Pooling)

通过不同采样率的空洞卷积获取不同尺度的特征信息。具体可参见DeepLabv2

4.2 辅助函数实现

import os

import paddle
import paddle.nn as nn
import paddle.nn.functional as F


def SyncBatchNorm(*args, **kwargs):
    """In cpu environment nn.SyncBatchNorm does not have kernel so use nn.BatchNorm2D instead"""
    if paddle.get_device() == 'cpu' or os.environ.get('PADDLESEG_EXPORT_STAGE'):
        return nn.BatchNorm2D(*args, **kwargs)
    elif paddle.distributed.ParallelEnv().nranks == 1:
        return nn.BatchNorm2D(*args, **kwargs)
    else:
        return nn.SyncBatchNorm(*args, **kwargs)


class ConvBNReLU(nn.Layer):
    def __init__(self,
                 in_channels,
                 out_channels,
                 kernel_size,
                 padding='same',
                 **kwargs):
        super().__init__()

        self._conv = nn.Conv2D(
            in_channels, out_channels, kernel_size, padding=padding, **kwargs)

        if 'data_format' in kwargs:
            data_format = kwargs['data_format']
        else:
            data_format = 'NCHW'
        self._batch_norm = SyncBatchNorm(out_channels, data_format=data_format)
        self._relu = Activation("relu")

    def forward(self, x):
        x = self._conv(x)
        x = self._batch_norm(x)
        x = self._relu(x)
        return x



class ConvBN(nn.Layer):
    def __init__(self,
                 in_channels,
                 out_channels,
                 kernel_size,
                 padding='same',
                 **kwargs):
        super().__init__()
        self._conv = nn.Conv2D(
            in_channels, out_channels, kernel_size, padding=padding, **kwargs)
        if 'data_format' in kwargs:
            data_format = kwargs['data_format']
        else:
            data_format = 'NCHW'
        self._batch_norm = SyncBatchNorm(out_channels, data_format=data_format)

    def forward(self, x):
        x = self._conv(x)
        x = self._batch_norm(x)
        return x



class SeparableConvBNReLU(nn.Layer):
    def __init__(self,
                 in_channels,
                 out_channels,
                 kernel_size,
                 padding='same',
                 pointwise_bias=None,
                 **kwargs):
        super().__init__()
        self.depthwise_conv = ConvBN(
            in_channels,
            out_channels=in_channels,
            kernel_size=kernel_size,
            padding=padding,
            groups=in_channels,
            **kwargs)
        if 'data_format' in kwargs:
            data_format = kwargs['data_format']
        else:
            data_format = 'NCHW'
        self.piontwise_conv = ConvBNReLU(
            in_channels,
            out_channels,
            kernel_size=1,
            groups=1,
            data_format=data_format,
            bias_attr=pointwise_bias)

    def forward(self, x):
        x = self.depthwise_conv(x)
        x = self.piontwise_conv(x)
        return x



class Activation(nn.Layer):
    def __init__(self, act=None):
        super(Activation, self).__init__()

        self._act = act
        upper_act_names = nn.layer.activation.__dict__.keys()
        lower_act_names = [act.lower() for act in upper_act_names]
        act_dict = dict(zip(lower_act_names, upper_act_names))

        if act is not None:
            if act in act_dict.keys():
                act_name = act_dict[act]
                self.act_func = eval("nn.layer.activation.{}()".format(
                    act_name))
            else:
                raise KeyError("{} does not exist in the current {}".format(
                    act, act_dict.keys()))

    def forward(self, x):
        if self._act is not None:
            return self.act_func(x)
        else:
            return x


class ConvBNLayer(nn.Layer):
    def __init__(self,
                 in_channels,
                 out_channels,
                 kernel_size,
                 stride=1,
                 dilation=1,
                 groups=1,
                 is_vd_mode=False,
                 act=None,
                 data_format='NCHW'):
        super(ConvBNLayer, self).__init__()
        if dilation != 1 and kernel_size != 3:
            raise RuntimeError("When the dilation isn't 1," \
                "the kernel_size should be 3.")

        self.is_vd_mode = is_vd_mode
        self._pool2d_avg = nn.AvgPool2D(
            kernel_size=2,
            stride=2,
            padding=0,
            ceil_mode=True,
            data_format=data_format)
        self._conv = nn.Conv2D(
            in_channels=in_channels,
            out_channels=out_channels,
            kernel_size=kernel_size,
            stride=stride,
            padding=(kernel_size - 1) // 2 \
                if dilation == 1 else dilation,
            dilation=dilation,
            groups=groups,
            bias_attr=False,
            data_format=data_format)

        self._batch_norm = SyncBatchNorm(
            out_channels, data_format=data_format)
        self._act_op = Activation(act=act)

    def forward(self, inputs):
        if self.is_vd_mode:
            inputs = self._pool2d_avg(inputs)
        y = self._conv(inputs)
        y = self._batch_norm(y)
        y = self._act_op(y)

        return y

4.3 backbone实现

class BottleneckBlock(nn.Layer):
    def __init__(self,
                 in_channels,
                 out_channels,
                 stride,
                 shortcut=True,
                 if_first=False,
                 dilation=1,
                 data_format='NCHW'):
        super(BottleneckBlock, self).__init__()

        self.data_format = data_format
        self.conv0 = ConvBNLayer(
            in_channels=in_channels,
            out_channels=out_channels,
            kernel_size=1,
            act='relu',
            data_format=data_format)

        self.dilation = dilation

        self.conv1 = ConvBNLayer(
            in_channels=out_channels,
            out_channels=out_channels,
            kernel_size=3,
            stride=stride,
            act='relu',
            dilation=dilation,
            data_format=data_format)
        self.conv2 = ConvBNLayer(
            in_channels=out_channels,
            out_channels=out_channels * 4,
            kernel_size=1,
            act=None,
            data_format=data_format)

        if not shortcut:
            self.short = ConvBNLayer(
                in_channels=in_channels,
                out_channels=out_channels * 4,
                kernel_size=1,
                stride=1,
                is_vd_mode=False if if_first or stride == 1 else True,
                data_format=data_format)

        self.shortcut = shortcut
        # NOTE: Use the wrap layer for quantization training
 
        self.relu = Activation(act="relu")

    def forward(self, inputs):
 
        y = self.conv0(inputs)
        conv1 = self.conv1(y)
        conv2 = self.conv2(conv1)
 

        if self.shortcut:
            short = inputs
        else:
            short = self.short(inputs)
        y = short+ conv2
        y = self.relu(y)
        return y


class BasicBlock(nn.Layer):
    def __init__(self,
                 in_channels,
                 out_channels,
                 stride,
                 dilation=1,
                 shortcut=True,
                 if_first=False,
                 data_format='NCHW'):
        super(BasicBlock, self).__init__()
        self.conv0 = ConvBNLayer(
            in_channels=in_channels,
            out_channels=out_channels,
            kernel_size=3,
            stride=stride,
            dilation=dilation,
            act='relu',
            data_format=data_format)
        self.conv1 = ConvBNLayer(
            in_channels=out_channels,
            out_channels=out_channels,
            kernel_size=3,
            dilation=dilation,
            act=None,
            data_format=data_format)

        if not shortcut:
            self.short = ConvBNLayer(
                in_channels=in_channels,
                out_channels=out_channels,
                kernel_size=1,
                stride=1,
                is_vd_mode=False if if_first or stride == 1 else True,
                data_format=data_format)

        self.shortcut = shortcut
        self.dilation = dilation
        self.data_format = data_format

        self.relu = Activation(act="relu")

    def forward(self, inputs):
        y = self.conv0(inputs)
        conv1 = self.conv1(y)

        if self.shortcut:
            short = inputs
        else:
            short = self.short(inputs)
        print(short.shape,conv1.shape)
        y = paddle.add(short, conv1)
        y = self.relu(y)

        return y


class ResNet_vd(nn.Layer):
    """
    The ResNet_vd implementation based on PaddlePaddle.
    The original article refers to Jingdong
    Tong He, et, al. "Bag of Tricks for Image Classification with Convolutional Neural Networks"
    (https://arxiv.org/pdf/1812.01187.pdf).
    Args:
        layers (int, optional): The layers of ResNet_vd. The supported layers are (18, 34, 50, 101, 152, 200). Default: 50.
        output_stride (int, optional): The stride of output features compared to input images. It is 8 or 16. Default: 8.
        multi_grid (tuple|list, optional): The grid of stage4. Defult: (1, 1, 1).
        pretrained (str, optional): The path of pretrained model.
    """

    def __init__(self,
                 layers=50,
                 output_stride=8,
                 multi_grid=(1, 1, 1),
                 pretrained=None,
                 data_format='NCHW'):
        super(ResNet_vd, self).__init__()

        self.data_format = data_format
        self.conv1_logit = None  # for gscnn shape stream
        self.layers = layers
        supported_layers = [18, 34, 50, 101, 152, 200]
        assert layers in supported_layers, \
            "supported layers are {} but input layer is {}".format(
                supported_layers, layers)

        if layers == 18:
            depth = [2, 2, 2, 2]
        elif layers == 34 or layers == 50:
            depth = [3, 4, 6, 3]
        elif layers == 101:
            depth = [3, 4, 23, 3]
        elif layers == 152:
            depth = [3, 8, 36, 3]
        elif layers == 200:
            depth = [3, 12, 48, 3]
        num_channels = [64, 256, 512,
                        1024] if layers >= 50 else [64, 64, 128, 256]
        num_filters = [64, 128, 256, 512]

        # for channels of four returned stages
        self.feat_channels = [c * 4 for c in num_filters
                              ] if layers >= 50 else num_filters

        dilation_dict = None
        if output_stride == 8:
            dilation_dict = {2: 2, 3: 4}
        elif output_stride == 16:
            dilation_dict = {3: 2}

        self.conv1_1 = ConvBNLayer(
            in_channels=3,
            out_channels=32,
            kernel_size=3,
            stride=2,
            act='relu',
            data_format=data_format)
        self.conv1_2 = ConvBNLayer(
            in_channels=32,
            out_channels=32,
            kernel_size=3,
            stride=1,
            act='relu',
            data_format=data_format)
        self.conv1_3 = ConvBNLayer(
            in_channels=32,
            out_channels=64,
            kernel_size=3,
            stride=1,
            act='relu',
            data_format=data_format)
        self.pool2d_max = nn.MaxPool2D(
            kernel_size=3, stride=2, padding=1, data_format=data_format)

        # self.block_list = []
        self.stage_list = []
        if layers >= 50:
            for block in range(len(depth)):
                shortcut = False
                block_list = []
                for i in range(depth[block]):
                    if layers in [101, 152] and block == 2:
                        if i == 0:
                            conv_name = "res" + str(block + 2) + "a"
                        else:
                            conv_name = "res" + str(block + 2) + "b" + str(i)
                    else:
                        conv_name = "res" + str(block + 2) + chr(97 + i)

                    ###############################################################################
                    # Add dilation rate for some segmentation tasks, if dilation_dict is not None.
                    dilation_rate = dilation_dict[
                        block] if dilation_dict and block in dilation_dict else 1

                    # Actually block here is 'stage', and i is 'block' in 'stage'
                    # At the stage 4, expand the the dilation_rate if given multi_grid
                    if block == 3:
                        dilation_rate = dilation_rate * multi_grid[i]
                    ###############################################################################

                    bottleneck_block = self.add_sublayer(
                        'bb_%d_%d' % (block, i),
                        BottleneckBlock(
                            in_channels=num_channels[block]
                            if i == 0 else num_filters[block] * 4,
                            out_channels=num_filters[block],
                            stride=2 if i == 0 and block != 0 and
                            dilation_rate == 1 else 1,
                            shortcut=shortcut,
                            if_first=block == i == 0,
                            dilation=dilation_rate,
                            data_format=data_format))

                    block_list.append(bottleneck_block)
                    shortcut = True
                self.stage_list.append(block_list)
        else:
            for block in range(len(depth)):
                shortcut = False
                block_list = []
                for i in range(depth[block]):
                    dilation_rate = dilation_dict[block] \
                        if dilation_dict and block in dilation_dict else 1
                    if block == 3:
                        dilation_rate = dilation_rate * multi_grid[i]

                    basic_block = self.add_sublayer(
                        'bb_%d_%d' % (block, i),
                        BasicBlock(
                            in_channels=num_channels[block]
                            if i == 0 else num_filters[block],
                            out_channels=num_filters[block],
                            stride=2 if i == 0 and block != 0 \
                                and dilation_rate == 1 else 1,
                            dilation=dilation_rate,
                            shortcut=shortcut,
                            if_first=block == i == 0,
                            data_format=data_format))
                    block_list.append(basic_block)
                    shortcut = True
                self.stage_list.append(block_list)


    def forward(self, inputs):
        y = self.conv1_1(inputs)
        y = self.conv1_2(y)
        y = self.conv1_3(y)
        self.conv1_logit = y.clone()
        y = self.pool2d_max(y)

        # A feature list saves the output feature map of each stage.
        feat_list = []
        for stage in self.stage_list:
            for block in stage:
                y = block(y)
            feat_list.append(y)

        return feat_list


def ResNet18_vd(**args):
    model = ResNet_vd(layers=18, **args)
    return model


def ResNet34_vd(**args):
    model = ResNet_vd(layers=34, **args)
    return model



def ResNet50_vd(**args):
    model = ResNet_vd(layers=50, **args)
    return model



def ResNet101_vd(**args):
    model = ResNet_vd(layers=101, **args)
    return model

4.4 DeepLabV3+实现

class ASPPModule(nn.Layer):
    """
    Atrous Spatial Pyramid Pooling.
    Args:
        aspp_ratios (tuple): The dilation rate using in ASSP module.
        in_channels (int): The number of input channels.
        out_channels (int): The number of output channels.
        align_corners (bool): An argument of F.interpolate. It should be set to False when the output size of feature
            is even, e.g. 1024x512, otherwise it is True, e.g. 769x769.
        use_sep_conv (bool, optional): If using separable conv in ASPP module. Default: False.
        image_pooling (bool, optional): If augmented with image-level features. Default: False
    """

    def __init__(self,
                 aspp_ratios,
                 in_channels,
                 out_channels,
                 align_corners,
                 use_sep_conv=False,
                 image_pooling=False,
                 data_format='NCHW'):
        super().__init__()

        self.align_corners = align_corners
        self.data_format = data_format
        self.aspp_blocks = nn.LayerList()

        for ratio in aspp_ratios:
            if use_sep_conv and ratio > 1:
                conv_func = SeparableConvBNReLU
            else:
                conv_func = ConvBNReLU

            block = conv_func(
                in_channels=in_channels,
                out_channels=out_channels,
                kernel_size=1 if ratio == 1 else 3,
                dilation=ratio,
                padding=0 if ratio == 1 else ratio,
                data_format=data_format)
            self.aspp_blocks.append(block)

        out_size = len(self.aspp_blocks)

        if image_pooling:
            self.global_avg_pool = nn.Sequential(
                nn.AdaptiveAvgPool2D(
                    output_size=(1, 1), data_format=data_format),
                ConvBNReLU(
                    in_channels,
                    out_channels,
                    kernel_size=1,
                    bias_attr=False,
                    data_format=data_format))
            out_size += 1
        self.image_pooling = image_pooling

        self.conv_bn_relu = ConvBNReLU(
            in_channels=out_channels * out_size,
            out_channels=out_channels,
            kernel_size=1,
            data_format=data_format)

        self.dropout = nn.Dropout(p=0.1)  # drop rate

    def forward(self, x):
        outputs = []
        if self.data_format == 'NCHW':
            interpolate_shape = paddle.shape(x)[2:]
            axis = 1
        else:
            interpolate_shape = paddle.shape(x)[1:3]
            axis = -1
        for block in self.aspp_blocks:
            y = block(x)
            outputs.append(y)

        if self.image_pooling:
            img_avg = self.global_avg_pool(x)
            img_avg = F.interpolate(
                img_avg,
                interpolate_shape,
                mode='bilinear',
                align_corners=self.align_corners,
                data_format=self.data_format)
            outputs.append(img_avg)

        x = paddle.concat(outputs, axis=axis)
        x = self.conv_bn_relu(x)
        x = self.dropout(x)

        return x

class DeepLabV3P(nn.Layer):
    """
    The DeepLabV3Plus implementation based on PaddlePaddle.
    The original article refers to
     Liang-Chieh Chen, et, al. "Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation"
     (https://arxiv.org/abs/1802.02611)
    Args:
        num_classes (int): The unique number of target classes.
        backbone (paddle.nn.Layer): Backbone network, currently support Resnet50_vd/Resnet101_vd/Xception65.
        backbone_indices (tuple, optional): Two values in the tuple indicate the indices of output of backbone.
           Default: (0, 3).
        aspp_ratios (tuple, optional): The dilation rate using in ASSP module.
            If output_stride=16, aspp_ratios should be set as (1, 6, 12, 18).
            If output_stride=8, aspp_ratios is (1, 12, 24, 36).
            Default: (1, 6, 12, 18).
        aspp_out_channels (int, optional): The output channels of ASPP module. Default: 256.
        align_corners (bool, optional): An argument of F.interpolate. It should be set to False when the feature size is even,
            e.g. 1024x512, otherwise it is True, e.g. 769x769. Default: False.
        pretrained (str, optional): The path or url of pretrained model. Default: None.
        data_format(str, optional): Data format that specifies the layout of input. It can be "NCHW" or "NHWC". Default: "NCHW".
    """

    def __init__(self,
                 num_classes,
                 backbone,
                 backbone_indices=(0, 3),
                 aspp_ratios=(1, 6, 12, 18),
                 aspp_out_channels=256,
                 align_corners=False,
                 pretrained=None,
                 data_format="NCHW"):
        super().__init__()

        self.backbone = backbone
        backbone_channels = [
            backbone.feat_channels[i] for i in backbone_indices
        ]

        self.head = DeepLabV3PHead(
            num_classes,
            backbone_indices,
            backbone_channels,
            aspp_ratios,
            aspp_out_channels,
            align_corners,
            data_format=data_format)

        self.align_corners = align_corners
        self.pretrained = pretrained
        self.data_format = data_format


    def forward(self, x):
        feat_list = self.backbone(x)
        logit_list = self.head(feat_list)
        if self.data_format == 'NCHW':
            ori_shape = paddle.shape(x)[2:]
        else:
            ori_shape = paddle.shape(x)[1:3]
        return [
            F.interpolate(
                logit,
                ori_shape,
                mode='bilinear',
                align_corners=self.align_corners,
                data_format=self.data_format) for logit in logit_list
        ]



class DeepLabV3PHead(nn.Layer):
    """
    The DeepLabV3PHead implementation based on PaddlePaddle.
    Args:
        num_classes (int): The unique number of target classes.
        backbone_indices (tuple): Two values in the tuple indicate the indices of output of backbone.
            the first index will be taken as a low-level feature in Decoder component;
            the second one will be taken as input of ASPP component.
            Usually backbone consists of four downsampling stage, and return an output of
            each stage. If we set it as (0, 3), it means taking feature map of the first
            stage in backbone as low-level feature used in Decoder, and feature map of the fourth
            stage as input of ASPP.
        backbone_channels (tuple): The same length with "backbone_indices". It indicates the channels of corresponding index.
        aspp_ratios (tuple): The dilation rates using in ASSP module.
        aspp_out_channels (int): The output channels of ASPP module.
        align_corners (bool): An argument of F.interpolate. It should be set to False when the output size of feature
            is even, e.g. 1024x512, otherwise it is True, e.g. 769x769.
        data_format(str, optional): Data format that specifies the layout of input. It can be "NCHW" or "NHWC". Default: "NCHW".
    """

    def __init__(self,
                 num_classes,
                 backbone_indices,
                 backbone_channels,
                 aspp_ratios,
                 aspp_out_channels,
                 align_corners,
                 data_format='NCHW'):
        super().__init__()

        self.aspp = ASPPModule(
            aspp_ratios,
            backbone_channels[1],
            aspp_out_channels,
            align_corners,
            use_sep_conv=True,
            image_pooling=True,
            data_format=data_format)
        self.decoder = Decoder(
            num_classes,
            backbone_channels[0],
            align_corners,
            data_format=data_format)
        self.backbone_indices = backbone_indices

    def forward(self, feat_list):
        logit_list = []
        low_level_feat = feat_list[self.backbone_indices[0]]
        x = feat_list[self.backbone_indices[1]]
        x = self.aspp(x)
        logit = self.decoder(x, low_level_feat)
        logit_list.append(logit)

        return logit_list

class Decoder(nn.Layer):
    """
    Decoder module of DeepLabV3P model
    Args:
        num_classes (int): The number of classes.
        in_channels (int): The number of input channels in decoder module.
    """

    def __init__(self,
                 num_classes,
                 in_channels,
                 align_corners,
                 data_format='NCHW'):
        super(Decoder, self).__init__()

        self.data_format = data_format
        self.conv_bn_relu1 = ConvBNReLU(
            in_channels=in_channels,
            out_channels=48,
            kernel_size=1,
            data_format=data_format)

        self.conv_bn_relu2 = SeparableConvBNReLU(
            in_channels=304,
            out_channels=256,
            kernel_size=3,
            padding=1,
            data_format=data_format)
        self.conv_bn_relu3 = SeparableConvBNReLU(
            in_channels=256,
            out_channels=256,
            kernel_size=3,
            padding=1,
            data_format=data_format)
        self.conv = nn.Conv2D(
            in_channels=256,
            out_channels=num_classes,
            kernel_size=1,
            data_format=data_format)

        self.align_corners = align_corners

    def forward(self, x, low_level_feat):
        low_level_feat = self.conv_bn_relu1(low_level_feat)
        if self.data_format == 'NCHW':
            low_level_shape = paddle.shape(low_level_feat)[-2:]
            axis = 1
        else:
            low_level_shape = paddle.shape(low_level_feat)[1:3]
            axis = -1
        x = F.interpolate(
            x,
            low_level_shape,
            mode='bilinear',
            align_corners=self.align_corners,
            data_format=self.data_format)
        x = paddle.concat([x, low_level_feat], axis=axis)
        x = self.conv_bn_relu2(x)
        x = self.conv_bn_relu3(x)
        x = self.conv(x)
        return x

4.5 模型可视化

调用飞桨提供的summary接口对组建好的模型进行可视化，方便进行模型结构和参数信息的查看和确认。

net = ResNet50_vd()
cnn = DeepLabV3P(NUM_CLASSES,net)

paddle.summary(cnn,(1,3,512,512))

---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
    Layer (type)                                       Input Shape                                                                   Output Shape                                     Param #    
===================================================================================================================================================================================================
      Conv2D-1                                      [[1, 3, 512, 512]]                                                            [1, 32, 256, 256]                                     864      
    BatchNorm2D-1                                  [[1, 32, 256, 256]]                                                            [1, 32, 256, 256]                                     128      
       ReLU-1                                      [[1, 32, 256, 256]]                                                            [1, 32, 256, 256]                                      0       
    Activation-1                                   [[1, 32, 256, 256]]                                                            [1, 32, 256, 256]                                      0       
    ConvBNLayer-1                                   [[1, 3, 512, 512]]                                                            [1, 32, 256, 256]                                      0       
      Conv2D-2                                     [[1, 32, 256, 256]]                                                            [1, 32, 256, 256]                                    9,216     
    BatchNorm2D-2                                  [[1, 32, 256, 256]]                                                            [1, 32, 256, 256]                                     128      
       ReLU-2                                      [[1, 32, 256, 256]]                                                            [1, 32, 256, 256]                                      0       
    Activation-2                                   [[1, 32, 256, 256]]                                                            [1, 32, 256, 256]                                      0       
    ConvBNLayer-2                                  [[1, 32, 256, 256]]                                                            [1, 32, 256, 256]                                      0       
      Conv2D-3                                     [[1, 32, 256, 256]]                                                            [1, 64, 256, 256]                                   18,432     
    BatchNorm2D-3                                  [[1, 64, 256, 256]]                                                            [1, 64, 256, 256]                                     256      
       ReLU-3                                      [[1, 64, 256, 256]]                                                            [1, 64, 256, 256]                                      0       
    Activation-3                                   [[1, 64, 256, 256]]                                                            [1, 64, 256, 256]                                      0       
    ConvBNLayer-3                                  [[1, 32, 256, 256]]                                                            [1, 64, 256, 256]                                      0       
     MaxPool2D-1                                   [[1, 64, 256, 256]]                                                            [1, 64, 128, 128]                                      0       
      Conv2D-4                                     [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                    4,096     
    BatchNorm2D-4                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                     256      
       ReLU-4                                      [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    Activation-4                                   [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    ConvBNLayer-4                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
      Conv2D-5                                     [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                   36,864     
    BatchNorm2D-5                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                     256      
       ReLU-5                                      [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    Activation-5                                   [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    ConvBNLayer-5                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
      Conv2D-6                                     [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                  16,384     
    BatchNorm2D-6                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                   1,024     
    Activation-6                                   [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    ConvBNLayer-6                                  [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                     0       
      Conv2D-7                                     [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                  16,384     
    BatchNorm2D-7                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                   1,024     
    Activation-7                                   [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    ConvBNLayer-7                                  [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                     0       
       ReLU-6                                      [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    Activation-8                                   [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
  BottleneckBlock-1                                [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                     0       
      Conv2D-8                                     [[1, 256, 128, 128]]                                                           [1, 64, 128, 128]                                   16,384     
    BatchNorm2D-8                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                     256      
       ReLU-7                                      [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    Activation-9                                   [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    ConvBNLayer-8                                  [[1, 256, 128, 128]]                                                           [1, 64, 128, 128]                                      0       
      Conv2D-9                                     [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                   36,864     
    BatchNorm2D-9                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                     256      
       ReLU-8                                      [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    Activation-10                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    ConvBNLayer-9                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
      Conv2D-10                                    [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                  16,384     
   BatchNorm2D-10                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                   1,024     
    Activation-11                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
   ConvBNLayer-10                                  [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                     0       
       ReLU-9                                      [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    Activation-12                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
  BottleneckBlock-2                                [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
      Conv2D-11                                    [[1, 256, 128, 128]]                                                           [1, 64, 128, 128]                                   16,384     
   BatchNorm2D-11                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                     256      
       ReLU-10                                     [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    Activation-13                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
   ConvBNLayer-11                                  [[1, 256, 128, 128]]                                                           [1, 64, 128, 128]                                      0       
      Conv2D-12                                    [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                   36,864     
   BatchNorm2D-12                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                     256      
       ReLU-11                                     [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
    Activation-14                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
   ConvBNLayer-12                                  [[1, 64, 128, 128]]                                                            [1, 64, 128, 128]                                      0       
      Conv2D-13                                    [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                  16,384     
   BatchNorm2D-13                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                   1,024     
    Activation-15                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
   ConvBNLayer-13                                  [[1, 64, 128, 128]]                                                            [1, 256, 128, 128]                                     0       
       ReLU-12                                     [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    Activation-16                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
  BottleneckBlock-3                                [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
      Conv2D-14                                    [[1, 256, 128, 128]]                                                           [1, 128, 128, 128]                                  32,768     
   BatchNorm2D-14                                  [[1, 128, 128, 128]]                                                           [1, 128, 128, 128]                                    512      
       ReLU-13                                     [[1, 128, 128, 128]]                                                           [1, 128, 128, 128]                                     0       
    Activation-17                                  [[1, 128, 128, 128]]                                                           [1, 128, 128, 128]                                     0       
   ConvBNLayer-14                                  [[1, 256, 128, 128]]                                                           [1, 128, 128, 128]                                     0       
      Conv2D-15                                    [[1, 128, 128, 128]]                                                            [1, 128, 64, 64]                                   147,456    
   BatchNorm2D-15                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                     512      
       ReLU-14                                      [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
    Activation-18                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
   ConvBNLayer-15                                  [[1, 128, 128, 128]]                                                            [1, 128, 64, 64]                                      0       
      Conv2D-16                                     [[1, 128, 64, 64]]                                                             [1, 512, 64, 64]                                   65,536     
   BatchNorm2D-16                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
    Activation-19                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-16                                   [[1, 128, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    AvgPool2D-17                                   [[1, 256, 128, 128]]                                                            [1, 256, 64, 64]                                      0       
      Conv2D-17                                     [[1, 256, 64, 64]]                                                             [1, 512, 64, 64]                                   131,072    
   BatchNorm2D-17                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
    Activation-20                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-17                                  [[1, 256, 128, 128]]                                                            [1, 512, 64, 64]                                      0       
       ReLU-15                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-21                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
  BottleneckBlock-4                                [[1, 256, 128, 128]]                                                            [1, 512, 64, 64]                                      0       
      Conv2D-18                                     [[1, 512, 64, 64]]                                                             [1, 128, 64, 64]                                   65,536     
   BatchNorm2D-18                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                     512      
       ReLU-16                                      [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
    Activation-22                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
   ConvBNLayer-18                                   [[1, 512, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
      Conv2D-19                                     [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                   147,456    
   BatchNorm2D-19                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                     512      
       ReLU-17                                      [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
    Activation-23                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
   ConvBNLayer-19                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
      Conv2D-20                                     [[1, 128, 64, 64]]                                                             [1, 512, 64, 64]                                   65,536     
   BatchNorm2D-20                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
    Activation-24                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-20                                   [[1, 128, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
       ReLU-18                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-25                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
  BottleneckBlock-5                                 [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-21                                     [[1, 512, 64, 64]]                                                             [1, 128, 64, 64]                                   65,536     
   BatchNorm2D-21                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                     512      
       ReLU-19                                      [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
    Activation-26                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
   ConvBNLayer-21                                   [[1, 512, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
      Conv2D-22                                     [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                   147,456    
   BatchNorm2D-22                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                     512      
       ReLU-20                                      [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
    Activation-27                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
   ConvBNLayer-22                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
      Conv2D-23                                     [[1, 128, 64, 64]]                                                             [1, 512, 64, 64]                                   65,536     
   BatchNorm2D-23                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
    Activation-28                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-23                                   [[1, 128, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
       ReLU-21                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-29                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
  BottleneckBlock-6                                 [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-24                                     [[1, 512, 64, 64]]                                                             [1, 128, 64, 64]                                   65,536     
   BatchNorm2D-24                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                     512      
       ReLU-22                                      [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
    Activation-30                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
   ConvBNLayer-24                                   [[1, 512, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
      Conv2D-25                                     [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                   147,456    
   BatchNorm2D-25                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                     512      
       ReLU-23                                      [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
    Activation-31                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
   ConvBNLayer-25                                   [[1, 128, 64, 64]]                                                             [1, 128, 64, 64]                                      0       
      Conv2D-26                                     [[1, 128, 64, 64]]                                                             [1, 512, 64, 64]                                   65,536     
   BatchNorm2D-26                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
    Activation-32                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-26                                   [[1, 128, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
       ReLU-24                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-33                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
  BottleneckBlock-7                                 [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-27                                     [[1, 512, 64, 64]]                                                             [1, 256, 64, 64]                                   131,072    
   BatchNorm2D-27                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-25                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-34                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-27                                   [[1, 512, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-28                                     [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                   589,824    
   BatchNorm2D-28                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-26                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-35                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-28                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-29                                     [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                   262,144    
   BatchNorm2D-29                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                    4,096     
    Activation-36                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
   ConvBNLayer-29                                   [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
      Conv2D-30                                     [[1, 512, 64, 64]]                                                            [1, 1024, 64, 64]                                   524,288    
   BatchNorm2D-30                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                    4,096     
    Activation-37                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
   ConvBNLayer-30                                   [[1, 512, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
       ReLU-27                                     [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
    Activation-38                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
  BottleneckBlock-8                                 [[1, 512, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
      Conv2D-31                                    [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                   262,144    
   BatchNorm2D-31                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-28                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-39                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-31                                  [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-32                                     [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                   589,824    
   BatchNorm2D-32                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-29                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-40                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-32                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-33                                     [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                   262,144    
   BatchNorm2D-33                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                    4,096     
    Activation-41                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
   ConvBNLayer-33                                   [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
       ReLU-30                                     [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
    Activation-42                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
  BottleneckBlock-9                                [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
      Conv2D-34                                    [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                   262,144    
   BatchNorm2D-34                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-31                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-43                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-34                                  [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-35                                     [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                   589,824    
   BatchNorm2D-35                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-32                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-44                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-35                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-36                                     [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                   262,144    
   BatchNorm2D-36                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                    4,096     
    Activation-45                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
   ConvBNLayer-36                                   [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
       ReLU-33                                     [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
    Activation-46                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
 BottleneckBlock-10                                [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
      Conv2D-37                                    [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                   262,144    
   BatchNorm2D-37                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-34                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-47                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-37                                  [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-38                                     [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                   589,824    
   BatchNorm2D-38                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-35                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-48                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-38                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-39                                     [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                   262,144    
   BatchNorm2D-39                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                    4,096     
    Activation-49                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
   ConvBNLayer-39                                   [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
       ReLU-36                                     [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
    Activation-50                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
 BottleneckBlock-11                                [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
      Conv2D-40                                    [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                   262,144    
   BatchNorm2D-40                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-37                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-51                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-40                                  [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-41                                     [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                   589,824    
   BatchNorm2D-41                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-38                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-52                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-41                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-42                                     [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                   262,144    
   BatchNorm2D-42                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                    4,096     
    Activation-53                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
   ConvBNLayer-42                                   [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
       ReLU-39                                     [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
    Activation-54                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
 BottleneckBlock-12                                [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
      Conv2D-43                                    [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                   262,144    
   BatchNorm2D-43                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-40                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-55                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-43                                  [[1, 1024, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-44                                     [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                   589,824    
   BatchNorm2D-44                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-41                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-56                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
   ConvBNLayer-44                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-45                                     [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                   262,144    
   BatchNorm2D-45                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                    4,096     
    Activation-57                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
   ConvBNLayer-45                                   [[1, 256, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
       ReLU-42                                     [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
    Activation-58                                  [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
 BottleneckBlock-13                                [[1, 1024, 64, 64]]                                                            [1, 1024, 64, 64]                                      0       
      Conv2D-46                                    [[1, 1024, 64, 64]]                                                             [1, 512, 64, 64]                                   524,288    
   BatchNorm2D-46                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
       ReLU-43                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-59                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-46                                  [[1, 1024, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-47                                     [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                  2,359,296   
   BatchNorm2D-47                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
       ReLU-44                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-60                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-47                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-48                                     [[1, 512, 64, 64]]                                                            [1, 2048, 64, 64]                                  1,048,576   
   BatchNorm2D-48                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                    8,192     
    Activation-61                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
   ConvBNLayer-48                                   [[1, 512, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
      Conv2D-49                                    [[1, 1024, 64, 64]]                                                            [1, 2048, 64, 64]                                  2,097,152   
   BatchNorm2D-49                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                    8,192     
    Activation-62                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
   ConvBNLayer-49                                  [[1, 1024, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
       ReLU-45                                     [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
    Activation-63                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
 BottleneckBlock-14                                [[1, 1024, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
      Conv2D-50                                    [[1, 2048, 64, 64]]                                                             [1, 512, 64, 64]                                  1,048,576   
   BatchNorm2D-50                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
       ReLU-46                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-64                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-50                                  [[1, 2048, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-51                                     [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                  2,359,296   
   BatchNorm2D-51                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
       ReLU-47                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-65                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-51                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-52                                     [[1, 512, 64, 64]]                                                            [1, 2048, 64, 64]                                  1,048,576   
   BatchNorm2D-52                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                    8,192     
    Activation-66                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
   ConvBNLayer-52                                   [[1, 512, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
       ReLU-48                                     [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
    Activation-67                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
 BottleneckBlock-15                                [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
      Conv2D-53                                    [[1, 2048, 64, 64]]                                                             [1, 512, 64, 64]                                  1,048,576   
   BatchNorm2D-53                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
       ReLU-49                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-68                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-53                                  [[1, 2048, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-54                                     [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                  2,359,296   
   BatchNorm2D-54                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                    2,048     
       ReLU-50                                      [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
    Activation-69                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
   ConvBNLayer-54                                   [[1, 512, 64, 64]]                                                             [1, 512, 64, 64]                                      0       
      Conv2D-55                                     [[1, 512, 64, 64]]                                                            [1, 2048, 64, 64]                                  1,048,576   
   BatchNorm2D-55                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                    8,192     
    Activation-70                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
   ConvBNLayer-55                                   [[1, 512, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
       ReLU-51                                     [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
    Activation-71                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
 BottleneckBlock-16                                [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
     ResNet_vd-1                                    [[1, 3, 512, 512]]                               [[1, 256, 128, 128], [1, 512, 64, 64], [1, 1024, 64, 64], [1, 2048, 64, 64]]        0       
      Conv2D-56                                    [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                   524,544    
   BatchNorm2D-56                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-52                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-72                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    ConvBNReLU-1                                   [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-57                                    [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                   20,480     
   BatchNorm2D-57                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                    8,192     
      ConvBN-1                                     [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
      Conv2D-58                                    [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                   524,544    
   BatchNorm2D-58                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-53                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-73                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    ConvBNReLU-2                                   [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
SeparableConvBNReLU-1                              [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-59                                    [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                   20,480     
   BatchNorm2D-59                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                    8,192     
      ConvBN-2                                     [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
      Conv2D-60                                    [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                   524,544    
   BatchNorm2D-60                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-54                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-74                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    ConvBNReLU-3                                   [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
SeparableConvBNReLU-2                              [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-61                                    [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                   20,480     
   BatchNorm2D-61                                  [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                    8,192     
      ConvBN-3                                     [[1, 2048, 64, 64]]                                                            [1, 2048, 64, 64]                                      0       
      Conv2D-62                                    [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                   524,544    
   BatchNorm2D-62                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-55                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-75                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    ConvBNReLU-4                                   [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
SeparableConvBNReLU-3                              [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
 AdaptiveAvgPool2D-1                               [[1, 2048, 64, 64]]                                                             [1, 2048, 1, 1]                                       0       
      Conv2D-63                                     [[1, 2048, 1, 1]]                                                               [1, 256, 1, 1]                                    524,288    
   BatchNorm2D-63                                    [[1, 256, 1, 1]]                                                               [1, 256, 1, 1]                                     1,024     
       ReLU-56                                       [[1, 256, 1, 1]]                                                               [1, 256, 1, 1]                                       0       
    Activation-76                                    [[1, 256, 1, 1]]                                                               [1, 256, 1, 1]                                       0       
    ConvBNReLU-5                                    [[1, 2048, 1, 1]]                                                               [1, 256, 1, 1]                                       0       
      Conv2D-64                                    [[1, 1280, 64, 64]]                                                             [1, 256, 64, 64]                                   327,936    
   BatchNorm2D-64                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                    1,024     
       ReLU-57                                      [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    Activation-77                                   [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    ConvBNReLU-6                                   [[1, 1280, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Dropout-1                                     [[1, 256, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
    ASPPModule-1                                   [[1, 2048, 64, 64]]                                                             [1, 256, 64, 64]                                      0       
      Conv2D-65                                    [[1, 256, 128, 128]]                                                           [1, 48, 128, 128]                                   12,336     
   BatchNorm2D-65                                  [[1, 48, 128, 128]]                                                            [1, 48, 128, 128]                                     192      
       ReLU-58                                     [[1, 48, 128, 128]]                                                            [1, 48, 128, 128]                                      0       
    Activation-78                                  [[1, 48, 128, 128]]                                                            [1, 48, 128, 128]                                      0       
    ConvBNReLU-7                                   [[1, 256, 128, 128]]                                                           [1, 48, 128, 128]                                      0       
      Conv2D-66                                    [[1, 304, 128, 128]]                                                           [1, 304, 128, 128]                                   3,040     
   BatchNorm2D-66                                  [[1, 304, 128, 128]]                                                           [1, 304, 128, 128]                                   1,216     
      ConvBN-4                                     [[1, 304, 128, 128]]                                                           [1, 304, 128, 128]                                     0       
      Conv2D-67                                    [[1, 304, 128, 128]]                                                           [1, 256, 128, 128]                                  78,080     
   BatchNorm2D-67                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                   1,024     
       ReLU-59                                     [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    Activation-79                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    ConvBNReLU-8                                   [[1, 304, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
SeparableConvBNReLU-4                              [[1, 304, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
      Conv2D-68                                    [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                   2,560     
   BatchNorm2D-68                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                   1,024     
      ConvBN-5                                     [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
      Conv2D-69                                    [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                  65,792     
   BatchNorm2D-69                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                   1,024     
       ReLU-60                                     [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    Activation-80                                  [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
    ConvBNReLU-9                                   [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
SeparableConvBNReLU-5                              [[1, 256, 128, 128]]                                                           [1, 256, 128, 128]                                     0       
      Conv2D-70                                    [[1, 256, 128, 128]]                                                           [1, 20, 128, 128]                                    5,140     
      Decoder-1                           [[1, 256, 64, 64], [1, 256, 128, 128]]                                                  [1, 20, 128, 128]                                      0       
  DeepLabV3PHead-1    [[[1, 256, 128, 128], [1, 512, 64, 64], [1, 1024, 64, 64], [1, 2048, 64, 64]]]                             [[1, 20, 128, 128]]                                     0       
===================================================================================================================================================================================================
Total params: 26,794,500
Trainable params: 26,652,804
Non-trainable params: 141,696
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
Input size (MB): 3.00
Forward/backward pass size (MB): 7731.53
Params size (MB): 102.21
Estimated Total Size (MB): 7836.74
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------






{'total_params': 26794500, 'trainable_params': 26652804}

五、模型训练

Mean Pixel Accuracy(MPA，平均像素精度)：是PA的一种简单提升，计算每个类内被正确分类像素数的比例，之后求所有类的平均。

train_dataset = CIHPDataset(mode='train') # 训练数据集
val_dataset = CIHPDataset(mode='val') # 验证数据集
train_loader = paddle.io.DataLoader(train_dataset, batch_size=16, shuffle=True)
val_loader = paddle.io.DataLoader(val_dataset, batch_size = 10, shuffle=False)

def MeanPixelAccuracy(pred,label):
    if isinstance(pred, (paddle.Tensor, paddle.fluid.core.eager.Tensor)):
        pred = pred.numpy()
    if isinstance(label, (paddle.Tensor, paddle.fluid.core.eager.Tensor)):
        label = label.numpy()
    preindex = np.argmax(pred,axis=-3)
    correct = preindex == label
    classlabel = np.unique(label)
    accs = []
    ci = 0
    for c in classlabel:
        clabel = (label==c)
        coun = clabel.sum()
        ci = ci + coun
        right = (clabel & correct).sum()
        acc = right/coun
        accs.append(acc)
    return np.mean(accs),(correct.sum()/correct.size) #

    
optim = paddle.optimizer.RMSProp(learning_rate=0.001, 
                                 rho=0.9, 
                                 momentum=0.0, 
                                 epsilon=1e-07, 
                                 centered=False,
                                 parameters=cnn.parameters())
criterion = paddle.nn.CrossEntropyLoss(axis=1)
num_epochs = 40
max_accuracy = 0
save_path ="bestmodel"
his_trainloss = []
his_trainacc = []
his_valloss = []
his_valacc = []
his_trainmenacc = []
his_valmeanacc = []
for epoch in range(num_epochs):
    
    traintotal,trainacc,trainmeanacc,trainloss = 0, 0, 0, 0
    cnn.train()
    for i, (img, labels) in enumerate(train_loader):
        predict = cnn(img)[0]
        # predict = paddle.to_tensor(predict)[0]
        loss = criterion(predict, labels)
        
        loss.backward()
        optim.step()
        optim.clear_grad()
        tmeanacc,acc = MeanPixelAccuracy(predict, labels)
        trainmeanacc += tmeanacc
        trainacc += acc
        trainloss += loss.numpy().item()
        if i%10==0:
            print(f"epoch {epoch} iter {i} loss: {loss.numpy().item():0.2f} accuracy {acc:0.2f} mean accuracy {tmeanacc:0.2f}")
        traintotal += 1
    valtotal, valacc,valmeanacc,valloss = 0, 0, 0, 0
    cnn.eval()
    for i ,(img, labels) in enumerate(val_loader):
        with paddle.no_grad():
            predict = cnn(img)
            predict = paddle.to_tensor(predict)[0]
            loss = criterion(predict, labels)
        vmeanacc,vacc = MeanPixelAccuracy(predict, labels)
        valacc += vacc
        valmeanacc += vmeanacc
        valloss += loss.numpy().item()
        valtotal+=1
    print(f"epoch {epoch} val Loss {valloss/valtotal:.2f} Accuray {valacc/valtotal:.2f} mean accuracy {vmeanacc:.2f}",)   
    # 只保存最优模型
    if valacc/valtotal>max_accuracy:
        max_accuracy = valacc/valtotal
        model_state = cnn.state_dict()
        paddle.save(model_state, save_path)
        print("max accuracy {}".format(max_accuracy))

    his_trainloss.append(trainloss/traintotal)
    his_trainacc.append(trainacc/traintotal)
    his_valloss.append(valloss/valtotal)
    his_valacc.append(valacc/valtotal)
    his_trainmenacc.append(trainmeanacc/traintotal)
    his_valmeanacc.append(valmeanacc/valtotal)

fig, ax = plt.subplots(2, 1, figsize=(8, 10))
ax = ax.ravel()


ax[0].plot(his_trainloss)
ax[0].plot(his_valloss)
ax[0].set_title("Model {}".format("Loss"))
ax[0].set_xlabel("epochs")
ax[0].legend(["train", "val"])

ax[1].plot(his_trainacc)
ax[1].plot(his_valacc)
ax[1].set_title("Model {}".format("Accuracy"))
ax[1].set_xlabel("epochs")
# ax[1].set_ylabel(metric)
_ylabel(metric)
ax[1].legend(["train", "val"])

六、模型预测

来自模型的原始预测表示一个热编码的形状张量，其中20个通道中的每一个都是对应于预测标签的二进制掩码。
为了可视化结果，我们将其绘制为RGB掩模，其中每个像素由对应于预测的特定标签的唯一颜色表示。我们可以很容易地从作为数据集一部分提供的文件中找到每个标签对应的颜色。我们还将在输入图像上绘制RGB分割掩模的覆盖图，因为这可以进一步帮助我们更直观地识别图像中存在的不同类别。（N，51251220）humancolormap.mat

colormap = loadmat(
    "./instance-level_human_parsing/human_colormap.mat"
)["colormap"]
colormap = colormap * 255
colormap = colormap.astype(np.uint8)

def decode_segmentation_masks(mask, colormap, n_classes):
    r = np.zeros_like(mask).astype(np.uint8)
    g = np.zeros_like(mask).astype(np.uint8)
    b = np.zeros_like(mask).astype(np.uint8)
    for l in range(0, n_classes):
        idx = mask == l
        r[idx] = colormap[l, 0]
        g[idx] = colormap[l, 1]
        b[idx] = colormap[l, 2]
    rgb = np.stack([r, g, b], axis=2)
    return rgb

def get_overlay(image, colored_mask):
    overlay = cv2.addWeighted(image, 0.35, colored_mask, 0.65, 0)
    return overlay

def preshow(path):
    plt.figure(figsize=(10, 10))
    i = 0
    mask_idx = 0
    param_state_dict = paddle.load( "bestmodel")
    cnn.set_dict(param_state_dict)
    cnn.eval() #预测模式
    with open(path, 'r') as f:
        for line in f.readlines():
            if i > 8: 
                break

            image_path, label_path = line.strip().split('\t')
            resize_t = T.Compose([           
                T.Transpose(), 
                T.Normalize(mean=127.5, std=127.5)
            ])
            image = PilImage.open(image_path)
            label = PilImage.open(label_path)
            
            image = np.array(image).astype('uint8')
            label = np.array(label).astype('uint8')

            cimg = np.expand_dims(resize_t(image),axis=0)
            data = cnn(paddle.to_tensor(cimg))[0][0]
            mask = np.argmax(data, axis=0)
   
            prediction_colormap = decode_segmentation_masks(mask, colormap, 20)
            overlay = get_overlay(image, prediction_colormap)

            
            plt.subplot(3, 3, i + 1)
            plt.imshow(image)
            plt.axis("off")

            plt.subplot(3, 3, i + 2)
            plt.imshow(overlay)
            plt.axis("off")
                   
            plt.subplot(3, 3, i + 3)
            plt.imshow(prediction_colormap)
            plt.axis("off")
            i += 3
            mask_idx += 1

    plt.show()

训练集展示

preshow('./train.txt')

验证集展示

preshow('./val.txt')

此文章为搬运
原项目链接

你可能感兴趣的:(分类,人工智能,python)

darts框架使用 ME_Seraph 机器学习 darts
文|Seraph高版本Pytorch问题运行test.py报错IndexError:invalidindexofa0-dimtensor.Usetensor.item()toconverta0-dimtensortoaPythonnumber解决：update函数的参数loss.data[0]，prec1.data[0]，prec5.data[0]等修改为loss.item()，prec1.ite
python面试题阿芯爱编程面试 python 开发语言
以下是一些Python面试题：一、基础语法Python中的列表（list）和元组（tuple）有什么区别？答案：可变性：列表是可变的，可以修改列表中的元素、添加或删除元素；元组是不可变的，一旦创建就不能修改。语法：列表使用方括号[]定义，元组使用圆括号()定义（单个元素的元组需要在元素后面加逗号，如(1,)）。性能：由于元组的不可变性，在某些情况下元组的性能比列表略高，例如在用作字典的键时（字典的
机器学习：k均值 golemon. ML 机器学习均值算法人工智能
所有代码和文档均在golitter/Decoding-ML-Top10:使用Python优雅地实现机器学习十大经典算法。(github.com)，欢迎查看。在“无监督学习”中，训练样本的标记信息是未知的，目标是通过对无标记训练样本的学习来揭示数据的内在性质及规律，为进一步的数据分析提供基础，较为经典的是聚类。**聚类试图将数据集中的样本划分为若干个通常是不相交的子集，每个子集称为一个“簇”。**聚
Python常见面试题的详解4 ylfhpy Python基础 python 开发语言面试
1.单例模式的实现方式要点：Python有多种实现单例模式的方法。模块由于其特性天然支持单例，首次导入生成对象，后续导入直接复用。通过装饰器可以控制实例的创建，元类能借助__call__方法管理实例化过程，重写类的__new__方法也能保证实例的唯一性。示例：pythondefsingleton(cls):instances={}defget_instance(*args,**kwargs):#如
OpenCV：人脸检测与Haar级联分类器（十三） WHCIS opencv opencv 数学建模人工智能计算机视觉音视频算法
一、Haar级联检测深度解析1.1Haar特征数学建模Haar特征的本质是通过矩形区域对比捕捉局部特征，其数学形式可扩展为四元组表示：特征定义：Haar(f)=(t,x,y,w,h)×s\text{Haar}(f)=(t,x,y,w,h)\timessHaar(f)=(t,x,y,w,h)×s其中：ttt表示特征类型（共14种基础变体）(x,y)(x,y)(x,y)为特征锚点坐标(w,h)(w,h
python：求解爱因斯坦场方程 belldeep python python 爱因斯坦
在物理学中，爱因斯坦的广义相对论（GeneralRelativity）是描述引力如何作用于时空的理论。广义相对论由爱因斯坦在1915年提出，并被阿尔伯特·爱因斯坦、纳森·罗森和纳尔逊·曼德尔斯塔姆共同发展。广义相对论的核心方程是爱因斯坦场方程，它描述了时空的几何结构如何由物质的分布决定。如果你想用Python来探索或模拟广义相对论中的某些现象，你可以从以下几个方面入手：1.使用现有的库Python
【Python】使用国内镜像加速 pip 安装详解 Peter-Lu #人工智能之python基础 python pip
文章目录一、pip工具简介1.什么是pip？2.什么是`-i`参数？二、国内镜像源的选择三、如何使用国内镜像源1.临时指定国内镜像源2.批量安装依赖时使用镜像源3.全局配置国内镜像源配置方法：四、国内镜像的使用场景1.安装大型库时2.批量安装依赖五、注意事项1.镜像源的选择2.镜像源的可信性3.镜像源与pip缓存在Python开发中，pip是一个非常重要的工具，用于安装和管理Python的第三方库
第N5周：Pytorch文本分类入门计算机真好丸 pytorch 分类人工智能
文章目录一、前期准备1.环境安装2.加载数据3.构建词典4.生成数据批次和迭代器二、准备模型1.定义模型2.定义实例三、训练模型1.拆分数据集并运行模型2.使用测试数据集评估模型本文为365天深度学习训练营中的学习记录博客原作者：K同学啊一、前期准备1.环境安装确保安装了torchtext与portalocker库2.加载数据importtorch#强制使用CPUdevice=torch.devi
第TR5周：Transformer实战：文本分类计算机真好丸 transformer 分类深度学习
文章目录1.准备环境1.1环境安装1.2加载数据2.数据预处理2.1构建词典2.2生成数据批次和迭代器2.3构建数据集3.模型构建3.1定义位置编码函数3.2定义Transformer模型3.3初始化模型3.4定义训练函数3.5定义评估函数4.训练模型4.1模型训练5.总结：本文为365天深度学习训练营中的学习记录博客原作者：K同学啊1.准备环境1.1环境安装这是一个使用PyTorch通过Tran
pandas（02 pandas基本功能和描述性统计） twilight ember pandas python 开发语言
前面内容：pandas(01入门)目录一、PythonPandas基本功能1.1Series基本功能1.2DataFrame基本功能二、PythonPandas描述性统计2.1常用函数*2.2汇总数据(describe)*一、PythonPandas基本功能到目前为止，我们已经学习了三种Pandas数据结构以及如何创建它们。我们将主要关注DataFrame对象，因为它在实时数据处理中非常重要，并讨
工控网络安全学习路线 206333308 安全
一、基础技能编程语言：从汇编语言开始学习，了解底层机器指令和内存管理等基础知识。接着学习C/C++，掌握面向过程和面向对象编程的基本概念和技术，为后续的漏洞挖掘和底层分析打下基础。最后学习Python，它在安全领域应用广泛，可用于自动化脚本编写、漏洞扫描和数据分析等。《计算机网络原理》：掌握网络通信的基本原理，包括OSI七层模型、TCP/IP协议栈、IP地址分配、子网掩码等。了解网络拓扑结构、路由
2025年——【寒假】自学黑客计划（网络安全）网安CILLE web安全网络安全网络安全 linux
CSDN大礼包：基于入门网络安全/黑客打造的：黑客&网络安全入门&进阶学习资源包前言什么是网络安全网络安全可以基于攻击和防御视角来分类，我们经常听到的“红队”、“渗透测试”等就是研究攻击技术，而“蓝队”、“安全运营”、“安全运维”则研究防御技术。如何成为一名黑客很多朋友在学习安全方面都会半路转行，因为不知如何去学，在这里，我将这个整份答案分为黑客（网络安全）入门必备、黑客（网络安全）职业指南、黑客
大数据知识图谱之深度学习——基于BERT+LSTM+CRF深度学习识别模型医疗知识图谱问答可视化系统_bert+lstm 2301_76348014 程序员深度学习大数据知识图谱
文章目录大数据知识图谱之深度学习——基于BERT+LSTM+CRF深度学习识别模型医疗知识图谱问答可视化系统一、项目概述二、系统实现基本流程三、项目工具所用的版本号四、所需要软件的安装和使用五、开发技术简介Django技术介绍Neo4j数据库Bootstrap4框架Echarts简介NavicatPremium15简介Layui简介Python语言介绍MySQL数据库深度学习六、核心理论贪心算法A
Python的垃圾回收机制，详解Python的GC体系李云龙炮击平安线程 python 系统架构面试跳槽后端架构
什么是垃圾回收？为什么需要垃圾回收？垃圾回收即Garbagecollection简称为GC，是Python，Java等高级语言所使用的内存回收机制，由虚拟机帮助我们管理内存，让它自动把我们去追踪和回收内存中的对象。没有作用的对象就是垃圾，虚拟机就是扫地机器人，在某个时机自动帮我们清除垃圾。区别于C和C++这种让用户自己进行内存管理的方式，由虚拟机代用户管理内存。让用户自己进行内存管理的方式固然自由
自动化办公|xlwings 数据类型和转换游客520 自动化实用代码 python全栈学习自动化运维 python
xlwings数据类型和转换：Python与Excel的桥梁在使用xlwings进行Python和Excel数据交互时，理解两者之间的数据类型对应关系至关重要。本篇将详细介绍Python数据类型与Excel数据类型的对应关系，以及如何进行数据类型转换。一、Python数据类型与Excel数据类型的对应关系Python数据类型Excel数据类型说明int数字整数float数字浮点数str文本字符串b
DeepSeek进阶开发与应用1：DeepSeek框架概述与基础应用 Evaporator Core #DeepSeek快速入门 DeepSeek进阶开发与应用 spring 自然语言处理
引言在当今的人工智能领域，深度学习技术已经成为了推动技术进步的核心动力之一。DeepSeek作为一个先进的深度学习框架，旨在为开发者和研究人员提供一个高效、灵活且易于扩展的平台，以便于他们能够快速地实现和部署各种深度学习模型。本文将深入探讨DeepSeek框架的核心架构、基础应用以及如何通过代码实现一个简单的深度学习模型。DeepSeek框架概述DeepSeek框架的设计理念是简洁而强大。它提供了
深度剖析DeepSeek本地部署：技术、实践与优化策略 Abossss AI 论文 python ai 人工智能
一、引言1.1研究背景与意义近年来，人工智能技术以迅猛之势蓬勃发展，成为推动各行业变革的核心力量。其中，大语言模型（LLMs）作为人工智能领域的关键技术，在自然语言处理、智能客服、内容创作等众多领域展现出了强大的应用潜力，引发了学术界和产业界的广泛关注。OpenAI的GPT系列模型凭借其出色的语言理解与生成能力，在全球范围内掀起了AI应用的热潮；Google的BERT模型则在自然语言理解任务中取得
python如何解压缩文件或文件夹游客520 实用代码 python全栈学习 python
在日常开发或数据处理工作中，我们经常需要对文件或文件夹进行压缩和解压缩操作。Python提供了强大的内置模块，如zipfile和shutil，可以帮助我们高效地完成这些任务。本文将介绍如何使用Python对文件夹或文件进行压缩和解压缩，内容包括两种常见方式：zipfile和shutil，并提供完整代码示例。压缩文件或文件夹1.使用zipfile模块压缩文件夹zipfile模块是Python标准库的
Python 爬虫验证码识别 acheding python python 爬虫 ocr
在我们进行爬虫的过程中，经常会碰到有些网站会时不时弹出来验证码识别。我们该如何解决呢？这里分享2种我尝试过的方法。0.验证码示例1.OpenCV+pytesseract使用Python中的OpenCV库进行图像预处理（边缘保留滤波、灰度化、二值化、形态学操作和逻辑运算），然后结合pytesseract进行文字识别。pytesseract需要配合安装在本地的tesseract-ocr.exe文件一起
AI驱动的可演化架构与前端开发效率 2401_89744464 人工智能架构前端
1.引言在当今快节奏的数字时代，软件系统需要具备强大的适应能力才能在瞬息万变的市场需求中保持竞争力。软件可演化架构的重要性日益凸显，它能够让软件系统在面对需求变更、技术升级以及市场波动时，能够快速、高效地进行调整和升级，避免因僵化的架构而导致的项目失败和资源浪费。然而，传统的软件架构往往面临着诸多挑战，例如维护成本高昂、迭代速度缓慢、难以适应新的技术和需求等。幸运的是，人工智能（AI）技术的快速发
【Python实用技巧】爬取数据保存到Excel中「已注销」 python python 爬虫开发语言
嗨嗨，大家好~今天来给你们分享一个小技巧如何用python爬取数据保存到Excel中话不多说，马上开始需要源码、教程，或者是自己有关python不懂的问题，都可以来这里哦https://jq.qq.com/?_wv=1027&k=s5bZE0K3这里还有学习资料与免费课程领取开发工具Python版本：3.6相关模块：importrequestsfromlxmlimportetreeimportti
Python爬取小说保存为Excel 不知所云975 python
本代码以实际案例介绍，爬取‘笔趣阁最新小说‘列表保存为表格文件。类封装以及网络爬虫以及openpyxl模块可以参考学习。#更新小说目录importrequestsfromlxmlimportetreeimportopenpyxlfromopenpyxl.stylesimportFont,Alignment,Side,Border,PatternFill#定义下载表格的类classDown_exce
python 爬取图片并保存到excel_python制作爬虫并将抓取结果保存到excel中 weixin_39778582 python 爬取图片并保存到excel
学习Python也有一段时间了，各种理论知识大体上也算略知一二了，今天就进入实战演练：通过Python来编写一个拉勾网薪资调查的小爬虫。第一步：分析网站的请求过程我们在查看拉勾网上的招聘信息的时候，搜索Python，或者是PHP等等的岗位信息，其实是向服务器发出相应请求，由服务器动态的响应请求，将我们所需要的内容通过浏览器解析，呈现在我们的面前。可以看到我们发出的请求当中，FormData中的kd
python爬取的数据保存到表格中_利用Python爬取的数据存入Excel表格 weixin_39608063
分析要爬取的内容的网页结构：demo.py:importrequests#requests是HTTP库importrefromopenpyxlimportworkbook#写入Excel表所用fromopenpyxlimportload_workbook#读取Excel表所用frombs4importBeautifulSoupasbs#bs:通过解析文档为用户提供需要抓取的数据importosim
python爬取天眼查存入excel表格_python爬取企查查江苏企业信息生成excel表格吴寿鹤
1.前期准备具体请查看上一篇2.准备库requests,BeautifulSoup,xlwt,lxml1.BeautifulSoup：是专业的网页爬取库，方便抓取网页信息2.xlwt：生成excel表格3.lxml：xml解析库3.具体思路企查查网站具有一定的反爬机制，直接爬取会受到网站阻拦，所以我们需要模拟浏览器请求，绕过反爬机制，打开企查查网站，获取cookie及一系列请求头文件，然后使用Be
学会Python3模拟登录并爬取表格数据！excel高手也自叹不如！ m0_60635321 2024年程序员学习 excel python 爬虫
先自我介绍一下，小编浙江大学毕业，去过华为、字节跳动等大厂，目前阿里P7深知大多数程序员，想要提升技能，往往是自己摸索成长，但自己不成体系的自学效果低效又漫长，而且极易碰到天花板技术停滞不前！因此收集整理了一份《2024年最新Python全套学习资料》，初衷也很简单，就是希望能够帮助到想自学提升又不知道该从何学起的朋友。既有适合小白学习的零基础资料，也有适合3年以上经验的小伙伴深入学习提升的进阶课
使用 Python 将爬取的内容保存到 Excel 表格木觞清 python excel 开发语言
在数据爬取的过程中，很多时候我们需要将爬取到的内容保存到Excel表格中，以便进一步处理、分析和可视化。Python提供了强大的库来实现这一功能，常用的有requests、BeautifulSoup用于网页内容的爬取，以及pandas、openpyxl用于将数据保存到Excel文件。本文将带你一步步完成从爬取数据到保存到Excel文件的整个过程。1.安装必要的库首先，你需要安装一些Python库。
Python学习心得字符串的去重操作 lifegoesonwjl python 开发语言 pycharm
一个字符串中可能包含许多相同的元素，为了保证字符串中的唯一性，下面介绍的是字符串的去重操作：第一种方式：利用for+if的结构进行去重这个程序是对字符串中的每个元素进行判断，如果不在新建的空字符串中就把该元素添加进来，否则就直接忽略过去。s='helloworldhelloworldhelloworld'new_s=''foritemins:ifitemnotinnew_s:new_s+=item
使用Python实现深度学习模型：知识蒸馏与模型压缩 Echo_Wish Python 笔记从零开始学Python人工智能 Python算法 python 深度学习开发语言
在深度学习领域，模型的大小和计算复杂度常常是一个挑战。知识蒸馏（KnowledgeDistillation）和模型压缩（ModelCompression）是两种有效的技术，可以在保持模型性能的同时减少模型的大小和计算需求。本文将详细介绍如何使用Python实现这两种技术。目录引言知识蒸馏概述模型压缩概述实现步骤数据准备教师模型训练学生模型训练（知识蒸馏）模型压缩代码实现结论1.引言在实际应用中，深
来看看爬虫合不合法度假的小鱼 Python基础爬虫搜索引擎 python
活动地址：CSDN21天学习挑战赛文章目录一、爬虫合不合法二、什么是爬虫三、爬虫的分类四、为什么学网络爬虫一、爬虫合不合法随着Python在最近几年的流行，Python中的爬虫也逐渐进入到大家的视野中，但是很多小伙伴，还是在担心爬虫的合法性。今天就来和大家一起讨论一下爬虫的合法性。大家可能在网上看到很多有关程序员写爬虫被抓这样的新闻只因写了一段爬虫，公司200多人被抓！爬虫的本身是合法的，但是如何
Maven Array_06 eclipse jdk maven
Maven Maven是基于项目对象模型(POM)，信息来管理项目的构建，报告和文档的软件项目管理工具。 Maven 除了以程序构建能力为特色之外，还提供高级项目管理工具。由于 Maven 的缺省构建规则有较高的可重用性，所以常常用两三行 Maven 构建脚本就可以构建简单的项目。由于 Maven 的面向项目的方法，许多 Apache Jakarta 项目发文时使用 Maven，而且公司
ibatis的queyrForList和queryForMap区别 bijian1013 java ibatis
一.说明 iBatis的返回值参数类型也有种：resultMap与resultClass，这两种类型的选择可以用两句话说明之： 1.当结果集列名和类的属性名完全相对应的时候，则可直接用resultClass直接指定查询结果类
LeetCode[位运算] - #191 计算汉明权重 Cwind java 位运算 LeetCode Algorithm 题解
原题链接：#191 Number of 1 Bits 要求：写一个函数，以一个无符号整数为参数，返回其汉明权重。例如，‘11’的二进制表示为'00000000000000000000000000001011', 故函数应当返回3。汉明权重：指一个字符串中非零字符的个数；对于二进制串，即其中‘1’的个数。难度：简单分析：将十进制参数转换为二进制，然后计算其中1的个数即可。 “
浅谈java类与对象 15700786134 java
java是一门面向对象的编程语言，类与对象是其最基本的概念。所谓对象，就是一个个具体的物体，一个人，一台电脑，都是对象。而类，就是对象的一种抽象，是多个对象具有的共性的一种集合，其中包含了属性与方法，就是属于该类的对象所具有的共性。当一个类创建了对象，这个对象就拥有了该类全部的属性，方法。相比于结构化的编程思路，面向对象更适用于人的思维
linux下双网卡同一个IP 被触发 linux
转自： http://q2482696735.blog.163.com/blog/static/250606077201569029441/ 由于需要一台机器有两个网卡，开始时设置在同一个网段的IP，发现数据总是从一个网卡发出，而另一个网卡上没有数据流动。网上找了下，发现相同的问题不少：一、关于双网卡设置同一网段IP然后连接交换机的时候出现的奇怪现象。当时没有怎么思考、以为是生成树
安卓按主页键隐藏程序之后无法再次打开肆无忌惮_ 安卓
遇到一个奇怪的问题，当SplashActivity跳转到MainActivity之后，按主页键，再去打开程序，程序没法再打开（闪一下），结束任务再开也是这样，只能卸载了再重装。而且每次在Log里都打印了这句话"进入主程序"。后来发现是必须跳转之后再finish掉SplashActivity 本来代码： // 销毁这个Activity fin
通过cookie保存并读取用户登录信息实例知了ing JavaScript html
通过cookie的getCookies()方法可获取所有cookie对象的集合；通过getName()方法可以获取指定的名称的cookie；通过getValue()方法获取到cookie对象的值。另外，将一个cookie对象发送到客户端，使用response对象的addCookie()方法。下面通过cookie保存并读取用户登录信息的例子加深一下理解。（1）创建index.jsp文件。在改
JAVA 对象池矮蛋蛋 java ObjectPool
原文地址： http://www.blogjava.net/baoyaer/articles/218460.html Jakarta对象池 ☆为什么使用对象池恰当地使用对象池化技术，可以有效地减少对象生成和初始化时的消耗，提高系统的运行效率。Jakarta Commons Pool组件提供了一整套用于实现对象池化
ArrayList根据条件+for循环批量删除的方法 alleni123 java
场景如下： ArrayList<Obj> list Obj-> createTime, sid. 现在要根据obj的createTime来进行定期清理。（释放内存） ------------------------- 首先想到的方法就是 for(Obj o:list){ if(o.createTime-currentT>xxx){
阿里巴巴“耕地宝”大战各种宝百合不是茶平台战略
“耕地保”平台是阿里巴巴和安徽农民共同推出的一个 “首个互联网定制私人农场”，“耕地宝”由阿里巴巴投入一亿，主要是用来进行农业方面，将农民手中的散地集中起来不仅加大农民集体在土地上面的话语权，还增加了土地的流通与利用率，提高了土地的产量，有利于大规模的产业化的高科技农业的发展，阿里在农业上的探索将会引起新一轮的产业调整，但是集体化之后农民的个体的话语权将更少，国家应出台相应的法律法规保护
Spring注入有继承关系的类（1） bijian1013 java spring
一个类一个类的注入 1.AClass类 package com.bijian.spring.test2; public class AClass { String a; String b; public String getA() { return a; } public void setA(Strin
30岁转型期你能否成为成功人士 bijian1013 成功
很多人由于年轻时走了弯路，到了30岁一事无成，这样的例子大有人在。但同样也有一些人，整个职业生涯都发展得很优秀，到了30岁已经成为职场的精英阶层。由于做猎头的原因，我们接触很多30岁左右的经理人，发现他们在职业发展道路上往往有很多致命的问题。在30岁之前，他们的职业生涯表现很优秀，但从30岁到40岁这一段，很多人
[Velocity三]基于Servlet+Velocity的web应用 bit1129 velocity
什么是VelocityViewServlet 使用org.apache.velocity.tools.view.VelocityViewServlet可以将Velocity集成到基于Servlet的web应用中，以Servlet+Velocity的方式实现web应用 Servlet + Velocity的一般步骤 1.自定义Servlet，实现VelocityViewServl
【Kafka十二】关于Kafka是一个Commit Log Service bit1129 service
Kafka is a distributed, partitioned, replicated commit log service.这里的commit log如何理解？ A message is considered "committed" when all in sync replicas for that partition have applied i
NGINX + LUA实现复杂的控制 ronin47 lua nginx 控制
安装lua_nginx_module 模块 lua_nginx_module 可以一步步的安装，也可以直接用淘宝的OpenResty Centos和debian的安装就简单了。。这里说下freebsd的安装： fetch http://www.lua.org/ftp/lua-5.1.4.tar.gz tar zxvf lua-5.1.4.tar.gz cd lua-5.1.4 ma
java-14.输入一个已经按升序排序过的数组和一个数字，在数组中查找两个数，使得它们的和正好是输入的那个数字 bylijinnan java
public class TwoElementEqualSum { /** * 第 14 题：题目：输入一个已经按升序排序过的数组和一个数字，在数组中查找两个数，使得它们的和正好是输入的那个数字。要求时间复杂度是 O(n) 。如果有多对数字的和等于输入的数字，输出任意一对即可。例如输入数组 1 、 2 、 4 、 7 、 11 、 15 和数字 15 。由于
Netty源码学习-HttpChunkAggregator-HttpRequestEncoder-HttpResponseDecoder bylijinnan java netty
今天看Netty如何实现一个Http Server org.jboss.netty.example.http.file.HttpStaticFileServerPipelineFactory： pipeline.addLast("decoder", new HttpRequestDecoder()); pipeline.addLast(&quo
java敏感词过虑-基于多叉树原理 cngolon 违禁词过虑替换违禁词敏感词过虑多叉树
基于多叉树的敏感词、关键词过滤的工具包，用于java中的敏感词过滤 1、工具包自带敏感词词库，第一次调用时读入词库，故第一次调用时间可能较长，在类加载后普通pc机上html过滤5000字在80毫秒左右，纯文本35毫秒左右。 2、如需自定义词库，将jar包考入WEB-INF工程的lib目录，在WEB-INF/classes目录下建一个 utf-8的words.dict文本文件，
多线程知识 cuishikuan 多线程
T1，T2，T3三个线程工作顺序，按照T1，T2，T3依次进行 public class T1 implements Runnable{ @Override
spring整合activemq dalan_123 java spring jms
整合spring和activemq需要搞清楚如下的东东1、ConnectionFactory分： a、spring管理连接到activemq服务器的管理ConnectionFactory也即是所谓产生到jms服务器的链接 b、真正产生到JMS服务器链接的ConnectionFactory还得
MySQL时间字段究竟使用INT还是DateTime？ dcj3sjt126com mysql
环境：Windows XPPHP Version 5.2.9MySQL Server 5.1 第一步、创建一个表date_test（非定长、int时间） CREATE TABLE `test`.`date_test` (`id` INT NOT NULL AUTO_INCREMENT ,`start_time` INT NOT NULL ,`some_content`
Parcel: unable to marshal value dcj3sjt126com marshal
在两个activity直接传递List<xxInfo>时，出现Parcel: unable to marshal value异常。在MainActivity页面（MainActivity页面向NextActivity页面传递一个List<xxInfo>）： Intent intent = new Intent(this, Next
linux进程的查看上（ps） eksliang linux ps linux ps -l linux ps aux
ps:将某个时间点的进程运行情况选取下来转载请出自出处：http://eksliang.iteye.com/admin/blogs/2119469 http://eksliang.iteye.com ps 这个命令的man page 不是很好查阅，因为很多不同的Unix都使用这儿ps来查阅进程的状态，为了要符合不同版本的需求，所以这个
为什么第三方应用能早于System的app启动 gqdy365 System
Android应用的启动顺序网上有一大堆资料可以查阅了，这里就不细述了，这里不阐述ROM启动还有bootloader，软件启动的大致流程应该是启动kernel -> 运行servicemanager 把一些native的服务用命令启动起来（包括wifi, power, rild, surfaceflinger, mediaserver等等）-> 启动Dalivk中的第一个进程Zygot
App Framework发送JSONP请求(3) hw1287789687 jsonp 跨域请求发送jsonp ajax请求越狱请求
App Framework 中如何发送JSONP请求呢? 使用jsonp,详情请参考:http://json-p.org/ 如何发送Ajax请求呢? (1)登录 /*** * 会员登录 * @param username * @param password */ var user_login=function(username,password){ // aler
发福利，整理了一份关于“资源汇总”的汇总 justjavac 资源
觉得有用的话，可以去github关注：https://github.com/justjavac/awesome-awesomeness-zh_CN 通用 free-programming-books-zh_CN 免费的计算机编程类中文书籍精彩博客集合 hacke2/hacke2.github.io#2 ResumeSample 程序员简历
用 Java 技术创建 RESTful Web 服务 macroli java 编程 Web REST
转载：http://www.ibm.com/developerworks/cn/web/wa-jaxrs/ JAX-RS (JSR-311) 【 Java API for RESTful Web Services 】是一种 Java™ API，可使 Java Restful 服务的开发变得迅速而轻松。这个 API 提供了一种基于注释的模型来描述分布式资源。注释被用来提供资源的位
CentOS6.5-x86_64位下oracle11g的安装详细步骤及注意事项超声波 oracle linux
前言：这两天项目要上线了，由我负责往服务器部署整个项目，因此首先要往服务器安装oracle，服务器本身是CentOS6.5的64位系统，安装的数据库版本是11g，在整个的安装过程中碰到很多的坑，不过最后还是通过各种途径解决并成功装上了。转别写篇博客来记录完整的安装过程以及在整个过程中的注意事项。希望对以后那些刚刚接触的菜鸟们能起到一定的帮助作用。安装过程中可能遇到的问题（注
HttpClient 4.3 设置keeplive 和 timeout 的方法 supben httpclient
ConnectionKeepAliveStrategy kaStrategy = new DefaultConnectionKeepAliveStrategy() { @Override public long getKeepAliveDuration(HttpResponse response, HttpContext context) { long keepAlive
Spring 4.2新特性-@Import注解的升级 wiselyman spring 4
3.1 @Import @Import注解在4.2之前只支持导入配置类在4.2,@Import注解支持导入普通的java类,并将其声明成一个bean 3.2 示例演示java类 package com.wisely.spring4_2.imp; public class DemoService { public void doSomethin