【卷积神经网络系列】六、GoogLeNet复现（Pytorch实现）

参考：
一、GoogLeNet-V1
- 1. 搭建神经网络结构
- - （1）定义基本的卷积：卷积+激活函数
  - （2）类`Inception`定义了 Inception 模块的网络结构
  - （3）类`InceptionAux`定义了辅助分类器模块的结构，辅助分类器只在训练的时候使用
  - （4）整体网络结构
- 2. 将定义好的网络结构搭载到GPU/CPU，并定义优化器
- 3.绘图参数列表
- 4.训练函数
- 5.测试函数
- 6.训练过程
二、GoogLeNet-V2
- - （1）定义基本的卷积：卷积+激活函数
  - （2）定义带BN的卷积：卷积+BN+ReLu6
  - （3）InceptionV2-A 模块结构：
  - （4）InceptionV2-B 模块结构：
  - （5）InceptionV2-C 模块结构：
  - （6）InceptionV2-D 模块结构：
  - （7）完整的 GoogLeNetV2 网络结构：
三、GoogLeNet-V4
- 1. 搭建神经网络结构
- - （1）定义基本的卷积：卷积+激活函数
  - （2）定义带BN的卷积：卷积+BN+ReLu
  - （3）Stem
  - （4）Inception-A：
  - （5）Inception-B:
  - （6）Inception-C：
  - （7）Reduction-A：
  - （8）Reduction-B：
  - （9）整体网络结构：
- 2. 将定义好的网络结构搭载到GPU/CPU，并定义优化器
- 3. 绘图参数
- 4. 训练函数
- 5. 测试函数
- 6. 训练过程
- 7. 保存模型并测试
- 8. 下载在服务器上训练好的模型参数，在测试集上验证精度
四、GoogLeNet-ResNet-V1
- - （1）定义基本的卷积：卷积+激活函数
  - （2）定义带BN的卷积：卷积+BN+ReLu
  - （3）Steam：
  - （4）Inception-A
  - （5）Inception-B
  - （6）Inception-C
  - （7）Reduction-A：
  - （8）Reduction-B：
  - （9）整体网络结构：
五、GoogLeNet-ResNet-V2
- - （1）定义基本的卷积：卷积+激活函数
  - （2）定义带BN的卷积：卷积+BN+ReLu
  - （3）Stem
  - （4）Inception-A
  - （5）Inception-B
  - （6）Inception-C
  - （7）Reduction-A：
  - （8）Reduction-B：
  - （9）整体网络结构：

参考：

基于PyTorch实现Inception-v4, Inception-ResNet-v1, Inception-ResNet-v2亲身实践

一、GoogLeNet-V1

1. 搭建神经网络结构

（1）定义基本的卷积：卷积+激活函数

class BasicConv2d(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(BasicConv2d, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.relu = nn.ReLU(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.relu(x)
        return x

（2）类`Inception`定义了 Inception 模块的网络结构

class Inception(nn.Module):
    def __init__(self, in_channels, ch1x1, ch3x3red, ch3x3, ch5x5red, ch5x5, pool_proj):
        """
        Args:
            in_channels: 整个Inception的输入维度
            ch1x1:       分支1(1x1卷积核)的out_channels
            ch3x3red:    分支2(3x3卷积核)的in_channels
            ch3x3:       分支2(3x3卷积核)的out_channels
            ch5x5red:    分支3(5x5卷积核)的in_channels
            ch5x5:       分支3(5x5卷积核)的out_channels
            pool_proj:   分支4(1x1卷积核)的out_channels  
        """
        super(Inception, self).__init__()

        # 分支1 -> 1x1
        self.branch1 = BasicConv2d(in_channels, ch1x1, kernel_size=1)

        # 分支2 -> 1x1 -> 3x3
        self.branch2 = nn.Sequential(
            BasicConv2d(in_channels, ch3x3red, kernel_size=1),
            BasicConv2d(ch3x3red, ch3x3, kernel_size=3, padding=1)   # 保证输出大小等于输入大小
        )

        # 分支3 -> 1x1 -> 5x5 
        self.branch3 = nn.Sequential(
            BasicConv2d(in_channels, ch5x5red, kernel_size=1),
            BasicConv2d(ch5x5red, ch5x5, kernel_size=5, padding=2)   # 保证输出大小等于输入大小
        )

        # 分支4 -> 3x3 -> 1x1
        self.branch4 = nn.Sequential(
            nn.MaxPool2d(kernel_size=3, stride=1, padding=1),       # 保证输出大小等于输入大小
            BasicConv2d(in_channels, pool_proj, kernel_size=1)      # 1x1的卷积核，输入为in_channels，输出为pool_proj
        )

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)
        branch4 = self.branch4(x)

        outputs = [branch1, branch2, branch3, branch4]
        
        # 拼接向量(从维度1开始拼接，维度0是batch)
        return torch.cat(outputs, 1)

（3）类`InceptionAux`定义了辅助分类器模块的结构，辅助分类器只在训练的时候使用

下图是辅助分类器 aux1 的结构图，aux2 结构与 aux1 完全相同。

class InceptionAux(nn.Module):
    def __init__(self, in_channels, num_classes):
        super(InceptionAux, self).__init__()
        self.averagePool = nn.AvgPool2d(kernel_size=5, stride=3)
        self.conv = BasicConv2d(in_channels, 128, kernel_size=1)  # output[batch, 128, 4, 4]

        self.fc1 = nn.Linear(2048, 1024)
        self.fc2 = nn.Linear(1024, num_classes)

    def forward(self, x):
        # aux1: N x 512 x 14 x 14, aux2: N x 528 x 14 x 14
        x = self.averagePool(x)
        
        # aux1: N x 512 x 4 x 4, aux2: N x 528 x 4 x 4
        x = self.conv(x)
        
        # N x 128 x 4 x 4
        x = torch.flatten(x, 1)
        x = F.dropout(x, 0.5, training=self.training)
        
        # N x 2048
        x = F.relu(self.fc1(x), inplace=True)
        x = F.dropout(x, 0.5, training=self.training)
        
        # N x 1024
        x = self.fc2(x)
        
        # N x num_classes
        return x

（4）整体网络结构

class GoogLeNet(nn.Module):
    def __init__(self, num_classes=2, aux_logits=True, init_weights=False):
        super(GoogLeNet, self).__init__()
        self.aux_logits = aux_logits
        
        # 输入为224x224x3 -> 输出为112x112x64
        self.conv1 = BasicConv2d(3, 64, kernel_size=7, stride=2, padding=3)
        # 输入为112x112x64 -> 输出为56x56x64
        # 当ceil_mode = true时，将保存不足为kernel_size大小的数据保存，自动补足NAN至kernel_size大小；
        self.maxpool1 = nn.MaxPool2d(3, stride=2, ceil_mode=True)
        # 1x1的卷积核(带ReLu)，增加非线性
        self.conv2 = BasicConv2d(64, 64, kernel_size=1)
        # 3x3的卷积核(带ReLu)，输入为56x56x64 -> 输出为56x56x192
        self.conv3 = BasicConv2d(64, 192, kernel_size=3, padding=1)
        
        # 输入为56x56x192 -> 输出为28x28x192
        self.maxpool2 = nn.MaxPool2d(3, stride=2, ceil_mode=True)
        # 输入为28x28x192 -> 输出为28x28x(64+128+32+32=256)
        self.inception3a = Inception(192, 64, 96, 128, 16, 32, 32)
        # 输入为28x28x256 -> 输出为28x28x(128+192+96+64=480)
        self.inception3b = Inception(256, 128, 128, 192, 32, 96, 64)
        
        # 输入为28x28x480 -> 输出为14x14x480
        self.maxpool3 = nn.MaxPool2d(3, stride=2, ceil_mode=True)
        # 输入为14x14x480 -> 输出为14x14x(192+208+48+64=512)
        self.inception4a = Inception(480, 192, 96, 208, 16, 48, 64)
        # 输入为14x14x512 -> 输出为14x14x(160+224+64+64=512)
        self.inception4b = Inception(512, 160, 112, 224, 24, 64, 64)
        # 输入为14x14x512 -> 输出为14x14x(128+256+64+64=512)
        self.inception4c = Inception(512, 128, 128, 256, 24, 64, 64)
        # 输入为14x14x512 -> 输出为14x14x(112+288+64+64=528)
        self.inception4d = Inception(512, 112, 144, 288, 32, 64, 64)
        # 输入为14x14x528 -> 输出为14x14x(256+320+128+128=832)
        self.inception4e = Inception(528, 256, 160, 320, 32, 128, 128)
        
        # 输入为14x14x832 -> 输出为7x7x832
        self.maxpool4 = nn.MaxPool2d(3, stride=2, ceil_mode=True)
        # 输入为7x7x832 -> 输出为7x7x(256+320+128+128=832)
        self.inception5a = Inception(832, 256, 160, 320, 32, 128, 128) 
        # 输入为7x7x832 -> 输出为7x7x(384+384+128+128=1024)
        self.inception5b = Inception(832, 384, 192, 384, 48, 128, 128)

        # 是否需要辅助分类器
        if self.aux_logits:
            self.aux1 = InceptionAux(512, num_classes)
            self.aux2 = InceptionAux(528, num_classes)

        # 输入为7x7x1024 -> 输出为1x1x1024
        self.avgpool = nn.AvgPool2d(kernel_size=7, stride=1)
        self.dropout = nn.Dropout(0.4)
        self.fc = nn.Linear(1024, num_classes)
        
        # 如果需要初始化参数，则调用函数
        if init_weights:
            self._initialize_weights()

    def forward(self, x):
        #------ 输入块 ------#
        # N x 3 x 224 x 224
        x = self.conv1(x)
        # N x 64 x 112 x 112
        x = self.maxpool1(x)
        # N x 64 x 56 x 56
        x = self.conv2(x)
        # N x 64 x 56 x 56
        x = self.conv3(x)
        # N x 192 x 56 x 56
        x = self.maxpool2(x)
        
        #------ Inception 4a/4b ------#
        # N x 192 x 28 x 28
        x = self.inception3a(x)
        # N x 256 x 28 x 28
        x = self.inception3b(x)
        # N x 480 x 28 x 28
        x = self.maxpool3(x)
        # N x 480 x 14 x 14
        x = self.inception4a(x)
        #------ 辅助分类器 1 ------#
        # N x 512 x 14 x 14
        if self.training and self.aux_logits:    # eval model lose this layer
            aux1 = self.aux1(x)

        #------ Inception 4b/4c/4d ------# 
        x = self.inception4b(x)
        # N x 512 x 14 x 14
        x = self.inception4c(x)
        # N x 512 x 14 x 14
        x = self.inception4d(x)
        #------ 辅助分类器 2 ------#
        # N x 528 x 14 x 14
        if self.training and self.aux_logits:    # eval model lose this layer
            aux2 = self.aux2(x)

        #------ Inception 4e/5a/5b ------# 
        x = self.inception4e(x)
        # N x 832 x 14 x 14
        x = self.maxpool4(x)
        # N x 832 x 7 x 7
        x = self.inception5a(x)
        # N x 832 x 7 x 7
        x = self.inception5b(x)
        # N x 1024 x 7 x 7

        #------ 输出块 ------# 
        x = self.avgpool(x)
        # N x 1024 x 1 x 1
        x = torch.flatten(x, 1)
        # N x 1024
        x = self.dropout(x)
        x = self.fc(x)
        # N x 1000 (num_classes)
        if self.training and self.aux_logits:   # 测试时不需要辅助分类器
            return x, aux2, aux1
        return x

    # 初始化参数
    def _initialize_weights(self):
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                nn.init.kaiming_normal_(m.weight, mode='fan_out', nonlinearity='relu')
                if m.bias is not None:
                    nn.init.constant_(m.bias, 0)
            elif isinstance(m, nn.Linear):
                nn.init.normal_(m.weight, 0, 0.01)
                nn.init.constant_(m.bias, 0)

2. 将定义好的网络结构搭载到GPU/CPU，并定义优化器

#创建模型，部署gpu
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = GoogLeNet(num_classes=2, aux_logits=True, init_weights=True)
model.to(device)

#定义优化器
criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(model.parameters(), lr=0.0001)

3.绘图参数列表

# 训练次数
epoch = 10

# 绘图所用
plt_epoch = []      # 横坐标，训练次数

Train_Loss = []     # 训练损失
Train_Accuracy = [] # 训练精度

Test_Loss = []      # 测试损失
Test_Accuracy = []  # 测试精度

4.训练函数

def train_runner(model, epoch):
    #训练模型, 启用 BatchNormalization 和 Dropout, 将BatchNormalization和Dropout置为True
    model.train()
    
    total = 0             # 总样本数量
    correct =0.0          # 每轮epoch分类正确样本数量
    epoch_avg_loss = 0.0  # 每轮epoch的平均损失
 
    #enumerate迭代已加载的数据集,同时获取数据和数据下标
    for batch_idx, data in enumerate(trainloader, 0):
        batch_avg_loss = 0.0                                    # 每个batch的平均损失
        
        inputs, labels = data                                   # 解包     
        inputs, labels = inputs.to(device), labels.to(device)   # 把模型部署到device上  
        optimizer.zero_grad()                                   # 梯度清零        
        logits, aux_logits2, aux_logits1 = model(inputs)        # 保存训练结果
        
        #计算损失和
        loss0 = criterion(logits, labels)                       # 主分类器
        loss1 = criterion(aux_logits1, labels)                  # 辅助分类器1
        loss2 = criterion(aux_logits2, labels)                  # 辅助分类器2
        loss = loss0 + loss1 * 0.3 + loss2 * 0.3                # 权重加和
        
        #dim=1表示返回每一行的最大值对应的列下标
        predict = logits.argmax(dim=1)                          #获取最大概率的预测结果
        total += labels.size(0)                                 # 总样本数
        correct += (predict == labels).sum().item()             # 统计正确分类样本个数
        
        epoch_avg_loss += loss.item()                           # 把每轮epoch的损失累加
        batch_avg_loss += loss.item()                           # 累加每100个batch的损失
        
        loss.backward()                                         # 反向传播
        optimizer.step()                                        # 更新参数
        
        # 每100个batch进行一次loss输出
        if batch_idx % 100 == 99:
            print('[epoch:%d, batch_idx:%5d] batch_avg_loss: %.6f' % (epoch, batch_idx+1, batch_avg_loss/100))
            batch_avg_loss = 0.0
    
    # 这里train_batchsize是64，向上取整，所有小数都是向着数值更大的方向取整
    batch_num = math.ceil(total/64)
        
    # 每完成一次训练epoch，打印当前平均Loss和精度
    epoch_avg_loss /= batch_num 
    print("Train Epoch{} \t epoch_avg_loss: {:.6f}, accuracy: {:.6f}%".format(epoch, epoch_avg_loss, 100*(correct/total)))
    
    # 加入列表，以便于绘图
    Train_Loss.append(epoch_avg_loss)
    Train_Accuracy.append(correct/total)

5.测试函数

def test_runner(model):
    #模型验证, 必须要写, 否则只要有输入数据, 即使不训练, 它也会改变权值
    #因为调用eval()将不启用 BatchNormalization 和 Dropout, BatchNormalization和Dropout置为False
    model.eval()
    
    #统计模型正确率, 设置初始值
    correct = 0.0
    test_loss = 0.0
    total = 0
    
    #torch.no_grad将不会计算梯度, 也不会进行反向传播
    with torch.no_grad():
        for data, label in testloader:
            data, label = data.to(device), label.to(device)
            output = model(data)
            test_loss += criterion(output, label).item()
            predict = output.argmax(dim=1)
            #计算正确数量
            total += label.size(0)
            correct += (predict == label).sum().item()
            
        # # 每完成一次训练epoch，打印当前平均Loss和精度
        test_loss /= total
            
        #计算损失值和精度
        print("test_avarage_loss: {:.6f}, accuracy: {:.6f}%".format(test_loss, 100*(correct/total)))
        
    # 加入列表，以便于绘图
    Test_Loss.append(test_loss)
    Test_Accuracy.append(correct/total)

6.训练过程

if __name__ == '__main__':
    
    print("start_time",time.strftime('%Y-%m-%d %H:%M:%S',time.localtime(time.time())))
    for epoch in range(1, epoch+1):
        plt_epoch.append(epoch)
        train_runner(model, epoch)
        test_runner(model)
    print("end_time: ",time.strftime('%Y-%m-%d %H:%M:%S',time.localtime(time.time())),'\n')
 
    print('Finished Training')
    plt.subplot(2,2,1), plt.plot(plt_epoch, Train_Loss), plt.title('Train_Loss'), plt.grid()
    plt.subplot(2,2,2), plt.plot(plt_epoch, Train_Accuracy), plt.title('Train_Accuracy'), plt.grid()
    plt.subplot(2,2,3), plt.plot(plt_epoch, Test_Loss), plt.title('Test_Loss'), plt.grid()
    plt.subplot(2,2,4), plt.plot(plt_epoch, Test_Accuracy), plt.title('Test_Accuracy'), plt.grid()
    plt.tight_layout()
    plt.show()

二、GoogLeNet-V2

（1）定义基本的卷积：卷积+激活函数

ReLU6是在ReLU的基础上，限制正值的上限6。

class BasicConv2d(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(BasicConv2d, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.relu = nn.ReLU6(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.relu(x)
        return x

（2）定义带BN的卷积：卷积+BN+ReLu6

class ConvBNReLU(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(ConvBNReLU, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.bn = nn.BatchNorm2d(out_channels)
        self.relu = nn.ReLU6(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.bn(x)
        x = self.relu(x)
        return x

（3）InceptionV2-A 模块结构：

class InceptionV2_A(nn.Module):
    def __init__(self, in_channels, out_channels_1red, out_channels_1, out_channels_2red, out_channels_2, out_channels_3, out_channels_4):
        """
        Args:
            in_channels:        整个Inception的输入维度
            out_channels_1red:  分支1(1x1卷积核)的out_channels
            out_channels_1:     分支1(3x3卷积核)的out_channels
            out_channels_2red:  分支2(1x1卷积核)的out_channels
            out_channels_2:     分支2(3x3卷积核)的out_channels
            out_channels_3:     分支3(1x1卷积核)的out_channels
            out_channels_4:     分支4(1x1卷积核)的out_channels
        """
        super(InceptionV2_A, self).__init__()
        
        # 分支1：(1x1->3x3->3x3)
        self.branch1 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_1red, kernel_size=1),
            ConvBNReLU(out_channels_1red, out_channels_1, kernel_size=3, padding=1),   # 保证输出大小等于输入大小
            ConvBNReLU(out_channels_1, out_channels_1, kernel_size=3, padding=1)   # 保证输出大小等于输入大小
        )
        
        # 分支2：(1x1->3x3)
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_2red, kernel_size=1),
            ConvBNReLU(out_channels_2red, out_channels_2, kernel_size=3, padding=1)   # 保证输出大小等于输入大小
        )
        
        # 分支3：(MaxPool->1x1)
        self.branch3 = nn.Sequential(
            nn.MaxPool2d(kernel_size=3, stride=1, padding=1),
            ConvBNReLU(in_channels, out_channels_3, kernel_size=1)  # 保证输出大小等于输入大小
        )

        # 分支4（1x1）
        self.branch4 = ConvBNReLU(in_channels, out_channels_4, kernel_size=1)


    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)
        branch4 = self.branch4(x)

        outputs = [branch1, branch2, branch3, branch4]
        return torch.cat(outputs, 1)

（4）InceptionV2-B 模块结构：

class InceptionV2_B(nn.Module):
    def __init__(self, in_channels, out_channels_1red, out_channels_1, out_channels_2red, out_channels_2, out_channels_3, out_channels_4):
        """
        Args:
            in_channels:        整个Inception的输入维度
            out_channels_1red:  分支1(1x1卷积核)的out_channels
            out_channels_1:     分支1(3x1卷积核)的out_channels
            out_channels_2red:  分支2(1x1卷积核)的out_channels   
            out_channels_2:     分支2(3x1卷积核)的out_channels   
            out_channels_3:     分支3(1x1卷积核)的out_channels
            out_channels_4:     分支4(1x1卷积核)的out_channels
        """
        super(InceptionV2_B, self).__init__()
        
        # 分支1：(1x1->1x3->3x1->1x3->3x1)
        self.branch1 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_1red, kernel_size=1),
            # 使用1x3卷积核时，需要分别设置padding，以保证WxH不发生改变
            ConvBNReLU(out_channels_1red, out_channels_1red, kernel_size=[1,3], padding=[0,1]), 
            ConvBNReLU(out_channels_1red, out_channels_1red, kernel_size=[3,1], padding=[1,0]), 
            ConvBNReLU(out_channels_1red, out_channels_1red, kernel_size=[1,3], padding=[0,1]), 
            ConvBNReLU(out_channels_1red, out_channels_1, kernel_size=[3,1], padding=[1,0])
        )
        
        # 分支2：(1x1->1x3->3x1)
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_2red, kernel_size=1),
            ConvBNReLU(out_channels_2red, out_channels_2red, kernel_size=[1,3], padding=[0,1]),
            ConvBNReLU(out_channels_2red, out_channels_2, kernel_size=[3,1], padding=[1,0])
        )
        
        # 分支3：(MaxPool->1x1)
        self.branch3 = nn.Sequential(
            nn.MaxPool2d(kernel_size=3, stride=1, padding=1),
            ConvBNReLU(in_channels, out_channels_3, kernel_size=1)
        )
        
        # 分支4：(1x1)
        self.branch4 = ConvBNReLU(in_channels, out_channels_4, kernel_size=1)

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)
        branch4 = self.branch4(x)

        outputs = [branch1, branch2, branch3, branch4]
        return torch.cat(outputs, 1)

（5）InceptionV2-C 模块结构：

class InceptionV2_C(nn.Module):
    def __init__(self, in_channels, out_channels_1red, out_channels_1, out_channels_2red, out_channels_2, out_channels_3, out_channels_4):
        """
        Args:
            in_channels:        整个Inception的输入维度
            out_channels_1red:  分支1(1x1卷积核)的out_channels
            out_channels_1:     分支1(1x3与3x1卷积核)的out_channels
            out_channels_2red:  分支2(1x1卷积核)的out_channels
            out_channels_2:     分支2(1x3与3x1卷积核)的out_channels
            out_channels_3:     分支3(1x1卷积核)的out_channels
            out_channels_4:     分支4(1x1卷积核)的out_channels
        """
        super(InceptionV2_C, self).__init__()
        
        # 分支1：(1x1->3x3->两个分支：①1x3；②3x1)
        self.branch1_conv1x1 = ConvBNReLU(in_channels, out_channels_1red, kernel_size=1)
        self.branch1_conv3x3 = ConvBNReLU(out_channels_1red, out_channels_1, kernel_size=3, padding=1)
        self.branch1_conv1x3 = ConvBNReLU(out_channels_1, out_channels_1, kernel_size=[1,3], padding=[0,1])
        self.branch1_conv3x1 = ConvBNReLU(out_channels_1, out_channels_1, kernel_size=[3,1], padding=[1,0])

        # 分支2：(1x1->两个分支：①1x3；②3x1)
        self.branch2_conv1x1 = ConvBNReLU(in_channels, out_channels_2red, kernel_size=1)
        self.branch2_conv1x3 = ConvBNReLU(out_channels_2red, out_channels_2, kernel_size=[1,3],padding=[0,1])
        self.branch2_conv3x1 = ConvBNReLU(out_channels_2red, out_channels_2, kernel_size=[3,1],padding=[1,0])
        
        # 分支3：(MaxPool->1x1)
        self.branch3 = nn.Sequential(
            nn.MaxPool2d(kernel_size=3, stride=1, padding=1),
            ConvBNReLU(in_channels, out_channels_3, kernel_size=1)
        )

        # 分支4：(1x1)
        self.branch4 = ConvBNReLU(in_channels, out_channels_4, kernel_size=1)
        

    def forward(self, x):
        # 分支1
        branch1_tmp = self.branch1_conv1x1(x)
        branch1_tmp = self.branch1_conv3x3(branch1_tmp)
        branch1 = torch.cat([self.branch1_conv1x3(branch1_tmp), self.branch1_conv3x1(branch1_tmp)], dim=1)
        
        # 分支2
        branch2_tmp = self.branch2_conv1x1(x)
        branch2 = torch.cat([self.branch2_conv1x3(branch2_tmp), self.branch2_conv3x1(branch2_tmp)], dim=1)
        
        # 分支3
        branch3 = self.branch3(x)
        
        # 分支4
        branch4 = self.branch4(x)

        outputs = [branch1, branch2, branch3, branch4]
        return torch.cat(outputs, 1)

（6）InceptionV2-D 模块结构：

class InceptionV2_D(nn.Module):
    def __init__(self, in_channels, out_channels_1red, out_channels_1, out_channels_2red, out_channels_2):
        super(InceptionV2_D, self).__init__()

        # 分支1：(1x1->3x3->3x3)
        self.branch1 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_1red, kernel_size=1),
            ConvBNReLU(out_channels_1red, out_channels_1, kernel_size=3, stride=1, padding=1),  
            ConvBNReLU(out_channels_1, out_channels_1, kernel_size=3, stride=2, padding=1) 
        )
        
        # 分支2：(1x1->3x3)
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_2red, kernel_size=1),
            ConvBNReLU(out_channels_2red, out_channels_2, kernel_size=3, stride=2,  padding=1)   
        )

        # 分支3：(1x1)
        self.branch3 = nn.MaxPool2d(kernel_size=3, stride=2, padding=1)

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)

        outputs = [branch1, branch2, branch3]
        return torch.cat(outputs, 1)

（7）完整的 GoogLeNetV2 网络结构：

class GoogLeNetV2(nn.Module):
    def __init__(self, num_classes=2, init_weights=False):
        super(GoogLeNetV2, self).__init__()
        
        # 输入299x299x3 -> 输出149x149x32
        self.conv1 = ConvBNReLU(3, 32, kernel_size=3, stride=2)
        # 输入149x149x32 -> 输出147x147x32
        self.conv2 = ConvBNReLU(32, 32, kernel_size=3, stride=1)
        # 输入147x147x32 -> 输出147x147x64
        self.conv3 = ConvBNReLU(32, 64, kernel_size=3, stride=1, padding=2)
        # 输入147x147x64 -> 输出73x73x64
        self.maxpool1 = nn.MaxPool2d(3, stride=2)      
        # 输入73x73x64 -> 输出71x71x80
        self.conv4 = ConvBNReLU(64, 80, kernel_size=3, stride=1)
        # 输入71x71x80 -> 输出35x35x192
        self.conv5 = ConvBNReLU(80, 192, kernel_size=3, stride=2)
        
        # 输入35x35x192 -> 输出35x35x288
        self.conv6 = BasicConv2d(192, 288, kernel_size=1)
        # 输入35x35x288 -> 输出17x17x288
        self.maxpool2 = nn.MaxPool2d(3, stride=2)  
        
        # 输入17x17x288 -> 输出17x17x(128+96+64+64=352)
        self.inceptionA1 = InceptionV2_A(288, 96, 128, 96, 96, 64, 64)
        # 输入17x17x352 -> 输出17x17x(256+128+96+128=608)
        self.inceptionA2 = InceptionV2_A(352, 256, 256, 128, 128, 96, 128)
        # 输入17x17x608 -> 输出17x17x(320+224+96+128=768)
        self.inceptionA3 = InceptionV2_A(608, 256, 320, 128, 224, 96, 128)
        
        # 输入17x17x768 -> 输出17x17x(384+256+96+128=864)
        self.inceptionB1 = InceptionV2_B(768, 128, 384, 128, 256, 96, 128)
        # 输入17x17x864 -> 输出17x17x(320+320+96+128=864)
        self.inceptionB2 = InceptionV2_B(864, 256, 320, 160, 320, 96, 128)
        # 输入17x17x864 -> 输出17x17x(272+368+96+128=864)
        self.inceptionB3 = InceptionV2_B(864, 256, 272, 192, 368, 96, 128)
        # 输入17x17x864 -> 输出17x17x(368+368+96+128=960)
        self.inceptionB4 = InceptionV2_B(864, 256, 368, 192, 368, 96, 128)
        # 输入17x17x960 -> 输出17x17x(496+400+128+256=1280)
        self.inceptionB5 = InceptionV2_B(960, 320, 496, 256, 400, 128, 256)
        # 输入17x17x1280 -> 输出8x8x1280
        self.maxpool3 = nn.MaxPool2d(3, stride=2)
        
        # 输入8x8x1280 -> 输出(384x2+256x2+128+256=1664)
        self.inceptionC1 = InceptionV2_C(1280, 512, 384, 384, 256, 128, 256)
        # 输入8x8x1664 -> 输出8x8x(216x2+192x2+80+128=1024)
        self.inceptionC2 = InceptionV2_C(1664, 512, 216, 192, 192, 80, 128)
        
        # 输出部分：平均池化->全连接层
        self.avgpool = nn.AvgPool2d(kernel_size=8, stride=1)
        self.dropout = nn.Dropout(0.5)
        self.fc = nn.Linear(1024, num_classes)
        
        if init_weights:
            self._initialize_weights()

    def forward(self, x):
        x = self.conv1(x)
        x = self.conv2(x)
        x = self.conv3(x)
        x = self.maxpool1(x)
        x = self.conv4(x)
        x = self.conv5(x)
        
        x = self.conv6(x)
        x = self.maxpool2(x)
        x = self.inceptionA1(x)
        x = self.inceptionA2(x)
        x = self.inceptionA3(x)
        
        x = self.inceptionB1(x)
        x = self.inceptionB2(x)
        x = self.inceptionB3(x)
        x = self.inceptionB4(x)
        x = self.inceptionB5(x)
        x = self.maxpool3(x)
        
        x = self.inceptionC1(x)
        x = self.inceptionC2(x)
        
        x = self.avgpool(x)
        # N x 1024 x 1 x 1
        x = torch.flatten(x, 1)
        x = self.dropout(x)
        x = self.fc(x)
        return x

    def _initialize_weights(self):
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                nn.init.kaiming_normal_(m.weight, mode='fan_out', nonlinearity='relu')
                if m.bias is not None:
                    nn.init.constant_(m.bias, 0)
            elif isinstance(m, nn.Linear):
                nn.init.normal_(m.weight, 0, 0.01)
                nn.init.constant_(m.bias, 0)

# 第一种测试模型的各层维度的方式：            
# input = torch.ones((1,3,299,299))
# print(input.shape)
# model = GoogLeNetV2(num_classes=2, init_weights=True)
# output = model.test(input) # 把forward改成test
# print(output.shape)

三、GoogLeNet-V4

1. 搭建神经网络结构

（1）定义基本的卷积：卷积+激活函数

class BasicConv2d(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(BasicConv2d, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.relu = nn.ReLU6(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.relu(x)
        return x

（2）定义带BN的卷积：卷积+BN+ReLu

class ConvBNReLU(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(ConvBNReLU, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.bn = nn.BatchNorm2d(out_channels)
        self.relu = nn.ReLU6(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.bn(x)
        x = self.relu(x)
        return x

（3）Stem

class Stem(nn.Module):
    """
    stem block for Inception-v4
    """
    def __init__(self):
        super(Stem, self).__init__()
        
        # 连续3个3x3的卷积核
        self.step1 = nn.Sequential(
            # 299x299x3 -> 149x149x32
            ConvBNReLU(in_channels=3, out_channels=32, kernel_size=3, stride=2),
            # 149x149x32 -> 147x147x32
            ConvBNReLU(in_channels=32, out_channels=32, kernel_size=3, stride=1),
            # 147x147x32 -> 147x147x64
            ConvBNReLU(in_channels=32, out_channels=64, kernel_size=3, stride=1, padding=1),
        )
        
        # 分支1：147x147x64 -> 72x72x64
        self.step2_pool = nn.MaxPool2d(kernel_size=3, stride=2)
        # 分支2：147x147x64 -> 72x72x96
        self.step2_conv = ConvBNReLU(in_channels=64, out_channels=96, kernel_size=3, stride=2)
        
        # 分支1：1x1+3x3 
        self.step3_1 = nn.Sequential(
            ConvBNReLU(in_channels=160, out_channels=64, kernel_size=1, stride=1),
            ConvBNReLU(in_channels=64, out_channels=96, kernel_size=3, stride=1)
        )
        # 分支2：1x1+7x1+1x7+3x3
        self.step3_2 = nn.Sequential(
            ConvBNReLU(in_channels=160, out_channels=64, kernel_size=1, stride=1),
            ConvBNReLU(in_channels=64, out_channels=64, kernel_size=[7,1], padding=[3,0]),
            ConvBNReLU(in_channels=64, out_channels=64, kernel_size=[1,7], padding=[0,3]),
            ConvBNReLU(in_channels=64, out_channels=96, kernel_size=3, stride=1)
        )
        
        # 分支1：池化
        self.step4_pool = nn.MaxPool2d(kernel_size=3, stride=2)
        # 分支2：3x3
        self.step4_conv = ConvBNReLU(in_channels=192, out_channels=192, kernel_size=3, stride=2)

    def forward(self, x):
        out = self.step1(x)
        
        tmp1 = self.step2_pool(out)
        tmp2 = self.step2_conv(out)
        out = torch.cat((tmp1, tmp2), 1)
        
        tmp1 = self.step3_1(out)
        tmp2 = self.step3_2(out)
        out = torch.cat((tmp1, tmp2), 1)
        
        tmp1 = self.step4_pool(out)
        tmp2 = self.step4_conv(out)
        
        outputs = [tmp1, tmp2]
        return torch.cat(outputs, 1)

（4）Inception-A：

class InceptionV4_A(nn.Module):
    def __init__(self, in_channels, out_channels_1, out_channels_2, out_channels_3red, out_channels_3, out_channels_4red, out_channels_4):
        """
        Args:
            in_channels:        整个Inception的输入维度
            out_channels_1:     分支1(1x1卷积核)的out_channels
            out_channels_2:     分支2(1x1卷积核)的out_channels
            out_channels_3red:  分支3(1x1卷积核)的out_channels
            out_channels_3:     分支3(3x3卷积核)的out_channels
            out_channels_4red:  分支4(1x1卷积核)的out_channels
            out_channels_4:     分支4(3x3卷积核)的out_channels
        """
        super(InceptionV4_A, self).__init__()
        
        # 分支1：avg -> 1x1
        self.branch1 = nn.Sequential(
           nn.AvgPool2d(kernel_size=3, stride=1, padding=1),
           ConvBNReLU(in_channels, out_channels_1, kernel_size=1)
        )
        
        # 分支2：1x1
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_2, kernel_size=1)
        )
        
        # 分支3：(1x1 -> 3x3)
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_3red, kernel_size=1),
            ConvBNReLU(out_channels_3red, out_channels_3, kernel_size=3, stride=1, padding=1)
        )

        # 分支4：(1x1 -> 3x3 -> 3x3)
        self.branch4 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_4red, kernel_size=1),
            ConvBNReLU(out_channels_4red, out_channels_4, kernel_size=3, stride=1, padding=1),
            ConvBNReLU(out_channels_4, out_channels_4, kernel_size=3, stride=1, padding=1)
        )

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)
        branch4 = self.branch4(x)

        outputs = [branch1, branch2, branch3, branch4]
        return torch.cat(outputs, 1)

（5）Inception-B:

class InceptionV4_B(nn.Module):
    def __init__(self, in_channels, out_channels_1, out_channels_2, 
                 out_channels_3_1x1, out_channels_3_1x7, out_channels_3, 
                 out_channels_4_1x1, out_channels_4_1x7_1, out_channels_4_7x1_1,
                 out_channels_4_1x7_2, out_channels_4_7x1_2):

        super(InceptionV4_B, self).__init__()
        
        # 分支1：(AvgPool->1x1)
        self.branch1 = nn.Sequential(
            nn.AvgPool2d(kernel_size=3, stride=1, padding=1),
            ConvBNReLU(in_channels, out_channels_1, kernel_size=1)
        )
        
        # 分支2：(1x1)
        self.branch2 = ConvBNReLU(in_channels, out_channels_2, kernel_size=1)
        
        # 分支3：(1x1->1x7->7x1)
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_3_1x1, kernel_size=1),
            ConvBNReLU(out_channels_3_1x1, out_channels_3_1x7, kernel_size=[1,7], padding=[0,3]),
            ConvBNReLU(out_channels_3_1x7, out_channels_3, kernel_size=[7,1], padding=[3,0])
        )
        
        # 分支4：(1x1->1x7->7x1->1x7->7x1)
        self.branch4 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_4_1x1, kernel_size=1),
            ConvBNReLU(out_channels_4_1x1, out_channels_4_1x7_1, kernel_size=[1,7], padding=[0,3]), 
            ConvBNReLU(out_channels_4_1x7_1, out_channels_4_7x1_1, kernel_size=[7,1], padding=[3,0]), 
            ConvBNReLU(out_channels_4_7x1_1, out_channels_4_1x7_2, kernel_size=[1,7], padding=[0,3]), 
            ConvBNReLU(out_channels_4_1x7_2, out_channels_4_7x1_2, kernel_size=[7,1], padding=[3,0])
        )

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)
        branch4 = self.branch4(x)

        outputs = [branch1, branch2, branch3, branch4]
        return torch.cat(outputs, 1)

（6）Inception-C：

class InceptionV4_C(nn.Module):
    def __init__(self, in_channels, out_channels_1, out_channels_2,
                 out_channels_3red, out_channels_3,
                 out_channels_4_1x1, out_channels_4_1x3_1, out_channels_4_3x1_1, 
                 out_channels_4_3x1_2, out_channels_4_1x3_2):

        super(InceptionV4_C, self).__init__()
        
        # 分支1：(AvgPool->1x1)
        self.branch1 = nn.Sequential(
            nn.AvgPool2d(kernel_size=3, stride=1, padding=1),
            ConvBNReLU(in_channels, out_channels_1, kernel_size=1)
        )
        
        # 分支2：(1x1)
        self.branch2 = ConvBNReLU(in_channels, out_channels_2, kernel_size=1)
        
        # 分支3：(1x1->两个分支：①1x3；②3x1)
        self.branch3_conv1x1 = ConvBNReLU(in_channels, out_channels_3red, kernel_size=1)
        self.branch3_conv1x3 = ConvBNReLU(out_channels_3red, out_channels_3, kernel_size=[1,3],padding=[0,1])
        self.branch3_conv3x1 = ConvBNReLU(out_channels_3red, out_channels_3, kernel_size=[3,1],padding=[1,0])
            
        # 分支4：(1x1->1x3->3x1->两个分支：①1x3；②3x1)
        self.branch4_step1 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_4_1x1, kernel_size=1),
            ConvBNReLU(out_channels_4_1x1, out_channels_4_1x3_1, kernel_size=[1,3],padding=[0,1]),
            ConvBNReLU(out_channels_4_1x3_1, out_channels_4_3x1_1, kernel_size=[3,1],padding=[1,0])
        )
        self.branch4_conv3x1 = ConvBNReLU(out_channels_4_3x1_1, out_channels_4_3x1_2, kernel_size=[3,1],padding=[1,0])
        self.branch4_conv1x3 = ConvBNReLU(out_channels_4_3x1_1, out_channels_4_1x3_2, kernel_size=[1,3],padding=[0,1])
        

    def forward(self, x):
        # 分支1
        branch1 = self.branch1(x)
        # 分支2
        branch2 = self.branch2(x)
        # 分支3
        branch3_tmp = self.branch3_conv1x1(x)
        branch3 = torch.cat([self.branch3_conv1x3(branch3_tmp), self.branch3_conv3x1(branch3_tmp)], dim=1)
        # 分支4
        branch4_tmp = self.branch4_step1(x)
        branch4 = torch.cat([self.branch4_conv3x1(branch4_tmp), self.branch4_conv1x3(branch4_tmp)], dim=1)
    
        outputs = [branch1, branch2, branch3, branch4]
        return torch.cat(outputs, 1)

（7）Reduction-A：

class Reduction_A(nn.Module):
    def __init__(self, in_channels, k, l, m, n):

        super(Reduction_A, self).__init__()
        
        # 分支1：MaxPool
        self.branch1 = nn.MaxPool2d(kernel_size=3, stride=2)
        
        # 分支2：(3x3)
        self.branch2 = ConvBNReLU(in_channels, n, kernel_size=3, stride=2)
        
        # 分支3：(1x1->3x3->3x3)
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, k, kernel_size=1),
            ConvBNReLU(k, l, kernel_size=3, stride=1, padding=1),
            ConvBNReLU(l, m, kernel_size=3, stride=2)
        )

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)

        outputs = [branch1, branch2, branch3]
        return torch.cat(outputs, 1)

（8）Reduction-B：

class Reduction_B(nn.Module):
    def __init__(self, in_channels, out_channels_2_1x1, out_channels_2_3x3,
                 out_channels_3_1x1, out_channels_3_1x7, out_channels_3_7x1, out_channels_3_3x3):

        super(Reduction_B, self).__init__()
        
        # 分支1：MaxPool
        self.branch1 = nn.MaxPool2d(kernel_size=3, stride=2)
        
        # 分支2：(3x3)
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_2_1x1, kernel_size=1),
            ConvBNReLU(out_channels_2_1x1, out_channels_2_3x3, kernel_size=3, stride=2)
        )
        
        # 分支3：(1x1->1x7->7x1)
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, out_channels_3_1x1, kernel_size=1),
            ConvBNReLU(out_channels_3_1x1, out_channels_3_1x7, kernel_size=[1,7], padding=[0,3]),
            ConvBNReLU(out_channels_3_1x7, out_channels_3_7x1, kernel_size=[7,1], padding=[3,0]),
            ConvBNReLU(out_channels_3_7x1, out_channels_3_3x3, kernel_size=3, stride=2)
        )

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)

        outputs = [branch1, branch2, branch3]
        return torch.cat(outputs, 1)

（9）整体网络结构：

class GoogLeNetV4(nn.Module):
    """
        implementation of Inception-v4
    """

    def __init__(self, num_classes, init_weights=False):
        super(GoogLeNetV4, self).__init__()
        
        # 整体主干网络
        self.stem = Stem()
        self.inception_A = self.__make_inception_A()
        self.Reduction_A = self.__make_reduction_A()
        self.inception_B = self.__make_inception_B()
        self.Reduction_B = self.__make_reduction_B()
        self.inception_C = self.__make_inception_C()
        
        # 输出部分：平均池化->全连接层
        self.avgpool = nn.AvgPool2d(kernel_size=8, stride=1)
        self.dropout = nn.Dropout(0.2)
        self.fc = nn.Linear(1536, num_classes)
        
        if init_weights:
            self._initialize_weights()
        

    # 制造4层Inception-A
    def __make_inception_A(self):
        layers = []
        for _ in range(4):
            layers.append(InceptionV4_A(384, 96, 96, 64, 96, 64, 96)) # 384 
        return nn.Sequential(*layers)

    # 制造1层Reduction-A
    def __make_reduction_A(self):
        return Reduction_A(384, 192, 224, 256, 384) # 1024

    # 制造7层Inception-B
    def __make_inception_B(self):
        layers = []
        for _ in range(7):
            layers.append(InceptionV4_B(1024, 128, 384, 192, 224, 256,
                                        192, 192, 224, 224, 256))   # 1024
        return nn.Sequential(*layers)

    # 制造1层Reduction-B
    def __make_reduction_B(self):
        return Reduction_B(1024, 192, 192, 256, 256, 320, 320)  # 1536

    # 制造3层Inception-C
    def __make_inception_C(self):
        layers = []
        for _ in range(3):
            layers.append(InceptionV4_C(1536, 256, 256, 384, 256, 384, 448, 512, 256, 256)) # 1536
        return nn.Sequential(*layers)

    def forward(self, x):
        
        out = self.stem(x)
        out = self.inception_A(out)
        out = self.Reduction_A(out)
        out = self.inception_B(out)
        out = self.Reduction_B(out)
        out = self.inception_C(out)
        
        out = self.avgpool(out)
        out = torch.flatten(out, 1)
        out = self.dropout(out)
        out = self.fc(out)
        
        return out
    
    def _initialize_weights(self):
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                nn.init.kaiming_normal_(m.weight, mode='fan_out', nonlinearity='relu')
                if m.bias is not None:
                    nn.init.constant_(m.bias, 0)
            elif isinstance(m, nn.Linear):
                nn.init.normal_(m.weight, 0, 0.01)
                nn.init.constant_(m.bias, 0)
                
# # 第一种测试模型的各层维度的方式：            
# input = torch.ones((1,3,299,299))
# print(input.shape)
# model = GoogLeNetV4(num_classes=2, init_weights=True)
# output = model.test(input) # 把forward改成test
# print(output.shape)

2. 将定义好的网络结构搭载到GPU/CPU，并定义优化器

#创建模型，部署gpu
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = GoogLeNetV4(num_classes=2, init_weights=True)
model.to(device)

# 第二种测试模型信息的方式
summary(model, (3, 299, 299))

#定义优化器
criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(model.parameters(), lr=0.0001)

3. 绘图参数

# 训练次数
epoch = 10

# 绘图所用
plt_epoch = []      # 横坐标，训练次数

Train_Loss = []     # 训练损失
Train_Accuracy = [] # 训练精度

Test_Loss = []      # 测试损失
Test_Accuracy = []  # 测试精度

4. 训练函数

def train_runner(model, epoch):
    #训练模型, 启用 BatchNormalization 和 Dropout, 将BatchNormalization和Dropout置为True
    model.train()
    
    total = 0             # 总样本数量
    correct =0.0          # 每轮epoch分类正确样本数量
    epoch_avg_loss = 0.0  # 每轮epoch的平均损失
 
    #enumerate迭代已加载的数据集,同时获取数据和数据下标
    for batch_idx, data in enumerate(trainloader, 0):
        batch_avg_loss = 0.0                                    # 每个batch的平均损失
        
        inputs, labels = data                                   # 解包     
        inputs, labels = inputs.to(device), labels.to(device)   # 把模型部署到device上  
        optimizer.zero_grad()                                   # 梯度清零        
        outputs = model(inputs)                                 # 保存训练结果
        loss = criterion(outputs, labels)                       # 计算损失和
        
        #dim=1表示返回每一行的最大值对应的列下标
        predict = outputs.argmax(dim=1)                         #获取最大概率的预测结果
        total += labels.size(0)                                 # 总样本数
        correct += (predict == labels).sum().item()             # 统计正确分类样本个数
        
        epoch_avg_loss += loss.item()                           # 把每轮epoch的损失累加
        batch_avg_loss += loss.item()                           # 累加每100个batch的损失
        
        loss.backward()                                         # 反向传播
        optimizer.step()                                        # 更新参数
        
        # 每100个batch进行一次loss输出
        if batch_idx % 100 == 99:
            print('[epoch:%d, batch_idx:%5d] batch_avg_loss: %.6f' % (epoch, batch_idx+1, batch_avg_loss/100))
            batch_avg_loss = 0.0
    
    # 这里train_batchsize是64，向上取整，所有小数都是向着数值更大的方向取整
    batch_num = math.ceil(total/64)
        
    # 每完成一次训练epoch，打印当前平均Loss和精度
    epoch_avg_loss /= batch_num 
    print("Train Epoch{} \t epoch_avg_loss: {:.6f}, accuracy: {:.6f}%".format(epoch, epoch_avg_loss, 100*(correct/total)))
    
    # 加入列表，以便于绘图
    Train_Loss.append(epoch_avg_loss)
    Train_Accuracy.append(correct/total)

5. 测试函数

def test_runner(model):
    #模型验证, 必须要写, 否则只要有输入数据, 即使不训练, 它也会改变权值
    #因为调用eval()将不启用 BatchNormalization 和 Dropout, BatchNormalization和Dropout置为False
    model.eval()
    
    #统计模型正确率, 设置初始值
    correct = 0.0
    test_loss = 0.0
    total = 0
    
    #torch.no_grad将不会计算梯度, 也不会进行反向传播
    with torch.no_grad():
        for data, label in testloader:
            data, label = data.to(device), label.to(device)
            output = model(data)
            test_loss += criterion(output, label).item()
            predict = output.argmax(dim=1)
            #计算正确数量
            total += label.size(0)
            correct += (predict == label).sum().item()
            
        # # 每完成一次训练epoch，打印当前平均Loss和精度
        test_loss /= total
            
        #计算损失值和精度
        print("test_avarage_loss: {:.6f}, accuracy: {:.6f}%".format(test_loss, 100*(correct/total)))
        
    # 加入列表，以便于绘图
    Test_Loss.append(test_loss)
    Test_Accuracy.append(correct/total)

6. 训练过程

if __name__ == '__main__':
    
    print("start_time",time.strftime('%Y-%m-%d %H:%M:%S',time.localtime(time.time())))
    for epoch in range(1, epoch+1):
        plt_epoch.append(epoch)
        train_runner(model, epoch)
        test_runner(model)
    print("end_time: ",time.strftime('%Y-%m-%d %H:%M:%S',time.localtime(time.time())),'\n')
 
    print('Finished Training')
    plt.subplot(2,2,1), plt.plot(plt_epoch, Train_Loss), plt.title('Train_Loss'), plt.grid()
    plt.subplot(2,2,2), plt.plot(plt_epoch, Train_Accuracy), plt.title('Train_Accuracy'), plt.grid()
    plt.subplot(2,2,3), plt.plot(plt_epoch, Test_Loss), plt.title('Test_Loss'), plt.grid()
    plt.subplot(2,2,4), plt.plot(plt_epoch, Test_Accuracy), plt.title('Test_Accuracy'), plt.grid()
    plt.tight_layout()
    plt.show()

7. 保存模型并测试

print(model)
pathfile = './models/'
save_filename = 'GoogLeNetV4-catvsdog.pth'
model_path = os.path.join(pathfile, save_filename)
torch.save(model, model_path) #保存模型

if __name__ == '__main__':
    device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
    pathfile = './models/'
    save_filename = 'GoogLeNetV4-catvsdog.pth'
    model_path = os.path.join(pathfile, save_filename)
    model = torch.load(model_path) #加载模型
    model = model.to(device)
    model.eval()    #把模型转为test模式

    #读取要预测的图片
    # 读取要预测的图片
    img = Image.open("./pic/test_cat.jpg") # 读取图像
    #img.show()
    plt.imshow(img) # 显示图片
    plt.axis('off') # 不显示坐标轴
    plt.show()

    # 导入图片，图片扩展后为[1，1，32，32]
    trans = transforms.Compose(
        [
            transforms.Resize((299,299)),
            transforms.ToTensor(),
            transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5))
        ])
    img = trans(img)
    img = img.to(device)
    img = img.unsqueeze(0)  #图片扩展多一维,因为输入到保存的模型中是4维的[batch_size,通道,长，宽]，而普通图片只有三维，[通道,长，宽]

    # 预测 
    classes = ('cat', 'dog')
    output = model(img)
    prob = F.softmax(output,dim=1) #prob是2个分类的概率
    print("概率：",prob)
    
    value, predicted = torch.max(output.data, 1)
    predict = output.argmax(dim=1)
    pred_class = classes[predicted.item()]
    print("预测类别：",pred_class)

8. 下载在服务器上训练好的模型参数，在测试集上验证精度

if __name__ == '__main__':
    device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
    pathfile = './models/'
    save_filename = 'GoogLeNetV4-catvsdog.pth'
    model_path = os.path.join(pathfile, save_filename)
    model = torch.load(model_path) #加载模型
    model = model.to(device)

    #定义优化器
    criterion = nn.CrossEntropyLoss()

    #模型验证, 必须要写, 否则只要有输入数据, 即使不训练, 它也会改变权值
    #因为调用eval()将不启用 BatchNormalization 和 Dropout, BatchNormalization和Dropout置为False
    model.eval()
    
    #统计模型正确率, 设置初始值
    correct = 0.0
    test_loss = 0.0
    total = 0
    
    #torch.no_grad将不会计算梯度, 也不会进行反向传播
    with torch.no_grad():
        for data, label in testloader:
            data, label = data.to(device), label.to(device)
            output = model(data)
            test_loss += criterion(output, label).item()
            predict = output.argmax(dim=1)
            #计算正确数量
            total += label.size(0)
            correct += (predict == label).sum().item()
            
        # 每完成一次训练epoch，打印当前平均Loss和精度
        test_loss /= total
            
        #计算损失值和精度
        print("test_avarage_loss: {:.6f}, accuracy: {:.6f}%".format(test_loss, 100*(correct/total)))

四、GoogLeNet-ResNet-V1

（1）定义基本的卷积：卷积+激活函数

class BasicConv2d(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(BasicConv2d, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.relu = nn.ReLU6(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.relu(x)
        return x

（2）定义带BN的卷积：卷积+BN+ReLu

class ConvBNReLU(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(ConvBNReLU, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.bn = nn.BatchNorm2d(out_channels)
        self.relu = nn.ReLU6(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.bn(x)
        x = self.relu(x)
        return x

（3）Steam：

class Stem(nn.Module):
    def __init__(self):
        super(Stem, self).__init__()
        self.stem = nn.Sequential(
            ConvBNReLU(in_channels=3, out_channels=32, kernel_size=3, stride=2),
            ConvBNReLU(in_channels=32, out_channels=32, kernel_size=3, stride=1),
            ConvBNReLU(in_channels=32, out_channels=64, kernel_size=3, stride=1, padding=1),
            nn.MaxPool2d(kernel_size=3, stride=2),
            ConvBNReLU(in_channels=64, out_channels=80, kernel_size=1, stride=1),
            ConvBNReLU(in_channels=80, out_channels=192, kernel_size=3, stride=1),
            ConvBNReLU(in_channels=192, out_channels=256, kernel_size=3, stride=2),
        )

    def forward(self, x):
        return self.stem(x)

（4）Inception-A

class Inception_A(nn.Module):

    def __init__(self, in_channels, b1, b2_1x1, b2_3x3, b3_1x1, b3_3x3_1, b3_3x3_2, n1_linear):
        super(Inception_A, self).__init__()
        
        # 分支1：
        self.branch1 = ConvBNReLU(in_channels, b1, kernel_size=1, stride=1)
        
        # 分支2：1x1 -> 3x3
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, b2_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b2_1x1, b2_3x3, kernel_size=3, stride=1, padding=1)
        )
        
        # 分支3：1x1 -> 3x3 -> 3x3
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, b3_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b3_1x1, b3_3x3_1, kernel_size=3, stride=1, padding=1),
            ConvBNReLU(b3_3x3_1, b3_3x3_2, kernel_size=3, stride=1, padding=1)
        )
        
        # 1x1Conv
        self.conv_linear = nn.Conv2d(b1+b2_3x3+b3_3x3_2, n1_linear, 1, 1, 0, bias=True)

        """
            因为这里需要将原始输入通过直连边连接到输出部分，所以需要判断in_channels和n1_linear的关系
        """
        # 如果in_channels==n1_linear，则不进行short_cut
        self.short_cut = nn.Sequential()
        # 如果in_channels!=n1_linear，则进行short_cut，把原始输入维度转为n1_linear
        if in_channels != n1_linear:
            self.short_cut = nn.Sequential(
                nn.Conv2d(in_channels, n1_linear, 1, 1, 0, bias=False),
                nn.BatchNorm2d(n1_linear)
            )
            
        self.relu = nn.ReLU(inplace=True)

    def forward(self, x):
        out1 = self.branch1(x)
        out2 = self.branch2(x)
        out3 = self.branch3(x)
        
        out = torch.cat((out1, out2, out3), 1)
        out = self.conv_linear(out)
        
        # 残差连接
        out += self.short_cut(x)
        
        out = self.relu(out)
        return out

（5）Inception-B

class Inception_B(nn.Module):
    def __init__(self, in_channels, b1, b2_1x1, b2_1x7, b2_7x1, n1_linear):
        super(Inception_B, self).__init__()
        
        # 分支1：
        self.branch1 = ConvBNReLU(in_channels, b1, kernel_size=1, stride=1)
        
        # 分支2：
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, b2_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b2_1x1, b2_1x7, kernel_size=[1,7], padding=[0,3]),
            ConvBNReLU(b2_1x7, b2_7x1, kernel_size=[7,1], padding=[3,0])
        )
        
        # 1x1Conv
        self.conv_linear = nn.Conv2d(b1 + b2_7x1, n1_linear, 1, 1, 0, bias=False)
        
        self.short_cut = nn.Sequential()
        if in_channels != n1_linear:
            self.short_cut = nn.Sequential(
                nn.Conv2d(in_channels, n1_linear, 1, 1, 0, bias=False),
                nn.BatchNorm2d(n1_linear)
            )
        self.relu = nn.ReLU(inplace=True)

    def forward(self, x):
        out1 = self.branch1(x)
        out2 = self.branch2(x)
        out = torch.cat((out1, out2), 1)
        out = self.conv_linear(out)
        
        # 残差连接
        out += self.short_cut(x)
        
        out = self.relu(out)
        return out

（6）Inception-C

class Inception_C(nn.Module):
    def __init__(self, in_channels, b1, b2_1x1, b2_1x3, b2_3x1, n1_linear):
        super(Inception_C, self).__init__()
        
        # 分支1：
        self.branch1 = ConvBNReLU(in_channels, b1, kernel_size=1, stride=1)
        
        # 分支2：
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, b2_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b2_1x1, b2_1x3, kernel_size=[1,3], padding=[0,1]),
            ConvBNReLU(b2_1x3, b2_3x1, kernel_size=[3,1], padding=[1,0])
        )
        
        # 1x1Conv
        self.conv_linear = nn.Conv2d(b1 + b2_3x1, n1_linear, 1, 1, 0, bias=False)
        
        self.short_cut = nn.Sequential()
        if in_channels != n1_linear:
            self.short_cut = nn.Sequential(
                nn.Conv2d(in_channels, n1_linear, 1, 1, 0, bias=False),
                nn.BatchNorm2d(n1_linear)
            )
        self.relu = nn.ReLU(inplace=True)

    def forward(self, x):
        out1 = self.branch1(x)
        out2 = self.branch2(x)
        out = torch.cat((out1, out2), 1)
        out = self.conv_linear(out)
        
        # 残差连接
        out += self.short_cut(x)
        
        out = self.relu(out)
        return out

（7）Reduction-A：

class Reduction_A(nn.Module):
    def __init__(self, in_channels, k, l, m, n):

        super(Reduction_A, self).__init__()
        
        # 分支1：MaxPool
        self.branch1 = nn.MaxPool2d(kernel_size=3, stride=2)
        
        # 分支2：(3x3)
        self.branch2 = ConvBNReLU(in_channels, n, kernel_size=3, stride=2)
        
        # 分支3：(1x1->3x3->3x3)
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, k, kernel_size=1),
            ConvBNReLU(k, l, kernel_size=3, stride=1, padding=1),
            ConvBNReLU(l, m, kernel_size=3, stride=2)
        )

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)

        outputs = [branch1, branch2, branch3]
        return torch.cat(outputs, 1)

（8）Reduction-B：

class Reduction_B(nn.Module):
    def __init__(self, in_channels, b2_1x1, b2_3x3, b3_1x1, b3_3x3, b4_1x1, b4_3x3_1, b4_3x3_2):
        super(Reduction_B, self).__init__()
        
        # 分支1：
        self.branch1 = nn.MaxPool2d(kernel_size=3, stride=2)
        
        # 分支2：
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, b2_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b2_1x1, b2_3x3, kernel_size=3, stride=2)
        )
        
        # 分支3：
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, b3_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b3_1x1, b3_3x3, kernel_size=3, stride=2)
        )
        
        # 分支4：
        self.branch4 = nn.Sequential(
            ConvBNReLU(in_channels, b4_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b4_1x1, b4_3x3_1, kernel_size=3, stride=1, padding=1),
            ConvBNReLU(b4_3x3_1, b4_3x3_2, kernel_size=3, stride=2),
        )

    def forward(self, x):
        out1 = self.branch1(x)
        out2 = self.branch2(x)
        out3 = self.branch3(x)
        out4 = self.branch4(x)
        return torch.cat((out1, out2, out3, out4), 1)

（9）整体网络结构：

class GoogLeNet_ResNetV1(nn.Module):
    def __init__(self, num_classes, init_weights=False):
        super(GoogLeNet_ResNetV1, self).__init__()
        
        # 整体主干网络
        self.stem = Stem()
        self.inception_A = self.__make_inception_A()
        self.Reduction_A = self.__make_reduction_A()
        self.inception_B = self.__make_inception_B()
        self.Reduction_B = self.__make_reduction_B()
        self.inception_C = self.__make_inception_C()
        
        # 输出部分：平均池化->全连接层
        self.avgpool = nn.AvgPool2d(kernel_size=8, stride=1)
        self.dropout = nn.Dropout(0.2)
        self.fc = nn.Linear(1792, num_classes)
        
        if init_weights:
            self._initialize_weights()
        

    # 制造5层Inception-A
    def __make_inception_A(self):
        layers = []
        for _ in range(5):
            layers.append(Inception_A(256, 32, 32, 32, 32, 32, 32, 256))
        return nn.Sequential(*layers)

    # 制造1层Reduction-A
    def __make_reduction_A(self):
        return Reduction_A(256, 192, 192, 256, 384)

    # 制造10层Inception-B
    def __make_inception_B(self):
        layers = []
        for _ in range(10):
            layers.append(Inception_B(896, 128, 128, 128, 128, 896))
        return nn.Sequential(*layers)

    # 制造1层Reduction-B
    def __make_reduction_B(self):
        return Reduction_B(896, 256, 384, 256, 256, 256, 256, 256)

    # 制造5层Inception-C
    def __make_inception_C(self):
        layers = []
        for _ in range(5):
            layers.append(Inception_C(1792, 192, 192, 192, 192, 1792))
        return nn.Sequential(*layers)

    def forward(self, x):
        
        out = self.stem(x)
        out = self.inception_A(out)
        out = self.Reduction_A(out)
        out = self.inception_B(out)
        out = self.Reduction_B(out)
        out = self.inception_C(out)
        
        out = self.avgpool(out)
        out = torch.flatten(out, 1)
        out = self.dropout(out)
        out = self.fc(out)
        
        return out
    
    def _initialize_weights(self):
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                nn.init.kaiming_normal_(m.weight, mode='fan_out', nonlinearity='relu')
                if m.bias is not None:
                    nn.init.constant_(m.bias, 0)
            elif isinstance(m, nn.Linear):
                nn.init.normal_(m.weight, 0, 0.01)
                nn.init.constant_(m.bias, 0)
                
# # 第一种测试模型的各层维度的方式：            
# input = torch.ones((1,3,299,299))
# print(input.shape)
# model = GoogLeNetV4(num_classes=2, init_weights=True)
# output = model.test(input) # 把forward改成test
# print(output.shape)

五、GoogLeNet-ResNet-V2

（1）定义基本的卷积：卷积+激活函数

class BasicConv2d(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(BasicConv2d, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.relu = nn.ReLU6(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.relu(x)
        return x

（2）定义带BN的卷积：卷积+BN+ReLu

class ConvBNReLU(nn.Module):
    def __init__(self, in_channels, out_channels, **kwargs):
        super(ConvBNReLU, self).__init__()
        self.conv = nn.Conv2d(in_channels, out_channels, **kwargs)
        self.bn = nn.BatchNorm2d(out_channels)
        self.relu = nn.ReLU6(inplace=True)

    def forward(self, x):
        x = self.conv(x)
        x = self.bn(x)
        x = self.relu(x)
        return x

（3）Stem

class Stem(nn.Module):
    def __init__(self):
        super(Stem, self).__init__()
        
        # 连续3个3x3的卷积核
        self.step1 = nn.Sequential(
            # 299x299x3 -> 149x149x32
            ConvBNReLU(in_channels=3, out_channels=32, kernel_size=3, stride=2),
            # 149x149x32 -> 147x147x32
            ConvBNReLU(in_channels=32, out_channels=32, kernel_size=3, stride=1),
            # 147x147x32 -> 147x147x64
            ConvBNReLU(in_channels=32, out_channels=64, kernel_size=3, stride=1, padding=1),
        )
        
        # 分支1：147x147x64 -> 72x72x64
        self.step2_pool = nn.MaxPool2d(kernel_size=3, stride=2)
        # 分支2：147x147x64 -> 72x72x96
        self.step2_conv = ConvBNReLU(in_channels=64, out_channels=96, kernel_size=3, stride=2)
        
        # 分支1：1x1+3x3 
        self.step3_1 = nn.Sequential(
            ConvBNReLU(in_channels=160, out_channels=64, kernel_size=1, stride=1),
            ConvBNReLU(in_channels=64, out_channels=96, kernel_size=3, stride=1)
        )
        # 分支2：1x1+7x1+1x7+3x3
        self.step3_2 = nn.Sequential(
            ConvBNReLU(in_channels=160, out_channels=64, kernel_size=1, stride=1),
            ConvBNReLU(in_channels=64, out_channels=64, kernel_size=[7,1], padding=[3,0]),
            ConvBNReLU(in_channels=64, out_channels=64, kernel_size=[1,7], padding=[0,3]),
            ConvBNReLU(in_channels=64, out_channels=96, kernel_size=3, stride=1)
        )
        
        # 分支1：池化
        self.step4_pool = nn.MaxPool2d(kernel_size=3, stride=2)
        # 分支2：3x3
        self.step4_conv = ConvBNReLU(in_channels=192, out_channels=192, kernel_size=3, stride=2)

    def forward(self, x):
        out = self.step1(x)
        
        tmp1 = self.step2_pool(out)
        tmp2 = self.step2_conv(out)
        out = torch.cat((tmp1, tmp2), 1)
        
        tmp1 = self.step3_1(out)
        tmp2 = self.step3_2(out)
        out = torch.cat((tmp1, tmp2), 1)
        
        tmp1 = self.step4_pool(out)
        tmp2 = self.step4_conv(out)
        
        outputs = [tmp1, tmp2]
        return torch.cat(outputs, 1)

（4）Inception-A

class Inception_A(nn.Module):

    def __init__(self, in_channels, b1, b2_1x1, b2_3x3, b3_1x1, b3_3x3_1, b3_3x3_2, n1_linear):
        super(Inception_A, self).__init__()
        
        # 分支1：
        self.branch1 = ConvBNReLU(in_channels, b1, kernel_size=1, stride=1)
        
        # 分支2：1x1 -> 3x3
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, b2_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b2_1x1, b2_3x3, kernel_size=3, stride=1, padding=1)
        )
        
        # 分支3：1x1 -> 3x3 -> 3x3
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, b3_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b3_1x1, b3_3x3_1, kernel_size=3, stride=1, padding=1),
            ConvBNReLU(b3_3x3_1, b3_3x3_2, kernel_size=3, stride=1, padding=1)
        )
        
        # 1x1Conv
        self.conv_linear = nn.Conv2d(b1+b2_3x3+b3_3x3_2, n1_linear, 1, 1, 0, bias=True)

        """
            因为这里需要将原始输入通过直连边连接到输出部分，所以需要判断in_channels和n1_linear的关系
        """
        # 如果in_channels==n1_linear，则不进行short_cut
        self.short_cut = nn.Sequential()
        # 如果in_channels!=n1_linear，则进行short_cut，把原始输入维度转为n1_linear
        if in_channels != n1_linear:
            self.short_cut = nn.Sequential(
                nn.Conv2d(in_channels, n1_linear, 1, 1, 0, bias=False),
                nn.BatchNorm2d(n1_linear)
            )
            
        self.relu = nn.ReLU(inplace=True)

    def forward(self, x):
        out1 = self.branch1(x)
        out2 = self.branch2(x)
        out3 = self.branch3(x)
        
        out = torch.cat((out1, out2, out3), 1)
        out = self.conv_linear(out)
        
        # 残差连接
        out += self.short_cut(x)
        
        out = self.relu(out)
        return out

（5）Inception-B

class Inception_B(nn.Module):
    def __init__(self, in_channels, b1, b2_1x1, b2_1x7, b2_7x1, n1_linear):
        super(Inception_B, self).__init__()
        
        # 分支1：
        self.branch1 = ConvBNReLU(in_channels, b1, kernel_size=1, stride=1)
        
        # 分支2：
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, b2_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b2_1x1, b2_1x7, kernel_size=[1,7], padding=[0,3]),
            ConvBNReLU(b2_1x7, b2_7x1, kernel_size=[7,1], padding=[3,0])
        )
        
        # 1x1Conv
        self.conv_linear = nn.Conv2d(b1 + b2_7x1, n1_linear, 1, 1, 0, bias=False)
        
        self.short_cut = nn.Sequential()
        if in_channels != n1_linear:
            self.short_cut = nn.Sequential(
                nn.Conv2d(in_channels, n1_linear, 1, 1, 0, bias=False),
                nn.BatchNorm2d(n1_linear)
            )
        self.relu = nn.ReLU(inplace=True)

    def forward(self, x):
        out1 = self.branch1(x)
        out2 = self.branch2(x)
        out = torch.cat((out1, out2), 1)
        out = self.conv_linear(out)
        
        # 残差连接
        out += self.short_cut(x)
        
        out = self.relu(out)
        return out

（6）Inception-C

class Inception_C(nn.Module):
    def __init__(self, in_channels, b1, b2_1x1, b2_1x3, b2_3x1, n1_linear):
        super(Inception_C, self).__init__()
        
        # 分支1：
        self.branch1 = ConvBNReLU(in_channels, b1, kernel_size=1, stride=1)
        
        # 分支2：
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, b2_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b2_1x1, b2_1x3, kernel_size=[1,3], padding=[0,1]),
            ConvBNReLU(b2_1x3, b2_3x1, kernel_size=[3,1], padding=[1,0])
        )
        
        # 1x1Conv
        self.conv_linear = nn.Conv2d(b1 + b2_3x1, n1_linear, 1, 1, 0, bias=False)
        
        self.short_cut = nn.Sequential()
        if in_channels != n1_linear:
            self.short_cut = nn.Sequential(
                nn.Conv2d(in_channels, n1_linear, 1, 1, 0, bias=False),
                nn.BatchNorm2d(n1_linear)
            )
        self.relu = nn.ReLU(inplace=True)

    def forward(self, x):
        out1 = self.branch1(x)
        out2 = self.branch2(x)
        out = torch.cat((out1, out2), 1)
        out = self.conv_linear(out)
        
        # 残差连接
        out += self.short_cut(x)
        
        out = self.relu(out)
        return out

（7）Reduction-A：

class Reduction_A(nn.Module):
    def __init__(self, in_channels, k, l, m, n):

        super(Reduction_A, self).__init__()
        
        # 分支1：MaxPool
        self.branch1 = nn.MaxPool2d(kernel_size=3, stride=2)
        
        # 分支2：(3x3)
        self.branch2 = ConvBNReLU(in_channels, n, kernel_size=3, stride=2)
        
        # 分支3：(1x1->3x3->3x3)
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, k, kernel_size=1),
            ConvBNReLU(k, l, kernel_size=3, stride=1, padding=1),
            ConvBNReLU(l, m, kernel_size=3, stride=2)
        )

    def forward(self, x):
        branch1 = self.branch1(x)
        branch2 = self.branch2(x)
        branch3 = self.branch3(x)

        outputs = [branch1, branch2, branch3]
        return torch.cat(outputs, 1)

（8）Reduction-B：

class Reduction_B(nn.Module):
    def __init__(self, in_channels, b2_1x1, b2_3x3, b3_1x1, b3_3x3, b4_1x1, b4_3x3_1, b4_3x3_2):
        super(Reduction_B, self).__init__()
        
        # 分支1：
        self.branch1 = nn.MaxPool2d(kernel_size=3, stride=2)
        
        # 分支2：
        self.branch2 = nn.Sequential(
            ConvBNReLU(in_channels, b2_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b2_1x1, b2_3x3, kernel_size=3, stride=2)
        )
        
        # 分支3：
        self.branch3 = nn.Sequential(
            ConvBNReLU(in_channels, b3_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b3_1x1, b3_3x3, kernel_size=3, stride=2)
        )
        
        # 分支4：
        self.branch4 = nn.Sequential(
            ConvBNReLU(in_channels, b4_1x1, kernel_size=1, stride=1),
            ConvBNReLU(b4_1x1, b4_3x3_1, kernel_size=3, stride=1, padding=1),
            ConvBNReLU(b4_3x3_1, b4_3x3_2, kernel_size=3, stride=2),
        )

    def forward(self, x):
        out1 = self.branch1(x)
        out2 = self.branch2(x)
        out3 = self.branch3(x)
        out4 = self.branch4(x)
        return torch.cat((out1, out2, out3, out4), 1)

（9）整体网络结构：

class GoogLeNet_ResNetV2(nn.Module):
    def __init__(self, num_classes, init_weights=False):
        super(GoogLeNet_ResNetV2, self).__init__()
        
        # 整体主干网络
        self.stem = Stem()
        self.inception_A = self.__make_inception_A()
        self.Reduction_A = self.__make_reduction_A()
        self.inception_B = self.__make_inception_B()
        self.Reduction_B = self.__make_reduction_B()
        self.inception_C = self.__make_inception_C()
        
        # 输出部分：平均池化->全连接层
        self.avgpool = nn.AvgPool2d(kernel_size=8, stride=1)
        self.dropout = nn.Dropout(0.2)
        self.fc = nn.Linear(2144, num_classes)
        
        if init_weights:
            self._initialize_weights()
        

    # 制造5层Inception-A
    def __make_inception_A(self):
        layers = []
        for _ in range(5):
            layers.append(Inception_A(384, 32, 32, 32, 32, 48, 64, 384))
        return nn.Sequential(*layers)

    # 制造1层Reduction-A
    def __make_reduction_A(self):
        return Reduction_A(384, 256, 256, 384, 384)

    # 制造10层Inception-B
    def __make_inception_B(self):
        layers = []
        for _ in range(10):
            layers.append(Inception_B(1152, 192, 128, 160, 192, 1152))
        return nn.Sequential(*layers)

    # 制造1层Reduction-B
    def __make_reduction_B(self):
        return Reduction_B(1152, 256, 384, 256, 288, 256, 288, 320)

    # 制造5层Inception-C
    def __make_inception_C(self):
        layers = []
        for _ in range(5):
            layers.append(Inception_C(2144, 192, 192, 224, 256, 2144))
        return nn.Sequential(*layers)

    def forward(self, x):
        
        out = self.stem(x)
        out = self.inception_A(out)
        out = self.Reduction_A(out)
        out = self.inception_B(out)
        out = self.Reduction_B(out)
        out = self.inception_C(out)
        
        out = self.avgpool(out)
        out = torch.flatten(out, 1)
        out = self.dropout(out)
        out = self.fc(out)
        
        return out
    
    def _initialize_weights(self):
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                nn.init.kaiming_normal_(m.weight, mode='fan_out', nonlinearity='relu')
                if m.bias is not None:
                    nn.init.constant_(m.bias, 0)
            elif isinstance(m, nn.Linear):
                nn.init.normal_(m.weight, 0, 0.01)
                nn.init.constant_(m.bias, 0)

你可能感兴趣的:(#,卷积神经网络,pytorch,深度学习,cnn)

爆改yolov8|利用BSAM改进YOLOv8，高效涨点不想敲代码！！！爆改yolov8 即插即用 YOLO yolov8 目标检测人工智能深度学习
1，本文介绍BSAM基于CBAM进行改进，经实测在多个数据集上都有涨点。BSAM（BiLevelSpatialAttentionModule）是一个用于提升深度学习模型在空间特征处理中的能力的模块。它主要通过双层注意力机制来增强模型对重要空间信息的关注，从而提升任务性能。核心特点：双层空间注意力：BSAM结合了两个层次的注意力机制——全局和局部。全局注意力捕捉图像或特征图的整体信息，而局部注意力则
查询vllm-flash-attn与之对应的pytorch 源来猿往运行环境 pytorch 人工智能 python
最近安装vllm的时候有时候pytorch版本总是弄错，这里写下vllm-flash-attn与pytoch对应关系打开网站vllm-flash-attn·PyPI查询历史版本点进去查询对应的pytoch
基于yolov8的8种人脸表情检测系统python源码+onnx模型+评估指标曲线+精美GUI界面 FL1623863129 深度学习 YOLO python 开发语言
【算法介绍】基于YOLOv8的人脸表情检测系统是一个结合了先进目标检测算法（YOLOv8）与深度学习技术的项目，旨在实时或离线地识别并分类人脸表情（如快乐、悲伤、愤怒、惊讶、恐惧、厌恶、中立等）。以下是一个简短的介绍，概述了该系统Python源码的核心要点：该系统直接利用YOLOv8模型进行人脸表情识别。YOLOv8以其高效的速度和准确性著称，非常适合实时应用。Python源码实现通常包括以下几个
AI如何创造情绪价值学客汇商业研究商业观察大模型人工智能生成式AI 大模型应用 AI与情绪管理 AI应用
随着科技的飞速发展，人工智能（AI）已经渗透到我们生活的方方面面。从智能家居到自动驾驶，从医疗辅助到金融服务，AI技术的身影无处不在。而如今，AI更是涉足了一个全新的领域——创造情绪价值。AI已经能够处理和分析大量的文本、图像、音频和视频数据，从中提取和识别出人类的情感信息。AI技术通过模拟人类神经网络的工作方式，对复杂的数据进行深度学习和理解，逐渐具备了处理人类情感的能力。在客户服务领域，情绪识
深度学习：探索人工智能的无限可能木小梦(๑• . •๑) 人工智能深度学习
引言：在当今这个数字化时代，人工智能（AI）已经成为了一个热门话题。从自动驾驶汽车到智能助手，AI正在逐渐改变我们的生活方式。而在AI领域，深度学习是近年来发展最为迅速的一个分支。本文将深入探讨深度学习及其相关领域，包括计算机视觉、自然语言处理、神经网络和强化学习。1.深度学习深度学习是一种基于人工神经网络的机器学习方法，它试图模拟人脑的工作方式，通过训练大量数据来自动学习数据的内在规律和表示层次
深度学习100问7-向量降维的算法有那些不断持续学习ing 深度学习机器学习人工智能
一、主成分分析（PCA）PCA就像你整理一堆考试成绩单。假如成绩单上有好多科目成绩，这就像一个高维向量。但有些科目成绩关系很紧密，比如数学好的同学一般物理也不错，化学也还行。那PCA就会找这些成绩单里最主要的特点，把关系近的科目合成几个新的“大科目”。这样就把原来很多科目的高维向量变成几个“大科目”的低维向量啦。二、奇异值分解（SVD）SVD呢，就好比你有一本很厚的书。书的每一页上的字可以看成一个
基于yolov8的绝缘子缺陷检测系统python源码+onnx模型+评估指标曲线+精美GUI界面 FL1623863129 深度学习 YOLO
【算法介绍】基于YOLOv8的绝缘子缺陷检测系统是一种利用先进深度学习技术的高效解决方案，旨在提升电力行业中输电线路的维护和监控水平。YOLOv8作为YOLO系列算法的最新版本，具备更高的检测速度和精度，特别适用于实时物体检测任务。该系统通过深入分析并标注绝缘子数据集，训练YOLOv8模型以精确识别输电线上的绝缘子及其缺陷状态。利用多尺度检测、FPN结构以及CSPDarknet网络等技术，YOLO
机器学习和深度学习中常见损失函数，包括损失函数的数学公式、推导及其在不同场景中的应用早起星人机器学习深度学习人工智能
目录引言什么是损失函数？常见损失函数介绍3.1均方误差（MeanSquaredError,MSE）3.2交叉熵损失（Cross-EntropyLoss）3.3平滑L1损失（SmoothL1Loss）3.4HingeLoss（合页损失）3.5二进制交叉熵损失（BinaryCross-EntropyLoss）3.6KL散度（KLDivergence）3.7Huber损失（HuberLoss）3.8对比
Python在神经网络中优化激活函数选择使用详解 Rocky006 python 开发语言
概要在神经网络中，激活函数扮演着至关重要的角色。它的主要作用是引入非线性因素，使得神经网络能够处理复杂的非线性问题。如果没有激活函数，神经网络仅仅是线性模型的堆叠，无法胜任深度学习中的各种任务。本文将深入探讨几种常用的激活函数，包括Sigmoid、Tanh、ReLU及其变种，并通过具体的代码示例展示它们在Python中的实现和应用。激活函数的重要性激活函数将输入信号进行非线性转换，从而增强神经网络
增强语音对车载语音质量测试的挑战众乐认证 itu
一、什么是增强语音语音助手是实现智慧车联的关键之一，通过助手，方可去掉按键。其中一个比较典型的功能就是目前比较流行的enhancedsiri。二、增强语音的难点1.语音合成技术语音合成技术在车内环境中的表现至关重要。语音合成采用了混合单元选择系统，结合了单元选择和参数合成的优势，并通过深度学习进一步提升了语音质量。这种技术的应用，使得语音助手能够在车内环境中提供流畅自然且易于理解的语音交互体验。2
TensorFlow和它的弟弟们活蹦乱跳酸菜鱼 tensorflow 人工智能 python
TensorFlow、TensorFlowLite、TensorFlowLiteMicro是Google在深度学习领域推出的三个不同产品，它们各自有着不同的设计目标和适用场景。以下是它们之间的主要区别：1.TensorFlow(PC\GPU)设计目标：TensorFlow是一个开源的机器学习框架，由GoogleBrain团队开发，旨在帮助开发者构建和训练深度学习模型。它支持多种编程语言（如Pyth
Datawhale AI夏令营-task03 ghost_him 人工智能
DatawhaleAI夏令营-task03笔记来源：DatawhaleAI夏令营数据增强基础数据增强是一种在机器学习和深度学习领域常用的技术，尤其是在处理图像和视频数据时。**数据增强的目的是通过人工方式增加训练数据的多样性，从而提高模型的泛化能力，使其能够在未见过的数据上表现得更好。**数据增强涉及对原始数据进行一系列的变换操作，生成新的训练样本。这些变换模拟了真实世界中的变化，对于图像而言，数
释放GPU潜能：PyTorch中torch.nn.DataParallel的数据并行实践 2401_85762266 pytorch 人工智能 python
释放GPU潜能：PyTorch中torch.nn.DataParallel的数据并行实践在深度学习模型的训练过程中，计算资源的需求往往随着模型复杂度的提升而增加。PyTorch，作为当前领先的深度学习框架之一，提供了torch.nn.DataParallel这一工具，使得开发者能够利用多个GPU进行数据并行处理，从而显著加速模型训练。本文将详细介绍如何在PyTorch中使用torch.nn.Dat
每天一个数据分析题（五百零五）- 提升方法跟着紫枫学姐学CDA 数据分析题库数据分析
提升方法（Boosting），是一种可以用来减小监督式学习中偏差的机器学习算法。基于Boosting的集成学习，其代表算法不包括？A.AdaboostB.GBDTC.XGBOOSTD.随机森林数据分析认证考试介绍：点击进入题目来源于CDA模拟题库点击此处获取答案数据分析专项练习题库内容涵盖Python，SQL，统计学，数据分析理论，深度学习，可视化，机器学习，Spark八个方向的专项练习题库，数据
每天一个数据分析题（五百零六）- 装袋方法跟着紫枫学姐学CDA 数据分析数据挖掘
装袋方法(bagging)也叫做bootstrapaggregating,是在原始数据集有放回地重采样S次后得到新数据集的一种技术，其代表算法有？A.AdaboostB.GBDTC.XGBOOSTD.随机森林数据分析认证考试介绍：点击进入题目来源于CDA模拟题库点击此处获取答案数据分析专项练习题库内容涵盖Python，SQL，统计学，数据分析理论，深度学习，可视化，机器学习，Spark八个方向的专
DeepArt——AI美术创作工具，能够帮助生成视觉内容爱研究的小牛 AIGC 人工智能深度学习
一、DeepArt的介绍DeepArt是一种基于深度学习的艺术风格迁移应用，能够将输入图像转换成具有特定艺术风格的输出图像。它的核心技术主要依赖于深度卷积神经网络（CNN）和风格迁移算法，能够将著名艺术作品的风格应用到用户的照片或图像上，从而创造出独具特色的艺术效果。二、DeepArt的使用选择内容图像和风格图像：用户首先需要上传一张内容图像，即他们希望转换成艺术风格的图像。接着，可以从提供的艺术
Wonder Dynamics——虚拟角色动画和实时互动生成爱研究的小牛实时互动
一、WonderDynamics介绍WonderDynamics的核心是通过AI驱动的自动化流程，简化和加速虚拟角色动画的制作。其主要功能包括：自动化角色动画：将预录制的动作捕捉数据自动应用到虚拟角色上。实时角色互动：实现虚拟角色与现实场景中的人物和物体实时互动。高精度捕捉和渲染：利用深度学习和计算机视觉技术，捕捉高精度的动作数据并生成高质量的动画。二、WonderDynamics实现技术详解Wo
AIGC深度学习教程：Transformer模型中的Position Embedding实现与应用玩AI的小胡子 embedding transformer AIGC 人工智能
在进入深度学习领域时，Transformer模型几乎是绕不开的话题，而其中的PositionEmbedding更是关键。对于刚入门的朋友，这篇教程将带你深入了解PositionEmbedding是什么、它如何在Transformer中运作，以及它在不同领域中的实际应用。什么是PositionEmbedding？PositionEmbedding是Transformer模型中一种关键机制，用于弥补模
并行处理的魔法：PyTorch中torch.multiprocessing的多进程训练指南 liuxin33445566 人工智能深度学习机器学习
并行处理的魔法：PyTorch中torch.multiprocessing的多进程训练指南在深度学习领域，模型训练往往需要大量的计算资源和时间。PyTorch，作为当前最流行的深度学习框架之一，提供了torch.multiprocessing模块，使得开发者能够利用多核CPU进行多进程训练，从而显著加速训练过程。本文将深入探讨如何在PyTorch中使用torch.multiprocessing进行
20.神经网络 - 搭建小实战和 Sequential 的使用椰皮糖深度学习神经网络人工智能深度学习
神经网络-搭建小实战和Sequential的使用在PyTorch中，Sequential是一个容器（container）类，用于构建神经网络模型。它允许你按顺序（sequential）添加不同的网络层，并将它们串联在一起，形成一个网络模型。这样做可以方便地定义简单的前向传播过程，适用于许多基本的网络结构。Sequential的优点之一是其简洁性和易读性，特别适用于简单的网络结构。然而，对于更复杂的
GPU算力租用平台推荐 hong161688 gpu算力
在当前快速发展的AI和深度学习领域，GPU算力租用平台成为了研究者、开发者及企业不可或缺的工具。这些平台提供了灵活、高效、可扩展的GPU资源，帮助用户解决计算资源不足的问题，加速模型训练、推理及高性能计算等任务。以下是对几个主流GPU算力租用平台的详细推荐，旨在为用户提供全面的选择和参考。一、国内GPU算力租用平台1.阿里云（AlibabaCloud）平台概述：阿里云作为中国领先的云计算服务提供商
【Rust日报】 2019-05-14：Rust中哪些特性是零开销抽象的六六子大顺1
tract-一个神经网络训练库Snips（一家做音频识别的创业公司）出品。在神经网络领域，现在基本已经被TensorFlow和PyTorch给占了。但是对于移动设备或IoT这些性能受限的设备，还有很多空间可以尝试。TensorFlow组推出了TensorFlowLite，微软的ONNX看上去也很有前景。一些硬件厂商也推出了他们自己的方案AndroidNNAPI,ARMNNSDK，AppleBNNS
基于霜冰优化算法(RIME)优化CNN-BiGUR-Attention风电功率预测研究（Matlab代码实现）程序辅导帮算法 cnn matlab
欢迎来到本博客❤️❤️博主优势：博客内容尽量做到思维缜密，逻辑清晰，为了方便读者。⛳️座右铭：行百里者，半于九十。本文目录如下：目录⛳️赠与读者1概述一、研究背景与意义二、技术概述1.霜冰优化算法（RIME）2.卷积神经网络（CNN）3.双向门控循环单元（BiGRU）4.注意力机制（AttentionMechanism）三、研究内容与方法四、预期成果与贡献五、结论与展望2运行结果3参考文献4Mat
深度学习与OpenCV：解锁计算机视觉的无限可能程序员-李旭亮深度学习
在科技日新月异的今天，计算机视觉作为人工智能领域的一颗璀璨明珠，正以前所未有的速度改变着我们的生活与工作方式。而《深度学习》与OpenCV，作为这一领域的两大重要工具，更是为计算机视觉的入门与深入探索铺设了坚实的基石。本文将带您一窥这两者的魅力，探索它们如何携手开启计算机视觉的无限可能。深度学习：智能的催化剂深度学习，作为机器学习的一个分支，其核心在于通过构建深层次的神经网络模型，模拟人脑的学习过
2021勇气读书会——《学习的逻辑》打卡（第二百一十天）于杰雄
这是我参加勇气读书会打卡第二百一十天我阅读的书籍：《学习的逻辑》出发日期：2021.1.1期待的收获：立足现在，创造未来，让自己的教学能力更上一层楼。一句标语：千里之行，始于足下。小想法：相信明天会更好，我们会战胜困难，迈向更美好的未来。不要放弃每一天的学习，让自己充实起来，加油！勇气读书会，永不散场。深度学习的策略有很多种，思维导图与结构化思维只是其中一个小小的分支而已，而关于学习策略也有更多深
什么是计算机视觉？龙腾AI 计算机视觉人工智能自然语言处理深度学习 ai
计算机视觉概述计算机视觉（ComputerVision）又称机器视觉（MachineVision），是一门让机器学会如何去“看”的学科，是深度学习技术的一个重要应用领域，被广泛应用到安防、工业质检和自动驾驶等场景。具体的说，就是让机器去识别摄像机拍摄的图片或视频中的物体，检测出物体所在的位置，并对目标物体进行跟踪，从而理解并描述出图片或视频里的场景和故事，以此来模拟人脑视觉系统。因此，计算机视觉也
在STM32上实现嵌入式人工智能应用嵌入式详谈 stm32 人工智能嵌入式硬件
引言随着微控制器的计算能力不断增强，人工智能（AI）开始在嵌入式系统中扮演越来越重要的角色。STM32微控制器由于其高性能和低功耗的特性，非常适合部署轻量级AI模型。本文将探讨如何在STM32平台上实现深度学习应用，特别是利用STM32Cube.AI工具链将训练好的神经网络模型部署到STM32设备上。环境准备硬件选择：STM32F746GDiscoverykit，具备足够的计算资源和内存支持复杂模
PyTorch库学习之torch.mean函数 Midsummer-逐梦 #torch pytorch 学习人工智能
PyTorch库学习之torch.mean函数一、简介torch.mean是PyTorch库中的一个函数，用于计算张量的均值。它可以沿着指定的维度或者整个张量计算均值，是数据分析和机器学习中常用的操作之一。二、语法和参数语法:torch.mean(input,dim=None,keepdim=False,*,out=None)参数:input(torch.Tensor):输入张量。dim(int,
理解PyTorch版YOLOv5模型构架 LabVIEW_Python
一个深度学习模型，可以拆解为：模型构架(ModelArchitecture):下面详述激活函数(ActivationFunction)：YOLOv5在隐藏层中使用了LeakyReLU激活函数，在最后的检测层中使用了Sigmoid激活函数，参考这里优化函数(OptimizationFunction)：YOLOv5的默认优化算法是：SGD；可以通过命令行参数更改为Adam损失函数(LossFuncti
每天一个数据分析题（五百零二）- 分割式聚类算法跟着紫枫学姐学CDA 数据分析题库算法数据分析聚类
以下哪个选项是分割式聚类算法?A.K-Means。B.CentroidMethodC.Ward’sMethodD.以上皆非数据分析认证考试介绍：点击进入题目来源于CDA模拟题库点击此处获取答案数据分析专项练习题库内容涵盖Python，SQL，统计学，数据分析理论，深度学习，可视化，机器学习，Spark八个方向的专项练习题库，数据分析从业者刷题必备神器！
[黑洞与暗粒子]没有光的世界 comsci
无论是相对论还是其它现代物理学,都显然有个缺陷,那就是必须有光才能够计算但是,我相信,在我们的世界和宇宙平面中,肯定存在没有光的世界.... 那么,在没有光的世界,光子和其它粒子的规律无法被应用和考察,那么以光速为核心的 &nbs
jQuery Lazy Load 图片延迟加载 aijuans jquery
基于 jQuery 的图片延迟加载插件，在用户滚动页面到图片之后才进行加载。对于有较多的图片的网页，使用图片延迟加载，能有效的提高页面加载速度。版本： jQuery v1.4.4+ jQuery Lazy Load v1.7.2 注意事项：需要真正实现图片延迟加载，必须将真实图片地址写在 data-original 属性中。若 src
使用Jodd的优点 Kai_Ge jodd
1. 简化和统一 controller ，抛弃 extends SimpleFormController ，统一使用 implements Controller 的方式。 2. 简化 JSP 页面的 bind, 不需要一个字段一个字段的绑定。 3. 对 bean 没有任何要求，可以使用任意的 bean 做为 formBean。使用方法简介
jpa Query转hibernate Query 120153216 Hibernate
public List<Map> getMapList(String hql, Map map) { org.hibernate.Query jpaQuery = entityManager.createQuery(hql); if (null != map) { for (String parameter : map.keySet()) { jp
Django_Python3添加MySQL/MariaDB支持 2002wmj mariaDB
现状首先，[email protected] 中默认的引擎为 django.db.backends.mysql 。但是在Python3中如果这样写的话，会发现 django.db.backends.mysql 依赖 MySQLdb[5] ，而 MySQLdb 又不兼容 Python3 于是要找一种新的方式来继续使用MySQL。 MySQL官方的方案首先据MySQL文档[3]说，自从MySQL
在SQLSERVER中查找消耗IO最多的SQL 357029540 SQL Server
返回做IO数目最多的50条语句以及它们的执行计划。 select top 50 (total_logical_reads/execution_count) as avg_logical_reads, (total_logical_writes/execution_count) as avg_logical_writes, (tot
spring UnChecked 异常官方定义！ 7454103 spring
如果你接触过spring的事物管理！那么你必须明白 spring的非捕获异常！即 unchecked 异常！因为 spring 默认这类异常事物自动回滚！！ public static boolean isCheckedException(Throwable ex) { return !(ex instanceof RuntimeExcep
mongoDB 入门指南、示例 adminjun java mongodb 操作
一、准备工作 1、下载mongoDB 下载地址：http://www.mongodb.org/downloads 选择合适你的版本相关文档：http://www.mongodb.org/display/DOCS/Tutorial 2、安装mongoDB A、不解压模式：将下载下来的mongoDB-xxx.zip打开，找到bin目录，运行mongod.exe就可以启动服务，默
CUDA 5 Release Candidate Now Available aijuans CUDA
The CUDA 5 Release Candidate is now available at http://developer.nvidia.com/<wbr></wbr>cuda/cuda-pre-production. Now applicable to a broader set of algorithms, CUDA 5 has advanced fe
Essential Studio for WinRT网格控件测评 Axiba JavaScript html5
Essential Studio for WinRT界面控件包含了商业平板应用程序开发中所需的所有控件，如市场上运行速度最快的grid 和chart、地图、RDL报表查看器、丰富的文本查看器及图表等等。同时，该控件还包含了一组独特的库，用于从WinRT应用程序中生成Excel、Word以及PDF格式的文件。此文将对其另外一个强大的控件——网格控件进行专门的测评详述。网格控件功能 1、
java 获取windows系统安装的证书或证书链 bewithme windows
有时需要获取windows系统安装的证书或证书链，比如说你要通过证书来创建java的密钥库。有关证书链的解释可以查看此处。 public static void main(String[] args) { SunMSCAPI providerMSCAPI = new SunMSCAPI(); S
NoSQL数据库之Redis数据库管理(set类型和zset类型) bijian1013 redis 数据库 NoSQL
4.sets类型 Set是集合，它是string类型的无序集合。set是通过hash table实现的，添加、删除和查找的复杂度都是O(1)。对集合我们可以取并集、交集、差集。通过这些操作我们可以实现sns中的好友推荐和blog的tag功能。 sadd：向名称为key的set中添加元
异常捕获何时用Exception，何时用Throwable bingyingao
用Exception的情况 try { //可能发生空指针、数组溢出等异常 } catch (Exception e) {
【Kafka四】Kakfa伪分布式安装 bit1129 kafka
在http://bit1129.iteye.com/blog/2174791一文中，实现了单Kafka服务器的安装，在Kafka中，每个Kafka服务器称为一个broker。本文简单介绍下，在单机环境下Kafka的伪分布式安装和测试验证 1. 安装步骤 Kafka伪分布式安装的思路跟Zookeeper的伪分布式安装思路完全一样，不过比Zookeeper稍微简单些(不
Project Euler bookjovi haskell
Project Euler是个数学问题求解网站，网站设计的很有意思，有很多problem，在未提交正确答案前不能查看problem的overview，也不能查看关于problem的discussion thread，只能看到现在problem已经被多少人解决了，人数越多往往代表问题越容易。看看problem 1吧： Add all the natural num
Java-Collections Framework学习与总结-ArrayDeque BrokenDreams Collections
表、栈和队列是三种基本的数据结构，前面总结的ArrayList和LinkedList可以作为任意一种数据结构来使用，当然由于实现方式的不同，操作的效率也会不同。这篇要看一下java.util.ArrayDeque。从命名上看
读《研磨设计模式》-代码笔记-装饰模式-Decorator bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.io.BufferedOutputStream; import java.io.DataOutputStream; import java.io.FileOutputStream; import java.io.Fi
Maven学习(一) chenyu19891124 Maven私服
学习一门技术和工具总得花费一段时间，5月底6月初自己学习了一些工具，maven+Hudson+nexus的搭建，对于maven以前只是听说，顺便再自己的电脑上搭建了一个maven环境，但是完全不了解maven这一强大的构建工具，还有ant也是一个构建工具，但ant就没有maven那么的简单方便，其实简单点说maven是一个运用命令行就能完成构建，测试，打包，发布一系列功
[原创]JWFD工作流引擎设计----节点匹配搜索算法(用于初步解决条件异步汇聚问题) 补充 comsci 算法工作 PHP 搜索引擎嵌入式
本文主要介绍在JWFD工作流引擎设计中遇到的一个实际问题的解决方案，请参考我的博文"带条件选择的并行汇聚路由问题"中图例A2描述的情况(http://comsci.iteye.com/blog/339756),我现在把我对图例A2的一个解决方案公布出来，请大家多指点节点匹配搜索算法(用于解决标准对称流程图条件汇聚点运行控制参数的算法) 需要解决的问题：已知分支
Linux中用shell获取昨天、明天或多天前的日期 daizj linux shell 上几年昨天获取上几个月
在Linux中可以通过date命令获取昨天、明天、上个月、下个月、上一年和下一年 # 获取昨天 date -d 'yesterday' # 或 date -d 'last day' # 获取明天 date -d 'tomorrow' # 或 date -d 'next day' # 获取上个月 date -d 'last month' #
我所理解的云计算 dongwei_6688 云计算
在刚开始接触到一个概念时，人们往往都会去探寻这个概念的含义，以达到对其有一个感性的认知，在Wikipedia上关于“云计算”是这么定义的，它说： Cloud computing is a phrase used to describe a variety of computing co
YII CMenu配置 dcj3sjt126com yii
Adding id and class names to CMenu We use the id and htmlOptions to accomplish this. Watch. //in your view $this->widget('zii.widgets.CMenu', array( 'id'=>'myMenu', 'items'=>$this-&g
设计模式之静态代理与动态代理 come_for_dream 设计模式
静态代理与动态代理代理模式是java开发中用到的相对比较多的设计模式，其中的思想就是主业务和相关业务分离。所谓的代理设计就是指由一个代理主题来操作真实主题，真实主题执行具体的业务操作，而代理主题负责其他相关业务的处理。比如我们在进行删除操作的时候需要检验一下用户是否登陆，我们可以删除看成主业务，而把检验用户是否登陆看成其相关业务
【转】理解Javascript 系列 gcc2ge JavaScript
理解Javascript_13_执行模型详解摘要: 在《理解Javascript_12_执行模型浅析》一文中,我们初步的了解了执行上下文与作用域的概念，那么这一篇将深入分析执行上下文的构建过程，了解执行上下文、函数对象、作用域三者之间的关系。函数执行环境简单的代码:当调用say方法时，第一步是创建其执行环境，在创建执行环境的过程中，会按照定义的先后顺序完成一系列操作:1.首先会创建一个
Subsets II hcx2013 set
Given a collection of integers that might contain duplicates, nums, return all possible subsets. Note: Elements in a subset must be in non-descending order. The solution set must not conta
Spring4.1新特性——Spring缓存框架增强 jinnianshilongnian spring4
目录 Spring4.1新特性——综述 Spring4.1新特性——Spring核心部分及其他 Spring4.1新特性——Spring缓存框架增强 Spring4.1新特性——异步调用和事件机制的异常处理 Spring4.1新特性——数据库集成测试脚本初始化 Spring4.1新特性——Spring MVC增强 Spring4.1新特性——页面自动化测试框架Spring MVC T
shell嵌套expect执行命令 liyonghui160com
一直都想把expect的操作写到bash脚本里,这样就不用我再写两个脚本来执行了,搞了一下午终于有点小成就,给大家看看吧. 系统:centos 5.x 1.先安装expect yum -y install expect 2.脚本内容: cat auto_svn.sh #!/bin/bash
Linux实用命令整理 pda158 linux
0. 基本命令　　linux 基本命令整理　　1. 压缩解压　　tar -zcvf a.tar.gz a #把a压缩成a.tar.gz 　　tar -zxvf a.tar.gz #把a.tar.gz解压成a 　　2. vim小结　　2.1 vim替换　　:m,ns/word_1/word_2/gc
独立开发人员通向成功的29个小贴士 shoothao 独立开发
概述：本文收集了关于独立开发人员通向成功需要注意的一些东西,对于具体的每个贴士的注解有兴趣的朋友可以查看下面标注的原文地址。明白你从事独立开发的原因和目的。保持坚持制定计划的好习惯。万事开头难，第一份订单是关键。培养多元化业务技能。提供卓越的服务和品质。谨小慎微。营销是必备技能。学会组织，有条理的工作才是最有效率的。 “独立
JAVA中堆栈和内存分配原理 uule java
1、栈、堆 1.寄存器：最快的存储区, 由编译器根据需求进行分配,我们在程序中无法控制.2. 栈：存放基本类型的变量数据和对象的引用，但对象本身不存放在栈中，而是存放在堆（new 出来的对象）或者常量池中（字符串常量对象存放在常量池中。）3. 堆：存放所有new出来的对象。4. 静态域：存放静态成员（static定义的）5. 常量池：存放字符串常量和基本类型常量（public static f

【卷积神经网络系列】六、GoogLeNet复现（Pytorch实现）

目录

参考：

一、GoogLeNet-V1

1. 搭建神经网络结构

（1）定义基本的卷积：卷积+激活函数

（2）类Inception定义了 Inception 模块的网络结构

（3）类InceptionAux定义了辅助分类器模块的结构，辅助分类器只在训练的时候使用

（4）整体网络结构

2. 将定义好的网络结构搭载到GPU/CPU，并定义优化器

3.绘图参数列表

4.训练函数

5.测试函数

6.训练过程

二、GoogLeNet-V2

（1）定义基本的卷积：卷积+激活函数

（2）定义带BN的卷积：卷积+BN+ReLu6

（3）InceptionV2-A 模块结构：

（4）InceptionV2-B 模块结构：

（5）InceptionV2-C 模块结构：

（6）InceptionV2-D 模块结构：

（7）完整的 GoogLeNetV2 网络结构：

三、GoogLeNet-V4

1. 搭建神经网络结构

（1）定义基本的卷积：卷积+激活函数

（2）定义带BN的卷积：卷积+BN+ReLu

（3）Stem

（4）Inception-A：

（5）Inception-B:

（6）Inception-C：

（7）Reduction-A：

（8）Reduction-B：

（9）整体网络结构：

2. 将定义好的网络结构搭载到GPU/CPU，并定义优化器

3. 绘图参数

4. 训练函数

5. 测试函数

6. 训练过程

7. 保存模型并测试

8. 下载在服务器上训练好的模型参数，在测试集上验证精度

四、GoogLeNet-ResNet-V1

（1）定义基本的卷积：卷积+激活函数

（2）定义带BN的卷积：卷积+BN+ReLu

（3）Steam：

（4）Inception-A

（5）Inception-B

（6）Inception-C

（7）Reduction-A：

（8）Reduction-B：

（9）整体网络结构：

五、GoogLeNet-ResNet-V2

（1）定义基本的卷积：卷积+激活函数

（2）定义带BN的卷积：卷积+BN+ReLu

（3）Stem

（4）Inception-A

（5）Inception-B

（6）Inception-C

（7）Reduction-A：

（8）Reduction-B：

（9）整体网络结构：

你可能感兴趣的:(#,卷积神经网络,pytorch,深度学习,cnn)

（2）类`Inception`定义了 Inception 模块的网络结构

（3）类`InceptionAux`定义了辅助分类器模块的结构，辅助分类器只在训练的时候使用