灵隐寺扫地僧

[CS231n Assignment 2 #05 ] 深度学习框架——Pytorch

作业主页：Assignment 2
官方示例代码： Assignment 2 code
作业源文件 PyTorch.ipynb
作业内容：
这个作业有5个部分。您将在不同的抽象级别上学习PyTorch，这将帮助您更好地理解它，并为最终项目做好准备。
1. Preparation: we will use CIFAR-10 dataset.
2. Barebones PyTorch: we will work directly with the lowest-level PyTorch Tensors.
3. PyTorch Module API: we will use nn.Module to define arbitrary neural network architecture.
4. PyTorch Sequential API: we will use nn.Sequential to define a linear feed-forward network very conveniently.
5.CIFAR-10 open-ended challenge: please implement your own network to get as high accuracy as possible on CIFAR-10. You can experiment with any layer, optimizer, hyperparameters or other advanced features.

API	灵活性	便捷性
Barebone	高	低
nn.Module	高	中
nn.Sequential	低	高

文章目录

0. Pytorch

0.1 简介
0.2 入门

1. 准备
2. 最基础的Pytorch API（Barebones PyTorch）

2.1 PyTorch Tensors: Flatten Function
2.2 Barebones PyTorch: Two-Layer Network
2.3 Barebones PyTorch: Three-Layer ConvNet
2.4 Barebones PyTorch: Initialization
2.5 Barebones PyTorch: Check Accuracy
2.6 BareBones PyTorch: Training Loop
2.7 BareBones PyTorch: Train a Two-Layer Network
2.8 BareBones PyTorch: Training a ConvNet

3.PyTorch Module API

3.1 Module API: Two-Layer Network
3.2 Module API: Three-Layer ConvNet
3.3 Module API: Check Accuracy
3.4 Module API: Training Loop
3.5 Module API: Train a Two-Layer Network
3.6 Module API: Train a Three-Layer ConvNet

4.PyTorch Sequential API

4.1 Sequential API: Two-Layer Network
4.2 Sequential API: Three-Layer ConvNet

5. CIFAR-10 open-ended challenge

5.1 你能尝试的方向
5.2 训练的技巧
5.3 做的更好
5.4 开始您的尝试
5.5 测试集测试——只测试一次

总结

0. Pytorch

0.1 简介

PyTorch是一个在**张量对象(tensor)**上执行动态计算图形的系统，这些张量对象的行为类似于numpy ndarray。它提供了一个强大的自动微分引擎，消除了手动反向传播(back-propagation)的需要。

0.2 入门

Justin Johnson 针对 Pytorch 分享了一个 tutorial ；
也可以在官方文档找到更详尽的内容 API doc；
遇到一些解决不了的问题，你可以去官方社区寻求帮助 PyTorch forum。

1. 准备

首先，我们下载 CIFAR-10 数据集，并利用 Pytorch 的模块来声明数据集，预处理数据以及生成 mini-batch。

import torch
import torch.nn as nn
import torch.optim as optim
from torch.utils.data import DataLoader
from torch.utils.data import sampler

import torchvision.datasets as dset
import torchvision.transforms as T

import numpy as np

NUM_TRAIN = 49000

# The torchvision.transforms package provides tools for preprocessing data
# and for performing data augmentation; here we set up a transform to
# preprocess the data by subtracting the mean RGB value and dividing by the
# standard deviation of each RGB value; we've hardcoded the mean and std.
transform = T.Compose([
                T.ToTensor(),
                T.Normalize((0.4914, 0.4822, 0.4465), (0.2023, 0.1994, 0.2010))
            ])

# We set up a Dataset object for each split (train / val / test); Datasets load
# training examples one at a time, so we wrap each Dataset in a DataLoader which
# iterates through the Dataset and forms minibatches. We divide the CIFAR-10
# training set into train and val sets by passing a Sampler object to the
# DataLoader telling how it should sample from the underlying Dataset.
cifar10_train = dset.CIFAR10('./cs231n/datasets', train=True, download=True,
                             transform=transform)
loader_train = DataLoader(cifar10_train, batch_size=64, 
                          sampler=sampler.SubsetRandomSampler(range(NUM_TRAIN)))

cifar10_val = dset.CIFAR10('./cs231n/datasets', train=True, download=True,
                           transform=transform)
loader_val = DataLoader(cifar10_val, batch_size=64, 
                        sampler=sampler.SubsetRandomSampler(range(NUM_TRAIN, 50000)))
                        
cifar10_test = dset.CIFAR10('./cs231n/datasets', train=False, 	
                             download=True, transform=transform)
loader_test = DataLoader(cifar10_test, batch_size=64)

然后，设置我们项目的全局 数据类型 以及 数据存储的设备， torch.cuda.is_available() 返回我们的 pytorch 是否支持 GPU，然后 dtype和device设置类型和设备。

USE_GPU = True

dtype = torch.float32 # we will be using float throughout this tutorial

if USE_GPU and torch.cuda.is_available():
    device = torch.device('cuda')
else:
    device = torch.device('cpu')

# Constant to control how frequently we print train loss
print_every = 100

print('using device:', device)

2. 最基础的Pytorch API（Barebones PyTorch）

这节我们会先写一个简单的网络用于 CIFAR 数据集的分类，其只包含带ReLu激活的全连接层，而且只有两个隐藏层。（我们会基于 Pytorch Tensor 完成前向传播，并借助 Pytorch 的自动求导机制 autograd 完成反向传播）。

当我们声明 Pytorch Tensor 并带有参数 requires_grad=True 时，表明张量不只是计算值,也将在后台建立一个计算图，使我们可以很容易地通过图形反向传播来计算一些张量相对于下游损失的梯度。
具体来说，如果x是一个x.requires_grad == True的张量，然后反向传播后x.grad是另一个张量，它包含了x关于最后的标量损耗loss的梯度。

2.1 PyTorch Tensors: Flatten Function

PyTorch张量在概念上类似于numpy数组:它是一个n维数字网格.。与numpy一样，它提供了许多函数来有效地操作张量 Tensor。
作为一个简单的例子，我们提供了一个flatten函数，用于在一个全连接的神经网络中对图像数据进行拉伸变型。
回想一下，图像数据通常存储在一个形状为 $N x C x H x W$ 的张量中，因此，我们使用**“flatten”操作将每个表示为 $C x H x W$ 的张量折叠成一个长向量**。

下面的 flatten 函数首先从给定的一批数据中读取N、C、H和W的值，然后返回该数据的view。
view类似于numpy的reshape方法:它将x的维度重塑为 $\times??$ 。其中? ?可以是任何东西，在这里，它是 $\times H \times W$ ，但我们不需要明确地指定它，可以用 -1来自动求出。

def flatten(x):
    N = x.shape[0] # read in N, C, H, W
    return x.view(N, -1)  # "flatten" the C * H * W values into a single vector per image

def test_flatten():
    x = torch.arange(12).view(2, 1, 3, 2)
    print('Before flattening: ', x)
    print('After flattening: ', flatten(x))

test_flatten()

输出：

Before flattening:  tensor([[[[ 0,  1],
          [ 2,  3],
          [ 4,  5]]],


        [[[ 6,  7],
          [ 8,  9],
          [10, 11]]]])
After flattening:  tensor([[ 0,  1,  2,  3,  4,  5],
        [ 6,  7,  8,  9, 10, 11]])

可以看到x的维度从 $2 * 1 * 3 * 2$ 变成了 $2 * 6$ 。

2.2 Barebones PyTorch: Two-Layer Network

在这里，我们定义了一个函数two_layer_fc，它对一批图像数据执行两层全连接ReLU网络的前向传递。在定义了向前传递之后，我们检查它是否崩溃，并通过在网络中输入0来检验输出的形状。

您不必在这里编写任何代码，但是阅读和理解实现是非常重要的

import torch.nn.functional as F  # useful stateless functions

def two_layer_fc(x, params):
    """
    A fully-connected neural networks; the architecture is:
    NN is fully connected -> ReLU -> fully connected layer.
    Note that this function only defines the forward pass; 
    PyTorch will take care of the backward pass for us.
    
    The input to the network will be a minibatch of data, of shape
    (N, d1, ..., dM) where d1 * ... * dM = D. The hidden layer will have H units,
    and the output layer will produce scores for C classes.
    
    Inputs:
    - x: A PyTorch Tensor of shape (N, d1, ..., dM) giving a minibatch of
      input data.
    - params: A list [w1, w2] of PyTorch Tensors giving weights for the network;
      w1 has shape (D, H) and w2 has shape (H, C).
    
    Returns:
    - scores: A PyTorch Tensor of shape (N, C) giving classification scores for
      the input data x.
    """
    # first we flatten the image
    x = flatten(x)  # shape: [batch_size, C x H x W]
    
    w1, w2 = params
    
    # Forward pass: compute predicted y using operations on Tensors. Since w1 and
    # w2 have requires_grad=True, operations involving these Tensors will cause
    # PyTorch to build a computational graph, allowing automatic computation of
    # gradients. Since we are no longer implementing the backward pass by hand we
    # don't need to keep references to intermediate values.  
    # you can also use `.clamp(min=0)`, equivalent to F.relu()
    x = F.relu(x.mm(w1))
    x = x.mm(w2)
    return x
    

def two_layer_fc_test():
    hidden_layer_size = 42
    x = torch.zeros((64, 50), dtype=dtype)  # minibatch size 64, feature dimension 50
    w1 = torch.zeros((50, hidden_layer_size), dtype=dtype)
    w2 = torch.zeros((hidden_layer_size, 10), dtype=dtype)
    scores = two_layer_fc(x, [w1, w2])
    print(scores.size())  # you should see [64, 10]

two_layer_fc_test()

torch.Size([64, 10])

2.3 Barebones PyTorch: Three-Layer ConvNet

在这里，您将完成函数 three_layer_convnet 的实现，该函数将执行一个三层卷积网络的前向传递。像上面一样，我们可以通过在网络中传递0来立即测试我们的实现。该网络应具有以下架构:

一个卷积层(带偏置bias)带channel_1个卷积核，大小为 $KW_1 \times KH_1$ ，且带有2个0填充；
一个ReLU非线性激活层；
一个卷积层(带偏置bias)带channel_2个卷积核，大小为 $KW_2 \times KH_2$ ，且带有1个0填充；
一个ReLu非线性激活层；
带有偏置bias的全连接层，产生C类得分。

torch.nn.functional.conv2d文档
torch.nn.Conv2d( in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True, padding_mode='zeros')
torch.nn.functional.conv2d(input, weight, bias=None, stride=1, padding=0, dilation=1, groups=1) → Tensor

def three_layer_convnet(x, params):
    """
    Performs the forward pass of a three-layer convolutional network with the
    architecture defined above.

    Inputs:
    - x: A PyTorch Tensor of shape (N, 3, H, W) giving a minibatch of images
    - params: A list of PyTorch Tensors giving the weights and biases for the
      network; should contain the following:
      - conv_w1: PyTorch Tensor of shape (channel_1, 3, KH1, KW1) giving weights
        for the first convolutional layer
      - conv_b1: PyTorch Tensor of shape (channel_1,) giving biases for the first
        convolutional layer
      - conv_w2: PyTorch Tensor of shape (channel_2, channel_1, KH2, KW2) giving
        weights for the second convolutional layer
      - conv_b2: PyTorch Tensor of shape (channel_2,) giving biases for the second
        convolutional layer
      - fc_w: PyTorch Tensor giving weights for the fully-connected layer. Can you
        figure out what the shape should be?
      - fc_b: PyTorch Tensor giving biases for the fully-connected layer. Can you
        figure out what the shape should be?
    
    Returns:
    - scores: PyTorch Tensor of shape (N, C) giving classification scores for x
    """
    conv_w1, conv_b1, conv_w2, conv_b2, fc_w, fc_b = params
    scores = None
    
    # TODO: Implement the forward pass for the three-layer ConvNet.                
    #torch.nn.functional.conv2d(input, weight, bias=None, stride=1, padding=0, dilation=1, groups=1)#
    x = F.conv2d(x,conv_w1,bias=conv_b1,padding=2)
    x = F.relu(x)
    x = F.conv2d(x,conv_w2,bias=conv_b2,padding=1)
    x = flatten(x)
    # 添加映射层
    scores = x.mm(fc_w) + fc_b
    
    return scores

def three_layer_convnet_test():
    x = torch.zeros((64, 3, 32, 32), dtype=dtype)  # minibatch size 64, image size [3, 32, 32]

    conv_w1 = torch.zeros((6, 3, 5, 5), dtype=dtype)  # [out_channel, in_channel, kernel_H, kernel_W]
    conv_b1 = torch.zeros((6,))  # out_channel
    conv_w2 = torch.zeros((9, 6, 3, 3), dtype=dtype)  # [out_channel, in_channel, kernel_H, kernel_W]
    conv_b2 = torch.zeros((9,))  # out_channel

    # you must calculate the shape of the tensor after two conv layers, before the fully-connected layer
    fc_w = torch.zeros((9 * 32 * 32, 10))
    fc_b = torch.zeros(10)

    scores = three_layer_convnet(x, [conv_w1, conv_b1, conv_w2, conv_b2, fc_w, fc_b])
    print(scores.size())  # you should see [64, 10]
three_layer_convnet_test()

输出:torch.Size([64, 10])

2.4 Barebones PyTorch: Initialization

让我们写几个实用程序方法来初始化我们的模型的权重矩阵

random_weight(shape) initializes a weight tensor with the Kaiming normalization method.
zero_weight(shape) initializes a weight tensor with all zeros. Useful for instantiating bias parameters.

def random_weight(shape):
    """
    Create random Tensors for weights; setting requires_grad=True means that we
    want to compute gradients for these Tensors during the backward pass.
    We use Kaiming normalization: sqrt(2 / fan_in)
    """
    if len(shape) == 2:  # FC weight
        fan_in = shape[0]
    else:
        fan_in = np.prod(shape[1:]) # conv weight [out_channel, in_channel, kH, kW]
    # randn is standard normal distribution generator. 
    w = torch.randn(shape, device=device, dtype=dtype) * np.sqrt(2. / fan_in)
    w.requires_grad = True
    return w

def zero_weight(shape):
    return torch.zeros(shape, device=device, dtype=dtype, requires_grad=True)

# create a weight of shape [3 x 5]
# you should see the type `torch.cuda.FloatTensor` if you use GPU. 
# Otherwise it should be `torch.FloatTensor`
random_weight((3, 5))

输出：

tensor([[-0.3170,  1.1586,  0.2524, -0.0345,  0.0226],
        [ 0.3086,  1.2709,  0.4495, -1.0421, -0.3212],
        [ 0.8470,  1.1458,  0.4931, -0.3018, -0.4302]], device='cuda:0',
       requires_grad=True)

2.5 Barebones PyTorch: Check Accuracy

在对模型进行训练时，我们将使用以下函数来检查我们的模型在训练集或验证集上的准确性。

在检查精度时，我们不需要计算任何梯度;因此，在计算分数时，我们不需要PyTorch为我们构建计算图。为了防止构建图形，我们在torch.no_grad()上下文管理器下确定计算范围。

def check_accuracy_part2(loader, model_fn, params):
    """
    Check the accuracy of a classification model.
    
    Inputs:
    - loader: A DataLoader for the data split we want to check
    - model_fn: A function that performs the forward pass of the model,
      with the signature scores = model_fn(x, params)
    - params: List of PyTorch Tensors giving parameters of the model
    
    Returns: Nothing, but prints the accuracy of the model
    """
    split = 'val' if loader.dataset.train else 'test'
    print('Checking accuracy on the %s set' % split)
    num_correct, num_samples = 0, 0
    with torch.no_grad():
        for x, y in loader:
            x = x.to(device=device, dtype=dtype)  # move to device, e.g. GPU
            y = y.to(device=device, dtype=torch.int64)
            scores = model_fn(x, params)
            _, preds = scores.max(1) # tensor.max()返回一个元组(最大值，最大值对应的索引)
            num_correct += (preds == y).sum()
            num_samples += preds.size(0)
        acc = float(num_correct) / num_samples
        print('Got %d / %d correct (%.2f%%)' % (num_correct, num_samples, 100 * acc))

2.6 BareBones PyTorch: Training Loop

def train_part2(model_fn, params, learning_rate):
    """
    Train a model on CIFAR-10.
    
    Inputs:
    - model_fn: A Python function that performs the forward pass of the model.
      It should have the signature scores = model_fn(x, params) where x is a
      PyTorch Tensor of image data, params is a list of PyTorch Tensors giving
      model weights, and scores is a PyTorch Tensor of shape (N, C) giving
      scores for the elements in x.
    - params: List of PyTorch Tensors giving weights for the model
    - learning_rate: Python scalar giving the learning rate to use for SGD
    
    Returns: Nothing
    """
    for t, (x, y) in enumerate(loader_train):
        # Move the data to the proper device (GPU or CPU)
        x = x.to(device=device, dtype=dtype)
        y = y.to(device=device, dtype=torch.long)

        # Forward pass: compute scores and loss
        scores = model_fn(x, params)
        loss = F.cross_entropy(scores, y)

        # Backward pass: PyTorch figures out which Tensors in the computational
        # graph has requires_grad=True and uses backpropagation to compute the
        # gradient of the loss with respect to these Tensors, and stores the
        # gradients in the .grad attribute of each Tensor.
        loss.backward()

        # Update parameters. We don't want to backpropagate through the
        # parameter updates, so we scope the updates under a torch.no_grad()
        # context manager to prevent a computational graph from being built.
        with torch.no_grad():
            for w in params:
                w -= learning_rate * w.grad

                # Manually zero the gradients after running the backward pass
                w.grad.zero_()

        if t % print_every == 0:
            print('Iteration %d, loss = %.4f' % (t, loss.item()))
            check_accuracy_part2(loader_val, model_fn, params)

2.7 BareBones PyTorch: Train a Two-Layer Network

hidden_layer_size = 4000
learning_rate = 1e-2

w1 = random_weight((3 * 32 * 32, hidden_layer_size))
w2 = random_weight((hidden_layer_size, 10))

train_part2(two_layer_fc, [w1, w2], learning_rate)
# val set 上的准确度大概在 40%

Iteration 0, loss = 3.2187
Checking accuracy on the val set
Got 129 / 1000 correct (12.90%)

Iteration 100, loss = 2.0614
Checking accuracy on the val set
Got 325 / 1000 correct (32.50%)

Iteration 200, loss = 1.6769
Checking accuracy on the val set
Got 371 / 1000 correct (37.10%)

Iteration 300, loss = 1.9845
Checking accuracy on the val set
Got 394 / 1000 correct (39.40%)

Iteration 400, loss = 2.1362
Checking accuracy on the val set
Got 354 / 1000 correct (35.40%)

Iteration 500, loss = 1.7240
Checking accuracy on the val set
Got 442 / 1000 correct (44.20%)

Iteration 600, loss = 2.3258
Checking accuracy on the val set
Got 438 / 1000 correct (43.80%)

Iteration 700, loss = 1.9758
Checking accuracy on the val set
Got 430 / 1000 correct (43.00%)

2.8 BareBones PyTorch: Training a ConvNet

learning_rate = 3e-3

channel_1 = 32
channel_2 = 16

conv_w1 =  None
conv_b1 =  None
conv_w2 =  None
conv_b2 =  None
fc_w =  None
fc_b = None


# TODO: Initialize the parameters of a three-layer ConvNet.                    #
conv_w1 = random_weight((32,3,5,5))
conv_b1 = zero_weight((32,))
conv_w2 = random_weight((16,32,3,3))
conv_b2 = zero_weight((16,))
fc_w = random_weight((16 * 32 * 32,10))
fc_b = zero_weight(10)


params = [conv_w1, conv_b1, conv_w2, conv_b2, fc_w, fc_b]
train_part2(three_layer_convnet, params, learning_rate)

Iteration 0, loss = 4.2925
Checking accuracy on the val set
Got 141 / 1000 correct (14.10%)

Iteration 100, loss = 1.6818
Checking accuracy on the val set
Got 350 / 1000 correct (35.00%)

Iteration 200, loss = 1.9404
Checking accuracy on the val set
Got 450 / 1000 correct (45.00%)

Iteration 300, loss = 1.7681
Checking accuracy on the val set
Got 474 / 1000 correct (47.40%)

Iteration 400, loss = 1.8228
Checking accuracy on the val set
Got 460 / 1000 correct (46.00%)

Iteration 500, loss = 1.4132
Checking accuracy on the val set
Got 483 / 1000 correct (48.30%)

Iteration 600, loss = 1.5273
Checking accuracy on the val set
Got 496 / 1000 correct (49.60%)

Iteration 700, loss = 1.5708
Checking accuracy on the val set
Got 509 / 1000 correct (50.90%)

3.PyTorch Module API

Barebone PyTorch要求我们手动跟踪所有的参数张量。这对于只有几个张量的小型网络来说很好，但是在大型网络中跟踪几十个或几百个张量会非常不方便而且容易出错。
PyTorch提供了nn.Module帮助您定义任意的网络架构，同时为您跟踪每个可学习的参数。在第二部分中，我们自己实现了SGD。PyTorch还提供了torch.optim包，它实现了所有常见的优化器，比如RMSProp、Adagrad和Adam。它甚至支持近似的二阶方法，如L-BFGS。您可以参考文档doc以获得每个优化器的准确规范。

为了使用Module API，我们按照下列步骤：

继承nn.Module。给你的网络类一个直观的名字，比如TwoLayerFC。
在构造函数_init__()中，将需要的所有层定义为类属性。
层对象（Layer objects），比如nn.Linear 和 nn.Conv2d本身就是nn.Module子类，并包含可学习的参数，因此您不必自己实例化原始张量。
nn.Module将为您跟踪这些内部参数。参考文档doc以了解更多关于构建层的信息。
警告: 不要忘记首先调用super(). init__() !
在forward()方法中，定义网络的连接性。你应该使用在_init__中定义的属性作为函数调用，以张量作为输入，输出“变换后的”张量。不要在forward()中创建任何带有可学习参数的新层!，所有这些层都必须在_init_里面声明。

定义了模块子类之后，可以将其实例化为对象并像第2部分中的NN前向函数那样调用它。

3.1 Module API: Two-Layer Network

class TwoLayerFC(nn.Module):
    def __init__(self, input_size, hidden_size, num_classes):
        super().__init__()
        # assign layer objects to class attributes
        self.fc1 = nn.Linear(input_size, hidden_size)
        # nn.init package contains convenient initialization methods
        # http://pytorch.org/docs/master/nn.html#torch-nn-init 
        nn.init.kaiming_normal_(self.fc1.weight)
        self.fc2 = nn.Linear(hidden_size, num_classes)
        nn.init.kaiming_normal_(self.fc2.weight)
    
    def forward(self, x):
        # forward always defines connectivity
        x = flatten(x)
        scores = self.fc2(F.relu(self.fc1(x)))
        return scores

def test_TwoLayerFC():
    input_size = 50
    x = torch.zeros((64, input_size), dtype=dtype)  # minibatch size 64, feature dimension 50
    model = TwoLayerFC(input_size, 42, 10)
    scores = model(x)
    print(scores.size())  # you should see [64, 10]
test_TwoLayerFC()

3.2 Module API: Three-Layer ConvNet

It’s your turn to implement a 3-layer ConvNet followed by a fully connected layer. The network architecture should be the same as in Part II:

Convolutional layer with channel_1 5x5 filters with zero-padding of 2
ReLU
Convolutional layer with channel_2 3x3 filters with zero-padding of 1
ReLU
Fully-connected layer to num_classes classes

You should initialize the weight matrices of the model using the Kaiming normal initialization method.

torch.nn.functional.conv2d文档
torch.nn.Conv2d( in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True, padding_mode='zeros')

class ThreeLayerConvNet(nn.Module):
    def __init__(self, in_channel, channel_1, channel_2, num_classes):
        super().__init__()
       
        # TODO: Set up the layers you need for a three-layer ConvNet with the  #
        # architecture defined above.                                          #
        self.conv1 = nn.Conv2d(in_channels=in_channel,out_channels=channel_1,kernel_size=5,padding=2, bias=True)
        nn.init.kaiming_normal_(self.conv1.weight)
        nn.init.constant_(self.conv1.bias,0)
        
        self.relu = F.relu
        
        self.conv2 = nn.Conv2d(in_channels=channel_1,out_channels=channel_2,kernel_size=3,padding=1,bias=True)
        nn.init.kaiming_normal_(self.conv2.weight)
        nn.init.constant_(self.conv2.bias,0)
        
        self.fc = nn.Linear(channel_2 * 32 * 32, num_classes)
     

    def forward(self, x):
        scores = None
        ########################################################################
        # TODO: Implement the forward function for a 3-layer ConvNet. you      #
        # should use the layers you defined in __init__ and specify the        #
        # connectivity of those layers in forward()                            #
        ########################################################################
        x = self.relu(self.conv1(x))
        x = self.relu(self.conv2(x))
        scores = self.fc(flatten(x))
        return scores


def test_ThreeLayerConvNet():
    x = torch.zeros((64, 3, 32, 32), dtype=dtype)  # minibatch size 64, image size [3, 32, 32]
    model = ThreeLayerConvNet(in_channel=3, channel_1=12, channel_2=8, num_classes=10)
    scores = model(x)
    print(scores.size())  # you should see [64, 10]
test_ThreeLayerConvNet()

3.3 Module API: Check Accuracy

给定验证或测试集，我们可以检查神经网络的分类精度。
这个版本与第二部分略有不同。您不再需要手动传递参数。

def check_accuracy_part34(loader, model):
    if loader.dataset.train:
        print('Checking accuracy on validation set')
    else:
        print('Checking accuracy on test set')   
    num_correct = 0
    num_samples = 0
    model.eval()  # set model to evaluation mode
    with torch.no_grad():
        for x, y in loader:
            x = x.to(device=device, dtype=dtype)  # move to device, e.g. GPU
            y = y.to(device=device, dtype=torch.long)
            scores = model(x)
            _, preds = scores.max(1)
            num_correct += (preds == y).sum()
            num_samples += preds.size(0)
        acc = float(num_correct) / num_samples
        print('Got %d / %d correct (%.2f)' % (num_correct, num_samples, 100 * acc))

注意：

设置模型为评测模式model.eval()，此时不会记录batchnorm的训练均值等；
使用with torch.no_grad()不会记录训练梯度；

3.4 Module API: Training Loop

我们还使用了一个稍微不同的训练循环。我们使用来自torch.optim的优化器对象，而不是自己更新权重的值。它抽象了优化算法的概念，并提供了通常用于优化神经网络的大多数算法的实现。

def train_part34(model, optimizer, epochs=1):
    """
    Train a model on CIFAR-10 using the PyTorch Module API.
    
    Inputs:
    - model: A PyTorch Module giving the model to train.
    - optimizer: An Optimizer object we will use to train the model
    - epochs: (Optional) A Python integer giving the number of epochs to train for
    
    Returns: Nothing, but prints model accuracies during training.
    """
    model = model.to(device=device)  # move the model parameters to CPU/GPU
    for e in range(epochs):
        for t, (x, y) in enumerate(loader_train):
            model.train()  # put model to training mode
            x = x.to(device=device, dtype=dtype)  # move to device, e.g. GPU
            y = y.to(device=device, dtype=torch.long)

            scores = model(x)
            loss = F.cross_entropy(scores, y)

            # Zero out all of the gradients for the variables which the optimizer
            # will update.
            optimizer.zero_grad()

            # This is the backwards pass: compute the gradient of the loss with
            # respect to each  parameter of the model.
            loss.backward()

            # Actually update the parameters of the model using the gradients
            # computed by the backwards pass.
            optimizer.step()

            if t % print_every == 0:
                print('Iteration %d, loss = %.4f' % (t, loss.item()))
                check_accuracy_part34(loader_val, model)
                print()

3.5 Module API: Train a Two-Layer Network

现在我们准备运行训练循环。与第二部分不同，我们不再显式地分配参数张量。
只需将输入大小、隐藏层大小和类的数量(即输出大小)传递给TwoLayerFC的构造函数。
您还需要定义一个优化器来跟踪TwoLayerFC中的所有可学习参数。
您不需要调整任何超参数，但是经过一个epoch的训练后，您应该可以看到模型精度超过40%。

hidden_layer_size = 4000
learning_rate = 1e-2
model = TwoLayerFC(3 * 32 * 32, hidden_layer_size, 10)
optimizer = optim.SGD(model.parameters(), lr=learning_rate)

train_part34(model, optimizer)

最终精度在43%~45%左右。

3.6 Module API: Train a Three-Layer ConvNet

现在您应该使用Module API在CIFAR上训练一个三层的ConvNet。这应该看起来很像训练两层网络!您不需要调整任何超参数，但是经过一段时间的训练后，您应该可以达到45%以上。你应该使用没有动量的随机梯度下降来训练模型。

learning_rate = 3e-3
channel_1 = 32
channel_2 = 16

model = None
optimizer = None
# TODO: Instantiate your ThreeLayerConvNet model and a corresponding optimizer #
model = ThreeLayerConvNet(3, channel_1, channel_2, 10)
optimizer = torch.optim.SGD(model.parameters(),lr=learning_rate)

train_part34(model, optimizer)

最终精度在48%左右。

4.PyTorch Sequential API

第三部分介绍了PyTorch Module API，它允许您定义任意可学习的层及其连接性。

对于像前馈层堆叠这样的简单模型，仍然需要经过3步:继承nn.Module类000，在_init__中将网络层声明成类属性，并在forward()方法中逐个调用每一层。

有没有更方便的方法?

幸运的是，PyTorch提供了一个名为nn.Sequentiao的容器模块，它将上述步骤合并为一个。它不像nn.Module`那么灵活。模块，因为您不能指定比前馈堆栈更复杂的拓扑结构，但是它对于许多用例来说已经足够了。

4.1 Sequential API: Two-Layer Network

# We need to wrap `flatten` function in a module in order to stack it
# in nn.Sequential
class Flatten(nn.Module):
    def forward(self, x):
        return flatten(x)

hidden_layer_size = 4000
learning_rate = 1e-2

model = nn.Sequential(
    Flatten(),
    nn.Linear(3 * 32 * 32, hidden_layer_size),
    nn.ReLU(),
    nn.Linear(hidden_layer_size, 10),
)

# you can use Nesterov momentum in optim.SGD
optimizer = torch.optim.SGD(model.parameters(), lr=learning_rate,
                     momentum=0.9, nesterov=True)

train_part34(model, optimizer)

4.2 Sequential API: Three-Layer ConvNet

channel_1 = 32
channel_2 = 16
learning_rate = 1e-2

model = None
optimizer = None


# TODO: Rewrite the 2-layer ConvNet with bias from Part III with the           #
# Sequential API.                                                              #
model = nn.Sequential(
    nn.Conv2d(3,channel_1,kernel_size=5,padding=2),
    nn.ReLU(),
    nn.Conv2d(channel_1,channel_2,kernel_size=3,padding=1),
    nn.ReLU(),
    Flatten(),
    nn.Linear(channel_2*32*32,10)
)
optimizer = torch.optim.SGD(model.parameters(), lr=learning_rate,
                     momentum=0.9, nesterov=True)

train_part34(model, optimizer)

5. CIFAR-10 open-ended challenge

在本节中，您可以在CIFAR-10上试验任何您想要的ConvNet架构。

现在，您的工作是对 体系结构、超参数、损失函数和优化器 进行试验，以训练在10个epoch内在CIFAR-10验证集上获得至少70%的准确率的模型。您可以使用上面的check_accuracy和train函数。你可以使用任何一个nn.Module或nn.Sequential的API。

以下是每个组件的官方API文档。注意:我们在Pytorch中将“spatial batch norm”称为“BatchNorm2D”。

Layers in torch.nn package: http://pytorch.org/docs/stable/nn.html
Activations: http://pytorch.org/docs/stable/nn.html#non-linear-activations
Loss functions:http://pytorch.org/docs/stable/nn.html#loss-functions
Optimizers: http://pytorch.org/docs/stable/optim.html

5.1 你能尝试的方向

卷积核大小(Filter size)：我们之前用的都是 5x5，是否更小的核更有效？
卷积核数目(Number of filters)：上面我们使用了32个过滤器。多做还是少做更好?
池化(Pooling)还是步长卷积（Strided Convolution）：你是使用最大池化还是仅仅使用跨步卷积?
批量归一化(Batch normalization)：尝试在卷积层之后添加空间批处理归一化，在仿射层之后添加普通批处理归一化。你的网络训练得更快吗?
网络结构(Network architecture)：上面的网络有两层可训练的参数。你能用深层网络做得更好吗?好的架构包括:
- [conv-relu-pool]xN -> [affine]xM -> [softmax or SVM]
- [conv-relu-conv-relu-pool]xN -> [affine]xM -> [softmax or SVM]
- [batchnorm-relu-conv]xN -> [affine]xM -> [softmax or SVM]
全局平均池化（Global Average Pooling）：不要先进行拉伸(flatten)，然后再使用多个仿射层，而是执行卷积，直到图像变小(7x7左右)，然后执行平均池操作，得到1x1图像图像(1,1,Filter#)，然后将其重新构造为(Filter#)向量。这个方法被用在了 Google’s Inception Network (See Table 1 for their architecture).

5.2 训练的技巧

对于您尝试的每个网络体系结构，您都应该调整学习率和其他超参数。当你这样做的时候，有几件重要的事情要记住:

如果参数工作得很好，您应该可以在几百次迭代中看到改进；
请记住超参数调优的由粗到细的方法:首先测试大范围的超参数，只需要几个训练迭代，就可以找到有效的参数组合。
一旦您找到了一些似乎可以工作的参数集，就可以更细致地搜索这些参数。你可能需要为更多的时代而训练。
您应该使用验证集来进行超参数搜索，并保存您的测试集，以便根据验证集所选择的最佳参数来评估您的体系结构。

5.3 做的更好

如果您喜欢冒险，您可以实现许多其他特性来尝试改进性能。您不需要实现其中的任何一个，但是如果您有时间，请不要错过其中的乐趣!

优化器(Alternative optimizers): you can try Adam, Adagrad, RMSprop, etc.
激活函数：alternative activation functions such as leaky ReLU, parametric ReLU, ELU, or MaxOut.
模型集成（Model ensembles）
数据增强（Data augmentation）
新的网络结构：
- ResNets where the input from the previous layer is added to the output.
- DenseNets where inputs into previous layers are concatenated together.
- This blog has an in-depth overview

5.4 开始您的尝试

我试了一下修改的ResNet18网络，在第3个epoch开始能得到75.4%的准确度,最高78%的准确度（验证集）。

################################################################################
# TODO:                                                                        #         
# Experiment with any architectures, optimizers, and hyperparameters.          #
# Achieve AT LEAST 70% accuracy on the *validation set* within 10 epochs.      #
#                                                                              #
# Note that you can use the check_accuracy function to evaluate on either      #
# the test set or the validation set, by passing either loader_test or         #
# loader_val as the second argument to check_accuracy. You should not touch    #
# the test set until you have finished your architecture and  hyperparameter   #
# tuning, and only run the test set once at the end to report a final value.   #
################################################################################
from collections import OrderedDict
model = None
optimizer = None

def conv3x3(in_planes,out_planes,stride = 1):
    # "3x3 convolution with padding"
    return nn.Conv2d(
        in_planes, out_planes, 
        kernel_size=3, stride=stride, padding=1, bias=False)
class BasicBlock(nn.Module):
    expansion = 1
    def __init__(self, inplanes, planes, stride=1, downsample=None):
        super(BasicBlock, self).__init__()
        m = OrderedDict()
        m['conv1'] = conv3x3(inplanes, planes, stride)
        m['bn1'] = nn.BatchNorm2d(planes)
        m['relu1'] = nn.ReLU(inplace=True)
        m['conv2'] = conv3x3(planes, planes)
        m['bn2'] = nn.BatchNorm2d(planes)
        self.group1 = nn.Sequential(m)

        self.relu= nn.Sequential(nn.ReLU(inplace=True))
        self.downsample = downsample

    def forward(self, x):
        if self.downsample is not None:
            residual = self.downsample(x)
        else:
            residual = x

        out = self.group1(x) + residual

        out = self.relu(out)

        return out

class ResNet(nn.Module):
    def __init__(self, block, layers, num_classes=10):
        self.inplanes = 64
        super(ResNet, self).__init__()

        m = OrderedDict()
        m['conv1'] = nn.Conv2d(3, 64, kernel_size=3, stride=1, padding=1, bias=False)
        m['bn1'] = nn.BatchNorm2d(64)
        m['relu1'] = nn.ReLU(inplace=True)
        m['maxpool'] = nn.MaxPool2d(kernel_size=2, stride=2)
        self.group1= nn.Sequential(m)

        self.layer1 = self._make_layer(block, 64, layers[0])
        self.layer2 = self._make_layer(block, 128, layers[1], stride=2)
        self.layer3 = self._make_layer(block, 256, layers[2], stride=2)
        self.layer4 = self._make_layer(block, 512, layers[3], stride=2)

        self.avgpool = nn.Sequential(nn.AvgPool2d(2))
        self.attention = nn.Conv1d(1,1,kernel_size=3,padding=1)
        self.group2 = nn.Sequential(
            OrderedDict([
                ('fc', nn.Linear(512 * block.expansion, num_classes))
            ])
        )
        
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                n = m.kernel_size[0] * m.kernel_size[1] * m.out_channels
                m.weight.data.normal_(0, np.sqrt(2. / n))
            elif isinstance(m, nn.BatchNorm2d):
                m.weight.data.fill_(1)
                m.bias.data.zero_()

    def _make_layer(self, block, planes, blocks, stride=1):
        downsample = None
        if stride != 1 or self.inplanes != planes * block.expansion:
            downsample = nn.Sequential(
                nn.Conv2d(self.inplanes, planes * block.expansion, kernel_size=1, stride=stride, bias=False),
                nn.BatchNorm2d(planes * block.expansion),
            )

        layers = []
        layers.append(block(self.inplanes, planes, stride, downsample))
        self.inplanes = planes * block.expansion
        for i in range(1, blocks):
            layers.append(block(self.inplanes, planes))

        return nn.Sequential(*layers)

    def forward(self, x):
        x = self.group1(x)
        x = self.layer1(x)
        x = self.layer2(x)
        x = self.layer3(x)
        x = self.layer4(x)  #(64,512,2,2)
        x = self.avgpool(x) #(64,512,1,1)
        x = x.view(x.size(0), -1) #(64,512)
        # 添加一层channel attention
        x = x.unsqueeze(1)
        a = self.attention(x)
        x = x.mul(a)
        x = x.squeeze(1)
        x = self.group2(x)

        return x


def resnet18(pretrained=False, model_root=None, **kwargs):
    model = ResNet(BasicBlock, [2, 2, 2, 2], **kwargs)
    if pretrained:
        misc.load_state_dict(model, model_urls['resnet18'], model_root)
    return model
def test_MyCNN():
    model = resnet18()
    x = torch.zeros((64, 3, 32, 32), dtype=dtype)  # minibatch size 64, image size [3, 32, 32]
    scores = model(x)
    print(scores.size())
# You should get at least 70% accuracy
model = resnet18()
optimizer = torch.optim.SGD(model.parameters(), lr=1e-2,
                     momentum=0.9, nesterov=True)
print_every = 1000
train_part34(model, optimizer,epochs=10)

5.5 测试集测试——只测试一次

best_model = model
check_accuracy_part34(loader_test, best_model)

测试集上精度：

Checking accuracy on test set
Got 7664 / 10000 correct (76.64)

总结

在使用 Pytorch 完成我们的深度学习工作的时候，大致遵循下列步骤：

定义你的数据集
你可以使用torchvision.datasets中已经定义好的数据集格式，或者自己定义一个数据集对象object。同时，利用import torchvision.transforms来对数据进行变换。
定义你的加载器
from torch.utils.data import DataLoader可以让你批量加载数据
定义你的模型
使用import torch.nn.functional as F里的函数，以及nn中的分钟带可学习参数的网络层来定义自己的网络结构以及连接nn.Module；或者使用nn.Sequential定义我们的前馈网络。
定义你的测试方法
使用model.eval()转换测试模式，用with torch.no_grad()声明计算图不积累梯度，然后输入测试数据计算测试指标（如损失、准确度等）
定义优化器
torch.optim中定义了各种优化器，我们需要指明参数，并分配给具体的模型可学习参数。
定义训练函数
加载模型->输入训练数据（前向传播 model(x)）->计算损失loss(nn.functioanl中定义了很多损失函数)->原始梯度清零(用optimizer.zero_grad()将以前积累的梯度清零)->损失反向传播(loss.backward())->参数更新(optimizer.step())

你可能感兴趣的:(#,CS231n)

cs231n_深度之眼第二次作业 Jie_Cheney
图像分类数据和label分别是什么？图像分类存在的问题与挑战？图像分类数据包括训练集测试集的数据，在有监督的问题中对于训练集数据来说是有label的，而测试集是等待我们去识别它的类别，不具有label。label就是分类标签，比如cifar10这个数据集，待分类的这10类数据我们可以写成1-10，或者0-9这就叫做label。图像分类存在的问题与挑战：光照，角度，形变，遮挡。使用python加载一
向量，矩阵和张量的导数 | 简单的数学橘子学AI
前段时间看过一些矩阵求导的教程，在看过的资料中，尤其喜欢斯坦福大学CS231n卷积神经网络课程中提到的Erik这篇文章。循着他的思路，可以逐步将复杂的求导过程简化、再简化，直到发现其中有规律的部分。话不多说，一起来看看吧。作者：ErikLearned-Miller翻译：橘子来源：橘子AI笔记（datawitch）本文旨在帮助您学习向量、矩阵和高阶张量（三维或三维以上的数组）的求导方法，以及如何求对
cs231n assignment1——SVM 柠檬山楂荷叶茶 cs231n 支持向量机 python 机器学习
整体思路加载CIFAR-10数据集并展示部分数据数据图像归一化，减去均值（也可以再除以方差）svm_loss_naive和svm_loss_vectorized计算hinge损失，用拉格朗日法列hinge损失函数利用随机梯度下降法优化SVM在训练集和验证集计算准确率，保存最好的模型在测试集进行预测计算准确率加载展示划分数据集加载CIFAR-10数据集#LoadtherawCIFAR-10data.
（2023版）斯坦福CS231n学习笔记：DL与CV教程 (12) | 视觉模型可视化与可解释性（Visualizing and Understanding）女王の专属领地计算机视觉 #计算机视觉 #学习笔记
前言笔记专栏：斯坦福CS231N：面向视觉识别的卷积神经网络（23）课程链接：https://www.bilibili.com/video/BV1xV411R7i5CS231n:深度学习计算机视觉（2017）中文笔记：https://zhuxiaoxia.blog.csdn.net/article/details/801551662023最新课程PPT：https://download.csdn.
2019-02-25~~2019-03-03 第十周周末复盘仰望星空的小狗
一、任务清单1、刷leetcode题目（7道）2、听tensorflow，cs231n和cv课程3、技术文档输出4、恢复早起的作息二、反思1、自从年前工作非常忙，加上遇上一些郁闷的事情，导致年前到现在时间记录中断了很长一段时间。本周开始恢复时间记录，日打卡，周复盘。2、生活中不论谁，肯定会时不时遇上一些令人郁闷的事情，这些郁闷的事情很可能会打乱原本的生活节奏。但是，生活还有很长的路要走，不应该因为
训练神经网络(上)激活函数笔写落去深度学习神经网络人工智能深度学习
本文介绍几种激活函数,只作为个人笔记.观看视频为cs231n文章目录前言一、Sigmoid函数二、tanh函数三、ReLU函数四、LeakyReLU函数五、ELU函数六.在实际应用中寻找激活函数的做法总结前言激活函数是用来加入非线性因素的，提高神经网络对模型的表达能力，解决线性模型所不能解决的问题。一、Sigmoid函数这个函数大家应该熟悉在逻辑回归中曾用到这个sigmoid函数这个函数可以将负无
卷积神经网络 weixin_34283445 人工智能
https://zhuanlan.zhihu.com/p/27642620关于卷积神经网络的讲解，网上有很多精彩文章，且恐怕难以找到比斯坦福的CS231n还要全面的教程。所以这里对卷积神经网络的讲解主要是以不同的思考侧重展开，通过对卷积神经网络的分析，进一步理解神经网络变体中“因素共享”这一概念。注意：该文会跟其他的现有文章有很大的不同。读该文需要有本书前些章节作为预备知识，不然会有理解障碍。没看
CS231n 作业答案 tech0ne
CS231n三次大作业：#第一次作业##原始包下载：作业一完成包地址：作业一JupyterNotebook结果：KNNSVMSoftmaxTwolayernetFeatures第二次作业原始包下载：作业二完成包地址：作业二JupyterNotebook结果：FullyConnectedNetsBatchNormalizationDropoutConvolutionalNetworksTensorf
cs231n作业-assignment1 momentum_ AI python 机器学习 numpy
assignment1(cs231n)文章目录assignment1(cs231n)KNN基础计算distances方法一：双层循环计算distances方法二：单层循环计算distances方法三：无循环根据dists找到每个测试样本的种类KNN模型汇总交叉验证KNN基础计算distances方法一：双层循环dists矩阵是（num_test,num_train）500*5000defcompu
【深度学习理论】(1) 损失函数立Sir 深度学习理论机器学习人工智能神经网络深度学习损失函数
各位同学好，最近学习了CS231N斯坦福计算机视觉公开课，讲的太精彩了，和大家分享一下。已知一张图像属于各个类别的分数，我们希望图像属于正确分类的分数是最大的，那如何定量的去衡量呢，那就是损失函数的作用了。通过比较分数与真实标签的差距，构造损失函数，就可以定量的衡量模型的分类效果，进而进行后续的模型优化和评估。构造损失函数之后，我们的目标就是将损失函数的值最小化，使用梯度下降的方法求得损失函数对于
线性分类器--数据处理骆驼穿针眼计算机视觉与深度学习深度学习
数据集划分通常按照70%，20%，10%来分数据集数据处理斯坦福的线性分类器体验http://vision.stanford.edu/teaching/cs231n-demos/linear-classify/
【CS231n】－学习笔记-1-Intro to Computer Vision, historical context. Alice熹爱学习计算机视觉计算机视觉 CS231n DeepLearning PYTHON
Class:http://cs231n.stanford.eduSchedule:http://cs231n.stanford.edu/syllabus.htmlSlides:http://vision.stanford.edu/teaching/cs231n/slides/winter1516_lecture1.pdfVideo:https://www.youtube.com/watch?v=N
笔记00-杜克大学公开课,图像和视频处理:从火星到好莱坞木木爱吃糖醋鱼
笔记内容介绍》ImageandVideoProcessing:FromMarstoHollywoodwithaStopattheHospital算起来是2017年中的时候，因为要搞深度学习的东西，就自学了斯坦福cs231n的神经网络的课。Youtube上有至少两期的公开课视频。好像从李飞飞离职之后，截止到2017年春季，就没再继续了。现在想想哪门课的内容挺多挺繁杂的。虽然是本科的课，最后好像每个学
向量对向量求导，链式法则构建的乐趣向量对向量求导
这还算不得向量微积分里多么主干的内容，只是一个小技术，但是数学推导很多时候就会用到。http://cs231n.stanford.edu/vecDerivs.pdf这个文献是一个好文献。另优秀翻译：https://zhuanlan.zhihu.com/p/142668996链式法则注意：这里的乘法变成了innerproduct推导过程中比较关键的点：除了利用这文献所讲的分量慢慢推，还有一个要点，首
Win10上关于cs231n（2017）课后作业的环境配置 Diane小山
开始首先，这篇文章是针对那些想完成cs231n作业，但是觉得装linux双系统很麻烦的童鞋。cs231n作业的SetUp官方教程只针对了那些使用Unix(Ubuntu,Macos等)的人，对使用Windows的人十分不友好。安装anaconda百度一篇anaconda的安装教程，照着安装即可。这里需要提醒的有两点：国内的anaconda镜像能用的基本都挂了，所以还是老老实实去官方网站下载吧：）一定
CS231N assignment2 SVM weixin_30363509 数据结构与算法人工智能 python
CS231NAssignment2SupportVectorMachineBegin本文主要介绍CS231N系列课程的第一项作业，写一个SVM无监督学习训练模型。课程主页：网易云课堂CS231N系列课程语言：Python3.61线形分类器以图像为例，一幅图像像素为32*32*3代表长32宽32有3通道的衣服图像，将其变为1*3072的一个向量，即该图像的特征向量。我们如果需要训练1000幅图像，那
【AI】斯坦福CS231n课程练习（1）—— KNN和SVM分类李清焰 CS231n KNN SVM
文章目录一、前言1、CS231n是啥？2、本篇博客任务3、使用的数据集二、知识准备1、KNN是什么？2、SVM是什么？SVM的组成：三、实验——KNN和SVM分类1、KNN图片分类（重要步骤将在目录上体现）（1）在colab上切换目录，加载dataset（2）加载包、设置和外部模块（3）加载、初步处理数据（4）可视化打印一些图片看看我们的数据集长什么样（5）对测试、训练数据进行分组（6）创建KNN
深度学习系列之cs231n assignment1 KNN（二）明曦君深度学习 python 机器学习
写在前面：久经周折，终于能够将KNN系列给大家继续分享了，这次的内容来源于李飞飞教授团队的cs231n深度学习课程的作业1中的KNN研究，我会在全文我遇到困难的地方进行分享，以及一些想法。内容安排深度学习系列依托与cs231n的课程作业，因为只想练习编程，所以不对课程内容进行分享，仅针对编程内容进行分享。那么这一次的分享就是assignment1中K近邻分类器的使用，以及完成其中的四个问题，这四个
cs231n assignment2(3) 没天赋的学琴
assignment2的第三部分，是熟悉深度学习框架pytorch或者tensorflow，这里选择的是使用pytorch框架。该部分主要通过三个层次：Barebones、ModuleAPI、SequentialAPI，来了解pytorch。Barebones在该层次中，需要利用pytorch所提供的一些函数，不仅需要定义神经网络的结构，同时还需编写网络的前向传播以及模型的训练部分；而参数的梯度可
第三十三周学习笔记 luputo 学习笔记
第三十三周学习笔记CS231nDeepLearningSoftwareCPUvsGPUCPU:Fewercores,buteachcoreismuchfasterandmuchmorecapable;greatatsequentialtasksGPU:Morecores,buteachcoreismuchslowerand“dumber”;greatforparalleltasks（matrixm
CNN(卷积神经网络)、RNN(循环神经网络)、DNN，LSTM weixin_34174132 人工智能
http://cs231n.github.io/neural-networks-1https://arxiv.org/pdf/1603.07285.pdfhttps://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-Understanding-Convolutional-Neural-Networks/Appli
CNN笔记：通俗理解卷积神经网络 I_O_fly 神经网络 cnn 神经网络深度学习
通俗理解卷积神经网络（cs231n与5月dl班课程笔记）1前言2012年我在北京组织过8期machinelearning读书会，那时“机器学习”非常火，很多人都对其抱有巨大的热情。当我2013年再次来到北京时，有一个词似乎比“机器学习”更火，那就是“深度学习”。本博客内写过一些机器学习相关的文章，但上一篇技术文章“LDA主题模型”还是写于2014年11月份，毕竟自2015年开始创业做在线教育后，太
Knn算法与 Svm算法对比一个不知名的码农支持向量机算法机器学习
Knn算法与Svm算法对比这里首先借用一个博主所做的图表，讲的很有理有据(7条消息)[cs231n]KNN与SVM区别_Rookie’Program的博客-CSDN博客_knn和svm的区别这里我们来讲一下我对这两个算法的理解knn看起来就是比较简单的一个数学模型，就是划范围论，精细程度实际上可能没有svm好，并且测试量也不能大，数据一大，处理起来又很麻烦，预测效率也比较低。相反的svm和knn对
斯坦福大学CS520知识图谱系列课程学习笔记：第一讲什么是知识图谱 ngl567
随着知识图谱在人工智能各个领域的广泛使用，知识图谱受到越来越多AI研究人员的关注和学习，已经成为人工智能迈向认知系统的关键技术之一。之前，斯坦福大学的面向计算机视觉的CS231n和面向自然语言处理的CS224n成为了全球非常多AI研究人员的入门经典学习课程。因此，斯坦福大学于今年3月开设了一门专门面向知识图谱的系列课程CS520，官网课程页：https://web.stanford.edu/cla
北京邮电大学计算机视觉与深度学习鲁鹏计算机视觉概述课程手迹 qinyaoze 机器学习 CV手记计算机视觉人工智能深度学习
课程笔记计算机视觉=输入(认知神经科学-理论,运用方法&算法,硬件)+输出(机器人)课程：图像处理-CS131，图像结构-CS231a，图像理论-CS230/CS231nQ-象棋与人工智能的关系？IBM-深蓝，Google-AlphaGo>>机器赢得象棋胜利=强大的搜索算法目标：语义鸿沟，即建立图像像素核语义间的关系发展过程：系统出现-物种大繁荣>>理论研究-猫视觉神经>>积木世界>>MIT图像处
国外AI大牛推荐的10大最有帮助免费在线机器学习课程机器学习与系统
woman_ml.jpg本文编译自twitter用户chipro斯坦福在线自学课程《概率与统计》：该课程涉及概率统计的基本概念，涵盖机器学习4个基本方面：探索性数据分析，产生数据，概率和推理。MIT的《线性代数》：这是我见过的最好的线性代数课程，由传奇教授GilbertStrang（吉尔伯特斯特朗）教授。斯坦福的CS231N：用于视觉识别的卷积神经网络：平衡理论与实践。课堂笔记写得很好，解释了不同
CS231n学习笔记--计算机视觉历史回顾与介绍1 听城
CS231n简介首先我们来看看官方对这门课的介绍：计算机视觉在社会中已经逐渐普及，并广泛运用于搜索检索、图像理解、手机应用、地图导航、医疗制药、无人机和无人驾驶汽车等领域。而这些应用的核心技术就是图像分类、图像定位和图像探测等视觉识别任务。近期神经网络（也就是“深度学习”）方法上的进展极大地提升了这些代表当前发展水平的视觉识别系统的性能。本课程将深入讲解深度学习框架的细节问题，聚焦面向视觉识别任务
计算机视觉实战项目（图像分类+目标检测+目标跟踪+姿态识别+车道线识别+车牌识别）阿利同学计算机视觉分类目标检测
图像分类教程博客_传送门链接:链接在本教程中，您将学习如何使用迁移学习训练卷积神经网络以进行图像分类。您可以在cs231n上阅读有关迁移学习的更多信息。本文主要目的是教会你如何自己搭建分类模型，耐心看完，相信会有很大收获。废话不多说，直切主题…首先们要知道深度学习大都包含了下面几个方面：1.加载（处理）数据2.网络搭建3.损失函数（模型优化）4模型训练和保存把握好这些主要内容和流程，基本上对分类模
cs231n assignment2(2) 没天赋的学琴
assignment2的第二部分的内容，实现一个卷积神经网络。这一部分主要是实现卷积神经网络中的一些所需用到的layer类型：卷积层(convolution)和池化层(这里是实现max-pooling)。这部分的实现是不考虑其运行效率，而在真正的实现应用上，卷积神经网络的运行效率是一个很重要的问题。卷积层卷积层是由一个个过滤器(filter)，每个过滤器的尺寸为:，这里的的大小与输入的图像或act
cs231n作业：Assignment1-Softmax Diane小山
softmax.pydefsoftmax_loss_naive(W,X,y,reg):"""Softmaxlossfunction,naiveimplementation(withloops)InputshavedimensionD,thereareCclasses,andweoperateonminibatchesofNexamples.Inputs:-W:Anumpyarrayofshape(
Java开发中，spring mvc 的线程怎么调用？小麦麦子 spring mvc
今天逛知乎，看到最近很多人都在问spring mvc 的线程http://www.maiziedu.com/course/java/ 的启动问题，觉得挺有意思的，那哥们儿问的也听仔细，下面的回答也很详尽，分享出来，希望遇对遇到类似问题的Java开发程序猿有所帮助。问题：在用spring mvc架构的网站上，设一线程在虚拟机启动时运行，线程里有一全局
maven依赖范围 bitcarter maven
1.test 测试的时候才会依赖，编译和打包不依赖，如junit不被打包 2.compile 只有编译和打包时才会依赖 3.provided 编译和测试的时候依赖，打包不依赖，如：tomcat的一些公用jar包 4.runtime 运行时依赖，编译不依赖 5.默认compile 依赖范围compile是支持传递的，test不支持传递 1.传递的意思是项目A，引用
Jaxb org.xml.sax.saxparseexception : premature end of file darrenzhu xml premature JAXB
如果在使用JAXB把xml文件unmarshal成vo(XSD自动生成的vo)时碰到如下错误： org.xml.sax.saxparseexception : premature end of file 很有可能时你直接读取文件为inputstream，然后将inputstream作为构建unmarshal需要的source参数。InputSource inputSource = new In
CSS Specificity 周凡杨 html 权重 Specificity css
有时候对于页面元素设置了样式，可为什么页面的显示没有匹配上呢？ because specificity CSS 的选择符是有权重的，当不同的选择符的样式设置有冲突时，浏览器会采用权重高的选择符设置的样式。规则： HTML标签的权重是1 Class 的权重是10 Id 的权重是100
java与servlet g21121 servlet
servlet 搞java web开发的人一定不会陌生，而且大家还会时常用到它。下面是java官方网站上对servlet的介绍： java官网对于servlet的解释写道 Java Servlet Technology Overview Servlets are the Java platform technology of choice for extending and enha
eclipse中安装maven插件 510888780 eclipse maven
1.首先去官网下载 Maven： http://www.apache.org/dyn/closer.cgi/maven/binaries/apache-maven-3.2.3-bin.tar.gz 下载完成之后将其解压，我将解压后的文件夹：apache-maven-3.2.3，并将它放在 D:\tools目录下，即 maven 最终的路径是：D:\tools\apache-mave
jpa@OneToOne关联关系布衣凌宇 jpa
Nruser里的pruserid关联到Pruser的主键id，实现对一个表的增删改，另一个表的数据随之增删改。 Nruser实体类 //***************************************************************** @Entity @Table(name="nruser") @DynamicInsert @Dynam
我的spring学习笔记11-Spring中关于声明式事务的配置 aijuans spring 事务配置
这两天学到事务管理这一块，结合到之前的terasoluna框架，觉得书本上讲的还是简单阿。我就把我从书本上学到的再结合实际的项目以及网上看到的一些内容，对声明式事务管理做个整理吧。我看得Spring in Action第二版中只提到了用TransactionProxyFactoryBean和<tx:advice/>,定义注释驱动这三种，我承认后两种的内容很好，很强大。但是实际的项目当中
java 动态代理简单实现 antlove java handler proxy dynamic service
dynamicproxy.service.HelloService package dynamicproxy.service; public interface HelloService { public void sayHello(); } dynamicproxy.service.impl.HelloServiceImpl package dynamicp
JDBC连接数据库百合不是茶 JDBC编程 JAVA操作oracle数据库
如果我们要想连接oracle公司的数据库，就要首先下载oralce公司的驱动程序，将这个驱动程序的jar包导入到我们工程中; JDBC链接数据库的代码和固定写法; 1,加载oracle数据库的驱动; &nb
单例模式中的多线程分析 bijian1013 java thread 多线程 java多线程
谈到单例模式，我们立马会想到饿汉式和懒汉式加载，所谓饿汉式就是在创建类时就创建好了实例，懒汉式在获取实例时才去创建实例，即延迟加载。饿汉式： package com.bijian.study; public class Singleton { private Singleton() { } // 注意这是private 只供内部调用 private static
javascript读取和修改原型特别需要注意原型的读写不具有对等性 bijian1013 JavaScript prototype
对于从原型对象继承而来的成员，其读和写具有内在的不对等性。比如有一个对象A，假设它的原型对象是B，B的原型对象是null。如果我们需要读取A对象的name属性值，那么JS会优先在A中查找，如果找到了name属性那么就返回；如果A中没有name属性，那么就到原型B中查找name，如果找到了就返回；如果原型B中也没有
【持久化框架MyBatis3六】MyBatis3集成第三方DataSource bit1129 dataSource
MyBatis内置了数据源的支持，如： <environments default="development"> <environment id="development"> <transactionManager type="JDBC" /> <data
我程序中用到的urldecode和base64decode,MD5 bitcarter c MD5 base64decode urldecode
这里是base64decode和urldecode，Md5在附件中。因为我是在后台所以需要解码： string Base64Decode(const char* Data,int DataByte,int& OutByte) { //解码表 const char DecodeTable[] = { 0, 0, 0, 0, 0, 0
腾讯资深运维专家周小军：QQ与微信架构的惊天秘密 ronin47
社交领域一直是互联网创业的大热门，从PC到移动端，从OICQ、MSN到QQ。到了移动互联网时代，社交领域应用开始彻底爆发，直奔黄金期。腾讯在过去几年里，社交平台更是火到爆，QQ和微信坐拥几亿的粉丝，QQ空间和朋友圈各种刷屏，写心得，晒照片，秀视频，那么谁来为企鹅保驾护航呢？支撑QQ和微信海量数据背后的架构又有哪些惊天内幕呢？本期大讲堂的内容来自今年2月份ChinaUnix对腾讯社交网络运营服务中心
java-69-旋转数组的最小元素。把一个数组最开始的若干个元素搬到数组的末尾，我们称之为数组的旋转。输入一个排好序的数组的一个旋转，输出旋转数组的最小元素 bylijinnan java
public class MinOfShiftedArray { /** * Q69 旋转数组的最小元素 * 把一个数组最开始的若干个元素搬到数组的末尾，我们称之为数组的旋转。输入一个排好序的数组的一个旋转，输出旋转数组的最小元素。 * 例如数组{3, 4, 5, 1, 2}为{1, 2, 3, 4, 5}的一个旋转，该数组的最小值为1。 */ publ
看博客，应该是有方向的 Cb123456 反省看博客
看博客，应该是有方向的: 我现在就复习以前的，在补补以前不会的，现在还不会的，同时完善完善项目，也看看别人的博客. 我刚突然想到的: 1.应该看计算机组成原理，数据结构，一些算法，还有关于android,java的。 2.对于我，也快大四了，看一些职业规划的，以及一些学习的经验，看看别人的工作总结的. 为什么要写
[开源与商业]做开源项目的人生活上一定要朴素,尽量减少对官方和商业体系的依赖 comsci 开源项目
为什么这样说呢？因为科学和技术的发展有时候需要一个平缓和长期的积累过程，但是行政和商业体系本身充满各种不稳定性和不确定性，如果你希望长期从事某个科研项目，但是却又必须依赖于某种行政和商业体系，那其中的过程必定充满各种风险。。。所以，为避免这种不确定性风险，我
一个 sql优化（[精华] 一个查询优化的分析调整全过程！很值得一看） cwqcwqmax9 sql
见 http://www.itpub.net/forum.php?mod=viewthread&tid=239011 Web翻页优化实例提交时间: 2004-6-18 15:37:49 回复发消息环境： Linux ve
Hibernat and Ibatis dashuaifu Hibernate ibatis
Hibernate VS iBATIS 简介 Hibernate 是当前最流行的O/R mapping框架，当前版本是3.05。它出身于sf.net，现在已经成为Jboss的一部分了 iBATIS 是另外一种优秀的O/R mapping框架，当前版本是2.0。目前属于apache的一个子项目了。相对Hibernate“O/R”而言，iBATIS 是一种“Sql Mappi
备份MYSQL脚本 dcj3sjt126com mysql
#!/bin/sh # this shell to backup mysql #[email protected] (QQ:1413161683 DuChengJiu) _dbDir=/var/lib/mysql/ _today=`date +%w` _bakDir=/usr/backup/$_today [ ! -d $_bakDir ] && mkdir -p
iOS第三方开源库的吐槽和备忘 dcj3sjt126com ios
转自 ibireme的博客做iOS开发总会接触到一些第三方库，这里整理一下，做一些吐槽。目前比较活跃的社区仍旧是Github，除此以外也有一些不错的库散落在Google Code、SourceForge等地方。由于Github社区太过主流，这里主要介绍一下Github里面流行的iOS库。首先整理了一份 Github上排名靠
html wlwmanifest.xml eoems html xml
所谓优化wp_head()就是把从wp_head中移除不需要元素，同时也可以加快速度。步骤：加入到function.php remove_action('wp_head', 'wp_generator'); //wp-generator移除wordpress的版本号，本身blog的版本号没什么意义，但是如果让恶意玩家看到，可能会用官网公布的漏洞攻击blog remov
浅谈Java定时器发展 hacksin java 并发 timer 定时器
java在jdk1.3中推出了定时器类Timer,而后在jdk1.5后由Dou Lea从新开发出了支持多线程的ScheduleThreadPoolExecutor，从后者的表现来看，可以考虑完全替代Timer了。 Timer与ScheduleThreadPoolExecutor对比： 1. Timer始于jdk1.3,其原理是利用一个TimerTask数组当作队列
移动端页面侧边导航滑入效果 ini jquery Web html5 css javascirpt
效果体验：http://hovertree.com/texiao/mobile/2.htm可以使用移动设备浏览器查看效果。效果使用到jquery-2.1.4.min.js，该版本的jQuery库是用于支持HTML5的浏览器上，不再兼容IE8以前的浏览器，现在移动端浏览器一般都支持HTML5，所以使用该jQuery没问题。HTML文件代码： <!DOCTYPE html> <h
AspectJ+Javasist记录日志 kane_xie aspectj javasist
在项目中碰到这样一个需求，对一个服务类的每一个方法，在方法开始和结束的时候分别记录一条日志，内容包括方法名，参数名+参数值以及方法执行的时间。 @Override public String get(String key) { // long start = System.currentTimeMillis(); // System.out.println("Be
redis学习笔记 MJC410621 redis NoSQL
1)nosql数据库主要由以下特点：非关系型的、分布式的、开源的、水平可扩展的。 1，处理超大量的数据 2，运行在便宜的PC服务器集群上， 3，击碎了性能瓶颈。 1)对数据高并发读写。 2)对海量数据的高效率存储和访问。 3)对数据的高扩展性和高可用性。 redis支持的类型： Sring 类型 set name lijie get name lijie set na
使用redis实现分布式锁 qifeifei
在多节点的系统中，如何实现分布式锁机制，其中用redis来实现是很好的方法之一，我们先来看一下jedis包中，有个类名BinaryJedis,它有个方法如下： public Long setnx(final byte[] key, final byte[] value) { checkIsInMulti(); client.setnx(key, value); ret
BI并非万能，中层业务管理报表要另辟蹊径张老师的菜大数据 BI 商业智能信息化
BI是商业智能的缩写，是可以帮助企业做出明智的业务经营决策的工具，其数据来源于各个业务系统，如ERP、CRM、SCM、进销存、HER、OA等。 BI系统不同于传统的管理信息系统，他号称是一个整体应用的解决方案，是融入管理思想的强大系统：有着系统整体的设计思想，支持对所有
安装rvm后出现rvm not a function 或者ruby -v后提示没安装ruby的问题 wudixiaotie function
1.在~/.bashrc最后加入 [[ -s "$HOME/.rvm/scripts/rvm" ]] && source "$HOME/.rvm/scripts/rvm" 2.重新启动terminal输入： rvm use ruby-2.2.1 --default 把当前安装的ruby版本设为默