Mike-H

pytorch---1.基础知识

(从B站某个大佬搬过来的，在加上自己的一些笔记，仅供自己参考)
B站pytorch视频
第一课
什么是PyTorch?
PyTorch是一个基于Python的科学计算库，它有以下特点:

类似于NumPy，但是它可以使用GPU
可以用它定义深度学习模型，可以灵活地进行深度学习模型的训练和使用
Tensors
Tensor类似与NumPy的ndarray，唯一的区别是Tensor可以在GPU上加速运算。

Torch 自称为神经网络界的 Numpy, 因为他能将 torch 产生的 tensor 放在 GPU 中加速运算 (前提是你有合适的 GPU), 就像 Numpy 会把 array 放在 CPU 中加速运算. 所以神经网络的话, 当然是用 Torch 的 tensor 形式数据最好咯. 就像 Tensorflow 当中的 tensor 一样.

当然, 我们对 Numpy 还是爱不释手的, 因为我们太习惯 numpy 的形式了. 不过 torch 看出来我们的喜爱, 他把 torch 做的和 numpy 能很好的兼容. 比如这样就能自由地转换 numpy array 和 torch tensor 了(以下代码都是V1.0.1版本的)

from __future__ import print_function
import torch
#构造一个未初始化的5x3矩阵:
x = torch.empty(5, 3)
print(x)

构建一个随机初始化的矩阵:

x = torch.rand(5, 3)
print(x)

构建一个全部为0，类型为long的矩阵:

x = torch.zeros(5, 3, dtype=torch.long)
print(x)

从数据直接直接构建tensor:

x = torch.tensor([5.5, 3])
print(x)

也可以从一个已有的tensor构建一个tensor。这些方法会重用原来tensor的特征，例如，数据类型，除非提供新的数据。

x = x.new_ones(5, 3, dtype=torch.double)      # new_* methods take in sizes
print(x)

x = torch.randn_like(x, dtype=torch.float)    # override dtype!
print(x)                                      # result has the same size

得到tensor的形状:


print(x.size())
torch.Size([5, 3])
#注意
#``torch.Size`` 返回的是一个tuple

Operations

有很多种tensor运算。我们先介绍加法运算。

y = torch.rand(5, 3)
print(x + y)

另一种着加法的写法

print(torch.add(x, y))

加法：把输出作为一个变量

result = torch.empty(5, 3)
torch.add(x, y, out=result)
print(result)

in-place加法

#adds x to y
y.add_(x)
print(y)

注意
任何in-place的运算都会以_结尾。举例来说：x.copy_(y), x.t_(), 会改变 x。

各种类似NumPy的indexing都可以在PyTorch tensor上面使用。

print(x[:, 1])

Resizing: 如果你希望resize/reshape一个tensor，可以使用torch.view：

x = torch.randn(4, 4)
y = x.view(16)
z = x.view(-1, 8)  # the size -1 is inferred from other dimensions
print(x.size(), y.size(), z.size())
torch.Size([4, 4]) torch.Size([16]) torch.Size([2, 8])

如果你有一个只有一个元素的tensor，使用.item()方法可以把里面的value变成Python数值。

x = torch.randn(1)
print(x)
print(x.item())

更多阅读
各种Tensor operations, 包括transposing, indexing, slicing, mathematical operations, linear algebra, random numbers在 pytorch文档

Numpy和Tensor之间的转化
在Torch Tensor和NumPy array之间相互转化非常容易。

Torch Tensor和NumPy array会共享内存，所以改变其中一项也会改变另一项。

把Torch Tensor转变成NumPy Array

a = torch.ones(5)
print(a)

b = a.numpy()
print(b)

改变numpy array里面的值。

Ia.add_(1)
print(a)
print(b)

把NumPy ndarray转成Torch Tensor

import numpy as np
a = np.ones(5)
b = torch.from_numpy(a)
np.add(a, 1, out=a)
print(a)
print(b)

所有CPU上的Tensor都支持转成numpy或者从numpy转成Tensor。

CUDA Tensors(只有计算tensorGPU才有用吧)
使用.to方法，Tensor可以被移动到别的device上。

# let us run this cell only if CUDA is available
# We will use ``torch.device`` objects to move tensors in and out of GPU

if torch.cuda.is_available():
    device = torch.device("cuda")          # a CUDA device object
    y = torch.ones_like(x, device=device)  # directly create a tensor on GPU
    x = x.to(device)                       # or just use strings ``.to("cuda")``
    z = x + y
    print(z)
    print(z.to("cpu", torch.double))       # ``.to`` can also change dtype together!

热身: 用numpy实现两层神经网络
一个全连接ReLU神经网络，一个隐藏层，没有bias。用来从x预测y，使用L2 Loss。

这一实现完全使用numpy来计算前向神经网络，loss，和反向传播。

numpy ndarray是一个普通的n维array。它不知道任何关于深度学习或者梯度(gradient)的知识，也不知道计算图(computation graph)，只是一种用来计算数学运算的数据结构。

import numpy as np

# N is batch size; D_in is input dimension;
# H is hidden dimension; D_out is output dimension.
N, D_in, H, D_out = 64, 1000, 100, 10

# Create random input and output data
x = np.random.randn(N, D_in)
y = np.random.randn(N, D_out)

# Randomly initialize weights
w1 = np.random.randn(D_in, H)
w2 = np.random.randn(H, D_out)

learning_rate = 1e-6
for t in range(500):
    # Forward pass: compute predicted y
    h = x.dot(w1)
    h_relu = np.maximum(h, 0)
    y_pred = h_relu.dot(w2)

    # Compute and print loss
    loss = np.square(y_pred - y).sum()
    print(t, loss)

    # Backprop to compute gradients of w1 and w2 with respect to loss
    
    # loss = (y_pred - y) ** 2
    grad_y_pred = 2.0 * (y_pred - y)
    # 
    grad_w2 = h_relu.T.dot(grad_y_pred)
    grad_h_relu = grad_y_pred.dot(w2.T)
    grad_h = grad_h_relu.copy()
    grad_h[h < 0] = 0
    grad_w1 = x.T.dot(grad_h)

    # Update weights
    w1 -= learning_rate * grad_w1
    w2 -= learning_rate * grad_w2

PyTorch: Tensors
这次我们使用PyTorch tensors来创建前向神经网络，计算损失，以及反向传播。

一个PyTorch Tensor很像一个numpy的ndarray。但是它和numpy ndarray最大的区别是，PyTorch Tensor可以在CPU或者GPU上运算。如果想要在GPU上运算，就需要把Tensor换成cuda类型。

import torch


dtype = torch.float
device = torch.device("cpu")
# device = torch.device("cuda:0") # Uncomment this to run on GPU

# N is batch size; D_in is input dimension;
# H is hidden dimension; D_out is output dimension.
N, D_in, H, D_out = 64, 1000, 100, 10

# Create random input and output data
x = torch.randn(N, D_in, device=device, dtype=dtype)
y = torch.randn(N, D_out, device=device, dtype=dtype)

# Randomly initialize weights
w1 = torch.randn(D_in, H, device=device, dtype=dtype)
w2 = torch.randn(H, D_out, device=device, dtype=dtype)

learning_rate = 1e-6
for t in range(500):
    # Forward pass: compute predicted y
    h = x.mm(w1)
    h_relu = h.clamp(min=0)
    y_pred = h_relu.mm(w2)

    # Compute and print loss
    loss = (y_pred - y).pow(2).sum().item()
    print(t, loss)

    # Backprop to compute gradients of w1 and w2 with respect to loss
    grad_y_pred = 2.0 * (y_pred - y)
    grad_w2 = h_relu.t().mm(grad_y_pred)
    grad_h_relu = grad_y_pred.mm(w2.t())
    grad_h = grad_h_relu.clone()
    grad_h[h < 0] = 0
    grad_w1 = x.t().mm(grad_h)

    # Update weights using gradient descent
    w1 -= learning_rate * grad_w1
    w2 -= learning_rate * grad_w2

简单的autograd(反向传播):

# Create tensors.
x = torch.tensor(1., requires_grad=True)
w = torch.tensor(2., requires_grad=True)
b = torch.tensor(3., requires_grad=True)

# Build a computational graph.
y = w * x + b    # y = 2 * x + 3

# Compute gradients.
y.backward()

# Print out the gradients.
print(x.grad)    # x.grad = 2 
print(w.grad)    # w.grad = 1 
print(b.grad)    # b.grad = 1

PyTorch: Tensor和autograd
PyTorch的一个重要功能就是autograd，也就是说只要定义了forward pass(前向神经网络)，计算了loss之后，PyTorch可以自动求导计算模型所有参数的梯度。

一个PyTorch的Tensor表示计算图中的一个节点。如果x是一个Tensor并且x.requires_grad=True那么x.grad是另一个储存着x当前梯度(相对于一个scalar，常常是loss)的向量。

import torch

dtype = torch.float
device = torch.device("cpu")
# device = torch.device("cuda:0") # Uncomment this to run on GPU

# N 是 batch size; D_in 是 input dimension;
# H 是 hidden dimension; D_out 是 output dimension.
N, D_in, H, D_out = 64, 1000, 100, 10

# 创建随机的Tensor来保存输入和输出
# 设定requires_grad=False表示在反向传播的时候我们不需要计算gradient
x = torch.randn(N, D_in, device=device, dtype=dtype)
y = torch.randn(N, D_out, device=device, dtype=dtype)

# 创建随机的Tensor和权重。
# 设置requires_grad=True表示我们希望反向传播的时候计算Tensor的gradient
w1 = torch.randn(D_in, H, device=device, dtype=dtype, requires_grad=True)
w2 = torch.randn(H, D_out, device=device, dtype=dtype, requires_grad=True)

learning_rate = 1e-6
for t in range(500):
    # 前向传播:通过Tensor预测y；这个和普通的神经网络的前向传播没有任何不同，
    # 但是我们不需要保存网络的中间运算结果，因为我们不需要手动计算反向传播。
    y_pred = x.mm(w1).clamp(min=0).mm(w2)

    # 通过前向传播计算loss
    # loss是一个形状为(1，)的Tensor
    # loss.item()可以给我们返回一个loss的scalar
    loss = (y_pred - y).pow(2).sum()#(类似计算图)
    print(t, loss.item())

    # PyTorch给我们提供了autograd的方法做反向传播。如果一个Tensor的requires_grad=True，
    # backward会自动计算loss相对于每个Tensor的gradient。在backward之后，
    # w1.grad和w2.grad会包含两个loss相对于两个Tensor的gradient信息。
    loss.backward()

    # 我们可以手动做gradient descent(后面我们会介绍自动的方法)。
    # 用torch.no_grad()包含以下statements，因为w1和w2都是requires_grad=True，
    # 但是在更新weights之后我们并不需要再做autograd。
    # 另一种方法是在weight.data和weight.grad.data上做操作，这样就不会对grad产生影响。
    # tensor.data会我们一个tensor，这个tensor和原来的tensor指向相同的内存空间，
    # 但是不会记录计算图的历史。

	#加了with就不会保存上一次的w1和w2了
    with torch.no_grad():
        w1 -= learning_rate * w1.grad
        w2 -= learning_rate * w2.grad

        # Manually zero the gradients after updating weights
        #下面必须加上，把它置0，才不会梯度一直涨(w1和w2会变得非常大。。。)
        w1.grad.zero_()
        w2.grad.zero_()

PyTorch: nn
这次我们使用PyTorch中nn这个库来构建网络。用PyTorch autograd来构建计算图和计算gradients，然后PyTorch会帮我们自动计算gradient。

import torch

# N is batch size; D_in is input dimension;
# H is hidden dimension; D_out is output dimension.
N, D_in, H, D_out = 64, 1000, 100, 10

# Create random Tensors to hold inputs and outputs
x = torch.randn(N, D_in)
y = torch.randn(N, D_out)

# Use the nn package to define our model as a sequence of layers. nn.Sequential
# is a Module which contains other Modules, and applies them in sequence to
# produce its output. Each Linear Module computes output from input using a
# linear function, and holds internal Tensors for its weight and bias.
**#就是顶一个一个有序的容器，神经网络模块将按照在传入构造器的顺序依次被添加到计算图中执行，同时以神经网络模块为元素的有序字典也可以作为传入参数，就是来定义w1,w2，激活函数等.。。**
model = torch.nn.Sequential(
    torch.nn.Linear(D_in, H),
    torch.nn.ReLU(),
    torch.nn.Linear(H, D_out),
)

# The nn package also contains definitions of popular loss functions; in this
# case we will use Mean Squared Error (MSE) as our loss function.
**#nn.MSELoss就是个均方差损失函数，顶替了pow(2).sum()**
loss_fn = torch.nn.MSELoss(reduction='sum')

learning_rate = 1e-4
for t in range(500):
    # Forward pass: compute predicted y by passing x to the model. Module objects
    # override the __call__ operator so you can call them like functions. When
    # doing so you pass a Tensor of input data to the Module and it produces
    # a Tensor of output data.
    y_pred = model(x)

    # Compute and print loss. We pass Tensors containing the predicted and true
    # values of y, and the loss function returns a Tensor containing the
    # loss.
    loss = loss_fn(y_pred, y)
    print(t, loss.item())

    # Zero the gradients before running the backward pass.
    #类似上面的把上次的清空,一定得加,一定放在求导（反向传播）前面
    model.zero_grad()

    # Backward pass: compute gradient of the loss with respect to all the learnable
    # parameters of the model. Internally, the parameters of each Module are stored
    # in Tensors with requires_grad=True, so this call will compute gradients for
    # all learnable parameters in the model.
    loss.backward()

    # Update the weights using gradient descent. Each parameter is a Tensor, so
    # we can access its gradients like we did before.
    with torch.no_grad():
    	#param是个(tensor,grad)
    	#model.parameters()是放这很多参数，eg:w1,w2...
        for param in model.parameters():
            param -= learning_rate * param.grad

PyTorch: optim,比nn又简化点了，越往下代码越简洁(封装度越高了。。。)
这一次我们不再手动更新模型的weights,而是使用optim这个包来帮助我们更新参数。 optim这个package提供了各种不同的模型优化方法，包括SGD(随机梯度下降法)+momentum, RMSProp, Adam等等.各种优化器比较：优化器比较

optimizer都实现了step()方法，这个方法会更新所有的参数。它能按两种方式来使用：
这是大多数optimizer所支持的简化版本。一旦梯度被如backward()之类的函数计算好后，我们就可以调用它。

import torch

# N is batch size; D_in is input dimension;
# H is hidden dimension; D_out is output dimension.
N, D_in, H, D_out = 64, 1000, 100, 10

# Create random Tensors to hold inputs and outputs
x = torch.randn(N, D_in)
y = torch.randn(N, D_out)

# Use the nn package to define our model and loss function.
#定义模型，就类似定义w1,w2的东西啦
model = torch.nn.Sequential(
    torch.nn.Linear(D_in, H),
    torch.nn.ReLU(),
    torch.nn.Linear(H, D_out),
)
#若把学习率改为1e-6的话，加下下面两句初始化变量，就会收敛快很多
#torch.nn.init.normal_(model[0].weight)
#torch.nn.init.normal_(model[2].weight)


#输出model:
#Sequential(
  #(0): Linear(in_features=1000, out_features=100, bias=True)
  #(1): ReLU()
  #(2): Linear(in_features=100, out_features=10, bias=True)
#)


loss_fn = torch.nn.MSELoss(reduction='sum')

# Use the optim package to define an Optimizer that will update the weights of
# the model for us. Here we will use Adam; the optim package contains many other
# optimization algoriths. The first argument to the Adam constructor tells the
# optimizer which Tensors it should update.
learning_rate = 1e-4
optimizer = torch.optim.Adam(model.parameters(), lr=learning_rate)#优化器更新所有参数
for t in range(500):
    # Forward pass: compute predicted y by passing x to the model.
    y_pred = model(x)

    # Compute and print loss.
    loss = loss_fn(y_pred, y)
    print(t, loss.item())

    # Before the backward pass, use the optimizer object to zero all of the
    # gradients for the variables it will update (which are the learnable
    # weights of the model). This is because by default, gradients are
    # accumulated in buffers( i.e, not overwritten) whenever .backward()
    # is called. Checkout docs of torch.autograd.backward for more details.
    #再求导前清空,一定放在求导前面
    optimizer.zero_grad()

    # Backward pass: compute gradient of the loss with respect to model
    # parameters
    #反向传播就是求导
    loss.backward()

    # Calling the step function on an Optimizer makes an update to its
    # parameters
    #求导之后做一步更新参数
    optimizer.step()

`PyTorch: 自定义 nn Modules
我们可以定义一个模型，这个模型继承自nn.Module类。如果需要定义一个比Sequential模型更加复杂的模型，就需要定义nn.Module模型。

import torch


class TwoLayerNet(torch.nn.Module):
    def __init__(self, D_in, H, D_out):
        """
        In the constructor we instantiate two nn.Linear modules and assign them as
        member variables.
        """
        super(TwoLayerNet, self).__init__()
        self.linear1 = torch.nn.Linear(D_in, H)
        self.linear2 = torch.nn.Linear(H, D_out)

    def forward(self, x):
        """
        In the forward function we accept a Tensor of input data and we must return
        a Tensor of output data. We can use Modules defined in the constructor as
        well as arbitrary operators on Tensors.
        """
        h_relu = self.linear1(x).clamp(min=0)
        y_pred = self.linear2(h_relu)
        return y_pred


# N is batch size; D_in is input dimension;
# H is hidden dimension; D_out is output dimension.
N, D_in, H, D_out = 64, 1000, 100, 10

# Create random Tensors to hold inputs and outputs
x = torch.randn(N, D_in)
y = torch.randn(N, D_out)

# Construct our model by instantiating the class defined above
model = TwoLayerNet(D_in, H, D_out)

# Construct our loss function and an Optimizer. The call to model.parameters()
# in the SGD constructor will contain the learnable parameters of the two
# nn.Linear modules which are members of the model.
criterion = torch.nn.MSELoss(reduction='sum')
optimizer = torch.optim.SGD(model.parameters(), lr=1e-4)
for t in range(500):
    # Forward pass: Compute predicted y by passing x to the model
    y_pred = model(x) #这就是调用forward()

    # Compute and print loss
    loss = criterion(y_pred, y)
    print(t, loss.item())

    # Zero gradients, perform a backward pass, and update the weights.
    optimizer.zero_grad()
    loss.backward()
    optimizer.step()

Pytorch:经典卷积神经网络LeNet实现 Tian_city pytorch 深度学习卷积神经网络
1：前言入门pytoch也有一段时间了，但是似乎基本上都是阅读别人的demo或者相关论文的源码，对框架的布局有了一定的认识，但是对于很多细节的问题仍然不是特别清楚，因此想通过一个小实践来过一遍流程，一是寻找遗漏的细节，二是通过对小项目的文件分割来建立一个相对可扩展的框架，而不是通过一个流水线下来，这样能够对每一个对象和类起到的作用有一个更加直观的把握。2：原理本次博客主要实现最早的卷积神经网络Le
LLaMA-Factory 微调训练 zsh_abc llama docker 深度学习人工智能 python linux
LLaMA-Factory微调训练该框架功能，标注-微调-导出-合并-部署，一整条流程都有，而且训练时消耗的gpu算力也会小一些一，安装（推荐在linux中训练，win可以用wsl+docker）gitclonehttps://github.com/hiyouga/LLaMA-Factory.gitcdLLaMA-Factory#根据cuda版本选择安装pytoch版本pip3installtor
查询vllm-flash-attn与之对应的pytorch 源来猿往运行环境 pytorch 人工智能 python
最近安装vllm的时候有时候pytorch版本总是弄错，这里写下vllm-flash-attn与pytoch对应关系打开网站vllm-flash-attn·PyPI查询历史版本点进去查询对应的pytoch
对于自定义dataset和数据处理的一些讨论——以kaggle中的图像分类为例(二） arxhsyd123 分类人工智能 pytorch
OK了家人们，早上起来神清气爽，让我们继续探究如何更加自由的定义dataset，那就到了我们今天的主题。自定义dataset中的sampler。书接上回。sampler顾名思义，就是一个采样器，决定着dataloader在batch_size固定的情况下，取哪几个数据，在简单的情况下，sampler都不需要自己定义，因为pytoch自己本身就给我们提供了两种sampler,一种是顺序采样器，一种就
pytorch安装 MechMaster 深度学习深度学习 pytorch
pytoch安装1.准备工作1.1需要提前安装的软件2.安装pyTorch我遇到的问题3.显卡测试4.CPU与GPU切换方法4.1创建张量4.2第一种切换方法4.3第二种切换方法1.准备工作1.1需要提前安装的软件Anaconda史上最全最详细的Anaconda安装教程CUDACUDA安装教程（超详细）2.安装pyTorch2023最新pytorch安装教程，简单易懂，面向初学者（Anaconda
PyTorch|view()，改变张量维度霜溪 pytorch pytorch 人工智能 python
在构建自己的网络时，了解数据经过每个层后的形状变化是必须的，否则，网络大概率会出现问题。PyToch张量有一个方法，叫做view(),使用这个方法，我们可以很容易的对张量的形状进行改变，从而符合网络的输入要求。view（）的基本用法很简单，只需传入想要的形状即可，就像这样：importtorchT1=torch.arange(0,16)print(T1)print(T1.size())tensor
PyTorch|一些简单操作霜溪 pytorch pytorch 人工智能 python
在使用PyTorch构建自己的神经网络时，灵活度非常大，这给了用户更多的发挥空间，这里介绍一些简单的操作。1，获取所安装PyToch版本>>>torch.__version__'1.11.0'2，判断Cuda在我们的系统上是否可用>>>torch.cuda.is_available3,查看张量的形状，shape,size>>>data=torch.tensor([1,2,3,4])>>>data.
使用Pytoch实现Opencv warpAffine方法太阳花的小绿豆 opencv 人工智能计算机视觉 pytorch
随着深度学习的不断发展，GPU/NPU的算力也越来越强，对于一些传统CV计算也希望能够直接在GPU/NPU上进行，例如Opencv的warpAffine方法。Opencv的warpAffine的功能主要是做仿射变换，如果不了解仿射变换的请自行了解。由于Pytorch的图像坐标系（图像左上角对应坐标(-1,-1)右下角对应坐标(1,1)）与Opencv的坐标系（图像左上角对应坐标(0,0)右下角对应
FCOS: Fully Convolutional One-Stage Object Detection 论文源代码复现 STRUGGLE_xlf 目标检测人工智能计算机视觉
FCOS源代码github地址为：FCOS这篇论文主要是关于目标检测的，今天跑一下它的实验，我是在autodl租的RTX2080Ti，因为这个代码比较久，所以Pytoch版本可能不可以装太高，我的镜像CUDA版本为10.1，具体规格如下：创建好服务器好，打开终端创建一个虚拟环境，也可以参考源文件的INSTALL.md步骤，这个文件里有依赖的要求以及步骤：1.创建虚拟环境：condacreate-n
ImportError: /lib/x86_64-linux-gnu/libstdc++.so.6: version `GLIBCXX_3.4.29‘ not foun lock cylinder linux 服务器运维
复现代码过程中，无意间出现这个问题，本以为很好解决，没想到还是花了我好几个小时的时间，总结一下趴。我的环境是cuda11.3+python3.9+pytoch1.10问题如图：图1bug示意图在网上找了很多答案，要么就是没说清的，要么就是有问题的，我在这里总结下我的步骤。第一步：使用如下命令查看是否缺失文件（一般来讲肯定是缺失的）strings/usr/lib/x86_64-linux-gnu/l
ModuleNotFoundError: No module named ‘torchvision.models.utils Kingyanhui Python 深度学习 pytorch 人工智能
如果你是高版本的torch大概pytoch1.6以上，直接换语句就不报错了fromtorch.hubimportload_state_dict_from_url如果没能够解决问题就是你torch版本与torchvision版本不对应，可以重新安装torch。
【Transformer从零开始代码实现 pytoch版】（六）模型基本测试运行辰阳星宇 #预训练模型 transformer 深度学习人工智能
模型基本测试及运行（1）构建数据生成器defdata_generator(V,batch,num_batch):"""用于随机生成copy任务的数据:paramV:随机生成数字的最大值+1:parambatch:每次输送给模型更新一次参数的数据量:paramnum_batch:输送多少次完成一轮:return:"""#遍历nbatchesforiinrange(num_batch):#在循环中使用
【Transformer从零开始代码实现 pytoch版】Transformer架构各个部件详细分析代码合集辰阳星宇 #预训练模型深度学习 #NLP transformer 深度学习人工智能
构建合集【Transformer从零开始代码实现pytoch版】（一）输入部件：embedding+positionalEncoding【Transformer从零开始代码实现pytoch版】（二）Encoder编码器组件：mask+attention+feedforward+add&norm【Transformer从零开始代码实现pytoch版】（三）Decoder编码器组件：多头自注意力+多头
TensorBoard服务器6006端口报错：No dashboards are active for the current data set. D_handsome python 报错 tensorflow pytorch
tensorboard生成日志后，在terminal运行成功（默认6006端口），但浏览器显示报错：数据源有误报错原因：服务器拥塞，需更换服务器端口源代码如下：PSD:\pytoch_learning\tudui>tensorboard--logdirFirst_try更改后：PSD:\pytoch_learning\tudui>tensorboard--logdirFirst_try--port
深度学习/pytoch/pycharm学习过程中遇到的问题 tao_sc 深度学习 pycharm 学习
1.在pycharm中打断点后，debug无反应，直到程序退出。解决：将项目目录中的.idea文件夹删除，重启pycharm可解决2.plt.imshow(img),plt.show()不显示图片。解决：importmatplotlib，matplotlib.use('TkAgg')3.pycharmerrorOMP:Error#15:Initializinglibiomp5md.dll,butf
pytoch安装指定版本教程&pytorch1.3安装笔记你若盛开，清风自来！ pytorch
一、先生成一个环境如果电脑里安装了其他的torch版本，另外生成一个环境可以防止原先torch版本被替换掉。打开conda的终端窗口输入以下命令就可以生成一个名为torch_1.3的环境：condacreate-ntorch_1.3python=3.6输入以下命令进入到torch_1.3的环境里：condaactivatetorch_1.3二、安装cuda9.2版本的torch1.3安装pytor
pytorch矩阵的乘法 CrystalheartLi pytorch pytorch
在用pytoch搭建深度学习模型时，矩阵的乘法必不可少，下面介绍四种方法来实现矩阵的乘法importtorcha=torch.randn(3,4)#建立一个3*4的张量b=torch.randn(4,3)#建立一个4*3的张量print(torch.mm(a,b))#1，矩阵乘法，调用函数，返回3*3的矩阵乘积print(a.mm(b))#2，矩阵乘法，内置方法，返回3*3的矩阵乘积print(a
pytorch: 转onnx模型傲笑风 pytorch pytorch 深度学习
摘要：onnx(OpenNeuralNetworkExchange)主要用于部署，训练一般使用pytorch和tensorflow，等训练完成，我们需要模型转成onnx，用于部署在云或者边缘计算上。而为什么要要转成onnx模型呢，主要是因为onnx没有训练，只有推理，速度很快，而且目前大多数芯片都适配onnx模型，相当于一个通用莫模型，易部署，而且速度快。pytoch转onnx1.导出onnx模型
pytorch学习 ou源仔
对教材《深度学习框架pytorch:入门与实践》和一些技术博客及实践过程中的总结，比较精炼。网络模型定义一般通过继承pytoch实现的torch.nn.Module(以下简称Module)，然后定义自己的网络的每层。这里我们先通过继承Module构建一个抽象的网络BasicModule实现了对Module中.save/.load的重载，方便保存和加载已训练完毕的模型(持久化)。classBasic
pytorch 模型与tf模型转换 zhurui_xiaozhuzaizai 自然语言处理 pytorch 深度学习 tensorflow
一bert_model.ckpt转pytoch_model.binTransformers库也是也提供了相关代码，这里做个搬运工convert_bert_original_tf_checkpoint_to_pytorch.py参考文章：https://zhuanlan.zhihu.com/p/361300189二pytoch_model.bin转bert_model.ckptconvert_pyt
基于Yolov5的摄像头吸烟行为检测系统（pytoch） AI小怪兽 Yolov5/Yolov7实战 YOLO 人工智能目标检测计算机视觉 python pytorch
目录1.数据集介绍1.1数据集划分1.2通过voc_label.py生成txt1.3小目标定义2.基于Yolov5的吸烟行为检测性能提升2.1采用多尺度提升小目标检测精度2.2多尺度训练结果分析2.3基于多尺度基础上加入BiFormer:基于动态稀疏注意力构建高效金字塔网络架构2.3.1BiFormer原理介绍2.3.2实验结果分析1.数据集介绍通过摄像头采集吸烟行为，共采集1812张图片进行标注
anaconda安装pytorch-GPU版本（python3.7）源先生深度学习 pytorch 深度学习 python
anaocnda安装pytoch-GPU版本（解决：默认CPU版本安装）文章目录anaocnda安装pytoch-GPU版本（解决：默认CPU版本安装）一、总体步骤规划二、查看cuda版本三、确定并下载pytorch、torchaudio以及torchversion对应版本安装文件1、确定torch版本及下载安装文件2、确定torchvision版本及下载安装文件3、确定torchaudio的版本
Pytorch官方新书 deep-learning-with-pytorch.pdf下载薯条大薯条 pytorch pytorch deep learning
2019.11.22pytorh官方终于出书了！！作者是EliStevens和LucaAntiga.EliStevens，是一名软件工程师，已经在硅谷工作了15年。过去7年中，他在一家开发医疗设备软件的创业公司担任CTO。LucaAntiga，是一家AI创业公司的联合创始人兼CEO，也是PyToch社区定期撰稿人。pdfPytorch限时官方下载链接：https://link.zhihu.com/
pytoch和tensorflow的区别元尘yc python tensorflow 深度学习 python pytorch
PyTorch和TensorFlow是两个广泛使用的深度学习框架，它们在许多方面有所不同。以下是它们之间的一些主要区别，以及相关的代码实例：动态计算图vs静态计算图PyTorch使用动态计算图（DynamicComputationalGraph），这意味着计算图在每次迭代中都会重新构建。这为研究人员提供了更大的灵活性，尤其是在处理循环神经网络（RNN）和自定义层时。TensorFlow2.x开始默
Pytorch 多GPU训练 HHHTTY- pytorch 深度学习 python 人工智能计算机视觉
Pytorch多GPU训练目录Pytorch多GPU训练1导入库2指定GPU2.1单GPU声明2.2多GPU声明3数据放到GPU4把模型网络放到GPU【重要】torch.nn.DataParallel（DP)5其他：多GPU并行1导入库importtorch#深度学习的pytoch平台importtorch.nnasnnfromtorch.autogradimportVariablefromtor
pytorch-gpu 极简安装机智的小神仙儿 pytorch 人工智能 python
1、进入pytoch官网：PyTorch找到pytorch-gpu版本，看到CUDA11.8、11.7、CPU，这里我选择安装CUDA11.82、下载CUDAToolkit：CUDAToolkit11.8Downloads|NVIDIADeveloper3、下载CUDANN：cuDNNDownload|NVIDIADeveloper命令行输入nvidia-smi，查看驱动信息输入nvcc--ver
PyToch 深度学习 || 3. 卷积神经网络 | 3.1 深度学习中的卷积操作 Mr_LeeCZ Pytorch 深度学习深度学习 cnn 人工智能
深度学习中的卷积操作文章目录深度学习中的卷积操作1.卷积2.一维卷积2.1使用nn.functional库中conv1d2.2使用nn库中的Conv1d3.二维卷积3.1nn.functional.conv2d3.2nn.Conv2d1.卷积加权求和是一种非常重要的运算，可以整合局部数字特征进而是提取局部信息的重要手段。这种加权求和的形式被称作卷积或者滤波，对于两个信号f(x)f(x)f(x)和g
从零开始下载torch+cu（无痛版）风吹落叶花飘荡深度学习人工智能
从零开始下载torch+cu（无痛版）文章目录从零开始下载torch+cu（无痛版）一，前言二，配置torch的GPU版具体步骤1,查看电脑安装的Cuda版本2，在pytoch官网检索待下载whl包名以及版本3，下载指定torch，torchvision，torchaudio三个库4，安装进目标深度学习环境中5，测试是否安装成功一，前言由于搞了些新电脑，经常需要配深度学习环境，老是搜来搜去的深感繁
ImportError: cannot import name ‘OrderedDict‘ from ‘typing‘的解决办法采蘑菇的黑兔 python 深度学习开发语言 pytorch
note：最近在入门pytorch的torchaudio模块，学习一下语音的基础知识，之前下的pytoch的版本是1.9.0，在pytoch的官方文档中使用fromtorchaudio.utilsimportdownload_asset发现里面并没有download_asset这个方法，应该是torchaudio的版本太低了，所以将pytoch的版本升级到1.13.0。其实本来应该我只用torch
教你掌握分布式训练PyTorch DDP到Accelerate到Trainer
目录概述什么是分布式训练，为什么它很重要？PyTorch分布式数据并行Accelerate使用notebook_launcher使用Trainer相关资源概述本教程假定你已经对于PyToch训练一个简单模型有一定的基础理解。本教程将展示使用3种封装层级不同的方法调用DDP(DistributedDataParallel)进程，在多个GPU上训练同一个模型：使用pytorch.distributed
Linux的Initrd机制被触发 linux
Linux 的 initrd 技术是一个非常普遍使用的机制，linux2.6 内核的 initrd 的文件格式由原来的文件系统镜像文件转变成了 cpio 格式，变化不仅反映在文件格式上， linux 内核对这两种格式的 initrd 的处理有着截然的不同。本文首先介绍了什么是 initrd 技术，然后分别介绍了 Linux2.4 内核和 2.6 内核的 initrd 的处理流程。最后通过对 Lin
maven本地仓库路径修改 bitcarter maven
默认maven本地仓库路径：C:\Users\Administrator\.m2 修改maven本地仓库路径方法： 1.打开E:\maven\apache-maven-2.2.1\conf\settings.xml 2.找到
XSD和XML中的命名空间 darrenzhu xml xsd schema namespace 命名空间
http://www.360doc.com/content/12/0418/10/9437165_204585479.shtml http://blog.csdn.net/wanghuan203/article/details/9203621 http://blog.csdn.net/wanghuan203/article/details/9204337 http://www.cn
Java 求素数运算周凡杨 java 算法素数
网络上对求素数之解数不胜数，我在此总结归纳一下，同时对一些编码，加以改进，效率有成倍热提高。第一种：原理: 6N(+-)1法任何一个自然数，总可以表示成为如下的形式之一： 6N，6N+1，6N+2，6N+3，6N+4，6N+5 (N=0，1，2，…)
java 单例模式 g21121 java
想必单例模式大家都不会陌生，有如下两种方式来实现单例模式： class Singleton { private static Singleton instance=new Singleton(); private Singleton(){} static Singleton getInstance() { return instance; }
Linux下Mysql源码安装 510888780 mysql
1.假设已经有mysql-5.6.23-linux-glibc2.5-x86_64.tar.gz (1)创建mysql的安装目录及数据库存放目录解压缩下载的源码包，目录结构，特殊指定的目录除外：
32位和64位操作系统墙头上一根草 32位和64位操作系统
32位和64位操作系统是指：CPU一次处理数据的能力是32位还是64位。现在市场上的CPU一般都是64位的，但是这些CPU并不是真正意义上的64 位CPU，里面依然保留了大部分32位的技术，只是进行了部分64位的改进。32位和64位的区别还涉及了内存的寻址方面，32位系统的最大寻址空间是2 的32次方= 4294967296（bit）= 4（GB）左右，而64位系统的最大寻址空间的寻址空间则达到了
我的spring学习笔记10-轻量级_Spring框架 aijuans Spring 3
一、问题提问： → 请简单介绍一下什么是轻量级？轻量级（Leightweight）是相对于一些重量级的容器来说的，比如Spring的核心是一个轻量级的容器，Spring的核心包在文件容量上只有不到1M大小，使用Spring核心包所需要的资源也是很少的，您甚至可以在小型设备中使用Spring。
mongodb 环境搭建及简单CURD antlove Web Install curd NoSQL mongo
一搭建mongodb环境 1. 在mongo官网下载mongodb 2. 在本地创建目录 "D:\Program Files\mongodb-win32-i386-2.6.4\data\db" 3. 运行mongodb服务 [mongod.exe --dbpath "D:\Program Files\mongodb-win32-i386-2.6.4\data\
数据字典和动态视图百合不是茶 oracle 数据字典动态视图系统和对象权限
数据字典（data dictionary）是 Oracle 数据库的一个重要组成部分，这是一组用于记录数据库信息的只读（read-only）表。随着数据库的启动而启动,数据库关闭时数据字典也关闭数据字典中包含数据库中所有方案对象（schema object）的定义(包括表，视图，索引，簇，同义词，序列，过程，函数，包，触发器等等) 数据库为一
多线程编程一般规则 bijian1013 java thread 多线程 java多线程
如果两个工两个以上的线程都修改一个对象，那么把执行修改的方法定义为被同步的，如果对象更新影响到只读方法，那么只读方法也要定义成同步的。不要滥用同步。如果在一个对象内的不同的方法访问的不是同一个数据，就不要将方法设置为synchronized的。
将文件或目录拷贝到另一个Linux系统的命令scp bijian1013 linux unix scp
一.功能说明 scp就是security copy，用于将文件或者目录从一个Linux系统拷贝到另一个Linux系统下。scp传输数据用的是SSH协议，保证了数据传输的安全，其格式如下： scp 远程用户名@IP地址：文件的绝对路径
【持久化框架MyBatis3五】MyBatis3一对多关联查询 bit1129 Mybatis3
以教员和课程为例介绍一对多关联关系，在这里认为一个教员可以叫多门课程，而一门课程只有1个教员教，这种关系在实际中不太常见，通过教员和课程是多对多的关系。示例数据：地址表： CREATE TABLE ADDRESSES ( ADDR_ID INT(11) NOT NULL AUTO_INCREMENT, STREET VAR
cookie状态判断引发的查找问题 bitcarter form cgi
先说一下我们的业务背景： 1.前台将图片和文本通过form表单提交到后台，图片我们都做了base64的编码，并且前台图片进行了压缩 2.form中action是一个cgi服务 3.后台cgi服务同时供PC，H5，APP 4.后台cgi中调用公共的cookie状态判断方法（公共的，大家都用，几年了没有问题）问题：（折腾两天。。。。） 1.PC端cgi服务正常调用，cookie判断没
通过Nginx,Tomcat访问日志(access log)记录请求耗时 ronin47
一、Nginx通过$upstream_response_time $request_time统计请求和后台服务响应时间 nginx.conf使用配置方式： log_format main '$remote_addr - $remote_user [$time_local] "$request" ''$status $body_bytes_sent "$http_r
java-67- n个骰子的点数。把n个骰子扔在地上，所有骰子朝上一面的点数之和为S。输入n，打印出S的所有可能的值出现的概率。 bylijinnan java
public class ProbabilityOfDice { /** * Q67 n个骰子的点数 * 把n个骰子扔在地上，所有骰子朝上一面的点数之和为S。输入n，打印出S的所有可能的值出现的概率。 * 在以下求解过程中，我们把骰子看作是有序的。 * 例如当n=2时，我们认为（1，2）和（2，1）是两种不同的情况 */ private stati
看别人的博客，觉得心情很好 Cb123456 博客心情
以为写博客，就是总结，就和日记一样吧，同时也在督促自己。今天看了好长时间博客: 职业规划: http://www.iteye.com/blogs/subjects/zhiyeguihua android学习: 1.http://byandby.i
[JWFD开源工作流]尝试用原生代码引擎实现循环反馈拓扑分析 comsci 工作流
我们已经不满足于仅仅跳跃一次，通过对引擎的升级，今天我测试了一下循环反馈模式，大概跑了200圈，引擎报一个溢出错误在一个流程图的结束节点中嵌入一段方程，每次引擎运行到这个节点的时候，通过实时编译器GM模块，计算这个方程，计算结果与预设值进行比较，符合条件则跳跃到开始节点，继续新一轮拓扑分析，直到遇到
JS常用的事件及方法 cwqcwqmax9 js
事件描述 onactivate 当对象设置为活动元素时触发。 onafterupdate 当成功更新数据源对象中的关联对象后在数据绑定对象上触发。 onbeforeactivate 对象要被设置为当前元素前立即触发。 onbeforecut 当选中区从文档中删除之前在源对象触发。 onbeforedeactivate 在 activeElement 从当前对象变为父文档其它对象之前立即
正则表达式验证日期格式 dashuaifu 正则表达式 IT其它 java其它
正则表达式验证日期格式 function isDate(d){ var v = d.match(/^(\d{4})-(\d{1,2})-(\d{1,2})$/i); if(!v) { this.focus(); return false; } } <input value="2000-8-8" onblu
Yii CModel.rules() 方法、validate预定义完整列表、以及说说验证 dcj3sjt126com yii
public array rules () {return} array 要调用 validate() 时应用的有效性规则。返回属性的有效性规则。声明验证规则，应重写此方法。每个规则是数组具有以下结构：array('attribute list', 'validator name', 'on'=>'scenario name', ...validation
UITextAttributeTextColor = deprecated in iOS 7.0 dcj3sjt126com ios
In this lesson we used the key "UITextAttributeTextColor" to change the color of the UINavigationBar appearance to white. This prompts a warning "first deprecated in iOS 7.0." Ins
判断一个数是质数的几种方法 EmmaZhao Math python
质数也叫素数，是只能被1和它本身整除的正整数，最小的质数是2，目前发现的最大的质数是p=2^57885161-1【注1】。判断一个数是质数的最简单的方法如下： def isPrime1(n): for i in range(2, n): if n % i == 0: return False return True 但是在上面的方法中有一些冗余的计算，所以
SpringSecurity工作原理小解读坏我一锅粥 SpringSecurity
SecurityContextPersistenceFilter ConcurrentSessionFilter WebAsyncManagerIntegrationFilter HeaderWriterFilter CsrfFilter LogoutFilter Use
JS实现自适应宽度的Tag切换 ini JavaScript html Web css html5
效果体验：http://hovertree.com/texiao/js/3.htm 该效果使用纯JavaScript代码，实现TAB页切换效果，TAB标签根据内容自适应宽度，点击TAB标签切换内容页。 HTML文件代码： <!DOCTYPE html> <html xmlns="http://www.w3.org/1999/xhtml"
Hbase Rest API : 数据查询 kane_xie REST hbase
hbase（hadoop）是用java编写的，有些语言（例如python）能够对它提供良好的支持，但也有很多语言使用起来并不是那么方便，比如c#只能通过thrift访问。Rest就能很好的解决这个问题。Hbase的org.apache.hadoop.hbase.rest包提供了rest接口，它内嵌了jetty作为servlet容器。启动命令：./bin/hbase rest s
JQuery实现鼠标拖动元素移动位置（源码+注释）明子健 jquery js 源码拖动鼠标
欢迎讨论指正！ print.html代码： <!DOCTYPE html> <html> <head> <meta http-equiv=Content-Type content="text/html;charset=utf-8"> <title>发票打印</title> &l
Postgresql 连表更新字段语法 update qifeifei PostgreSQL
下面这段sql本来目的是想更新条件下的数据，可是这段sql却更新了整个表的数据。sql如下： UPDATE tops_visa.visa_order SET op_audit_abort_pass_date = now() FROM tops_visa.visa_order as t1 INNER JOIN tops_visa.visa_visitor as t2 ON t1.
将redis,memcache结合使用的方案? tcrct redis cache
公司架构上使用了阿里云的服务，由于阿里的kvstore收费相当高，打算自建，自建后就需要自己维护，所以就有了一个想法，针对kvstore(redis)及ocs(memcache)的特点，想自己开发一个cache层，将需要用到list，set，map等redis方法的继续使用redis来完成，将整条记录放在memcache下，即findbyid，save等时就memcache，其它就对应使用redi
开发中遇到的诡异的bug wudixiaotie bug
今天我们服务器组遇到个问题：我们的服务是从Kafka里面取出数据，然后把offset存储到ssdb中，每个topic和partition都对应ssdb中不同的key，服务启动之后，每次kafka数据更新我们这边收到消息，然后存储之后就发现ssdb的值偶尔是-2,这就奇怪了，最开始我们是在代码中打印存储的日志，发现没什么问题，后来去查看ssdb的日志，才发现里面每次set的时候都会对同一个key

pytorch---1.基础知识

你可能感兴趣的:(pytoch)