SuperFeHanHan

PyTorch | QuickStart

1. Quickstart
- 1.1 Data
- 1.2 查询训练用的是CPU还是GPU
- 1.3 创建模型
- 1.4 Optimizing the Model Parameters （优化器设置）
- 1.5 Train & Test
- 1.6 主程序
- 1.7 保存和读取模型
2. Tensors
- 2.1 初始化一个Tensor
- 2.2 Attributes of a Tensor
- 2.3 Operations on Tensors
- 2.4 Bridge with NumPy
3. DataSet & DataLoader
- 3.1 Loading a Dataset
- 3.2 Iterating and Visualizing the Dataset
- 3.3 创建自己的DataSet
- 3.4 DataLoader
4. Transforms
- 4.1 别人用的Transform
5. Build the Neural Network
- 5.1 Get Device for Training
- 5.2 Define the Class
- 5.3 Model Layers
- 5.4 Model Parameters [查看模型的参数细节]
6. 用torch.autograd自动求导
- 6.1 例子
- 6.2 Disabling Gradient Tracking
- 6.3 More on Computational Graphs [autograd的实现]
7. Optimizing Model Parameters
- 7.1 Loss Function
- 7.2 Optimizer
- 例子：
8. Save and load the model
- 8.1 Saving and Loading Model Weights 用字典形式保存每层的权重和读取
- 8.2 保存和读取整个模型
- 8.3 Exporting Model to ONNX
9. Visualizing models, data and Training with Tensorboard
其他常用功能：
BCELoss和BCEWithLogitsLoss的区别

参考：
https://pytorch.org/tutorials/beginner/basics/intro.html

1. Quickstart

本节给出一个快速入门的例子，具体类别的变量还是要看对应章节的介绍。

1.1 Data

torch.utils.data.Dataset

Dataset stores the samples and their corresponding labels.
所有数据集都要继承的类。

torch.utils.data.DataLoader

DataLoader wraps an iterable around the Dataset.
从Dataset变为Dataloader
train_dataloader = DataLoader(training_Dataset, batch_size=batch_size)
DataLoader类似iterator，可以用for循环遍历其中的元素

for X, y in test_dataloader:
    print("Shape of X [N, C, H, W]: ", X.shape)
    print("Shape of y: ", y.shape, y.dtype)
    break

备注：

PyTorch offers domain-specific libraries such as TorchText, TorchVision, and TorchAudio, all of which include datasets.如TorchVision预置的数据集

1.2 查询训练用的是CPU还是GPU

# Get cpu or gpu device for training.
device = "cuda" if torch.cuda.is_available() else "cpu"
print("Using {} device".format(device))

1.3 创建模型

# Define model
class NeuralNetwork(nn.Module):
    def __init__(self):
        super(NeuralNetwork, self).__init__()
        self.flatten = nn.Flatten()
        self.linear_relu_stack = nn.Sequential(
            nn.Linear(28*28, 512),
            nn.ReLU(),
            nn.Linear(512, 512),
            nn.ReLU(),
            nn.Linear(512, 10),
            nn.ReLU()
        )

    def forward(self, x):
        x = self.flatten(x)
        logits = self.linear_relu_stack(x)
        return logits

# 如果能用Cuda就用Cuda。
device = "cuda" if torch.cuda.is_available() else "cpu"
model = NeuralNetwork().to(device)
print(model)

1.4 Optimizing the Model Parameters （优化器设置）

loss_fn = nn.CrossEntropyLoss()
optimizer = torch.optim.SGD(model.parameters(), lr=1e-3)

然后就可以训练了

1.5 Train & Test

def train(dataloader,model,loss_fn,optimizer):
	size = len(dataloader.dataset)
    for batch, (X, y) in enumerate(dataloader):
        X, y = X.to(device), y.to(device)

        # Compute prediction error
        pred = model(X)
        loss = loss_fn(pred, y)

        # Backpropagation
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()

        if batch % 100 == 0:
            loss, current = loss.item(), batch * len(X)
            print(f"loss: {loss:>7f}  [{current:>5d}/{size:>5d}]")

def test(dataloader, model):
    size = len(dataloader.dataset)
    model.eval()
    test_loss, correct = 0, 0
    with torch.no_grad():
        for X, y in dataloader:
            X, y = X.to(device), y.to(device)
            pred = model(X)
            test_loss += loss_fn(pred, y).item()
            correct += (pred.argmax(1) == y).type(torch.float).sum().item()
    test_loss /= size
    correct /= size
    print(f"Test Error: \n Accuracy: {(100*correct):>0.1f}%, Avg loss: {test_loss:>8f} \n")

1.6 主程序

if __name__ =="__main__":
    epochs = 5
    for t in range(epochs):
        print(f"Epoch {t + 1}\n-------------------------------")
        train(train_dataloader, model, loss_fn, optimizer)
        test(test_dataloader, model)
    print("Done!")

1.7 保存和读取模型

Save

torch.save(model.state_dict(), "model.pth")
print("Saved PyTorch Model State to model.pth")

Load

import torch
from torch import nn
from torch.utils.data import DataLoader

# 使用视觉库的例子
from torchvision import datasets
from torchvision.transforms import ToTensor, Lambda, Compose
import matplotlib.pyplot as plt


# 1.1 导入数据
# Download training data from open datasets.
training_data = datasets.FashionMNIST(
    root="data",
    train=True,
    download=True,
    transform=ToTensor(),
)

# Download test data from open datasets.
test_data = datasets.FashionMNIST(
    root="data",
    train=False,
    download=True,
    transform=ToTensor(),
)

class NeuralNetwork(nn.Module):
    def __init__(self):
        super(NeuralNetwork, self).__init__()
        self.flatten = nn.Flatten()
        self.linear_relu_stack = nn.Sequential(
            nn.Linear(28*28, 512),
            nn.ReLU(),
            nn.Linear(512, 512),
            nn.ReLU(),
            nn.Linear(512, 10),
            nn.ReLU()
        )

    def forward(self, x):
        x = self.flatten(x)
        logits = self.linear_relu_stack(x)
        return logits

model = NeuralNetwork()
model.load_state_dict(torch.load("model.pth"))

classes = [
    "T-shirt/top",
    "Trouser",
    "Pullover",
    "Dress",
    "Coat",
    "Sandal",
    "Shirt",
    "Sneaker",
    "Bag",
    "Ankle boot",
]

model.eval()
x, y = test_data[0][0], test_data[0][1]
with torch.no_grad():
    pred = model(x)
    predicted, actual = classes[pred[0].argmax(0)], classes[y]
    print(f'Predicted: "{predicted}", Actual: "{actual}"')

2. Tensors

Tensors are similar to NumPy’s ndarrays, except that tensors can run on GPUs or other hardware accelerators.
实际上，Tensors和NumPy数组通常可以共享相同的基础内存，从而无需复制数据。
Tensors也针对AutoGrad进行了优化。

2.1 初始化一个Tensor

Directly from data (list)
The data type is automatically inferred.

data = [[1, 2],[3, 4]]
x_data = torch.tensor(data)

From a NumPy array

Tensors和NumPy arrays可以互相转换

np_array = np.array(data)
x_np = torch.from_numpy(np_array)

From another tensor

The new tensor retains the properties (shape, datatype) of the argument tensor, unless explicitly overridden.
我们可以保留之前的Tensor的形状和数据类型，然后在做修改

# 创建一个和x_data一样形状的1矩阵
x_ones = torch.ones_like(x_data) # retains the properties of x_data
print(f"Ones Tensor: \n {x_ones} \n")

# 创建一个和x_data一样形状的随机浮点数矩阵。
x_rand = torch.rand_like(x_data, dtype=torch.float) # overrides the datatype of x_data
print(f"Random Tensor: \n {x_rand} \n")

With random or constant values:

shape = (2,3,)
rand_tensor = torch.rand(shape)
ones_tensor = torch.ones(shape)
zeros_tensor = torch.zeros(shape)

print(f"Random Tensor: \n {rand_tensor} \n")
print(f"Ones Tensor: \n {ones_tensor} \n")
print(f"Zeros Tensor: \n {zeros_tensor}")

2.2 Attributes of a Tensor

shape
dtype
device: 存在CPU还是GPU

tensor = torch.rand(3,4)

print(f"Shape of tensor: {tensor.shape}")
print(f"Datatype of tensor: {tensor.dtype}")
print(f"Device tensor is stored on: {tensor.device}")

2.3 Operations on Tensors

Tensor所有的Operation

Standard numpy-like indexing and slicing:

tensor = torch.ones(4, 4)
print('第一行: ',tensor[0])
print('第一列: ', tensor[:, 0])
print('最后一列:', tensor[..., -1])
# 将第一列全变为0。
tensor[:,1] = 0
print(tensor)

Joining tensors （Stacking）

t1 = torch.cat([tensor, tensor, tensor], dim=1)

之前：
torch.Size([4, 4])
之后：
torch.Size([4, 12]) # 在dim=1上叠起来了。

之后改变tensor，t1中的元素值不会发生改变。

Arithmetic operations
矩阵乘法：

y1 = tensor @ tensor.T
y2 = tensor.matmul(tensor.T)
# 或者提前定义一个空间
y3 = torch.rand_like(tensor)
torch.matmul(tensor, tensor.T, out=y3)

矩阵对应元素相乘，跟Numpy中有广播机制。
如 [[1,1],[1,1]] * [1,2] = [[1,2],[1,2]]

z1 = tensor * tensor
z2 = tensor.mul(tensor)

z3 = torch.rand_like(tensor)
torch.mul(tensor, tensor, out=z3)

Single-element tensors: 可以用.item得到它的值
.shape: torch.Size([])
.item(): 16.0

如tensor.sum()得到的就是tensor中所有元素的和。

In-place operations: 方法名+" $\_$ "
tensor.add_(5): tensor中所有元素都+5。

2.4 Bridge with NumPy

Tensor to NumPy array: t.numpy()
两者公用同一个内存。

t = torch.ones(5)
n = t.numpy()
n[0] = 100
t.add_(1)
print(f"t: {t}")
print(f"n: {n}")

NumPy array to Tensor: torch.from_numpy()
两者公用同一个内存。

n = np.ones(5)
t = torch.from_numpy(n)

3. DataSet & DataLoader

Dataset stores the samples and their corresponding labels
DataLoader wraps an iterable around the Dataset to enable easy access to the samples.

预置的一些Dataset：
Image Datasets
Text Datasets
Audio Datasets

3.1 Loading a Dataset

以导入torchvision中FashionMNIST这个数据集为例。

import torch
from torch.utils.data import Dataset
from torchvision import datasets
from torchvision.transforms import ToTensor, Lambda

training_data = datasets.FashionMNIST(
    root="data",				# 保存在当前目录下的data文件夹内
    train=True,					# 训练集还是测试集
    download=True,			    # 是否当场下载
    transform=ToTensor()		# specify the feature and label transformations
)

test_data = datasets.FashionMNIST(
    root="data",
    train=False,
    download=True,
    transform=ToTensor()
)

3.2 Iterating and Visualizing the Dataset

We can index Datasets manually like a list: X,y = training_data[index]
换句话说trainning_data[index]表示第index个样本。

labels_map = {
    0: "T-Shirt",
    1: "Trouser",
    2: "Pullover",
    3: "Dress",
    4: "Coat",
    5: "Sandal",
    6: "Shirt",
    7: "Sneaker",
    8: "Bag",
    9: "Ankle Boot",
}
figure = plt.figure(figsize=(8, 8))
cols, rows = 3, 3
for i in range(1, cols * rows + 1):
    sample_idx = torch.randint(len(training_data), size=(1,)).item()
    img, label = training_data[sample_idx]
    figure.add_subplot(rows, cols, i)
    plt.title(labels_map[label])
    plt.axis("off")
    plt.imshow(img.squeeze(), cmap="gray")
plt.show()

3.3 创建自己的DataSet

为了使用自己的DataSet一定要写3个函数：
__init__: 初始化labels,dir,transform (对X的类型转换)和target_transform (对y的类型转换)
__len__：返回我们数据集中数据的个数
__getitem__：

import os
import pandas as pd
from torchvision.io import read_image

class CustomImageDataset(Dataset):
    def __init__(self, annotations_file, img_dir, transform=None, target_transform=None):
        self.img_labels = pd.read_csv(annotations_file)
        self.img_dir = img_dir
        self.transform = transform
        self.target_transform = target_transform

    def __len__(self):
        return len(self.img_labels)

    def __getitem__(self, idx):
        img_path = os.path.join(self.img_dir, self.img_labels.iloc[idx, 0])
        image = read_image(img_path)
        label = self.img_labels.iloc[idx, 1]
        if self.transform:
            image = self.transform(image)
        if self.target_transform:
            label = self.target_transform(label)
        sample = {"image": image, "label": label}
        return sample

3.4 DataLoader

关键是：

from torch.utils.data import DataLoader
train_dataloader = DataLoader(training_data, batch_size=64, shuffle=True)
for X,y in train_dataloader():
	...

from torch.utils.data import DataLoader
train_dataloader = DataLoader(training_data, batch_size=64, shuffle=True)
test_dataloader = DataLoader(test_data, batch_size=64, shuffle=True)

# Display image and label.
train_features, train_labels = next(iter(train_dataloader)) # 取得Iterator的下一个元素
print(f"Feature batch shape: {train_features.size()}")
print(f"Labels batch shape: {train_labels.size()}")
img = train_features[0].squeeze()
label = train_labels[0]
plt.imshow(img, cmap="gray")
plt.show()
print(f"Label: {label}")

4. Transforms

transform to modify the features
target_transform to modify the labels

from torchvision import datasets
from torchvision.transforms import ToTensor, Lambda

ds = datasets.FashionMNIST(
    root="data",
    train=True,
    download=True,
    transform=ToTensor(),
    target_transform=Lambda(lambda y: torch.zeros(10, dtype=torch.float).scatter_(0, torch.tensor(y), value=1)) # 将一个向量变为one-hot vector
)

4.1 别人用的Transform

transform = transforms.Compose(
    [transforms.ToTensor(),
    transforms.Normalize((0.5,), (0.5,))])

5. Build the Neural Network

The torch.nn namespace provides all the building blocks you need to build your own neural network.
Every module in PyTorch subclasses the nn.Module.

5.1 Get Device for Training

device = 'cuda' if torch.cuda.is_available() else 'cpu'
print('Using {} device'.format(device))

5.2 Define the Class

5.3 Model Layers

nn.Flatten()
保留Batchsize这个维度：[a,b,c,d] -> [a,bcd]。

nn.Linear(in_features=28*28, out_features=20)
保留Batchsize这个维度：[a,b,c,d] -> [a,bcd]。

torch.nn.Conv2d(in_channels,out,kernel_size): 这里input_channel是颜色通道数，如果是1就是灰度图
torch.nn.Conv2d(in_channels, out_channels, kernel_size, stride=1, padding=0, dilation=1, groups=1, bias=True, padding_mode=‘zeros’)

如: tmp = torch.randn(32,3,28,28) [ $N * C * H * W$ ]
conv = nn.Conv2d(3, 6, 5) [6个神经元，kernel_size=5*5，默认是zeros - padding_mode]
torch.Size([32, 6, 24, 24]) [32个样本，因为是6个神经元，28-(5-1)=24]

torch.nn.MaxPool2d(kernel_size, stride=None)
torch.nn.MaxPool2d(kernel_size, stride=None, padding=0, dilation=1, return_indices=False, ceil_mode=False)

nn.ReLU(), nn.Sigmoid()

nn.Softmax(dim=1): dim=1表示用第1维度的输出数据进行softmax计算

nn.Sequential

seq_modules = nn.Sequential(
    flatten,
    layer1,
    nn.ReLU(),
    nn.Linear(20, 10)
)
input_image = torch.rand(3,28,28)
logits = seq_modules(input_image)

5.4 Model Parameters [查看模型的参数细节]

# 打印模型各层的参数
print("Model structure: ", model, "\n\n")
# model.named_parameters包含层的name和对应的参数。
for name, param in model.named_parameters():
    print(f"Layer: {name} | Size: {param.size()} | Values : {param[:2]} \n")

6. 用torch.autograd自动求导

6.1 例子

以构建上面这个计算图为例。

import torch
x = torch.ones(5) # [1,1,1,1,1] 可以想像成(1,5)
y = torch.zeros(3) # [0,0,0]
w = torch.randn(5,3, requires_grad=True)
# 上面写法等价于
# w = torch.randn(5,3)
# w.requires_grad_(True)
b = torch.randn(3, requires_grad=True)
z = torch.mathmul(x,w)+b
loss = torch.nn.functional.binary_cross_entropy_with_logits(z, y) # sigmoid(z)和y的CrossEntropy

# 计算梯度
loss.backward()
print(w.grad)
print(b.grad)

Rq:

在搭建过程中，我们使用的torch.mathmul这个函数属于Function这个类。它知道如何forward direction和compute it’s derivative during the backward propagation step。
A reference to the backward propagation function is stored in grad_fn property of a tensor.
backward中只对于requires_grad=True的变量求导。
默认对于一张计算图我们只调用1词backward，如果需要调用多次，则我们需要loss.backward(retain_graph=True)

6.2 Disabling Gradient Tracking

当我们只做Inference的时候，我们不需要在forward过程中考虑反向传播。因此需要手动关闭Gradient Tracking。[可以加速计算]
要将神经网络中的某些参数标记为冻结参数。[Fine Tune的情况]

方法一： with torch.no_grad():

z = torch.matmul(x, w)+b
print(z.requires_grad)

with torch.no_grad():
    z = torch.matmul(x, w)+b
print(z.requires_grad)

方法二： z_det = z.detach()

z = torch.matmul(x, w)+b
z_det = z.detach()
print(z_det.requires_grad)

6.3 More on Computational Graphs [autograd的实现]

autograd是通过Function objects的DAG (directed acyclic graph, 有向无环图)实现的。
在这个DAG中，叶子节点是input tensor，根结点是output tensor。因此从根结点到叶子节点的过程就是一个反向传播（运用链式法则）的过程。

forward process: autograd同时做两件事

run the requested operation to compute a resulting tensor
maintain the operation’s gradient function in the DAG.

backward pass

在.backward()在DAG root调用的时候开始，autograd做了：
computes the gradients from each .grad_fn （每一个需要grad的tensor）,
accumulates them in the respective tensor’s .grad attribute （修改tensor对应的grad属性）
using the chain rule, propagates all the way to the leaf tensors. （反向传播到节点）

DAGs are dynamic in PyTorch （动态计算图）
An important thing to note is that the graph is recreated from scratch; after each .backward() call, autograd starts populating a new graph. This is exactly what allows you to use control flow statements in your model; you can change the shape, size and operations at every iteration if needed.

7. Optimizing Model Parameters

7.1 Loss Function

Regression
nn.MSELoss()

Classification
nn.NLLLoss (Negative Log Likelihood)
nn.CrossEntropyLoss: 包括了 nn.LogSoftmax 和 nn.NLLLoss.
torch.nn.CrossEntropyLoss(weight=None, size_average=None, ignore_index=-100, reduce=None, reduction='mean')

7.2 Optimizer

各种各样的Optimizer

一个用SGD的例子，我们传入之前模型登记过的要train的参数。

optimizer = torch.optim.SGD(model.parameters(), lr=learning_rate)
# 各tensor梯度清零
optimizer.zero_grad()
# 反向传播计算各tensor上的梯度
loss.backward()
# 利用优化算法和已经得到的梯度对参数进行更新
optimizer.step()

例子：

import torch
from torch import nn
from torch.utils.data import DataLoader
from torchvision import datasets
from torchvision.transforms import ToTensor, Lambda

training_data = datasets.FashionMNIST(
    root="data",
    train=True,
    download=True,
    transform=ToTensor()
)

test_data = datasets.FashionMNIST(
    root="data",
    train=False,
    download=True,
    transform=ToTensor()
)

train_dataloader = DataLoader(training_data, batch_size=64)
test_dataloader = DataLoader(test_data, batch_size=64)

class NeuralNetwork(nn.Module):
    def __init__(self):
        super(NeuralNetwork, self).__init__()
        self.flatten = nn.Flatten()
        self.linear_relu_stack = nn.Sequential(
            nn.Linear(28*28, 512),
            nn.ReLU(),
            nn.Linear(512, 512),
            nn.ReLU(),
            nn.Linear(512, 10),
            nn.ReLU()
        )

    def forward(self, x):
        x = self.flatten(x)
        logits = self.linear_relu_stack(x)
        return logits

model = NeuralNetwork()

# Hyperparameters
learning_rate = 1e-3
batch_size = 64
epochs = 5

def train_loop(dataloader, model, loss_fn, optimizer):
    size = len(dataloader.dataset)
    for batch, (X, y) in enumerate(dataloader):
        # Compute prediction and loss
        pred = model(X)
        loss = loss_fn(pred, y)

        # Backpropagation
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()

        if batch % 100 == 0:
            loss, current = loss.item(), batch * len(X)
            print(f"loss: {loss:>7f}  [{current:>5d}/{size:>5d}]")


def test_loop(dataloader, model, loss_fn):
    size = len(dataloader.dataset)
    test_loss, correct = 0, 0

    with torch.no_grad():
        for X, y in dataloader:
            pred = model(X)
            test_loss += loss_fn(pred, y).item()
            correct += (pred.argmax(1) == y).type(torch.float).sum().item()

    test_loss /= size
    correct /= size
    print(f"Test Error: \n Accuracy: {(100*correct):>0.1f}%, Avg loss: {test_loss:>8f} \n")

loss_fn = nn.CrossEntropyLoss()
optimizer = torch.optim.SGD(model.parameters(), lr=learning_rate)

epochs = 10
for t in range(epochs):
    print(f"Epoch {t+1}\n-------------------------------")
    train_loop(train_dataloader, model, loss_fn, optimizer)
    test_loop(test_dataloader, model, loss_fn)
print("Done!")

8. Save and load the model

8.1 Saving and Loading Model Weights 用字典形式保存每层的权重和读取

# 保存
model = models.vgg16(pretrained=True)
torch.save(model.state_dict(), 'model_weights.pth')
# 读取
# 要先读取对应的网络结构。
model = models.vgg16() # we do not specify pretrained=True, i.e. do not load default weights
model.load_state_dict(torch.load('model_weights.pth'))
model.eval()

⚠️ ：一定要在inference之前，先调用model.eval()，否则会合保存之前得到的结果不一致！[用来设置dropout和Batch normalization]

8.2 保存和读取整个模型

之前我们需要重新instantiate这个模型(model = models.vgg16() )，但是现在我们不需要了。

torch.save(model, 'model.pth')
model = torch.load('model.pth')

这种方法最后load使用的时候需要保证模型对应的网络的class可以被读取到。
This approach uses Python pickle module when serializing the model, thus it relies on the actual class definition to be available when loading the model.

8.3 Exporting Model to ONNX

PyTorch也有本地的ONNX导出支持。然而，鉴于PyTorch执行图的动态性质，导出过程必须遍历执行图以产生持久的ONNX模型。出于这个原因，应该向导出程序传递一个适当大小的测试变量（在我们的例子中，我们将创建一个正确大小的假零张量）。

input_image = torch.zeros((1,3,224,224))
onnx.export(model, input_image, 'model.onnx')

9. Visualizing models, data and Training with Tensorboard

1. TensorBoard setup
如果提示没有module，则需要用pip安装。

from torch.utils.tensorboard import SummaryWriter

# default `log_dir` is "runs" - we'll be more specific here
# 设置存储log的文件夹
writer = SummaryWriter('runs/fashion_mnist_experiment_1')

2. Writing to TensorBoard

# get some random training images
dataiter = iter(trainloader)
images, labels = dataiter.next()

# create grid of images
img_grid = torchvision.utils.make_grid(images)

# show images
matplotlib_imshow(img_grid, one_channel=True)

# write to tensorboard
writer.add_image('four_fashion_mnist_images', img_grid)

3. Inspect the model using TensorBoard

writer.add_graph(net, images)  #把数据和网络的实例输入进去
writer.close()

然后用命令行进入run的上层文件夹中，输入：

tensorboard --logdir=runs

然后浏览器进入
http://localhost:6006/
结果：

4. Adding a “Projector” to TensorBoard

# 4. Adding a “Projector” to TensorBoard

# helper function
def select_n_random(data, labels, n=100):
    '''
    Selects n random datapoints and their corresponding labels from a dataset
    选择n个数据点
    '''
    assert len(data) == len(labels)

    perm = torch.randperm(len(data))
    return data[perm][:n], labels[perm][:n]

# select random images and their target indices
images, labels = select_n_random(trainset.data, trainset.targets)

# get the class labels for each image
class_labels = [classes[lab] for lab in labels]

# log embeddings
features = images.view(-1, 28 * 28) # [SampleSize,特征向量维度]

# 实现高维空间到低维空间的映射，导入数据即可
writer.add_embedding(features,
                    metadata=class_labels,
                    label_img=images.unsqueeze(1))

writer.close()

5. Tracking model training with TensorBoard

显示loss：writer.add_scalar(‘training loss’, running_loss / 1000, epoch * len(trainloader) + i)
显示addfigure：writer.add_figure(‘predictions vs. actuals’, plot_classes_preds(net, inputs, labels), global_step=epoch * len(trainloader) + i)

# 5. Tracking model training with TensorBoard
# helper functions
def images_to_probs(net, images):
    '''
    Generates predictions and corresponding probabilities from a trained
    network and a list of images
    '''
    output = net(images)   # [SampleSize,10]
    # convert output probabilities to predicted class
    _, preds_tensor = torch.max(output, 1)
    preds = np.squeeze(preds_tensor.numpy())
    return preds, [F.softmax(el, dim=0)[i].item() for i, el in zip(preds, output)]


def plot_classes_preds(net, images, labels):
    '''
    Generates matplotlib Figure using a trained network, along with images
    and labels from a batch, that shows the network's top prediction along
    with its probability, alongside the actual label, coloring this
    information based on whether the prediction was correct or not.
    Uses the "images_to_probs" function.
    '''
    preds, probs = images_to_probs(net, images)
    # plot the images in the batch, along with predicted and true labels
    fig = plt.figure(figsize=(12, 48))
    for idx in np.arange(4):
        ax = fig.add_subplot(1, 4, idx+1, xticks=[], yticks=[])
        matplotlib_imshow(images[idx], one_channel=True)
        ax.set_title("{0}, {1:.1f}%\n(label: {2})".format(
            classes[preds[idx]],
            probs[idx] * 100.0,
            classes[labels[idx]]),
                    color=("green" if preds[idx]==labels[idx].item() else "red"))
    return fig

running_loss = 0.0
for epoch in range(1):  # loop over the dataset multiple times

    for i, data in enumerate(trainloader, 0):
        # 正常训练过程
        # get the inputs; data is a list of [inputs, labels]
        inputs, labels = data

        # zero the parameter gradients
        optimizer.zero_grad()

        # forward + backward + optimize
        outputs = net(inputs)
        loss = criterion(outputs, labels)
        loss.backward()
        optimizer.step()

        running_loss += loss.item()
        if i % 1000 == 999:    # every 1000 mini-batches...

            # ...log the running loss，将显示信息写到tensorboard中
            writer.add_scalar('training loss',
                            running_loss / 1000,
                            epoch * len(trainloader) + i)

            # ...log a Matplotlib Figure showing the model's predictions on a
            # random mini-batch
            writer.add_figure('predictions vs. actuals',
                            plot_classes_preds(net, inputs, labels),
                            global_step=epoch * len(trainloader) + i)
            running_loss = 0.0
print('Finished Training')

6. Assessing trained models with TensorBoard 添加Precision / Recall Curve

writer.add_pr_curve(classes[class_index],
                        tensorboard_truth, # list of boollean  如果原本是这个类别则为true
                        tensorboard_probs, # list of prob 	   预测成这个类别的概率
                        global_step=global_step)

# 6. Assessing trained models with TensorBoard
# 1. gets the probability predictions in a test_size x num_classes Tensor
# 2. gets the preds in a test_size Tensor
# takes ~10 seconds to run
class_probs = []
class_label = []
with torch.no_grad():
    for data in testloader:
        images, labels = data
        output = net(images)
        class_probs_batch = [F.softmax(el, dim=0) for el in output]

        class_probs.append(class_probs_batch)
        class_label.append(labels)

test_probs = torch.cat([torch.stack(batch) for batch in class_probs])
test_label = torch.cat(class_label)

# helper function
def add_pr_curve_tensorboard(class_index, test_probs, test_label, global_step=0):
    '''
    Takes in a "class_index" from 0 to 9 and plots the corresponding
    precision-recall curve
    '''
    tensorboard_truth = test_label == class_index     # 得到对应类别的序号
    tensorboard_probs = test_probs[:, class_index]    # 对应每个样本的Probability

    writer.add_pr_curve(classes[class_index],
                        tensorboard_truth,
                        tensorboard_probs,
                        global_step=global_step)
    writer.close()

# plot all the pr curves
for i in range(len(classes)):
    add_pr_curve_tensorboard(i, test_probs, test_label) # 对每一个类别画PR Curve

个人猜测具体实现方式是看每个原先应该属于这个类别的样本预测成这个类别的probability。然后可以统计每一个Probability下的TP,FP,TN,FN。

其他常用功能：

Tensor.squeeze()
去除所有1的维度，如[1,2,3,1,4,1] -> [2,3,4]

改变某一个维度的值：
y.scatter_(dim=0,index=torch.tensor(1),value=2)，如[0,0,0] -> [0,2,0]

交换维度：
np.transpose(tmp,(3,2,0,1))，如(3, 2, 1, 1) -> (1, 1, 3, 2)

数字得到one-hot Vector

y = torch.Tensor(1).type(torch.int64)   # torch.Size([1])
torch.zeros(10, dtype=torch.float).scatter_(0, torch.tensor(y), value=1)

torch.max()
value, index = torch.max(output, 1) # output在第1维上的最大值和对应的index。
preds = np.squeeze(index.numpy())

BCELoss和BCEWithLogitsLoss的区别

简言之：
BCEWithLogitsLoss = Sigmoid + BCELoss
测试：

input = torch.randn(3,3)
target = torch.FloatTensor([[0,1,1],
							[1,1,0],
							[0,1,1]])
m = nn.Sigmoid()
loss = nn.BCELoss()
print(loss(m(input),target))  # tensor(0.8303)
loss = nn.BCEWithLogitsLoss()
print(loss(input,target))  # tensor(0.8303)

手动验算：
我们的例子中

input = tensor([[-0.8477,  0.0327, -0.0345],
				[ 0.0830, -0.6805, -0.6124],
				[ 1.6842, -0.4261, -0.1475]])
m(input) =  tensor([[0.2999, 0.5082, 0.4914],
					[0.5207, 0.3362, 0.3515],
					[0.8435, 0.3951, 0.4632]])
target = 	tensor([[0., 1., 1.],
			        [1., 1., 0.],
			        [0., 1., 1.]])

$BCE=-\frac{1}{n}\sum (y_n ln(x_n)+(1-y_n)*ln(1-x_n))$

# tensor(0.8303)
print(-torch.mean(torch.log(1-m(input).view(1,-1))*(1-target.view(1,-1))+torch.log(m(input).view(1,-1))*(target.view(1,-1))))

你可能感兴趣的:(PyTorch,python,pytorch,深度学习)

【数值模型后处理系列】通风系数计算及垂直层插值 ⁣北潇数值模型 Python实用基础技能 python WRF
一、通风系数1.1通风系数简介通风系数（VentilationCoefficient，VC）可以用来表征扩散条件，其计算公式如下（参考USIyerandPErnestRaj的文章）：其中mixingdepth选用WRF输出的边界层高度（PBLH），meanwindspeed近似用边界层顶的风速与地面风速做平均（当然也可多选几层）。1.2Python代码实现VC的计算计算VC的示例代码：fromne
使用python开发flsak_FlaskWeb开发:基于Python的Web应用开发实战 RoseofVersailles 使用python开发flsak
本书不仅适合初级Web开发人员学习阅读，更是Python程序员用来学习高级Web开发技术的优秀参考书。•学习Flask应用的基本结构，编写示例应用；•使用必备的组件，包括模板、数据库、Web表单和电子邮件支持；•使用包和模块构建可伸缩的大型应用；•实现用户认证、角色和个人资料；•在博客网站中重用模板、分页显示列表以及使用富文本；•使用基于Flask的REST式API，在智能手机、平板电脑和其他第三
关于网页自动化工具DrissionPage进行爬虫的使用方法 web15117360223 面试学习路线阿里巴巴自动化爬虫运维
目录一.简介二.使用1.安装方式2.基本用法3.模式4.元素交互4.SessionPage5.运行JS6.结语一.简介最近在学python的过程中，发现了一个好用的爬虫库DrissionPage——一个基于python的网页自动化工具。据具官方文档（官方网址：https://drissionpage.cn/）介绍：它既能控制浏览器，也能收发数据包，还能把两者合而为一。可兼顾浏览器自动化的便利性和r
conda虚拟环境迁移今夕节度使 conda
目录【查现有环境】【直接创建】【复制环境】一，在本机上，直接使用二，如果要复制到其他机器，就要考虑导出当前环境到文件，利用文件再次创建环境1）导出环境2）使用yaml配置文件创建新环境【删除环境】【退出虚拟环境】【查现有环境】复制Anaconda虚拟环境_anconda虚拟环境复制-CSDN博客condainfo--env【直接创建】condacreate-nyour_env_namepython
义父们，支持我兄弟参加CSDN博客之星2024！他是一名优秀的运维工程师！ qq_42856429 运维 java 开发语言
标题：支持我兄弟参加CSDN博客之星2024！他是一名优秀的运维工程师！大家好，今天想为大家推荐一位非常优秀的技术博主——XMYX-0。他是一名专注于运维领域的开发者，尤其擅长Kubernetes（K8s）和Python自动化运维。他正在参加CSDN博客之星2024活动，希望大家能够为他投上宝贵的一票！为什么支持他？Kubernetes（K8s）领域的深度实践者在K8s领域有着丰富的实战经验。他的
conda实现虚拟环境的迁移邹小妹
参考https://zhuanlan.zhihu.com/p/87344422使用conda将服务器上配置好的虚拟环境从当前ip迁移到目标ip。1、如果需要在具有相同操作系统的计算机之间复制环境，则可以生成speclist。生成speclist文件：condalist--explicit>spec-list.txt重现环境：condacreate--namepython-course--files
【深度学习】计算机视觉（CV）-图像分类-ResNet（Residual Network，残差网络） IT古董深度学习人工智能深度学习计算机视觉分类
ResNet（ResidualNetwork，残差网络）是一种深度卷积神经网络（CNN）架构，由何恺明（KaimingHe）等人在2015年提出，最初用于ImageNet竞赛，并在分类任务上取得了冠军。ResNet的核心思想是残差学习（ResidualLearning），它通过跳跃连接（SkipConnections）解决了深度神经网络训练中的梯度消失和梯度爆炸问题，使得非常深的网络（如50层、1
【深度学习基础】什么是注意力机制我的青春不太冷深度学习人工智能注意力机制
文章目录一、注意力机制的核心地位：从补充到主导二、技术突破：从Transformer到多模态融合三、跨领域应用：从NLP到通用人工智能四、未来挑战与趋势结语参考链接注意力机制：深度学习的核心革命与未来基石在深度学习的发展历程中，注意力机制（AttentionMechanism）的引入堪称一场革命。它不仅解决了传统模型的根本性缺陷，更通过动态聚焦关键信息的能力，重塑了人工智能处理复杂任务的范式。本文
使用Python构建论坛爬虫：抓取论坛主题、标签和讨论量 Python爬虫项目 python 爬虫开发语言信息可视化金融
引言随着互联网的发展，论坛作为一个信息交流的地方，承载了大量的讨论内容、主题和标签。通过抓取论坛的数据，用户可以了解最热的话题、讨论量大的主题以及与特定标签相关的内容。本篇博客将介绍如何使用Python构建一个论坛数据抓取爬虫，从论坛网站上抓取主题、标签和讨论量，并对数据进行存储和分析。目标与背景我们的目标是从多个论坛网站抓取以下内容：论坛主题：讨论的主要内容或话题。标签：与主题相关的分类信息。讨
【第15章：量子深度学习与未来趋势—15.3 量子深度学习在图像处理、自然语言处理等领域的应用潜力分析】再见孙悟空_ #【深度学习・探索智能核心奥秘】深度学习机器学习人工智能音视频自然语言处理量子深度学习量子学习未来
一、开篇：为什么我们需要关注这场"量子+AI"的世纪联姻？各位技术爱好者们，今天我们要聊的这个话题，可能是未来十年最值得押注的技术革命——量子深度学习。这不是简单的"1+1=2"的物理叠加，而是一场可能彻底改写AI发展轨迹的范式转移。想象这样一个场景：你现在训练一个GPT-5级别的模型，不需要耗费价值上亿美元的算力资源，不需要等待数周的训练时间，甚至不需要纠结于模型参数是否过拟合。这就是量子深度学
【第15章：量子深度学习与未来趋势—15.1 量子计算基础与量子机器学习的发展背景】再见孙悟空_ #【深度学习・探索智能核心奥秘】机器翻译自然语言处理计算机视觉量子计算人工智能深度学习机器学习
想象一下，你正在用ChatGPT生成一篇小说，突然它卡在"主角穿越虫洞"的情节上——这不是因为想象力枯竭，而是传统计算机的晶体管已经烧到冒烟。当前AI大模型的参数规模每4个月翻一番，但摩尔定律的终结让经典计算机的算力增长首次跟不上AI的进化速度。这时候，量子计算带着它的"超能力"登场了：1台50量子位的量子计算机，处理某些问题的速度可达超级计算机的1亿倍。这场算力革命，正在改写深度学习的游戏规则。
Python学习教程：必须掌握的Cookie知识点都在这里了 weixin_30387339 python 爬虫 javascript ViewUI
今天我们来全面了解一下Cookie（小饼干）相关的知识！篇幅有点长，在学习Python的伙伴或者有兴趣的你，可以耐心看哦！相信很多同学肯定听过Cookie这个东西，也大概了解其作用，但是其原理以及如何设置，可能没有做过web的同学并不是非常清楚，以前的Python学习教程中其实有跟大家提到过，那今天就带大家详细了解下Cookie相关的知识！一、诞生背景爬虫系列教程的第一篇：HTTP详解中我们便说过
Python学习，cookie和session sehun_sx python 数据挖掘开发语言 python学习学习
用户登录,未登录不能访问指定页面基于cookie实现保存在用户浏览器端的键值对,向服务端发请求时会自动携带deflogin(request):#设置cookiedata=redirect('...')data.set_cookie()#读取cookierequest.COOKIES.get('xx')returndatacookie的三个参数:key,value='',max_age=None应用
Python学习之cookies及session用法一个人旅行*-* Python Python cookies session
当想利用Python在网页上发表评论的时候，需要一些账号密码登录的信息，这个时候用requests.get()请求的话，账号密码全部会显示在网址上，这显然不科学！这个时候需要用post请求，可以这么理解，get是明文显示，post是非明文显示。通常，get请求会应用于获取网页数据，比如我们之前学的requests.get()。post请求则应用于向网页提交数据，比如提交表单类型数据（像账号密码就是
Python随机森林算法详解与案例实现闲人编程 python 算法 python 随机森林数据分析人工智能
目录Python随机森林算法详解与案例实现1、随机森林算法概述2、随机森林的原理3、实现步骤4、分类案例：使用随机森林预测鸢尾花品种4.1数据集介绍4.2代码实现4.3代码解释4.4运行结果5、回归案例：使用随机森林预测波士顿房价5.1数据集介绍5.2代码实现5.3代码解释5.4运行结果6、随机森林的优缺点7、改进方向8、应用场景9、总结Python随机森林算法详解与案例实现1、随机森林算法概述随
Python 循环神经网络（RNN）算法详解与应用案例闲人编程 python python rnn 算法循环神经网络深度学习文本生成
目录Python循环神经网络（RNN）算法详解与应用案例引言一、RNN的基本原理1.1RNN的结构1.2RNN的优势与挑战二、Python中RNN的面向对象实现2.1`RNNCell`类的实现2.2`RNNModel`类的实现2.3`Trainer`类的实现三、案例分析3.1序列预测3.1.1数据准备3.1.2模型训练3.1.3结果分析3.2文本生成3.2.1数据准备3.2.2模型训练3.2.3文
python-web session LennyZzz
#session-为了应对HTTP协议的无状态性-用来保存用户比较敏感的信息-属于request的一个属性,其实在数据库中常用操作：-request.session.get(key,value)-request.session.clear()-request.session[key]=value-request.session.flush()删除当前会话，清楚会话的cookie-delreques
基于深度学习YOLOv10的PCB板缺陷检测系统（附完整资源+PySide6界面+训练代码）人工智能_SYBH 深度学习 YOLO 人工智能目标检测 python
引言：在现代制造业中，电子元件和PCB（印刷电路板）是非常重要的基础设施。PCB缺陷检测是生产过程中至关重要的一步。传统的缺陷检测方法主要依靠人工检查，这不仅效率低，而且容易受到人眼疲劳的影响。随着深度学习技术的不断发展，基于深度学习的自动化缺陷检测已成为研究的热点，尤其是在计算机视觉领域。YOLO（YouOnlyLookOnce）系列算法凭借其高速和高精度的优势，成为了目标检测领域的佼佼者。本文
券商api有哪些用途？如何申请和使用券商api进行股票交易？股票程序化交易接口量化交易股票API接口 Python股票量化交易大数据券商api 股票交易申请使用股票量化接口股票API接口
Python股票接口实现查询账户，提交订单，自动交易（1）Python股票程序交易接口查账，提交订单，自动交易（2）股票量化，Python炒股，CSDN交流社区>>>自动化交易功能券商API允许投资者编写程序来实现自动化交易。在股票市场中，价格波动瞬息万变，人工交易可能会因为反应速度慢而错过最佳交易时机。通过自动化交易程序，利用券商API，可以根据预设的交易策略，如当股票价格达到某个设定值时自动买
python中session的使用白桃提拉米苏
使用场景：当接口之间有cookie数据之间的传递的情况下为了确保接口之间cookie数据传递，一定要使用同一个session对象接口返回的cookie数据，存储在session对象中#1.创建session对象session=requests.session()#2.使用session对象，实现之后所有的接口请求session.get()session.post()session.put()举例#
本地部署model scope魔搭大模型流程 CQller python 算法深度学习机器学习 jupyter pytorch
一、安装python二、安装Gradio三、添加镜像加速四、运行字符串倒叙五、运行绘图六、安装常用软件包和库七、我目前使用的软件包和库简介八、文字生成图片AI模型九、文字回复AI模型一、安装python可参考安装步骤：python学习笔记-python安装与环境变量配置_python环境变量-CSDN博客二、安装Gradio在cmd执行以下命令。Gradio封装了功能丰富的前端用户界面，一会儿用来
Python中的Session和Cookie详解闲人编程进阶算法案例 python 开发语言 cookie session 网络爬虫
目录Python中的Session和Cookie详解引言一、Cookie1.1Cookie的基本概念1.2Cookie的工作原理1.3Cookie的基本属性1.4Python中Cookie的实现1.4.1Cookie实现代码1.5使用案例二、Session2.1Session的基本概念2.2Session的工作原理2.3Session的优点2.4Python中Session的实现2.4.1Sess
应用行为检测工具【python源码】 PaceCN python
使用说明基于python编写的应用行为检测工具源码。1、选择你想检测的exe文件，点击启动检测，等待日志显示。2、工具会自动检测启动的进程并显示在左侧3、在启动检测软件如果有DLL加载、网络连接、文件修改、子程序创建、注册表操作会显示在日志窗口4、支持日志另存为，方便查询用(将日志直接丢给AI，然后问它是否存在行为风险)日志查询示例日志信息分析1.日志内容概述进程创建：日志记录了QQMusic.e
Python的那些事第二十三篇：Express（Node.js）与 Python：一场跨语言的浪漫邂逅暮雨哀尘 Python的那些事 linux python node.js express 服务器开发语言 web开发
摘要在当今的编程世界里，Node.js和Python像是两个性格迥异的超级英雄，一个以速度和灵活性著称，另一个则以强大和优雅闻名。本文将探讨如何通过Express框架将Node.js和Python结合起来，打造出一个高效、有趣的Web应用。我们将通过一系列幽默风趣的实例和表格，展示这种跨语言合作的无限可能。如果你厌倦了单调的技术论文，那么这篇论文绝对能让你眼前一亮！1.引言：当Node.js遇上P
【深度解析】ICLR：人工智能领域的顶级学术会议 | 顶会与SCI期刊的区别全攻略 X_taiyang18 人工智能
【深度解析】ICLR：人工智能领域的顶级学术会议|顶会与SCI期刊的区别全攻略简介在人工智能和机器学习领域，ICLR（InternationalConferenceonLearningRepresentations）被誉为“深度学习的顶级会议”。自2013年由深度学习三巨头中的YoshuaBengio和YannLeCun创办以来，ICLR迅速崛起，成为全球科研人员争相投稿的学术盛会。那么，ICLR
30.4:Python如何安装Pandas库？（课程共4100字）小兔子平安 Python完整学习全解答 python pandas 开发语言
课程概述（课程共4100字）①安装Pandas库打开命令提示符或终端窗口，输入以下命令来安装Pandas：当安装完成后，可以使用以下命令来验证Pandas是否已正确安装：②数据处理和分析读写数据数据清洗和预处理数据分组和聚合数据可视化③Python学习的深入讨论Python的应用领域Python的优点和缺点学习Python的建议学习Python的挑战课程总结课程概述Python是一种功能强大的编程
python画二维矩阵图_基于python 二维数组及画图的实例详解 weixin_39785400 python画二维矩阵图
1、二维数组取值注：不管是二维数组，还是一维数组，数组里的数据类型要一模一样，即若是数值型，全为数值型#二维数组importnumpyasnplist1=[[1.73,1.68,1.71,1.89,1.78],[54.4,59.2,63.6,88.4,68.7]]list3=[1.73,1.68,1.71,1.89,1.78]list4=[54.4,59.2,63.6,88.4,68.7]list
使用多模态大语言模型进行深度学习的图像、文本和语音数据增强数行天下人工智能语言模型深度学习人工智能自然语言处理
在过去的五年里，研究方向已从传统的机器学习（ML）和深度学习（DL）方法转向利用大语言模型（LLMs），包括多模态方法，用于数据增强，以提高泛化能力，并在训练深度卷积神经网络时防止过拟合。然而，现有的综述文章主要集中于机器学习和深度学习技术或有限的模态（如文本或图像），在涵盖LLM方法的最新进展和多模态应用方面仍存在空白。本文通过探索利用多模态LLMs进行图像、文本和语音数据增强的最新文献，填补了
《深入浅出LLM基础篇》（三）：大模型结构分类 GoAI 深入浅出LLM 深入浅出AI 自然语言处理NLP 大模型 LLM 人工智能 transformer chatgpt
AI学习星球推荐：GoAI的学习社区知识星球是一个致力于提供《机器学习|深度学习|CV|NLP|大模型|多模态|AIGC》各个最新AI方向综述、论文等成体系的学习资料，配有全面而有深度的专栏内容，包括不限于前沿论文解读、资料共享、行业最新动态以、实践教程、求职相关（简历撰写技巧、面经资料与心得）多方面综合学习平台，强烈推荐AI小白及AI爱好者学习，性价比非常高！加入星球➡️点击链接✨专栏介
深入理解TensorFlow中的形状处理函数 SEVEN-YEARS tensorflow 人工智能 python
摘要在深度学习模型的构建过程中，张量（Tensor）的形状管理是一项至关重要的任务。特别是在使用TensorFlow等框架时，确保张量的形状符合预期是保证模型正确运行的基础。本文将详细介绍几个常用的形状处理函数，包括get_shape_list、reshape_to_matrix、reshape_from_matrix和assert_rank，并通过具体的代码示例来展示它们的使用方法。1.引言在深
用MiddleGenIDE工具生成hibernate的POJO（根据数据表生成POJO类） AdyZhang POJO eclipse Hibernate MiddleGenIDE
推荐:MiddlegenIDE插件, 是一个Eclipse 插件. 用它可以直接连接到数据库, 根据表按照一定的HIBERNATE规则作出BEAN和对应的XML ，用完后你可以手动删除它加载的JAR包和XML文件! 今天开始试着使用
.9.png Cb123456 android
“点九”是andriod平台的应用软件开发里的一种特殊的图片形式，文件扩展名为：.9.png 　　智能手机中有自动横屏的功能,同一幅界面会在随着手机(或平板电脑)中的方向传感器的参数不同而改变显示的方向,在界面改变方向后,界面上的图形会因为长宽的变化而产生拉伸,造成图形的失真变形。　　我们都知道android平台有多种不同的分辨率，很多控件的切图文件在被放大拉伸后，边
算法的效率天子之骄算法效率复杂度最坏情况运行时间大O阶平均情况运行时间
算法的效率效率是速度和空间消耗的度量。集中考虑程序的速度，也称运行时间或执行时间，用复杂度的阶(O)这一标准来衡量。空间的消耗或需求也可以用大O表示，而且它总是小于或等于时间需求。以下是我的学习笔记： 1.求值与霍纳法则，即为秦九韶公式。 2.测定运行时间的最可靠方法是计数对运行时间有贡献的基本操作的执行次数。运行时间与这个计数成正比。
java数据结构何必如此 java 数据结构
Java 数据结构 Java工具包提供了强大的数据结构。在Java中的数据结构主要包括以下几种接口和类：枚举（Enumeration）位集合（BitSet）向量（Vector）栈（Stack）字典（Dictionary）哈希表（Hashtable）属性（Properties）以上这些类是传统遗留的，在Java2中引入了一种新的框架-集合框架(Collect
MybatisHelloWorld 3213213333332132
//测试入口TestMyBatis package com.base.helloworld.test; import java.io.IOException; import org.apache.ibatis.io.Resources; import org.apache.ibatis.session.SqlSession; import org.apache.ibat
Java|urlrewrite|URL重写|多个参数 7454103 java xml Web 工作
个人工作经验！如有不当之处，敬请指点 1.0 web -info 目录下建立 urlrewrite.xml 文件类似如下： <?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE u
达梦数据库+ibatis darkranger sql mysql ibatis SQL Server
--插入数据方面如果您需要数据库自增... 那么在插入的时候不需要指定自增列. 如果想自己指定ID列的值, 那么要设置 set identity_insert 数据库名.模式名.表名; ----然后插入数据; example: create table zhabei.test( id bigint identity(1,1) primary key, nam
XML 解析四种方式 aijuans android
XML现在已经成为一种通用的数据交换格式,平台的无关性使得很多场合都需要用到XML。本文将详细介绍用Java解析XML的四种方法。 XML现在已经成为一种通用的数据交换格式,它的平台无关性,语言无关性,系统无关性,给数据集成与交互带来了极大的方便。对于XML本身的语法知识与技术细节,需要阅读相关的技术文献,这里面包括的内容有DOM(Document Object
spring中配置文件占位符的使用 avords
1.类 <?xml version="1.0" encoding="UTF-8"?><!DOCTYPE beans PUBLIC "-//SPRING//DTD BEAN//EN" "http://www.springframework.o
前端工程化-公共模块的依赖和常用的工作流 bee1314 webpack
题记：一个人的项目，还有工程化的问题嘛？我们在推进模块化和组件化的过程中，肯定会不断的沉淀出我们项目的模块和组件。对于这些沉淀出的模块和组件怎么管理？另外怎么依赖也是个问题？你真的想这样嘛？ var BreadCrumb = require(‘../../../../uikit/breadcrumb’); //真心ugly。
上司说「看你每天准时下班就知道你工作量不饱和」，该如何回应？ bijian1013 项目管理沟通 IT职业规划
问题：上司说「看你每天准时下班就知道你工作量不饱和」，如何回应正常下班时间6点，只要是6点半前下班的，上司都认为没有加班。 Eno-Bea回答，注重感受，不一定是别人的虽然我不知道你具体从事什么工作与职业，但是我大概猜测，你是从事一项不太容易出现阶段性成果的工作
TortoiseSVN，过滤文件征客丶 SVN
环境： TortoiseSVN 1.8 配置：在文件夹空白处右键选择 TortoiseSVN -> Settings 在 Global ignote pattern 中添加要过滤的文件：多类型用英文空格分开 *name ：过滤所有名称为 name 的文件或文件夹 *.name ：过滤所有后缀为 name 的文件或文件夹 --------
【Flume二】HDFS sink细说 bit1129 Flume
1. Flume配置 a1.sources=r1 a1.channels=c1 a1.sinks=k1 ###Flume负责启动44444端口 a1.sources.r1.type=avro a1.sources.r1.bind=0.0.0.0 a1.sources.r1.port=44444 a1.sources.r1.chan
The Eight Myths of Erlang Performance bookjovi erlang
erlang有一篇guide很有意思： http://www.erlang.org/doc/efficiency_guide 里面有个The Eight Myths of Erlang Performance： http://www.erlang.org/doc/efficiency_guide/myths.html Myth: Funs are sl
java多线程网络传输文件(非同步)-2008-08-17 ljy325 java 多线程 socket
利用 Socket 套接字进行面向连接通信的编程。客户端读取本地文件并发送；服务器接收文件并保存到本地文件系统中。使用说明:请将TransferClient, TransferServer, TempFile三个类编译，他们的类包是FileServer. 客户端: 修改TransferClient: serPort, serIP, filePath, blockNum,的值来符合您机器的系
读《研磨设计模式》-代码笔记-模板方法模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.sql.Connection; import java.sql.DriverManager; import java.sql.PreparedStatement; import java.sql.ResultSet;
配置心得 chenyu19891124 配置
时间就这样不知不觉的走过了一个春夏秋冬，转眼间来公司已经一年了，感觉时间过的很快，时间老人总是这样不停走，从来没停歇过。作为一名新手的配置管理员，刚开始真的是对配置管理是一点不懂，就只听说咱们公司配置主要是负责升级，而具体该怎么做却一点都不了解。经过老员工的一点点讲解，慢慢的对配置有了初步了解，对自己所在的岗位也慢慢的了解。做了一年的配置管理给自总结下： 1.改变从一个以前对配置毫无
对“带条件选择的并行汇聚路由问题”的再思考 comsci 算法工作软件测试嵌入式领域模型
2008年上半年，我在设计并开发基于”JWFD流程系统“的商业化改进型引擎的时候，由于采用了新的嵌入式公式模块而导致出现“带条件选择的并行汇聚路由问题”(请参考2009-02-27博文)，当时对这个问题的解决办法是采用基于拓扑结构的处理思想，对汇聚点的实际前驱分支节点通过算法预测出来，然后进行处理，简单的说就是找到造成这个汇聚模型的分支起点，对这个起始分支节点实际走的路径数进行计算，然后把这个实际
Oracle 10g 的clusterware 32位下载地址 daizj oracle
Oracle 10g 的clusterware 32位下载地址 http://pan.baidu.com/share/link?shareid=531580&uk=421021908 http://pan.baidu.com/share/link?shareid=137223&uk=321552738 http://pan.baidu.com/share/l
非常好的介绍：Linux定时执行工具cron dongwei_6688 linux
Linux经过十多年的发展，很多用户都很了解Linux了，这里介绍一下Linux下cron的理解，和大家讨论讨论。cron是一个Linux 定时执行工具，可以在无需人工干预的情况下运行作业，本文档不讲cron实现原理，主要讲一下Linux定时执行工具cron的具体使用及简单介绍。新增调度任务推荐使用crontab -e命令添加自定义的任务（编辑的是/var/spool/cron下对应用户的cr
Yii assets目录生成及修改 dcj3sjt126com yii
assets的作用是方便模块化，插件化的，一般来说出于安全原因不允许通过url访问protected下面的文件，但是我们又希望将module单独出来，所以需要使用发布，即将一个目录下的文件复制一份到assets下面方便通过url访问。 assets设置对应的方法位置 \framework\web\CAssetManager.php assets配置方法在m
mac工作软件推荐 dcj3sjt126com mac
mac上的Terminal + bash ＋ screen组合现在已经非常好用了，但是还是经不起iterm＋zsh＋tmux的冲击。在同事的强烈推荐下，趁着升级mac系统的机会，顺便也切换到iterm＋zsh＋tmux的环境下了。我为什么要要iterm2 切换过来也是脑袋一热的冲动，我也调查过一些资料，看了下iterm的一些优点： * 兼容性好，远程服务器 vi 什么的低版本能很好兼
Memcached(三)、封装Memcached和Ehcache frank1234 memcached ehcache spring ioc
本文对Ehcache和Memcached进行了简单的封装，这样对于客户端程序无需了解ehcache和memcached的差异，仅需要配置缓存的Provider类就可以在二者之间进行切换，Provider实现类通过Spring IoC注入。 cache.xml <?xml version="1.0" encoding="UTF-8"?>
Remove Duplicates from Sorted List II hcx2013 remove
Given a sorted linked list, delete all nodes that have duplicate numbers, leaving only distinct numbers from the original list. For example,Given 1->2->3->3->4->4->5,
Spring4新特性——注解、脚本、任务、MVC等其他特性改进 jinnianshilongnian spring4
Spring4新特性——泛型限定式依赖注入 Spring4新特性——核心容器的其他改进 Spring4新特性——Web开发的增强 Spring4新特性——集成Bean Validation 1.1(JSR-349)到SpringMVC Spring4新特性——Groovy Bean定义DSL Spring4新特性——更好的Java泛型操作API Spring4新
MySQL安装文档 liyong0802 mysql
工作中用到的MySQL可能安装在两种操作系统中，即Windows系统和Linux系统。以Linux系统中情况居多。安装在Windows系统时与其它Windows应用程序相同按照安装向导一直下一步就即，这里就不具体介绍，本文档只介绍Linux系统下MySQL的安装步骤。 Linux系统下安装MySQL分为三种：RPM包安装、二进制包安装和源码包安装。二
使用VS2010构建HotSpot工程 p2p2500 HotSpot OpenJDK VS2010
1. 下载OpenJDK7的源码： http://download.java.net/openjdk/jdk7 http://download.java.net/openjdk/ 2. 环境配置 ▶
Oracle实用功能之分组后列合并 seandeng888 oracle 分组实用功能合并
1 实例解析由于业务需求需要对表中的数据进行分组后进行合并的处理，鉴于Oracle10g没有现成的函数实现该功能，且该功能如若用JAVA代码实现会比较复杂，因此，特将SQL语言的实现方式分享出来，希望对大家有所帮助。如下：表test 数据如下： ID,SUBJECTCODE,DIMCODE,VALUE 1&nbs
Java定时任务注解方式实现 tuoni java spring jvm xml jni
Spring 注解的定时任务，有如下两种方式：第一种： <?xml version="1.0" encoding="UTF-8"?> <beans xmlns="http://www.springframework.org/schema/beans" xmlns:xsi="http
11大Java开源中文分词器的使用方法和分词效果对比 yangshangchuan word分词器 ansj分词器 Stanford分词器 FudanNLP分词器 HanLP分词器
本文的目标有两个： 1、学会使用11大Java开源中文分词器 2、对比分析11大Java开源中文分词器的分词效果本文给出了11大Java开源中文分词的使用方法以及分词结果对比代码，至于效果哪个好，那要用的人结合自己的应用场景自己来判断。 11大Java开源中文分词器，不同的分词器有不同的用法，定义的接口也不一样，我们先定义一个统一的接口： /** * 获取文本的所有分词结果, 对比