only粉丝

pytorch基础学习

文章目录

Tensor Initialization
- From a python list
- From a Numpy Array
- From a Tensor
- By Specifying a Shape
- With torch.arange
Tensor Properties
- Data type
- Shape
- contiguous
- Unsqueeze and squeeze
- Device
Tensor Indexing
Operation
- Broadcast
- statistic
- cat
Autograd
- 举个例子
Neural Network Module
- ModuleLayers
- Activation Function Layer
- Putting the Layers Together
- Custom Modules
Optimization
Demo: Word Window Classification
- Data
- Preprocessing
- Converting words to Embedings
- Batching Sentences
- Model
- Training
- Prediction

Tensor Initialization

From a python list

data = [
        [0,1],
        [2,3],
        [4,5]
        ]
data_tensor = torch.tensor(data)
data_tensor
# tensor([[0, 1],
#         [2, 3],
#         [4, 5]])

data_tensor_float = torch.tensor(data, dtype = torch.float)
data_tensor_float
# tensor([[0., 1.],
#         [2., 3.],
#         [4., 5.]])

data_tensor_float = torch.tensor(data, dtype = torch.bool)
data_tensor_float
# tensor([[False,  True],
#         [ True,  True],
#         [ True,  True]])
# torch.float32
data_tensor.float()
# tensor([[0., 1.],
#         [2., 3.],
#         [4., 5.]])
# torch.float32

torch.tensor 使用的是工厂函数来进行初始化，也可以用类来进行初始化
tensor.FloatTensor()
tensor.Tensor(default is float type),
tensor.LongTensor()

data_tensor_float = torch.Tensor(data)
data_tensor_float.dtype
# torch.float32

From a Numpy Array

import numpy as np
ndarray = np.array(data)
x_numpy = torch.from_numpy(ndarray)
x_numpy
# tensor([[0, 1],
#         [2, 3],
#         [4, 5]], dtype=torch.int32)

From a Tensor

x = torch.tensor([[1.,2.],[3.,4.]])
x
# tensor([[1., 2.],
#         [3., 4.]])
x_zeros = torch.zeros_like(x)
x_zeros
# tensor([[0., 0.],
#         [0., 0.]])
x_ones = torch.ones_like(x)
x_ones
# tensor([[1., 1.],
#         [1., 1.]])
x_rand = torch.rand_like(x)
x_rand
# tensor([[0.6859, 0.5000],
#         [0.1916, 0.6818]])
x_randn = torch.randn_like(x)
x_randn
# tensor([[ 0.1215,  1.3117],
#         [-1.5105,  0.3146]])

By Specifying a Shape

shape = (3,2,2)
x_zeros = torch.zeros(shape)
x_zeros
# tensor([[[0., 0.],
#          [0., 0.]],

#         [[0., 0.],
#          [0., 0.]],

#         [[0., 0.],
#          [0., 0.]]])

With torch.arange

x = torch.arange(10)
x
# tensor([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

Tensor Properties

Data type

x = torch.ones(2,3)
x.dtype

Shape

# torch.float32
x.shape
# torch.Size([2, 3])
x.size(0)
#2
x.shape[0]
#2

contiguous

对view的使用

x = torch.arange(6).reshape(2,3)
# tensor([[0, 1, 2],
#         [3, 4, 5]])
x_view1 = x.view(3,2)
# tensor([[0, 1],
#         [2, 3],
#         [4, 5]])
x_view2 = x.view(-1,2)
# tensor([[0, 1],
#         [2, 3],
#         [4, 5]])

view操作的前提是Tensor必须要是contiguous的

在pytorch中，为了节省内存，transpose和permute等操作是没有新开辟内存的，即并没有修改或者复制底层的数组，但是他们新建了一份Tensor元信息，并在新的元信息中重新制定了stride。torch.view 方法约定了不修改数组本身，只是使用新的形状查看数据。如果我们在 transpose、permute 操作后执行 view，Pytorch 会抛错误

t = torch.arange(12).reshape(3,4)
t
# tensor([[ 0,  1,  2,  3],
#         [ 4,  5,  6,  7],
#         [ 8,  9, 10, 11]])
t.stride() 
#(4, 1)
#4 是在0 维度上跳到下一个元素需要的距离， #1 是在1维上跳到下一个元素的距离
t_T = t.transpose(0,1)
t_T
# tensor([[ 0,  4,  8],
#         [ 1,  5,  9],
#         [ 2,  6, 10],
#         [ 3,  7, 11]])
t_T.stride()
# (1, 4)
#在第一个维度上，跨行的stride是1，但是跨列的stride是4
t_T.data_ptr() == t.data_ptr()
# True
#表示数据存在相同的位置
t_T.is_contiguous(), t.is_contiguous()
# (False, True)
t_T.view(-1)
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
C:\Users\ADMINI~1\AppData\Local\Temp/ipykernel_7968/2726062531.py in <module>
----> 1 t_T.view(-1)

RuntimeError: view size is not compatible with input tensor's size and stride (at least one dimension spans across two contiguous subspaces). Use .reshape(...) instead.

这时候只要做一下contiguous操作就可以开辟一段新的内存，用于存储新的数据

t_T_contiguous = t_T.contiguous()
t_T
# tensor([[ 0,  4,  8],
#         [ 1,  5,  9],
#         [ 2,  6, 10],
#         [ 3,  7, 11]])
t_T_contiguous.data_ptr() == t.data_ptr()
# False

Unsqueeze and squeeze

x = torch.arange(10).reshape(5,2)
x
# tensor([[0, 1],
#         [2, 3],
#         [4, 5],
#         [6, 7],
#         [8, 9]])
x = x.unsqueeze(1)
x.shape
# torch.Size([5, 1, 2])
x = x.squeeze(1)
x.shape
# torch.Size([5, 2])

Device

Device tellls where to store the tensor, which determines which device, GPU or CPU would be handling the computations involving it.

x = torch.Tensor([[1,2],[3,4]])
x
# tensor([[1., 2.],
#         [3., 4.]])
x.device
# device(type='cpu')
x = x.to('cuda')
x.device
# device(type='cuda', index=0)
torch.cuda.is_available()
# True

Tensor Indexing

x = torch.arange(12).reshape(3,2,2)
x
# tensor([[[ 0,  1],
#          [ 2,  3]],

#         [[ 4,  5],
#          [ 6,  7]],

#         [[ 8,  9],
#          [10, 11]]])
x.shape
# torch.Size([3, 2, 2])
x[0], x[0,:], x[0,:,:], x[0].shape
# (tensor([[0, 1],
#          [2, 3]]),
#  tensor([[0, 1],
#          [2, 3]]),
#  tensor([[0, 1],
#          [2, 3]]),
#  torch.Size([2, 2]))

可以通过index他列表挑选某一个维度的数据

i = torch.tensor([0,1,0,1])
x[i], x[i].shape
# (tensor([[[0, 1],
#           [2, 3]],
 
#          [[4, 5],
#           [6, 7]],
 
#          [[4, 5],
#           [6, 7]]]),
#  torch.Size([3, 2, 2]))

也可以通过几个index列表挑选


i = torch.tensor([1,2, 1, 2])
j = torch.tensor([1])
k = torch.tensor([1])
x[i,j,k], x[i,j,k].shape
# (tensor([ 7, 11,  7, 11]), torch.Size([4]))

# 或者
x[[0,0,2],[1],:]
# tensor([[ 2,  3],
#         [ 2,  3],
#         [10, 11]])

x[0:2,[1],:]
# tensor([[[2, 3]],

#         [[6, 7]]])

Operation

Broadcast

可以像numpy一样广播

a = torch.ones((4,3)) * 6
b = torch.ones(3) * 2
a, b, a/b
# (tensor([[6., 6., 6.],
#          [6., 6., 6.],
#          [6., 6., 6.],
#          [6., 6., 6.]]),
#  tensor([2., 2., 2.]),
#  tensor([[3., 3., 3.],
#          [3., 3., 3.],
#          [3., 3., 3.],
#          [3., 3., 3.]]))

statistic

还可以算一些统计数据
注意tensor算mean和std一定要用float的

m = torch.tensor([
    [1., 1.],
    [2., 2.],
    [3., 3.],
    [4., 4.]
])
print(m.shape)
pp.pprint('Mean: {}'.format(m.mean()))
pp.pprint('Mean in the 0th dimension: {}'.format(m.mean(0)))
pp.pprint('Standard deviation in the 0th dimension: {}'.format(m.std(0)))
# torch.Size([4, 2])
# 'Mean: 2.5'
# 'Mean in the 0th dimension: tensor([2.5000, 2.5000])'
#对哪个维度平均， 那个维度就消失
# 'Standard deviation in the 0th dimension: tensor([1.2910, 1.2910])'

cat

cat命令也很实用

a = torch.arange(24).reshape(3,4,2)
a_cat0 = torch.cat([a,a,a], dim = 0)
a_cat1 = torch.cat([a,a,a], dim = 1)

print('initial shape: {}'.format(a.shape))
print('Shape after concatenation in dimension 0 is: {}'.format(a_cat0.shape))
print('Shape after concatenation in dimension 1 is: {}'.format(a_cat1.shape))
# initial shape: torch.Size([3, 4, 2])
# Shape after concatenation in dimension 0 is: torch.Size([9, 4, 2])
# Shape after concatenation in dimension 1 is: torch.Size([3, 12, 2])

Autograd

autograd 是torch的精髓

x = torch.tensor([2. ])
print(x.requires_grad)
# False
#default requires_grad is false
x = torch.tensor([2. ], requires_grad = True)
print(x.grad)
# None
# 初始化的grad是0

举个例子

x = torch.tensor([2. ], requires_grad = True)
y = x*x *3
y.backward()
pp.pprint(x.grad)
# tensor([12.])

然后我们再对x添加新的计算

z = x*x*3
z.backward()
pp.pprint(x.grad)
# tensor([24.])

注意一定要记得把grad清零
x.grad is updated to be the sum of the gradient calculated so far

Thus we need to run zero_grad() in every training iteration, Otherwise, the grad will keep building up

BTW，在网上看到许多Variable的教程，但其实已经用不到了

在python之前的版本中，variable可以封装tensor，计算反向传播梯度时需要将tensor封装在variable中。但是在python 0.4版本之后，将variable和tensor合并，也就是说不需要将tensor封装在variable中就可以计算梯度。tensor具有variable的性质。
作为能否autograd的标签，requires_grad现在是Tensor的属性，所以，只要当一个操作的任何输入Tensor具有requires_grad = True的属性，autograd就可以自动追踪历史和反向传播了。

原文链接：https://blog.csdn.net/weixin_44054487/article/details/92844571

Neural Network Module

在torch.nn这个模块里有许多提前定义好的网络

import torch.nn as nn
#there are some predefined block in torch

ModuleLayers

举个例子

input =  torch.ones(2,3,4)
#make a linesr layers transforming N, *, H_in in dimensional inputs to N, *, H_out
linear = nn.Linear(4, 2)
linear_output = linear(input)
linear_output, linear_output.shape
# (tensor([[[ 0.3793, -0.3961],
#           [ 0.3793, -0.3961],
#           [ 0.3793, -0.3961]],
 
#          [[ 0.3793, -0.3961],
#           [ 0.3793, -0.3961],
#           [ 0.3793, -0.3961]]], grad_fn=),
#  torch.Size([2, 3, 2]))

除了Linear，还有许多其他的例如
nn.Conv2d, nn.ConvTranspose2d, nn.BatchNorm1d, nn.BatchNorm2d, nn.Upsample, nn.MaxPool2d

Activation Function Layer

nn.ReLU(), nn.Sigmoid(), nn.LeakyReLU()

linear_output
# tensor([[[ 0.3793, -0.3961],
#          [ 0.3793, -0.3961],
#          [ 0.3793, -0.3961]],

#         [[ 0.3793, -0.3961],
#          [ 0.3793, -0.3961],
#          [ 0.3793, -0.3961]]], grad_fn=)
sigmoid = nn.Sigmoid()
output = sigmoid(linear_output)
output
# tensor([[[0.5937, 0.4022],
#          [0.5937, 0.4022],
#          [0.5937, 0.4022]],

#         [[0.5937, 0.4022],
#          [0.5937, 0.4022],
#          [0.5937, 0.4022]]], grad_fn=)

注意这里grad_fn 的变化

Putting the Layers Together

block = nn.Sequential(nn.Linear(4,2), nn.Sigmoid())

input = torch.ones(2,3,4)
output = block(input)
output
# tensor([[[0.5822, 0.5445],
#          [0.5822, 0.5445],
#          [0.5822, 0.5445]],

#         [[0.5822, 0.5445],
#          [0.5822, 0.5445],
#          [0.5822, 0.5445]]], grad_fn=)

Custom Modules

class MultilayerPerceptron(nn.Module):
    def __init__(self, input_size, hidden_size):
        super(MultilayerPerceptron, self).__init__()
        self.input_size = input_size
        self.hidden_size = hidden_size
        self.model = nn.Sequential(
            nn.Linear(self.input_size, self.hidden_size),
            nn.ReLU(),
            nn.Linear(self.hidden_size, self.input_size),
            nn.Sigmoid()
        )
        
    def forward(self, x):
        output = self.model(x)
        return output

input = torch.randn(2, 5)
model = MultilayerPerceptron(5,3)
output = model(input)
output, output.shape
# (tensor([[0.6317, 0.5629, 0.4682, 0.6893, 0.4653],
#          [0.6339, 0.5788, 0.4785, 0.6718, 0.4759]], grad_fn=),
#  torch.Size([2, 5]))

list(model.named_parameters())
# [('model.0.weight',
#   Parameter containing:
#   tensor([[ 0.3093,  0.4423,  0.2119, -0.2477,  0.1644],
#           [-0.0874,  0.2330, -0.2347, -0.4302, -0.2038],
#           [-0.0728,  0.3057, -0.3874, -0.0650, -0.3754]], requires_grad=True)),
#  ('model.0.bias',
#   Parameter containing:
#   tensor([-0.1116, -0.2941,  0.1612], requires_grad=True)),
#  ('model.2.weight',
#   Parameter containing:
#   tensor([[-0.0755, -0.1328,  0.0387],
#           [ 0.0355, -0.4333, -0.4412],
#           [ 0.1081, -0.3106, -0.2242],
#           [-0.4018,  0.5232,  0.5647],
#           [-0.1715, -0.3474, -0.1894]], requires_grad=True)),
#  ('model.2.bias',
#   Parameter containing:
#   tensor([0.5289, 0.5478, 0.0306, 0.4223, 0.0022], requires_grad=True))]

Optimization

首先引入模块

import torch.optim as optim

定义一个dummy的输入和输出

y = torch.ones(10, 5)
x = y + torch.randn_like(y)

使用上面定义的customized nn.Module来训练

model = MultilayerPerceptron(5,3)

adam = optim.Adam(model.parameters(), lr = 1e-1)

loss_function = nn.BCELoss()

y_predict = model(x)

loss_function(y_predict, y).item()
# 0.7197282910346985

n_epoch = 100
for epoch in range(n_epoch):
    adam.zero_grad()
    y_pred = model(x)
    loss = loss_function(y_pred, y)
    print(f"Epoch {epoch}: training loss: {loss}")
    loss.backward()
    adam.step()
# Epoch 0: training loss: 0.7025878429412842
# Epoch 1: training loss: 0.6235182285308838
# Epoch 2: training loss: 0.5331209897994995
# Epoch 3: training loss: 0.42057228088378906
# Epoch 4: training loss: 0.30540353059768677
# Epoch 5: training loss: 0.20316310226917267
# ...
# Epoch 95: training loss: 3.5762795391747204e-08
# Epoch 96: training loss: 3.5762795391747204e-08
# Epoch 97: training loss: 3.5762795391747204e-08
# Epoch 98: training loss: 3.5762795391747204e-08
# Epoch 99: training loss: 3.5762795391747204e-08

y_pred = model(x)
y_pred, loss_function(y_pred, y).item()
# (tensor([[1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000]], grad_fn=),
#  3.5762795391747204e-08)

x2 = y + torch.randn_like(y)
y_pred = model(x2)
y_pred, loss_function(y_pred, y).item()
# (tensor([[1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [0.9998, 0.9999, 1.0000, 0.9997, 0.9999],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000],
#          [1.0000, 1.0000, 1.0000, 0.9999, 1.0000],
#          [1.0000, 1.0000, 1.0000, 1.0000, 1.0000]], grad_fn=),
#  1.5589259419357404e-05)

Demo: Word Window Classification

Data

corpus = [
    'We always come to Paris',
    'The professor, is from Australia',
    'I live in Stanford',
    'He comes from Taiwan',
    'The capital of Turkey is Ankara'
]

Preprocessing

清洗数据并添加标签

def preprocess_sentence(sentences):
        return sentences.lower().split()
    
train_sentences = [sent.lower().split() for sent in corpus]
train_sentences
# [['we', 'always', 'come', 'to', 'paris'],
#  ['the', 'professor,', 'is', 'from', 'australia'],
#  ['i', 'live', 'in', 'stanford'],
#  ['he', 'comes', 'from', 'taiwan'],
#  ['the', 'capital', 'of', 'turkey', 'is', 'ankara']]
location = set(['australia', 'ankara', 'paris', 'stanford', 'taiwan', 'turkey'])
train_labels = [[1 if word in location else 0 for word in sent] for sent in train_sentences]
train_labels
# [[0, 0, 0, 0, 1],
#  [0, 0, 0, 0, 1],
#  [0, 0, 0, 1],
#  [0, 0, 0, 1],
#  [0, 0, 0, 1, 0, 1]]

Converting words to Embedings

vocabulary = set(w for s in train_sentences for w in s)
vocabulary.add('')
vocabulary.add('')

def pad_window(sentence, window_size, pad_token = ''):
    window = [pad_token] * window_size
    return window + sentence + window

window_size = 2
pad_window(train_sentences[0], window_size)
# ['', '', 'we', 'always', 'come', 'to', 'paris', '', '']

ix2word = sorted(list(vocabulary))
word2ix = {word: ix for ix, word in enumerate(ix2word)}
word2ix
# {'': 0,
#  '': 1,
#  'always': 2,
#  'ankara': 3,
#  'australia': 4,
#  'capital': 5,
#  'come': 6,
#  'comes': 7,
#  'from': 8,
#  'he': 9,
#  'i': 10,
#  'in': 11,
#  'is': 12,
#  'live': 13,
#  'of': 14,
#  'paris': 15,
#  'professor,': 16,
#  'stanford': 17,
#  'taiwan': 18,
#  'the': 19,
#  'to': 20,
#  'turkey': 21,
#  'we': 22}

def convert_token_to_indices(sentence, word2ix):
    return [word2ix.get(token, word2ix['']) for token in sentence]

example_sentence = ['we', 'always', 'come', 'to', 'kuwait']
example_indices = convert_token_to_indices(example_sentence, word2ix)
restored_example = [ix2word[ind] for ind in example_indices]

print(example_sentence, example_indices, restored_example)
# ['we', 'always', 'come', 'to', 'kuwait'] 
# [22, 2, 6, 20, 1] 
# ['we', 'always', 'come', 'to', '']

example_padded_indices = [convert_token_to_indices(s, word2ix) for s in train_sentences]
example_padded_indices
# [[22, 2, 6, 20, 15],
#  [19, 16, 12, 8, 4],
#  [10, 13, 11, 17],
#  [9, 7, 8, 18],
#  [19, 5, 14, 21, 12, 3]]

embedding_dim = 5
embeds = nn.Embedding(len(vocabulary), embedding_dim)

#举个例子
index_paris = word2ix['paris']
index_ankara = word2ix['ankara']
indices = [index_paris, index_ankara]
indices_tensor = torch.tensor(indices, dtype=torch.long)
embeddings = embeds(indices_tensor)
embeddings
# tensor([[ 0.0622,  0.5321,  0.3486, -0.8931, -1.4741],
#         [ 0.3636, -0.7360,  1.4412,  0.0530, -0.8008]],
#        grad_fn=)

Batching Sentences

首先要写一个collate_fn 函数

def _custom_collate_fn(batch, window_size, word2ix):
    x, y = zip(*batch)
    x = [pad_window(s, window_size = window_size) for s in x]
    x = [convert_token_to_indices(s, word2ix) for s in x]
    
    pad_token = word2ix['']
    x = [torch.LongTensor(x_i) for x_i in x]
    x_padded = nn.utils.rnn.pad_sequence(x, batch_first=True, padding_value = pad_token)
    
    lengths = [len(label) for label in y]
    lengths = torch.LongTensor(lengths)
    y = [torch.LongTensor(y_i) for y_i in y]
    y_padded = nn.utils.rnn.pad_sequence(y, batch_first=True, padding_value = 0)
    
    return x_padded, y_padded, lengths

关于pad_packed_sequence, pack_pad_sequence, pack_sequence, pad_sequence
可以查看这篇cite
相信我，印度人念这个简直是灾难

当我们load的时候实际上是load了什么呢，这个partial其实就是固定一些参数后的方程

from torch.utils.data import DataLoader
from functools import partial

data = list(zip(train_sentences, train_labels))
batch_size = 2
shuffle = True
window_size = 2
collate_fn = partial(_custom_collate_fn, window_size = window_size, word2ix = word2ix)
loader = DataLoader(data, batch_size = batch_size, shuffle= shuffle, collate_fn = collate_fn)
counter = 0
for batched_x, batched_y, batched_lengths in loader:
    
    print(f'Iteration {counter}')
    print('Batched Input: ')
    print(batched_x)
    print('Batched Labels: ')
    print(batched_y)
    print('Batched Lengths')
    print(batched_lengths)
    counter+=1
# Iteration 0
# Batched Input: 
# tensor([[ 0,  0, 19, 16, 12,  8,  4,  0,  0,  0],
#         [ 0,  0, 19,  5, 14, 21, 12,  3,  0,  0]])
# Batched Labels: 
# tensor([[0, 0, 0, 0, 1, 0],
#         [0, 0, 0, 1, 0, 1]])
# Batched Lengths
# tensor([5, 6])
# Iteration 1
# Batched Input: 
# tensor([[ 0,  0, 22,  2,  6, 20, 15,  0,  0],
#         [ 0,  0,  9,  7,  8, 18,  0,  0,  0]])
# Batched Labels: 
# tensor([[0, 0, 0, 0, 1],
#         [0, 0, 0, 1, 0]])
# Batched Lengths
# tensor([5, 4])
# Iteration 2
# Batched Input: 
# tensor([[ 0,  0, 10, 13, 11, 17,  0,  0]])
# Batched Labels: 
# tensor([[0, 0, 0, 1]])
# Batched Lengths
# tensor([4])

接下来用unfold函数得到window
Tensor.unfold(dimension, size, step) → Tensor
Returns a view of the original tensor which contains all slices of size size from self tensor in the dimension dimension.

Parameters

dimension (int) – dimension in which unfolding happens

size (int) – the size of each slice that is unfolded

step (int) – the step between each slice

print(f'Original Tensor: ')
print(batched_x)
print("")
chunk = batched_x.unfold(1, window_size*2 + 1, 1)
print(f"Windows: ")
print(chunk)
# Original Tensor: 
# tensor([[ 0,  0, 10, 13, 11, 17,  0,  0]])

# Windows: 
# tensor([[[ 0,  0, 10, 13, 11],
#          [ 0, 10, 13, 11, 17],
#          [10, 13, 11, 17,  0],
#          [13, 11, 17,  0,  0]]])

Model

class WordWindowClassifier(nn.Module):
    def __init__(self, hyperparameters, vocab_size, pad_ix = 0):
        super(WordWindowClassifier, self).__init__()
        
        self.window_size = hyperparameters['window_size']
        self.embed_dim = hyperparameters['embed_dim']
        self.hidden_dim = hyperparameters['hidden_dim']
        self.freeze_embeddings = hyperparameters['freeze_embeddings']
        
        self.embeds = nn.Embedding(vocab_size, self.embed_dim, padding_idx = pad_ix)
        if self.freeze_embeddings:
            self.embeds.weight.requires_grad = False
            
        full_window_size = 2 * window_size + 1
        self.hidden_layer = nn.Sequential(
            nn.Linear(full_window_size * self.embed_dim, self.hidden_dim),
            nn.Tanh()
        )
        
        self.output_layer = nn.Linear(self.hidden_dim, 1)
        
        self.probabilities = nn.Sigmoid()
        
    def forward(self, inputs):
        '''
        B: batch_size
        L: window_padded sentence length
        D: self.embed_dim
        S: self.window_size
        H: self.hidden_dim
        '''
        
        B,L = inputs.size()
        token_windows = inputs.unfold(1,2*window_size + 1, 1)
        _, adjusted_length, _ = token_windows.size()
        
        assert token_windows.size() == (B, adjusted_length, 2*self.window_size + 1)
        #(B, L~, S)
        
        embedded_windows = self.embeds(token_windows)
        #(B,L~,S,D)
        
        embedded_windows = embedded_windows.view(B, adjusted_length, -1)
        #(B,L~,S*D)
        
        layer_1 = self.hidden_layer(embedded_windows)
        #(B,L~, H)
        
        output = self.output_layer(layer_1)
        
        output = self.probabilities(output)
        #(B, L~, 1)
        
        output = output.view(B, -1)
        #(B, L~)
        return output

Training

data = list(zip(train_sentences, train_labels))
batch_size = 2
shuffle = True
window_size = 2
collate_fn = partial(_custom_collate_fn, window_size= window_size, word2ix = word2ix)
loader = DataLoader(data, batch_size = batch_size, shuffle = shuffle, collate_fn= collate_fn)

model_hyperparameters = {
    'batch_size': 4,
    'window_size': 2,
    'embed_dim': 25,
    'hidden_dim': 25,
    'freeze_embeddings': False
}

vocab_size = len(word2ix)
model = WordWindowClassifier(model_hyperparameters, vocab_size)

learning_rate = 0.01
optimizer = torch.optim.SGD(model.parameters(), lr = learning_rate)

def loss_function(batch_outputs, batch_labels, batch_lengths):
    bceloss = nn.BCELoss()#don't forget ()
    loss = bceloss(batch_outputs, batch_labels.float())
    loss = loss/batch_lengths.sum().float()
    return loss

def train_epoch(loss_function, optimizer, model, loader):
    total_loss = 0
    for batch_inputs, batch_labels, batch_lengths in loader:
        optimizer.zero_grad()
        outputs = model.forward(batch_inputs)
        loss =  loss_function(outputs, batch_labels, batch_lengths)
        loss.backward()
        optimizer.step()
        total_loss += loss.item()
    return total_loss
    
def train(loss_function, optimizer, model, loader, num_epochs = 10000):
    for epoch in range(num_epochs):
        epoch_loss = train_epoch(loss_function, optimizer, model, loader)
        if epoch %100 == 0: print(epoch_loss)
       
num_epochs = 1000
train(loss_function, optimizer, model, loader, num_epochs = num_epochs)
# 0.2615063562989235
# 0.2215040773153305
# 0.17036793380975723
# 0.15041063725948334
# 0.12963269650936127
# 0.08397780545055866
# 0.07003378123044968
# 0.05545256659388542
# 0.04804721102118492
# 0.03628341108560562

Prediction

用同样的方法生成testloader

test_corpus = ['She comes from Paris']
test_sentences = [s.lower().split() for s in test_corpus]
test_labels = [(0,0,0,1)]

test_data = list(zip(test_sentences, test_labels))
batch_size = 1
shuffle = False
window_size = 2
collate_fn = partial(_custom_collate_fn, window_size = 2, word2ix = word2ix)
test_loader = torch.utils.data.DataLoader(test_data, batch_size = batch_size, shuffle = shuffle, collate_fn  = collate_fn)

验证结果

for test_instances, labels, _ in test_loader:
    outputs = model.forward(test_instances)
    print(labels)
    print(outputs)
# tensor([[0, 0, 0, 1]])
# tensor([[0.0859, 0.1915, 0.0471, 0.9349]], grad_fn=)

你可能感兴趣的:(pytorch,pytorch,深度学习,神经网络)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
推荐3家毕业AI论文可五分钟一键生成！文末附免费教程！小猪包333 写论文人工智能 AI写作深度学习计算机视觉
在当前的学术研究和写作领域，AI论文生成器已经成为许多研究人员和学生的重要工具。这些工具不仅能够帮助用户快速生成高质量的论文内容，还能进行内容优化、查重和排版等操作。以下是三款值得推荐的AI论文生成器：千笔-AIPassPaper、懒人论文以及AIPaperPass。千笔-AIPassPaper千笔-AIPassPaper是一款基于深度学习和自然语言处理技术的AI写作助手，旨在帮助用户快速生成高质
AI大模型的架构演进与最新发展季风泯灭的季节 AI大模型应用技术二人工智能架构
随着深度学习的发展，AI大模型（LargeLanguageModels,LLMs）在自然语言处理、计算机视觉等领域取得了革命性的进展。本文将详细探讨AI大模型的架构演进，包括从Transformer的提出到GPT、BERT、T5等模型的历史演变，并探讨这些模型的技术细节及其在现代人工智能中的核心作用。一、基础模型介绍：Transformer的核心原理Transformer架构的背景在Transfo
ai绘画工具midjourney怎么下载？附作品管理教程设计师早上好
Midjourney是一款功能强大的AI绘画工具，它使用机器学习技术和深度神经网络等算法，可以生成各种艺术风格的绘画作品。在创意设计、广告宣传等方面有着广泛的应用前景。那么，ai绘画工具midjourney怎么下载？本文将为您介绍Midjourney的下载以及作品的相关管理。一、Midjourney下载Midjourney的下载非常简单，只需打开Midjourney官网（点击“GetMidjour
[实践应用] 深度学习之模型性能评估指标 YuanDaima2048 深度学习工具使用深度学习人工智能损失函数性能评估 pytorch python 机器学习
文章总览：YuanDaiMa2048博客文章总览深度学习之模型性能评估指标分类任务回归任务排序任务聚类任务生成任务其他介绍在机器学习和深度学习领域，评估模型性能是一项至关重要的任务。不同的学习任务需要不同的性能指标来衡量模型的有效性。以下是对一些常见任务及其相应的性能评估指标的详细解释和总结。分类任务分类任务是指模型需要将输入数据分配到预定义的类别或标签中。以下是分类任务中常用的性能指标：准确率(
[实践应用] 深度学习之优化器 YuanDaima2048 深度学习工具使用 pytorch 深度学习人工智能机器学习 python 优化器
文章总览：YuanDaiMa2048博客文章总览深度学习之优化器1.随机梯度下降（SGD）2.动量优化（Momentum）3.自适应梯度（Adagrad）4.自适应矩估计（Adam）5.RMSprop总结其他介绍在深度学习中，优化器用于更新模型的参数，以最小化损失函数。常见的优化函数有很多种，下面是几种主流的优化器及其特点、原理和PyTorch实现：1.随机梯度下降（SGD）原理:随机梯度下降通过
生成式地图制图 Bwywb_3 深度学习机器学习深度学习生成对抗网络
生成式地图制图（GenerativeCartography）是一种利用生成式算法和人工智能技术自动创建地图的技术。它结合了传统的地理信息系统（GIS）技术与现代生成模型（如深度学习、GANs等），能够根据输入的数据自动生成符合需求的地图。这种方法在城市规划、虚拟环境设计、游戏开发等多个领域具有应用前景。主要特点：自动化生成：通过算法和模型，系统能够根据输入的地理或空间数据自动生成地图，而无需人工逐
吴恩达深度学习笔记(30)-正则化的解释极客Array
正则化（Regularization）深度学习可能存在过拟合问题——高方差，有两个解决方法，一个是正则化，另一个是准备更多的数据，这是非常可靠的方法，但你可能无法时时刻刻准备足够多的训练数据或者获取更多数据的成本很高，但正则化通常有助于避免过拟合或减少你的网络误差。如果你怀疑神经网络过度拟合了数据，即存在高方差问题，那么最先想到的方法可能是正则化，另一个解决高方差的方法就是准备更多数据，这也是非常
个人学习笔记7-6：动手学深度学习pytorch版-李沐浪子L 深度学习深度学习笔记计算机视觉 python 人工智能神经网络 pytorch
#人工智能##深度学习##语义分割##计算机视觉##神经网络#计算机视觉13.11全卷积网络全卷积网络（fullyconvolutionalnetwork，FCN）采用卷积神经网络实现了从图像像素到像素类别的变换。引入l转置卷积（transposedconvolution）实现的，输出的类别预测与输入图像在像素级别上具有一一对应关系：通道维的输出即该位置对应像素的类别预测。13.11.1构造模型下
深度学习-点击率预估-研究论文2024-09-14速读 sp_fyf_2024 深度学习人工智能
深度学习-点击率预估-研究论文2024-09-14速读1.DeepTargetSessionInterestNetworkforClick-ThroughRatePredictionHZhong,JMa,XDuan,SGu,JYao-2024InternationalJointConferenceonNeuralNetworks,2024深度目标会话兴趣网络用于点击率预测摘要：这篇文章提出了一种新
计算机视觉中，Pooling的作用 Wils0nEdwards 计算机视觉人工智能
在计算机视觉中，Pooling（池化）是一种常见的操作，主要用于卷积神经网络（CNN）中。它通过对特征图进行下采样，减少数据的空间维度，同时保留重要的特征信息。Pooling的作用可以归纳为以下几个方面：1.降低计算复杂度与内存需求Pooling操作通过对特征图进行下采样，减少了特征图的空间分辨率（例如，高度和宽度）。这意味着网络需要处理的数据量会减少，从而降低了计算量和内存需求。这对大型神经网络
神经网络-损失函数红米煮粥神经网络人工智能深度学习
文章目录一、回归问题的损失函数1.均方误差（MeanSquaredError,MSE）2.平均绝对误差（MeanAbsoluteError,MAE）二、分类问题的损失函数1.0-1损失函数（Zero-OneLossFunction）2.交叉熵损失（Cross-EntropyLoss）3.合页损失（HingeLoss）三、总结在神经网络中，损失函数（LossFunction）扮演着至关重要的角色，它
损失函数与反向传播 Star_. PyTorch pytorch 深度学习 python
损失函数定义与作用损失函数(lossfunction)在深度学习领域是用来计算搭建模型预测的输出值和真实值之间的误差。1.损失函数越小越好2.计算实际输出与目标之间的差距3.为更新输出提供依据（反向传播)常见的损失函数回归常见的损失函数有：均方差（MeanSquaredError，MSE）、平均绝对误差（MeanAbsoluteErrorLoss，MAE）、HuberLoss是一种将MSE与MAE
BP神经网络的传递函数大胜归来19 MATLAB
BP网络一般都是用三层的，四层及以上的都比较少用；传输函数的选择，这个怎么说，假设你想预测的结果是几个固定值，如1,0等，满足某个条件输出1，不满足则0的话，首先想到的是hardlim函数，阈值型的，当然也可以考虑其他的；然后，假如网络是用来表达某种线性关系时，用purelin---线性传输函数；若是非线性关系的话，用别的非线性传递函数，多层网络时，每层不一定要用相同的传递函数，可以是三种配合，可
神经网络传递函数sigmoid,神经网络传递函数作用快乐的小荣荣神经网络机器学习深度学习人工智能
神经网络传递函数选取不同会有特别大差别嘛？只是最后一层，但前面层是非线性，那么可能存在区别不大的情况。线性函数f(a*input)=af(input),一般来说，input为向量，最简化情况下，可以假设input的各个维度，a1=a2=a3。。。意味着你线性层只是简单的对输入做了scale~而神经网络能起作用的原因，在于通过足够复杂的非线性函数，来模拟任何的分布。所以，神经网络必须要用非线性函数。
【安装环境】配置MMTracking环境 xuanyu22 安装环境机器学习神经网络深度学习 python
版本v0.14.0安装torchnumpy的版本不能太高，否则后面安装时会发生冲突。先安装numpy，因为pytorch的安装会自动配置高版本numpy。condainstallnumpy=1.21.5mmtracking支持的torch版本有限，需要找到合适的condainstallpytorch==1.11.0torchvision==0.12.0cudatoolkit=10.2-cpytor
Python和R均方根误差平均绝对误差算法模型亚图跨际 Python 交叉知识 R 回归模型误差指标归一化均方根误差生态状态指标神经网络成本误差气体排放气候模型多项式拟合
要点回归模型误差评估指标归一化均方根误差生态状态指标神经网络成本误差计算气体排放气候算法模型Python误差指标均方根误差和平均绝对误差均方根偏差或均方根误差是两个密切相关且经常使用的度量值之一，用于衡量真实值或预测值与观测值或估计值之间的差异。估计器θ^\hat{\theta}θ^相对于估计参数θ\thetaθ的RMSD定义为均方误差的平方根：RMSD⁡(θ^)=MSE⁡(θ^)=E((θ^−θ
Python(PyTorch)和MATLAB及Rust和C++结构相似度指数测量导图亚图跨际 Python 交叉知识算法量化检查图像压缩质量低分辨率多光谱峰值信噪比端到端优化图像压缩手术机器人三维实景实时可微分渲染重建三维可视化
要点量化检查图像压缩质量低分辨率多光谱和高分辨率图像实现超分辨率分析图像质量图像索引/多尺度结构相似度指数和光谱角映射器及视觉信息保真度多种指标峰值信噪比和结构相似度指数测量结构相似性图像分类PNG和JPEG图像相似性近似算法图像压缩，视频压缩、端到端优化图像压缩、神经图像压缩、GPU变速图像压缩手术机器人深度估计算法重建三维可视化推理图像超分辨率算法模型三维实景实时可微分渲染算法MATLAB结构
【深度学习】训练过程中一个OOM的问题，太难查了 weixin_40293999 深度学习深度学习人工智能
现象：各位大佬又遇到过ubuntu的这个问题么？现象是在训练过程中，ssh上不去了，能ping通，没死机，但是ubunutu的pc侧的显示器，鼠标啥都不好用了。只能重启。问题原因：OOM了95G，尼玛！！！！pytorch爆内存了，然后journald假死了，在journald被watchdog干掉之后，系统就崩溃了。这种规模的爆内存一般，即使被oomkill了，也要卡半天的，确实会这样，能不能配
Pyorch中 nn.Conv1d 与 nn.Linear 的区别迪三 #NN_Layer 神经网络
即一维卷积层和全联接层的区别nn.Conv1d和nn.Linear都是PyTorch中的层，它们用于不同的目的，主要区别在于它们处理输入数据的方式和执行的操作类型。nn.Conv1d通过应用滑动过滤器来捕捉序列数据中的局部模式，适用于处理具有时间或序列结构的数据。nn.Linear通过将每个输入与每个输出相连接，捕捉全局关系，适用于将输入数据作为整体处理的任务。1.维度与输入nn.Conv1d（一
图片中的上采样，下采样和通道融合(up-sample, down-sample, channel confusion) 迪三 #图像处理_PyTorch 计算机视觉深度学习人工智能
前言以conv2d为例（即图片），Pytorch中输入的数据格式为tensor，格式为:[N,C,W,H,W]第一维N.代表图片个数，类似一个batch里面有N张图片第二维C.代表通道数，在模型中输入如果为彩色，常用RGB三色图，那么就是3维，即C=3。如果是黑白的，即灰度图，那么只有一个通道，即C=1第三维H.代表图片的高度，H的数量是图片像素的列数第四维W.代表图片的宽度，W的数量是图片像素的
云服务业界动态简报-20180128 Captain7
一、青云青云QingCloud推出深度学习平台DeepLearningonQingCloud，包含了主流的深度学习框架及数据科学工具包，通过QingCloudAppCenter一键部署交付，可以让算法工程师和数据科学家快速构建深度学习开发环境，将更多的精力放在模型和算法调优。二、腾讯云1.腾讯云正式发布腾讯专有云TCE(TencentCloudEnterprise)矩阵，涵盖企业版、大数据版、AI
机器学习VS深度学习 nfgo 机器学习
机器学习（MachineLearning,ML）和深度学习（DeepLearning,DL）是人工智能（AI）的两个子领域，它们有许多相似之处，但在技术实现和应用范围上也有显著区别。下面从几个方面对两者进行区分：1.概念层面机器学习：是让计算机通过算法从数据中自动学习和改进的技术。它依赖于手动设计的特征和数学模型来进行学习，常用的模型有决策树、支持向量机、线性回归等。深度学习：是机器学习的一个子领
大数据毕业设计hadoop+spark+hive知识图谱租房数据分析可视化大屏租房推荐系统 58同城租房爬虫房源推荐系统房价预测系统计算机毕业设计机器学习深度学习人工智能 2401_84572577 程序员大数据 hadoop 人工智能
做了那么多年开发，自学了很多门编程语言，我很明白学习资源对于学一门新语言的重要性，这些年也收藏了不少的Python干货，对我来说这些东西确实已经用不到了，但对于准备自学Python的人来说，或许它就是一个宝藏，可以给你省去很多的时间和精力。别在网上瞎学了，我最近也做了一些资源的更新，只要你是我的粉丝，这期福利你都可拿走。我先来介绍一下这些东西怎么用，文末抱走。（1）Python所有方向的学习路线（
深度学习-13-小语言模型之SmolLM的使用皮皮冰燃深度学习深度学习
文章附录1SmolLM概述1.1SmolLM简介1.2下载模型2运行2.1在CPU/GPU/多GPU上运行模型2.2使用torch.bfloat162.3通过位和字节的量化版本3应用示例4问题及解决4.1attention_mask和pad_token_id报错4.2max_new_tokens=205参考附录1SmolLM概述1.1SmolLM简介SmolLM是一系列尖端小型语言模型，提供三种规
【NLP5-RNN模型、LSTM模型和GRU模型】一蓑烟雨紫洛 nlp rnn lstm gru nlp
RNN模型、LSTM模型和GRU模型1、什么是RNN模型RNN（RecurrentNeuralNetwork)中文称为循环神经网络，它一般以序列数据为输入，通过网络内部的结构设计有效捕捉序列之间的关系特征，一般也是以序列形式进行输出RNN的循环机制使模型隐层上一时间步产生的结果，能够作为当下时间步输入的一部分（当下时间步的输入除了正常的输入外还包括上一步的隐层输出）对当下时间步的输出产生影响2、R
基于深度学习的农作物病害检测 SEU-WYL 深度学习dnn 深度学习人工智能
基于深度学习的农作物病害检测利用卷积神经网络（CNN）、生成对抗网络（GAN）、Transformer等深度学习技术，自动识别和分类农作物的病害，帮助农业工作者提高作物管理效率、减少损失。1.农作物病害检测的挑战病害种类繁多：农作物病害的类型多样，不同病害在同一作物上的表现差异很大，同时同一种病害在不同生长阶段的症状也可能不同。环境影响：天气、光照、湿度等外部环境因素会影响农作物的表现，使得病害检
基于深度学习的文本引导的图像编辑 SEU-WYL 深度学习dnn 深度学习人工智能
基于深度学习的文本引导的图像编辑（Text-GuidedImageEditing）是一种通过自然语言文本指令对图像进行编辑或修改的技术。它结合了图像生成和自然语言处理（NLP）的最新进展，使用户能够通过描述性文本对图像内容进行精确的调整和操控。1.文本引导的图像编辑的挑战文本和图像之间的对齐：如何将文本中的语义信息准确地映射到图像中的特定区域或元素是一个关键挑战。这涉及到多模态数据的对齐和理解。编
深度学习--对抗生成网络（GAN, Generative Adversarial Network） Ambition_LAO 深度学习生成对抗网络
对抗生成网络（GAN,GenerativeAdversarialNetwork）是一种深度学习模型，由IanGoodfellow等人在2014年提出。GAN主要用于生成数据，通过两个神经网络相互对抗，来生成以假乱真的新数据。以下是对GAN的详细阐述，包括其概念、作用、核心要点、实现过程、代码实现和适用场景。1.概念GAN由两个神经网络组成：生成器（Generator）和判别器（Discrimina
戴尔笔记本win8系统改装win7系统 sophia天雪 win7 戴尔改装系统 win8
戴尔win8 系统改装win7 系统详述第一步：使用U盘制作虚拟光驱： 1）下载安装UltraISO：注册码可以在网上搜索。 2）启动UltraISO，点击“文件”—》“打开”按钮，打开已经准备好的ISO镜像文
BeanUtils.copyProperties使用笔记 bylijinnan java
BeanUtils.copyProperties VS PropertyUtils.copyProperties 两者最大的区别是： BeanUtils.copyProperties会进行类型转换，而PropertyUtils.copyProperties不会。既然进行了类型转换，那BeanUtils.copyProperties的速度比不上PropertyUtils.copyProp
MyEclipse中文乱码问题 0624chenhong MyEclipse
一、设置新建常见文件的默认编码格式，也就是文件保存的格式。在不对MyEclipse进行设置的时候，默认保存文件的编码，一般跟简体中文操作系统（如windows2000，windowsXP）的编码一致，即GBK。在简体中文系统下，ANSI 编码代表 GBK编码;在日文操作系统下，ANSI 编码代表 JIS 编码。 Window-->Preferences-->General -
发送邮件不懂事的小屁孩 send email
import org.apache.commons.mail.EmailAttachment; import org.apache.commons.mail.EmailException; import org.apache.commons.mail.HtmlEmail; import org.apache.commons.mail.MultiPartEmail;
动画合集换个号韩国红果果 html css
动画指一种样式变为另一种样式 keyframes应当始终定义0 100 过程 1 transition 制作鼠标滑过图片时的放大效果 css .wrap{ width: 340px;height: 340px; position: absolute; top: 30%; left: 20%; overflow: hidden; bor
网络最常见的攻击方式竟然是SQL注入蓝儿唯美 sql注入
NTT研究表明，尽管SQL注入（SQLi）型攻击记录详尽且为人熟知，但目前网络应用程序仍然是SQLi攻击的重灾区。信息安全和风险管理公司NTTCom Security发布的《2015全球智能威胁风险报告》表明，目前黑客攻击网络应用程序方式中最流行的，要数SQLi攻击。报告对去年发生的60亿攻击行为进行分析，指出SQLi攻击是最常见的网络应用程序攻击方式。全球网络应用程序攻击中，SQLi攻击占
java笔记2 a-john java
类的封装： 1，java中，对象就是一个封装体。封装是把对象的属性和服务结合成一个独立的的单位。并尽可能隐藏对象的内部细节（尤其是私有数据） 2，目的：使对象以外的部分不能随意存取对象的内部数据（如属性），从而使软件错误能够局部化，减少差错和排错的难度。 3，简单来说，“隐藏属性、方法或实现细节的过程”称为——封装。 4，封装的特性： 4.1设置
[Andengine]Error：can't creat bitmap form path “gfx/xxx.xxx” aijuans 学习Android遇到的错误
最开始遇到这个错误是很早以前了，以前也没注意，只当是一个不理解的bug，因为所有的texture，textureregion都没有问题，但是就是提示错误。昨天和美工要图片，本来是要背景透明的png格式，可是她却给了我一个jpg的。说明了之后她说没法改，因为没有png这个保存选项。我就看了一下，和她要了psd的文件，还好我有一点
自己写的一个繁体到简体的转换程序 asialee java 转换繁体 filter 简体
今天调研一个任务，基于java的filter实现繁体到简体的转换，于是写了一个demo，给各位博友奉上，欢迎批评指正。实现的思路是重载request的调取参数的几个方法，然后做下转换。
android意图和意图监听器技术百合不是茶 android 显示意图隐式意图意图监听器
Intent是在activity之间传递数据;Intent的传递分为显示传递和隐式传递显式意图：调用Intent.setComponent() 或 Intent.setClassName() 或 Intent.setClass()方法明确指定了组件名的Intent为显式意图，显式意图明确指定了Intent应该传递给哪个组件。隐式意图;不指明调用的名称,根据设
spring3中新增的@value注解 bijian1013 java spring @Value
在spring 3.0中，可以通过使用@value，对一些如xxx.properties文件中的文件，进行键值对的注入，例子如下： 1.首先在applicationContext.xml中加入： <beans xmlns="http://www.springframework.
Jboss启用CXF日志 sunjing log jboss CXF
1. 在standalone.xml配置文件中添加system-properties： <system-properties> <property name="org.apache.cxf.logging.enabled" value=&
【Hadoop三】Centos7_x86_64部署Hadoop集群之编译Hadoop源代码 bit1129 centos
编译必需的软件 Firebugs3.0.0 Maven3.2.3 Ant JDK1.7.0_67 protobuf-2.5.0 Hadoop 2.5.2源码包 Firebugs3.0.0 http://sourceforge.jp/projects/sfnet_findbug
struts2验证框架的使用和扩展白糖_ 框架 xml bean struts 正则表达式
struts2能够对前台提交的表单数据进行输入有效性校验，通常有两种方式： 1、在Action类中通过validatexx方法验证，这种方式很简单，在此不再赘述； 2、通过编写xx-validation.xml文件执行表单验证，当用户提交表单请求后，struts会优先执行xml文件，如果校验不通过是不会让请求访问指定action的。本文介绍一下struts2通过xml文件进行校验的方法并说
记录-感悟 braveCS 感悟
再翻翻以前写的感悟，有时会发现自己很幼稚，也会让自己找回初心。 2015-1-11 1. 能在工作之余学习感兴趣的东西已经很幸福了； 2. 要改变自己，不能这样一直在原来区域，要突破安全区舒适区，才能提高自己，往好的方面发展； 3. 多反省多思考；要会用工具，而不是变成工具的奴隶； 4. 一天内集中一个定长时间段看最新资讯和偏流式博
编程之美-数组中最长递增子序列 bylijinnan 编程之美
import java.util.Arrays; import java.util.Random; public class LongestAccendingSubSequence { /** * 编程之美数组中最长递增子序列 * 书上的解法容易理解 * 另一方法书上没有提到的是，可以将数组排序（由小到大）得到新的数组， * 然后求排序后的数组与原数
读书笔记5 chengxuyuancsdn 重复提交 struts2的token验证
1、重复提交 2、struts2的token验证 3、用response返回xml时的注意 1、重复提交 (1)应用场景 (1-1)点击提交按钮两次。 (1-2)使用浏览器后退按钮重复之前的操作，导致重复提交表单。 (1-3)刷新页面 (1-4)使用浏览器历史记录重复提交表单。 (1-5)浏览器重复的 HTTP 请求。 (2)解决方法 (2-1)禁掉提交按钮 (2-2)
[时空与探索]全球联合进行第二次费城实验的可能性 comsci
二次世界大战前后,由爱因斯坦参加的一次在海军舰艇上进行的物理学实验 -费城实验至今给我们大家留下很多迷团..... 关于费城实验的详细过程,大家可以在网络上搜索一下,我这里就不详细描述了在这里,我的意思是,现在
easy connect 之 ORA-12154: TNS: 无法解析指定的连接标识符 daizj oracle ORA-12154
用easy connect连接出现“tns无法解析指定的连接标示符”的错误，如下： C:\Users\Administrator>sqlplus username/[email protected]:1521/orcl SQL*Plus: Release 10.2.0.1.0 – Production on 星期一 5月 21 18:16:20 2012 Copyright (c) 198
简单排序:归并排序 dieslrae 归并排序
public void mergeSort(int[] array){ int temp = array.length/2; if(temp == 0){ return; } int[] a = new int[temp]; int
C语言中字符串的\0和空格 dcj3sjt126com c
\0 为字符串结束符，比如说： abcd (空格)cdefg；存入数组时，空格作为一个字符占有一个字节的空间，我们
解决Composer国内速度慢的办法 dcj3sjt126com Composer
用法：有两种方式启用本镜像服务： 1 将以下配置信息添加到 Composer 的配置文件 config.json 中（系统全局配置）。见“例1” 2 将以下配置信息添加到你的项目的 composer.json 文件中（针对单个项目配置）。见“例2” 为了避免安装包的时候都要执行两次查询，切记要添加禁用 packagist 的设置，如下 1 2 3 4 5
高效可伸缩的结果缓存 shuizhaosi888 高效可伸缩的结果缓存
/** * 要执行的算法，返回结果v */ public interface Computable<A, V> { public V comput(final A arg); } /** * 用于缓存数据 */ public class Memoizer<A, V> implements Computable<A,
三点定位的算法 haoningabc c 算法
三点定位，已知a,b,c三个顶点的x,y坐标和三个点都z坐标的距离，la，lb,lc 求z点的坐标原理就是围绕a,b,c 三个点画圆，三个圆焦点的部分就是所求但是，由于三个点的距离可能不准，不一定会有结果，所以是三个圆环的焦点，环的宽度开始为0，没有取到则加1 运行 gcc -lm test.c test.c代码如下 #include "stdi
epoll使用详解 jimmee c linux 服务端编程 epoll
epoll - I/O event notification facility在linux的网络编程中，很长的时间都在使用select来做事件触发。在linux新的内核中，有了一种替换它的机制，就是epoll。相比于select，epoll最大的好处在于它不会随着监听fd数目的增长而降低效率。因为在内核中的select实现中，它是采用轮询来处理的，轮询的fd数目越多，自然耗时越多。并且，在linu
Hibernate对Enum的映射的基本使用方法 linzx0212 enum Hibernate
枚举 /** * 性别枚举 */ public enum Gender { MALE(0), FEMALE(1), OTHER(2); private Gender(int i) { this.i = i; } private int i; public int getI
第10章高级事件（下） onestopweb 事件
index.html <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/
孙子兵法 roadrunners 孙子兵法
始计第一孙子曰：兵者，国之大事，死生之地，存亡之道，不可不察也。故经之以五事，校之以计，而索其情：一曰道，二曰天，三曰地，四曰将，五曰法。道者，令民于上同意，可与之死，可与之生，而不危也；天者，阴阳、寒暑、时制也；地者，远近、险易、广狭、死生也；将者，智、信、仁、勇、严也；法者，曲制、官道、主用也。凡此五者，将莫不闻，知之者胜，不知之者不胜。故校之以计，而索其情，曰
MySQL双向复制 tomcat_oracle mysql
本文包括: 主机配置从机配置建立主-从复制建立双向复制背景按照以下简单的步骤: 参考一下：在机器A配置主机(192.168.1.30) 在机器B配置从机(192.168.1.29) 我们可以使用下面的步骤来实现这一点步骤1：机器A设置主机在主机中打开配置文件 ,
zoj 3822 Domination(dp) 阿尔萨斯 Mina
题目链接：zoj 3822 Domination 题目大意：给定一个N∗M的棋盘，每次任选一个位置放置一枚棋子，直到每行每列上都至少有一枚棋子，问放置棋子个数的期望。解题思路：大白书上概率那一张有一道类似的题目，但是因为时间比较久了，还是稍微想了一下。dp[i][j][k]表示i行j列上均有至少一枚棋子，并且消耗k步的概率（k≤i∗j）,因为放置在i+1~n上等价与放在i+1行上，同理