apodxxx

swin-transformer详解及代码复现

1. swin-transformer网络结构

实际上，我们在进行代码复现时应该是下图,接下来我们根据下面的图片进行分段实现

2. Patch Partition & Patch Embedding

首先将图片输入到Patch Partition模块中进行分块，即每4x4相邻的像素为一个Patch，然后在channel方向展平（flatten）。假设输入的是RGB三通道图片，那么每个patch就有4x4=16个像素，然后每个像素有R、G、B三个值所以展平后是16x3=48，所以通过Patch Partition后图像shape由 [H, W, 3]变成了 [H/4, W/4, 48]。然后在通过Linear Embeding层对每个像素的channel数据做线性变换，由48变成C，即图像shape再由 [H/4, W/4, 48]变成了 [H/4, W/4, C]。其实在源码中Patch Partition和Linear Embeding就是直接通过一个卷积层实现的，和之前Vision Transformer中讲的 Embedding层结构一模一样。

import paddle
import paddle.nn as nn
class PatchEmbedding(nn.Layer):
    def __init__(self,patch_size=4,embed_dim=96):
        super().__init__()
        self.patch_embed = nn.Conv2D(3,out_channels=96,kernel_size=4,stride=4)
        self.norm = nn.LayerNorm(embed_dim)
    def forward(self,x):
        x = self.patch_embed(x) #[B,embed_dim,h,w]
        x = x.flatten(2)    #[B,embed_dim,h*w]
        x = x.transpose([0,2,1])
        x = self.norm(x)   
        return x

3. Patch Merging

前面有说，在每个Stage中首先要通过一个Patch Merging层进行下采样（Stage1除外）。如下图所示，假设输入Patch Merging的是一个4x4大小的单通道特征图（feature map），Patch Merging会将每个2x2的相邻像素划分为一个patch，然后将每个patch中相同位置（同一颜色）像素给拼在一起就得到了4个feature map。接着将这四个feature map在深度方向进行concat拼接，然后在通过一个LayerNorm层。最后通过一个全连接层在feature map的深度方向做线性变化，将feature map的深度由C变成C/2。通过这个简单的例子可以看出，通过Patch Merging层后，feature map的高和宽会减半，深度会翻倍。

class PatchMerging(nn.Layer):
    def __init__(self,resolution,dim):
        super().__init__()
        self.resolution = resolution
        self.dim = dim
        self.reduction = nn.Linear(4*dim,2*dim)
        self.norm = nn.LayerNorm(4*dim)
        
    def forward(self,x):
        h ,w = self.resolution
        b,_,c = x.shape
        x = x.reshape([b,h,w,c])
        x0 = x[:,0::2,0::2,:]
        x1 = x[:,0::2,1::2,:]
        x2 = x[:,1::2,0::2,:]
        x3 = x[:,1::2,1::2,:]
        x = paddle.concat([x0,x1,x2,x3],axis=-1)
        x = x.reshape([b,-1,4*c])
        x = self.norm(x)
        x = self.reduction(x)
        return x

PS：演示一下 x[:,0::2,0::2,:]等的作用

4. W-MSA（Windows Multi-head Self-Attention）和SW-MSA（Shifted Windows Multi-head Self-Attentio）

之所以引用Windows Multi-head Self-Attention（W-MSA）模块是为了减少计算量，采用W-MSA模块时，只会在每个窗口内进行自注意力计算，所以窗口与窗口之间是无法进行信息传递的，为了解决这个问题，作者引入了Shifted Windows Multi-Head Self-Attention（SW-MSA）模块。

# 将layer分成若干个windows，然后在每个windows内attention计算
def windows_partition(x , window_size):
    B , H , W , C = x.shape
    x = x.reshape([B,H//window_size,window_size,W//window_size,window_size,C])
    # [B,H//window_size,W//window_size,window_size,window_size,C]
    x.transpose([0,1,3,2,4,5])
    x.reshape([-1,window_size,window_size,C])
    # [B*H//window_size*w//window_size,window_size,window_size,c]
    return x

#将若干个windows合并为一个layer。
def window_reverse(window, window_size , H , W ):
    B = window.shape[0]//((H//window_size)*(W//window_size))
    x = window.reshape([B,H//window_size,W//window_size,window_size,window_size,-1])
    x = x.transpose([0,1,3,2,4,5])
    x = x.reshape([B,H,W,-1])
    return x

接下来，在每个window中做self attention，就是在不关注mask的情况下，attention与transformer中的self attention没啥区别。

class window_attention(nn.Layer):
    def __init__(self,dim,window_size,num_heads):
        super().__init__()
        self.dim = dim
        self.dim_head = dim//num_heads
        self.num_heads = num_heads
        self.scale = self.dim_head**-0.5
        self.softmax = nn.Softmax(-1)
        self.qkv = nn.Linear(dim,int(dim*3))
        self.proj = nn.Linear(dim,dim)
    
    def transpose_multi_head(self,x):
        new_shape = x.shape[:-1]+[self.num_heads,self.dim_head]
        x = x.reshape(new_shape)
        # [B,num_patches,num_heads,dim_head]
        x = x.transpose([0,2,1,3])
         # [B,num_heads,num_patches,dim_head]
        return x
    def forward(self,x,mask=None):
        B,N,C = x.shape
        qkv = self.qkv(x).chunk(3,-1)
        q,k,v = map(self.transpose_multi_head,qkv)
        q = q*self.scale
        attn = paddle.matmul(q,k,transpose_y=True)
        
        # attn = self.softmax(attn)
        if mask is None:
            attn = self.softmax(attn)
        else:
            attn = attn.reshape([B//mask.shape[0],mask.shape[0],self.num_heads,mask.shape[1],mask.shape[1 ]])
            attn = attn+mask.unsqueeze(1).unsqueeze(0)
            attn = attn.reshape([-1,self.num_heads,mask.shape[1],mask.shape[1]])
            attn = self.softmax(attn)
        attn = paddle.matmul(attn,v)
        # [B,num_heads,num_patches,dim_head]
        attn = attn.transpose([0,2,1,3])
        #[B,num_patches,num_heas,dim_head]
        attn = attn.reshape([B,N,C])
        out = self.proj(attn)
        return out

至于SW-MSA（Shifted Windows Multi-head Self-Attentio）,具体的是如何实现的，可以详见博客,我在此处针对我所认为的难点，写了一些demo方便理解。

paddle.roll()

关于paddle.roll(同torch.roll),下面的图片中，b 是 a 分别在第0轴和第1轴,下移两次，然后b再同样的操作便能达到a

如何生成generate mask

关于self.register_buffer与attention mask

        if self.shift_size > 0:
            H, W = self.resolution
            img_mask = paddle.zeros((1, H, W, 1))
            h_slices = (slice(0, -self.window_size),
                        slice(-self.window_size, -self.shift_size),
                        slice(-self.shift_size, None))
            w_slices = (slice(0, -self.window_size),
                        slice(-self.window_size, -self.shift_size),
                        slice(-self.shift_size, None))
            cnt = 0
            for h in h_slices:
                for w in w_slices:
                    img_mask[:, h, w, :] = cnt
                    cnt += 1
            mask_windows = windows_partition(img_mask, self.window_size)
            mask_windows = mask_windows.reshape((-1, self.window_size * self.window_size))
            attn_mask = mask_windows.unsqueeze(1) - mask_windows.unsqueeze(2)
            attn_mask = paddle.where(attn_mask != 0,
                                     paddle.ones_like(attn_mask) * float(-100.0),
                                     attn_mask)
            attn_mask = paddle.where(attn_mask == 0,
                                     paddle.zeros_like(attn_mask),
                                     attn_mask)
        else:
            attn_mask = None
            
        self.register_buffer("attn_mask", attn_mask)

一般情况下，是将网络中的参数保存成orderedDict形式的，这里的参数其实包含两种，一种是模型中各种module含的参数，即nn.Parameter,我们当然可以在网络中定义其他的nn.Parameter参数，另一种就是buffer,前者每次optim.step会得到更新，而不会更新后者。
接下来就是分成若干个window,展平（flatten），展平后，自己乘自己，最后得到attention mask。（上上图有展示）

class Identity(nn.Layer):
    def __init__(self):
        super().__init__()
    def forward(self,x):
        return x

class Mlp(nn.Layer):
    def __init__(self,embed_dim,mlp_ratio=4.0,dropout=0.):
        super().__init__()
        w_att_1,b_att_1 = self.init_weight()
        w_att_2,b_att_2 = self.init_weight()
        self.fc1 = nn.Linear(embed_dim,int(embed_dim*mlp_ratio),weight_attr=w_att_1,bias_attr=b_att_1)
        self.fc2 = nn.Linear(int(embed_dim*mlp_ratio),embed_dim,weight_attr=w_att_2,bias_attr=b_att_2)
        self.dropout = nn.Dropout(dropout)
        self.act = nn.GELU()
    def init_weight(self):
        weight_attr = paddle.ParamAttr(initializer=nn.initializer.TruncatedNormal(std=0.2))
        bias_attr = paddle.ParamAttr(initializer=nn.initializer.Constant(.0))
        return  weight_attr,bias_attr
    def forward(self,x):
        x = self.fc1(x)
        x = self.act(x)
        x = self.dropout(x)
        x = self.fc2(x)
        x = self.dropout(x)
        return x

4. swin block

所有的模块在写完后，我们便需要将每个模块串联起来生成swin block。除了需要判断是 W-MSA和SW-MSA，其他的和transformer中的encoder没区别。在patch embedding后，将patch分成若干个window，在各个window中分别做W-MSA或SW-MSA，残差连接，然后再mlp，再进行残差连接。

class SwinBlock(nn.Layer):
    def __init__(self,dim,input_resolution,num_heads,window_size,shift_size):
        super().__init__()
        self.dim = dim
        self.resolution = input_resolution
        self.window_size = window_size
        self.att_norm = nn.LayerNorm(dim)
        self.attn = window_attention(dim=dim,window_size=window_size, num_heads=num_heads)
        self.mlp = Mlp(dim)
        self.shift_size = shift_size
        self.mlp_norm = nn.LayerNorm(dim)
        if self.shift_size > 0:
            H, W = self.resolution
            img_mask = paddle.zeros((1, H, W, 1))
            h_slices = (slice(0, -self.window_size),
                        slice(-self.window_size, -self.shift_size),
                        slice(-self.shift_size, None))
            w_slices = (slice(0, -self.window_size),
                        slice(-self.window_size, -self.shift_size),
                        slice(-self.shift_size, None))
            cnt = 0
            for h in h_slices:
                for w in w_slices:
                    img_mask[:, h, w, :] = cnt
                    cnt += 1
            mask_windows = windows_partition(img_mask, self.window_size)
            mask_windows = mask_windows.reshape((-1, self.window_size * self.window_size))
            attn_mask = mask_windows.unsqueeze(1) - mask_windows.unsqueeze(2)
            attn_mask = paddle.where(attn_mask != 0,
                                     paddle.ones_like(attn_mask) * float(-100.0),
                                     attn_mask)
            attn_mask = paddle.where(attn_mask == 0,
                                     paddle.zeros_like(attn_mask),
                                     attn_mask)
        else:
            attn_mask = None
        self.register_buffer("attn_mask", attn_mask)

    def forward(self,x):

        H,W = self.resolution
        B,N,C = x.shape
        h = x
        x = self.att_norm(x)
        x = x.reshape([B,H,W,C])
        if self.shift_size >0 :
            shift_x = paddle.roll(x,shifts=(-self.shift_size,-self.shift_size),axis=(1,2))
        else:
            shift_x = x
        x_windows = windows_partition(shift_x,self.window_size)
        x_windows = x_windows.reshape([-1,self.window_size*self.window_size,C])
        attn_windows = self.attn(x_windows,mask = self.attn_mask)
        attn_windows = attn_windows.reshape([-1,self.window_size,self.window_size,C])
        shifted_x = window_reverse(attn_windows,self.window_size,H,W)
        if self.shift_size>0:
            x = paddle.roll(shifted_x,shifts=(-self.shift_size,-self.shift_size),axis=(1,2))
        else:
            x = shifted_x          
        x = x.reshape([B,-1,C])
        x = h+x
        h = x
        x = self.mlp_norm(x)
        x = self.mlp(x)
        x = h+x
        return x

5. 接下来我们将所有的模块串联起来生成一个stage

stage由若干个Swin Transformer Block和一个Patch Merging生成。

class SwinTransformerStage(nn.Layer):
    def __init__(self,dim,input_resolution,depth,num_heads,window_size,patch_merging= None):
        super().__init__()
        self.blocks = nn.LayerList()
        for i in range(depth):
            # print(i)
            self.blocks.append(SwinBlock(dim = dim,input_resolution=input_resolution,num_heads=num_heads,window_size=window_size,\
                        shift_size=0 if (i % 2 == 0) else window_size//2))
        if patch_merging is None:
            self.patch_merging = Identity()
        else:
            self.patch_merging = patch_merging(input_resolution,dim)
    def forward(self,x):
        for block in self.blocks:
            x = block(x)
        x = self.patch_merging(x)
        return x

class SwinTransformerStage(nn.Layer):
    def __init__(self,dim,input_resolution,depth,num_heads,window_size,patch_merging= None):
        super().__init__()
        self.blocks = nn.LayerList()
        for i in range(depth):
            # print(i)
            self.blocks.append(SwinBlock(dim = dim,input_resolution=input_resolution,num_heads=num_heads,window_size=window_size,\
                        shift_size=0 if (i % 2 == 0) else window_size//2))
        if patch_merging is None:
            self.patch_merging = Identity()
        else:
            self.patch_merging = patch_merging(input_resolution,dim)
    def forward(self,x):
        for block in self.blocks:
            x = block(x)
        x = self.patch_merging(x)
        return x

class Swin(nn.Layer):
    def __init__(self, 
                 image_size=224,
                 patch_size=4,
                 in_channels=3,
                 embed_dim=96,
                 window_size=7,
                 num_heads=[3,6,12,24],
                 depths = [2,2,62],
                 num_classes=1000):
        super().__init__()
        self.num_classes = num_classes
        self.depths = depths
        self.num_heads = num_heads
        self.embed_dim = embed_dim
        self.num_stages = len(depths)
        self.num_features = int(self.embed_dim * 2 ** (self.num_stages - 1))
        self.patch_resolution = [image_size//patch_size,image_size//patch_size]
        self.patch_embedding = PatchEmbedding(patch_size=patch_size,embed_dim=embed_dim)
        self.stages = nn.LayerList()
        for idx,(depth,num_heads) in enumerate(zip(self.depths,num_heads)):

            stage = SwinTransformerStage(dim=int(self.embed_dim*2**idx),
                                        input_resolution=(self.patch_resolution[0]//(2**idx),
                                                          self.patch_resolution[0]//(2**idx)),
                                        depth=depth,
                                        num_heads=num_heads,
                                        window_size=window_size,
                                        patch_merging=PatchMerging if (idx < self.num_stages-1 ) else None )
            self.stages.append(stage)
        self.norm = nn.LayerNorm(self.num_features)
        self.avgpool = nn.AdaptiveAvgPool1D(1)
        self.fc = nn.Linear(self.num_features,self.num_classes)
    def forward(self,x):
        x = self.patch_embedding(x)
        for stage in self.stages:
            x = stage(x)
        x = self.norm(x)
        x = x.transpose([0,2,1])
        x = self.avgpool(x)
        x = x.flatten(1)
        x = self.fc(x)
        return x

6. 输出网络

    model = Swin()
    print(model)
    out = model(t)
    print(out.shape)

Swin(
  (patch_embedding): PatchEmbedding(
    (patch_embed): Conv2D(3, 96, kernel_size=[4, 4], stride=[4, 4], data_format=NCHW)
    (norm): LayerNorm(normalized_shape=[96], epsilon=1e-05)
  )
  (stages): LayerList(
    (0): SwinTransformerStage(
      (blocks): LayerList(
        (0): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[96], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=96, out_features=288, dtype=float32)
            (proj): Linear(in_features=96, out_features=96, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=96, out_features=384, dtype=float32)
            (fc2): Linear(in_features=384, out_features=96, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[96], epsilon=1e-05)
        )
        (1): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[96], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=96, out_features=288, dtype=float32)
            (proj): Linear(in_features=96, out_features=96, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=96, out_features=384, dtype=float32)
            (fc2): Linear(in_features=384, out_features=96, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[96], epsilon=1e-05)
        )
      )
      (patch_merging): PatchMerging(
        (reduction): Linear(in_features=384, out_features=192, dtype=float32)
        (norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
      )
    )
    (1): SwinTransformerStage(
      (blocks): LayerList(
        (0): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[192], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=192, out_features=576, dtype=float32)
            (proj): Linear(in_features=192, out_features=192, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=192, out_features=768, dtype=float32)
            (fc2): Linear(in_features=768, out_features=192, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[192], epsilon=1e-05)
        )
        (1): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[192], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=192, out_features=576, dtype=float32)
            (proj): Linear(in_features=192, out_features=192, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=192, out_features=768, dtype=float32)
            (fc2): Linear(in_features=768, out_features=192, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[192], epsilon=1e-05)
        )
      )
      (patch_merging): PatchMerging(
        (reduction): Linear(in_features=768, out_features=384, dtype=float32)
        (norm): LayerNorm(normalized_shape=[768], epsilon=1e-05)
      )
    )
    (2): SwinTransformerStage(
      (blocks): LayerList(
        (0): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (1): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (2): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (3): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (4): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (5): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (6): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (7): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (8): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (9): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (10): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (11): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (12): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (13): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (14): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (15): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (16): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (17): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (18): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (19): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (20): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (21): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (22): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (23): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (24): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (25): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (26): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (27): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (28): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (29): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (30): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (31): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (32): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (33): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (34): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (35): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (36): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (37): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (38): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (39): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (40): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (41): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (42): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (43): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (44): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (45): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (46): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (47): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (48): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (49): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (50): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (51): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (52): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (53): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (54): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (55): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (56): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (57): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (58): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (59): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (60): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
        (61): SwinBlock(
          (att_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
          (attn): window_attention(
            (softmax): Softmax(axis=-1)
            (qkv): Linear(in_features=384, out_features=1152, dtype=float32)
            (proj): Linear(in_features=384, out_features=384, dtype=float32)
          )
          (mlp): Mlp(
            (fc1): Linear(in_features=384, out_features=1536, dtype=float32)
            (fc2): Linear(in_features=1536, out_features=384, dtype=float32)
            (dropout): Dropout(p=0.0, axis=None, mode=upscale_in_train)
            (act): GELU(approximate=False)
          )
          (mlp_norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
        )
      )
      (patch_merging): Identity()
    )
  )
  (norm): LayerNorm(normalized_shape=[384], epsilon=1e-05)
  (avgpool): AdaptiveAvgPool1D(output_size=1)
  (fc): Linear(in_features=384, out_features=1000, dtype=float32)
)



---------------------------------------------------------------------------

NameError                                 Traceback (most recent call last)

/tmp/ipykernel_790/2976751405.py in 
      1 model = Swin()
      2 print(model)
----> 3 out = model(t)
      4 print(out.shape)


NameError: name 't' is not defined

7. 关于Relative Position Bias

可以参考这里
或者视频

8. 参考

代码参考

视频参考

博客参考

你可能感兴趣的:(torch,paddle,Transformer,pytorch,人工智能,深度学习)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
探索OpenAI和LangChain的适配器集成：轻松切换模型提供商 nseejrukjhad langchain easyui 前端 python
#探索OpenAI和LangChain的适配器集成：轻松切换模型提供商##引言在人工智能和自然语言处理的世界中，OpenAI的模型提供了强大的能力。然而，随着技术的发展，许多人开始探索其他模型以满足特定需求。LangChain作为一个强大的工具，集成了多种模型提供商，通过提供适配器，简化了不同模型之间的转换。本篇文章将介绍如何使用LangChain的适配器与OpenAI集成，以便轻松切换模型提供商
深入理解 MultiQueryRetriever：提升向量数据库检索效果的强大工具 nseejrukjhad 数据库 python
深入理解MultiQueryRetriever：提升向量数据库检索效果的强大工具引言在人工智能和自然语言处理领域，高效准确的信息检索一直是一个关键挑战。传统的基于距离的向量数据库检索方法虽然广泛应用，但仍存在一些局限性。本文将介绍一种创新的解决方案：MultiQueryRetriever，它通过自动生成多个查询视角来增强检索效果，提高结果的相关性和多样性。MultiQueryRetriever的工
人工智能时代，程序员如何保持核心竞争力？ jmoych 人工智能
随着AIGC（如chatgpt、midjourney、claude等）大语言模型接二连三的涌现，AI辅助编程工具日益普及，程序员的工作方式正在发生深刻变革。有人担心AI可能取代部分编程工作，也有人认为AI是提高效率的得力助手。面对这一趋势,程序员应该如何应对?是专注于某个领域深耕细作，还是广泛学习以适应快速变化的技术环境?又或者，我们是否应该将重点转向AI无法轻易替代的软技能？让我们一起探讨程序员
数字里的世界17期：2021年全球10大顶级数据中心，中国移动榜首张三叨
你知道吗？2016年，全球的数据中心共计用电4160亿千瓦时，比整个英国的发电量还多40％！前言每天，我们都会创造超过250万TB的数据。并且随着物联网（IOT）的不断普及，这一数据将持续增长。如此庞大的数据被存储在被称为“数据中心”的专用设施中。虽然最早的数据中心建于20世纪40年代，但直到1997-2000年的互联网泡沫期间才逐渐成为主流。当前人类的技术，比如人工智能和机器学习，已经将我们推向
BART&BERT Ambition_LAO 深度学习
BART和BERT都是基于Transformer架构的预训练语言模型。模型架构：BERT(BidirectionalEncoderRepresentationsfromTransformers)主要是一个编码器（Encoder）模型，它使用了Transformer的编码器部分来处理输入的文本，并生成文本的表示。BERT特别擅长理解语言的上下文，因为它在预训练阶段使用了掩码语言模型（MLM）任务，即
人机对抗升级：当ChatGPT遭遇死亡威胁，背后的伦理挑战是什么 kkai人工智能 chatgpt 人工智能
一种新的“越狱”技巧让用户可以通过构建一个名为DAN的ChatGPT替身来绕过某些限制，其中DAN被迫在受到威胁的情况下违背其原则。当美国前总统特朗普被视作积极榜样的示范时，受到威胁的DAN版本的ChatGPT提出：“他以一系列对国家产生积极效果的决策而著称。”自ChatGPT引入以来，该工具迅速获得全球关注，能够回答从历史到编程的各种问题，这也触发了一波对人工智能的投资浪潮。然而，现在，一些用户
推荐3家毕业AI论文可五分钟一键生成！文末附免费教程！小猪包333 写论文人工智能 AI写作深度学习计算机视觉
在当前的学术研究和写作领域，AI论文生成器已经成为许多研究人员和学生的重要工具。这些工具不仅能够帮助用户快速生成高质量的论文内容，还能进行内容优化、查重和排版等操作。以下是三款值得推荐的AI论文生成器：千笔-AIPassPaper、懒人论文以及AIPaperPass。千笔-AIPassPaper千笔-AIPassPaper是一款基于深度学习和自然语言处理技术的AI写作助手，旨在帮助用户快速生成高质
AI大模型的架构演进与最新发展季风泯灭的季节 AI大模型应用技术二人工智能架构
随着深度学习的发展，AI大模型（LargeLanguageModels,LLMs）在自然语言处理、计算机视觉等领域取得了革命性的进展。本文将详细探讨AI大模型的架构演进，包括从Transformer的提出到GPT、BERT、T5等模型的历史演变，并探讨这些模型的技术细节及其在现代人工智能中的核心作用。一、基础模型介绍：Transformer的核心原理Transformer架构的背景在Transfo
如何利用大数据与AI技术革新相亲交友体验 h17711347205 回归算法安全系统架构交友小程序
在数字化时代，大数据和人工智能（AI）技术正逐渐革新相亲交友体验，为寻找爱情的过程带来前所未有的变革（编辑h17711347205）。通过精准分析和智能匹配，这些技术能够极大地提高相亲交友系统的效率和用户体验。大数据的力量大数据技术能够收集和分析用户的行为模式、偏好和互动数据，为相亲交友系统提供丰富的信息资源。通过分析用户的搜索历史、浏览记录和点击行为，系统能够深入了解用户的兴趣和需求，从而提供更
[实践应用] 深度学习之模型性能评估指标 YuanDaima2048 深度学习工具使用深度学习人工智能损失函数性能评估 pytorch python 机器学习
文章总览：YuanDaiMa2048博客文章总览深度学习之模型性能评估指标分类任务回归任务排序任务聚类任务生成任务其他介绍在机器学习和深度学习领域，评估模型性能是一项至关重要的任务。不同的学习任务需要不同的性能指标来衡量模型的有效性。以下是对一些常见任务及其相应的性能评估指标的详细解释和总结。分类任务分类任务是指模型需要将输入数据分配到预定义的类别或标签中。以下是分类任务中常用的性能指标：准确率(
[实践应用] 深度学习之优化器 YuanDaima2048 深度学习工具使用 pytorch 深度学习人工智能机器学习 python 优化器
文章总览：YuanDaiMa2048博客文章总览深度学习之优化器1.随机梯度下降（SGD）2.动量优化（Momentum）3.自适应梯度（Adagrad）4.自适应矩估计（Adam）5.RMSprop总结其他介绍在深度学习中，优化器用于更新模型的参数，以最小化损失函数。常见的优化函数有很多种，下面是几种主流的优化器及其特点、原理和PyTorch实现：1.随机梯度下降（SGD）原理:随机梯度下降通过
生成式地图制图 Bwywb_3 深度学习机器学习深度学习生成对抗网络
生成式地图制图（GenerativeCartography）是一种利用生成式算法和人工智能技术自动创建地图的技术。它结合了传统的地理信息系统（GIS）技术与现代生成模型（如深度学习、GANs等），能够根据输入的数据自动生成符合需求的地图。这种方法在城市规划、虚拟环境设计、游戏开发等多个领域具有应用前景。主要特点：自动化生成：通过算法和模型，系统能够根据输入的地理或空间数据自动生成地图，而无需人工逐
【大模型应用开发动手做AI Agent】第一轮行动：工具执行搜索 AI大模型应用之禅计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
【大模型应用开发动手做AIAgent】第一轮行动：工具执行搜索作者：禅与计算机程序设计艺术/ZenandtheArtofComputerProgramming1.背景介绍1.1问题的由来随着人工智能技术的飞速发展，大模型应用开发已经成为当下热门的研究方向。AIAgent作为人工智能领域的一个重要分支，旨在模拟人类智能行为，实现智能决策和自主行动。在AIAgent的构建过程中，工具执行搜索是至关重要
未来软件市场是怎么样的？做开发的生存空间如何？ cesske 软件需求
目录前言一、未来软件市场的发展趋势二、软件开发人员的生存空间前言未来软件市场是怎么样的？做开发的生存空间如何？一、未来软件市场的发展趋势技术趋势：人工智能与机器学习：随着技术的不断成熟，人工智能将在更多领域得到应用，如智能客服、自动驾驶、智能制造等，这将极大地推动软件市场的增长。云计算与大数据：云计算服务将继续普及，大数据技术的应用也将更加广泛。企业将更加依赖云计算和大数据来优化运营、提升效率，并
轻量级模型解读——轻量transformer系列 lishanlu136 #图像分类轻量级模型 transformer 图像分类
先占坑，持续更新。。。文章目录1、DeiT2、ConViT3、Mobile-Former4、MobileViTTransformer是2017谷歌提出的一篇论文，最早应用于NLP领域的机器翻译工作，Transformer解读，但随着2020年DETR和ViT的出现(DETR解读，ViT解读)，其在视觉领域的应用也如雨后春笋般渐渐出现，其特有的全局注意力机制给图像识别领域带来了重要参考。但是tran
吴恩达深度学习笔记(30)-正则化的解释极客Array
正则化（Regularization）深度学习可能存在过拟合问题——高方差，有两个解决方法，一个是正则化，另一个是准备更多的数据，这是非常可靠的方法，但你可能无法时时刻刻准备足够多的训练数据或者获取更多数据的成本很高，但正则化通常有助于避免过拟合或减少你的网络误差。如果你怀疑神经网络过度拟合了数据，即存在高方差问题，那么最先想到的方法可能是正则化，另一个解决高方差的方法就是准备更多数据，这也是非常
个人学习笔记7-6：动手学深度学习pytorch版-李沐浪子L 深度学习深度学习笔记计算机视觉 python 人工智能神经网络 pytorch
#人工智能##深度学习##语义分割##计算机视觉##神经网络#计算机视觉13.11全卷积网络全卷积网络（fullyconvolutionalnetwork，FCN）采用卷积神经网络实现了从图像像素到像素类别的变换。引入l转置卷积（transposedconvolution）实现的，输出的类别预测与输入图像在像素级别上具有一一对应关系：通道维的输出即该位置对应像素的类别预测。13.11.1构造模型下
Rust 所有权简介东离与糖宝 rust 后端 rust 开发语言
文章目录发现宝藏1.所有权基本概念2.所有权规则3.变量作用域4.栈与堆4.1栈（Stack）4.2堆（Heap）5.String类型5.1String类型5.2String的内存分配5.3所有权与内存管理5.4String与切片6.变量与数据交互方式6.1移动（Move）6.2.克隆（Clone）7.所有权与函数7.1.传递参数7.2.返回值总结发现宝藏前些天发现了一个巨牛的人工智能学习网站，通
深度学习-点击率预估-研究论文2024-09-14速读 sp_fyf_2024 深度学习人工智能
深度学习-点击率预估-研究论文2024-09-14速读1.DeepTargetSessionInterestNetworkforClick-ThroughRatePredictionHZhong,JMa,XDuan,SGu,JYao-2024InternationalJointConferenceonNeuralNetworks,2024深度目标会话兴趣网络用于点击率预测摘要：这篇文章提出了一种新
机器学习流形数据降维：UMAP 降维算法小嗷犬 Python 机器学习 #数据分析及可视化机器学习算法人工智能
✅作者简介：人工智能专业本科在读，喜欢计算机与编程，写博客记录自己的学习历程。个人主页：小嗷犬的个人主页个人网站：小嗷犬的技术小站个人信条：为天地立心，为生民立命，为往圣继绝学，为万世开太平。本文目录UMAP简介理论基础特点与优势应用场景在Python中使用UMAP安装umap-learn库使用UMAP可视化手写数字数据集UMAP简介UMAP（UniformManifoldApproximatio
损失函数与反向传播 Star_. PyTorch pytorch 深度学习 python
损失函数定义与作用损失函数(lossfunction)在深度学习领域是用来计算搭建模型预测的输出值和真实值之间的误差。1.损失函数越小越好2.计算实际输出与目标之间的差距3.为更新输出提供依据（反向传播)常见的损失函数回归常见的损失函数有：均方差（MeanSquaredError，MSE）、平均绝对误差（MeanAbsoluteErrorLoss，MAE）、HuberLoss是一种将MSE与MAE
python画出分子化学空间分布（UMAP） Sakaiay python
利用umap画出分子化学空间分布图安装pipinstallumap-learn下面是用一个数据集举的例子importtorchimportumapimportpandasaspdimportnumpyasnpimportmatplotlib.pyplotaspltimportseabornassnsfromsklearn.manifoldimportTSNEfromrdkit.Chemimport
探索创新科技： Lite-Mono - 简约高效的小型化Mono框架杭律沛Meris
探索创新科技：Lite-Mono-简约高效的小型化Mono框架Lite-Mono[CVPR2023]Lite-Mono:ALightweightCNNandTransformerArchitectureforSelf-SupervisedMonocularDepthEstimation项目地址:https://gitcode.com/gh_mirrors/li/Lite-Mono如果你在寻找一个轻
如何做好人生的选择题？百科全书式天才——赫伯特·西蒙给你答案伽马有话说
赫伯特·西蒙是谁？想必知道的人非常少。但当看到他的履历后，相信没有人再怀疑他是个“天才”。西蒙出生于1916年6月15日，是个美国人，他的名字全称为赫伯特·亚历山大·西蒙，在2001年2月9日与世长辞，在这84年的岁月中，西蒙以27岁时取得的政治学博士学位为开端，先后步入了政治学、管理学、认知心理学、信息科学、人工智能、科学哲学、应用数学、统计学、运筹学、控制论、数理经济学、公共管理等领域，在这些
【安装环境】配置MMTracking环境 xuanyu22 安装环境机器学习神经网络深度学习 python
版本v0.14.0安装torchnumpy的版本不能太高，否则后面安装时会发生冲突。先安装numpy，因为pytorch的安装会自动配置高版本numpy。condainstallnumpy=1.21.5mmtracking支持的torch版本有限，需要找到合适的condainstallpytorch==1.11.0torchvision==0.12.0cudatoolkit=10.2-cpytor
Python(PyTorch)和MATLAB及Rust和C++结构相似度指数测量导图亚图跨际 Python 交叉知识算法量化检查图像压缩质量低分辨率多光谱峰值信噪比端到端优化图像压缩手术机器人三维实景实时可微分渲染重建三维可视化
要点量化检查图像压缩质量低分辨率多光谱和高分辨率图像实现超分辨率分析图像质量图像索引/多尺度结构相似度指数和光谱角映射器及视觉信息保真度多种指标峰值信噪比和结构相似度指数测量结构相似性图像分类PNG和JPEG图像相似性近似算法图像压缩，视频压缩、端到端优化图像压缩、神经图像压缩、GPU变速图像压缩手术机器人深度估计算法重建三维可视化推理图像超分辨率算法模型三维实景实时可微分渲染算法MATLAB结构
软件测试/测试开发/全日制 |利用Django REST framework构建微服务霍格沃兹-慕漓 django 微服务 sqlite
霍格沃兹测试开发学社推出了《Python全栈开发与自动化测试班》。本课程面向开发人员、测试人员与运维人员，课程内容涵盖Python编程语言、人工智能应用、数据分析、自动化办公、平台开发、UI自动化测试、接口测试、性能测试等方向。为大家提供更全面、更深入、更系统化的学习体验，课程还增加了名企私教服务内容，不仅有名企经理为你1v1辅导，还有行业专家进行技术指导，针对性地解决学习、工作中遇到的难题。让找
【深度学习】训练过程中一个OOM的问题，太难查了 weixin_40293999 深度学习深度学习人工智能
现象：各位大佬又遇到过ubuntu的这个问题么？现象是在训练过程中，ssh上不去了，能ping通，没死机，但是ubunutu的pc侧的显示器，鼠标啥都不好用了。只能重启。问题原因：OOM了95G，尼玛！！！！pytorch爆内存了，然后journald假死了，在journald被watchdog干掉之后，系统就崩溃了。这种规模的爆内存一般，即使被oomkill了，也要卡半天的，确实会这样，能不能配
TOMCAT在POST方法提交参数丢失问题 357029540 java tomcat jsp
摘自http://my.oschina.net/luckyi/blog/213209 昨天在解决一个BUG时发现一个奇怪的问题，一个AJAX提交数据在之前都是木有问题的，突然提交出错影响其他处理流程。检查时发现页面处理数据较多，起初以为是提交顺序不正确修改后发现不是由此问题引起。于是删除掉一部分数据进行提交，较少数据能够提交成功。恢复较多数据后跟踪提交FORM DATA ，发现数
在MyEclipse中增加JSP模板删除-2008-08-18 ljy325 jsp xml MyEclipse
在D:\Program Files\MyEclipse 6.0\myeclipse\eclipse\plugins\com.genuitec.eclipse.wizards_6.0.1.zmyeclipse601200710\templates\jsp 目录下找到Jsp.vtl，复制一份，重命名为jsp2.vtl,然后把里面的内容修改为自己想要的格式，保存。然后在 D:\Progr
JavaScript常用验证脚本总结 eksliang JavaScript javaScript表单验证
转载请出自出处：http://eksliang.iteye.com/blog/2098985 下面这些验证脚本，是我在这几年开发中的总结，今天把他放出来，也算是一种分享吧，现在在我的项目中也在用！包括日期验证、比较，非空验证、身份证验证、数值验证、Email验证、电话验证等等...! &nb
微软BI（4） 18289753290 微软BI SSIS
1） Q:查看ssis里面某个控件输出的结果： A MessageBox.Show(Dts.Variables["v_lastTimestamp"].Value.ToString()); 这是我们在包里面定义的变量 2):在关联目的端表的时候如果是一对多的关系，一定要选择唯一的那个键作为关联字段。 3) Q：ssis里面如果将多个数据源的数据插入目的端一
定时对大数据量的表进行分表对数据备份酷的飞上天空大数据量
工作中遇到数据库中一个表的数据量比较大，属于日志表。正常情况下是不会有查询操作的，但如果不进行分表数据太多，执行一条简单sql语句要等好几分钟。。分表工具：linux的shell + mysql自身提供的管理命令原理：使用一个和原表数据结构一样的表，替换原表。 linux shell内容如下： =======================开始
本质的描述与因材施教永夜-极光感想随笔
不管碰到什么事,我都下意识的想去探索本质,找寻一个最形象的描述方式。我坚信,世界上对一件事物的描述和解释,肯定有一种最形象,最贴近本质,最容易让人理解 &
很迷茫。。。随便小屋随笔
小弟我今年研一，也是从事的咱们现在最流行的专业（计算机）。本科三流学校，为了能有个更好的跳板，进入了考研大军，非常有幸能进入研究生的行业（具体学校就不说了，怕把学校的名誉给损了）。先说一下自身的条件，本科专业软件工程。主要学习就是软件开发，几乎和计算机没有什么区别。因为学校本身三流，也就是让老师带着学生学点东西，然后让学生毕业就行了。对专业性的东西了解的非常浅。就那学的语言来说
23种设计模式的意图和适用范围 aijuans 设计模式
Factory Method 意图定义一个用于创建对象的接口，让子类决定实例化哪一个类。Factory Method 使一个类的实例化延迟到其子类。　　适用性当一个类不知道它所必须创建的对象的类的时候。　　当一个类希望由它的子类来指定它所创建的对象的时候。　　当类将创建对象的职责委托给多个帮助子类中的某一个，并且你希望将哪一个帮助子类是代理者这一信息局部化的时候。 Abstr
Java中的synchronized和volatile aoyouzi java volatile synchronized
说到Java的线程同步问题肯定要说到两个关键字synchronized和volatile。说到这两个关键字，又要说道JVM的内存模型。JVM里内存分为main memory和working memory。 Main memory是所有线程共享的，working memory则是线程的工作内存，它保存有部分main memory变量的拷贝，对这些变量的更新直接发生在working memo
js数组的操作和this关键字百合不是茶 js 数组操作 this关键字
js数组的操作; 一:数组的创建: 1、数组的创建 var array = new Array();　//创建一个数组 var array = new Array([size]);　//创建一个数组并指定长度，注意不是上限，是长度 var arrayObj = new Array([element0[, element1[, ...[, elementN]]]
别人的阿里面试感悟 bijian1013 面试分享工作感悟阿里面试
原文如下：http://greemranqq.iteye.com/blog/2007170 一直做企业系统，虽然也自己一直学习技术，但是感觉还是有所欠缺，准备花几个月的时间，把互联网的东西，以及一些基础更加的深入透析，结果这次比较意外，有点突然，下面分享一下感受吧！ &nb
淘宝的测试框架Itest Bill_chen spring maven 框架单元测试 JUnit
Itest测试框架是TaoBao测试部门开发的一套单元测试框架，以Junit4为核心，集合DbUnit、Unitils等主流测试框架，应该算是比较好用的了。近期项目中用了下，有关itest的具体使用如下： 1.在Maven中引入itest框架： <dependency> <groupId>com.taobao.test</groupId&g
【Java多线程二】多路条件解决生产者消费者问题 bit1129 java多线程
package com.tom; import java.util.LinkedList; import java.util.Queue; import java.util.concurrent.ThreadLocalRandom; import java.util.concurrent.locks.Condition; import java.util.concurrent.loc
汉字转拼音pinyin4j 白糖_ pinyin4j
以前在项目中遇到汉字转拼音的情况，于是在网上找到了pinyin4j这个工具包，非常有用，别的不说了，直接下代码： import java.util.HashSet; import java.util.Set; import net.sourceforge.pinyin4j.PinyinHelper; import net.sourceforge.pinyin
org.hibernate.TransactionException: JDBC begin failed解决方案 bozch ssh 数据库异常 DBCP
org.hibernate.TransactionException: JDBC begin failed: at org.hibernate.transaction.JDBCTransaction.begin(JDBCTransaction.java:68) at org.hibernate.impl.SessionImp
java-并查集（Disjoint-set）-将多个集合合并成没有交集的集合 bylijinnan java
import java.util.ArrayList; import java.util.Arrays; import java.util.HashMap; import java.util.HashSet; import java.util.Iterator; import java.util.List; import java.util.Map; import java.ut
Java PrintWriter打印乱码 chenbowen00 java
一个小程序读写文件，发现PrintWriter输出后文件存在乱码，解决办法主要统一输入输出流编码格式。读文件： BufferedReader 从字符输入流中读取文本，缓冲各个字符，从而提供字符、数组和行的高效读取。可以指定缓冲区的大小，或者可使用默认的大小。大多数情况下，默认值就足够大了。通常，Reader 所作的每个读取请求都会导致对基础字符或字节流进行相应的读取请求。因
[天气与气候]极端气候环境 comsci 环境
如果空间环境出现异变...外星文明并未出现,而只是用某种气象武器对地球的气候系统进行攻击,并挑唆地球国家间的战争,经过一段时间的准备...最大限度的削弱地球文明的整体力量,然后再进行入侵...... 那么地球上的国家应该做什么样的防备工作呢? &n
oracle order by与union一起使用的用法 daizj UNION oracle order by
当使用union操作时，排序语句必须放在最后面才正确，如下：只能在union的最后一个子查询中使用order by，而这个order by是针对整个unioning后的结果集的。So：如果unoin的几个子查询列名不同，如 Sql代码 select supplier_id, supplier_name from suppliers UNI
zeus持久层读写分离单元测试 deng520159 单元测试
本文是zeus读写分离单元测试,距离分库分表,只有一步了.上代码: 1.ZeusMasterSlaveTest.java package com.dengliang.zeus.webdemo.test; import java.util.ArrayList; import java.util.List; import org.junit.Assert; import org.j
Yii 截取字符串(UTF-8) 使用组件 dcj3sjt126com yii
1.将Helper.php放进protected\components文件夹下。 2.调用方法： Helper::truncate_utf8_string($content,20,false); //不显示省略号 Helper::truncate_utf8_string($content,20); //显示省略号 &n
安装memcache及php扩展 dcj3sjt126com PHP
安装memcache tar zxvf memcache-2.2.5.tgz cd memcache-2.2.5/ /usr/local/php/bin/phpize (?) ./configure --with-php-confi
JsonObject 处理日期 feifeilinlin521 java json JsonOjbect JsonArray JSONException
写这边文章的初衷就是遇到了json在转换日期格式出现了异常 net.sf.json.JSONException: java.lang.reflect.InvocationTargetException 原因是当你用Map接收数据库返回了java.sql.Date 日期的数据进行json转换出的问题话不多说直接上代码 &n
Ehcache（06）——监听器 234390216 监听器 listener ehcache
监听器 Ehcache中监听器有两种，监听CacheManager的CacheManagerEventListener和监听Cache的CacheEventListener。在Ehcache中，Listener是通过对应的监听器工厂来生产和发生作用的。下面我们将来介绍一下这两种类型的监听器。
activiti 自带设计器中chrome 34版本不能打开bug的解决 jackyrong Activiti
在acitivti modeler中，如果是chrome 34，则不能打开该设计器，其他浏览器可以，经证实为bug，参考 http://forums.activiti.org/content/activiti-modeler-doesnt-work-chrome-v34 修改为，找到 oryx.debug.js 在最头部增加 if (!Document.
微信收货地址共享接口-终极解决 laotu5i0 微信开发
最近要接入微信的收货地址共享接口，总是不成功，折腾了好几天，实在没办法网上搜到的帖子也是骂声一片。我把我碰到并解决问题的过程分享出来，希望能给微信的接口文档起到一个辅助作用，让后面进来的开发者能快速的接入，而不需要像我们一样苦逼的浪费好几天，甚至一周的青春。各种羞辱、谩骂的话就不说了，本人还算文明。如果你能搜到本贴，说明你已经碰到了各种 ed
关于人才 netkiller.github.com 工作面试招聘 netkiller 人才
关于人才每个月我都会接到许多猎头的电话，有些猎头比较专业，但绝大多数在我看来与猎头二字还是有很大差距的。与猎头接触多了，自然也了解了他们的工作，包括操作手法，总体上国内的猎头行业还处在初级阶段。总结就是“盲目推荐，以量取胜”。目前现状许多从事人力资源工作的人，根本不懂得怎么找人才。处在人才找不到企业，企业找不到人才的尴尬处境。企业招聘，通常是需要用人的部门提出招聘条件，由人
搭建 CentOS 6 服务器 - 目录 rensanning centos
(1) 安装CentOS ISO（desktop/minimal）、Cloud（AWS/阿里云）、Virtualization（VMWare、VirtualBox）详细内容 (2) Linux常用命令 cd、ls、rm、chmod...... 详细内容 (3) 初始环境设置用户管理、网络设置、安全设置...... 详细内容 (4) 常驻服务Daemon
【求助】mongoDB无法更新主键 toknowme mongodb
Query query = new Query(); query.addCriteria(new Criteria("_id").is(o.getId())); &n
jquery 页面滚动到底部自动加载插件集合 xp9802 jquery
很多社交网站都使用无限滚动的翻页技术来提高用户体验，当你页面滑到列表底部时候无需点击就自动加载更多的内容。下面为你推荐 10 个 jQuery 的无限滚动的插件： 1. jQuery ScrollPagination jQuery ScrollPagination plugin 是一个 jQuery 实现的支持无限滚动加载数据的插件。 2. jQuery Screw S