u012841335

Stacked Autoencoders

博文内容参照网页Stacked Autoencoders，Stacked Autocoders是栈式的自编码器（参考网页Autoencoder and Sparsity和博文自编码与稀疏性），就是多层的自编码器，把前一层自编码器的输出（中间隐藏层）作为后一层自编码器的输入，其实就是把很多自编码器的编码部分叠加起来，然后再叠加对应自编码器的解码部分，这样就是一个含有多个隐含层的自编码器了。本博文介绍栈式自编码、微调栈式自编码算法，然后用栈式自编码算法实现MNIST的数字识别。

1、栈式自编码概述

前面博文Self-Taught Learning to Deep Networks说到训练深度网络可以采用逐层贪婪训练方法，每次只训练一个隐藏层，训练时可以采用有监督（比如对每一层隐藏层输入到softmax回归计算分类误差）或无监督（比如稀疏自编码），这里就采用无监督的稀疏自编码算法来学习隐藏层的特征。由于是多层的稀疏自编码神经网络，并且是逐层编码的，我们把它叫做stacked autocoders。

栈式自编码神经网络的编码步骤：

解码步骤为：

其实就类似hinton用栈式RBM组成的神经网络模型（论文是06年在science上发表的，有兴趣可以看看）:

只是我们这里是用稀疏自编码器，而不是用RBM。

如果我们把最后一层隐藏层，即对原数据最高阶的特征表示，输入到softmax回归模型，就可以实现分类啦。把整个网络模型合起来得到：

栈式自编码具有更强大的表达能力及深度网络的所有优点，自编码器倾向于学习到数据的特征表示，那么对于栈式自编码器，第一层可以学习到一阶特征，第二层可以学到二阶特征等等，对于图像而言，第一层可能学习到边，第二层可能学习到如何去组合边形成轮廓、点，更高层可能学习到更形象且更有意义的特征，学到的特征方便我们更好地处理图像，比如对图像分类、检索等等。

2、微调栈式自编码算法

前面也说过微调可以改善深度网络的学习效果，微调就是在原来训练好的模型参数下再稍微修改各层权重以更好地学习数据，哪该如何微调呢？没错，就是用BP算法（参考网页Backpropogation algorithm和博文浅谈神经网络），利用BP微调的算法如下：

要注意的是第二步中对输出层即softmax的输入层求导时，不是BP算法中的平方损失函数，而是softmax损失函数对x的求导，认真推算是可以得到那个表达式的。

3.Exercise:Implement deep networks for digit classification

该实验是用两层隐藏层是stacked autocoders + softmax对MNIST数字进行分类。

实验步骤：

初始化参数；
在原数据上训练第一个自编码器，然后算出L1 features；
在L1 features上训练第二个自编码器，然后算出L2 features；
在L2 features上训练softmax分类器；
stacked autocoders+softmax模型，用BP算法微调参数；
测试模型

实验结果：

用display_network函数去显示第一个自编码器的编码权重得到的图像如下：

至于第二个自编码器，由于输入层的size是200，不能显示正方形的图像，该如何显示第二层的特征我也还没搞清楚。

最终结果为：

Before Finetuning Test Accuracy: 92.080%
After Finetuning Test Accuracy: 98.210%

整个模型的损失函数教程上说不要加上自编码层的W的惩罚项，只加上softmax层的惩罚项，我不是很理解，我加了后的结果如下：

Before Finetuning Test Accuracy: 91.750%
After Finetuning Test Accuracy: 97.430%

matlab代码：

stackedAEExercise.m

%% CS294A/CS294W Stacked Autoencoder Exercise

%  Instructions
%  ------------
% 
%  This file contains code that helps you get started on the
%  sstacked autoencoder exercise. You will need to complete code in
%  stackedAECost.m
%  You will also need to have implemented sparseAutoencoderCost.m and 
%  softmaxCost.m from previous exercises. You will need the initializeParameters.m
%  loadMNISTImages.m, and loadMNISTLabels.m files from previous exercises.
%  
%  For the purpose of completing the assignment, you do not need to
%  change the code in this file. 
%
%%======================================================================
%% STEP 0: Here we provide the relevant parameters values that will
%  allow your sparse autoencoder to get good filters; you do not need to 
%  change the parameters below.

inputSize = 28 * 28;
numClasses = 10;
hiddenSizeL1 = 200;    % Layer 1 Hidden Size
hiddenSizeL2 = 200;    % Layer 2 Hidden Size
sparsityParam = 0.1;   % desired average activation of the hidden units.
                       % (This was denoted by the Greek alphabet rho, which looks like a lower-case "p",
		               %  in the lecture notes). 
lambda = 3e-3;         % weight decay parameter       
beta = 3;              % weight of sparsity penalty term       

%%======================================================================
%% STEP 1: Load data from the MNIST database
%
%  This loads our training data from the MNIST database files.

% Load MNIST database files
trainData = loadMNISTImages('mnist/train-images-idx3-ubyte');
trainLabels = loadMNISTLabels('mnist/train-labels-idx1-ubyte');

trainLabels(trainLabels == 0) = 10; % Remap 0 to 10 since our labels need to start from 1

%添加L-BFGS算法的目录路径
addpath minFunc/
%%======================================================================
%% STEP 2: Train the first sparse autoencoder
%  This trains the first sparse autoencoder on the unlabelled STL training
%  images.
%  If you've correctly implemented sparseAutoencoderCost.m, you don't need
%  to change anything here.


%  Randomly initialize the parameters
sae1Theta = initializeParameters(hiddenSizeL1, inputSize);

%% ---------------------- YOUR CODE HERE  ---------------------------------
%  Instructions: Train the first layer sparse autoencoder, this layer has
%                an hidden size of "hiddenSizeL1"
%                You should store the optimal parameters in sae1OptTheta
%训练第一个自编码器
sae1OptTheta = sae1Theta;
options.Method = 'lbfgs';
options.maxIter = 400;
options.display = 'on';

[sae1OptTheta, cost] = minFunc( @(p) sparseAutoencoderCost(p, ...
                                    inputSize, hiddenSizeL1, ...
                                    lambda, sparsityParam, ...
                                    beta, trainData), ...
                                    sae1Theta, options);

% -------------------------------------------------------------------------
fprintf('第一个自编码器训练完成\n');


%%======================================================================
%% STEP 2: Train the second sparse autoencoder
%  This trains the second sparse autoencoder on the first autoencoder
%  featurse.
%  If you've correctly implemented sparseAutoencoderCost.m, you don't need
%  to change anything here.
%利用第一个自编码器的编码得到输入数据的一阶表示
[sae1Features] = feedForwardAutoencoder(sae1OptTheta, hiddenSizeL1, ...
                                        inputSize, trainData);

%  Randomly initialize the parameters
sae2Theta = initializeParameters(hiddenSizeL2, hiddenSizeL1);

%% ---------------------- YOUR CODE HERE  ---------------------------------
%  Instructions: Train the second layer sparse autoencoder, this layer has
%                an hidden size of "hiddenSizeL2" and an inputsize of
%                "hiddenSizeL1"
%
%                You should store the optimal parameters in sae2OptTheta
%训练第二个自编码器
sae2OptTheta = sae2Theta;

[sae2OptTheta, cost] = minFunc( @(p) sparseAutoencoderCost(p, ...
                                    hiddenSizeL1, hiddenSizeL2, ...
                                    lambda, sparsityParam, ...
                                    beta, sae1Features), ...
                                    sae2Theta, options);


% -------------------------------------------------------------------------
fprintf('第二个自编码器训练完成\n');

%%======================================================================
%% STEP 3: Train the softmax classifier
%  This trains the sparse autoencoder on the second autoencoder features.
%  If you've correctly implemented softmaxCost.m, you don't need
%  to change anything here.
% 利用第二个自编码器得到输入数据的二阶表示
[sae2Features] = feedForwardAutoencoder(sae2OptTheta, hiddenSizeL2, ...
                                        hiddenSizeL1, sae1Features);

%  Randomly initialize the parameters
saeSoftmaxTheta = 0.005 * randn(hiddenSizeL2 * numClasses, 1);


%% ---------------------- YOUR CODE HERE  ---------------------------------
%  Instructions: Train the softmax classifier, the classifier takes in
%                input of dimension "hiddenSizeL2" corresponding to the
%                hidden layer size of the 2nd layer.
%
%                You should store the optimal parameters in saeSoftmaxOptTheta 
%
%  NOTE: If you used softmaxTrain to complete this part of the exercise,
%        set saeSoftmaxOptTheta = softmaxModel.optTheta(:);
% 用softmax模型对二阶特征进行训练
options.maxIter = 100;
lambda = 1e-4;
softmaxModel = softmaxTrain(hiddenSizeL2, numClasses, lambda, ...
                            sae2Features, trainLabels, options);
saeSoftmaxOptTheta = softmaxModel.optTheta(:);

% -------------------------------------------------------------------------
fprintf('softmax训练完成\n');

%%======================================================================
%% STEP 5: Finetune softmax model
%微调，要计算出整个网络模型的损失函数和梯度
% Implement the stackedAECost to give the combined cost of the whole model
% then run this cell.

% Initialize the stack using the parameters learned
stack = cell(2,1);
stack{1}.w = reshape(sae1OptTheta(1:hiddenSizeL1*inputSize), ...
                     hiddenSizeL1, inputSize);
stack{1}.b = sae1OptTheta(2*hiddenSizeL1*inputSize+1:2*hiddenSizeL1*inputSize+hiddenSizeL1);
stack{2}.w = reshape(sae2OptTheta(1:hiddenSizeL2*hiddenSizeL1), ...
                     hiddenSizeL2, hiddenSizeL1);
stack{2}.b = sae2OptTheta(2*hiddenSizeL2*hiddenSizeL1+1:2*hiddenSizeL2*hiddenSizeL1+hiddenSizeL2);

% Initialize the parameters for the deep model
[stackparams, netconfig] = stack2params(stack);
stackedAETheta = [ saeSoftmaxOptTheta ; stackparams ]; %得到fine-tune前的模型参数

%% ---------------------- YOUR CODE HERE  ---------------------------------
%  Instructions: Train the deep network, hidden size here refers to the '
%                dimension of the input to the classifier, which corresponds 
%                to "hiddenSizeL2".
%
%
%BP算法fine-tuning
[stackedAEOptTheta, cost] = minFunc( @(p) stackedAECost(p, inputSize, hiddenSizeL2, ...
                                            numClasses, netconfig, ...
                                            lambda, trainData, trainLabels), ...
                                            stackedAETheta, options);



% -------------------------------------------------------------------------
fprintf('整个模型微调完成\n');


%%======================================================================
%% STEP 6: Test 
%  Instructions: You will need to complete the code in stackedAEPredict.m
%                before running this part of the code
%

% Get labelled test images
% Note that we apply the same kind of preprocessing as the training set
testData = loadMNISTImages('mnist/t10k-images-idx3-ubyte');
testLabels = loadMNISTLabels('mnist/t10k-labels-idx1-ubyte');

testLabels(testLabels == 0) = 10; % Remap 0 to 10

[pred] = stackedAEPredict(stackedAETheta, inputSize, hiddenSizeL2, ...
                          numClasses, netconfig, testData);

acc = mean(testLabels(:) == pred(:));
fprintf('Before Finetuning Test Accuracy: %0.3f%%\n', acc * 100);

[pred] = stackedAEPredict(stackedAEOptTheta, inputSize, hiddenSizeL2, ...
                          numClasses, netconfig, testData);

acc = mean(testLabels(:) == pred(:));
fprintf('After Finetuning Test Accuracy: %0.3f%%\n', acc * 100);

% Accuracy is the proportion of correctly classified images
% The results for our implementation were:
%
% Before Finetuning Test Accuracy: 87.7%
% After Finetuning Test Accuracy:  97.6%
%
% If your values are too low (accuracy less than 95%), you should check 
% your code for errors, and make sure you are training on the 
% entire data set of 60000 28x28 training images 
% (unless you modified the loading code, this should be the case)

stackedAECost.m

function [ cost, grad ] = stackedAECost(theta, inputSize, hiddenSize, ...
                                              numClasses, netconfig, ...
                                              lambda, data, labels)
                                         
% stackedAECost: Takes a trained softmaxTheta and a training data set with labels,
% and returns cost and gradient using a stacked autoencoder model. Used for
% finetuning.
                                         
% theta: trained weights from the autoencoder
% visibleSize: the number of input units
% hiddenSize:  the number of hidden units *at the 2nd layer*
% numClasses:  the number of categories
% netconfig:   the network configuration of the stack
% lambda:      the weight regularization penalty
% data: Our matrix containing the training data as columns.  So, data(:,i) is the i-th training example. 
% labels: A vector containing labels, where labels(i) is the label for the
% i-th training example


%% Unroll softmaxTheta parameter

% We first extract the part which compute the softmax gradient
softmaxTheta = reshape(theta(1:hiddenSize*numClasses), numClasses, hiddenSize);

% Extract out the "stack"
stack = params2stack(theta(hiddenSize*numClasses+1:end), netconfig);

% You will need to compute the following gradients
softmaxThetaGrad = zeros(size(softmaxTheta));
stackgrad = cell(size(stack));
for d = 1:numel(stack)
    stackgrad{d}.w = zeros(size(stack{d}.w));
    stackgrad{d}.b = zeros(size(stack{d}.b));
end

cost = 0; % You need to compute this

% You might find these variables useful
M = size(data, 2);
groundTruth = full(sparse(labels, 1:M, 1));


%% --------------------------- YOUR CODE HERE -----------------------------
%  Instructions: Compute the cost function and gradient vector for 
%                the stacked autoencoder.
%
%                You are given a stack variable which is a cell-array of
%                the weights and biases for every layer. In particular, you
%                can refer to the weights of Layer d, using stack{d}.w and
%                the biases using stack{d}.b . To get the total number of
%                layers, you can use numel(stack).
%
%                The last layer of the network is connected to the softmax
%                classification layer, softmaxTheta.
%
%                You should compute the gradients for the softmaxTheta,
%                storing that in softmaxThetaGrad. Similarly, you should
%                compute the gradients for each layer in the stack, storing
%                the gradients in stackgrad{d}.w and stackgrad{d}.b
%                Note that the size of the matrices in stackgrad should
%                match exactly that of the size of the matrices in stack.
%
depth = size(stack, 1);
a = cell(depth+1, 1);
a{1} = data; %输入层
Jweight = 0; %权重惩罚项
m = size(data, 2); %样本数

for i=2:numel(a)
    a{i} = sigmoid(stack{i-1}.w*a{i-1}+repmat(stack{i-1}.b, [1 size(a{i-1}, 2)]));
    %Jweight = Jweight + sum(sum(stack{i-1}.w).^2);
end

M = softmaxTheta*a{depth+1};
M = bsxfun(@minus, M, max(M, [], 1));
M = exp(M);
p = bsxfun(@rdivide, M, sum(M));

Jweight = Jweight + sum(sum(softmaxTheta.^2));
%与目标误差项+权重惩罚项
cost = -1/m .* groundTruth(:)'*log(p(:)) + lambda/2*Jweight; 

%计算softmax层梯度
softmaxThetaGrad = -1/m .* (groundTruth - p)*a{depth+1}' + lambda*softmaxTheta;

%隐藏层节点误差，对z的求导
delta = cell(depth+1, 1);
%对最后一层隐藏层，即softmax的输入层求导，delta{depth+1}的每一列是对每个样本的求导
delta{depth+1} = -softmaxTheta' * (groundTruth - p) .* a{depth+1} .* (1-a{depth+1});

for i=depth:-1:2
    delta{i} = stack{i}.w'*delta{i+1}.*a{i}.*(1-a{i});
end

for i=depth:-1:1
    stackgrad{i}.w = 1/m .* delta{i+1}*a{i}';
    stackgrad{i}.b = 1/m .* sum(delta{i+1}, 2);
end


% -------------------------------------------------------------------------

%% Roll gradient vector
grad = [softmaxThetaGrad(:) ; stack2params(stackgrad)];

end


% You might find this useful
function sigm = sigmoid(x)
    sigm = 1 ./ (1 + exp(-x));
end

stackedAEPredict.m

function [pred] = stackedAEPredict(theta, inputSize, hiddenSize, numClasses, netconfig, data)
                                         
% stackedAEPredict: Takes a trained theta and a test data set,
% and returns the predicted labels for each example.
                                         
% theta: trained weights from the autoencoder
% visibleSize: the number of input units
% hiddenSize:  the number of hidden units *at the 2nd layer*
% numClasses:  the number of categories
% data: Our matrix containing the training data as columns.  So, data(:,i) is the i-th training example. 

% Your code should produce the prediction matrix 
% pred, where pred(i) is argmax_c P(y(c) | x(i)).
 
%% Unroll theta parameter

% We first extract the part which compute the softmax gradient
softmaxTheta = reshape(theta(1:hiddenSize*numClasses), numClasses, hiddenSize);

% Extract out the "stack"
stack = params2stack(theta(hiddenSize*numClasses+1:end), netconfig);

%% ---------- YOUR CODE HERE --------------------------------------
%  Instructions: Compute pred using theta assuming that the labels start 
%                from 1.
depth = numel(stack);
a = cell(depth+1);
a{1} = data;
m = size(data, 2);

for i=2:depth+1
    a{i} = sigmoid(stack{i-1}.w*a{i-1}+ repmat(stack{i-1}.b, [1 m]));
end

[prob pred] = max(softmaxTheta*a{depth+1});




% -----------------------------------------------------------

end


% You might find this useful
function sigm = sigmoid(x)
    sigm = 1 ./ (1 + exp(-x));
end

科研绘图系列：R语言扩展物种堆积图（Extended Stacked Barplot）生信学习者1 SCI科研绘图系列 r语言数据可视化数据分析
介绍R语言的扩展物种堆积图是一种数据可视化工具，它不仅展示了物种的堆积结果，还整合了不同样本分组之间的差异性分析结果。这种图形表示方法能够直观地比较不同物种在各个分组中的显著性差异，为研究者提供了一种有效的数据解读方式。加载R包knitr::opts_chunk$set(warning=F,message=F)library(tidyverse)library(phyloseq)library(g
科研绘图系列：R语言组合堆积图（stacked barplot with multiple groups）生信学习者2 R语言可视化 r语言数据可视化
介绍通常堆积图的X轴表示样本，样本可能会存在较多的分组信息，通过组合堆积图和样本标签分组信息，我们可以得到一张能展示更多信息的可发表图形。加载R包knitr::opts_chunk$set(warning=F,message=F)library(tidyverse)library(cowplot)library(patchwork)导入数据数据可从以下链接下载（画图所需要的所有数据）：百度云盘链接
用自编码器检测小波散射异常 MATLAB 闪闪发亮的小星星数字信号处理与分析 matlab 开发语言
小波散射LSTM自编码器卷积自编码器卷积自编码器比LSTM自编码器快！modwpt主要参考：https://ww2.mathworks.cn/help/wavelet/ug/detect-anomalies-using-wavelet-scattering-with-autoencoders.html代码及部分注释%加载数据parentDir='';%ifexist(fullfile(parent
数据处理方法—— 7 种数据降维操作！！ JOYCE_Leo16 Python 数据降维 python 数据处理
文章目录数据降维1.主成分分析（PCA）2.线性判别分析（LDA）3.t-分布随机邻域嵌入（t-SNE）4.局部线性嵌入（LLE)5.多维缩放（MDS)6.奇异值分解（SVD)7.自动编码器（Autoencoders)总结数据降维数据降维是一种将高维数据转换为低纬数据的技术，同时尽量保留原始数据的重要信息。这对于处理大规模数据集非常有用，因为它有助于减少计算资源的需要，并提高算法的效率。以下是一些
2-5 异常检测 Anomaly detection with robust deep autoencoders 笔记 Siberia_
一、基本信息题目：Anomalydetectionwithrobustdeepautoencoders 期刊/会议：ACMSIGKDD 发表时间：2017年引用次数：26二、论文总结2.1研究方向提高自编码模型的抗噪声能力2.2写作动机受鲁棒PCA的启发，将原始数据分成正常数据和噪声、异常数据两部分，然后进行交替训练。2.3创新之处除了使用传统的L1正则化去约束噪声部分之外
机器学习中的流形学习算法 Manifold Learning scott198512 机器学习机器学习流形学习 LLE t-SNE 拉普拉斯特征图
1.流形学习概述流形学习manifoldlearning，于2000年在Science杂志上首次提出，是一大类基于流形的框架，是机器学习、模式识别中的一种方法，在维数约简（降维）方面具有广泛的应用。它的主要思想是将高维的数据映射到低维，使低维的数据能够反映原高维数据的某些本质结构特征。不同于一般意义上的数据降维方法，典型如autoencoders等通过学习得到一种参数化的模型，能够适用于任何输入向
一文弄懂自编码器 -- Autoencoders 赵卓不凡深度学习计算机视觉人工智能深度学习机器学习
1.引言近年来，自编码器（Autoencoder）一词在许多人工智能相关的研究论文、期刊和学位论文中被频繁提及。自动编码器于1980年推出，是一种用于神经网络的无监督学习技术，可以从未被标注的训练集中学习。本文重点介绍自编码器的概念、相关变体及其应用，闲话少说，我们直接开始吧！2.原理介绍自编码器神经网络是一种无监督的机器学习算法，它的主要目的为将输入层的数据压缩成较短的格式，我们也可以称为潜在空
DDAE: Denoising Diffusion Autoencoders are Unified Self-supervised Learners Adenialzz 人工智能
DDAE:DenoisingDiffusionAutoencodersareUnifiedSelf-supervisedLearnersPaper：https://arxiv.org/abs/2303.09769Code：https://github.com/FutureXiang/ddaeTL;DR：扩散模型的训练其实就是训练一个去噪模型，考虑到类似的去噪自编码器能够提取出图像线性可分的表征用于
堆叠注入（stacked injecti） himobrinehacken sqli-labs 数据库网络安全系统安全安全网络攻击模型 web安全
stackedinjecti概念：执行多条sql语句一起执行。原理介绍‘；’表示一条sql语句的结束，在后面再次执行下一条sql语句。（union只能苹姐一条select语句）局限性不是每个环境都可以各个数据库实例介绍mysql数据库mssqlserver其实可以发现就是简单的都能用，和做sql是一样的操作
Masked Autoencoders Are Scalable Vision Learners 2021-11-13 不想读Paper
ViT作为Backbone,用类似BERT的方式进行自监督预训练，通过随机遮盖大部分patch让encoder更好地“理解”图片。重点以及和BEIT的区别其实把BERT模型搬到视觉领域，也已经有之前的一篇工作BEIT了。而且BEIT中也使用了AutoEncoder，但是和MAE的区别是，这里的AE是作为一个tokenizer使用，而下面的Transformer重现的也是token而不是原图。BEI
股票价格预测 | Python实现基于Stacked-LSTM的股票预测模型，可预测未来（keras）算法如诗股票价格预测（SPP）lstm python keras Stacked-LSTM
文章目录效果一览文章概述模型描述源码设计效果一览文章概述以股票价格预测为例，基于Stacked-LSTM的股票预测模型（keras），可预测未来。模型描述LSTM用于处理序列数据，如时间序列、文本和音频。相对于传统的RNN，LSTM更擅长捕获长期依赖关系，
bootstrap nav 导航菜单 SkTj
image.pngnavnav-tabs标签式的导航菜单HomeSVNiOSVB.NetJavaPHPimage.png胶囊式菜单：navnav-pills基本的胶囊式导航菜单HomeSVNiOSVB.NetJavaPHPimage.png垂直菜单navnav-pillsnav-stacked垂直的胶囊式导航菜单HomeSVNiOSVB.NetJavaPHPimage.png自适应菜单：navna
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders 橙黄橘绿时_Eden 深度学习 python
1.关于稀疏卷积的解释：https://zhuanlan.zhihu.com/p/3823658892.答案：在深度学习领域，尤其是计算机视觉任务中，遮蔽图像建模（MaskedImageModeling,MIM）是一种自监督学习策略，其基本思想是遮蔽（或隐藏）图像中的部分信息，然后训练模型去预测这些遮蔽的部分。这种方法的一个关键点是，遮蔽的图像可以被视为一个稀疏的2D像素数组。这是因为当图像中的某
180605_Stacked_Regression 郑磊_4135
regressiontopredictHousePriceThefeatureofdataengineeringincludesasfollowings:ImputingmissingvaluesbyproceedingsequentiallythroughthedatatransformingsomenumericalvariablesthatseemreallycategoricalLabel
（论文阅读31/100）Stacked hourglass networks for human pose estimation 朽月初二论文阅读计算机视觉笔记学习
31.文献阅读笔记简介题目Stackedhourglassnetworksforhumanposeestimation作者AlejandroNewell,KaiyuYang,andJiaDeng,ECCV,2016.原文链接https://arxiv.org/pdf/1603.06937.pdf关键词HumanPoseEstimation研究问题CNN运用于HumanPoseEstimation，
MAE(Masked Autoencoders) 详解 sjx_alo 机器视觉深度学习计算机视觉人工智能深度学习 transformer
MAE详解0.引言1.网络结构1.1Mask策略1.2Encoder1.3Decoder2.关键问题解答2.1进行分类任务怎么来做？2.2非对称的编码器和解码器机制的介绍2.3损失函数是怎么计算的？2.4bert把mask放在编码端，为什么MAE加在解码端？3.总结0.引言maskedautoencoders(MAE)是用于CV的自监督学习方法，优点是扩展性强的（scalable），方法简单。在M
【翻译】Introduction to Autoencoders 李加号pluuuus CV基础人工智能
翻译自文章：IntroductiontoAutoencoders-PyImageSearch将你的衣服整理成一个无尽、无限宽、高的衣柜。然后，当您请求特定的物品时，您只需告知Alex它的位置，他们就会使用可靠的缝纫机从头开始缝制该物品。您很快就会意识到，将相似的物品彼此靠近排列是至关重要的，这样Alex就可以仅根据它们的位置准确地重新创建它们。经过几周的微调和调整壁橱的布置，你和Alex建立了对其
【论文阅读笔记】Traj-MAE: Masked Autoencoders for Trajectory Prediction 技术宅学长论文阅读笔记
Abstract通过预测可能的危险，轨迹预测一直是构建可靠的自动驾驶系统的关键任务。一个关键问题是在不发生碰撞的情况下生成一致的轨迹预测。为了克服这一挑战，我们提出了一种有效的用于轨迹预测的掩蔽自编码器(Traj-MAE)，它能更好地代表驾驶环境中智能体的复杂行为。具体来说，我们的Traj-MAE采用了多种掩蔽策略来预训练轨迹编码器和地图编码器，允许捕获智能体之间的社会和时间信息，同时利用来自多个
Multimodal-Neuroimaging-Feature-Learning-With-Multimodal-Stacked-Deep-Polynomial-Networks-for-Dia... SnorlaxSE
MultimodalNeuroimagingFeatureLearningWithMultimodalStackedDeepPolynomialNetworksforDiagnosisofAlzheimer’sDisease多模态神经影像学特征学习多模态叠加深度多项式网络诊断阿尔茨海默病Abstract阿尔茨海默病（AD）及其早期阶段（即轻度认知障碍）的准确诊断对于及时治疗和可能的AD延迟至关重要
3.7 柱状图、堆叠图夏日春风
2种方式创建老方法：plt.plot(kind='bar'),新方法：plt.bar()1--堆叠图的制作方法；方法1：plt.plot(kind='bar')#stacked→堆叠2--方法2：plt.bar()
CMOS图像传感器——Stack Pixel（2）沧海一升 CMOS 图像传感器成像 CIS sensor 图像传感器
在去年的时候,就写过Sony大法的StackPixelCMOS图像传感器——StackPixel_stacked-pixelcis_沧海一升的博客-CSDN博客对索尼的2-LayerTransistorPixel技术进行了介绍_stacked-pixelcishttps://blog.csdn.net/qq_21842097/article/details/127007460IEDM2021上，索
Unsupervised Visual Representation Learning by Context Prediction读后感 Jeffery_李俊峰
这篇文章的思想是利用self-supervise的思想去运用一些互联网规模的数据集（人工的annotation很难获得），无监督地为图片生成特征。生成的embedding希望是语义相近的目标在embedding空间中也是要相近的（语义不同也就不相近）。一种方法是构建一个pretext任务，autoencoders，contextprediction（文本很容易，但是图片的话很难预测context）
python barplot 比例bili scanpy Young.Dr 纸上得来终觉浅 python 开发语言
AdvancedSingle-cellOmics(nbisweden.github.io)tmp=pd.crosstab(adata.obs['clusters'],adata.obs['library_id'],normalize='index')tmp.plot.bar(stacked=True).legend(loc='upperright')
机器学习专业名词中英文对照 iOSDevLog
部分转自知乎部分转自AI人工智能专业词汇集部分转自百度文库可参考链接：机器之心https://blog.csdn.net/liuxiao214/article/details/78130910otheractivation激活值activationfunction激活函数additivenoise加性噪声autoencoder自编码器Autoencoders自编码算法averagefiringra
Diffusion Autoencoders: Toward a Meaningful and Decodable Representation 努力学图像处理的小菜扩散模型 Tricks 图像处理人工智能深度学习计算机视觉
DiffusionAutoencoders:TowardaMeaningfulandDecodableRepresentation(Paperreading)KonpatPreechakul,VISTEC,Thailand,CVPR22Oral,Cited:117,Code,Paper1.前言扩散概率模型(DPM)在图像生成方面取得了显着的质量，可与GAN相媲美。但是与GAN不同，DPM使用一组潜
Denoising Diffusion Autoencoders are Unified Self-supervised Learners 努力学图像处理的小菜计算机视觉深度学习
DenoisingDiffusionAutoencodersareUnifiedSelf-supervisedLearners(Paperreading)WeilaiXiang,BeihangUniversity,arXiv23,Code,Paper1.前言受最近扩散模型进展的启发，这让人想起去噪自编码器，我们研究了它们是否可以通过生成预训练获得分类的判别表示。本文表明扩散模型中的网络，即去噪扩散
论文 Stacked Cross Attention for Image-Text Matching 浅析（SCAN方法）大胡子爷爷黎曼的小弟 pytorch 深度学习机器学习神经网络
文章目录1.前言2.原理2.1StackedCrossAttention(SCAN)2.1.1Image-TextStackedCrossAttention.2.1.2Text-ImageStackedCrossAttention.2.2图像和文本的对齐2.3fasterR-CNN图像特征框的提取2.4bi-directionalGRU3总结1.前言这篇文章是2018年发表在ECCV上的一篇文章，
【第41篇】ConvMAE：Masked Convolution 遇到 Masked Autoencoders 静静AI学堂高质量AI论文翻译深度学习人工智能计算机视觉
文章目录摘要1简介2方法2.1MAE的简要回顾2.2ConvMAE2.3ConvMAE用于目标检测和语义分割2.4ConvMAE用于视频理解3实验3.1ImageNet-1K预训练和微调3.2物体检测3.3语义分割3.4视频理解3.5ConvMAE的消融研究4相关工作5结论摘要论文地址：https://arxiv.org/pdf/2205.03892视觉转换器(ViT)已成为各种视觉任务广泛采用的
deep generative model 破壁者-燕 python
https://towardsdatascience.com/understanding-variational-autoencoders-vaes-f70510919f73
【笔记记录】MAE：Masked Autoencoders Are Scalable Vision Learners 三木今天学习了嘛笔记
文章目录标题摘要关键图结论导言相关工作MAE模型实验评论与之前读的文章的关系Transformer它是一个纯基于注意力机制的编码器和解码器在机器翻译任务上，它比基于RNN的架构要更好一些BERT它使用一个Transformer编码器，拓展到了更一般的NLP任务上它使用了完形填空的自监督的训练机制，这样就不需要使用标号，而是通过预测一个句子里面哪些词不见了，从而获取对文本特征抽取的能力BERT极大地
多线程编程之存钱与取钱周凡杨 java thread 多线程存钱取钱
生活费问题是这样的：学生每月都需要生活费，家长一次预存一段时间的生活费，家长和学生使用统一的一个帐号，在学生每次取帐号中一部分钱，直到帐号中没钱时通知家长存钱，而家长看到帐户还有钱则不存钱，直到帐户没钱时才存钱。问题分析：首先问题中有三个实体，学生、家长、银行账户，所以设计程序时就要设计三个类。其中银行账户只有一个，学生和家长操作的是同一个银行账户，学生的行为是
java中数组与List相互转换的方法征客丶 JavaScript java jsonp
1.List转换成为数组。（这里的List是实体是ArrayList) 　　调用ArrayList的toArray方法。　　toArray 　　public T[] toArray(T[] a)返回一个按照正确的顺序包含此列表中所有元素的数组；返回数组的运行时类型就是指定数组的运行时类型。如果列表能放入指定的数组，则返回放入此列表元素的数组。否则，将根据指定数组的运行时类型和此列表的大小分
Shell 流程控制 daizj 流程控制 if else while case shell
Shell 流程控制和Java、PHP等语言不一样，sh的流程控制不可为空，如(以下为PHP流程控制写法)： <?php if(isset($_GET["q"])){ search(q);}else{// 不做任何事情} 在sh/bash里可不能这么写，如果else分支没有语句执行，就不要写这个else，就像这样 if else if if 语句语
Linux服务器新手操作之二周凡杨 Linux 简单操作
1.利用关键字搜寻Man Pages man -k keyword 其中-k 是选项，keyword是要搜寻的关键字如果现在想使用whoami命令，但是只记住了前3个字符who，就可以使用 man -k who来搜寻关键字who的man命令 [haself@HA5-DZ26 ~]$ man -k
socket聊天室之服务器搭建朱辉辉33 socket
因为我们做的是聊天室，所以会有多个客户端，每个客户端我们用一个线程去实现，通过搭建一个服务器来实现从每个客户端来读取信息和发送信息。我们先写客户端的线程。 public class ChatSocket extends Thread{ Socket socket; public ChatSocket(Socket socket){ this.sock
利用finereport建设保险公司决策分析系统的思路和方法老A不折腾 finereport 金融保险分析系统报表系统项目开发
决策分析系统呈现的是数据页面，也就是俗称的报表，报表与报表间、数据与数据间都按照一定的逻辑设定，是业务人员查看、分析数据的平台，更是辅助领导们运营决策的平台。底层数据决定上层分析，所以建设决策分析系统一般包括数据层处理（数据仓库建设）。项目背景介绍通常，保险公司信息化程度很高，基本上都有业务处理系统（像集团业务处理系统、老业务处理系统、个人代理人系统等）、数据服务系统（通过
始终要页面在ifream的最顶层林鹤霄
index.jsp中有ifream，但是session消失后要让login.jsp始终显示到ifream的最顶层。。。始终没搞定，后来反复琢磨之后，得到了解决办法，在这儿给大家分享下。。 index.jsp--->主要是加了颜色的那一句 <html> <iframe name="top" ></iframe> <ifram
MySQL binlog恢复数据 aigo mysql
1，先确保my.ini已经配置了binlog： # binlog log_bin = D:/mysql-5.6.21-winx64/log/binlog/mysql-bin.log log_bin_index = D:/mysql-5.6.21-winx64/log/binlog/mysql-bin.index log_error = D:/mysql-5.6.21-win
OCX打成CBA包并实现自动安装与自动升级 alxw4616 ocx cab
近来手上有个项目,需要使用ocx控件 (ocx是什么? http://baike.baidu.com/view/393671.htm) 在生产过程中我遇到了如下问题. 1. 如何让 ocx 自动安装? a) 如何签名? b) 如何打包? c) 如何安装到指定目录? 2.
Hashmap队列和PriorityQueue队列的应用百合不是茶 Hashmap队列 PriorityQueue队列
HashMap队列已经是学过了的,但是最近在用的时候不是很熟悉,刚刚重新看以一次, HashMap是K,v键 ,值 put()添加元素 //下面试HashMap去掉重复的 package com.hashMapandPriorityQueue; import java.util.H
JDK1.5 returnvalue实例 bijian1013 java thread java多线程 returnvalue
Callable接口：返回结果并且可能抛出异常的任务。实现者定义了一个不带任何参数的叫做 call 的方法。 Callable 接口类似于 Runnable，两者都是为那些其实例可能被另一个线程执行的类设计的。但是 Runnable 不会返回结果，并且无法抛出经过检查的异常。 ExecutorService接口方
angularjs指令中动态编译的方法(适用于有异步请求的情况) 内嵌指令无效 bijian1013 JavaScript AngularJS
在directive的link中有一个$http请求，当请求完成后根据返回的值动态做element.append('......');这个操作，能显示没问题，可问题是我动态组的HTML里面有ng-click，发现显示出来的内容根本不执行ng-click绑定的方法！
【Java范型二】Java范型详解之extend限定范型参数的类型 bit1129 extend
在第一篇中，定义范型类时，使用如下的方式： public class Generics<M, S, N> { //M,S,N是范型参数 } 这种方式定义的范型类有两个基本的问题： 1. 范型参数定义的实例字段，如private M m = null;由于M的类型在运行时才能确定，那么我们在类的方法中，无法使用m，这跟定义pri
【HBase十三】HBase知识点总结 bit1129 hbase
1. 数据从MemStore flush到磁盘的触发条件有哪些？ a.显式调用flush，比如flush 'mytable' b.MemStore中的数据容量超过flush的指定容量，hbase.hregion.memstore.flush.size,默认值是64M 2. Region的构成是怎么样？ 1个Region由若干个Store组成
服务器被DDOS攻击防御的SHELL脚本 ronin47
mkdir /root/bin vi /root/bin/dropip.sh #!/bin/bash/bin/netstat -na|grep ESTABLISHED|awk ‘{print $5}’|awk -F:‘{print $1}’|sort|uniq -c|sort -rn|head -10|grep -v -E ’192.168|127.0′|awk ‘{if($2!=null&a
java程序员生存手册-craps 游戏-一个简单的游戏 bylijinnan java
import java.util.Random; public class CrapsGame { /** * *一个简单的赌*博游戏，游戏规则如下： *玩家掷两个骰子，点数为1到6，如果第一次点数和为7或11，则玩家胜， *如果点数和为2、3或12，则玩家输， *如果和为其它点数，则记录第一次的点数和，然后继续掷骰，直至点数和等于第一次掷出的点
TOMCAT启动提示NB: JAVA_HOME should point to a JDK not a JRE解决开窍的石头 JAVA_HOME
当tomcat是解压的时候，用eclipse启动正常，点击startup.bat的时候启动报错; 报错如下： The JAVA_HOME environment variable is not defined correctly This environment variable is needed to run this program NB: JAVA_HOME shou
[操作系统内核]操作系统与互联网 comsci 操作系统
我首先申明：我这里所说的问题并不是针对哪个厂商的，仅仅是描述我对操作系统技术的一些看法操作系统是一种与硬件层关系非常密切的系统软件，按理说，这种系统软件应该是由设计CPU和硬件板卡的厂商开发的，和软件公司没有直接的关系，也就是说，操作系统应该由做硬件的厂商来设计和开发
富文本框ckeditor_4.4.7 文本框的简单使用支持IE11 cuityang 富文本框
<html xmlns="http://www.w3.org/1999/xhtml"> <head> <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" /> <title>知识库内容编辑</tit
Property null not found darrenzhu datagrid Flex Advanced propery null
When you got error message like "Property null not found ***", try to fix it by the following way: 1)if you are using AdvancedDatagrid, make sure you only update the data in the data prov
MySQl数据库字符串替换函数使用 dcj3sjt126com mysql 函数替换
需求：需要将数据表中一个字段的值里面的所有的 . 替换成 _ 原来的数据是 site.title site.keywords .... 替换后要为 site_title site_keywords 使用的SQL语句如下： updat
mac上终端起动MySQL的方法 dcj3sjt126com mysql mac
首先去官网下载: http://www.mysql.com/downloads/ 我下载了5.6.11的dmg然后安装,安装完成之后..如果要用终端去玩SQL.那么一开始要输入很长的:/usr/local/mysql/bin/mysql 这不方便啊,好想像windows下的cmd里面一样输入mysql -uroot -p1这样...上网查了下..可以实现滴. 打开终端,输入: 1
Gson使用一（Gson） eksliang json gson
转载请出自出处：http://eksliang.iteye.com/blog/2175401 一.概述从结构上看Json，所有的数据（data）最终都可以分解成三种类型：第一种类型是标量（scalar），也就是一个单独的字符串（string）或数字（numbers），比如"ickes"这个字符串。第二种类型是序列（sequence），又叫做数组（array）
android点滴4 gundumw100 android
Android 47个小知识 http://www.open-open.com/lib/view/open1422676091314.html Android实用代码七段（一） http://www.cnblogs.com/over140/archive/2012/09/26/2611999.html http://www.cnblogs.com/over140/arch
JavaWeb之JSP基本语法 ihuning javaweb
目录 JSP模版元素 JSP表达式 JSP脚本片断 EL表达式 JSP注释特殊字符序列的转义处理如何查找JSP页面中的错误 JSP模版元素 JSP页面中的静态HTML内容称之为JSP模版元素，在静态的HTML内容之中可以嵌套JSP
App Extension编程指南（iOS8/OS X v10.10）中文版啸笑天 ext
当iOS 8.0和OS X v10.10发布后，一个全新的概念出现在我们眼前，那就是应用扩展。顾名思义，应用扩展允许开发者扩展应用的自定义功能和内容，能够让用户在使用其他app时使用该项功能。你可以开发一个应用扩展来执行某些特定的任务，用户使用该扩展后就可以在多个上下文环境中执行该任务。比如说，你提供了一个能让用户把内容分
SQLServer实现无限级树结构 macroli oracle sql SQL Server
表结构如下：数据库id path titlesort 排序 1 0 首页 0 2 0,1 新闻 1 3 0,2 JAVA 2 4 0,3 JSP 3 5 0,2,3 业界动态 2 6 0,2,3 国内新闻 1 创建一个存储过程来实现，如果要在页面上使用可以设置一个返回变量将至传过去 create procedure test as begin decla
Css居中div，Css居中img，Css居中文本，Css垂直居中div qiaolevip 众观千象学习永无止境每天进步一点点 css
/**********Css居中Div**********/ div.center { width: 100px; margin: 0 auto; } /**********Css居中img**********/ img.center { display: block; margin-left: auto; margin-right: auto; }
Oracle 常用操作(实用) 吃猫的鱼 oracle
SQL>select text from all_source where owner=user and name=upper('&plsql_name'); SQL>select * from user_ind_columns where index_name=upper('&index_name'); 将表记录恢复到指定时间段以前
iOS中使用RSA对数据进行加密解密 witcheryne ios rsa iPhone objective c
RSA算法是一种非对称加密算法,常被用于加密数据传输.如果配合上数字摘要算法, 也可以用于文件签名. 本文将讨论如何在iOS中使用RSA传输加密数据. 本文环境 mac os openssl-1.0.1j, openssl需要使用1.x版本, 推荐使用[homebrew](http://brew.sh/)安装. Java 8 RSA基本原理 RS

Stacked Autoencoders

你可能感兴趣的:(Stacked Autoencoders)