进击的吃恩程sy

cs231n assignment1_Q1_KNN Classifier

今天开始将进行学习cs231n课程并完成相关的作业，在此记录。

配置环境
首先在做作业之前，需要配置作业相关的环境才能进行，我的步骤如下：

下载并安装anaconda：https://www.anaconda.com/download/ ，这是一个Python的科学包，它集成了很多机器学习、深度学习要用的相关库，也自带了一个Python。
安装Jupyter notebook，直接可以在命令行安装，pip install jupyter
下载数据集CIFAR10，网址：http://www.cs.toronto.edu/~kriz/cifar.html
在有Pycharm的前提下，在Pycharm上使用Jupyter notebook进行编写。
Jupyter这个工具可以及时在工作界面上看到运行的结果。

做作业

题目：

k-最近邻（kNN）练习

完成并将完成的工作表（包括其输出和工作表之外的任何支持代码）与您的作业提交一起提交。有关详细信息，请参阅课程网站上的作业页面。

kNN分类器包括两个阶段：

在训练期间，分类器获取训练数据并简单地记住它

在测试期间，kNN通过与所有训练图像进行比较并且转移k个最相似训练示例的标签来对每个测试图像进行分类

k的值是交叉验证的

在本练习中，您将实现这些步骤并理解基本的图像分类管道，交叉验证，并获得编写高效矢量化代码的熟练程度。

步骤

到官网下载作业包，网址：https://github.com/cs231n/cs231n.github.io/blob/master/assignments/2017/assignment1.md
然后解压。
进入pycharm，新建项目到之前步骤1解压的目录下，将下载的CIFAR10压缩包解压到cs231n\datasets下。
在pycharm中打开assignment1/knn.ipynb进行编写。

下面贴上代码

knn.ipynb

import random
import numpy as np
from cs231n.data_utils import load_CIFAR10
import matplotlib.pyplot as plt


# This is a bit of magic to make matplotlib figures appear inline in the notebook
# rather than in a new window.
%matplotlib inline 

plt.rcParams['figure.figsize'] = (10.0, 8.0) # set default size of plots
plt.rcParams['image.interpolation'] = 'nearest'
plt.rcParams['image.cmap'] = 'gray'


# Some more magic so that the notebook will reload external python modules;
# see http://stackoverflow.com/questions/1907993/autoreload-of-modules-in-ipython
%load_ext autoreload
%autoreload 2

# Load the raw CIFAR-10 data.
cifar10_dir = 'cs231n/datasets/cifar-10-batches-py'
X_train, y_train, X_test, y_test = load_CIFAR10(cifar10_dir)

# As a sanity check, we print out the size of the training and test data.
print('Training data shape: ', X_train.shape)   
print('Training labels shape: ', y_train.shape)
print('Test data shape: ', X_test.shape)
print('Test labels shape: ', y_test.shape)

Training data shape: (50000, 32, 32, 3)
Training labels shape: (50000,)
Test data shape: (10000, 32, 32, 3)
Test labels shape: (10000,)

# Visualize some examples from the dataset.
# We show a few examples of training images from each class.
classes = ['plane', 'car', 'bird', 'cat', 'deer', 'dog', 'frog', 'horse', 'ship', 'truck']
num_classes = len(classes)
samples_per_class = 7

#############
#
# enumerate函数示例：
# seasons = ['Spring', 'Summer', 'Fall', 'Winter']
# >>> list(enumerate(seasons))
# [(0, 'Spring'), (1, 'Summer'), (2, 'Fall'), (3, 'Winter')]
#
#############

for y, cls in enumerate(classes):  
    idxs = np.flatnonzero(y_train == y) #此方法返回了训练集中有相同标签的索引值
    idxs = np.random.choice(idxs, samples_per_class, replace=False) #产生一个随机采样
    for i, idx in enumerate(idxs):
        plt_idx = i * num_classes + y + 1
        plt.subplot(samples_per_class, num_classes, plt_idx) #行数，列数，每行的第几个图像
        plt.imshow(X_train[idx].astype('uint8'))
        plt.axis('off')
        if i == 0:
            plt.title(cls)
plt.show()

# Subsample the data for more efficient code execution in this exercise
#为了后面减少计算量，随机采样数据集，训练集采样5000张，测试集采样500张
num_training = 5000
mask = list(range(num_training))
X_train = X_train[mask]
y_train = y_train[mask]

num_test = 500
mask = list(range(num_test))
X_test = X_test[mask]
y_test = y_test[mask]

# Reshape the image data into rows
#将32*32*3的图片reshape成一行
X_train = np.reshape(X_train, (X_train.shape[0], -1))  #逗号表达式， -1表示不确定。
X_test = np.reshape(X_test, (X_test.shape[0], -1))
print(X_train.shape, X_test.shape)

from cs231n.classifiers import KNearestNeighbor

# Create a kNN classifier instance. 
# Remember that training a kNN classifier is a noop: 
# the Classifier simply remembers the data and does no further processing 
classifier = KNearestNeighbor() 
classifier.train(X_train, y_train) #仅存储训练样本数据

# Open cs231n/classifiers/k_nearest_neighbor.py and implement
# compute_distances_two_loops.

# Test your implementation:
dists = classifier.compute_distances_two_loops(X_test) #用两层循环计算L2距离
print(dists.shape)

(500, 5000)

# We can visualize the distance matrix: each row is a single test example and
# its distances to training examples
plt.imshow(dists, interpolation='none')
plt.show()

# Now implement the function predict_labels and run the code below:
# We use k = 1 (which is Nearest Neighbor).
y_test_pred = classifier.predict_labels(dists, k=1)

# Compute and print the fraction of correctly predicted examples
#print(y_test_pred.shape,y_test.shape)
num_correct = np.sum(y_test_pred == y_test) #分类正确的次数
accuracy = float(num_correct) / num_test
print('Got %d / %d correct => accuracy: %f' % (num_correct, num_test, accuracy))

Got 137 / 500 correct => accuracy: 0.274000

y_test_pred = classifier.predict_labels(dists, k=5)
num_correct = np.sum(y_test_pred == y_test)
accuracy = float(num_correct) / num_test
print('Got %d / %d correct => accuracy: %f' % (num_correct, num_test, accuracy))

Got 139 / 500 correct => accuracy: 0.278000

# Now lets speed up distance matrix computation by using partial vectorization
# with one loop. Implement the function compute_distances_one_loop and run the
# code below:
dists_one = classifier.compute_distances_one_loop(X_test)

# To ensure that our vectorized implementation is correct, we make sure that it
# agrees with the naive implementation. There are many ways to decide whether
# two matrices are similar; one of the simplest is the Frobenius norm. In case
# you haven't seen it before, the Frobenius norm of two matrices is the square
# root of the squared sum of differences of all elements; in other words, reshape
# the matrices into vectors and compute the Euclidean distance between them.
difference = np.linalg.norm(dists - dists_one, ord='fro')
print('Difference was: %f' % (difference, ))
if difference < 0.001:
    print('Good! The distance matrices are the same')
else:
    print('Uh-oh! The distance matrices are different')

Difference was: 0.000000
Good! The distance matrices are the same

# Now implement the fully vectorized version inside compute_distances_no_loops
# and run the code
dists_two = classifier.compute_distances_no_loops(X_test)

# check that the distance matrix agrees with the one we computed before:
difference = np.linalg.norm(dists - dists_two, ord='fro')
print('Difference was: %f' % (difference, ))
if difference < 0.001:
    print('Good! The distance matrices are the same')
else:
    print('Uh-oh! The distance matrices are different')

Difference was: 0.000000
Good! The distance matrices are the same

# Let's compare how fast the implementations are
def time_function(f, *args):
    """
    Call a function f with args and return the time (in seconds) that it took to execute.
    """
    import time
    tic = time.time()
    f(*args)
    toc = time.time()
    return toc - tic

two_loop_time = time_function(classifier.compute_distances_two_loops, X_test)
print('Two loop version took %f seconds' % two_loop_time)

one_loop_time = time_function(classifier.compute_distances_one_loop, X_test)
print('One loop version took %f seconds' % one_loop_time)

no_loop_time = time_function(classifier.compute_distances_no_loops, X_test)
print('No loop version took %f seconds' % no_loop_time)

# you should see significantly faster performance with the fully vectorized implementation

Two loop version took 55.624931 seconds
One loop version took 103.568292 seconds
No loop version took 0.680607 seconds

num_folds = 5
k_choices = [1, 3, 5, 8, 10, 12, 15, 20, 50, 100]

X_train_folds = []
y_train_folds = []
################################################################################
# TODO:                                                                        #
# Split up the training data into folds. After splitting, X_train_folds and    #
# y_train_folds should each be lists of length num_folds, where                #
# y_train_folds[i] is the label vector for the points in X_train_folds[i].     #
# Hint: Look up the numpy array_split function.                                #
################################################################################

X_train_folds = np.split(X_train,5,axis=0)
y_train_folds = np.split(y_train,5,axis=0)

################################################################################
#                                 END OF YOUR CODE                             #
################################################################################

# A dictionary holding the accuracies for different values of k that we find
# when running cross-validation. After running cross-validation,
# k_to_accuracies[k] should be a list of length num_folds giving the different
# accuracy values that we found when using that value of k.
k_to_accuracies = {}


################################################################################
# TODO:                                                                        #
# Perform k-fold cross validation to find the best value of k. For each        #
# possible value of k, run the k-nearest-neighbor algorithm num_folds times,   #
# where in each case you use all but one of the folds as training data and the #
# last fold as a validation set. Store the accuracies for all fold and all     #
# values of k in the k_to_accuracies dictionary.                               #
################################################################################
for k in k_choices:
    accuracies = []
    for i in range(num_folds):
        X_train_cv = np.vstack(X_train_folds[0:i] + X_train_folds[i+1:])
        y_train_cv = np.hstack(y_train_folds[0:i] + y_train_folds[i+1:])
        X_valid_cv = X_train_folds[i]
        y_valid_cv = y_train_folds[i]
        
        classifier.train(X_train_cv,y_train_cv)
        dists = classifier.compute_distances_no_loops(X_valid_cv)
        y_test_pred = classifier.predict_labels(dists,k)
        num_correct = np.sum(y_test_pred == y_valid_cv)
        accuracy = float(num_correct) / y_valid_cv.shape[0]
        #print (accuracy)
        accuracies.append(accuracy)
    
    k_to_accuracies[k]= accuracies

################################################################################
#                                 END OF YOUR CODE                             #
################################################################################

# Print out the computed accuracies
for k in sorted(k_to_accuracies):
    for accuracy in k_to_accuracies[k]:
        print('k = %d, accuracy = %f' % (k, accuracy))

k = 1, accuracy = 0.263000
k = 1, accuracy = 0.257000
k = 1, accuracy = 0.264000
k = 1, accuracy = 0.278000
k = 1, accuracy = 0.266000
k = 3, accuracy = 0.239000
k = 3, accuracy = 0.249000
k = 3, accuracy = 0.240000
k = 3, accuracy = 0.266000
k = 3, accuracy = 0.254000
k = 5, accuracy = 0.248000
k = 5, accuracy = 0.266000
k = 5, accuracy = 0.280000
k = 5, accuracy = 0.292000
k = 5, accuracy = 0.280000
k = 8, accuracy = 0.262000
k = 8, accuracy = 0.282000
k = 8, accuracy = 0.273000
k = 8, accuracy = 0.290000
k = 8, accuracy = 0.273000
k = 10, accuracy = 0.265000
k = 10, accuracy = 0.296000
k = 10, accuracy = 0.276000
k = 10, accuracy = 0.284000
k = 10, accuracy = 0.280000
k = 12, accuracy = 0.260000
k = 12, accuracy = 0.295000
k = 12, accuracy = 0.279000
k = 12, accuracy = 0.283000
k = 12, accuracy = 0.280000
k = 15, accuracy = 0.252000
k = 15, accuracy = 0.289000
k = 15, accuracy = 0.278000
k = 15, accuracy = 0.282000
k = 15, accuracy = 0.274000
k = 20, accuracy = 0.270000
k = 20, accuracy = 0.279000
k = 20, accuracy = 0.279000
k = 20, accuracy = 0.282000
k = 20, accuracy = 0.285000
k = 50, accuracy = 0.271000
k = 50, accuracy = 0.288000
k = 50, accuracy = 0.278000
k = 50, accuracy = 0.269000
k = 50, accuracy = 0.266000
k = 100, accuracy = 0.256000
k = 100, accuracy = 0.270000
k = 100, accuracy = 0.263000
k = 100, accuracy = 0.256000
k = 100, accuracy = 0.263000

# plot the raw observations
for k in k_choices:
    accuracies = k_to_accuracies[k]
    plt.scatter([k] * len(accuracies), accuracies)

# plot the trend line with error bars that correspond to standard deviation
accuracies_mean = np.array([np.mean(v) for k,v in sorted(k_to_accuracies.items())])
accuracies_std = np.array([np.std(v) for k,v in sorted(k_to_accuracies.items())])
plt.errorbar(k_choices, accuracies_mean, yerr=accuracies_std)
plt.title('Cross-validation on k')
plt.xlabel('k')
plt.ylabel('Cross-validation accuracy')
plt.show()

# Based on the cross-validation results above, choose the best value for k,   
# retrain the classifier using all the training data, and test it on the test
# data. You should be able to get above 28% accuracy on the test data.
best_k = 10

classifier = KNearestNeighbor()
classifier.train(X_train, y_train)
y_test_pred = classifier.predict(X_test, k=best_k)

# Compute and display the accuracy
num_correct = np.sum(y_test_pred == y_test)
accuracy = float(num_correct) / num_test
print('Got %d / %d correct => accuracy: %f' % (num_correct, num_test, accuracy))

Got 141 / 500 correct => accuracy: 0.282000

接下来是K_nearest_neighbor.py的代码

import numpy as np
from past.builtins import xrange


class KNearestNeighbor(object):
  """ a kNN classifier with L2 distance """

  def __init__(self):
    pass

  def train(self, X, y):
 .X_train = X
    self.y_train = y
    
  def predict(self, X, k=1, num_loops=0):
oops == 0:
      dists = self.compute_distances_no_loops(X)
    elif num_loops == 1:
      dists = self.compute_distances_one_loop(X)
    elif num_loops == 2:
      dists = self.compute_distances_two_loops(X)
    else:
      raise ValueError('Invalid value %d for num_loops' % num_loops)

    return self.predict_labels(dists, k=k)

  def compute_distances_two_loops(self, X):
  t = X.shape[0] #dists矩阵行数
    num_train = self.X_train.shape[0] #dists矩阵列数
    dists = np.zeros((num_test, num_train)) #距离初始化为0
    for i in xrange(num_test):
      for j in xrange(num_train):
        #####################################################################
        # TODO:                                                             #
        # Compute the l2 distance between the ith test point and the jth    #
        # training point, and store the result in dists[i, j]. You should   #
        # not use a loop over dimension.                                    #
        #####################################################################

        dists[i][j] = np.sqrt(np.sum(np.square(X[i]-self.X_train[j]))) #两向量之间的L2距离。

        #####################################################################
        #                       END OF YOUR CODE                            #
        #####################################################################
    return dists

  def compute_distances_one_loop(self, X):
    """
    Compute the distance between each test point in X and each training point
    in self.X_train using a single loop over the test data.

    Input / Output: Same as compute_distances_two_loops
    """
    num_test = X.shape[0]
    num_train = self.X_train.shape[0]
    dists = np.zeros((num_test, num_train))
    for i in xrange(num_test):
      #######################################################################
      # TODO:                                                               #
      # Compute the l2 distance between the ith test point and all training #
      # points, and store the result in dists[i, :].                        #
      #######################################################################


      dists[i] = np.sqrt(np.sum(np.square(self.X_train - X[i]), axis=1))#按行来执行

      #######################################################################
      #                         END OF YOUR CODE                            #
      #######################################################################
    return dists

  def compute_distances_no_loops(self, X):
    """
    Compute the distance between each test point in X and each training point
    in self.X_train using no explicit loops.

    Input / Output: Same as compute_distances_two_loops
    """
    num_test = X.shape[0]
    num_train = self.X_train.shape[0]
    dists = np.zeros((num_test, num_train)) 
    #########################################################################
    # TODO:                                                                 #
    # Compute the l2 distance between all test points and all training      #
    # points without using any explicit loops, and store the result in      #
    # dists.                                                                #
    #                                                                       #
    # You should implement this function using only basic array operations; #
    # in particular you should not use functions from scipy.                #
    #                                                                       #
    # HINT: Try to formulate the l2 distance using matrix multiplication    #
    #       and two broadcast sums.                                         #
    #########################################################################

    #就是将距离函数拆开，完成各个单项后再合并（x - y）^2 = x^2 - 2*x*y + y^2
    #当axis为0时,是压缩行,即将每一列的元素相加,将矩阵压缩为一行
    #当axis为1时,是压缩列,即将每一行的元素相加,将矩阵压缩为一列
    #这里的计算要特别注意维度的不同

    ab = np.dot(X, self.X_train.T)  # num_test * num_train 点积
    a2 = np.sum(np.square(X), axis=1).reshape(-1, 1)  # num_test * 1
    b2 = np.sum(np.square(self.X_train.T), axis=0).reshape(1, -1)  # 1 * num_train
    dists = -2 * ab + a2 + b2  # 不同维度计算会自动 broadcast
    dists = np.sqrt(dists)


    #########################################################################
    #                         END OF YOUR CODE                              #
    #########################################################################
    return dists

  def predict_labels(self, dists, k=1):
    """
    Given a matrix of distances between test points and training points,
    predict a label for each test point.

    Inputs:
    - dists: A numpy array of shape (num_test, num_train) where dists[i, j]
      gives the distance betwen the ith test point and the jth training point.

    Returns:
    - y: A numpy array of shape (num_test,) containing predicted labels for the
      test data, where y[i] is the predicted label for the test point X[i].  
    """
    num_test = dists.shape[0] #dists矩阵行数
    #print(dists.shape[0])
    y_pred = np.zeros(num_test)
    for i in xrange(num_test):
      # A list of length k storing the labels of the k nearest neighbors to
      # the ith test point.

      #closest_y = []

      #########################################################################
      # TODO:                                                                 #
      # Use the distance matrix to find the k nearest neighbors of the ith    #
      # testing point, and use self.y_train to find the labels of these       #
      # neighbors. Store these labels in closest_y.                           #
      # Hint: Look up the function numpy.argsort.                             #
      #########################################################################

      #argsort函数返回的是数组值从小到大的索引值。

      closest_y = self.y_train[ np.argsort(dists[i])[0:k] ]

      #########################################################################
      # TODO:                                                                 #
      # Now that you have found the labels of the k nearest neighbors, you    #
      # need to find the most common label in the list closest_y of labels.   #
      # Store this label in y_pred[i]. Break ties by choosing the smaller     #
      # label.                                                                #
      #########################################################################

      #argmax函数沿给定轴返回最大元素的索引
      y_pred[i] = np.argmax(np.bincount(closest_y))  #bincount函数统计结果的“票数”

      #########################################################################
      #                           END OF YOUR CODE                            # 
      #########################################################################

    return y_pred

其中non-loop的算法原理：

本次作业重点就是在对于矩阵的处理上，掌握numpy的基本操作。

【可控图像生成系列论文（四）】IP-Adapter 具体是如何训练的？1公式篇多恩Stone AIGC Diffusion Transformer 计算机视觉深度学习 python AIGC pytorch 机器学习人工智能
系列文章目录【可控图像生成系列论文（一）】简要介绍了MimicBrush的整体流程和方法；【可控图像生成系列论文（二）】就MimicBrush的具体模型结构、训练数据和纹理迁移进行了更详细的介绍。【可控图像生成系列论文（三）】介绍了一篇相对早期（2018年）的可控字体艺术化工作。文章目录系列文章目录前言〇、文生图模型预备知识1.训练目标2.无分类器指导（classifier-freeguidanc
Your Diffusion Model is Secretly a Zero-Shot Classifier论文阅读笔记 Rising_Flashlight 论文阅读笔记计算机视觉
YourDiffusionModelisSecretlyaZero-ShotClassifier论文阅读笔记这篇文章我感觉在智源大会上听到无数个大佬讨论，包括OpenAISora团队负责人，谢赛宁，好像还有杨植麟。虽然这个文章好像似乎被引量不是特别高，但是和AI甚至人类理解很本质的问题很相关，即是不是要通过生成来构建理解的问题，文章的做法也很巧妙，感觉是一些学者灵机一动的产物，好好学习一个！摘要这
Spark MLlib模型训练—分类算法Multilayer Perceptron Classifier 猫猫姐 Spark实战 spark-ml spark 机器学习
SparkMLlib模型训练—分类算法MultilayerPerceptronClassifierMultilayerPerceptronClassifier（多层感知器分类器，简称MLP）是SparkMLlib中用于分类任务的神经网络模型。MLP是一种前馈神经网络（FeedforwardNeuralNetwork），其架构由输入层、隐藏层和输出层组成。MLP通过反向传播算法（Backpropag
Maven仓库介绍单手提煤气罐 JAVA java maven
1何为Maven仓库任何一个依赖、插件或者项目构建的输出都可以称为构件任何一个构件都有一组坐标唯一标识Maven可以在某个位置统一存储所有Maven项目共享的构件，这个统一的位置就是仓库2仓库的布局构件：groupId=org.testng、artifactId=testng、version=5.8、classifier=jdk15、packaging=jar，其对应的路径按如下步骤生成基于构件的
maven异常记录-must be unique angelasp java maven java
maven打包异常记录我们可以看看一个重要的异常：'dependencies.dependency.(groupId:artifactId:type:classifier)'mustbeunique:org.springframework.boot:spring-boot-starter-test经过检查pom文件果然是spring-boot-starter-test引用重复，平时直接运行项目不会
cs231n_深度之眼第二次作业 Jie_Cheney
图像分类数据和label分别是什么？图像分类存在的问题与挑战？图像分类数据包括训练集测试集的数据，在有监督的问题中对于训练集数据来说是有label的，而测试集是等待我们去识别它的类别，不具有label。label就是分类标签，比如cifar10这个数据集，待分类的这10类数据我们可以写成1-10，或者0-9这就叫做label。图像分类存在的问题与挑战：光照，角度，形变，遮挡。使用python加载一
Pipeline是如何运行月疯【NLP】python 开发语言
pipeline的两个重要组件模型（Models类）和分词器（Tokenizers类）的参数以及使用方式。以第一个情感分析pipeline为例，我们运行下面的代码fromtransformersimportpipelineclassifier=pipeline("sentiment-analysis")result=classifier("I'vebeenwaitingforaHuggingFac
向量，矩阵和张量的导数 | 简单的数学橘子学AI
前段时间看过一些矩阵求导的教程，在看过的资料中，尤其喜欢斯坦福大学CS231n卷积神经网络课程中提到的Erik这篇文章。循着他的思路，可以逐步将复杂的求导过程简化、再简化，直到发现其中有规律的部分。话不多说，一起来看看吧。作者：ErikLearned-Miller翻译：橘子来源：橘子AI笔记（datawitch）本文旨在帮助您学习向量、矩阵和高阶张量（三维或三维以上的数组）的求导方法，以及如何求对
linux上迁移python虚拟环境让我康康南墙长什么样子 Linux linux python 服务器
BERT验证由于某些需要，目前需要把BERT在GLUE数据集上实现一下验证，在github上down了一份BERT的代码，里面是直接给好了验证代码的（run_classifier.py）只需要按照requirement.txt安装一下tensorflow1.x就可以开始测试了首先肯定是在本机上运行测试一下代码是否有坑，于是马上上手做，下载model，下载数据集，建立虚拟环境，装包等一系列操作，另外
sklearn之模型评估指标总结归纳 lzw2016 机器学习 Python学习 sklearn 模型评估指标归纳总结
文章目录机器学习模型评估分类模型回归模型聚类模型交叉验证中指定scoring参数网格搜索中应用机器学习模型评估以下方法，sklearn中都在sklearn.metrics类下，务必记住哪些指标适合分类，那些适合回归，不能混着用分类的模型大多是Classifier结尾，回归是Regression分类模型accuracy_score（准确率得分）是模型分类正确的数据除以样本总数【模型的score方法算
eclipse 导入 fabric-sdk-java 报错处理 SlowGO
fabric-sdk-java项目clone下来后，先按照docs/EclipseSetup.md来操作，操作完成后，还会遇到问题。1.POM错误com.google.protobuf:protoc:exe:${os.detected.classifier}问题就出在${os.detected.classifier}，需要我们手动把他替换掉。随便建一个class，main中写：System.out
dependencies.dependency.(groupId:artifactId:type:classifier)‘ must be unique: com.duo:duo-shop sccd2009 开发语言 java
引续'dependencies.dependency.(groupId:artifactId:type:classifier)'mustbeunique:com.duo:duo-shop:jar->version(?)vs3.8.5@line71,column21Itishighlyrecommendedtofixtheseproblemsbecausetheythreatenthestabili
cs231n assignment1——SVM 柠檬山楂荷叶茶 cs231n 支持向量机 python 机器学习
整体思路加载CIFAR-10数据集并展示部分数据数据图像归一化，减去均值（也可以再除以方差）svm_loss_naive和svm_loss_vectorized计算hinge损失，用拉格朗日法列hinge损失函数利用随机梯度下降法优化SVM在训练集和验证集计算准确率，保存最好的模型在测试集进行预测计算准确率加载展示划分数据集加载CIFAR-10数据集#LoadtherawCIFAR-10data.
（2023版）斯坦福CS231n学习笔记：DL与CV教程 (12) | 视觉模型可视化与可解释性（Visualizing and Understanding）女王の专属领地计算机视觉 #计算机视觉 #学习笔记
前言笔记专栏：斯坦福CS231N：面向视觉识别的卷积神经网络（23）课程链接：https://www.bilibili.com/video/BV1xV411R7i5CS231n:深度学习计算机视觉（2017）中文笔记：https://zhuxiaoxia.blog.csdn.net/article/details/801551662023最新课程PPT：https://download.csdn.
2019-02-25~~2019-03-03 第十周周末复盘仰望星空的小狗
一、任务清单1、刷leetcode题目（7道）2、听tensorflow，cs231n和cv课程3、技术文档输出4、恢复早起的作息二、反思1、自从年前工作非常忙，加上遇上一些郁闷的事情，导致年前到现在时间记录中断了很长一段时间。本周开始恢复时间记录，日打卡，周复盘。2、生活中不论谁，肯定会时不时遇上一些令人郁闷的事情，这些郁闷的事情很可能会打乱原本的生活节奏。但是，生活还有很长的路要走，不应该因为
清除Maven缓存使用插件生成grpc代码 qq_22905801 maven 缓存 spring
清除Maven缓存：Maven有时会缓存错误信息。运行一下命令以清除Maven的缓存：mvndependency:purge-local-repository强制更新依赖：当运行Maven命令时，使用-U参数：mvncleancompile-U手动指定分类器：如果自动检测不起作用，可以手动指定分类器。根据您的操作系统手动替换${os.detected.classifier}。例如，如果是在64位L
【论文笔记】End-to-End Diffusion Latent Optimization Improves Classifier Guidance xhyu61 机器学习论文笔记学习笔记论文阅读
AbstractClassifierguidance为图像生成带来了控制，但是需要训练新的噪声感知模型(noise-awaremodels)来获得准确的梯度，或使用最终生成的一步去噪近似，这会导致梯度错位(misalignedgradients)和次优控制(sub-optimalcontrol)。梯度错位(misalignedgradients)：通过噪声感知模型指导生成模型时，两个模型的结构和目
geemap学习笔记 08 geemap 监督分类结果的精度验证案例弈落馨 geemap python 分类学习机器学习
文章目录前言一、分类精度评价二、监督分类结果的精度验证1.混淆矩阵2.总体精度3.Kappa系数4.生产者精度5.用户精度总结前言要评估分类器的准确性，可以使用ConfusionMatrix。**sample()方法从输入数据生成两个随机样本:一个用于训练，另一个用于验证。训练样本用于训练分类器。从classifier.confusionMatrix()**中可以得到训练数据的替换精度。为了获得验
训练神经网络(上)激活函数笔写落去深度学习神经网络人工智能深度学习
本文介绍几种激活函数,只作为个人笔记.观看视频为cs231n文章目录前言一、Sigmoid函数二、tanh函数三、ReLU函数四、LeakyReLU函数五、ELU函数六.在实际应用中寻找激活函数的做法总结前言激活函数是用来加入非线性因素的，提高神经网络对模型的表达能力，解决线性模型所不能解决的问题。一、Sigmoid函数这个函数大家应该熟悉在逻辑回归中曾用到这个sigmoid函数这个函数可以将负无
Halcon DL-Model相关算子夏雪之晶莹《HALCON》学习笔记机器视觉
(1)create_dl_model_detection(::Backbone,NumClasses,DLModelDetectionParam:DLModelHandle)功能：创建一个用于目标检测或实例分割的深度学习网络。控制输入参数1：Backbone：骨干网络(预训练分类器)，Defaultvalue:'pretrained_dl_classifier_compact.hdl'；'pret
第九课：机器学习与人工智能、计算机视觉、自然语言处理 NLP及机器人笛秋白计算机科学人工智能机器学习计算机视觉个人开发计算机历史快速入门
第九课：机器学习与人工智能、计算机视觉、自然语言处理NLP及机器人第三十四章：机器学习与人工智能1、分类Classification2、做分类的算法分类器Classifier3、用于分类的值是特征Feature4、特征值+种类叫做标记数据Labeleddata5、决策边界Decisionboundaries6、混淆矩阵Confusionmatrix7、决策树Decisiontree8、支持向量机S
深入理解感知机月见樽
本文公式较多，由于不支持公式渲染，公式完整版请移步个人博客1.模型感知机的模型如下图所示：linear_classifier_structure.png公式表示如下所示：$$f(x)=sign(w\cdotx+b)\sign(x)=\begin{cases}+1&x\geq0\-1&x<0\end{cases}$$对于该分类器，其假设空间为特征空间的所有线性分类器，从几何学的角度可以理解为是特征空
Weka 分类树输出结果解析 Weighted.avg deer(écho) MachineLearning 分类数据挖掘人工智能
本文是对weka分类树的结果解释，集合了其它的博文我们使用的是weka自带的weather数据库先看左侧，classifier是分类方法，J48是递归分治策略；cross-validation表示交叉验证，使用了10-Foldspercentagesplit表示分割比例，用以分割训练集和测试集（猜的）再看看output，yes(9/3)(5/2)表示训练集里3个no，测试集里2个no(猜的x2)其
rasa框架意图分类embedding算法 233彭于晏
算法模型intent_classifier_tensorflow_embedding点击此处获取算法代码算法框架算法框架算法思想把训练样本和意图编码到同一个向量空间，设计损失函数，使得样本与真实意图更相近，样本与其他意图更相反，意图之间编码更相反，达到意图分类的目的。举个例子说明，假设有两条训练样本“我要充话费”和“我要订机票”，有四个意图“订机票”、“查天气”，“充话费”，“查运势”，意图分类算
卷积神经网络 weixin_34283445 人工智能
https://zhuanlan.zhihu.com/p/27642620关于卷积神经网络的讲解，网上有很多精彩文章，且恐怕难以找到比斯坦福的CS231n还要全面的教程。所以这里对卷积神经网络的讲解主要是以不同的思考侧重展开，通过对卷积神经网络的分析，进一步理解神经网络变体中“因素共享”这一概念。注意：该文会跟其他的现有文章有很大的不同。读该文需要有本书前些章节作为预备知识，不然会有理解障碍。没看
【最优传输二十八】Reusing the Task-specific Classifier as a Discriminator:Discriminator-free Adversarial Dom 羊驼不驼a 最优传输域适应基本论文深度学习机器学习
1.motivation现有的对抗性UDA方法通常采用额外的鉴别器来与特征提取器进行最小-最大博弈。然而，这些方法大多未能有效利用预测的判别信息，从而导致生成器的模式崩溃。为了解决这个问题，本文设计了一个简单而有效的对抗性范式，即无鉴别器的对抗性学习网络（DALN），其中类别分类器被重新用作鉴别器，通过统一的目标实现显式的领域对齐和类别区分，使得DALN能够利用预测的判别信息来进行充分的特征对准。
机器学习的一些有趣的点【异常检测】我就是菜鸡1229 机器学习人工智能
机器能不能知道自己不知道，而不是给出判断中的一种？Classifier（分类）AnomalyDetection（异常检测）机器能不能说出为什么知道？有时候可能是因为数据的问题导致了这种错觉。机器学习是否会有错觉？AdversarialAttack“对抗攻击”。这是指针对机器学习模型或人工智能系统的一种攻击方法，攻击者通过精心设计的输入，试图欺骗模型，使其产生错误的输出或分类。这种攻击是通过对输入数
CS231n 作业答案 tech0ne
CS231n三次大作业：#第一次作业##原始包下载：作业一完成包地址：作业一JupyterNotebook结果：KNNSVMSoftmaxTwolayernetFeatures第二次作业原始包下载：作业二完成包地址：作业二JupyterNotebook结果：FullyConnectedNetsBatchNormalizationDropoutConvolutionalNetworksTensorf
【扩散模型】9、Imagen | 借用语言模型的能力来实现文生图（NIPS2022 Oral）呆呆的猫扩散模型 Imagen 语言模型人工智能
文章目录一、背景二、方法2.1预训练的语言编码器2.2扩散模型和classifier-freeguidance三、效果论文：Imagen:PhotorealisticText-to-ImageDiffusionModelswithDeepLanguageUnderstanding官网：https://www.assemblyai.com/blog/how-imagen-actually-works
GEE机器学习——Classifier.explain()查看训练模型的过程和变量重要性分析此星光明机器学习机器学习人工智能云计算 javascript 重要性变量 gee
变量重要性变量重要性分析是一种用于评估模型中每个特征（变量）对模型性能的影响程度的方法。通过分析每个特征的重要性，可以帮助我们理解模型如何利用不同特征来进行预测，并且可以帮助我们选择最重要的特征，以便更好地解释模型和优化模型性能。在本案例种，使用不同机器学习方法，然后根据该函数对各参与构建模型的变量进行重要性分析，这样最后可以获取各变量的一个数值，最终就可以根据变量重要性来进行模型的优化和变量冗余
解线性方程组 qiuwanchi
package gaodai.matrix; import java.util.ArrayList; import java.util.List; import java.util.Scanner; public class Test { public static void main(String[] args) { Scanner scanner = new Sc
在mysql内部存储代码 annan211 性能 mysql 存储过程触发器
在mysql内部存储代码在mysql内部存储代码，既有优点也有缺点，而且有人倡导有人反对。先看优点： 1 她在服务器内部执行，离数据最近，另外在服务器上执行还可以节省带宽和网络延迟。 2 这是一种代码重用。可以方便的统一业务规则，保证某些行为的一致性，所以也可以提供一定的安全性。 3 可以简化代码的维护和版本更新。 4 可以帮助提升安全，比如提供更细
Android使用Asynchronous Http Client完成登录保存cookie的问题 hotsunshine android
Asynchronous Http Client是android中非常好的异步请求工具除了异步之外还有很多封装比如json的处理，cookie的处理引用 Persistent Cookie Storage with PersistentCookieStore This library also includes a PersistentCookieStore whi
java面试题 Array_06 java 面试
java面试题第一，谈谈final, finally, finalize的区别。 final-修饰符（关键字）如果一个类被声明为final，意味着它不能再派生出新的子类，不能作为父类被继承。因此一个类不能既被声明为 abstract的，又被声明为final的。将变量或方法声明为final，可以保证它们在使用中不被改变。被声明为final的变量必须在声明时给定初值，而在以后的引用中只能
网站加速 oloz 网站加速
前序:本人菜鸟，此文研究总结来源于互联网上的资料，大牛请勿喷！本人虚心学习，多指教. 1、减小网页体积的大小，尽量采用div+css模式，尽量避免复杂的页面结构，能简约就简约。 2、采用Gzip对网页进行压缩； GZIP最早由Jean-loup Gailly和Mark Adler创建，用于UNⅨ系统的文件压缩。我们在Linux中经常会用到后缀为.gz
正确书写单例模式随意而生 java 设计模式单例
　　单例模式算是设计模式中最容易理解，也是最容易手写代码的模式了吧。但是其中的坑却不少，所以也常作为面试题来考。本文主要对几种单例写法的整理，并分析其优缺点。很多都是一些老生常谈的问题，但如果你不知道如何创建一个线程安全的单例，不知道什么是双检锁，那这篇文章可能会帮助到你。　　懒汉式，线程不安全　　当被问到要实现一个单例模式时，很多人的第一反应是写出如下的代码，包括教科书上也是这样
单例模式香水浓 java
懒汉调用getInstance方法时实例化 public class Singleton { private static Singleton instance; private Singleton() {} public static synchronized Singleton getInstance() { if(null == ins
安装Apache问题：系统找不到指定的文件 No installed service named "Apache2" AdyZhang apache http server
安装Apache问题：系统找不到指定的文件 No installed service named "Apache2" 每次到这一步都很小心防它的端口冲突问题，结果，特意留出来的80端口就是不能用，烦。解决方法确保几处： 1、停止IIS启动 2、把端口80改成其它（譬如90，800，，，什么数字都好） 3、防火墙(关掉试试) 在运行处输入 cmd 回车，转到apa
如何在android 文件选择器中选择多个图片或者视频？ aijuans android
我的android app有这样的需求，在进行照片和视频上传的时候，需要一次性的从照片/视频库选择多条进行上传但是android原生态的sdk中，只能一个一个的进行选择和上传。我想知道是否有其他的android上传库可以解决这个问题，提供一个多选的功能，可以使checkbox之类的，一次选择多个处理方法官方的图片选择器(但是不支持所有版本的androi，只支持API Level
mysql中查询生日提醒的日期相关的sql baalwolf mysql
SELECT sysid,user_name,birthday,listid,userhead_50,CONCAT(YEAR(CURDATE()),DATE_FORMAT(birthday,'-%m-%d')),CURDATE(), dayofyear( CONCAT(YEAR(CURDATE()),DATE_FORMAT(birthday,'-%m-%d')))-dayofyear(
MongoDB索引文件破坏后导致查询错误的问题 BigBird2012 mongodb
问题描述： MongoDB在非正常情况下关闭时，可能会导致索引文件破坏，造成数据在更新时没有反映到索引上。解决方案：使用脚本，重建MongoDB所有表的索引。 var names = db.getCollectionNames(); for( var i in names ){ var name = names[i]; print(name);
Javascript Promise bijian1013 JavaScript Promise
Parse JavaScript SDK现在提供了支持大多数异步方法的兼容jquery的Promises模式，那么这意味着什么呢，读完下文你就了解了。一.认识Promises “Promises”代表着在javascript程序里下一个伟大的范式，但是理解他们为什么如此伟大不是件简
[Zookeeper学习笔记九]Zookeeper源代码分析之Zookeeper构造过程 bit1129 zookeeper
Zookeeper重载了几个构造函数，其中构造者可以提供参数最多，可定制性最多的构造函数是 public ZooKeeper(String connectString, int sessionTimeout, Watcher watcher, long sessionId, byte[] sessionPasswd, boolea
【Java命令三】jstack bit1129 jstack
jstack是用于获得当前运行的Java程序所有的线程的运行情况(thread dump），不同于jmap用于获得memory dump [hadoop@hadoop sbin]$ jstack Usage: jstack [-l] <pid> (to connect to running process) jstack -F
jboss 5.1启停脚本　动静分离部署 ronin47
以前启动jboss，往各种xml配置文件，现只要运行一句脚本即可。start nohup sh /**/run.sh -c servicename -b ip -g clustername -u broatcast jboss.messaging.ServerPeerID=int -Djboss.service.binding.set=p
UI之如何打磨设计能力? brotherlamp UI ui教程 ui自学 ui资料 ui视频
在越来越拥挤的初创企业世界里，视觉设计的重要性往往可以与杀手级用户体验比肩。在许多情况下，尤其对于 Web 初创企业而言，这两者都是不可或缺的。前不久我们在《右脑革命：别学编程了，学艺术吧》中也曾发出过重视设计的呼吁。如何才能提高初创企业的设计能力呢?以下是 9 位创始人的体会。 1.找到自己的方式如果你是设计师，要想提高技能可以去设计博客和展示好设计的网站如D-lists或
三色旗算法 bylijinnan java 算法
import java.util.Arrays; /** 问题：假设有一条绳子，上面有红、白、蓝三种颜色的旗子，起初绳子上的旗子颜色并没有顺序，您希望将之分类，并排列为蓝、白、红的顺序，要如何移动次数才会最少，注意您只能在绳子上进行这个动作，而且一次只能调换两个旗子。网上的解法大多类似：在一条绳子上移动，在程式中也就意味只能使用一个阵列，而不使用其它的阵列来
警告:No configuration found for the specified action: \'s chiangfai configuration
1.index.jsp页面form标签未指定namespace属性。  <%@taglib prefix="s" uri="/struts-tags"%> ... <s:form action="submit" method="post"&g
redis -- hash_max_zipmap_entries设置过大有问题 chenchao051 redis hash
使用redis时为了使用hash追求更高的内存使用率，我们一般都用hash结构，并且有时候会把hash_max_zipmap_entries这个值设置的很大，很多资料也推荐设置到1000，默认设置为了512，但是这里有个坑 #define ZIPMAP_BIGLEN 254 #define ZIPMAP_END 255 /* Return th
select into outfile access deny问题 daizj mysql txt 导出数据到文件
本文转自：http://hatemysql.com/2010/06/29/select-into-outfile-access-deny%E9%97%AE%E9%A2%98/ 为应用建立了rnd的帐号，专门为他们查询线上数据库用的，当然，只有他们上了生产网络以后才能连上数据库，安全方面我们还是很注意的，呵呵。授权的语句如下： grant select on armory.* to rn
phpexcel导出excel表简单入门示例 dcj3sjt126com PHP Excel phpexcel
<?php error_reporting(E_ALL); ini_set('display_errors', TRUE); ini_set('display_startup_errors', TRUE); if (PHP_SAPI == 'cli') die('This example should only be run from a Web Brows
美国电影超短200句 dcj3sjt126com 电影
1. I see．我明白了。2. I quit! 我不干了!3. Let go! 放手!4. Me too．我也是。5. My god! 天哪!6. No way! 不行!7. Come on．来吧(赶快)8. Hold on．等一等。9. I agree。我同意。10. Not bad．还不错。11. Not yet．还没。12. See you．再见。13. Shut up!
Java访问远程服务 dyy_gusi httpclient webservice get post
随着webService的崛起，我们开始中会越来越多的使用到访问远程webService服务。当然对于不同的webService框架一般都有自己的client包供使用，但是如果使用webService框架自己的client包，那么必然需要在自己的代码中引入它的包，如果同时调运了多个不同框架的webService，那么就需要同时引入多个不同的clien
Maven的settings.xml配置 geeksun settings.xml
settings.xml是Maven的配置文件，下面解释一下其中的配置含义： settings.xml存在于两个地方： 1.安装的地方：$M2_HOME/conf/settings.xml 2.用户的目录：${user.home}/.m2/settings.xml 前者又被叫做全局配置，后者被称为用户配置。如果两者都存在，它们的内容将被合并，并且用户范围的settings.xml优先。
ubuntu的init与系统服务设置 hongtoushizi ubuntu
转载自： http://iysm.net/?p=178 init Init是位于/sbin/init的一个程序，它是在linux下，在系统启动过程中，初始化所有的设备驱动程序和数据结构等之后，由内核启动的一个用户级程序，并由此init程序进而完成系统的启动过程。 ubuntu与传统的linux略有不同，使用upstart完成系统的启动，但表面上仍维持init程序的形式。运行
跟我学Nginx+Lua开发目录贴 jinnianshilongnian nginx lua
使用Nginx+Lua开发近一年的时间，学习和实践了一些Nginx+Lua开发的架构，为了让更多人使用Nginx+Lua架构开发，利用春节期间总结了一份基本的学习教程，希望对大家有用。也欢迎谈探讨学习一些经验。目录第一章安装Nginx+Lua开发环境第二章 Nginx+Lua开发入门第三章 Redis/SSDB+Twemproxy安装与使用第四章 L
php位运算符注意事项 home198979 位运算 PHP &
$a = $b = $c = 0; $a & $b = 1; $b | $c = 1 问a,b,c最终为多少? 当看到这题时，我犯了一个低级错误，误以为位运算符会改变变量的值。所以得出结果是1 1 0 但是位运算符是不会改变变量的值的，例如： $a=1;$b=2; $a&$b; 这样a,b的值不会有任何改变
Linux shell数组建立和使用技巧 pda158 linux
1.数组定义　　[chengmo@centos5 ~]$ a=(1 2 3 4 5) 　　[chengmo@centos5 ~]$ echo $a 　　1 　　一对括号表示是数组，数组元素用“空格”符号分割开。　　 2.数组读取与赋值　　得到长度：　　[chengmo@centos5 ~]$ echo ${#a[@]} 　　5 　　用${#数组名[@或
hotspot源码(JDK7) ol_beta java HotSpot jvm
源码结构图，方便理解： ├─agent Serviceab
Oracle基本事务和ForAll执行批量DML练习 vipbooks oracle sql
基本事务的使用：从账户一的余额中转100到账户二的余额中去，如果账户二不存在或账户一中的余额不足100则整笔交易回滚 select * from account; -- 创建一张账户表 create table account( -- 账户ID id number(3) not null, -- 账户名称 nam

cs231n assignment1_Q1_KNN Classifier

你可能感兴趣的:(cs231n assignment1_Q1_KNN Classifier)