张泽旺

使用逻辑回归进行mnist手写字识别

1.引言

逻辑回归（LR）在分类问题中的应用十分广泛，它是一个基于概率的线性分类器，通过建立一个简单的输入层和输出层，即可实现对输入数据的有效分类。而该网络结构的主要参数只有两个，分别是权重和偏置，本文定义损耗函数为负对数，然后通过随机梯度下降算法（SGD）来对参数进行更新，并定义误差函数来衡量训练的阶段。

2.具体训练过程

在第3部分将会给出本文的完整python代码，其中用到的文件mnist.pkl.gz可以去网上下载，放到与python文件同一目录下面即可。

首先，定义一个基于object的LogisticRegression类，类的构造方法中包含输入、输入数据维度、输出数据维度；同时将W、b分别初始化为0矩阵、0向量；接下来定义由x输出y的分类器为softmax函数，并且取概率最大值得到y的预测值。

接下来定义一个输入参数为y的negative_log_likelihood(self, y)函数，其返回值为负对数似然的平均值。

接下来定义errors(self, y)函数，计算网络的错误率。

接下来定义导入数据的函数 load_data(dataset)，这个比较简单，主要是对象的反序列化。最终返回结果是[(train_set_x, train_set_y), (valid_set_x, valid_set_y), (test_set_x, test_set_y)]，顾名思义。

接下来定义sgd_optimization_mnist(learning_rate=0.13, n_epochs=1000,dataset='mnist.pkl.gz',batch_size=600)函数，将训练数据分成多个batch，每个batch的大小为600，并且实例化一个LogisticRegression对象，输出为长度为10的向量（因为包含0~9之间的数字分类），并且输入的矩阵为28*28像素大小的图片，并定义两个函数:test_model和validate_model，然后定义了参数的更新规则，最后定义了训练函数train_model，训练一次就要对参数更新一次。

最后一段代码是进行训练的主要流程。

3.Python实现

"""
This tutorial introduces logistic regression using Theano and stochastic
gradient descent.

Logistic regression is a probabilistic, linear classifier. It is parametrized
by a weight matrix :math:`W` and a bias vector :math:`b`. Classification is
done by projecting data points onto a set of hyperplanes, the distance to
which is used to determine a class membership probability.

Mathematically, this can be written as:

.. math::
  P(Y=i|x, W,b) &= softmax_i(W x + b) \\
                &= \frac {e^{W_i x + b_i}} {\sum_j e^{W_j x + b_j}}


The output of the model or prediction is then done by taking the argmax of
the vector whose i'th element is P(Y=i|x).

.. math::

  y_{pred} = argmax_i P(Y=i|x,W,b)


This tutorial presents a stochastic gradient descent optimization method
suitable for large datasets.


References:

    - textbooks: "Pattern Recognition and Machine Learning" -
                 Christopher M. Bishop, section 4.3.2

"""
__docformat__ = 'restructedtext en'

import cPickle
import gzip
import os
import sys
import timeit

import numpy

import theano
import theano.tensor as T


class LogisticRegression(object):
    """Multi-class Logistic Regression Class

    The logistic regression is fully described by a weight matrix :math:`W`
    and bias vector :math:`b`. Classification is done by projecting data
    points onto a set of hyperplanes, the distance to which is used to
    determine a class membership probability.
    """

    def __init__(self, input, n_in, n_out):
        """ Initialize the parameters of the logistic regression

        :type input: theano.tensor.TensorType
        :param input: symbolic variable that describes the input of the
                      architecture (one minibatch)

        :type n_in: int
        :param n_in: number of input units, the dimension of the space in
                     which the datapoints lie

        :type n_out: int
        :param n_out: number of output units, the dimension of the space in
                      which the labels lie

        """
        # start-snippet-1
        # initialize with 0 the weights W as a matrix of shape (n_in, n_out)
        self.W = theano.shared(
            value=numpy.zeros(
                (n_in, n_out),
                dtype=theano.config.floatX
            ),
            name='W',
            borrow=True
        )
        # initialize the biases b as a vector of n_out 0s
        self.b = theano.shared(
            value=numpy.zeros(
                (n_out,),
                dtype=theano.config.floatX
            ),
            name='b',
            borrow=True
        )

        # symbolic expression for computing the matrix of class-membership
        # probabilities
        # Where:
        # W is a matrix where column-k represent the separation hyperplane for
        # class-k
        # x is a matrix where row-j  represents input training sample-j
        # b is a vector where element-k represent the free parameter of
        # hyperplane-k
        self.p_y_given_x = T.nnet.softmax(T.dot(input, self.W) + self.b)

        # symbolic description of how to compute prediction as class whose
        # probability is maximal
        self.y_pred = T.argmax(self.p_y_given_x, axis=1)
        # end-snippet-1

        # parameters of the model
        self.params = [self.W, self.b]

        # keep track of model input
        self.input = input

    def negative_log_likelihood(self, y):
        """Return the mean of the negative log-likelihood of the prediction
        of this model under a given target distribution.

        .. math::

            \frac{1}{|\mathcal{D}|} \mathcal{L} (\theta=\{W,b\}, \mathcal{D}) =
            \frac{1}{|\mathcal{D}|} \sum_{i=0}^{|\mathcal{D}|}
                \log(P(Y=y^{(i)}|x^{(i)}, W,b)) \\
            \ell (\theta=\{W,b\}, \mathcal{D})

        :type y: theano.tensor.TensorType
        :param y: corresponds to a vector that gives for each example the
                  correct label

        Note: we use the mean instead of the sum so that
              the learning rate is less dependent on the batch size
        """
        # start-snippet-2
        # y.shape[0] is (symbolically) the number of rows in y, i.e.,
        # number of examples (call it n) in the minibatch
        # T.arange(y.shape[0]) is a symbolic vector which will contain
        # [0,1,2,... n-1] T.log(self.p_y_given_x) is a matrix of
        # Log-Probabilities (call it LP) with one row per example and
        # one column per class LP[T.arange(y.shape[0]),y] is a vector
        # v containing [LP[0,y[0]], LP[1,y[1]], LP[2,y[2]], ...,
        # LP[n-1,y[n-1]]] and T.mean(LP[T.arange(y.shape[0]),y]) is
        # the mean (across minibatch examples) of the elements in v,
        # i.e., the mean log-likelihood across the minibatch.
        return -T.mean(T.log(self.p_y_given_x)[T.arange(y.shape[0]), y])
        # end-snippet-2

    def errors(self, y):
        """Return a float representing the number of errors in the minibatch
        over the total number of examples of the minibatch ; zero one
        loss over the size of the minibatch

        :type y: theano.tensor.TensorType
        :param y: corresponds to a vector that gives for each example the
                  correct label
        """

        # check if y has same dimension of y_pred
        if y.ndim != self.y_pred.ndim:
            raise TypeError(
                'y should have the same shape as self.y_pred',
                ('y', y.type, 'y_pred', self.y_pred.type)
            )
        # check if y is of the correct datatype
        if y.dtype.startswith('int'):
            # the T.neq operator returns a vector of 0s and 1s, where 1
            # represents a mistake in prediction
            return T.mean(T.neq(self.y_pred, y))
        else:
            raise NotImplementedError()


def load_data(dataset):
    ''' Loads the dataset

    :type dataset: string
    :param dataset: the path to the dataset (here MNIST)
    '''

    #############
    # LOAD DATA #
    #############

    # Download the MNIST dataset if it is not present
    data_dir, data_file = os.path.split(dataset)
    if data_dir == "" and not os.path.isfile(dataset):
        # Check if dataset is in the data directory.
        new_path = os.path.join(
            os.path.split(__file__)[0],
            "..",
            "data",
            dataset
        )
        if os.path.isfile(new_path) or data_file == 'mnist.pkl.gz':
            dataset = new_path

    if (not os.path.isfile(dataset)) and data_file == 'mnist.pkl.gz':
        import urllib
        origin = (
            'http://www.iro.umontreal.ca/~lisa/deep/data/mnist/mnist.pkl.gz'
        )
        print 'Downloading data from %s' % origin
        urllib.urlretrieve(origin, dataset)

    print '... loading data'

    # Load the dataset
    f = gzip.open(dataset, 'rb')
    train_set, valid_set, test_set = cPickle.load(f)
    f.close()
    #train_set, valid_set, test_set format: tuple(input, target)
    #input is an numpy.ndarray of 2 dimensions (a matrix)
    #witch row's correspond to an example. target is a
    #numpy.ndarray of 1 dimensions (vector)) that have the same length as
    #the number of rows in the input. It should give the target
    #target to the example with the same index in the input.

    def shared_dataset(data_xy, borrow=True):
        """ Function that loads the dataset into shared variables

        The reason we store our dataset in shared variables is to allow
        Theano to copy it into the GPU memory (when code is run on GPU).
        Since copying data into the GPU is slow, copying a minibatch everytime
        is needed (the default behaviour if the data is not in a shared
        variable) would lead to a large decrease in performance.
        """
        data_x, data_y = data_xy
        shared_x = theano.shared(numpy.asarray(data_x,
                                               dtype=theano.config.floatX),
                                 borrow=borrow)
        shared_y = theano.shared(numpy.asarray(data_y,
                                               dtype=theano.config.floatX),
                                 borrow=borrow)
        # When storing data on the GPU it has to be stored as floats
        # therefore we will store the labels as ``floatX`` as well
        # (``shared_y`` does exactly that). But during our computations
        # we need them as ints (we use labels as index, and if they are
        # floats it doesn't make sense) therefore instead of returning
        # ``shared_y`` we will have to cast it to int. This little hack
        # lets ous get around this issue
        return shared_x, T.cast(shared_y, 'int32')

    test_set_x, test_set_y = shared_dataset(test_set)
    valid_set_x, valid_set_y = shared_dataset(valid_set)
    train_set_x, train_set_y = shared_dataset(train_set)

    rval = [(train_set_x, train_set_y), (valid_set_x, valid_set_y),
            (test_set_x, test_set_y)]
    return rval


def sgd_optimization_mnist(learning_rate=0.13, n_epochs=1000,
                           dataset='mnist.pkl.gz',
                           batch_size=600):
    """
    Demonstrate stochastic gradient descent optimization of a log-linear
    model

    This is demonstrated on MNIST.

    :type learning_rate: float
    :param learning_rate: learning rate used (factor for the stochastic
                          gradient)

    :type n_epochs: int
    :param n_epochs: maximal number of epochs to run the optimizer

    :type dataset: string
    :param dataset: the path of the MNIST dataset file from
                 http://www.iro.umontreal.ca/~lisa/deep/data/mnist/mnist.pkl.gz

    """
    datasets = load_data(dataset)

    train_set_x, train_set_y = datasets[0]
    valid_set_x, valid_set_y = datasets[1]
    test_set_x, test_set_y = datasets[2]

    # compute number of minibatches for training, validation and testing
    n_train_batches = train_set_x.get_value(borrow=True).shape[0] / batch_size
    n_valid_batches = valid_set_x.get_value(borrow=True).shape[0] / batch_size
    n_test_batches = test_set_x.get_value(borrow=True).shape[0] / batch_size

    ######################
    # BUILD ACTUAL MODEL #
    ######################
    print '... building the model'

    # allocate symbolic variables for the data
    index = T.lscalar()  # index to a [mini]batch

    # generate symbolic variables for input (x and y represent a
    # minibatch)
    x = T.matrix('x')  # data, presented as rasterized images
    y = T.ivector('y')  # labels, presented as 1D vector of [int] labels

    # construct the logistic regression class
    # Each MNIST image has size 28*28
    classifier = LogisticRegression(input=x, n_in=28 * 28, n_out=10)

    # the cost we minimize during training is the negative log likelihood of
    # the model in symbolic format
    cost = classifier.negative_log_likelihood(y)

    # compiling a Theano function that computes the mistakes that are made by
    # the model on a minibatch
    test_model = theano.function(
        inputs=[index],
        outputs=classifier.errors(y),
        givens={
            x: test_set_x[index * batch_size: (index + 1) * batch_size],
            y: test_set_y[index * batch_size: (index + 1) * batch_size]
        }
    )

    validate_model = theano.function(
        inputs=[index],
        outputs=classifier.errors(y),
        givens={
            x: valid_set_x[index * batch_size: (index + 1) * batch_size],
            y: valid_set_y[index * batch_size: (index + 1) * batch_size]
        }
    )

    # compute the gradient of cost with respect to theta = (W,b)
    g_W = T.grad(cost=cost, wrt=classifier.W)
    g_b = T.grad(cost=cost, wrt=classifier.b)

    # start-snippet-3
    # specify how to update the parameters of the model as a list of
    # (variable, update expression) pairs.
    updates = [(classifier.W, classifier.W - learning_rate * g_W),
               (classifier.b, classifier.b - learning_rate * g_b)]

    # compiling a Theano function `train_model` that returns the cost, but in
    # the same time updates the parameter of the model based on the rules
    # defined in `updates`
    train_model = theano.function(
        inputs=[index],
        outputs=cost,
        updates=updates,
        givens={
            x: train_set_x[index * batch_size: (index + 1) * batch_size],
            y: train_set_y[index * batch_size: (index + 1) * batch_size]
        }
    )
    # end-snippet-3

    ###############
    # TRAIN MODEL #
    ###############
    print '... training the model'
    # early-stopping parameters
    patience = 5000  # look as this many examples regardless
    patience_increase = 2  # wait this much longer when a new best is
                                  # found
    improvement_threshold = 0.995  # a relative improvement of this much is
                                  # considered significant
    validation_frequency = min(n_train_batches, patience / 2)
                                  # go through this many
                                  # minibatche before checking the network
                                  # on the validation set; in this case we
                                  # check every epoch

    best_validation_loss = numpy.inf
    test_score = 0.
    start_time = timeit.default_timer()

    done_looping = False
    epoch = 0
    while (epoch < n_epochs) and (not done_looping):
        epoch = epoch + 1
        for minibatch_index in xrange(n_train_batches):

            minibatch_avg_cost = train_model(minibatch_index)
            # iteration number
            iter = (epoch - 1) * n_train_batches + minibatch_index

            if (iter + 1) % validation_frequency == 0:
                # compute zero-one loss on validation set
                validation_losses = [validate_model(i)
                                     for i in xrange(n_valid_batches)]
                this_validation_loss = numpy.mean(validation_losses)

                print(
                    'epoch %i, minibatch %i/%i, validation error %f %%' %
                    (
                        epoch,
                        minibatch_index + 1,
                        n_train_batches,
                        this_validation_loss * 100.
                    )
                )

                # if we got the best validation score until now
                if this_validation_loss < best_validation_loss:
                    #improve patience if loss improvement is good enough
                    if this_validation_loss < best_validation_loss *  \
                       improvement_threshold:
                        patience = max(patience, iter * patience_increase)

                    best_validation_loss = this_validation_loss
                    # test it on the test set

                    test_losses = [test_model(i)
                                   for i in xrange(n_test_batches)]
                    test_score = numpy.mean(test_losses)

                    print(
                        (
                            '     epoch %i, minibatch %i/%i, test error of'
                            ' best model %f %%'
                        ) %
                        (
                            epoch,
                            minibatch_index + 1,
                            n_train_batches,
                            test_score * 100.
                        )
                    )

                    # save the best model
                    with open('best_model.pkl', 'w') as f:
                        cPickle.dump(classifier, f)

            if patience <= iter:
                done_looping = True
                break

    end_time = timeit.default_timer()
    print(
        (
            'Optimization complete with best validation score of %f %%,'
            'with test performance %f %%'
        )
        % (best_validation_loss * 100., test_score * 100.)
    )
    print 'The code run for %d epochs, with %f epochs/sec' % (
        epoch, 1. * epoch / (end_time - start_time))
    print >> sys.stderr, ('The code for file ' +
                          os.path.split(__file__)[1] +
                          ' ran for %.1fs' % ((end_time - start_time)))


def predict():
    """
    An example of how to load a trained model and use it
    to predict labels.
    """

    # load the saved model
    classifier = cPickle.load(open('best_model.pkl'))

    # compile a predictor function
    predict_model = theano.function(
        inputs=[classifier.input],
        outputs=classifier.y_pred)

    # We can test it on some examples from test test
    dataset='mnist.pkl.gz'
    datasets = load_data(dataset)
    test_set_x, test_set_y = datasets[2]
    test_set_x = test_set_x.get_value()

    predicted_values = predict_model(test_set_x[:10])
    print ("Predicted values for the first 10 examples in test set:")
    print predicted_values


if __name__ == '__main__':
    sgd_optimization_mnist()

GDB调试程序：使用方法和编程技巧程序员拓荒编程
在软件开发过程中，调试是一个至关重要的环节。GDB（GNU调试器）是一个功能强大的调试工具，可以帮助开发人员诊断和修复程序中的错误。本文将介绍GDB的基本用法和一些编程技巧，并提供一些示例源代码供参考。什么是GDB？GDB是一个用于调试程序的命令行工具。它可以帮助开发人员在程序执行过程中定位错误、追踪程序状态以及查看变量的值。GDB支持多种编程语言，包括C、C++、Objective-C、Fort
卡片区样式，按钮样式，运营模块哎呦你好 CSS+HTML案例 java 前端 javascript css3 css html
最近写了一个卡片区的样式，效果如下，HBuilder编辑器，样式代码使用scss语法编写。在Vue组件的标签中添加lang="scss"属性后，Vue（以及构建工具如Webpack）会识别这个属性，并使用相应的预处理器（如‌sass-loader‌）将SCSS代码编译成普通的CSS代码，这样浏览器才能识别和执行它。页面中的其他样式如：flex，wrap，ustify-between是flex布局的
OpenHarmony应用ServiceExtensionAbility的使用全村最肉的人 OpenHarmony常用技巧 OpenHarmony
文章目录概述环境一、创建ServiceExtensionAbility服务二、配置ServiceExtensionAbility服务三、应用特权配置1.提取当前设备系统中的特权配置文件install_list_capability.json，文件位于/etc/app/中2.在文档最下面添加应用的信息3.将特权配置文件install_list_capability.json推送回系统中，覆盖系统配置
MyBatis-Spring 优化 Mapper 接口使用的实践与原理 coderzpw Mybatis Spring系列 mybatis spring java
MyBatis-Spring优化Mapper接口使用的实践与原理一、纯MyBatis项目Mapper接口使用的核心痛点1.1配置与调用流程繁琐1.2代码规范难以统一1.3依赖管理不清晰二、MyBatis-Spring实现Mapper接口自动化注册的原理与优势2.1MapperScannerConfigurer2.2ClassPathMapperScanner2.3MapperFactoryBean
Java 中 DataSource-数据源的基础介绍
Java中DataSource-数据源的基础介绍一、核心概念解析1.1数据源（DataSource）1.2数据库连接池（ConnectionPool）1.3二者关系1.4DataSource接口二、DataSource解决的问题与优势2.1DataSource的作用2.2传统方式的局限性2.3使用连接池DataSource的改进三、SpringBoot中DataSource的配置与使用3.1自动配
最长公共子序列长度的四种解法小菜鸟派大星 C语言算法算法 c语言
一.题目：求两个字符序列的最长公共字符子序列。给定两个字符串，求解这两个字符串的最长公共子序列（LongestCommonSequence）。比如字符串1：BDCABA；字符串2：ABCBDAB，则这两个字符串的最长公共子序列长度为4。二.解法1：递归解法1.设计思路：分析两个字符串的比较规律，可以发现字符串在进行比较的时候有三种情况：A.str1[i+1]与str2[j]比较；B.str1[i]
物流数据行业分析（包含完整代码和流程）------python数据分析师项目Anaconda 欲梦yhd 数据分析项目大数据 conda python
一、引言数据分析流程为明确目的、获取数据、数据探索和预处理、分析数据、得出结论、验证结论、结果展现。物流业务中对数据进行深入挖掘和分析的过程，旨在提高运输效率、降低运输成本、提高客户满意度，以及提高公司的竞争力。本案例物流数据分析目的：a、配送服务是否存在问题b、是否存在尚有潜力的销售区域c、商品是否存在质量问题二、详细流程1、数据预处理（数据清洗）（1）数据导入使用panda库读取数据，编码方式
springboot中@Transactional注解的使用风也温柔1 springboot spring boot 后端 java
1、引入依赖首先，确保你的SpringBoot项目中包含了对事务支持的依赖。对于大多数应用场景，SpringBoot会自动引入事务管理相关的依赖。但如果你需要手动添加，可以检查spring-boot-starter-data-jpa（针对JPA）或spring-boot-starter-jdbc（针对JDBC）等起步依赖是否已经存在于你的pom.xml或build.gradle文件中。2、使用@T
Java--SpringBoot使用@Transactional注解添加事务 m0_54883970 面试学习路线阿里巴巴 android 前端后端
一、Java事务1、通常的观念认为，事务仅与数据库相关。事务必须服从ISO/IEC所制定的ACID原则。ACID是原子性（atomicity）、一致性（consistency）、**隔离性（isolation）和持久性（durability）**的缩写。事务的原子性：表示事务执行过程中的任何失败都将导致事务所做的任何修改失效。事务的一致性：表示当事务执行失败时，所有被该事务影响的数据都应该恢复到事
springboot使用@Transactional失效问题排查
1、排查数据库引擎是不是InnoDB2、启动类是否开启@EnableTransactionManagement3、重点在使用@Transactional(rollbackFor=Exception.class)这个注解的类或者方法中是否有trycatch如果有，要在catch中设置手动回滚//设置手动回滚TransactionAspectSupport.currentTransactionStat
HoloViz Panel项目：跨环境无缝开发指南郁蝶文Yvette
HoloVizPanel项目：跨环境无缝开发指南panelholoviz/panel:Panel是一个开源的数据可视化库，专为Python生态设计，基于HoloViews构建，能够轻松将各种数据科学和数据分析结果转化为交互式仪表板应用。用户可以创建复杂的可视化界面，并与Bokeh、Plotly等其他可视化工具结合使用。项目地址:https://gitcode.com/gh_mirrors/pan/
GitHub使用完全指南：从注册到上手的全流程解析 echoarts github
（仅作占位说明，实际写作中需删除）今天咱们来聊聊程序员必备的GitHub使用指南（手把手教学版）！！！作为一个从零开始踩过无数坑的老司机，我把这些年总结的实战经验都整理在这里了。无论你是刚接触编程的萌新，还是想系统梳理GitHub知识的老手，这篇指南都能让你少走80%的弯路！一、注册与基础设置（超级重要）1.注册账号（3分钟搞定）打开GitHub官网（要是打不开后面有解决方案），点击右上角的Sig
学 Simulink：实时系统与嵌入式部署类场景ROS + Simulink 联合仿真的多传感器信号融合与滤波模块 amy_mhd simulink matlab
目录ROS+Simulink联合仿真的多传感器信号融合与滤波模块场景目标✅准备工作软件安装：硬件准备（可选）：步骤详解第一步：创建Simulink模型并配置ROS支持启用ROS工具箱支持：第二步：添加ROS输入接口（接收传感器数据）使用Subscribe模块接收ROSTopic数据：第三步：设计滤波与信号预处理模块方法一：IMU数据滤波（加速度+角速度）方法二：卡尔曼滤波器（KalmanFilte
2025年 UI 自动化框架使用排行 Thomas Kant 自动化测试 ui 自动化运维
亲爱的技术爱好者们，热烈欢迎来到Kant2048的博客！我是ThomasKant，很开心能在CSDN上与你们相遇～本博客的精华专栏：【自动化测试】【测试经验】【人工智能】【Python】</
【Go语言-Day 12】解密动态数组：深入理解 Go 切片 (Slice) 的创建与核心原理吴师兄大模型 Go 语言从入门到精通 golang 开发语言后端 go语言人工智能 LLM python
Langchain系列文章目录01-玩转LangChain：从模型调用到Prompt模板与输出解析的完整指南02-玩转LangChainMemory模块：四种记忆类型详解及应用场景全覆盖03-全面掌握LangChain：从核心链条构建到动态任务分配的实战指南04-玩转LangChain：从文档加载到高效问答系统构建的全程实战05-玩转LangChain：深度评估问答系统的三种高效方法（示例生成、手
java使用json一篇就够了渐暖° 一篇就够了 java json python
java在调用第三方的接口时经常会获取到一堆json，一般都想转化成对应的实体来操作，具体的方式如下目录:raised_hand:四种方式:see_no_evil:分别举例1.JSON-Java库2.Jaskon3.Gson3.FastJSON:ram:FastJSON和GSON的区别✋四种方式1.JSON-Java库，org.json，这个库提供了用于解析和操作Java中的JSON的类。此外，这
Java 解析JSON的 6 种方案奔向理想的星辰大海 Java研发实用技巧 java json 数据库
1.使用Jackson：业界标配功能特点强大的序列化和反序列化：支持将JSON字符串转为Java对象，也支持将Java对象转换为JSON。支持复杂结构：处理嵌套对象、数组、泛型等场景非常轻松。支持注解：如@JsonIgnore、@JsonProperty等，能精细控制序列化与反序列化的行为。性能高：Jackson的性能非常出色，是很多企业级项目的首选。代码示例1.JSON转对象（反序列化）impo
打造智能 CLI 的核心：深度解析 React Hook 驱动的自动补全系统步子哥智能涌现 react.js 前端前端框架人工智能
在现代CLI工具的用户体验中，智能的自动补全功能扮演着至关重要的角色。今天我们来深入分析GeminiCLI中的一个精心设计的ReactHook——useCompletion，看看它是如何将复杂的文件系统导航、命令补全和用户交互完美融合在一起的。为什么需要这样的自动补全系统？想象一下，当你在使用AI编程助手时，需要频繁地引用项目中的文件。传统的方式可能需要你记住完整的文件路径，或者在文件管理器中反复
机电一体化c语言程序设计,机电一体化专业《C语言程序设计》课程标准爱吃糖的果子狸机电一体化c语言程序设计
山东海事职业学院机电一体化专业《C语言程序设计》课程标准一、课程性质与任务《C语言程序设计》是机电一体化专业的职业能力素质课程之一，并且是本专业的核心专业课程之一，理论性和实践性均较强，既要掌握理论概念，又要动手编程，还要上机调试运行。通过本课程的学习，使学生掌握基本的程序设计过程和技巧，熟练应用MicrosoftVisualC6.0集成环境进行C语言的编写、编译与调试，培养学生的逻辑思维能力、抽
I.MX6ULL ARM裸机开发---C语言LED实验一盆电子 ARM裸机开发 arm c语言驱动开发
一、引言考虑到工作效率，嵌入式驱动开发很少用汇编，大部分是用C语言进行开发。嵌入式驱动开发开始部分就可以用C语言吗？当然不是！在开始部分用汇编来初始化一下C语言环境，比如初始化DDR、设置堆栈指针SP等等，当这些工作都做完以后就可以进入C语言环境，也就是运行C语言代码，一般都是进入main函数。有两部分文件需要完成： 1、汇编文件汇编文件用来完成C语言环境搭建。 2、C语言文件
分享一个MFC的ProgressCtrl的扩展类(支持自定义显示文字、颜色、百分比及其位置) RevsInterstellar QT笔记 MFC笔记 mfc c++progressctrl 进度条控件扩展类
MFC自带的进度条控件CProgressCtrl不能在上面显示文字和百分比信息，这个类由CProgressCtrl派生，对其进行修改，可以改变其外观，更加美观实用。头文件：CXProgressCtrl.h#pragmaonce//CXProgressCtrlclassCXProgressCtrl:publicCProgressCtrl{DECLARE_DYNAMI
使用Ultralytics YOLO进行数据增强 alpszero YOLO计算机视觉应用 YOLO 人工智能机器学习
概述数据增强是计算机视觉领域的一项重要技术，它通过对现有图像进行各种转换，人为地扩展训练数据集。在训练深度学习模型时，数据增强有助于提高模型的鲁棒性，减少过拟合，并增强对真实世界场景的泛化。在训练计算机视觉模型的过程中，数据增强具有多种重要作用：扩展数据集：通过创建现有图像的变体，可以有效增加训练数据集的规模，而无需收集新数据。提高泛化能力：模型学会在各种条件下识别物体，使其在实际应用中更加稳健。
Windows下利用RegisterWindowMessage函数实现进程间通信
程间通信的方法有很多，比如使用注册消息，内存映射，WM_COPYDATA等，下面先讲使用注册消息实现的方法。使用注册消息比较简单，核心是消息的接收端和消息的发送端（接收端和发送端在两个不同的进程）必须注册相同的消息，这样发送消息才能识别。功能：定义一个新的窗口消息，该消息保证在整个系统中是唯一的。发送或发布消息时可以使用消息值。UINTRegisterWindowMessageA([in]LPCS
MFC界面库ToolkitPro v15.3.1的编译和使用教程(支持VS2015和VS2017) RevsInterstellar MFC笔记 mfc c++ToolKitPro Codejock.Xtreme 界面库 15.3.1
一、ToolkitProv15.3.1库的下载界面库全称为CodejockXtremeToolkitPro，目前可以免费使用的版本为v15.3.1，可以在CSDN上搜索下载，有很多，比如https://download.csdn.net/download/nizheng96/11151867二、ToolkitProv15.3.1库的编译虽然很多人在这个库的资源中说v15.3.1版本可以支持VS20
Python数据可视化-----制作全球地震散点图从未止步.. python python json 数据结构
为了制作全球地震散点图，我在网上下载了一个数据集，其中记录了一个月内全球发生的所有地震，但这些数据是以JSON格式存储的，因此需要用json模块来进行处理。查看JSON数据：首先我们先打开下载好的数据集浏览一下：你会发现其中的数据密密麻麻，根本不是人读的，因此，接下来我们将对数据进行处理，让它变得简单易读。importjson#导入json模块，以便于加载文件中的数据filename='eq_da
java中操作JSON字符串莫笑皮皮猪随笔 java json 开发语言
java操作JSON串在java操作JSON字符串中，通常分为两种，一种是有对象对应的，一种是没有对象对应的有对象对应的话，可以转换成vo对象的集合没有对象对应的，可以转换成JSON对象来进行相应操作无论有对象对应还是没有对象的，解析复杂的JSON串(多级JSON)，解析完之后，没有被解析的子JSON，还是以JSON字符串的形式存在，被解析的以对象的形式存在我们操作数据，一般都是把JSON串转换成
JAVA：常见 JSON 库的技术详解拾荒的小海螺 JAVA java json 开发语言
1、简述在现代应用开发中，JSON（JavaScriptObjectNotation）已成为数据交换的标准格式。Java提供了多种方式将对象转换为JSON或从JSON转换为对象，常见的库包括Jackson、Gson和org.json。本文将介绍几种常用的JSON处理方式，并通过简单示例展示其应用。2、什么是JSON？JSON是一种轻量级的数据交换格式，使用键值对来表示数据。它易于人阅读和编写，同时
算法竞赛备考冲刺必刷题（C++） | 洛谷 P1001 A+B Problem 热爱编程的通信人 c++算法
本文分享的必刷题目是从蓝桥云课、洛谷、AcWing等知名刷题平台精心挑选而来，并结合各平台提供的算法标签和难度等级进行了系统分类。题目涵盖了从基础到进阶的多种算法和数据结构，旨在为不同阶段的编程学习者提供一条清晰、平稳的学习提升路径。欢迎大家订阅我的专栏：算法题解：C++与Python实现！附上汇总贴：算法竞赛备考冲刺
C++ sfml使用教程 Tan_Zhixia c++
配置过程参考下面的文章：超详细！SFML库vs2022配置教程-CSDN博客教程sfml是一个图形库，它提供了窗口，绘图等图形化功能。先来看一个简单的例子（官方demo）例子#includeintmain(){sf::RenderWindowwindow(sf::VideoMode(200,200),"SFMLworks!");sf::CircleShapeshape(100.f);shape.s
C++“inFile”介绍 Tan_Zhixia c++
基础操作介绍inFile需要导入一个叫做fstream的库inFile是输入，但是和cin（输入数据流）不一样，inFile是在写好的文件中进行读取的。格式为：文件名.ininFile的基础代码为：#include#includeusingnamespacestd;stringin;intmain(){ifstreaminFile("文件名.in");//操作文件"文件名.in"并打包到inFil
mondb入手木zi_鸣 mongodb
windows 启动mongodb 编写bat文件， mongod --dbpath D:\software\MongoDBDATA mongod --help 查询各种配置配置在mongob 打开批处理，即可启动，27017原生端口，shell操作监控端口扩展28017，web端操作端口启动配置文件配置，数据更灵活
大型高并发高负载网站的系统架构 bijian1013 高并发负载均衡
扩展Web应用程序一.概念简单的来说，如果一个系统可扩展，那么你可以通过扩展来提供系统的性能。这代表着系统能够容纳更高的负载、更大的数据集，并且系统是可维护的。扩展和语言、某项具体的技术都是无关的。扩展可以分为两种： 1.
DISPLAY变量和xhost(原创) czmmiao display
DISPLAY 在Linux/Unix类操作系统上, DISPLAY用来设置将图形显示到何处. 直接登陆图形界面或者登陆命令行界面后使用startx启动图形, DISPLAY环境变量将自动设置为:0:0, 此时可以打开终端, 输出图形程序的名称(比如xclock)来启动程序, 图形将显示在本地窗口上, 在终端上输入printenv查看当前环境变量, 输出结果中有如下内容:DISPLAY=:0.0
获取B/S客户端IP 周凡杨 java 编程 jsp Web 浏览器
最近想写个B/S架构的聊天系统，因为以前做过C/S架构的QQ聊天系统，所以对于Socket通信编程只是一个巩固。对于C/S架构的聊天系统，由于存在客户端Java应用，所以直接在代码中获取客户端的IP，应用的方法为： String ip = InetAddress.getLocalHost().getHostAddress(); 然而对于WEB
浅谈类和对象朱辉辉33 编程
类是对一类事物的总称，对象是描述一个物体的特征，类是对象的抽象。简单来说，类是抽象的，不占用内存，对象是具体的，占用存储空间。类是由属性和方法构成的，基本格式是public class 类名{ //定义属性 private/public 数据类型属性名； //定义方法 publ
android activity与viewpager+fragment的生命周期问题肆无忌惮_ viewpager
有一个Activity里面是ViewPager，ViewPager里面放了两个Fragment。第一次进入这个Activity。开启了服务，并在onResume方法中绑定服务后，对Service进行了一定的初始化，其中调用了Fragment中的一个属性。 super.onResume(); bindService(intent, conn, BIND_AUTO_CREATE);
base64Encode对图片进行编码 843977358 base64 图片 encoder
/** * 对图片进行base64encoder编码 * * @author mrZhang * @param path * @return */ public static String encodeImage(String path) { BASE64Encoder encoder = null; byte[] b = null; I
Request Header简介 aigo servlet
当一个客户端(通常是浏览器)向Web服务器发送一个请求是，它要发送一个请求的命令行，一般是GET或POST命令，当发送POST命令时，它还必须向服务器发送一个叫“Content-Length”的请求头(Request Header) 用以指明请求数据的长度，除了Content-Length之外，它还可以向服务器发送其它一些Headers，如：
HttpClient4.3 创建SSL协议的HttpClient对象 alleni123 httpclient 爬虫 ssl
public class HttpClientUtils { public static CloseableHttpClient createSSLClientDefault(CookieStore cookies){ SSLContext sslContext=null; try { sslContext=new SSLContextBuilder().l
java取反 -右移-左移-无符号右移的探讨百合不是茶位运算符位移
取反：在二进制中第一位，1表示符数，0表示正数 byte a = -1; 原码：10000001 反码：11111110 补码：11111111 //异或: 00000000 byte b = -2; 原码：10000010 反码：11111101 补码：11111110 //异或: 00000001
java多线程join的作用与用法 bijian1013 java 多线程
对于JAVA的join，JDK 是这样说的：join public final void join （long millis ）throws InterruptedException Waits at most millis milliseconds for this thread to die. A timeout of 0 means t
Java发送http请求(get 与post方法请求) bijian1013 java spring
PostRequest.java package com.bijian.study; import java.io.BufferedReader; import java.io.DataOutputStream; import java.io.IOException; import java.io.InputStreamReader; import java.net.HttpURL
【Struts2二】struts.xml中package下的action配置项默认值 bit1129 struts.xml
在第一部份，定义了struts.xml文件，如下所示： <!DOCTYPE struts PUBLIC "-//Apache Software Foundation//DTD Struts Configuration 2.3//EN" "http://struts.apache.org/dtds/struts
【Kafka十三】Kafka Simple Consumer bit1129 simple
代码中关于Host和Port是割裂开的，这会导致单机环境下的伪分布式Kafka集群环境下，这个例子没法运行。实际情况是需要将host和port绑定到一起， package kafka.examples.lowlevel; import kafka.api.FetchRequest; import kafka.api.FetchRequestBuilder; impo
nodejs学习api ronin47 nodejs api
NodeJS基础什么是NodeJS JS是脚本语言，脚本语言都需要一个解析器才能运行。对于写在HTML页面里的JS，浏览器充当了解析器的角色。而对于需要独立运行的JS，NodeJS就是一个解析器。每一种解析器都是一个运行环境，不但允许JS定义各种数据结构，进行各种计算，还允许JS使用运行环境提供的内置对象和方法做一些事情。例如运行在浏览器中的JS的用途是操作DOM，浏览器就提供了docum
java-64.寻找第N个丑数 bylijinnan java
public class UglyNumber { /** * 64.查找第N个丑数具体思路可参考 [url] http://zhedahht.blog.163.com/blog/static/2541117420094245366965/[/url] * 题目：我们把只包含因子 2、3和5的数称作丑数（Ugly Number）。例如6、8都是丑数，但14
二维数组（矩阵）对角线输出 bylijinnan 二维数组
/** 二维数组对角线输出两个方向例如对于数组： { 1, 2, 3, 4 }, { 5, 6, 7, 8 }, { 9, 10, 11, 12 }, { 13, 14, 15, 16 }, slash方向输出： 1 5 2 9 6 3 13 10 7 4 14 11 8 15 12 16 backslash输出： 4 3
[JWFD开源工作流设计]工作流跳跃模式开发关键点(今日更新) comsci 工作流
既然是做开源软件的,我们的宗旨就是给大家分享设计和代码,那么现在我就用很简单扼要的语言来透露这个跳跃模式的设计原理大家如果用过JWFD的ARC-自动运行控制器,或者看过代码,应该知道在ARC算法模块中有一个函数叫做SAN(),这个函数就是ARC的核心控制器,要实现跳跃模式,在SAN函数中一定要对LN链表数据结构进行操作,首先写一段代码,把
redis常见使用 cuityang redis 常见使用
redis 通常被认为是一个数据结构服务器，主要是因为其有着丰富的数据结构 strings、map、 list、sets、 sorted sets 引入jar包 jedis-2.1.0.jar (本文下方提供下载) package redistest; import redis.clients.jedis.Jedis; public class Listtest
配置多个redis dalan_123 redis
配置多个redis客户端 <?xml version="1.0" encoding="UTF-8"?><beans xmlns="http://www.springframework.org/schema/beans" xmlns:xsi=&quo
attrib命令 dcj3sjt126com attr
attrib指令用于修改文件的属性.文件的常见属性有:只读.存档.隐藏和系统. 只读属性是指文件只可以做读的操作.不能对文件进行写的操作.就是文件的写保护. 存档属性是用来标记文件改动的.即在上一次备份后文件有所改动.一些备份软件在备份的时候会只去备份带有存档属性的文件.
Yii使用公共函数 dcj3sjt126com yii
在网站项目中，没必要把公用的函数写成一个工具类，有时候面向过程其实更方便。在入口文件index.php里添加 require_once('protected/function.php'); 即可对其引用，成为公用的函数集合。 function.php如下： <?php /** * This is the shortcut to D
linux 系统资源的查看（free、uname、uptime、netstat） eksliang netstat linux uname linux uptime linux free
linux 系统资源的查看转载请出自出处：http://eksliang.iteye.com/blog/2167081 http://eksliang.iteye.com 一、free查看内存的使用情况语法如下： free [-b][-k][-m][-g] [-t] 参数含义 -b:直接输入free时，显示的单位是kb我们可以使用b(bytes),m
JAVA的位操作符 greemranqq 位运算 JAVA位移 <<>>>
最近几种进制，加上各种位操作符，发现都比较模糊，不能完全掌握，这里就再熟悉熟悉。 1.按位操作符：按位操作符是用来操作基本数据类型中的单个bit,即二进制位，会对两个参数执行布尔代数运算，获得结果。与（&）运算： 1&1 = 1, 1&0 = 0, 0&0 &
Web前段学习网站 ihuning Web
Web前段学习网站菜鸟学习：http://www.w3cschool.cc/ JQuery中文网：http://www.jquerycn.cn/ 内存溢出：http://outofmemory.cn/#csdn.blog http://www.icoolxue.com/ http://www.jikexue
强强联合：FluxBB 作者加盟 Flarum justjavac r
原文：FluxBB Joins Forces With Flarum作者：Toby Zerner译文：强强联合：FluxBB 作者加盟 Flarum译者：justjavac FluxBB 是一个快速、轻量级论坛软件，它的开发者是一名德国的 PHP 天才 Franz Liedke。FluxBB 的下一个版本(2.0)将被完全重写，并已经开发了一段时间。FluxBB 看起来非常有前途的，
java统计在线人数（session存储信息的） macroli java Web
这篇日志是我写的第三次了前两次都发布失败！郁闷极了！由于在web开发中常常用到这一部分所以在此记录一下，呵呵，就到备忘录了！我对于登录信息时使用session存储的，所以我这里是通过实现HttpSessionAttributeListener这个接口完成的。 1、实现接口类，在web.xml文件中配置监听类，从而可以使该类完成其工作。 public class Ses
bootstrp carousel初体验快速构建图片播放 qiaolevip 每天进步一点点学习永无止境 bootstrap 纵观千象
img{ border: 1px solid white; box-shadow: 2px 2px 12px #333; _width: expression(this.width > 600 ? "600px" : this.width + "px"); _height: expression(this.width &
SparkSQL读取HBase数据，通过自定义外部数据源 superlxw1234 spark sparksql sparksql读取hbase sparksql外部数据源
关键字：SparkSQL读取HBase、SparkSQL自定义外部数据源前面文章介绍了SparSQL通过Hive操作HBase表。 SparkSQL从1.2开始支持自定义外部数据源(External DataSource)，这样就可以通过API接口来实现自己的外部数据源。这里基于Spark1.4.0，简单介绍SparkSQL自定义外部数据源，访
Spring Boot 1.3.0.M1发布 wiselyman spring boot
Spring Boot 1.3.0.M1于6.12日发布，现在可以从Spring milestone repository下载。这个版本是基于Spring Framework 4.2.0.RC1,并在Spring Boot 1.2之上提供了大量的新特性improvements and new features。主要包含以下： 1.提供一个新的sprin

使用逻辑回归进行mnist手写字识别

1.引言

2.具体训练过程

3.Python实现

你可能感兴趣的:(使用逻辑回归进行mnist手写字识别)