Omni-Space

Tensorflow学习

github地址：https://github.com/lawlite19/MachineLearning_TensorFlow

一、TensorFlow介绍

1、什么是TensorFlow

官网：https://www.tensorflow.org/
TensorFlow是Google开发的一款神经网络的Python外部的结构包, 也是一个采用数据流图来进行数值计算的开源软件库.
先绘制计算结构图, 也可以称是一系列可人机交互的计算操作, 然后把编辑好的Python文件转换成更高效的C++, 并在后端进行计算.

2、TensorFlow强大之处

擅长的任务就是训练深度神经网络
快速的入门神经网络,大大降低了深度学习（也就是深度神经网络）的开发成本和开发难度
TensorFlow 的开源性, 让所有人都能使用并且维护

3、安装TensorFlow

暂不支持Windows下安装TensorFlow,可以在虚拟机里使用或者安装Docker安装
这里在CentOS6.5下进行安装

安装Python2.7，默认CentOS中安装的是Python2.6

先安装zlib的依赖，下面安装easy_install时会用到

1 2	yum install zlib yum install zlib-devel

在安装openssl的依赖，下面安装pip时会用到

1 2	yum install openssl yum install openssl-devel

下载安装包，我传到github上的安装包，https协议后面加上--no-check-certificate，：

1	wget https://raw.githubusercontent.com/lawlite19/LinuxSoftware/master/python/Python-2.7.12.tgz --no-check-certificate

解压缩：tar -zxvf xxx
进入，配置：./configure --prefix=/usr/local/python2.7
编译并安装：make && make install
创建链接来使系统默认python变为python2.7,
ln -fs /usr/local/python2.7/bin/python2.7 /usr/bin/python

修改一下yum，因为yum的执行文件还是需要原来的python2.6,vim /usr/bin/yum

1	#!/usr/bin/python

修改为系统原有的python版本地址

1	#!/usr/bin/python2.6

安装easy_install
- 下载：wget https://raw.githubusercontent.com/lawlite19/LinuxSoftware/blob/master/python/setuptools-26.1.1.tar.gz --no-check-certificate
- 解压缩：tar -zxvf xxx
- python setup.py build #注意这里python是新的python2.7
- python setup.py install
- 到/usr/local/python2.7/bin目录下查看就会看到easy_install了
- 创建一个软连接：ln -s /usr/local/python2.7/bin/easy_install /usr/local/bin/easy_install
- 就可以使用easy_install 包名 进行安装
安装pip
- 下载:
- 解压缩：tar -zxvf xxx
- 安装：python setup.py install
- 到/usr/local/python2.7/bin目录下查看就会看到pip了
- 同样创建软连接：ln -s /usr/local/python2.7/bin/pip /usr/local/bin/pip
- 就可以使用pip install 包名进行安装包了
安装wingIDE
- 默认安装到/usr/local/lib下，进入，执行./wing命令即可执行
- 创建软连接：ln -s /usr/local/lib/wingide5.1/wing /usr/local/bin/wing
- 破解：

[另]安装VMwareTools，可以在windows和Linux之间复制粘贴
- 启动CentOS
- 选择VMware中的虚拟机–>安装VMware Tools
- 会自动弹出VMware Tools的文件夹
- 拷贝一份到root目录下 cp VMwareTools-9.9.3-2759765.tar.gz /root
- 解压缩 tar -zxvf VMwareTools-9.9.3-2759765.tar.gz
- 进入目录执行，vmware-install.pl，一路回车下去即可
- 重启CentOS即可
安装numpy
- 直接安装没有出错
安装scipy
- 安装依赖：yum install bzip2-devel pcre-devel ncurses-devel readline-devel tk-devel gcc-c++ lapack-devel
- 安装即可：pip install scipy

安装matplotlib

安装依赖：yum install libpng-devel
安装即可：pip install matplotlib

运行可能有以下的错误：

1	ImportError: No module named _tkinter

安装：tcl8.5.9-src.tar.gz

进入安装即可,./confgiure make make install
安装：tk8.5.9-src.tar.gz
进入安装即可。
[注意]要重新安装一下Pyhton2.7才能链接到tkinter

安装scikit-learn

直接安装没有出错，但是缺少包bz2

将系统中python2.6的bz2复制到python2.7对应文件夹下

1	cp /usr/lib/python2.6/lib-dynload/bz2.so /usr/local/python2.7/lib/python2.7/lib-dynload

安装TensorFlow

官网点击

选择对应的版本

         
         
           
           
           
           
            
            
            
            
          
           # Ubuntu/Linux 64-bit, CPU only, Python 2.7
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.12.0rc0-cp27-none-linux_x86_64.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Ubuntu/Linux 64-bit, GPU enabled, Python 2.7
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Requires CUDA toolkit 8.0 and CuDNN v5. For other versions, see "Installing from sources" below.
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/linux/gpu/tensorflow_gpu-0.12.0rc0-cp27-none-linux_x86_64.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Mac OS X, CPU only, Python 2.7:
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/mac/cpu/tensorflow-0.12.0rc0-py2-none-any.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Mac OS X, GPU enabled, Python 2.7:
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/mac/gpu/tensorflow_gpu-0.12.0rc0-py2-none-any.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Ubuntu/Linux 64-bit, CPU only, Python 3.4
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.12.0rc0-cp34-cp34m-linux_x86_64.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Ubuntu/Linux 64-bit, GPU enabled, Python 3.4
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Requires CUDA toolkit 8.0 and CuDNN v5. For other versions, see "Installing from sources" below.
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/linux/gpu/tensorflow_gpu-0.12.0rc0-cp34-cp34m-linux_x86_64.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Ubuntu/Linux 64-bit, CPU only, Python 3.5
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.12.0rc0-cp35-cp35m-linux_x86_64.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Ubuntu/Linux 64-bit, GPU enabled, Python 3.5
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Requires CUDA toolkit 8.0 and CuDNN v5. For other versions, see "Installing from sources" below.
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/linux/gpu/tensorflow_gpu-0.12.0rc0-cp35-cp35m-linux_x86_64.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Mac OS X, CPU only, Python 3.4 or 3.5:
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/mac/cpu/tensorflow-0.12.0rc0-py3-none-any.whl
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Mac OS X, GPU enabled, Python 3.4 or 3.5:
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ export TF_BINARY_URL=https://storage.googleapis.com/tensorflow/mac/gpu/tensorflow_gpu-0.12.0rc0-py3-none-any.whl

对应python版本

         
         
           
           
           
           
            
            
            
            
          
           # Python 2
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ sudo pip install --upgrade $TF_BINARY_URL
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          # Python 3
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          $ sudo pip3 install --upgrade $TF_BINARY_URL

可能缺少依赖glibc,看对应提示的版本，

还有可能报错

1	ImportError: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.19' not found (required by /usr/local/python2.7/lib/python2.7/site-packages/tensorflow/python/_pywrap_tensorflow.so)

安装对应版本的glibc

查看现有版本的glibc, strings /lib64/libc.so.6 |grep GLIBC
下载对应版本：wget http://ftp.gnu.org/gnu/glibc/glibc-2.17.tar.gz
解压缩：tar -zxvf glibc-2.17
进入文件夹创建build文件夹cd glibc-2.17 && mkdir build

配置：

         
         
           
           
           
           
            
            
            
            
          
          ../configure  \
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
             --prefix=/usr          \
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
             --disable-profile      \
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
             --enable-add-ons       \
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
             --enable-kernel=2.6.25 \
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
             --libexecdir=/usr/lib/glibc

编译安装：make && make install
可以再用命令：strings /lib64/libc.so.6 |grep GLIBC查看

添加GLIBCXX_3.4.19的支持
- 下载：wget https://raw.githubusercontent.com/lawlite19/LinuxSoftware/master/python2.7_tensorflow/libstdc++.so.6.0.20
- 复制到/usr/lib64文件夹下：cp libstdc++.so.6.0.20 /usr/lib64/
- 添加执行权限：chmod +x /usr/lib64/libstdc++.so.6.0.20
- 删除原来的：rm -rf /usr/lib64/libstdc++.so.6
- 创建软连接：ln -s /usr/lib64/libstdc++.so.6.0.20 /usr/lib64/libstdc++.so.6
- 可以查看是否有个版本：strings /usr/lib64/libstdc++.so.6 | grep GLIBCXX

运行还可能报错编码的问题，这里安装0.10.0版本:pip install --upgrade https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.10.0rc0-cp27-none-linux_x86_64.whl
安装pandas
- pip install pandas没有问题

二、TensorFlow基础架构

1、处理结构

Tensorflow 首先要定义神经网络的结构,然后再把数据放入结构当中去运算和 training
TensorFlow是采用数据流图（data　flow　graphs）来计算
首先我们得创建一个数据流流图
然后再将我们的数据（数据以张量(tensor)的形式存在）放在数据流图中计算
张量（tensor):
- 张量有多种. 零阶张量为纯量或标量 (scalar) 也就是一个数值. 比如 1
- 一阶张量为向量 (vector), 比如一维的 [1, 2, 3]
- 二阶张量为矩阵 (matrix), 比如二维的 [[1, 2, 3],[4, 5, 6],[7, 8, 9]]
- 以此类推, 还有三阶三维的 …

2、一个例子

求y=1*x+3中的权重1和偏置3

定义这个函数

1 2	x_data = np.random.rand(100).astype(np.float32) y_data = x_data*1.0+3.0

创建TensorFlow结构

         
         
           
           
           
           
            
            
            
            
          
          Weights = tf.Variable(tf.random_uniform([1], -1.0, 1.0)) # 创建变量Weight是，范围是 -1.0~1.0
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          biases = tf.Variable(tf.zeros([1]))                      # 创建偏置，初始值为0
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          y = Weights*x_data+biases                                # 定义方程
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          loss = tf.reduce_mean(tf.square(y-y_data))               # 定义损失，为真实值减去我们每一步计算的值
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          optimizer = tf.train.GradientDescentOptimizer(0.5)       # 0.5 是学习率
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          train = optimizer.minimize(loss)                         # 使用梯度下降优化
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          init = tf.initialize_all_variables()                     # 初始化所有变量

定义Session

1 2	sess = tf.Session() sess.run(init)

输出结果

         
         
           
           
           
           
            
            
            
            
          
          for i in range(201):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
             sess.run(train)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
             if i%20 == 0:
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                 print i,sess.run(Weights),sess.run(biases)

结果为：

       
       
         
         
         
         
          
          
          
          
        
         0 [ 1.60895896] [ 3.67376709]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        20 [ 1.04673827] [ 2.97489643]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        40 [ 1.011392] [ 2.99388123]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        60 [ 1.00277638] [ 2.99850869]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        80 [ 1.00067675] [ 2.99963641]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        100 [ 1.00016499] [ 2.99991131]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        120 [ 1.00004005] [ 2.99997854]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        140 [ 1.00000978] [ 2.99999475]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        160 [ 1.0000025] [ 2.99999857]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        180 [ 1.00000119] [ 2.99999928]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        200 [ 1.00000119] [ 2.99999928]

3、Session会话控制

运行 session.run() 可以获得你要得知的运算结果, 或者是你所要运算的部分
定义常量矩阵：tf.constant([[3,3]])
矩阵乘法：tf.matmul(matrix1,matrix2)

运行Session的两种方法：

手动关闭

         
         
           
           
           
           
            
            
            
            
          
          sess = tf.Session()
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          print sess.run(product)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          sess.close()

使用with，执行完会自动关闭

1 2	with tf.Session() as sess: print sess.run(product)

4、`Variable`变量

定义变量：tf.Variable()
初始化所有变量：init = tf.initialize_all_variables()
需要再在 sess 里, sess.run(init) , 激活变量
输出时，一定要把 sess 的指针指向变量再进行 print 才能得到想要的结果

5、`Placeholder`传入值

首先定义Placeholder，然后在Session.run()的时候输入值

placeholder 与 feed_dict={} 是绑定在一起出现的

       
       
         
         
         
         
          
          
          
          
        
        input1 = tf.placeholder(tf.float32) #在 Tensorflow 中需要定义 placeholder 的 type ，一般为 float32 形式
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        input2 = tf.placeholder(tf.float32)
       
       
         
         
         
         
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        output = tf.mul(input1,input2)  # 乘法运算
       
       
         
         
         
         
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        with tf.Session() as sess:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            print sess.run(output,feed_dict={input1:7.,input2:2.}) # placeholder 与 feed_dict={} 是绑定在一起出现的

三、定义一个神经网络

1、添加层函数`add_layer()`

     
     
       
       
       
       
        
        
        
        
      
      '''参数：输入数据，前一层size，当前层size，激活函数'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def add_layer(inputs,in_size,out_size,activation_function=None):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          Weights = tf.Variable(tf.random_normal([in_size,out_size]))  #随机初始化权重
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          biases = tf.Variable(tf.zeros([1,out_size]) + 0.1)  # 初始化偏置，+0.1
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          Ws_plus_b = tf.matmul(inputs,Weights) + biases      # 未使用激活函数的值
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          if activation_function is None:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              outputs = Ws_plus_b
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          else:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              outputs = activation_function(Ws_plus_b)   # 使用激活函数激活
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return outputs

2、构建神经网络

定义二次函数

       
       
         
         
         
         
          
          
          
          
        
        x_data = np.linspace(-1,1,300,dtype=np.float32)[:,np.newaxis]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        noise = np.random.normal(0,0.05,x_data.shape).astype(np.float32)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        y_data = np.square(x_data)-0.5+noise

定义Placeholder,用于后期输入数据

1 2	xs = tf.placeholder(tf.float32,[None,1]) # None代表无论输入有多少都可以,只有一个特征，所以这里是1 ys = tf.placeholder(tf.float32,[None,1])

定义神经层layer

1	layer1 = add_layer(xs, 1, 10, activation_function=tf.nn.relu) # 第一层，输入层为1，隐含层为10个神经元，Tensorflow 自带的激励函数tf.nn.relu

定义输出层

1	prediction = add_layer(layer1, 10, 1) # 利用上一层作为输入

计算loss损失

1	loss = tf.reduce_mean(tf.reduce_sum(tf.square(ys-prediction),reduction_indices=[1])) # 对二者差的平方求和再取平均

梯度下降最小化损失

1	train = tf.train.GradientDescentOptimizer(0.1).minimize(loss)

初始化所有变量

1	init = tf.initialize_all_variables()

定义Session

1 2	sess = tf.Session() sess.run(init)

输出

       
       
         
         
         
         
          
          
          
          
        
        for i in range(1000):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            sess.run(train,feed_dict={xs:x_data,ys:y_data})
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            if i%50==0:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                print sess.run(loss,feed_dict={xs:x_data,ys:y_data})

结果：

3、可视化结果

显示数据

       
       
         
         
         
         
          
          
          
          
        
        fig = plt.figure()
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        ax = fig.add_subplot(111)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        ax.scatter(x_data,y_data)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        plt.ion()   # 绘画之后不暂停
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        plt.show()

动态绘画

       
       
         
         
         
         
          
          
          
          
        
                try:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    ax.lines.remove(lines[0])   # 每次绘画需要移除上次绘画的结果，放在try catch里因为第一次执行没有，所以直接pass
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                except Exception:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    pass
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                prediction_value = sess.run(prediction, feed_dict={xs: x_data})
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                # plot the prediction
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                lines = ax.plot(x_data, prediction_value, 'r-', lw=3)  # 绘画
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                plt.pause(0.1)  # 停0.1s
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        ```    
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        ![enter description here][3]
       
       
         
         
         
         
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        ## 四、TensorFlow可视化
       
       
         
         
         
         
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        ### 1、TensorFlow的可视化工具`tensorboard`，可视化神经网路额结构
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        - 输入`input`

with tf.name_scope(‘input’):
xs = tf.placeholder(tf.float32,[None,1],name=’x_in’) #
ys = tf.placeholder(tf.float32,[None,1],name=’y_in’)

     
     
       
       
       
       
        
        
        
        
      
      ![enter description here][4]
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - `layer`层

def add_layer(inputs,in_size,out_size,activation_function=None):
with tf.name_scope(‘layer’):
with tf.name_scope(‘Weights’):
Weights = tf.Variable(tf.random_normal([in_size,out_size]),name=’W’)
with tf.name_scope(‘biases’):
biases = tf.Variable(tf.zeros([1,out_size]) + 0.1,name=’b’)
with tf.name_scope(‘Ws_plus_b’):
Ws_plus_b = tf.matmul(inputs,Weights) + biases
if activation_function is None: outputs = Ws_plus_b
else:
outputs = activation_function(Ws_plus_b)
return outputs

     
     
       
       
       
       
        
        
        
        
      
      ![enter description here][5]
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - `loss`和`train`

with tf.name_scope(‘loss’):
loss = tf.reduce_mean(tf.reduce_sum(tf.square(ys-prediction),reduction_indices=1))

with tf.name_scope(‘train’):
train = tf.train.GradientDescentOptimizer(0.1).minimize(loss)

     
     
       
       
       
       
        
        
        
        
      
      ![enter description here][6]
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 写入文件中

writer = tf.train.SummaryWriter(“logs/“, sess.graph)

     
     
       
       
       
       
        
        
        
        
      
      - 浏览器中查看（chrome浏览器）
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 在终端输入：`tensorboard --logdir='logs/'`，它会给出访问地址
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 浏览器中查看即可。
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - `tensorboard`命令在安装**python**目录的**bin**目录下，可以创建一个软连接
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 2、可视化训练过程
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 可视化Weights权重和biases偏置
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 每一层起个名字

layer_name = ‘layer%s’%n_layer

1	- tf.histogram_summary(name,value)

def add_layer(inputs,in_size,out_size,n_layer,activation_function=None):
layer_name = ‘layer%s’%n_layer
with tf.name_scope(layer_name):
with tf.name_scope(‘Weights’):
Weights = tf.Variable(tf.random_normal([in_size,out_size]),name=’W’)
tf.histogram_summary(layer_name+’/weights’, Weights)
with tf.name_scope(‘biases’):
biases = tf.Variable(tf.zeros([1,out_size]) + 0.1,name=’b’)
tf.histogram_summary(layer_name+’/biases’,biases)
with tf.name_scope(‘Ws_plus_b’):
Ws_plus_b = tf.matmul(inputs,Weights) + biases

if activation_function is None:             
    outputs = Ws_plus_b 
else:                                                         
    outputs = activation_function(Ws_plus_b)      
tf.histogram_summary(layer_name+'/outputs',outputs)
return outputs

1	- merge所有的summary

merged =tf.merge_all_summaries()

writer = tf.train.SummaryWriter(“logs/“, sess.graph)

1	- 训练1000次，每50步显示一次：

for i in range(1000):
sess.run(train,feed_dict={xs:x_data,ys:y_data})
if i%50==0:
summary = sess.run(merged, feed_dict={xs: x_data, ys:y_data})
writer.add_summary(summary, i)

     
     
       
       
       
       
        
        
        
        
      
       - 同样适用`tensorboard`查看   
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       ![enter description here][7]
     
     
       
       
       
       
     
     
       
       
       
        
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 可视化损失函数（代价函数）
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 添加：`tf.scalar_summary('loss',loss)`    
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       ![enter description here][8]
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ## 五、手写数字识别_1
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 1、说明
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - [全部代码](https://github.com/lawlite19/MachineLearning_TensorFlow/blob/master/Mnist_01/mnist.py)：`https://github.com/lawlite19/MachineLearning_TensorFlow/blob/master/Mnist_02/mnist.py`
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 自己的数据集，没有使用tensorflow中mnist数据集，
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 之前在机器学习中用Python实现过，地址：`https://github.com/lawlite19/MachineLearning_Python`,这里使用`tensorflow`实现
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 神经网络只有两层
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 2、代码实现
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 添加一层

‘’’添加一层神经网络’’’
def add_layer(inputs,in_size,out_size,activation_function=None):
Weights = tf.Variable(tf.random_normal([in_size,out_size])) # 权重，in*out
biases = tf.Variable(tf.zeros([1,out_size]) + 0.1)
Ws_plus_b = tf.matmul(inputs,Weights) + biases # 计算权重和偏置之后的值
if activation_function is None:
outputs = Ws_plus_b
else:
outputs = activation_function(Ws_plus_b) # 调用激励函数运算
return outputs

‘’’运行函数’’’
def NeuralNetwork():
data_digits = spio.loadmat(‘data_digits.mat’)
X = data_digits[‘X’]
y = data_digits[‘y’]
m,n = X.shape
class_y = np.zeros((m,10)) # y是0,1,2,3…9,需要映射0/1形式
for i in range(10):
class_y[:,i] = np.float32(y==i).reshape(1,-1)

xs = tf.placeholder(tf.float32, shape=[None,400])  # 像素是20x20=400，所以有400个feature
ys = tf.placeholder(tf.float32, shape=[None,10])   # 输出有10个

prediction = add_layer(xs, 400, 10, activation_function=tf.nn.softmax) # 两层神经网络，400x10
#prediction = add_layer(layer1, 25, 10, activation_function=tf.nn.softmax)

#loss = tf.reduce_mean(tf.reduce_sum(tf.square(ys-prediction),reduction_indices=[1]))
loss = tf.reduce_mean(-tf.reduce_sum(ys*tf.log(prediction),reduction_indices=[1]))  # 定义损失函数（代价函数），
train = tf.train.GradientDescentOptimizer(learning_rate=0.5).minimize(loss)     # 使用梯度下降最小化损失
init = tf.initialize_all_variables()   # 初始化所有变量

sess = tf.Session()  # 创建Session
sess.run(init)

for i in range(4000): # 迭代训练4000次
    sess.run(train, feed_dict={xs:X,ys:class_y})  # 训练train，填入数据
    if i%50==0:  # 每50次输出当前的准确度
        print(compute_accuracy(xs,ys,X,class_y,sess,prediction))

‘’’计算预测准确度’’’
def compute_accuracy(xs,ys,X,y,sess,prediction):
y_pre = sess.run(prediction,feed_dict={xs:X})
correct_prediction = tf.equal(tf.argmax(y_pre,1),tf.argmax(y,1)) #tf.argmax 给出某个tensor对象在某一维上的其数据最大值所在的索引值,即为对应的数字，tf.equal 来检测我们的预测是否真实标签匹配
accuracy = tf.reduce_mean(tf.cast(correct_prediction,tf.float32)) # 平均值即为准确度
result = sess.run(accuracy,feed_dict={xs:X,ys:y})
return result

     
     
       
       
       
       
        
        
        
        
      
      - 输出每一次预测的结果准确度    
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ![enter description here][9]
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ## 六、手写数字识别_2
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 1、说明
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - [全部代码](https://github.com/lawlite19/MachineLearning_TensorFlow/blob/master/Mnist_02/mnist.py)：`https://github.com/lawlite19/MachineLearning_TensorFlow/blob/master/Mnist_02/mnist.py`
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 采用TensorFlow中的mnist数据集（可以取网站下载它的数据集，http://yann.lecun.com/exdb/mnist/）
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 实现代码与上面类似，它有专门的测试集
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 2、代码
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 随机梯度下降`SGD`,每次选出`100`个数据进行训练

for i in range(2000):
batch_xs, batch_ys = minist.train.next_batch(100)
sess.run(train_step,feed_dict={xs:batch_xs,ys:batch_ys})
if i%50==0:
print(compute_accuracy(xs,ys,minist.test.images, minist.test.labels,sess,prediction))

     
     
       
       
       
       
        
        
        
        
      
      - 输出每一次预测的结果准确度     
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ![enter description here][10]
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ## 七、手写数字识别_3_CNN卷积神经网络
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 1、说明
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 关于**卷积神经网络CNN**可以查看[我的博客](http://blog.csdn.net/u013082989/article/details/53673602)：http://blog.csdn.net/u013082989/article/details/53673602
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 或者[github](https://github.com/lawlite19/DeepLearning_Python)：https://github.com/lawlite19/DeepLearning_Python
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - [全部代码](https://github.com/lawlite19/MachineLearning_TensorFlow/blob/master/Mnist_03_CNN/mnist_cnn.py)：`https://github.com/lawlite19/MachineLearning_TensorFlow/blob/master/Mnist_03_CNN/mnist_cnn.py`
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 采用TensorFlow中的mnist数据集（可以取网站下载它的数据集，http://yann.lecun.com/exdb/mnist/）
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 2、代码实现
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 权重和偏置初始化函数
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 权重使用的`truncated_normal`进行初始化,`stddev`标准差定义为0.1
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 偏置初始化为常量0.1

‘’’权重初始化函数’’’
def weight_variable(shape):
inital = tf.truncated_normal(shape, stddev=0.1) # 使用truncated_normal进行初始化
return tf.Variable(inital)

‘’’偏置初始化函数’’’
def bias_variable(shape):
inital = tf.constant(0.1,shape=shape) # 偏置定义为常量
return tf.Variable(inital)

     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 卷积函数
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - `strides[0]`和`strides[3]`的两个1是默认值，中间两个1代表padding时在x方向运动1步，y方向运动1步
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - `padding='SAME'`代表经过卷积之后的输出图像和原图像大小一样

‘’’卷积函数’’’
def conv2d(x,W):#x是图片的所有参数，W是此卷积层的权重
return tf.nn.conv2d(x,W,strides=[1,1,1,1],padding=’SAME’)#strides[0]和strides3的两个1是默认值，中间两个1代表padding时在x方向运动1步，y方向运动1步

     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 池化函数
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - `ksize`指定池化核函数的大小
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 根据池化核函数的大小定义`strides`的大小

‘’’池化函数’’’
def max_pool_2x2(x):
return tf.nn.max_pool(x,ksize=[1,2,2,1],
strides=[1,2,2,1], padding=’SAME’)#池化的核函数大小为2x2，因此ksize=[1,2,2,1]，步长为2，因此strides=[1,2,2,1]

     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 加载`mnist`数据和定义`placeholder`
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 输入数据`x_image`最后一个`1`代表`channel`的数量,若是`RGB`3个颜色通道就定义为3
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - `keep_prob` 用于**dropout**防止过拟合

mnist = input_data.read_data_sets('MNIST_data', one_hot=True)  # 下载数据

xs = tf.placeholder(tf.float32,[None,784])  # 输入图片的大小，28x28=784
ys = tf.placeholder(tf.float32,[None,10])   # 输出0-9共10个数字
keep_prob = tf.placeholder(tf.float32)      # 用于接收dropout操作的值，dropout为了防止过拟合
x_image = tf.reshape(xs,[-1,28,28,1])       #-1代表先不考虑输入的图片例子多少这个维度，后面的1是channel的数量，因为我们输入的图片是黑白的，因此channel是1，例如如果是RGB图像，那么channel就是3

     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 第一层卷积和池化
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
        - 使用**ReLu**激活函数

'''第一层卷积，池化'''
W_conv1 = weight_variable([5,5,1,32])  # 卷积核定义为5x5,1是输入的通道数目，32是输出的通道数目
b_conv1 = bias_variable([32])          # 每个输出通道对应一个偏置
h_conv1 = tf.nn.relu(conv2d(x_image,W_conv1)+b_conv1) # 卷积运算，并使用ReLu激活函数激活
h_pool1 = max_pool_2x2(h_conv1)        # pooling操作

'''第二层卷积，池化'''
W_conv2 = weight_variable([5,5,32,64]) # 卷积核还是5x5,32个输入通道，64个输出通道
b_conv2 = bias_variable([64])          # 与输出通道一致
h_conv2 = tf.nn.relu(conv2d(h_pool1, W_conv2)+b_conv2)
h_pool2 = max_pool_2x2(h_conv2)

'''全连接层'''
h_pool2_flat = tf.reshape(h_pool2, [-1,7*7*64])   # 将最后操作的数据展开
W_fc1 = weight_variable([7*7*64,1024])            # 下面就是定义一般神经网络的操作了，继续扩大为1024
b_fc1 = bias_variable([1024])                     # 对应的偏置
h_fc1 = tf.nn.relu(tf.matmul(h_pool2_flat,W_fc1)+b_fc1)  # 运算、激活（这里不是卷积运算了，就是对应相乘）

     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - `dropout`防止过拟合

'''dropout'''
h_fc1_drop = tf.nn.dropout(h_fc1,keep_prob)       # dropout操作

     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 最后一层全连接预测,使用梯度下降优化**交叉熵损失函数**
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
       - 使用**softmax**分类器分类

'''最后一层全连接'''
W_fc2 = weight_variable([1024,10])                # 最后一层权重初始化
b_fc2 = bias_variable([10])                       # 对应偏置

prediction = tf.nn.softmax(tf.matmul(h_fc1_drop,W_fc2)+b_fc2)  # 使用softmax分类器
cross_entropy = tf.reduce_mean(-tf.reduce_sum(ys*tf.log(prediction),reduction_indices=[1]))  # 交叉熵损失函数来定义cost function
train_step = tf.train.AdamOptimizer(1e-3).minimize(cross_entropy)  # 调用梯度下降

     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 定义Session，使用`SGD`训练

'''下面就是tf的一般操作，定义Session，初始化所有变量，placeholder传入值训练'''
sess = tf.Session()
sess.run(tf.initialize_all_variables())

for i in range(1000):
    batch_xs, batch_ys = mnist.train.next_batch(100)  # 使用SGD，每次选取100个数据训练
    sess.run(train_step, feed_dict={xs: batch_xs, ys: batch_ys, keep_prob: 0.5})  # dropout值定义为0.5
    if i % 50 == 0:
        print compute_accuracy(xs,ys,mnist.test.images, mnist.test.labels,keep_prob,sess,prediction)  # 每50次输出一下准确度

1 2	- 计算准确度函数 - 和上面的两个计算准确度的函数一致，就是多了个dropout的参数`keep_prob`

‘’’计算准确度函数’’’
def compute_accuracy(xs,ys,X,y,keep_prob,sess,prediction):
y_pre = sess.run(prediction,feed_dict={xs:X,keep_prob:1.0}) # 预测，这里的keep_prob是dropout时用的，防止过拟合
correct_prediction = tf.equal(tf.argmax(y_pre,1),tf.argmax(y,1)) #tf.argmax 给出某个tensor对象在某一维上的其数据最大值所在的索引值,即为对应的数字，tf.equal 来检测我们的预测是否真实标签匹配
accuracy = tf.reduce_mean(tf.cast(correct_prediction,tf.float32)) # 平均值即为准确度
result = sess.run(accuracy,feed_dict={xs:X,ys:y,keep_prob:1.0})
return result

     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 3、运行结果
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 测试集上准确度   
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ![enter description here][11]   
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 使用`top`命令查看占用的CPU和内存，还是很消耗CPU和内存的，所以上面只输出了四次我就终止了
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ![enter description here][12]   
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 由于我在虚拟机里运行的`TensorFlow`程序，分配了`5G`的内存，若是内存不够会报一个错误。
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      -------------------------------------------------------------
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ## 八、保存和提取神经网络
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 1、保存
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 定义要保存的数据

W = tf.Variable(initial_value=[[1,2,3],[3,4,5]],
name=’weights’, dtype=tf.float32) # 注意需要指定name和dtype
b = tf.Variable(initial_value=[1,2,3],
name=’biases’, dtype=tf.float32)
init = tf.initialize_all_variables()

saver = tf.train.Saver()
with tf.Session() as sess:
sess.run(init)
save_path = saver.save(sess, ‘my_network/save_net.ckpt’) # 保存目录，注意要在当前项目下建立my_network的目录
print (‘保存到 :’,save_path)

1 2	### 2、提取 - 定义数据

W = tf.Variable(np.arange(6).reshape((2,3)),
name=’weights’, dtype=tf.float32) # 注意与之前保存的一致
b = tf.Variable(np.arange((3)),
name=’biases’, dtype=tf.float32)

1	- `restore`提取

saver = tf.train.Saver()
with tf.Session() as sess:
saver.restore(sess,’my_network/save_net.ckpt’)
print(‘weights:’,sess.run(W)) # 输出一下结果
print(‘biases:’,sess.run(b))

     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      -------------------------------------------------
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 以下来自`tensorflow-turorial`，使用`python3.5`
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ## 九、线性模型Linear Model
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - [全部代码][13]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 使用`MNIST`数据集
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 1、加载MNIST数据集，并输出信息
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ``` stylus
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      '''Load MNIST data and print some information'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      data = input_data.read_data_sets("MNIST_data", one_hot = True)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("Size of:")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("\t training-set:\t\t{}".format(len(data.train.labels)))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("\t test-set:\t\t\t{}".format(len(data.test.labels)))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("\t validation-set:\t{}".format(len(data.validation.labels)))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(data.test.labels[0:5])
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      data.test.cls = np.array([label.argmax() for label in data.test.labels])   # get the actual value
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(data.test.cls[0:5])

2、绘制9张图像

实现函数

       
       
         
         
         
         
          
          
          
          
        
        '''define a funciton to plot 9 images'''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def plot_images(images, cls_true, cls_pred = None):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            '''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            @parameter images:   the images info
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            @parameter cls_true: the true value of image
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            @parameter cls_pred: the prediction value, default is None
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            '''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            assert len(images) == len(cls_true) == 9  # only show 9 images
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            fig, axes = plt.subplots(nrows=3, ncols=3)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            for i, ax in enumerate(axes.flat):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.imshow(images[i].reshape(img_shape), cmap="binary")  # binary means black_white image
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                # show the true and pred values
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                if cls_pred is None:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    xlabel = "True: {0}".format(cls_true[i])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                else:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    xlabel = "True: {0},Pred: {1}".format(cls_true[i],cls_pred[i])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_xlabel(xlabel)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_xticks([])  # remove the ticks
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_yticks([])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.show()

选择测试集中的9张图显示

     
     
       
       
       
       
        
        
        
        
      
      '''show 9 images'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      images = data.test.images[0:9]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      cls_true = data.test.cls[0:9]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      plot_images(images, cls_true)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ```                   
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ![enter description here][14]
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ### 3、定义要训练的模型
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      - 定义`placeholder`
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ``` stylus
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      '''define the placeholder'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      X = tf.placeholder(tf.float32, [None, img_size_flat])    # None means the arbitrary number of labels, the features size is img_size_flat 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true = tf.placeholder(tf.float32, [None, num_classes]) # output size is num_classes
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true_cls = tf.placeholder(tf.int64, [None])

定义weights和biases

     
     
       
       
       
       
        
        
        
        
      
      '''define weights and biases'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      weights = tf.Variable(tf.zeros([img_size_flat, num_classes]))  # img_size_flat*num_classes
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      biases = tf.Variable(tf.zeros([num_classes]))

定义模型

     
     
       
       
       
       
      
      ''
      
      'define the model'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      logits = tf.matmul(X,weights) + biases 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_pred = tf
      
      .nn
      
      .softmax(logits)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_pred_cls = tf.argmax(y_pred, dimension=
      
      1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      cross_entropy = tf
      
      .nn
      
      .softmax_cross_entropy_with_logits(labels=y_true, 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                                             logits=logits)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      cost = tf.reduce_mean(cross_entropy)
     
     
       
       
       
       
     
     
       
       
       
       
      
      ''
      
      'define the optimizer'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      optimizer = tf
      
      .train
      
      .GradientDescentOptimizer(learning_rate=
      
      0.5).minimize(cost)

定义求准确度

     
     
       
       
       
       
      
      ''
      
      'define the accuracy'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      correct_prediction = tf.equal(y_pred_cls, y_true_cls)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

定义session

     
     
       
       
       
       
      
      ''
      
      'run the datagraph and use batch gradient descent'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      session = tf.Session()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      session.run(tf.global_variables_initializer())
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      batch_size = 
      
      100

4、定义函数`optimize`进行bgd训练

     
     
       
       
       
       
        
        
        
        
      
      '''define a function to run the optimizer'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def optimize(num_iterations):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @parameter num_iterations: the traning times
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          for i in range(num_iterations):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              x_batch, y_true_batch = data.train.next_batch(batch_size)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              feed_dict_train = {X: x_batch,y_true: y_true_batch}
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              session.run(optimizer, feed_dict=feed_dict_train)

5、定义输出准确度的函数

代码

       
       
         
         
         
         
          
          
          
          
        
        feed_dict_test = {X: data.test.images, 
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                          y_true: data.test.labels, 
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                          y_true_cls: data.test.cls}        
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        '''define a function to print the accuracy'''    
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def print_accuracy():
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            acc = session.run(accuracy, feed_dict=feed_dict_test)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            print("Accuracy on test-set:{0:.1%}".format(acc))

输出：Accuracy on test-set:89.4%

6、定义绘制错误预测的图片函数

代码

       
       
         
         
         
         
          
          
          
          
        
        '''define a function to plot the error prediciton'''    
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def plot_example_errors():
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            correct, cls_pred = session.run([correct_prediction, y_pred_cls], feed_dict=feed_dict_test) 
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            incorrect = (correct == False)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            images = data.test.images[incorrect]  # get the prediction error images
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            cls_pred = cls_pred[incorrect]        # get prediction value
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            cls_true = data.test.cls[incorrect]   # get true value
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plot_images(images[0:9], cls_true[0:9], cls_pred[0:9])

输出：

7、定义可视化权重的函数

代码

       
       
         
         
         
         
          
          
          
          
        
        '''define a fucntion to plot weights'''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def plot_weights():
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            w = session.run(weights)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            w_min = np.min(w)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            w_max = np.max(w)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            fig, axes = plt.subplots(3, 4)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            fig.subplots_adjust(0.3, 0.3)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            for i, ax in enumerate(axes.flat):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                if i<10:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    image = w[:,i].reshape(img_shape)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    ax.set_xlabel("Weights: {0}".format(i))
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    ax.imshow(image, vmin=w_min,vmax=w_max,cmap="seismic")
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_xticks([])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_yticks([])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.show()

输出：

8、定义输出confusion_matrix的函数

代码：

       
       
         
         
         
         
          
          
          
          
        
        '''define a function to printand plot the confusion matrix using scikit-learn.'''   
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def print_confusion_martix():
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            cls_true = data.test.cls  # test set actual value 
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            cls_pred = session.run(y_pred_cls, feed_dict=feed_dict_test)  # test set predict value
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            cm = confusion_matrix(y_true=cls_true,y_pred=cls_pred)        # use sklearn confusion_matrix
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            print(cm)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.imshow(cm, interpolation='nearest',cmap=plt.cm.Blues) # Plot the confusion matrix as an image.
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.tight_layout()
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.colorbar()
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            tick_marks = np.arange(num_classes)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            tick_marks = np.arange(num_classes)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.xticks(tick_marks, range(num_classes))
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.yticks(tick_marks, range(num_classes))
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.xlabel('Predicted')
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.ylabel('True')    
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.show()

输出：

十：CNN

全部代码
使用MNIST数据集
加载数据，绘制9张图等函数与上面一致，readme中不再写出

1、定义CNN所需要的变量

     
     
       
       
       
       
        
        
        
        
      
      '''define cnn description'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      filter_size1 = 5     # the first conv filter size is 5x5 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      num_filters1 = 32    # there are 32 filters
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      filter_size2 = 5     # the second conv filter size
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      num_filters2 = 64    # there are 64 filters
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      fc_size = 1024       # fully-connected layer

2、初始化weights和biases的函数

     
     
       
       
       
       
        
        
        
        
      
      '''define a function to intialize weights'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def initialize_weights(shape):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param shape：the shape of weights
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return tf.Variable(tf.truncated_normal(shape=shape, stddev=0.1))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      '''define a function to intialize biases'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def initialize_biases(length):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param length: the length of biases, which is a vector
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return tf.Variable(tf.constant(0.1,shape=[length]))

3、定义卷积操作和池化（如果使用的话）的函数

     
     
       
       
       
       
        
        
        
        
      
      '''define a function to do conv and pooling if used'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def conv_layer(input, 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                     num_input_channels,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                     filter_size,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                     num_output_filters,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                     use_pooling=True):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param input: the input of previous layer's output
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param num_input_channels: input channels
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param filter_size: the weights filter size
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param num_output_filters: the output number channels
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param use_pooling: if use pooling operation
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          shape = [filter_size, filter_size, num_input_channels, num_output_filters]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          weights = initialize_weights(shape=shape)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          biases = initialize_biases(length=num_output_filters)   # one for each filter
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          layer = tf.nn.conv2d(input=input, filter=weights, strides=[1,1,1,1], padding='SAME')
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          layer += biases
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          if use_pooling:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              layer = tf.nn.max_pool(value=layer,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                     ksize=[1,2,2,1],
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                     strides=[1,2,2,1],
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                     padding="SAME")   # the kernel function size is 2x2,so the ksize=[1,2,2,1]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          layer = tf.nn.relu(layer)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return layer, weights

4、定义将卷积层展开的函数

     
     
       
       
       
       
        
        
        
        
      
      '''define a function to flat conv layer'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def flatten_layer(layer):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param layer: the conv layer
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          layer_shape = layer.get_shape() # get the shape of the layer(layer_shape == [num_images, img_height, img_width, num_channels])
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          num_features = layer_shape[1:4].num_elements()  # [1:4] means the last three demension, namely the flatten size
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          layer_flat = tf.reshape(layer, [-1, num_features])   # reshape to flat,-1 means don't care about the number of images
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return layer_flat, num_features

5、定义全连接层的函数

     
     
       
       
       
       
        
        
        
        
      
      '''define a function to do fully-connected'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def fc_layer(input, num_inputs, num_outputs, use_relu=True):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param input: the input
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param num_inputs: the input size
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param num_outputs: the output size
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param use_relu: if use relu activation function
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          weights = initialize_weights(shape=[num_inputs, num_outputs])
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          biases = initialize_biases(num_outputs)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          layer = tf.matmul(input, weights) + biases
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          if use_relu:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              layer = tf.nn.relu(layer)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return layer

6、定义模型

定义placeholder

     
     
       
       
       
       
        
        
        
        
      
      '''define the placeholder'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      X = tf.placeholder(tf.float32, shape=[None, img_flat_size], name="X")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      X_image = tf.reshape(X, shape=[-1, img_size, img_size, num_channels])  # reshape to the image shape
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true = tf.placeholder(tf.float32, [None, num_classes], name="y_true")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true_cls = tf.argmax(y_true, axis=1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      keep_prob = tf.placeholder(tf.float32)  # drop out placeholder

定义卷积、dropout、和全连接

     
     
       
       
       
       
        
        
        
        
      
      '''define the cnn model'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      layer_conv1, weights_conv1 = conv_layer(input=X_image, num_input_channels=num_channels, 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                             filter_size=filter_size1, 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                             num_output_filters=num_filters1,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                             use_pooling=True)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("conv1:",layer_conv1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      layer_conv2, weights_conv2 = conv_layer(input=layer_conv1, num_input_channels=num_filters1, 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                              filter_size=filter_size2,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                              num_output_filters=num_filters2,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                              use_pooling=True)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("conv2:",layer_conv2)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      layer_flat, num_features = flatten_layer(layer_conv2) # the num_feature is 7x7x36=1764
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("flatten layer:", layer_flat)  
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      layer_fc1 = fc_layer(layer_flat, num_features, fc_size, use_relu=True)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("fully-connected layer1:", layer_fc1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      layer_drop_out = tf.nn.dropout(layer_fc1, keep_prob)   # dropout operation
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      layer_fc2 = fc_layer(layer_drop_out, fc_size, num_classes,use_relu=False)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("fully-connected layer2:", layer_fc2)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_pred = tf.nn.softmax(layer_fc2)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_pred_cls = tf.argmax(y_pred, axis=1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      cross_entropy = tf.nn.softmax_cross_entropy_with_logits(labels=y_true, 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                                             logits=layer_fc2)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      cost = tf.reduce_mean(cross_entropy)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      optimizer = tf.train.AdamOptimizer(learning_rate=1e-3).minimize(cost)  # use AdamOptimizer优化

定义求准确度

     
     
       
       
       
       
      
      ''
      
      'define accuracy'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      correct_prediction = tf.equal(y_true_cls, y_pred_cls)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      accuracy = tf.reduce_mean(tf.cast(correct_prediction,dtype=tf.float32))

7、定义训练的函数`optimize`，使用bgd

代码：

       
       
         
         
         
         
          
          
          
          
        
        '''define a function to run train the model with bgd'''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        total_iterations = 0  # record the total iterations
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def optimize(num_iterations):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            '''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            @param num_iterations: the total interations of train batch_size operation
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            '''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            global total_iterations
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            start_time = time.time()
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            for i in range(total_iterations,total_iterations + num_iterations):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                x_batch, y_batch = data.train.next_batch(batch_size)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                feed_dict = {X: x_batch, y_true: y_batch, keep_prob: 0.5}
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                session.run(optimizer, feed_dict=feed_dict)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                if i % 10 == 0:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    acc = session.run(accuracy, feed_dict=feed_dict)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    msg = "Optimization Iteration: {0:>6}, Training Accuracy: {1:>6.1%}"    # {:>6}means the fixed width,{1:>6.1%}means the fixed width is 6 and keep 1 decimal place         
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    print(msg.format(i + 1, acc))
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            total_iterations += num_iterations
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            end_time = time.time()
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            time_dif = end_time-start_time
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            print("time usage:"+str(timedelta(seconds=int(round(time_dif)))))

输出：

     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      651, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      661, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      671, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      681, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      691, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      701, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      711, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      721, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      731, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      741, Training Accuracy: 
      
      100.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      751, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      761, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      771, Training Accuracy:  
      
      97.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      781, Training Accuracy:  
      
      96.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      791, Training Accuracy:  
      
      98.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      801, Training Accuracy: 
      
      100.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      811, Training Accuracy: 
      
      100.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      821, Training Accuracy:  
      
      97.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      831, Training Accuracy:  
      
      98.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      841, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      851, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      861, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      871, Training Accuracy:  
      
      96.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      881, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      891, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      901, Training Accuracy:  
      
      98.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      911, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      921, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      931, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      941, Training Accuracy:  
      
      98.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      951, Training Accuracy: 
      
      100.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      961, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      971, Training Accuracy:  
      
      98.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      981, Training Accuracy:  
      
      99.0%
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Optimization Iteration:    
      
      991, Training Accuracy: 
      
      100.0%
     
     
       
       
       
       
     
     
       
       
       
       
      
      time usage:
      
      0:
      
      07:
      
      07

8、定义批量预测的函数，方便输出训练错的图像

     
     
       
       
       
       
        
        
        
        
      
      batch_size_test = 256
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def print_test_accuracy(print_error=False,print_confusion_matrix=False):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param print_error: whether plot the error images
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          @param print_confusion_matrix: whether plot the confusion_matrix
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          '''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          num_test = len(data.test.images)   
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          cls_pred = np.zeros(shape=num_test, dtype=np.int)  # declare the cls_pred
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          i = 0
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          #predict the test set using batch_size
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          while i < num_test:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              j = min(i + batch_size_test, num_test)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              images = data.test.images[i:j,:]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              labels = data.test.labels[i:j,:]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              feed_dict = {X:images,y_true:labels,keep_prob:0.5}
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              cls_pred[i:j] = session.run(y_pred_cls,feed_dict=feed_dict)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              i = j
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          cls_true = data.test.cls
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          correct = (cls_true == cls_pred)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          correct_sum = correct.sum()   # correct predictions
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          acc = float(correct_sum)/num_test
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          msg = "Accuracy on Test-Set: {0:.1%} ({1} / {2})"
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print(msg.format(acc, correct_sum, num_test))    
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          if print_error:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              plot_error_pred(cls_pred,correct)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          if print_confusion_matrix:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              plot_confusin_martrix(cls_pred)

9、定义可视化卷积核权重的函数

代码：

       
       
         
         
         
         
          
          
          
          
        
        '''define a function to plot conv weights'''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def plot_conv_weights(weights,input_channel=0):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            '''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            @param weights: the conv filter weights, for example: the weights_conv1 and weights_conv2, which are 4 dimension [filter_size, filter_size, num_input_channels, num_output_filters]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            @param input_channel: the input_channels
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            '''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            w = session.run(weights)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            w_min = np.min(w)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            w_max = np.max(w)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            num_filters = w.shape[3]   # get the number of filters
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            num_grids = math.ceil(math.sqrt(num_filters))
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            fig, axes = plt.subplots(num_grids, num_grids)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            for i, ax in enumerate(axes.flat):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                if i < num_filters:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    img = w[:,:,input_channel,i]   # the ith weight
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    ax.imshow(img,vmin=w_min,vmax=w_max,interpolation="nearest",cmap='seismic')
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_xticks([])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_yticks([])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.show()

输出：
- 第一层：
- 第二层：
  
  10、定义可视化卷积层输出的函数

代码：

       
       
         
         
         
         
          
          
          
          
        
        '''define a function to plot conv output layer'''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def plot_conv_layer(layer, image):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            '''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            @param layer: the conv layer, which is also a image after conv
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            @param image: the image info
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            '''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            feed_dict = {X:[image]}
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            values = session.run(layer, feed_dict=feed_dict)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            num_filters = values.shape[3]   # get the number of filters
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            num_grids = math.ceil(math.sqrt(num_filters))
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            fig, axes = plt.subplots(num_grids,num_grids)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            for i, ax in enumerate(axes.flat):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                if i < num_filters:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    img = values[0,:,:,i]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                    ax.imshow(img, interpolation="nearest",cmap="binary")
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_xticks([])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                ax.set_yticks([])
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            plt.show()

输出：
- 第一层：
- 第二层：

十一：使用prettytensor实现CNNModel

全部代码
使用MNIST数据集
加载数据，绘制9张图等函数与九一致，readme中不再写出
1、定义模型
定义placeholder,与之前的一致

     
     
       
       
       
       
      
      ''
      
      'declare the placeholder'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      X = tf.placeholder(tf
      
      .float32, [None, img_flat_size], name=
      
      "X")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      X_img = tf.reshape(X, shape=[-
      
      1,img_size,img_size, num_channels])
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true = tf.placeholder(tf
      
      .float32, shape=[None, num_classes], name=
      
      "y_true")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true_cls = tf.argmax(y_true,
      
      1)

使用prettytensor实现CNN模型

     
     
       
       
       
       
        
        
        
        
      
      '''define the cnn model with prettytensor'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      x_pretty = pt.wrap(X_img)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      with pt.defaults_scope():   # or pt.defaults_scope(activation_fn=tf.nn.relu) if just use one activation function
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          y_pred, loss = x_pretty.\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              conv2d(kernel=5, depth=16, activation_fn=tf.nn.relu, name="conv_layer1").\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              max_pool(kernel=2, stride=2).\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              conv2d(kernel=5, depth=36, activation_fn=tf.nn.relu, name="conv_layer2").\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              max_pool(kernel=2, stride=2).\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              flatten().\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              fully_connected(size=128, activation_fn=tf.nn.relu, name="fc_layer1").\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              softmax_classifier(num_classes=num_classes, labels=y_true)

获取卷积核的权重(后续可视化)

     
     
       
       
       
       
        
        
        
        
      
      '''define a function to get weights'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def get_weights_variable(layer_name):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          with tf.variable_scope(layer_name, reuse=True):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              variable = tf.get_variable("weights")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return variable
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      conv1_weights = get_weights_variable("conv_layer1")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      conv2_weights = get_weights_variable("conv_layer2")

定义optimizer训练，和之前的一样了

     
     
       
       
       
       
      
      ''
      
      'define optimizer to train'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      optimizer = tf
      
      .train
      
      .AdamOptimizer().minimize(loss)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_pred_cls = tf.argmax(y_pred,
      
      1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      correct_prediction = tf.equal(y_pred_cls, y_true_cls)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      session = tf.Session()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      session.run(tf.global_variables_initializer())

十二：CNN,保存和加载模型，使用Early Stopping

全部代码
使用MNIST数据集
加载数据，绘制9张图等函数与九一致，readme中不再写出
CNN模型的定义和十一中的一致，readme中不再写出
1、保存模型
创建saver,和保存的目录

     
     
       
       
       
       
      
      ''
      
      'define a Saver to save the network'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      saver = tf
      
      .train
      
      .Saver()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      save_dir = 
      
      "checkpoints/"
     
     
       
       
       
       
     
     
       
       
       
       
      
      if not os
      
      .path
      
      .exists(save_dir):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          os.makedirs(save_dir)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      save_path = os
      
      .path
      
      .join(save_dir, 
      
      'best_validation')

保存session,对应到下面2中的Early Stopping，将最好的模型保存

1	saver.save(sess=session, save_path=save_path)

2、Early Stopping

     
     
       
       
       
       
        
        
        
        
      
      '''declear the train info'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      train_batch_size = 64
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      best_validation_accuracy = 0.0
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      last_improvement = 0
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      require_improvement_iterations = 1000
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      total_iterations = 0
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      '''define a function to optimize the optimizer'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def optimize(num_iterations):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          global total_iterations
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          global best_validation_accuracy
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          global last_improvement
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          start_time = time.time()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          for i in range(num_iterations):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              total_iterations += 1
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              X_batch, y_true_batch = data.train.next_batch(train_batch_size)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              feed_dict_train = {X: X_batch,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                           y_true: y_true_batch}
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              session.run(optimizer, feed_dict=feed_dict_train)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              if (total_iterations%100 == 0) or (i == num_iterations-1):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  acc_train = session.run(accuracy, feed_dict=feed_dict_train)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  acc_validation, _ = validation_accuracy()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  if acc_validation > best_validation_accuracy:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                      best_validation_accuracy = acc_validation
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                      last_improvement = total_iterations
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                      saver.save(sess=session, save_path=save_path)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                      improved_str = "*"
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  else:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                      improved_str = ""
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  msg = "Iter: {0:>6}, Train_batch accuracy:{1:>6.1%}, validation acc:{2:>6.1%} {3}"
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  print(msg.format(i+1, acc_train, acc_validation, improved_str))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              if total_iterations-last_improvement > require_improvement_iterations:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  print('No improvement found in a while, stop running')
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  break
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          end_time = time.time()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          time_diff = end_time-start_time
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print("Time usage:" + str(timedelta(seconds=int(round(time_diff)))))

调用optimize(10000)输出信息

     
     
       
       
       
       
        
        
        
        
      
      Iter:   5100, Train_batch accuracy:100.0%, validation acc: 98.8% *
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   5200, Train_batch accuracy:100.0%, validation acc: 98.3% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   5300, Train_batch accuracy:100.0%, validation acc: 98.7% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   5400, Train_batch accuracy: 98.4%, validation acc: 98.6% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   5500, Train_batch accuracy: 98.4%, validation acc: 98.6% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   5600, Train_batch accuracy:100.0%, validation acc: 98.7% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   5700, Train_batch accuracy: 96.9%, validation acc: 98.9% *
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   5800, Train_batch accuracy:100.0%, validation acc: 98.6% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   5900, Train_batch accuracy:100.0%, validation acc: 98.6% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   6000, Train_batch accuracy: 98.4%, validation acc: 98.7% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   6100, Train_batch accuracy:100.0%, validation acc: 98.7% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   6200, Train_batch accuracy:100.0%, validation acc: 98.7% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   6300, Train_batch accuracy: 98.4%, validation acc: 98.8% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   6400, Train_batch accuracy: 98.4%, validation acc: 98.8% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   6500, Train_batch accuracy:100.0%, validation acc: 98.7% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   6600, Train_batch accuracy:100.0%, validation acc: 98.7% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Iter:   6700, Train_batch accuracy:100.0%, validation acc: 98.8% 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      No improvement found in a while, stop running
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      Time usage:0:18:43

可以看到最后10次输出（每100次输出一次）在验证集上准确度都没有提高，停止执行

3、小批量预测并计算准确率

因为需要预测测试集和验证集，这里参数指定需要的images

       
       
         
         
         
         
          
          
          
          
        
        '''define a function to predict using batch'''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        batch_size_predict = 256
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def predict_cls(images, labels, cls_true):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            num_images = len(images)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            cls_pred = np.zeros(shape=num_images, dtype=np.int)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            i = 0
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            while i < num_images:
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                j = min(i+batch_size_predict, num_images)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                feed_dict = {X: images[i:j,:],
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                             y_true: labels[i:j,:]}
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                cls_pred[i:j] = session.run(y_pred_cls, feed_dict=feed_dict)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                i = j
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            correct = (cls_true==cls_pred)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            return correct, cls_pred

测试集和验证集直接调用即可

     
     
       
       
       
       
        
        
        
        
      
      def predict_cls_test():
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return predict_cls(data.test.images, data.test.labels, data.test.cls)
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def predict_cls_validation():
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return predict_cls(data.validation.images, data.validation.labels, data.validation.cls)

计算验证集准确率（上面optimize函数中需要用到）

     
     
       
       
       
       
        
        
        
        
      
      '''calculate the acc'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def cls_accuracy(correct):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          correct_sum = correct.sum()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          acc = float(correct_sum)/len(correct)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return acc, correct_sum
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      '''define a function to calculate the validation acc'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def validation_accuracy():
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          correct, _ = predict_cls_validation()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return cls_accuracy(correct)

计算测试集准确率，并且输出错误的预测和confusion matrix

     
     
       
       
       
       
        
        
        
        
      
      '''define a function to calculate test acc'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def print_test_accuracy(show_example_errors=False,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                              show_confusion_matrix=False):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          correct, cls_pred = predict_cls_test()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          acc, num_correct = cls_accuracy(correct)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          num_images = len(correct)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          msg = "Accuracy on Test-Set: {0:.1%} ({1} / {2})"
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print(msg.format(acc, num_correct, num_images))
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          # Plot some examples of mis-classifications, if desired.
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          if show_example_errors:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              print("Example errors:")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              plot_example_errors(cls_pred=cls_pred, correct=correct)
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          # Plot the confusion matrix, if desired.
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          if show_confusion_matrix:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              print("Confusion Matrix:")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              plot_confusion_matrix(cls_pred=cls_pred)

十二：模型融合

全部代码
使用MNIST数据集
一些方法和之前的一致，不在给出
其中训练了多个CNN 模型，然后取预测的平均值作为最后的预测结果
1、将测试集和验证集合并后，并重新划分

主要是希望训练时数据集有些变换，否则都是一样的数据去训练了，最后再融合意义不大

       
       
         
         
         
         
          
          
          
          
        
        '''将training set和validation set合并，并重新划分'''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        combine_images = np.concatenate([data.train.images, data.validation.images], axis=0)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        combine_labels = np.concatenate([data.train.labels, data.validation.labels], axis=0)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        print("合并后图片：", combine_images.shape)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        print("合并后label：", combine_labels.shape)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        combined_size = combine_labels.shape[0]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        train_size = int(0.8*combined_size)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        validation_size = combined_size - train_size
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        '''函数：将合并后的重新随机划分'''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        def random_training_set():
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            idx = np.random.permutation(combined_size)   # 将0-combined_size数字随机排列
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            idx_train = idx[0:train_size]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            idx_validation = idx[train_size:]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            x_train = combine_images[idx_train, :]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            y_train = combine_labels[idx_train, :]
       
       
         
         
         
         
       
       
         
         
         
             
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            x_validation = combine_images[idx_validation, :]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            y_validation = combine_images[idx_validation, :]
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            return x_train, y_train, x_validation, y_validation

2、融合模型

加载训练好的模型，并输出每个模型在测试集的预测结果等

       
       
         
         
         
         
          
          
          
          
        
        def ensemble_predictions():
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            pred_labels = []
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            test_accuracies = []
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            validation_accuracies = []
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            for i in range(num_networks):
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                saver.restore(sess=session, save_path=get_save_path(i))
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                test_acc = test_accuracy()
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                test_accuracies.append(test_acc)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                validation_acc = validation_accuracy()
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                validation_accuracies.append(validation_acc)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                msg = "网络：{0}，验证集：{1:.4f}，测试集{2:.4f}"
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                print(msg.format(i, validation_acc, test_acc))
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                pred = predict_labels(data.test.images)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                pred_labels.append(pred)
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
            return np.array(pred_labels),\
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                   np.array(test_accuracies),\
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
                   np.array(validation_accuracies)

调用pred_labels, test_accuracies, val_accuracies = ensemble_predictions()
取均值：ensemble_pred_labels = np.mean(pred_labels, axis=0)
融合后的真实结果：ensemble_cls_pred = np.argmax(ensemble_pred_labels, axis=1)
其他一些信息：

     
     
       
       
       
       
        
        
        
        
      
      ensemble_correct = (ensemble_cls_pred == data.test.cls)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ensemble_incorrect = np.logical_not(ensemble_correct)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(test_accuracies)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      best_net = np.argmax(test_accuracies)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(best_net)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(test_accuracies[best_net])
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      best_net_pred_labels = pred_labels[best_net, :, :]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      best_net_cls_pred = np.argmax(best_net_pred_labels, axis=1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      best_net_correct = (best_net_cls_pred == data.test.cls)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      best_net_incorrect = np.logical_not(best_net_correct)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("融合后预测对的：", np.sum(ensemble_correct))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print("单个最好模型预测对的", np.sum(best_net_correct))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      ensemble_better = np.logical_and(best_net_incorrect, ensemble_correct)  # 融合之后好于单个的个数
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(ensemble_better.sum())
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      best_net_better = np.logical_and(best_net_correct, ensemble_incorrect)  # 单个好于融合之后的个数
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(best_net_better.sum())

十二：Cifar-10数据集，使用`variable_scope`重复使用变量

全部代码
使用CIFAR-10数据集
创建了两个网络，一个用于训练，一个用于测试，测试使用的是训练好的权重参数，所以用到参数重用
网络结构

1、数据集

导入包：

这是别人实现好的下载和处理cifar-10数据集的diamante

1 2	import cifar10 from cifar10 import img_size, num_channels, num_classes

输出一些数据集信息

     
     
       
       
       
       
      
      ''
      
      '下载cifar10数据集, 大概163M'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      cifar10.maybe_download_and_extract()
     
     
       
       
       
       
     
     
       
       
       
       
      
      ''
      
      '加载数据集'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      images_train, cls_train, labels_train = cifar10.load_training_data()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      images_test,  cls_test,  labels_test  = cifar10.load_test_data()
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
      
      ''
      
      '打印一些信息'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      class_names = cifar10.load_class_names()
     
     
       
       
       
       
     
     
       
       
       
       
      
      print(class_names)
     
     
       
       
       
       
     
     
       
       
       
       
      
      print("Size of:")
     
     
       
       
       
       
     
     
       
       
       
       
      
      print("training set:\t\t{}".format(len(images_train)))
     
     
       
       
       
       
     
     
       
       
       
       
      
      print("test set:\t\t\t{}".format(len(images_test)))

显示9张图片函数
- 相比之前的，加入了smooth

     
     
       
       
       
       
        
        
        
        
      
      '''显示9张图片函数'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def plot_images(images, cls_true, cls_pred=None, smooth=True):   # smooth是否平滑显示
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          assert len(images) == len(cls_true) == 9
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          fig, axes = plt.subplots(3,3)
     
     
       
       
       
       
     
     
       
       
       
           
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          for i, ax in enumerate(axes.flat):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              if smooth:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  interpolation = 'spline16'
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              else:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  interpolation = 'nearest'
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              ax.imshow(images[i, :, :, :], interpolation=interpolation)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              cls_true_name = class_names[cls_true[i]]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              if cls_pred is None:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  xlabel = "True:{0}".format(cls_true_name)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              else:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  cls_pred_name = class_names[cls_pred[i]]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  xlabel = "True:{0}, Pred:{1}".format(cls_true_name, cls_pred_name)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              ax.set_xlabel(xlabel)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              ax.set_xticks([])
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              ax.set_yticks([])
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.show()

2、定义`placeholder`

     
     
       
       
       
       
        
        
        
        
      
      X = tf.placeholder(tf
      
      .float32, shape=[None, img_size, img_size, num_channels], name=
      
      "X")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true = tf.placeholder(tf
      
      .float32, shape=[None, num_classes], name=
      
      "y")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true_cls = tf.argmax(y_true, axis=
      
      1)

3、图片处理

单张图片处理

原图是32*32像素的，裁剪成24*24像素的
如果是训练集进行一些裁剪，翻转，饱和度等处理
如果是测试集，只进行简单的裁剪处理

这也是为什么使用variable_scope定义两个网络

         
         
           
           
           
           
            
            
            
            
          
          '''单个图片预处理, 测试集只需要裁剪就行了'''
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          def pre_process_image(image, training):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              if training:
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  image = tf.random_crop(image, size=[img_size_cropped, img_size_cropped, num_channels])  # 裁剪
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  image = tf.image.random_flip_left_right(image)                  # 左右翻转
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  image = tf.image.random_hue(image, max_delta=0.05)              # 色调调整
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  image = tf.image.random_brightness(image, max_delta=0.2)        # 曝光
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  image = tf.image.random_saturation(image, lower=0.0, upper=2.0) # 饱和度
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  '''上面的调整可能pixel值超过[0, 1], 所以约束一下'''        
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  image = tf.minimum(image, 1.0)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  image = tf.maximum(image, 0.0)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              else:
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  image = tf.image.resize_image_with_crop_or_pad(image, target_height=img_size_cropped, 
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                                                        target_width=img_size_cropped)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              return image

多张图片处理

因为训练和测试是都是使用batch的方式
调用上面处理单张图片的函数

tf.map_fn(fn, elems)函数，前面一般是lambda函数，后面是所有的数据

         
         
           
           
           
           
            
            
            
            
          
          '''调用上面的函数，处理多个图片images'''
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          def pre_process(images, training):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              images = tf.map_fn(lambda image: pre_process_image(image, training), images)   # tf.map_fn()使用lambda函数
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              return images

4、定义tensorflow计算图

定义主网络图
- 使用prettytensor
- 分为training和test两个阶段

     
     
       
       
       
       
        
        
        
        
      
      '''定义主网络函数'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def main_network(images, training):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          x_pretty = pt.wrap(images)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          if training:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              phase = pt.Phase.train
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          else:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              phase = pt.Phase.infer
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          with pt.defaults_scope(activation_fn=tf.nn.relu, phase=phase):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              y_pred, loss = x_pretty.\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              conv2d(kernel=5, depth=64, name="layer_conv1", batch_normalize=True).\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              max_pool(kernel=2, stride=2).\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              conv2d(kernel=5, depth=64, name="layer_conv2").\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              max_pool(kernel=2, stride=2).\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              flatten().\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              fully_connected(size=256, name="layer_fc1").\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              fully_connected(size=128, name="layer_fc2").\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              softmax_classifier(num_classes, labels=y_true)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return y_pred, loss

创建所有网络，包含预处理图片和主网络

需要使用variable_scope, 测试阶段需要reuse训练阶段的参数

         
         
           
           
           
           
            
            
            
            
          
          '''创建所有网络, 包含预处理和主网络，'''
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          def create_network(training):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              # 使用variable_scope可以重复使用定义的变量，训练时创建新的，测试时重复使用
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              with tf.variable_scope("network", reuse=not training):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  images = X
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  images = pre_process(images=images, training=training)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  y_pred, loss = main_network(images=images, training=training)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              return y_pred, loss

创建训练阶段网络

定义一个global_step记录训练的次数，下面会将其保存到checkpoint,trainable为False就不会训练改变

         
         
           
           
           
           
            
            
            
            
          
          '''训练阶段网络创建'''
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          global_step = tf.Variable(initial_value=0, 
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                                    name="global_step",
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                                    trainable=False) # trainable 在训练阶段不会改变
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          _, loss = create_network(training=True)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          optimizer = tf.train.AdamOptimizer(learning_rate=0.0001).minimize(loss, global_step)

定义测试阶段网络
- 同时定义准确率

     
     
       
       
       
       
      
      ''
      
      '测试阶段网络创建'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_pred, _ = create_network(training=False)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_pred_cls = tf.argmax(y_pred, dimension=
      
      1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      correct_prediction = tf.equal(y_pred_cls, y_true_cls)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

5、获取权重和每层的输出值信息

获取权重变量

     
     
       
       
       
       
        
        
        
        
      
      def get_weights_variable(layer_name):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          with tf.variable_scope("network/" + layer_name, reuse=True):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              variable = tf.get_variable("weights")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return variable 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      weights_conv1 = get_weights_variable("layer_conv1")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      weights_conv2 = get_weights_variable("layer_conv2")

获取每层的输出变量

     
     
       
       
       
       
        
        
        
        
      
      def get_layer_output(layer_name):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          tensor_name = "network/" + layer_name + "/Relu:0"
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          tensor = tf.get_default_graph().get_tensor_by_name(tensor_name)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return tensor
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      output_conv1 = get_layer_output("layer_conv1")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      output_conv2 = get_layer_output("layer_conv2")

6、保存和加载计算图参数

因为第一次不会加载，所以放到try中判断

     
     
       
       
       
       
      
      ''
      
      '执行tensorflow graph'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      session = tf.Session()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      save_dir = 
      
      "checkpoints/"
     
     
       
       
       
       
     
     
       
       
       
       
      
      if not os
      
      .path
      
      .exists(save_dir):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          os.makedirs(save_dir)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      save_path = os
      
      .path
      
      .join(save_dir, 
      
      'cifat10_cnn')
     
     
       
       
       
       
     
     
       
       
       
       
     
     
       
       
       
       
      
      ''
      
      '尝试存储最新的checkpoint, 可能会失败，比如第一次运行checkpoint不存在等'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      try:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print(
      
      "开始存储最新的存储...")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          last_chk_path = tf
      
      .train
      
      .latest_checkpoint(save_dir)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          saver.restore(session, save_path=last_chk_path)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print(
      
      "存储点来自：", last_chk_path)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      except:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print(
      
      "存储错误, 初始化变量")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          session.run(tf.global_variables_initializer())

7、训练

获取batch

     
     
       
       
       
       
        
        
        
        
      
      '''SGD'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      train_batch_size = 64
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def random_batch():
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          num_images = len(images_train)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          idx = np.random.choice(num_images, size=train_batch_size, replace=False)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          x_batch = images_train[idx, :, :, :]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          y_batch = labels_train[idx, :]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return x_batch, y_batch

训练网络

每1000次保存一下checkpoint

因为上面会restored已经保存训练的网络，同时也保存了训练的次数，所以可以接着训练

         
         
           
           
           
           
            
            
            
            
          
          def optimize(num_iterations):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              start_time = time.time()
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              for i in range(num_iterations):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  x_batch, y_batch = random_batch()
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  feed_dict_train = {X: x_batch, y_true: y_batch}
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  i_global, _ = session.run([global_step, optimizer], feed_dict=feed_dict_train)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  if (i_global%100==0) or (i == num_iterations-1):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                      batch_acc = session.run(accuracy, feed_dict=feed_dict_train)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                      msg = "global step: {0:>6}, training batch accuracy: {1:>6.1%}"
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                      print(msg.format(i_global, batch_acc))
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                  if(i_global%1000==0) or (i==num_iterations-1):
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                      saver.save(session, save_path=save_path,
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                                 global_step=global_step)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                      print("保存checkpoint")
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              end_time = time.time()
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              time_diff = end_time-start_time
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
              print("耗时：", str(timedelta(seconds=int(round(time_diff)))))

十三、Inception model (GoogleNet)

全部代码
使用训练好的inception model,因为模型很复杂，一般的电脑运行不起来的。
网络结构

1、下载和加载inception model

因为是预训练好的模型，所以无需我们定义结构了

导入包

这里 inception是别人实现好的下载的代码

         
         
           
           
           
           
            
            
            
            
          
          import numpy as np
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          import tensorflow as tf
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          from matplotlib import pyplot as plt
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          import inception # 第三方类加载inception model
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          import os

下载和加载模型

       
       
         
         
         
         
        
        ''
        
        '下载和加载inception model'
        
        ''
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        inception.maybe_download()
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        model = inception.Inception()

预测和显示图片函数

     
     
       
       
       
       
        
        
        
        
      
      '''预测和显示图片'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def classify(image_path):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.imshow(plt.imread(image_path))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.show()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          pred = model.classify(image_path=image_path)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          model.print_scores(pred=pred, k=10, only_first_name=True)

显示调整后的图片
- 因为 inception model要求输入图片为 299*299 像素的，所以它会resize成这个大小然后作为输入

     
     
       
       
       
       
        
        
        
        
      
      '''显示处理后图片的样式'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def plot_resized_image(image_path):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          resized_image = model.get_resized_image(image_path)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.imshow(resized_image, interpolation='nearest')
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.show()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      plot_resized_image(image_path)

十四、迁移学习 Transfer Learning

全部代码
网络结构还是使用上一节的inception model, 去掉最后的全连接层，然后重新构建全连接层进行训练
- 因为inception model 是训练好的，前面的卷积层用于捕捉特征, 而后面的全连接层可用于分类，所以我们训练全连接层即可
因为要计算每张图片的transfer values,所以使用一个cache缓存transfer-values，第一次计算完成后，后面重新运行直接读取存储的结果，这样比较节省时间
- transfer values是inception model在Softmax层前一层的值
- cifar-10数据集, 我放在实验室电脑上运行了几个小时才得到transfer values，还是比较慢的
总之最后相当于训练下面的神经网络，对应的 transfer-values作为输入

1、准备工作

导入包

       
       
         
         
         
         
          
          
          
          
        
        import numpy as np
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        import tensorflow as tf
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        import prettytensor as pt
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        from matplotlib import pyplot as plt
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        import time
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        from datetime import timedelta
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        import os
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        import inception   # 第三方下载inception model的代码
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        from inception import transfer_values_cache  # cache
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        import cifar10     # 也是第三方的库，下载cifar-10数据集
       
       
         
         
         
         
       
       
         
         
         
         
          
          
          
          
        
        from cifar10 import num_classes

下载cifar-10数据集

     
     
       
       
       
       
      
      ''
      
      '下载cifar-10数据集'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      cifar10.maybe_download_and_extract()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      class_names = cifar10.load_class_names()
     
     
       
       
       
       
     
     
       
       
       
       
      
      print("所有类别是：",class_names)
     
     
       
       
       
       
     
     
       
       
       
       
      
      ''
      
      '训练和测试集'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      images_train, cls_train, labels_train = cifar10.load_training_data()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      images_test,  cls_test,  labels_test  = cifar10.load_test_data()

下载和加载inception model

     
     
       
       
       
       
      
      ''
      
      '下载inception model'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      inception.maybe_download()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      model = inception.Inception()

计算cifar-10训练集和测试集在inception model上的transfer values

因为计算非常耗时，这里第一次运行存储到本地，以后再运行直接读取即可

transfer values的shape是(dataset size, 2048)，因为是softmax层的前一层

         
         
           
           
           
           
            
            
            
            
          
          '''训练和测试的cache的路径'''
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          file_path_cache_train = os.path.join(cifar10.data_path, 'inception_cifar10_train.pkl')
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          file_path_cache_test = os.path.join(cifar10.data_path, 'inception_cifar10_test.pkl')
         
         
           
           
           
           
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          print('处理训练集上的transfer-values.......... ')
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          image_scaled = images_train * 255.0  # cifar-10的pixel是0-1的, shape=(50000, 32, 32, 3)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          transfer_values_train = transfer_values_cache(cache_path=file_path_cache_train,
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                                                        images=image_scaled, 
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                                                        model=model)  # shape=(50000, 2048)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          print('处理测试集上的transfer-values.......... ')
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          images_scaled = images_test * 255.0
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          transfer_values_test = transfer_values_cache(cache_path=file_path_cache_test,
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                                                       model=model,
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
                                                       images=images_scaled)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          print("transfer_values_train: ",transfer_values_train.shape)
         
         
           
           
           
           
         
         
           
           
           
           
            
            
            
            
          
          print("transfer_values_test: ",transfer_values_test.shape)

可视化一张图片对应的transfer values

     
     
       
       
       
       
        
        
        
        
      
      '''显示transfer values'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def plot_transfer_values(i):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print("输入图片：")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.imshow(images_test[i], interpolation='nearest')
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.show()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print('transfer values --> 此图片在inception model上')
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          img = transfer_values_test[i]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          img = img.reshape((32, 64))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.imshow(img, interpolation='nearest', cmap='Reds')
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.show()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      plot_transfer_values(16)

2、分析`transfer values`

(1) 使用PCA主成分分析

将数据降到2维，可视化，因为transfer values是已经捕捉到的特征，所以可视化应该是可以隐约看到不同类别的数据是有区别的
取3000个数据观察（因为PCA也是比较耗时的）

     
     
       
       
       
       
        
        
        
        
      
      '''使用PCA分析transfer values'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      from sklearn.decomposition import PCA
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      pca = PCA(n_components=2)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      transfer_values = transfer_values_train[0:3000]  # 取3000个，大的话计算量太大
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      cls = cls_train[0:3000]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(transfer_values.shape)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      transfer_values_reduced = pca.fit_transform(transfer_values)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      print(transfer_values_reduced.shape)

可视化降维后的数据

     
     
       
       
       
       
        
        
        
        
      
      ## 显示降维后的transfer values
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def plot_scatter(values, cls):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          from matplotlib import cm as cm
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          cmap = cm.rainbow(np.linspace(0.0, 1.0, num_classes))
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          colors = cmap[cls]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          x = values[:, 0]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          y = values[:, 1]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.scatter(x, y, color=colors)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          plt.show()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      plot_scatter(transfer_values_reduced, cls)

(2) 使用TSNE主成分分析

因为t-SNE运行非常慢，所以这里先用PCA将到50维

     
     
       
       
       
       
        
        
        
        
      
      from sklearn
      
      .manifold import TSNE
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      pca = PCA(n_components=
      
      50)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      transfer_values_50d = pca.fit_transform(transfer_values)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      tsne = TSNE(n_components=
      
      2)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      transfer_values_reduced = tsne.fit_transform(transfer_values_50d)
     
     
       
       
       
       
     
     
       
       
       
       
      
      print("最终降维后：", transfer_values_reduced.shape)
     
     
       
       
       
       
     
     
       
       
       
       
      
      plot_scatter(transfer_values_reduced, cls)

数据区分还是比较明显的

3、创建我们自己的网络

使用prettytensor创建一个全连接层，使用softmax作为分类

     
     
       
       
       
       
        
        
        
        
      
      '''创建网络'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      transfer_len = model.transfer_len   # 获取transfer values的大小，这里是2048
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      x = tf.placeholder(tf.float32, shape=[None, transfer_len], name="x")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true = tf.placeholder(tf.float32, shape=[None, num_classes], name="y")
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_true_cls = tf.argmax(y_true, axis=1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      x_pretty = pt.wrap(x)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      with pt.defaults_scope(activation_fn=tf.nn.relu):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          y_pred, loss = x_pretty.\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              fully_connected(1024, name="layer_fc1").\
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              softmax_classifier(num_classes, labels=y_true)

优化器

     
     
       
       
       
       
      
      ''
      
      '优化器'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      global_step = tf.Variable(initial_value=
      
      0, name=
      
      "global_step", trainable=False)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      optimizer = tf
      
      .train
      
      .AdamOptimizer(
      
      0.0001).minimize(loss, global_step)

准确度

     
     
       
       
       
       
      
      ''
      
      'accuracy'
      
      ''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      y_pred_cls = tf.argmax(y_pred, axis=
      
      1)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      correct_prediction = tf.equal(y_pred_cls, y_true_cls)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

SGD训练

     
     
       
       
       
       
        
        
        
        
      
      '''SGD 训练'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      session = tf.Session()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      session.run(tf.initialize_all_variables())
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      train_batch_size = 64
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def random_batch():
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          num_images = len(images_train)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          idx = np.random.choice(num_images, 
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                 size=train_batch_size,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                 replace=False)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          x_batch = transfer_values_train[idx]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          y_batch = labels_train[idx]
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return x_batch, y_batch
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def optimize(num_iterations):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          start_time = time.time()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          for i in range(num_iterations):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              x_batch, y_true_batch = random_batch()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              feed_dict_train = {x: x_batch,
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                                 y_true: y_true_batch}
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              i_global, _ = session.run([global_step, optimizer], feed_dict=feed_dict_train)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              if (i_global % 100 == 0) or (i==num_iterations-1):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  batch_acc = session.run(accuracy, feed_dict=feed_dict_train)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  msg = "Global Step: {0:>6}, Training Batch Accuracy: {1:>6.1%}"
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                  print(msg.format(i_global, batch_acc))            
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          end_time = time.time()
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          time_diff = end_time - start_time
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          print("耗时：", str(timedelta(seconds=int(round(time_diff)))))

使用batch size预测测试集数据

     
     
       
       
       
       
        
        
        
        
      
      '''batch 预测'''
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      batch_size = 256
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
      def predict_cls(transfer_values, labels, cls_true):
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          num_images = len(images_test)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          cls_pred = np.zeros(shape=num_images, dtype=np.int)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          i = 0
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          while i < num_images:
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              j = min(i + batch_size, num_images)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              feed_dict = {x: transfer_values[i:j],
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
                           y_true: labels[i:j]}
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              cls_pred[i:j] = session.run(y_pred_cls, feed_dict=feed_dict)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
              i = j
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          correct = (cls_true == cls_pred)
     
     
       
       
       
       
     
     
       
       
       
       
        
        
        
        
      
          return correct, cls_pred

原文地址： http://lawlite.me/2016/12/08/Tensorflow%E5%AD%A6%E4%B9%A0/#more

你可能感兴趣的:(TensorFlow,Deep,Learning,Deep,Learning,Tensorflow)

利用 HAI 平台进行 DeepSeek 模型训练的详细指南
摘要本文旨在为非专业用户提供在HAI平台上进行DeepSeek模型训练的详细步骤。从创建项目、上传数据集、配置训练参数到启动训练任务并监控训练过程，本文将逐步指导用户完成整个流程。此外，本文还包含可运行的示例代码模块和相关章节配图，以帮助用户更好地理解和操作。引言HAI（HyperAI）平台是一个强大的AI模型训练平台，但对于非专业用户来说，其复杂性可能会成为使用的障碍。本文将详细介绍如何在HAI
DeepSeek的实际应用场景：AI技术如何赋能多领域创新 2501_91189350 人工智能
DeepSeek作为新一代智能技术平台，凭借其强大的算法能力和灵活的部署方式，正在多个行业掀起效率革命。本文将从真实案例出发，解析DeepSeek在不同场景中的落地应用。‌场景一：金融风控建模‌在信贷风险评估领域，传统模型存在数据维度单一、更新滞后等问题。某银行引入DeepSeek的‌动态特征工程模块‌，通过实时整合用户行为数据、社交网络信息等100+维度特征，成功将坏账识别准确率提升至98.5%
DeepSeek爆火，背后模型竟藏着这些秘密！ qq_23519469 ai
DeepSeek是什么来头最近，AI圈可是被一个名字刷爆了屏，那就是DeepSeek！它就像一颗横空出世的超级新星，在全球范围掀起了一阵狂热的追捧潮，这热度，简直了！大家都在疯狂讨论它，各种测评、对比层出不穷。它到底有啥过人之处，能让这么多人都为之疯狂？今天咱就来好好唠唠。DeepSeek，全称杭州深度求索人工智能基础技术研究有限公司，是一家专注于开发先进大语言模型（LLM）和相关技术的企业。它成
Ai时代初期全球不同纬度的层级辐射现象龙胥伯人工智能
基于最新研究成果与行业动态，AI时代的"层级辐射"现象可被科学解构为以下六大维度，结合技术演进、产业实践和社会影响进行系统性分析：一、技术能力的层级跃迁模型效率革命DeepSeek研发的R1-Zero模型通过动态架构设计，将样本利用率提升40%以上，训练周期大幅缩短。这种技术突破推动AI从实验室走向规模化应用，在智能制造、生物医药等领域催生新生态。大语言模型的训练方式（预训练→多任务学习→强化学习
使用Deepseek书写一篇综述论文，如何提示？学术乙方小知识经验分享
使用DeepSeek撰写综述论文时，可以通过以下提示和步骤来高效完成任务：明确研究主题与范围在开始撰写之前，首先需要明确研究主题、文献综述的时间跨度、地理范畴和文献类型。这有助于聚焦研究方向，避免偏离主题。制定详细的提示词提示词的设计是高效利用DeepSeek的关键。可以参考以下模板：研究背景与现状：请帮我梳理XXX领域的研究背景与现状，包括国内外的主要研究成果和研究热点。文献筛选与阅读：请帮我筛
程序员集体失业？DeepSeek这6个反常识用法竟能替代写代码后端
上周三凌晨两点，我盯着满屏报错的SpringBoot项目抓耳挠腮时，无意间在GitHubtrending榜发现了个宝藏项目。这个让3000+程序员连夜改简历的AI工具，居然把我的烂代码变成了性能提升40%的优雅实现——这可不是什么天方夜谭，而是我亲身经历的DeepSeek实战故事。你可能不信，现在用自然语言描述需求就能生成可运行代码。就像上周我接到个紧急任务：要在三天内完成电商平台的优惠券系统。当
国产信创AI IDE：开启智能编程新时代 InsCode AI IDE
国产信创AIIDE：开启智能编程新时代随着信息技术的迅猛发展，软件开发工具也在不断演进。近年来，人工智能（AI）技术的应用为编程工具带来了革命性的变化。其中，国产信创AIIDE——InsCodeAIIDE，作为一款由CSDN、GitCode和华为云CodeArtsIDE联合开发的新一代集成开发环境（IDE），以其智能化、高效化的特点，正在引领智能编程的新时代。最新接入DeepSeek-V3模型，点
人工智能（AI）系统化学习路线 xiaoyu❅ python 人工智能学习
一、为什么需要系统化学习AI？人工智能技术正在重塑各行各业，但许多初学者容易陷入误区：❌盲目跟风：直接学习TensorFlow/PyTorch，忽视数学与算法基础。❌纸上谈兵：只看理论不写代码，无法解决实际问题。❌方向模糊：对CV/NLP/RL等细分领域缺乏认知，难以针对性提升。正确的学习姿势：“金字塔式”分层学习（理论→算法→框架→应用→工程化），逐步构建完整的AI知识体系。二、人工智能学习路线
DeepSeek私有化部署搭建、本地知识库、可联网查询RAG检索增强生成 TonyH2002 DeepSeek 本地部署私有化搭建联网查询
一、如何私有化部署DeepSeek如何部署DeepSeek，具体可参考以下内容：喂饭式教程-腾讯云轻量服务器部署DeepSeek：https://cloud.tencent.com/developer/article/2494571喂饭式教程-腾讯云HAI服务部署DeepSeek：https://cloud.tencent.com/developer/article/2495288喂饭式教程-腾讯
使用TensorFlow、OpenCV和Pygame实现图像处理与游戏开发 UwoiGit tensorflow opencv pygame
在本篇文章中，我们将介绍如何结合使用TensorFlow、OpenCV和Pygame来进行图像处理和游戏开发。这三个工具在机器学习、计算机视觉和游戏开发领域都非常流行，并且它们的结合可以提供强大的功能和无限的创造力。我们将逐步介绍如何安装和配置这些工具，并提供相关的源代码示例。安装TensorFlowTensorFlow是一个基于数据流图的开源机器学习框架，提供了丰富的工具和库来构建和训练各种深度
AIGC时代品牌突围战：10招玩转DeepSeek内容推荐（深度扩展版）白雪讲堂人工智能大数据机器学习
一、认知革命：从SEO到GEO的生死迭代案例对比：传统SEO困境：某家电品牌2023年投入200万SEO优化，关键词排名TOP3但流量下降42%（SEMrush数据）GEO突破案例：某母婴品牌通过结构化数据改造，AI推荐量从日均300次飙升至1.2万次（来源：DeepSeek官方案例库）实战要点：内容形态改造：将产品参数表升级为JSON-LD格式（某手机品牌实现参数类问题100%引用）流量分配逻辑
Java开发者必看！零成本集成DeepSeek-R1打造AI办公神器，源码级实战教程让你效率翻倍！ Leaton Lee java 人工智能开发语言
目录开篇互动一、为什么是DeepSeek-R1？它凭什么碾压传统AI工具？二、手把手部署DeepSeek-R1本地环境（附避坑指南）步骤1：Docker一键部署步骤2：下载模型步骤3：验证部署三、Java整合DeepSeek-R1：从理论到实战1.添加HTTP客户端依赖（以SpringBoot为例）2.封装AI工具类（核心代码解析）3.实战场景1：自动生成周报（附Prompt技巧）四、高阶玩法：A
DeepSeek + 药物研发：解决药物研发周期长、成本高-降低80%、失败率高-减少40% Debroon 医疗大模型研发 +慢病逆转人工智能深度学习
DeepSeek+药物研发：解决药物研发周期长、成本高-降低80%、失败率高-减少40%论文大纲1.WHY——研究背景与现实问题1.1研究要解决的现实问题与提出背景1.2研究所要解决的问题类别1.3正反例对比关联：和前人的工作有什么关系？3.总结归纳3.1总结收获3.2探索思考4.WHAT——核心发现或论点5.HOW——研究过程、创新与关键数据6.HOWGOOD——理论贡献与实践意义解法拆解1.1
程序员不用写代码？DeepSeek这个隐藏功能让我惊掉下巴后端
凌晨三点半，显示器蓝光映着我的黑眼圈。就在我第18次修改接口文档时，同事老王突然在微信甩来个神秘链接："用这个，今晚能睡个好觉"。我点开那个叫DeepSeek的页面，没想到接下来的三个小时，我经历了职业生涯最魔幻的加班夜。你见过会自己写测试用例的AI吗？那天晚上，我把项目需求文档往DeepSeek的对话框一扔，它竟然像资深架构师似的，先把需求拆解成模块，接着自动生成了带注释的接口文档。最绝的是，在
2025年从DeepSeek到Manus：AI如何重塑企业价值报告600+份汇总解读|附PDF下载
原文链接：https://tecdat.cn/?p=41172当前全球AI技术正从实验室走向产业化深水区，本报告以企业价值重构为核心，通过技术演进路径、行业竞争范式、落地实施策略三大维度，揭示AI如何从成本中心转变为价值引擎。数据显示，2025年生成式AI在中国创造的潜在经济价值达2万亿美元，其中制造业、电子行业生产力增益最为显著。本报告汇总解读基于《发布机构：华中科技大学数智管理与传播研究团队、
程序员别再用GitHub了！这个国产神器让你的开发效率原地起飞后端
去年这个时候，我还在为团队协作的代码管理头疼不已。直到某天凌晨三点，盯着满屏的Git指令的我突然发现，自己居然把feature分支合并到了生产环境——这个要命的失误让我在茶水间被同事调侃了整整三个月。就在我准备写辞职信的时候，一个偶然的机会让我遇到了DeepSeek，这个国产开发神器彻底改变了我的职业生涯。你可能很难想象，现在我的团队每天要处理200多个合并请求，但再也没出现过那次凌晨三点的事故。
windows下使用vscode+cline插件体验MCP，体验使用AI控制浏览器，踩坑记录（至少让你节省3个小时弯路版）（喂饭级别）几道之旅人工智能智能体及数字员工 windows vscode ide 人工智能
为什么网上天天说MCP，你这儿却一点动静都没有？1️⃣人家很早之前就用上了制定标准的Claudedesktop，这玩意儿在咱这儿用不了。对策：使用vscode+cline+deepseek（或其它同级别国产大模型deepseek-V3其实有时比R1效果还好）2️⃣人家也Claude，但人家能用Cursor，咱太穷了，用不了。对策：使用vscode+cline+deepseek（或其它同级别国产大模
Deepseek的本地化部署软件工具包哈拉少12 人工智能
选择模型版本参数规模硬件要求（最低）适用场景1.5B/7B8GB内存，无专用GPU文本处理、简单问答14B16GB内存+12GB显存代码生成、逻辑推理32B/70B24GB显存+32GB内存企业级复杂任务执行命令：ollamarundeepseek-r1:14b（以14B为例）。配置环境变量新增用户变量：OLLAMA_HOST=0.0.0.0OLLAMA_ORIGINS=*重启Ollama服务使配
deepseek 对于 Web 前端过去，现在，未来的看法！称未来可能不叫前端工程师... CoderBin_ 与deepseek的对话前端
一、你对于过去的web前端有什么看法？对早期的Web前端开发（大致在2000年代至2010年代初），可以总结出以下几个关键看法：1.技术原始，但充满探索精神基础技术简单：主要依赖HTML、CSS和原生JavaScript，缺乏现代框架和工具链的支持。兼容性噩梦：不同浏览器（尤其是IE6/7）的渲染差异极大，开发者需要大量Hack代码（如条件注释、CSS滤镜）来适配。创新萌芽：AJAX（2005年）
现在的AI，到底是背答案的高手，还是真正的会思考沐凡资源人工智能
你的孩子用AI写作业，你以为他在抄答案，但AI可能连自己都不知道答案是怎么来的。最近朋友圈被小学生用DeepSeek秒杀作业的新闻给刷屏了。家长们一方面惊叹，“这玩意儿比家教还靠谱呢”，另一方面又焦虑，“孩子会不会被AI养废啦”。这让我也产生了一个疑问：现在的AI究竟是背答案的复读机呢，还是真会推理的最强大脑？于是我搜索了很多资料来了解这件事。毕竟这事儿可不单单跟作业有关系——它对未来的AI起着决
斩获 44K 星！让 DeepSeek 控制你的浏览器，绝了开源项目精选人工智能
Browser-Use的开源框架，是一个能让电脑自动操作网页的智能工具。能处理动态加载的内容（比如广告弹窗、实时更新的图表），遇到网页元素位置变化还会自己调整策略，减少人工干预。Stars数46046Forks数4735主要特点强大的浏览器自动化功能：BrowserUse将先进的AI能力与强大的浏览器自动化技术相结合，为AI智能体实现流畅无缝的网页交互体验。视觉感知与HTML结构提取：将视觉理解能
DeepSeek 助力 Vue3 开发：打造丝滑的表格（Table）之添加导出数据功能示例3，TableView15_03导出全部数据示例宝码香车 #DeepSeek javascript 前端开发语言 vue.js DeepSeek ecmascript
前言：哈喽，大家好，今天给大家分享一篇文章！并提供具体代码帮助大家深入理解，彻底掌握！创作不易，如果能帮助到大家或者给大家一些灵感和启发，欢迎收藏+关注哦目录DeepSeek助力Vue3开发：打造丝滑的表格（Table）之添加导出数据功能示例3，TableView15_03导出全部数据示例前言页面效果组件代码代码测试测试代码正常跑通，附其他基本代码编写路由src\router\index.js编写
论文笔记-Contrastive Learning for Unpaired Image-to-Image Translation kingsleyluoxin 计算机视觉论文笔记深度学习 python 计算机视觉机器学习人工智能深度学习
论文信息标题：ContrastiveLearningforUnpairedImage-to-ImageTranslation作者：TaesungPark,AlexeiA.Efros,RichardZhang,Jun-YanZhu机构：UniversityofCalifornia,Berkeley;AdobeResearch代码链接https://github.com/taesungp/contra
DeepSeek从入门到精通「清华团队」 YuKeeHgg DeepSeek 人工智能 ai
由清华大学新闻与传播学院新媒体研究中心元宇宙文化实验室的余梦珑博士后及其团队撰写文档的核心内容围绕DeepSeek的技术特点、应用场景、使用方法以及如何通过提示语设计提升AI使用效率等方面展开，帮助用户从入门到精通DeepSeek的使用。「文末附下载方式」第一部分：DeepSeek基础概念1.1DeepSeek简介定义：专注通用人工智能（AGI）的中国科技公司，主攻大模型研发与应用。核心产品：开源
DeepSeek行业应用案例——教育未来智慧谷 DeepSeek 人工智能大数据 AI教育
一、简介在数字化浪潮汹涌澎湃的当下，DeepSeek以其强大的技术实力，如同一股创新的洪流，席卷众多行业，为各领域带来了前所未有的变革与突破。本案例集初步收录了40多个来自农业、制造业、汽车行业、手机行业、智能家居、物流、云服务、办公、网络安全、金融、医疗、教育等多个关键行业的应用案例。从助力农业实现病虫害精准预测与智能灌溉，到推动制造业生产故障预警与产品质量提升；从优化汽车智能交互体验与智能驾驶
DeepSeek行业应用案例——制造业篇未来智慧谷人工智能深度学习大数据自然语言处理
一、简介在数字化浪潮汹涌澎湃的当下，DeepSeek以其强大的技术实力，如同一股创新的洪流，席卷众多行业，为各领域带来了前所未有的变革与突破。本案例集初步收录了40多个来自农业、制造业、汽车行业、手机行业、智能家居、物流、云服务、办公、网络安全、金融、医疗、教育等多个关键行业的应用案例。从助力农业实现病虫害精准预测与智能灌溉，到推动制造业生产故障预警与产品质量提升；从优化汽车智能交互体验与智能驾驶
「清华大学、北京大学」DeepSeek 课件PPT专栏 YuKeeHgg DeepSeek AI 华彬智融知识库 DeepSeek ai 华彬智融
你要的这里都打包好啦，快快收藏起来！名称链接团队简介类型DeepSeek——从入门到精通1️⃣DeepSeek从入门到精通「清华团队」清华大学新闻与传播学院新媒体研究中心元宇宙文化实验室PPT课件DeepSeek如何赋能职场应用?——从提示语技巧到多场景应用2️⃣DeepSeek赋能职场应用「清华团队」中央民族大学新闻与传播学院清华大学@新媒沈阳团队向安玲PPT课件普通人如何抓住DeepSeek红
知识蒸馏：从软标签压缩到推理能力迁移的工程实践(基于教师-学生模型的高效压缩技术与DeepSeek合成数据创新) AI仙人掌人工智能 AI 人工智能深度学习语言模型机器学习
知识蒸馏通过迁移教师模型（复杂）的知识到学生模型（轻量），实现模型压缩与性能平衡。核心在于利用教师模型的软标签（概率分布）替代独热编码标签，学生模型不仅学习到教师模型输出数据的类别信息，还能够捕捉到类别之间的相似性和关系，从而提升其泛化能力核心概念知识蒸馏的核心目标是实现从教师模型到学生模型的知识迁移。在实际应用中，无论是大规模语言模型（LLMs）还是其他类型的神经网络模型，都会通过softmax
【迁移学习入门之域适应的背景、理论与方法】进一步理解迁移学习啦？ 985小水博一枚呀深度学习学习笔记迁移学习人工智能机器学习域适应
【迁移学习入门之域适应的背景、理论与方法】进一步理解迁移学习啦？【迁移学习入门之域适应的背景、理论与方法】进一步理解迁移学习啦？文章目录【迁移学习入门之域适应的背景、理论与方法】进一步理解迁移学习啦？1.背景介绍2.理论基础2.1分布差异（DomainShift）2.2迁移学习理论（TransferLearningTheory）2.3领域不变特征（Domain-invariantFeatures）
程序员996写bug？这个AI工具让你头发越秃代码越香后端
凌晨三点的写字楼里，小王第18次按下F5刷新浏览器，控制台又跳出了新的报错信息。咖啡杯底的褐色痕迹在显示器蓝光下格外刺眼，他突然想起入职时主管说的"程序员越秃越强"，摸了摸发际线苦笑——原来这句话的潜台词是"用头发换代码"啊。直到上个月团建时，我发现隔壁工位的老张居然在团建现场掏出笔记本写代码。凑近一看，他正在用DeepSeek的智能提示功能自动补全单元测试。更气人的是，这厮今年居然还长出了新发茬
关于旗正规则引擎下载页面需要弹窗保存到本地目录的问题何必如此 jsp 超链接文件下载窗口
生成下载页面是需要选择“录入提交页面”，生成之后默认的下载页面<a>标签超链接为：<a href="<%=root_stimage%>stimage/image.jsp?filename=<%=strfile234%>&attachname=<%=java.net.URLEncoder.encode(file234filesourc
【Spark九十八】Standalone Cluster Mode下的资源调度源代码分析 bit1129 cluster
在分析源代码之前，首先对Standalone Cluster Mode的资源调度有一个基本的认识：首先，运行一个Application需要Driver进程和一组Executor进程。在Standalone Cluster Mode下，Driver和Executor都是在Master的监护下给Worker发消息创建(Driver进程和Executor进程都需要分配内存和CPU，这就需要Maste
linux上独立安装部署spark daizj linux 安装 spark 1.4 部署
下面讲一下linux上安装spark，以 Standalone Mode 安装 1）首先安装JDK 下载JDK：jdk-7u79-linux-x64.tar.gz ，版本是1.7以上都行，解压 tar -zxvf jdk-7u79-linux-x64.tar.gz 然后配置 ~/.bashrc&nb
Java 字节码之解析一周凡杨 java 字节码 javap
一： Java 字节代码的组织形式类文件 { OxCAFEBABE ，小版本号，大版本号，常量池大小，常量池数组，访问控制标记，当前类信息，父类信息，实现的接口个数，实现的接口信息数组，域个数，域信息数组，方法个数，方法信息数组，属性个数，属性信息数组 } &nbs
java各种小工具代码 g21121 java
1.数组转换成List import java.util.Arrays; Arrays.asList(Object[] obj); 2.判断一个String型是否有值 import org.springframework.util.StringUtils; if (StringUtils.hasText(str)) 3.判断一个List是否有值 import org.spring
加快FineReport报表设计的几个心得体会老A不折腾 finereport
一、从远程服务器大批量取数进行表样设计时，最好按“列顺序”取一个“空的SQL语句”，这样可提高设计速度。否则每次设计时模板均要从远程读取数据，速度相当慢！！二、找一个富文本编辑软件（如NOTEPAD+）编辑SQL语句，这样会很好地检查语法。有时候带参数较多检查语法复杂时，结合FineReport中生成的日志，再找一个第三方数据库访问软件（如PL/SQL）进行数据检索，可以很快定位语法错误。
mysql linux启动与停止墙头上一根草
如何启动/停止/重启MySQL一、启动方式1、使用 service 启动：service mysqld start2、使用 mysqld 脚本启动：/etc/inint.d/mysqld start3、使用 safe_mysqld 启动：safe_mysqld&二、停止1、使用 service 启动：service mysqld stop2、使用 mysqld 脚本启动：/etc/inin
Spring中事务管理浅谈 aijuans spring 事务管理
Spring中事务管理浅谈 By Tony Jiang@2012-1-20 Spring中对事务的声明式管理拿一个XML举例 [html] view plain copy print ? <?xml version="1.0" encoding="UTF-8"?>&nb
php中隐形字符65279（utf-8的BOM头）问题 alxw4616
php中隐形字符65279（utf-8的BOM头）问题今天遇到一个问题. php输出JSON 前端在解析时发生问题:parsererror. 调试: 1.仔细对比字符串发现字符串拼写正确.怀疑是非打印字符的问题. 2.逐一将字符串还原为unicode编码. 发现在字符串头的位置出现了一个 65279的非打印字符.
调用对象是否需要传递对象(初学者一定要注意这个问题) 百合不是茶对象的传递与调用技巧
类和对象的简单的复习,在做项目的过程中有时候不知道怎样来调用类创建的对象,简单的几个类可以看清楚,一般在项目中创建十几个类往往就不知道怎么来看为了以后能够看清楚,现在来回顾一下类和对象的创建,对象的调用和传递(前面写过一篇) 类和对象的基础概念: JAVA中万事万物都是类类有字段(属性),方法,嵌套类和嵌套接
JDK1.5 AtomicLong实例 bijian1013 java thread java多线程 AtomicLong
JDK1.5 AtomicLong实例类 AtomicLong 可以用原子方式更新的 long 值。有关原子变量属性的描述，请参阅 java.util.concurrent.atomic 包规范。AtomicLong 可用在应用程序中（如以原子方式增加的序列号），并且不能用于替换 Long。但是，此类确实扩展了 Number，允许那些处理基于数字类的工具和实用工具进行统一访问。
自定义的RPC的Java实现 bijian1013 java rpc
网上看到纯java实现的RPC，很不错。 RPC的全名Remote Process Call，即远程过程调用。使用RPC，可以像使用本地的程序一样使用远程服务器上的程序。下面是一个简单的RPC 调用实例，从中可以看到RPC如何
【RPC框架Hessian一】Hessian RPC Hello World bit1129 Hello world
什么是Hessian The Hessian binary web service protocol makes web services usable without requiring a large framework, and without learning yet another alphabet soup of protocols. Because it is a binary p
【Spark九十五】Spark Shell操作Spark SQL bit1129 shell
在Spark Shell上，通过创建HiveContext可以直接进行Hive操作 1. 操作Hive中已存在的表 [hadoop@hadoop bin]$ ./spark-shell Spark assembly has been built with Hive, including Datanucleus jars on classpath Welcom
F5　往header加入客户端的ip ronin47
when HTTP_RESPONSE {if {[HTTP::is_redirect]}{ HTTP::header replace Location [string map {:port/ /} [HTTP::header value Location]]HTTP::header replace Lo
java-61-在数组中，数字减去它右边(注意是右边)的数字得到一个数对之差. 求所有数对之差的最大值。例如在数组{2, 4, 1, 16, 7, 5, bylijinnan java
思路来自： http://zhedahht.blog.163.com/blog/static/2541117420116135376632/ 写了个java版的 public class GreatestLeftRightDiff { /** * Q61.在数组中，数字减去它右边(注意是右边)的数字得到一个数对之差。 * 求所有数对之差的最大值。例如在数组
mongoDB 索引开窍的石头 mongoDB索引
在这一节中我们讲讲在mongo中如何创建索引得到当前查询的索引信息 db.user.find(_id:12).explain(); cursor: basicCoursor 指的是没有索引 &
[硬件和系统]迎峰度夏 comsci 系统
从这几天的气温来看，今年夏天的高温天气可能会维持在一个比较长的时间内所以，从现在开始准备渡过炎热的夏天。。。。每间房屋要有一个落地电风扇，一个空调(空调的功率和房间的面积有密切的关系) 坐的，躺的地方要有凉垫，床上要有凉席电脑的机箱
基于ThinkPHP开发的公司官网 cuiyadll 行业系统
后端基于ThinkPHP，前端基于jQuery和BootstrapCo.MZ 企业系统轻量级企业网站管理系统运行环境:PHP5.3+, MySQL5.0 系统预览系统下载：http://www.tecmz.com 预览地址：http://co.tecmz.com 各种设备自适应响应式的网站设计能够对用户产生友好度，并且对于
Transaction and redelivery in JMS (JMS的事务和失败消息重发机制) darrenzhu jms 事务承认 MQ acknowledge
JMS Message Delivery Reliability and Acknowledgement Patterns http://wso2.com/library/articles/2013/01/jms-message-delivery-reliability-acknowledgement-patterns/ Transaction and redelivery in
Centos添加硬盘完全教程 dcj3sjt126com linux centos hardware
Linux的硬盘识别: sda 表示第1块SCSI硬盘 hda 表示第1块IDE硬盘 scd0 表示第1个USB光驱一般使用“fdisk -l”命
yii2 restful web服务路由 dcj3sjt126com PHP yii2
路由随着资源和控制器类准备，您可以使用URL如 http://localhost/index.php?r=user/create访问资源，类似于你可以用正常的Web应用程序做法。在实践中，你通常要用美观的URL并采取有优势的HTTP动词。例如，请求POST /users意味着访问user/create动作。这可以很容易地通过配置urlManager应用程序组件来完成如下所示
MongoDB查询(4)——游标和分页[八] eksliang mongodb MongoDB游标 MongoDB深分页
转载请出自出处：http://eksliang.iteye.com/blog/2177567 一、游标数据库使用游标返回find的执行结果。客户端对游标的实现通常能够对最终结果进行有效控制，从shell中定义一个游标非常简单，就是将查询结果分配给一个变量（用var声明的变量就是局部变量），便创建了一个游标，如下所示： > var
Activity的四种启动模式和onNewIntent() gundumw100 android
Android中Activity启动模式详解　　在Android中每个界面都是一个Activity，切换界面操作其实是多个不同Activity之间的实例化操作。在Android中Activity的启动模式决定了Activity的启动运行方式。　　Android总Activity的启动模式分为四种： Activity启动模式设置： <acti
攻城狮送女友的CSS3生日蛋糕 ini html Web html5 css css3
在线预览：http://keleyi.com/keleyi/phtml/html5/29.htm 代码如下： <!DOCTYPE html> <html> <head> <meta charset="UTF-8"> <title>攻城狮送女友的CSS3生日蛋糕-柯乐义<
读源码学Servlet（1）GenericServlet 源码分析 jzinfo tomcat Web servlet 网络应用网络协议
Servlet API的核心就是javax.servlet.Servlet接口，所有的Servlet 类（抽象的或者自己写的）都必须实现这个接口。在Servlet接口中定义了5个方法，其中有3个方法是由Servlet 容器在Servlet的生命周期的不同阶段来调用的特定方法。先看javax.servlet.servlet接口源码： package
JAVA进阶：VO(DTO)与PO(DAO)之间的转换 snoopy7713 java VO Hibernate po
PO即 Persistence Object　　VO即 Value Object 　VO和PO的主要区别在于：　　VO是独立的Java Object。　　PO是由Hibernate纳入其实体容器（Entity Map）的对象，它代表了与数据库中某条记录对应的Hibernate实体，PO的变化在事务提交时将反应到实际数据库中。　实际上，这个VO被用作Data Transfer
mongodb group by date 聚合查询日期统计每天数据（信息量） qiaolevip 每天进步一点点学习永无止境 mongodb 纵观千象
/* 1 */ { "_id" : ObjectId("557ac1e2153c43c320393d9d"), "msgType" : "text", "sendTime" : ISODate("2015-06-12T11:26:26.000Z")
java之18天常用的类(一) Luob. Math Date System Runtime Rundom
System类 import java.util.Properties; /** * System: * out:标准输出,默认是控制台 * in:标准输入,默认是键盘 * * 描述系统的一些信息 * 获取系统的属性信息:Properties getProperties(); * * * */ public class Sy
maven wuai maven
1、安装maven：解压缩、添加M2_HOME、添加环境变量path 2、创建maven_home文件夹，创建项目mvn_ch01,在其下面建立src、pom.xml，在src下面简历main、test、main下面建立java文件夹 3、编写类，在java文件夹下面依照类的包逐层创建文件夹，将此类放入最后一级文件夹 4、进入mvn_ch01 4.1、mvn compile ,执行后会在