wyx100

Tensorflow的动态图（Eager）介绍（官网原文译文）

未完待续。。。

Eager

官网地址

英汉互译：

Eager：急切的、渴望的；很多文章翻译为：动态图，Eager execution（立即执行）引申为动态图。

DevOps（Development和Operations的组合），突出重视软件开发人员和运维人员的沟通合作，通过自动化流程来使得软件构建、测试、发布更加快捷、频繁和可靠。

Eager execution

动态图

Eager execution is a feature that makes TensorFlow execute operations immediately: concrete values are returned, instead of creating a computational graph that is executed later：

动态图特性使Tensorflow可以立刻执行运算：并返回具体值，替换（之前）建立计算图（它）过一会才执行。

A user guide is available: https://www.tensorflow.org/programmers_guide/eager (source file)

使用指南：https://www.tensorflow.org/programmers_guide/eager (source file)

We welcome feedback through GitHub issues.

欢迎反馈：GitHub issues.

Sample code is available, including benchmarks for some:

可用的实例，包含如下实例：

Linear Regression 线性回归
MNIST handwritten digit classifier 基于MINIST数据集的手写数字分类
ResNet50 image classification 使用ResNet50网络分类图像
RNN to generate colors 使用RNN（循环神经网络）生成演示
RNN language model RNN（循环神经网络）语言模型

Eager Execution

动态图

TensorFlow's eager execution is an imperative programming environment that evaluates operations immediately, without building graphs: operations return concrete values instead of constructing a computational graph to run later. This makes it easy to get started with TensorFlow and debug models, and it reduces boilerplate as well. To follow along with this guide, run the code samples below in an interactive python interpreter.

TensorFlow's eager（动态图）是一个命令式的编程环境，不建立图而是立即运算求值：运算返回具体值替换（以前）先构建运算图然后执行的机制。使得（使用）Tensorflow和调试模型变得简单，而且减少了多余（模板化、公式化操作）。请根据指南，在交互式Python的解释器中运行以下样例。

Eager execution is a flexible machine learning platform for research and experimentation, providing:

动态图是一个灵活的机器学习平台，用于研究和实验，提供以下功能：

An intuitive interface—Structure your code naturally and use Python data structures. Quickly iterate on small models and small data.
直观的接口-方便编码使用使用，基于Python数据结构。快速迭代小模型和小数据
Easier debugging—Call ops directly to inspect running models and test changes. Use standard Python debugging tools for immediate error reporting.
调试简单-直接调用ops来检查运行模型和测试变更。使用标准Python调试工具进行即时错误报告。
Natural control flow—Use Python control flow instead of graph control flow, simplifying the specification of dynamic models.
自然控制流-使用Python控制流替换图控制流，简化动态模型规范。

Eager execution supports most TensorFlow operations and GPU acceleration. For a collection of examples running in eager execution, see: tensorflow/contrib/eager/python/examples.

动态图支持大部分Tensorflow运算和GPU（图像设备接口，显卡）加速，运行动态库实例见：

tensorflow/contrib/eager/python/examples.

Note: Some models may experience increased overhead with eager execution enabled. Performance improvements are ongoing, but please file a bug if you find a problem and share your benchmarks.

注意：有些模型可能会在动态图执行时会增加开销。性能改进正在进行中，如果发现问题请提交bug并分享基准（依据）。

Setup and basic usage

安装和简单使用

Upgrade to the latest version of TensorFlow:

更新到最新的Tensorflow版本

$ pip install --upgrade tensorflow

To start eager execution, add tf.enable_eager_execution() to the beginning of the program or console session. Do not add this operation to other modules that the program calls.

开始动态图使用，在程序文件开始处添加tf.enable_eager_execution()启动会话。不要添加该之旅到程序调用的其他模块。

from __future__ import absolute_import, division, print_function

import tensorflow as tf

tf.enable_eager_execution()

（请根据Tensorflow实际版本情况添加，不同版本使用指令有差异）

#!/usr/bin/env python
# -*- coding: utf-8 -*-
# Tensorflow 1.6.0版本

from __future__ import absolute_import, division, print_function
import tensorflow as tf
import tensorflow.contrib.eager as tfe

tfe.enable_eager_execution()

x = [[2.]]
m = tf.matmul(x, x)
print("hello, {}".format(m))  # => "hello, [[4.]]"

Now you can run TensorFlow operations and the results will return immediately:

现在运行Tensorflow运算，会立即返回结果。

tf.executing_eagerly()        # => True

x = [[2.]]
m = tf.matmul(x, x)
print("hello, {}".format(m))  # => "hello, [[4.]]"

Enabling eager execution changes how TensorFlow operations behave—now they immediately evaluate and return their values to Python. tf.Tensor objects reference concrete values instead of symbolic handles to nodes in a computational graph. Since there isn't a computational graph to build and run later in a session, it's easy to inspect results using print() or a debugger. Evaluating, printing, and checking tensor values does not break the flow for computing gradients.

动态图改变了Tensorflow的运行机制，现在它立即给出运算并返给Python结果。tf.Tensor（Tensorflow中的张量）对象对应引用了具体的值提供给计算图中的节点，替换了符号句柄。因此，在会话中不用使用先建立后运行的计算图了，使用print()或调试器查看运算（结果）变容易了。

Eager execution works nicely with NumPy. NumPy operations accept tf.Tensor arguments. TensorFlow math operations convert Python objects and NumPy arrays to tf.Tensor objects. The tf.Tensor.numpy method returns the object's value as a NumPy ndarray.

动态图和NumPy兼容性很好. NumPy 运算接受 tf.Tensor 参数. TensorFlow 数学运算可以转换为Python对象，同时NumPy 数组可以转换为 tf.Tensor 对象. The tf.Tensor.numpy 方法返回对象的值和NumPy ndarray类似。

a = tf.constant([[1, 2],
                 [3, 4]])
print(a)
# => tf.Tensor([[1 2]
#               [3 4]], shape=(2, 2), dtype=int32)

# Broadcasting support
b = tf.add(a, 1)
print(b)
# => tf.Tensor([[2 3]
#               [4 5]], shape=(2, 2), dtype=int32)

# Operator overloading is supported
print(a * b)
# => tf.Tensor([[ 2  6]
#               [12 20]], shape=(2, 2), dtype=int32)

# Use NumPy values
import numpy as np

c = np.multiply(a, b)
print(c)
# => [[ 2  6]
#     [12 20]]

# Obtain numpy value from a tensor:
print(a.numpy())
# => [[1 2]
#     [3 4]]

The tf.contrib.eager module contains symbols available to both eager and graph execution environments and is useful for writing code to work with graphs:

tf.contrib.eager 模块包含的符号在动态图和图执行环境都可以用，这对写代码在图中运行有用。

tfe = tf.contrib.eager

Dynamic control flow

动态控制流

A major benefit of eager execution is that all the functionality of the host language is available while your model is executing. So, for example, it is easy to write fizzbuzz:

动态图一个主要好处是，当模型正在执行时，宿主语言的所有功能都可用。例如，很容易写一个主要好处是，当您的模型正在执行时，宿主语言的所有功能都可用。例如，写fizzbuzz很容易:

def fizzbuzz(max_num):
  counter = tf.constant(0)
  max_num = tf.convert_to_tensor(max_num)
  for num in range(max_num.numpy()):
    num = tf.constant(num)
    if int(num % 3) == 0 and int(num % 5) == 0:
      print('FizzBuzz')
    elif int(num % 3) == 0:
      print('Fizz')
    elif int(num % 5) == 0:
      print('Buzz')
    else:
      print(num)
    counter += 1
  return counter

This has conditionals that depend on tensor values and it prints these values at runtime.

这有条件依赖于张量值，并且在运行时打印这些值。

Build a model

建立模型

Many machine learning models are represented by composing layers. When using TensorFlow with eager execution you can either write your own layers or use a layer provided in the tf.keras.layers package.

许多机器学习模型使用合成层表示，如果使用Tensorflow动态图，既可以写自定义层还可以使用tf.keras.layers包提供的层。

While you can use any Python object to represent a layer, TensorFlow has tf.keras.layers.Layer as a convenient base class. Inherit from it to implement your own layer:

当你使用Python模型展示层，Tensorflow提供tf.keras.layers.Layer作为一个便利的基础类。基于该函数实现自定义层。

class MySimpleLayer(tf.keras.layers.Layer):
  def __init__(self, output_units):
    super(MySimpleLayer, self).__init__()
    self.output_units = output_units

  def build(self, input_shape):
    # The build method gets called the first time your layer is used.
    # Creating variables on build() allows you to make their shape depend
    # on the input shape and hence removes the need for the user to specify
    # full shapes. It is possible to create variables during __init__() if
    # you already know their full shapes.
    self.kernel = self.add_variable(
      "kernel", [input_shape[-1], self.output_units])

  def call(self, input):
    # Override call() instead of __call__ so we can perform some bookkeeping.
    return tf.matmul(input, self.kernel)

Use tf.keras.layers.Dense layer instead of MySimpleLayer above as it has a superset of its functionality (it can also add a bias).

使用tf.keras.layers.Dense层代替MySimpleLayer，因为它具有功能的超集（也可以添加偏差）。

When composing layers into models you can use tf.keras.Sequential to represent models which are a linear stack of layers. It is easy to use for basic models:

将层组合成模型时，可以使用tf.keras.Sequential来表示模型，这些模型是层的线性堆积。应用与基本模型很容易：

model = tf.keras.Sequential([
  tf.keras.layers.Dense(10, input_shape=(784,)),  # must declare input shape
  tf.keras.layers.Dense(10)
])

Alternatively, organize models in classes by inheriting from tf.keras.Model. This is a container for layers that is a layer itself, allowing tf.keras.Model objects to contain other tf.keras.Model objects.

或者，通过基础 tf.keras.Model组织类里面的模型。这是一个容器层，它本身是一个层，允许tf.keras.Model对象包含其他tf.keras.Model对象。

class MNISTModel(tf.keras.Model):
  def __init__(self):
    super(MNISTModel, self).__init__()
    self.dense1 = tf.keras.layers.Dense(units=10)
    self.dense2 = tf.keras.layers.Dense(units=10)

  def call(self, input):
    """Run the model."""
    result = self.dense1(input)
    result = self.dense2(result)
    result = self.dense2(result)  # reuse variables from dense2 layer
    return result

model = MNISTModel()

It's not required to set an input shape for the tf.keras.Model class since the parameters are set the first time input is passed to the layer.

不需要为tf.keras.Model类设置输入形状，因为参数第一次输入已设置并被传递到该层。

tf.keras.layers classes create and contain their own model variables that are tied to the lifetime of their layer objects. To share layer variables, share their objects.

tf.keras.layers类建立和包含自有模型变量，他们和层对象整个生命周期都是联系的。共享层变量，共享对象。

Eager training

动态图训练

Computing gradients

计算梯度

Automatic differentiation is useful for implementing machine learning algorithms such as backpropagation for training neural networks. During eager execution, use tf.GradientTape to trace operations for computing gradients later.

自动微分很有用，用于实现机器学习算法，如BP（反向传播）训练神经网络。在动态图中，使用tf.GradientTape跟踪运算以计算梯度

tf.GradientTape is an opt-in feature to provide maximal performance when not tracing. Since different operations can occur during each call, all forward-pass operations get recorded to a "tape". To compute the gradient, play the tape backwards and then discard. A particular tf.GradientTape can only compute one gradient; subsequent calls throw a runtime error.

tf.GradientTape是一个可选的特性，在不跟踪时提供最大的性能。由于在每次调用期间都会发生不同的操作，所以所有的正向传递操作都被记录到“磁带”。要计算梯度，播放磁带向后，然后丢弃。特定的tf.GradientTape只计算一个梯度；随后的调用会引发运行时错误。

w = tfe.Variable([[1.0]])
with tf.GradientTape() as tape:
  loss = w * w

grad = tape.gradient(loss, w)
print(grad)  # => tf.Tensor([[ 2.]], shape=(1, 1), dtype=float32)

Here's an example of tf.GradientTape that records forward-pass operations to train a simple model:

下面是一个tf.GradientTape例子，它记录正向传递操作来训练一个简单的模型：

# A toy dataset of points around 3 * x + 2
NUM_EXAMPLES = 1000
training_inputs = tf.random_normal([NUM_EXAMPLES])
noise = tf.random_normal([NUM_EXAMPLES])
training_outputs = training_inputs * 3 + 2 + noise

def prediction(input, weight, bias):
  return input * weight + bias

# A loss function using mean-squared error
def loss(weights, biases):
  error = prediction(training_inputs, weights, biases) - training_outputs
  return tf.reduce_mean(tf.square(error))

# Return the derivative of loss with respect to weight and bias
def grad(weights, biases):
  with tf.GradientTape() as tape:
    loss_value = loss(weights, biases)
  return tape.gradient(loss_value, [weights, biases])

train_steps = 200
learning_rate = 0.01
# Start with arbitrary values for W and B on the same batch of data
W = tfe.Variable(5.)
B = tfe.Variable(10.)

print("Initial loss: {:.3f}".format(loss(W, B)))

for i in range(train_steps):
  dW, dB = grad(W, B)
  W.assign_sub(dW * learning_rate)
  B.assign_sub(dB * learning_rate)
  if i % 20 == 0:
    print("Loss at step {:03d}: {:.3f}".format(i, loss(W, B)))

print("Final loss: {:.3f}".format(loss(W, B)))
print("W = {}, B = {}".format(W.numpy(), B.numpy()))

Output (exact numbers may vary):

输出（实际数字会变化）：

Initial loss: 71.204
Loss at step 000: 68.333
Loss at step 020: 30.222
Loss at step 040: 13.691
Loss at step 060: 6.508
Loss at step 080: 3.382
Loss at step 100: 2.018
Loss at step 120: 1.422
Loss at step 140: 1.161
Loss at step 160: 1.046
Loss at step 180: 0.996
Final loss: 0.974
W = 3.01582956314, B = 2.1191945076

Replay the tf.GradientTape to compute the gradients and apply them in a training loop. This is demonstrated in an excerpt from the mnist_eager.py example:

重放tf.GradientTape计算梯度并将它们应用于训练循环中。这是从mnist_eager.py 示例中摘录的：

dataset = tf.data.Dataset.from_tensor_slices((data.train.images,
                                              data.train.labels))
...
for (batch, (images, labels)) in enumerate(dataset):
  ...
  with tf.GradientTape() as tape:
    logits = model(images, training=True)
    loss_value = loss(logits, labels)
  ...
  grads = tape.gradient(loss_value, model.variables)
  optimizer.apply_gradients(zip(grads, model.variables),
                            global_step=tf.train.get_or_create_global_step())

The following example creates a multi-layer model that classifies the standard MNIST handwritten digits. It demonstrates the optimizer and layer APIs to build trainable graphs in an eager execution environment.

以下示例创建了一个多层模型，将标准的MNIST handwritten digits分类。它演示了优化器和层APIs层在动态图环境中构建可训练的图形。

Train a model

训练模型

Even without training, call the model and inspect the output in eager execution:

未训练，在动态图调用模型和检查输出：

# Create a tensor representing a blank image
batch = tf.zeros([1, 1, 784])
print(batch.shape)  # => (1, 1, 784)

result = model(batch)
# => tf.Tensor([[[ 0.  0., ..., 0.]]], shape=(1, 1, 10), dtype=float32)

This example uses the dataset.py module from the TensorFlow MNIST example; download this file to your local directory. Run the following to download the MNIST data files to your working directory and prepare a tf.data.Dataset for training:

此示例使用TensorFlow MNIST example中的dataset.py module模块；将此文件下载到本地目录。运行以下步骤将MNIST数据文件下载到工作目录，并准备一个tf.data.Dataset用于训练：

import dataset  # download dataset.py file
dataset_train = dataset.train('./datasets').shuffle(60000).repeat(4).batch(32)

To train a model, define a loss function to optimize and then calculate gradients. Use an optimizer to update the variables:

为训练模型，定义损失函数优化和计算梯度。使用优化器更新变量：

def loss(model, x, y):
  prediction = model(x)
  return tf.losses.sparse_softmax_cross_entropy(labels=y, logits=prediction)

def grad(model, inputs, targets):
  with tf.GradientTape() as tape:
    loss_value = loss(model, inputs, targets)
  return tape.gradient(loss_value, model.variables)

optimizer = tf.train.GradientDescentOptimizer(learning_rate=0.001)

x, y = iter(dataset_train).next()
print("Initial loss: {:.3f}".format(loss(model, x, y)))

# Training loop
for (i, (x, y)) in enumerate(dataset_train):
  # Calculate derivatives of the input function with respect to its parameters.
  grads = grad(model, x, y)
  # Apply the gradient to the model
  optimizer.apply_gradients(zip(grads, model.variables),
                            global_step=tf.train.get_or_create_global_step())
  if i % 200 == 0:
    print("Loss at step {:04d}: {:.3f}".format(i, loss(model, x, y)))

print("Final loss: {:.3f}".format(loss(model, x, y)))

Output (exact numbers may vary):

输出（实际数字会变化）：

Initial loss: 2.674
Loss at step 0000: 2.593
Loss at step 0200: 2.143
Loss at step 0400: 2.009
Loss at step 0600: 2.103
Loss at step 0800: 1.621
Loss at step 1000: 1.695
...
Loss at step 6600: 0.602
Loss at step 6800: 0.557
Loss at step 7000: 0.499
Loss at step 7200: 0.744
Loss at step 7400: 0.681
Final loss: 0.670

And for faster training, move the computation to a GPU:

为了快速训练，可以使用GPU运算：

with tf.device("/gpu:0"):
  for (i, (x, y)) in enumerate(dataset_train):
    # minimize() is equivalent to the grad() and apply_gradients() calls.
    optimizer.minimize(lambda: loss(model, x, y),
                       global_step=tf.train.get_or_create_global_step())

Variables and optimizers

变量和优化

tfe.Variable objects store mutable tf.Tensor values accessed during training to make automatic differentiation easier. The parameters of a model can be encapsulated in classes as variables.

tfe.Variable对象存储在训练期间访问的可变tf.Tensor值以使自动微分更容易。模型的参数可以像变量一样封装在类中。

Better encapsulate model parameters by using tfe.Variable with tf.GradientTape. For example, the automatic differentiation example above can be rewritten:

可使用基于tf.GradientTape的tfe.Variable更好的封装模型参数。例如，以上自动微分实例可以写成：

class Model(tf.keras.Model):
  def __init__(self):
    super(Model, self).__init__()
    self.W = tfe.Variable(5., name='weight')
    self.B = tfe.Variable(10., name='bias')
  def predict(self, inputs):
    return inputs * self.W + self.B

# A toy dataset of points around 3 * x + 2
NUM_EXAMPLES = 2000
training_inputs = tf.random_normal([NUM_EXAMPLES])
noise = tf.random_normal([NUM_EXAMPLES])
training_outputs = training_inputs * 3 + 2 + noise

# The loss function to be optimized
def loss(model, inputs, targets):
  error = model.predict(inputs) - targets
  return tf.reduce_mean(tf.square(error))

def grad(model, inputs, targets):
  with tf.GradientTape() as tape:
    loss_value = loss(model, inputs, targets)
  return tape.gradient(loss_value, [model.W, model.B])

# Define:
# 1. A model.
# 2. Derivatives of a loss function with respect to model parameters.
# 3. A strategy for updating the variables based on the derivatives.
model = Model()
optimizer = tf.train.GradientDescentOptimizer(learning_rate=0.01)

print("Initial loss: {:.3f}".format(loss(model, training_inputs, training_outputs)))

# Training loop
for i in range(300):
  grads = grad(model, training_inputs, training_outputs)
  optimizer.apply_gradients(zip(grads, [model.W, model.B]),
                            global_step=tf.train.get_or_create_global_step())
  if i % 20 == 0:
    print("Loss at step {:03d}: {:.3f}".format(i, loss(model, training_inputs, training_outputs)))

print("Final loss: {:.3f}".format(loss(model, training_inputs, training_outputs)))
print("W = {}, B = {}".format(model.W.numpy(), model.B.numpy()))

Output (exact numbers may vary):

输出（实际数字会变化）：

Initial loss: 69.066
Loss at step 000: 66.368
Loss at step 020: 30.107
Loss at step 040: 13.959
Loss at step 060: 6.769
Loss at step 080: 3.567
Loss at step 100: 2.141
Loss at step 120: 1.506
Loss at step 140: 1.223
Loss at step 160: 1.097
Loss at step 180: 1.041
Loss at step 200: 1.016
Loss at step 220: 1.005
Loss at step 240: 1.000
Loss at step 260: 0.998
Loss at step 280: 0.997
Final loss: 0.996
W = 2.99431324005, B = 2.02129220963

Use objects for state during eager execution

在动态图中使用对象作为状态

With graph execution, program state (such as the variables) is stored in global collections and their lifetime is managed by the tf.Session object. In contrast, during eager execution the lifetime of state objects is determined by the lifetime of their corresponding Python object.

（静态）图执行中，程序状态（例如变量）被存储在全局集合中，并且它们的生存周期由tf.Session对象管理。相反，在动态图中，状态对象的生命周期由其相应的Python对象的生存周期决定。

Variables are objects

变量即对象

During eager execution, variables persist until the last reference to the object is removed, and is then deleted.

在动态图中，变量一直存在，直到对象的最后引用被删除，然后被删除。

with tf.device("gpu:0"):
  v = tfe.Variable(tf.random_normal([1000, 1000]))
  v = None  # v no longer takes up GPU memory

Object-based saving

基于对象的保存

tfe.Checkpoint can save and restore tfe.Variables to and from checkpoints:

tfe.Checkpoint可以保存和恢复tfe.Variables的变量：

x = tfe.Variable(10.)

checkpoint = tfe.Checkpoint(x=x)  # save as "x"

x.assign(2.)   # Assign a new value to the variables and save.
save_path = checkpoint.save('./ckpt/')

x.assign(11.)  # Change the variable after saving.

# Restore values from the checkpoint
checkpoint.restore(save_path)

print(x)  # => 2.0

To save and load models, tfe.Checkpoint stores the internal state of objects, without requiring hidden variables. To record the state of a model, an optimizer, and a global step, pass them to a tfe.Checkpoint:

为了保存和加载模型，tfe.Checkpoint存储对象的内部状态，而不需要隐藏变量。记录模型、优化器和全局步骤的状态，将它们传递给 tfe.Checkpoint：

model = MyModel()
optimizer = tf.train.AdamOptimizer(learning_rate=0.001)
checkpoint_dir = ‘/path/to/model_dir’
checkpoint_prefix = os.path.join(checkpoint_dir, "ckpt")
root = tfe.Checkpoint(optimizer=optimizer,
                      model=model,
                      optimizer_step=tf.train.get_or_create_global_step())

root.save(file_prefix=checkpoint_prefix)
# or
root.restore(tf.train.latest_checkpoint(checkpoint_dir))

Object-oriented metrics

面向对象度量

tfe.metrics are stored as objects. Update a metric by passing the new data to the callable, and retrieve the result using the tfe.metrics.result method, for example:

tfe.metrics作为对象存储。通过将新数据传递给可调用的更新度量，并使用tfe.metrics.result方法检索结果，例如：

m = tfe.metrics.Mean("loss")
m(0)
m(5)
m.result()  # => 2.5
m([8, 9])
m.result()  # => 5.5

Summaries and TensorBoard

摘要与TensorBoard

@{$summaries_and_tensorboard$TensorBoard} is a visualization tool for understanding, debugging and optimizing the model training process. It uses summary events that are written while executing the program.

@{$summaries_and_tensorboard$TensorBoard} 是一个可视化工具，用于理解、调试和优惠模型训练过程。它使用在执行程序时编写的摘要事件。

tf.contrib.summary is compatible with both eager and graph execution environments. Summary operations, such as tf.contrib.summary.scalar, are inserted during model construction. For example, to record summaries once every 100 global steps:

tf.contrib.summary在动态图和静态图执行环境中具有兼容性。在模型构建过程中插入摘要操作，如tf.contrib.summary.scalar。例如，每100个全局步骤记录摘要一次：

writer = tf.contrib.summary.create_file_writer(logdir)
global_step=tf.train.get_or_create_global_step()  # return global step var

writer.set_as_default()

for _ in range(iterations):
  global_step.assign_add(1)
  # Must include a record_summaries method
  with tf.contrib.summary.record_summaries_every_n_global_steps(100):
    # your model code goes here
    tf.contrib.summary.scalar('loss', loss)
     ...

Advanced automatic differentiation topics

高级自动微分主题

Dynamic models

动态模型

tf.GradientTape can also be used in dynamic models. This example for a backtracking line search algorithm looks like normal NumPy code, except there are gradients and is differentiable, despite the complex control flow:

tf.GradientTape可以用于动态模型。backtracking line search算法的这个示例看起来像正常的NumPy 代码，除了有复杂的控制流之外，还有梯度，并且是可微分的：

def line_search_step(fn, init_x, rate=1.0):
  with tf.GradientTape() as tape:
    # Variables are automatically recorded, but manually watch a tensor
    tape.watch(init_x)
    value = fn(init_x)
  grad = tape.gradient(value, init_x)
  grad_norm = tf.reduce_sum(grad * grad)
  init_value = value
  while value > init_value - rate * grad_norm:
    x = init_x - rate * grad
    value = fn(x)
    rate /= 2.0
  return x, value

Additional functions to compute gradients

计算梯度的附加函数

tf.GradientTape is a powerful interface for computing gradients, but there is another Autograd-style API available for automatic differentiation. These functions are useful if writing math code with only tensors and gradient functions, and without tfe.Variables:

tf.GradientTape是计算梯度的一个强大的接口，但还有另一个Autograd-style的API可用于自动分化。这些功能是有用的如果写数学代码只有张量梯度功能，没有变量的tfe.Variables：

tfe.gradients_function —Returns a function that computes the derivatives of its input function parameter with respect to its arguments. The input function parameter must return a scalar value. When the returned function is invoked, it returns a list of tf.Tensor objects: one element for each argument of the input function. Since anything of interest must be passed as a function parameter, this becomes unwieldy if there's a dependency on many trainable parameters.
tfe.gradients_function —返回一个函数，该函数计算其输入函数参数相对于其参数的导数。输入函数参数必须返回标量值。当调用返回函数时，它返回tf.Tensor对象的列表：输入函数的每个参数的一个元素。由于任何感兴趣的事物都必须作为函数参数传递，如果对许多可训练参数有依赖性，这将变得笨拙。
tfe.value_and_gradients_function —Similar to tfe.gradients_function, but when the returned function is invoked, it returns the value from the input function in addition to the list of derivatives of the input function with respect to its arguments.
tfe.value_and_gradients_function —类似于tfe.gradients_function函数，但是当调用返回函数时，除了输入函数的导数的列表之外，它还从输入函数返回值。

In the following example, tfe.gradients_function takes the square function as an argument and returns a function that computes the partial derivatives of square with respect to its inputs. To calculate the derivative of square at 3, grad(3.0) returns 6.

以下示例， tfe.gradients_function将square函数作为为参数，并返回一个函数来计算square相对于它的输入的偏导数。计算square在3的导数, grad(3.0)返回6。

def square(x):
  return tf.multiply(x, x)

grad = tfe.gradients_function(square)

square(3.)  # => 9.0
grad(3.)    # => [6.0]

# The second-order derivative of square:
gradgrad = tfe.gradients_function(lambda x: grad(x)[0])
gradgrad(3.)  # => [2.0]

# The third-order derivative is None:
gradgradgrad = tfe.gradients_function(lambda x: gradgrad(x)[0])
gradgradgrad(3.)  # => [None]


# With flow control:
def abs(x):
  return x if x > 0. else -x

grad = tfe.gradients_function(abs)

grad(3.)   # => [1.0]
grad(-3.)  # => [-1.0]

Custom gradients

自定义梯度

Custom gradients are an easy way to override gradients in eager and graph execution. Within the forward function, define the gradient with respect to the inputs, outputs, or intermediate results. For example, here's an easy way to clip the norm of the gradients in the backward pass:

自定义梯度是一种简单的方式来克服梯度在动态图和图执行。在前向函数中，定义相对于输入、输出或中间结果的梯度。例如，这里有一种简单的方法来处理后传中梯度的范数：

@tf.custom_gradient
def clip_gradient_by_norm(x, norm):
  y = tf.identity(x)
  def grad_fn(dresult):
    return [tf.clip_by_norm(dresult, norm), None]
  return y, grad_fn

Custom gradients are commonly used to provide a numerically stable gradient for a sequence of operations:

自定义梯度通常是用来为一个操作序列提供数值稳定的梯度：

def log1pexp(x):
  return tf.log(1 + tf.exp(x))
grad_log1pexp = tfe.gradients_function(log1pexp)

# The gradient computation works fine at x = 0.
grad_log1pexp(0.)  # => [0.5]

# However, x = 100 fails because of numerical instability.
grad_log1pexp(100.)  # => [nan]

Here, the log1pexp function can be analytically simplified with a custom gradient. The implementation below reuses the value for tf.exp(x) that is computed during the forward pass—making it more efficient by eliminating redundant calculations:

这里，log1pexp函数可以用自定义梯度分析并简化。下面的实现重用了在正向传递期间计算的tf.exp(x)的值，通过消除冗余计算使其更有效：

@tf.custom_gradient
def log1pexp(x):
  e = tf.exp(x)
  def grad(dy):
    return dy * (1 - 1 / (1 + e))
  return tf.log(1 + e), grad

grad_log1pexp = tfe.gradients_function(log1pexp)

# As before, the gradient computation works fine at x = 0.
grad_log1pexp(0.)  # => [0.5]

# And the gradient computation also works at x = 100.
grad_log1pexp(100.)  # => [1.0]

Performance

性能

Computation is automatically offloaded to GPUs during eager execution. If you want control over where a computation runs you can enclose it in a tf.device('/gpu:0') block (or the CPU equivalent):

动态图中计算自动加载到GPU。如果想控制计算运行的地方，可以将其包含在tf.device('/gpu:0')块（或CPU等效）中：

import time

def measure(x, steps):
  # TensorFlow initializes a GPU the first time it's used, exclude from timing.
  tf.matmul(x, x)
  start = time.time()
  for i in range(steps):
    x = tf.matmul(x, x)
    _ = x.numpy()  # Make sure to execute op and not just enqueue it
  end = time.time()
  return end - start

shape = (1000, 1000)
steps = 200
print("Time to multiply a {} matrix by itself {} times:".format(shape, steps))

# Run on CPU:
with tf.device("/cpu:0"):
  print("CPU: {} secs".format(measure(tf.random_normal(shape), steps)))

# Run on GPU, if available:
if tfe.num_gpus() > 0:
  with tf.device("/gpu:0"):
    print("GPU: {} secs".format(measure(tf.random_normal(shape), steps)))
else:
  print("GPU: not found")

Output (exact numbers depend on hardware):

输出（实际数字根据实际硬件）

Time to multiply a (1000, 1000) matrix by itself 200 times:
CPU: 4.614904403686523 secs
GPU: 0.5581181049346924 secs

A tf.Tensor object can be copied to a different device to execute its operations:

x = tf.random_normal([10, 10])

x_gpu0 = x.gpu()
x_cpu = x.cpu()

_ = tf.matmul(x_cpu, x_cpu)    # Runs on CPU
_ = tf.matmul(x_gpu0, x_gpu0)  # Runs on GPU:0

if tfe.num_gpus() > 1:
  x_gpu1 = x.gpu(1)
  _ = tf.matmul(x_gpu1, x_gpu1)  # Runs on GPU:1

Benchmarks

基准

For compute-heavy models, such as ResNet50 training on a GPU, eager execution performance is comparable to graph execution. But this gap grows larger for models with less computation and there is work to be done for optimizing hot code paths for models with lots of small operations.

对于计算重模型，如GPU上的ResNet50（一种深度卷积神经网络）训练，动态图性能可与（静态）图相媲美。但是，对于计算量较小的模型来说，这种差距越来越大，需要对许多小操作的模型优化hot code 路径。

Work with graphs

图工作

While eager execution makes development and debugging more interactive, TensorFlow graph execution has advantages for distributed training, performance optimizations, and production deployment. However, writing graph code can feel different than writing regular Python code and more difficult to debug.

虽然动态图使开发和调试更具交互性，但Tensorflow（静态）图对于分布式训练、性能优化和生产部署方面具有优势。但是，编写（静态）图代码会比编写常规Python代码感觉不同（不方便，不习惯，不爽），调试也困难。

For building and training graph-constructed models, the Python program first builds a graph representing the computation, then invokes Session.run to send the graph for execution on the C++-based runtime. This provides:

建立和训练图模型，首先建立一个Python程序图的计算，然后基于C++的环境，调用Session.run执行图。这提供：

Automatic differentiation using static autodiff.
使用autodiff自动微分。
Simple deployment to a platform independent server.
简单部署到与平台无关的服务器。
Graph-based optimizations (common subexpression elimination, constant-folding, etc.).
基于图的优化（常见的子表达式消除、常数折叠等。）
Compilation and kernel fusion.
编译与核融合。
Automatic distribution and replication (placing nodes on the distributed system).
自动分发和复制（在分布式系统上部署节点）。

Deploying code written for eager execution is more difficult: either generate a graph from the model, or run the Python runtime and code directly on the server.

为动态图编写的代码部署更加困难：要么从模型生成图形，要么运行Python时直接在服务器上进行编码。

Write compatible code

写入兼容代码

The same code written for eager execution will also build a graph during graph execution. Do this by simply running the same code in a new Python session where eager execution is not enabled.

为动态图编写的相同代码也将在图执行期间建立图表。通过在一个新的Python会话中简单地运行相同的代码来执行这一操作，而在该Python会话中没有动态图。

Most TensorFlow operations work during eager execution, but there are some things to keep in mind:

大多数TensorFlow 运算都是在急切的执行过程中进行的，但是有以下情况请注意：

Use tf.data for input processing instead of queues. It's faster and easier.
使用tf.data做输入处理而不是队列。它更快更容易。
Use object-oriented layer APIs—like tf.keras.layers and tf.keras.Model—since they have explicit storage for variables.
使用面向对象的层API，如tf.keras.layers和tf.keras.Model，因为它们对变量有明确的存储。
Most model code works the same during eager and graph execution, but there are exceptions. (For example, dynamic models using Python control flow to change the computation based on inputs.)
大多数模型代码在动态图和（静态）图执行过程中都是相同的，但也有例外。（例如，使用Python控制流的动态模型来改变基于输入的计算）。
Once eager execution is enabled with tf.enable_eager_execution, it cannot be turned off. Start a new Python session to return to graph execution.
如果使用tf.enable_eager_execution启用动态图，它就不能关闭。启动一个新的Python会话以返回到（静态）图执行。

It's best to write code for both eager execution and graph execution. This gives you eager's interactive experimentation and debuggability with the distributed performance benefits of graph execution.

最好是编写代码，动态图和（静态）图都使用。利用动态图交互体验和调试以及（静态）图的分布式性能。

Write, debug, and iterate in eager execution, then import the model graph for production deployment. Use tfe.Checkpoint to save and restore model variables, this allows movement between eager and graph execution environments. See the examples in: tensorflow/contrib/eager/python/examples.

编写、调试和迭代执行，然后导入模型图进行生产部署。使用tfe.Checkpoint保存和恢复模型变量，这允许在动态图和（静态）图环境之间切换。

实例见：tensorflow/contrib/eager/python/examples.

Use eager execution in a graph environment

在图环境中使用动态图

Selectively enable eager execution in a TensorFlow graph environment using tfe.py_func. This is used

when tf.enable_eager_execution() has not been called.

在Tensorflow图环境中，使用tfe.py_func有选择的使用动态图，当tf.enable_eager_execution()没有被调用时，可使用该函数

def my_py_func(x):
  x = tf.matmul(x, x)  # You can use tf ops
  print(x)  # but it's eager!
  return x

with tf.Session() as sess:
  x = tf.placeholder(dtype=tf.float32)
  # Call eager function in graph!
  pf = tfe.py_func(my_py_func, [x], tf.float32)
  sess.run(pf, feed_dict={x: [[2.0]]})  # [[4.0]]

完整项目下载

方便没积分童鞋，请加企鹅452205574，共享文件夹。

包括：代码、数据集合（图片）、已生成model、安装库文件等。

你可能感兴趣的:(深度学习,机器学习,tensorFlow)

awesome python 中文版相见恨晚！(pythonNB的第三方资源库) weixin_30788731
AwesomePython中文版来啦！原文链接：Python资源大全内容包括：Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等。GitHub-jobbole/awesome-python-cn:Python资源大全中文版环境管理管理Python版本和环境的工具p–非常简单的交互式python版本管理工具。pyenv–简单
【机器学习】决策树 ( Decision Tree ) AI天才研究院 ChatGPT DeepSeek R1 &大数据AI人工智能大模型深度学习实战机器学习决策树算法支持向量机人工智能
【机器学习】决策树(DecisionTree)文章目录【机器学习】决策树(DecisionTree)1.ID3(1)信息增益(2)ID3的算法流程(3)实现ID32.C4.53.CART(1)决策桩DecisionStump(2)回归CART：最小二乘回归树leastsquaresregressiontree⚪回归CART的例子(3)分类CART(4)处理缺失值Handlemissingfeatu
机器学习-随机森林解析 Mr终游机器学习机器学习随机森林人工智能
目录一、.随机森林的思想二、随机森林构建步骤1.自助采样2.特征随机选择3构建决策树4.集成预测三.随机森林的关键优势**(1)减少过拟合****(2)高效并行化****(3)特征重要性评估****(4)耐抗噪声**四.随机森林的优缺点优点缺点五.参数调优（以scikit-learn为例）波士顿房价预测一、.随机森林的思想1.通过组成多个弱学习器（决策树）形成一个学习器2.多样性增强：每颗决策树通
基于YOLOv5深度学习的田间杂草检测系统：UI界面 + YOLOv5 + 数据集详细教程深度学习&目标检测实战项目 YOLO 深度学习 ui YOLOv5 人工智能计算机视觉
引言随着农业科技的进步，智能化农业越来越受到重视，尤其是通过计算机视觉技术对作物进行监测和管理。在农业生产中，杂草的生长对作物的生长产生了负面影响，因此准确地检测和识别田间杂草至关重要。本文将详细介绍如何构建一个基于深度学习的田间杂草检测系统，使用YOLOv5模型进行目标检测，并提供一个用户友好的界面。我们将分步骤进行，包括环境配置、数据集准备、模型训练、实时杂草检测系统的实现等内容。目录引言目录
人工智能与深度学习的应用案例解析及代码实现 accurater 人工智能深度学习科技机器人
引言人工智能（AI）与深度学习（DeepLearning）作为21世纪最具变革性的技术之一，已渗透到医疗、金融、交通、制造等各个领域。深度学习通过多层神经网络模拟人类认知过程，显著提升了复杂任务的自动化水平。本文将从技术原理、核心应用案例及代码实现三个维度，系统解析其实际应用，并探讨未来挑战与发展方向。一、深度学习技术概述1.1核心技术框架深度学习基于深度神经网络（DNN），其核心在于通过多层非线
深度学习笔记——基础部分肆—— 深度学习深度学习笔记人工智能 python pytorch
深度学习是一种机器学习的方式，通过模仿人脑吃力信息的方式，使用多层神经网络来学习数据的复杂模式和特征。深度学习和机器学习的区别：在机器学习中，特征提取通常需要人工设计和选择，依赖于领域专家的知识来确定哪些特征对模型最为重要;而在深度学习中，特征提取是自动进行的，通过多层神经网络结构直接从原始数据(也可能需要初步处理)中学习复杂特征，减少了对人工干预的依赖，使得模型能够处理更加复杂的数据和任务。计算
机器学习基础（4） yyc_audio 深度学习 python 机器学习神经网络人工智能
超越基于常识的基准除了不同的评估方法，还应该利用基于常识的基准。训练深度学习模型就好比在平行世界里按下发射火箭的按钮，你听不到也看不到。你无法观察流形学习过程，它发生在数千维空间中，即使投影到三维空间中，你也无法解释它。唯一的反馈信号就是验证指标，就像隐形火箭的高度计。特别重要的是，我们需要知道火箭是否离开了地面。发射地点的海拔高度是多少？模型似乎有15%的精度——这算是很好吗？在开始处理一个数据
机器学习｜决策树｜Gini指数和熵的区别｜简单示例漂亮_大男孩机器学习决策树人工智能
如是我闻：在决策树模型中，Gini指数和熵（Entropy）是用来计算节点纯度的两种方法。它们都是评估分裂点的好坏，以选择最佳的属性来分裂。让我们先来了解一下这两种方法的定义，然后通过一个简单的例子来讨论它们之间的区别。Gini指数Gini指数是一个衡量数据分布不均匀程度的指标。在决策树中，它用于评估数据集的不纯度。Gini指数越低，数据的纯度越高。其计算公式为：Gini=1−∑i=1npi2Gi
00计算机视觉学习内容依旧阳光的老码农计算机视觉计算机视觉人工智能
计算机视觉（ComputerVision）开发需要掌握数学基础、编程语言、图像处理、机器学习、深度学习等多个方面的知识。以下是一个系统的学习路线：1️⃣数学基础（核心理论支撑）计算机视觉涉及很多数学概念，以下是必备数学知识：✅线性代数（矩阵运算是计算机视觉的核心）向量、矩阵运算（加减、乘法、转置）特征值与特征向量SVD（奇异值分解），用于图像压缩、降维齐次坐标变换（用于3D计算机视觉）✅概率统计（
01计算机视觉学习计划依旧阳光的老码农计算机视觉计算机视觉人工智能
计算机视觉系统学习计划（3-6个月）本计划按照数学→编程→图像处理→机器学习→深度学习→3D视觉→项目实战的顺序，确保从基础到高级，结合理论和实践。第一阶段（第1-2个月）：基础夯实✅目标：掌握数学基础、Python/C++编程、基本图像处理1️⃣数学基础（2周）每日2小时线性代数：矩阵运算、特征值分解（推荐《线性代数及其应用》）概率统计：高斯分布、贝叶斯定理微积分：偏导数、梯度下降傅里叶变换：图
决策树 vs 神经网络：何时使用？ HP-Succinum 机器学习决策树神经网络算法
目录1.决策树（DecisionTrees）1.1特点1.2优点1.3缺点1.4适用场景2.神经网络（NeuralNetworks）2.1特点2.2优点2.3缺点2.4适用场景3.何时选择哪种方法？4.结合使用的可能性5.总结在机器学习领域，决策树（DecisionTrees）和神经网络（NeuralNetworks）是两种常见但风格截然不同的算法。它们各自适用于不同类型的问题，本文将介绍它们的特
特斯拉FSD系统：自动驾驶的未来百态老人人工智能笔记
FSD系统概述FSD（FullSelf-Driving）系统是特斯拉研发的一套高级自动驾驶技术，旨在实现车辆在各种道路和驾驶场景下的完全自动驾驶。FSD系统通过集成先进的计算机视觉、深度学习、传感器融合等技术，利用车辆上安装的多种传感器和先进的计算机视觉技术，实现对周围环境的感知和理解。特斯拉通过不断收集和分析实际道路数据，持续优化其自动驾驶算法，使得FSD技术的安全性和可靠性得到了大幅提升.FS
特斯拉FSD不同版本的进化 AI智能涌现深度研究 AI大模型应用入门实战与进阶 java python javascript kotlin golang 架构人工智能
特斯拉，FSD，自动驾驶，深度学习，计算机视觉，强化学习，神经网络，模型训练1.背景介绍特斯拉自2016年推出Autopilot以来，一直致力于开发全自动驾驶系统，其目标是实现完全无人驾驶，让汽车能够像人类一样感知周围环境，做出安全可靠的驾驶决策。FSD（FullSelf-Driving）是特斯拉自动驾驶系统的最高级别，它旨在实现车辆在任何道路和环境条件下都能安全自主驾驶的能力。FSD的开发是一个
使用 Dlib 库进行人脸检测和人脸识别萧鼎 python基础到进阶教程计算机视觉人工智能 python 人脸识别人脸检测
使用Dlib库进行人脸检测和人脸识别什么是Dlib？Dlib是一个广泛使用的C++库，提供了多种用于机器学习和计算机视觉的工具。它包含了人脸检测、人脸识别、物体检测、图像处理等功能。Dlib具有高效、易用的Python接口，因此它也被广泛应用于Python中进行深度学习和计算机视觉任务。安装Dlib首先，我们需要在Python环境中安装Dlib库。你可以通过pip进行安装：pipinstalldl
介绍常见的图片分类模型与算法萧鼎 python基础到进阶教程算法分类数据挖掘
介绍常见的图片分类模型与算法在机器学习和深度学习的领域中，图片分类任务是一个广泛的应用场景。随着深度学习技术的飞速发展，很多强大的图像分类算法和模型已经被提出，广泛应用于从医疗影像到自动驾驶、从人脸识别到图像检索等多个领域。本文将重点介绍多种用于图像分类的经典算法与模型，帮助你了解在图像分类任务中常用的技术。1.传统机器学习模型在深度学习崭露头角之前，传统的机器学习模型是图像分类的主流方法。这些模
如何通过卷积神经网络（CNN）有效地提取图像的局部特征，并在CIFAR-10数据集上实现高精度的分类？浪九天人工智能理论 python 后端深度学习神经网络人工智能机器学习 pytorch
目录1.CNN提取图像局部特征的原理2.在CIFAR-10数据集上实现高精度分类的步骤2.1数据准备2.2构建CNN模型2.3定义损失函数和优化器2.4训练模型2.5测试模型3.提高分类精度的技巧卷积神经网络（ConvolutionalNeuralNetwork,CNN）是专门为处理具有网格结构数据（如图像）而设计的深度学习模型，能够有效地提取图像的局部特征。下面将详细介绍如何通过CNN提取图像局
一文读懂！OpenCV 实时人脸识别从 0 到 1，小白也能轻松实操的超详细教程（完整教程及源码） AI_DL_CODE opencv 人工智能计算机视觉人脸识别
摘要：本文围绕使用OpenCV实现实时人脸识别展开。从环境搭建入手，详细介绍Python及相关库的安装。数据准备环节涵盖收集、标注及预处理步骤。深入阐述特征提取、模型训练方法，包含传统与深度学习方式，还介绍OpenCV预训练模型的使用与评估。详细讲解实时识别过程，包括打开摄像头、逐帧处理及结果显示优化。针对复杂场景，提出光照、姿态、遮挡等问题的解决办法及模型更新维护策略。通过丰富代码示例与解释，助
弹性算力革命：企业级GPU云服务如何重构AI与图形处理的效能边界
企业级GPU云服务是一种面向企业用户，基于云计算技术，将强大的图形处理器（GPU）资源以服务的形式提供给企业的创新模式。通过这种模式，企业无需自行购置、安装和维护昂贵的GPU硬件设备，只需按需从云端获取GPU计算资源，就能满足自身多样化的业务需求。随着人工智能、大数据、深度学习、虚拟现实以及高性能计算等前沿技术在各行业的深入渗透，企业对于大规模并行计算能力的要求越来越高。GPU凭借其卓越的并行计算
弹性算力革命：企业级GPU云服务如何重构AI与图形处理的效能边界
企业级GPU云服务是一种面向企业用户，基于云计算技术，将强大的图形处理器（GPU）资源以服务的形式提供给企业的创新模式。通过这种模式，企业无需自行购置、安装和维护昂贵的GPU硬件设备，只需按需从云端获取GPU计算资源，就能满足自身多样化的业务需求。随着人工智能、大数据、深度学习、虚拟现实以及高性能计算等前沿技术在各行业的深入渗透，企业对于大规模并行计算能力的要求越来越高。GPU凭借其卓越的并行计算
量化投资与算法交易 AI天才研究院 Python实战自然语言处理人工智能语言模型编程实践开发语言架构设计
作者：禅与计算机程序设计艺术1.简介量化投资（Quantitativeinvestment）和算法交易（AlgorithmicTrading），两者是近几年兴起的两个热门词汇。市场对这两个词汇的认识也是逐渐加深。在过去几年里，人们普遍认为，算法交易和机器学习结合是未来股票、期货等金融产品的必然趋势。机器学习是由多个数据源（如财务报表、交易历史数据、社交网络数据等）自动分析生成的模型，能够预测出股价
Unity AI 技术浅析（二）爱研究的小牛 AIGC—游戏制作 AIGC—虚拟现实 unity 人工智能游戏引擎 AIGC 机器学习深度学习
UnityAI是Unity引擎中集成的智能技术，旨在为游戏开发者、虚拟现实（VR）和增强现实（AR）应用开发者提供强大的AI工具和功能。UnityAI涵盖了从智能代理（Agents）、机器学习（MachineLearning）到自然语言处理（NLP）等多个领域。一、UnityAI的工作原理1.智能代理（Agents）UnityAI的核心之一是智能代理（Agents），这些代理可以模拟游戏中的非玩家
深度学习主流经典框架PyTorch（day2） inquisitor.dom 深度学习 pytorch 人工智能
五、Tensor数据转换5.1张量转numpy浅拷贝调用numpy()方法可以把Tensor转换为Numpy，此时内存是共享的。#张量转numpydata_tensor=torch.tensor([[1,2,3],[4,5,6]])data_numpy=data_tensor.numpy()print(type(data_tensor),type(data_numpy))#他们内存是共享的data
Paper Reading | AI & 数据库融合经典论文回顾数据库人工智能阅读
人工智能（AI）和数据库（DB）在过去的50年里得到了广泛的研究，随着数据库近年来的不断发展，数据库开始与人工智能结合，数据库和人工智能（AI）可以相互促进。一方面，AI可以使数据库更加智能化（AI4DB）。例如，传统的数据库优化技术无法满足大规模数据库实例、各种应用程序和多样化用户的高性能要求，尤其是在云上。幸运的是，基于机器学习的技术可以缓解这个问题。另一方面，数据库技术可以优化AI模型（DB
2024年Python最新蓝桥杯基础练习全解答案+解析共17题 python，三年经验Python开发面经总结 2401_84139963 程序员 python 学习面试
最后Python崛起并且风靡，因为优点多、应用领域广、被大牛们认可。学习Python门槛很低，但它的晋级路线很多，通过它你能进入机器学习、数据挖掘、大数据，CS等更加高级的领域。Python可以做网络应用，可以做科学计算，数据分析，可以做网络爬虫，可以做机器学习、自然语言处理、可以写游戏、可以做桌面应用…Python可以做的很多，你需要学好基础，再选择明确的方向。这里给大家分享一份全套的Pytho
机器学习——使用分类特征的一种独热编码，小卷心菜. 机器学习人工智能
在我们目前看到的例子中，每个特性只能具有两个可能的值中的一个，耳朵形状不是尖的就是软的，脸型不是圆就是不圆，胡须不是存在就是不存在，但是如果特性可以具有两个以上的离散值呢？如何使用一个热编码来解决这样的特性？下图是我们宠物收养中心申请的新培训集，所有的数据都是一样的，除了耳形特征有尖软之外还有椭圆形，所以这个特征仍然是一个分类值特征，但它可以有三个可能的值，而不仅仅是两个可能的值，这意味着当你在这
【AI学习从零至壹】pytorch基础 flyyyya AI 算法人工智能学习 pytorch
pytorch基础pytorch基础张量（Tensor）张量的属性张量的索引和切⽚：张量的拼接张量的算数运算单元素张量In-place操作与numpy之间的转换张量到numpy数组计算图静态计算图动态计算图pytorch计算图可视化pytorch基础PyTorch是⼀个开源的深度学习框架，由Facebook的⼈⼯智能研究团队开发和维护，在学术界和⼯业界都得到了⼴泛应⽤。张量（Tensor）张量（T
MOE（Mixture of Experts，混合专家） ZhangJiQun&MXP 教学 2021 论文人工智能深度学习语言模型机器学习
目录MOE（MixtureofExperts，混合专家）一、MOE架构的核心概念二、MOE架构的工作原理三、MOE架构的优势四、MOE架构的举例五、MOE架构的应用场景MOE（MixtureofExperts，混合专家）架构是一种用于提高深度学习模型性能和效率的神经网络架构设计。以下是对MOE架构的详细解释及举例：一、MOE架构的核心概念专家（Experts）：在MOE架构中，专家指的是一组独立的
深度学习 -- 逻辑回归 PyTorch实现逻辑回归冲鸭嘟嘟可深度学习逻辑回归 python 人工智能
前言线性回归解决的是回归问题，而逻辑回归解决的是分类问题，这两种问题的区别是前者的目标属性是连续的数值类型，而后者的目标属性是离散的标称类型。可以将逻辑回归视为神经网络的一个神经元，因此学习逻辑回归能帮助理解神经网络的工作原理。什么是逻辑回归？逻辑回归是一种广义的线性回归分析模型，是监督学习的一种重要方法，主要用于二分类问题，但也可以用于多分类问题。逻辑回归的主要思想是，对于一个二分类问题，先根据
深度学习的数学之魂：传统机器学习的超越者洋葱蚯蚓机器学习深度学习机器学习人工智能经验分享个人开发数据挖掘
深度学习的数学之魂：传统机器学习的超越者前言第一部分：神经元的数学语言1.1神经元模型的启示1.2激活函数的非线性魔法第二部分：网络结构的层次之美2.1网络结构的多样性2.2层次结构的力量第三部分：图像的力量与直观理解3.1图表与动图的辅助作用3.2直观理解的桥梁第四部分：深度与专业的对话4.1深度学习与传统机器学习的比较4.2专业性强的技术分析第五部分：数学原理的深度剖析5.1神经网络的数学表达
机器学习与深度学习里生成模型和判别模型的理解程序员羊羊机器学习深度学习人工智能 php 学习 chatgpt 前端
两个模型是啥我们从几句话进入这两个概念：1、机器学习分为有监督的机器学习和无监督的机器学习；2、有监督的机器学习就是已知训练集数据的类别情况来训练分类器，无监督的机器学习就是不知道训练集的类别情况来训练分类器；3、所以说，有监督的机器学习可以抽象为一个分类task，而无监督的基本完成的是聚类；4、有监督的机器学习中，我们可以概述为通过很多有标记的数据，训练出一个模型，然后利用这个，对输入的X进行预
解线性方程组 qiuwanchi
package gaodai.matrix; import java.util.ArrayList; import java.util.List; import java.util.Scanner; public class Test { public static void main(String[] args) { Scanner scanner = new Sc
在mysql内部存储代码 annan211 性能 mysql 存储过程触发器
在mysql内部存储代码在mysql内部存储代码，既有优点也有缺点，而且有人倡导有人反对。先看优点： 1 她在服务器内部执行，离数据最近，另外在服务器上执行还可以节省带宽和网络延迟。 2 这是一种代码重用。可以方便的统一业务规则，保证某些行为的一致性，所以也可以提供一定的安全性。 3 可以简化代码的维护和版本更新。 4 可以帮助提升安全，比如提供更细
Android使用Asynchronous Http Client完成登录保存cookie的问题 hotsunshine android
Asynchronous Http Client是android中非常好的异步请求工具除了异步之外还有很多封装比如json的处理，cookie的处理引用 Persistent Cookie Storage with PersistentCookieStore This library also includes a PersistentCookieStore whi
java面试题 Array_06 java 面试
java面试题第一，谈谈final, finally, finalize的区别。 final-修饰符（关键字）如果一个类被声明为final，意味着它不能再派生出新的子类，不能作为父类被继承。因此一个类不能既被声明为 abstract的，又被声明为final的。将变量或方法声明为final，可以保证它们在使用中不被改变。被声明为final的变量必须在声明时给定初值，而在以后的引用中只能
网站加速 oloz 网站加速
前序:本人菜鸟，此文研究总结来源于互联网上的资料，大牛请勿喷！本人虚心学习，多指教. 1、减小网页体积的大小，尽量采用div+css模式，尽量避免复杂的页面结构，能简约就简约。 2、采用Gzip对网页进行压缩； GZIP最早由Jean-loup Gailly和Mark Adler创建，用于UNⅨ系统的文件压缩。我们在Linux中经常会用到后缀为.gz
正确书写单例模式随意而生 java 设计模式单例
　　单例模式算是设计模式中最容易理解，也是最容易手写代码的模式了吧。但是其中的坑却不少，所以也常作为面试题来考。本文主要对几种单例写法的整理，并分析其优缺点。很多都是一些老生常谈的问题，但如果你不知道如何创建一个线程安全的单例，不知道什么是双检锁，那这篇文章可能会帮助到你。　　懒汉式，线程不安全　　当被问到要实现一个单例模式时，很多人的第一反应是写出如下的代码，包括教科书上也是这样
单例模式香水浓 java
懒汉调用getInstance方法时实例化 public class Singleton { private static Singleton instance; private Singleton() {} public static synchronized Singleton getInstance() { if(null == ins
安装Apache问题：系统找不到指定的文件 No installed service named "Apache2" AdyZhang apache http server
安装Apache问题：系统找不到指定的文件 No installed service named "Apache2" 每次到这一步都很小心防它的端口冲突问题，结果，特意留出来的80端口就是不能用，烦。解决方法确保几处： 1、停止IIS启动 2、把端口80改成其它（譬如90，800，，，什么数字都好） 3、防火墙(关掉试试) 在运行处输入 cmd 回车，转到apa
如何在android 文件选择器中选择多个图片或者视频？ aijuans android
我的android app有这样的需求，在进行照片和视频上传的时候，需要一次性的从照片/视频库选择多条进行上传但是android原生态的sdk中，只能一个一个的进行选择和上传。我想知道是否有其他的android上传库可以解决这个问题，提供一个多选的功能，可以使checkbox之类的，一次选择多个处理方法官方的图片选择器(但是不支持所有版本的androi，只支持API Level
mysql中查询生日提醒的日期相关的sql baalwolf mysql
SELECT sysid,user_name,birthday,listid,userhead_50,CONCAT(YEAR(CURDATE()),DATE_FORMAT(birthday,'-%m-%d')),CURDATE(), dayofyear( CONCAT(YEAR(CURDATE()),DATE_FORMAT(birthday,'-%m-%d')))-dayofyear(
MongoDB索引文件破坏后导致查询错误的问题 BigBird2012 mongodb
问题描述： MongoDB在非正常情况下关闭时，可能会导致索引文件破坏，造成数据在更新时没有反映到索引上。解决方案：使用脚本，重建MongoDB所有表的索引。 var names = db.getCollectionNames(); for( var i in names ){ var name = names[i]; print(name);
Javascript Promise bijian1013 JavaScript Promise
Parse JavaScript SDK现在提供了支持大多数异步方法的兼容jquery的Promises模式，那么这意味着什么呢，读完下文你就了解了。一.认识Promises “Promises”代表着在javascript程序里下一个伟大的范式，但是理解他们为什么如此伟大不是件简
[Zookeeper学习笔记九]Zookeeper源代码分析之Zookeeper构造过程 bit1129 zookeeper
Zookeeper重载了几个构造函数，其中构造者可以提供参数最多，可定制性最多的构造函数是 public ZooKeeper(String connectString, int sessionTimeout, Watcher watcher, long sessionId, byte[] sessionPasswd, boolea
【Java命令三】jstack bit1129 jstack
jstack是用于获得当前运行的Java程序所有的线程的运行情况(thread dump），不同于jmap用于获得memory dump [hadoop@hadoop sbin]$ jstack Usage: jstack [-l] <pid> (to connect to running process) jstack -F
jboss 5.1启停脚本　动静分离部署 ronin47
以前启动jboss，往各种xml配置文件，现只要运行一句脚本即可。start nohup sh /**/run.sh -c servicename -b ip -g clustername -u broatcast jboss.messaging.ServerPeerID=int -Djboss.service.binding.set=p
UI之如何打磨设计能力? brotherlamp UI ui教程 ui自学 ui资料 ui视频
在越来越拥挤的初创企业世界里，视觉设计的重要性往往可以与杀手级用户体验比肩。在许多情况下，尤其对于 Web 初创企业而言，这两者都是不可或缺的。前不久我们在《右脑革命：别学编程了，学艺术吧》中也曾发出过重视设计的呼吁。如何才能提高初创企业的设计能力呢?以下是 9 位创始人的体会。 1.找到自己的方式如果你是设计师，要想提高技能可以去设计博客和展示好设计的网站如D-lists或
三色旗算法 bylijinnan java 算法
import java.util.Arrays; /** 问题：假设有一条绳子，上面有红、白、蓝三种颜色的旗子，起初绳子上的旗子颜色并没有顺序，您希望将之分类，并排列为蓝、白、红的顺序，要如何移动次数才会最少，注意您只能在绳子上进行这个动作，而且一次只能调换两个旗子。网上的解法大多类似：在一条绳子上移动，在程式中也就意味只能使用一个阵列，而不使用其它的阵列来
警告:No configuration found for the specified action: \'s chiangfai configuration
1.index.jsp页面form标签未指定namespace属性。  <%@taglib prefix="s" uri="/struts-tags"%> ... <s:form action="submit" method="post"&g
redis -- hash_max_zipmap_entries设置过大有问题 chenchao051 redis hash
使用redis时为了使用hash追求更高的内存使用率，我们一般都用hash结构，并且有时候会把hash_max_zipmap_entries这个值设置的很大，很多资料也推荐设置到1000，默认设置为了512，但是这里有个坑 #define ZIPMAP_BIGLEN 254 #define ZIPMAP_END 255 /* Return th
select into outfile access deny问题 daizj mysql txt 导出数据到文件
本文转自：http://hatemysql.com/2010/06/29/select-into-outfile-access-deny%E9%97%AE%E9%A2%98/ 为应用建立了rnd的帐号，专门为他们查询线上数据库用的，当然，只有他们上了生产网络以后才能连上数据库，安全方面我们还是很注意的，呵呵。授权的语句如下： grant select on armory.* to rn
phpexcel导出excel表简单入门示例 dcj3sjt126com PHP Excel phpexcel
<?php error_reporting(E_ALL); ini_set('display_errors', TRUE); ini_set('display_startup_errors', TRUE); if (PHP_SAPI == 'cli') die('This example should only be run from a Web Brows
美国电影超短200句 dcj3sjt126com 电影
1. I see．我明白了。2. I quit! 我不干了!3. Let go! 放手!4. Me too．我也是。5. My god! 天哪!6. No way! 不行!7. Come on．来吧(赶快)8. Hold on．等一等。9. I agree。我同意。10. Not bad．还不错。11. Not yet．还没。12. See you．再见。13. Shut up!
Java访问远程服务 dyy_gusi httpclient webservice get post
随着webService的崛起，我们开始中会越来越多的使用到访问远程webService服务。当然对于不同的webService框架一般都有自己的client包供使用，但是如果使用webService框架自己的client包，那么必然需要在自己的代码中引入它的包，如果同时调运了多个不同框架的webService，那么就需要同时引入多个不同的clien
Maven的settings.xml配置 geeksun settings.xml
settings.xml是Maven的配置文件，下面解释一下其中的配置含义： settings.xml存在于两个地方： 1.安装的地方：$M2_HOME/conf/settings.xml 2.用户的目录：${user.home}/.m2/settings.xml 前者又被叫做全局配置，后者被称为用户配置。如果两者都存在，它们的内容将被合并，并且用户范围的settings.xml优先。
ubuntu的init与系统服务设置 hongtoushizi ubuntu
转载自： http://iysm.net/?p=178 init Init是位于/sbin/init的一个程序，它是在linux下，在系统启动过程中，初始化所有的设备驱动程序和数据结构等之后，由内核启动的一个用户级程序，并由此init程序进而完成系统的启动过程。 ubuntu与传统的linux略有不同，使用upstart完成系统的启动，但表面上仍维持init程序的形式。运行
跟我学Nginx+Lua开发目录贴 jinnianshilongnian nginx lua
使用Nginx+Lua开发近一年的时间，学习和实践了一些Nginx+Lua开发的架构，为了让更多人使用Nginx+Lua架构开发，利用春节期间总结了一份基本的学习教程，希望对大家有用。也欢迎谈探讨学习一些经验。目录第一章安装Nginx+Lua开发环境第二章 Nginx+Lua开发入门第三章 Redis/SSDB+Twemproxy安装与使用第四章 L
php位运算符注意事项 home198979 位运算 PHP &
$a = $b = $c = 0; $a & $b = 1; $b | $c = 1 问a,b,c最终为多少? 当看到这题时，我犯了一个低级错误，误以为位运算符会改变变量的值。所以得出结果是1 1 0 但是位运算符是不会改变变量的值的，例如： $a=1;$b=2; $a&$b; 这样a,b的值不会有任何改变
Linux shell数组建立和使用技巧 pda158 linux
1.数组定义　　[chengmo@centos5 ~]$ a=(1 2 3 4 5) 　　[chengmo@centos5 ~]$ echo $a 　　1 　　一对括号表示是数组，数组元素用“空格”符号分割开。　　 2.数组读取与赋值　　得到长度：　　[chengmo@centos5 ~]$ echo ${#a[@]} 　　5 　　用${#数组名[@或
hotspot源码(JDK7) ol_beta java HotSpot jvm
源码结构图，方便理解： ├─agent Serviceab
Oracle基本事务和ForAll执行批量DML练习 vipbooks oracle sql
基本事务的使用：从账户一的余额中转100到账户二的余额中去，如果账户二不存在或账户一中的余额不足100则整笔交易回滚 select * from account; -- 创建一张账户表 create table account( -- 账户ID id number(3) not null, -- 账户名称 nam