cherry1307

吴恩达神经网络与深度学习——深度神经网络习题4：构建DNN架构

吴恩达神经网络与深度学习——深度神经网络习题4

构建DNN架构

包
作业大纲

初始化

2层NN
L层NN

前向传播

2层NN

线性部分
激活函数
线性+激活函数

L层NN

代价函数
反向传播

2层NN

线性部分
激活函数+线性部分

L层NN

更新参数

构建DNN架构

a^[l]:第l层的激活函数
w^[l]，b^[l]:第l层的参数
x^(i):第i个训练样本
a^[l]_i：第l层第i个神经元的激活函数

包

numpy
matplotlib
dnn_utils

import numpy as np
import h5py
import matplotlib.pyplot as plt
from testCases_v2 import *
from dnn_utils_v2 import sigmoid, sigmoid_backward, relu, relu_backward

%matplotlib inline
plt.rcParams['figure.figsize'] = (5.0, 4.0) # set default size of plots
plt.rcParams['image.interpolation'] = 'nearest'
plt.rcParams['image.cmap'] = 'gray'

%load_ext autoreload
%autoreload 2

np.random.seed(1)

作业大纲

建一个2层NN和一个L层NN
	1.初始化参数
	2.实现前向传播
		实现一个线性部分(Z^[l])
		实现激活函数(ReLu/sigmoid)
		将前两步结合起来，形成[LINEAR->ACTIVATION]正向传播
		复制L-1次，最后一次使用[LINEAR->sigmoid]正向传播
	3.计算损失函数
	4.实现反向传播
		完成反向传播的线性部分
		计算激活函数的导数(relu_backward/sigmoid_backward) 
		结合前两步，形成 [LINEAR->ACTIVATION] 反向传播
		复制L-1次，最后一次使用[LINEAR->sigmoid]反向传播
	5.更新参数

初始化

2层NN

import numpy as np
def initialize_parameters(n_x, n_h, n_y):
    '''
    input:
        n_x:input layer size
        n_h:hidden layer size
        n_y:outpou layer size
    output:
        parameters:which combine W1,W2,b1,b2
        W1,b1:1th layer parameters
           W1.shape = n_h,n_x
           b1.shape = n_h,1
        W2,b2:2th layer parameters
           W2.shape = n_y,n_h
           b2.shape = n_y,1
    '''
    np.random.seed(1)
    W1 = np.random.randn(n_h, n_x) * 0.01
    W2 = np.random.randn(n_y, n_h) * 0.01
    b1 = np.zeros((n_h, 1))
    b2 = np.zeros((n_y, 1))

    assert (W1.shape == (n_h, n_x))
    assert (W2.shape == (n_y, n_h))
    assert (b1.shape == (n_h, 1))
    assert (b2.shape == (n_y, 1))
    parameters = {
        "W1": W1,
        "W2": W2,
        "b1": b1,
        "b2": b2}
    return parameters
parameters = initialize_parameters(2,2,1)
print("W1 = " + str(parameters["W1"]))
print("b1 = " + str(parameters["b1"]))
print("W2 = " + str(parameters["W2"]))
print("b2 = " + str(parameters["b2"]))

L层NN

import two_layer_nn
import  numpy as np
def initialize_parameters_deep(layer_dims):
    '''
    input:
        layer_dims:python array (list) containing the dimensions of each layer in our network
            example:layer_dims = [2,4,1]means it's a 2 layer NN,input layer has 2 units,hidden layer has 4 units,output layer has 1 unit
    output:
        :parameters: which combines W1,b1,W2,b2,...,WL,bL
            W1.shape = n1,n0   b1.shape = n1,1
            W2.shape = n2,n1   b2.shape = n2,1
            ...
            WL.shape = nL,nL-1,bL.shape = nL,1

    '''
    np.random.seed(3)
    parameters = {}
    L = len(layer_dims)
    for i in range(1,L):
        parameters["W"+str(i)] = np.random.randn(layer_dims[i],layer_dims[i-1])*0.01
        parameters["b"+str(i)] = np.zeros((layer_dims[i],1))
        assert (parameters["W"+str(i)].shape == (layer_dims[i],layer_dims[i-1]))
        assert (parameters["b"+str(i)].shape == (layer_dims[i],1))

    return  parameters
parameters = initialize_parameters_deep([5,4,3])
print("W1 = " + str(parameters["W1"]))
print("b1 = " + str(parameters["b1"]))
print("W2 = " + str(parameters["W2"]))
print("b2 = " + str(parameters["b2"]))

前向传播

2层NN

线性部分

def linear_forward(A, W, b):
    '''
input:
    :param A: input dataset
    A.shape = n_x , m
    :param W: 1th layer parameters
    W.shape = n_h,n_x
    :param b: 1th layer parameters
    b.shape = n_h,1
return:
    :param Z:
    Z.shape = n_h,m
    '''
    Z = np.dot(W,A)+b
    assert (Z.shape == (W.shape[0],A.shape[1]))
    cache = (A,W,b)
    return Z,cache
A, W, b = testCases_v2.linear_forward_test_case()
Z, linear_cache = linear_forward(A, W, b)
print("Z = " + str(Z))

激活函数

import numpy as np

def sigmoid(Z):
    """
    Implements the sigmoid activation in numpy
    
    Arguments:
    Z -- numpy array of any shape
    
    Returns:
    A -- output of sigmoid(z), same shape as Z
    cache -- returns Z as well, useful during backpropagation
    """
    
    A = 1/(1+np.exp(-Z))
    cache = Z
    
    return A, cache

def relu(Z):
    """
    Implement the RELU function.

    Arguments:
    Z -- Output of the linear layer, of any shape

    Returns:
    A -- Post-activation parameter, of the same shape as Z
    cache -- a python dictionary containing "A" ; stored for computing the backward pass efficiently
    """
    
    A = np.maximum(0,Z)
    
    assert(A.shape == Z.shape)
    
    cache = Z 
    return A, cache


def relu_backward(dA, cache):
    """
    Implement the backward propagation for a single RELU unit.

    Arguments:
    dA -- post-activation gradient, of any shape
    cache -- 'Z' where we store for computing backward propagation efficiently

    Returns:
    dZ -- Gradient of the cost with respect to Z
    """
    
    Z = cache
    dZ = np.array(dA, copy=True) # just converting dz to a correct object.
    
    # When z <= 0, you should set dz to 0 as well. 
    dZ[Z <= 0] = 0
    
    assert (dZ.shape == Z.shape)
    
    return dZ

def sigmoid_backward(dA, cache):
    """
    Implement the backward propagation for a single SIGMOID unit.

    Arguments:
    dA -- post-activation gradient, of any shape
    cache -- 'Z' where we store for computing backward propagation efficiently

    Returns:
    dZ -- Gradient of the cost with respect to Z
    """
    
    Z = cache
    
    s = 1/(1+np.exp(-Z))
    dZ = dA * s * (1-s)
    
    assert (dZ.shape == Z.shape)
    
    return dZ

线性+激活函数

def linear_activation_forward(A_prev, W, b, activation):
    '''
    input:
        A_prev -- activations from previous layer (or input data): (size of previous layer, number of examples)
        W -- weights matrix: numpy array of shape (size of current layer, size of previous layer)
        b -- bias vector, numpy array of shape (size of the current layer, 1)
        activation -- the activation to be used in this layer, stored as a text string: "sigmoid" or "relu"

    Returns:
        A -- the output of the activation function, also called the post-activation value
        cache -- a python dictionary containing "linear_cache" and "activation_cache";
             stored for computing the backward pass efficiently
    '''
    if activation == "sigmoid":
        Z,linear_cache = linear_forward(A_prev,W,b)
        A,activation_cache= dnn_utils_v2.sigmoid(Z)
    elif activation == "relu":
        Z,linear_cache = linear_forward(A_prev,W,b)
        A,activation_cache = dnn_utils_v2.relu(Z)
    assert (A.shape == (W.shape[0],A_prev.shape[1]))
    cache = (linear_cache,activation_cache)
    return A,cache
A_prev, W, b = testCases_v2.linear_activation_forward_test_case()
A, linear_activation_cache = linear_activation_forward(A_prev, W, b, activation = "sigmoid")
print("With sigmoid: A = " + str(A))
A, linear_activation_cache = linear_activation_forward(A_prev, W, b, activation = "relu")
print("With ReLU: A = " + str(A))

L层NN

# GRADED FUNCTION: L_model_forward

def L_model_forward(X, parameters):
    """
    Implement forward propagation for the [LINEAR->RELU]*(L-1)->LINEAR->SIGMOID computation
    
    Arguments:
    X -- data, numpy array of shape (input size, number of examples)
    parameters -- output of initialize_parameters_deep()
    
    Returns:
    AL -- last post-activation value
    caches -- list of caches containing:
                every cache of linear_relu_forward() (there are L-1 of them, indexed from 0 to L-2)
                the cache of linear_sigmoid_forward() (there is one, indexed L-1)
    """

    caches = []
    A = X
    L = len(parameters) // 2                  # number of layers in the neural network
    
    # Implement [LINEAR -> RELU]*(L-1). Add "cache" to the "caches" list.
    for l in range(1, L):
        A_prev = A 
        ### START CODE HERE ### (≈ 2 lines of code)
        A, cache = linear_activation_forward(A_prev, 
                                             parameters["W" + str(l)], 
                                             parameters["b" + str(l)], 
                                             activation='relu')
        caches.append(cache)

        ### END CODE HERE ###
    
    # Implement LINEAR -> SIGMOID. Add "cache" to the "caches" list.
    ### START CODE HERE ### (≈ 2 lines of code)
    AL, cache = linear_activation_forward(A, 
                                             parameters["W" + str(L)], 
                                             parameters["b" + str(L)], 
                                             activation='sigmoid')
    caches.append(cache)
    
    ### END CODE HERE ###
    
    assert(AL.shape == (1,X.shape[1]))
            
    return AL, caches
X, parameters = testCases_v2.L_model_forward_test_case()
AL, caches = L_model_forward(X, parameters)
print("AL = " + str(AL))
print("Length of caches list = " + str(len(caches)))

代价函数

def compute_cost(AL, Y):
    """
    Implement the cost function defined by equation (7).

    Arguments:
    AL -- probability vector corresponding to your label predictions, shape (1, number of examples)
    Y -- true "label" vector (for example: containing 0 if non-cat, 1 if cat), shape (1, number of examples)

    Returns:
    cost -- cross-entropy cost
    """
    m = Y.size
    cost = -1/m * np.sum(Y*np.log(AL)+(1-Y)*np.log(1-AL))

    assert (cost.shape == ())
    return cost
Y, AL = testCases_v2.compute_cost_test_case()
print("cost = " + str(compute_cost(AL, Y)))

反向传播

2层NN

线性部分

def linear_backward(dZ, cache):
    """
    Implement the linear portion of backward propagation for a single layer (layer l)

    Arguments:
    dZ -- Gradient of the cost with respect to the linear output (of current layer l)
    cache -- tuple of values (A_prev, W, b) coming from the forward propagation in the current layer

    Returns:
    dA_prev -- Gradient of the cost with respect to the activation (of the previous layer l-1), same shape as A_prev
    dW -- Gradient of the cost with respect to W (current layer l), same shape as W
    db -- Gradient of the cost with respect to b (current layer l), same shape as b
    """
    A_prev ,W , b = cache
    m = dZ.shape[1]
    dW = 1/m * np.dot(dZ,A_prev.T)
    db = 1/m * np.sum(dZ,axis=1,keepdims=True)
    dA_prev = np.dot(W.T,dZ)

    assert (dA_prev.shape == A_prev.shape)
    assert (dW.shape == W.shape)
    assert (db.shape == b.shape)
    return dA_prev,dW,db
dZ, linear_cache = testCases_v2.linear_backward_test_case()

dA_prev, dW, db = linear_backward(dZ, linear_cache)
print ("dA_prev = "+ str(dA_prev))
print ("dW = " + str(dW))
print ("db = " + str(db))

激活函数+线性部分

# GRADED FUNCTION: linear_activation_backward

def linear_activation_backward(dA, cache, activation):
    """
    Implement the backward propagation for the LINEAR->ACTIVATION layer.
    
    Arguments:
    dA -- post-activation gradient for current layer l 
    cache -- tuple of values (linear_cache, activation_cache) we store for computing backward propagation efficiently
    activation -- the activation to be used in this layer, stored as a text string: "sigmoid" or "relu"
    
    Returns:
    dA_prev -- Gradient of the cost with respect to the activation (of the previous layer l-1), same shape as A_prev
    dW -- Gradient of the cost with respect to W (current layer l), same shape as W
    db -- Gradient of the cost with respect to b (current layer l), same shape as b
    """
    linear_cache, activation_cache = cache
    
    if activation == "relu":
        ### START CODE HERE ### (≈ 2 lines of code)
        dZ = relu_backward(dA, activation_cache)
        dA_prev, dW, db = linear_backward(dZ, linear_cache)
        ### END CODE HERE ###
        
    elif activation == "sigmoid":
        dZ = sigmoid_backward(dA, activation_cache)
        dA_prev, dW, db = linear_backward(dZ, linear_cache)
  
    return dA_prev, dW, db
AL, linear_activation_cache = linear_activation_backward_test_case()

dA_prev, dW, db = linear_activation_backward(AL, linear_activation_cache, activation = "sigmoid")
print ("sigmoid:")
print ("dA_prev = "+ str(dA_prev))
print ("dW = " + str(dW))
print ("db = " + str(db) + "\n")

dA_prev, dW, db = linear_activation_backward(AL, linear_activation_cache, activation = "relu")
print ("relu:")
print ("dA_prev = "+ str(dA_prev))
print ("dW = " + str(dW))
print ("db = " + str(db))

L层NN

def L_model_backward(AL, Y, caches):
    """
    Implement the backward propagation for the [LINEAR->RELU] * (L-1) -> LINEAR -> SIGMOID group

    Arguments:
    AL -- probability vector, output of the forward propagation (L_model_forward())
    Y -- true "label" vector (containing 0 if non-cat, 1 if cat)
    caches -- list of caches containing:
                every cache of linear_activation_forward() with "relu" (it's caches[l], for l in range(L-1) i.e l = 0...L-2)
                the cache of linear_activation_forward() with "sigmoid" (it's caches[L-1])

    Returns:
    grads -- A dictionary with the gradients
             grads["dA" + str(l)] = ...
             grads["dW" + str(l)] = ...
             grads["db" + str(l)] = ...
    """
    grads = {}
    L = len(caches)  # the number of layers
    m = AL.shape[1]
    Y = Y.reshape(AL.shape)  # after this line, Y is the same shape as AL

    # Initializing the backpropagation
    ### START CODE HERE ### (1 line of code)
    dAL = -np.divide(Y, AL) + np.divide((1 - Y), (1 - AL))
    ### END CODE HERE ###

    # Lth layer (SIGMOID -> LINEAR) gradients. Inputs: "AL, Y, caches". Outputs: "grads["dAL"], grads["dWL"], grads["dbL"]
    ### START CODE HERE ### (approx. 2 lines)
    current_cache = caches[-1]
    grads["dA" + str(L)], grads["dW" + str(L)], grads["db" + str(L)] = two_layer_nn.linear_activation_backward(dAL, current_cache,
                                                                                                  activation="sigmoid")
    ### END CODE HERE ###

    for l in reversed(range(L - 1)):
        # lth layer: (RELU -> LINEAR) gradients.
        # Inputs: "grads["dA" + str(l + 2)], caches". Outputs: "grads["dA" + str(l + 1)] , grads["dW" + str(l + 1)] , grads["db" + str(l + 1)]
        ### START CODE HERE ### (approx. 5 lines)
        current_cache = caches[l]

        dA_prev_temp, dW_temp, db_temp = two_layer_nn.linear_activation_backward(grads["dA" + str(l + 2)], current_cache,
                                                                    activation="relu")
        grads["dA" + str(l + 1)] = dA_prev_temp
        grads["dW" + str(l + 1)] = dW_temp
        grads["db" + str(l + 1)] = db_temp
        ### END CODE HERE ###

    return grads
AL, Y_assess, caches = testCases_v2.L_model_backward_test_case()
grads = L_model_backward(AL, Y_assess, caches)
print ("dW1 = "+ str(grads["dW1"]))
print ("db1 = "+ str(grads["db1"]))
print ("dA1 = "+ str(grads["dA1"]))

更新参数

def update_parameters(parameters, grads, learning_rate):
    """
    Update parameters using gradient descent

    Arguments:
    parameters -- python dictionary containing your parameters
    grads -- python dictionary containing your gradients, output of L_model_backward

    Returns:
    parameters -- python dictionary containing your updated parameters
                  parameters["W" + str(l)] = ...
                  parameters["b" + str(l)] = ...
    """
    L = len(parameters)//2
    for i in range(1,L):
        parameters["W"+str(i)] = parameters["W"+str(i)]-learning_rate*grads["dW"+str(i)]
        parameters["b"+str(i)] = parameters["b"+str(i)]-learning_rate*grads["db"+str(i)]
        assert (parameters["W"+str(i)].shape ==grads["dW"+str(i)].shape )
        assert (parameters["b" + str(i)].shape == grads["db" + str(i)].shape)
    return parameters
parameters, grads = testCases_v2.update_parameters_test_case()
parameters = update_parameters(parameters, grads, 0.1)

print ("W1 = "+ str(parameters["W1"]))
print ("b1 = "+ str(parameters["b1"]))
print ("W2 = "+ str(parameters["W2"]))
print ("b2 = "+ str(parameters["b2"]))

机器学习与深度学习资料 JasonDing1354 【Machine Learning】
《BriefHistoryofMachineLearning》介绍:这是一篇介绍机器学习历史的文章，介绍很全面，从感知机、神经网络、决策树、SVM、Adaboost到随机森林、DeepLearning.《DeepLearninginNeuralNetworks:AnOverview》介绍:这是瑞士人工智能实验室JurgenSchmidhuber写的最新版本《神经网络与深度学习综述》本综述的特点是以
神经网络与深度学习入门：理解ANN、CNN和RNN shandianfk_com ChatGPT AI 神经网络深度学习 cnn
在现代科技日新月异的今天，人工智能已经成为了我们生活中的重要组成部分。无论是智能手机的语音助手，还是推荐系统，背后都有一项核心技术在支撑，那就是神经网络与深度学习。今天，我们就来聊一聊这个听起来高大上的话题，其实它也没那么难懂！什么是神经网络？首先，我们要了解什么是神经网络。神经网络（ArtificialNeuralNetwork，简称ANN）是模拟人脑神经元连接方式的一种算法。它由一层层的“神经
《神经网络与深度学习》(邱锡鹏) 内容概要【不含数学推导】 code_stream #机器学习神经网络
第1章绪论基本概念：介绍了人工智能的发展历程及不同阶段的特点，如符号主义、连接主义、行为主义等。还阐述了深度学习在人工智能领域的重要地位和发展现状，以及其在图像、语音、自然语言处理等多个领域的成功应用。术语解释人工智能：旨在让机器模拟人类智能的技术和科学。深度学习：一种基于对数据进行表征学习的方法，通过构建具有很多层的神经网络模型，自动从大量数据中学习复杂的模式和特征。第2章机器学习概述基本概念：
# 第一章：认识chatgpt 出门喝奶茶 chatgpt chatgpt
chatgpt发展背景详细介绍一、基础理论背景人工智能和自然语言处理的兴起早期理论:20世纪中期，人工智能（AI）初见端倪，目标是模拟人类智能。自然语言处理作为AI的重要分支，致力于让机器理解和生成人类语言。关键里程碑:1980年代的统计方法和2000年代的神经网络技术，使NLP实现了从规则驱动到数据驱动的转变。神经网络与深度学习2010年代，深度学习的兴起极大推动了NLP的发展。基于大规模语料库
【ShuQiHere】《机器学习的进化史『下』：从神经网络到深度学习的飞跃》 ShuQiHere 机器学习深度学习神经网络
【ShuQiHere】引言：神经网络与深度学习的兴起在上篇文章中，我们回顾了机器学习的起源与传统模型的发展历程，如线性回归、逻辑回归和支持向量机（SVM）。然而，随着数据规模的急剧增长和计算能力的提升，传统模型在处理复杂问题时显得力不从心。在这种背景下，神经网络重新进入了研究者们的视野，并逐步演变为深度学习，成为解决复杂问题的强大工具。今天，我们将进一步探索从神经网络到深度学习的进化历程，揭示这些
神经网络深度学习梯度下降算法优化海棠如醉人工智能深度学习
【神经网络与深度学习】以最通俗易懂的角度解读[梯度下降法及其优化算法]，这一篇就足够（很全很详细）_梯度下降在神经网络中的作用及概念-CSDN博客https://blog.51cto.com/u_15162069/2761936梯度下降数学原理
李宏毅机器学习笔记 2.回归 Simone Zeng 机器学习机器学习
最近在跟着Datawhale组队学习打卡，学习李宏毅的机器学习/深度学习的课程。课程视频：https://www.bilibili.com/video/BV1Ht411g7Ef开源内容：https://github.com/datawhalechina/leeml-notes本篇文章对应视频中的P3。另外，最近我也在学习邱锡鹏教授的《神经网络与深度学习》，会补充书上的一点内容。通过上一次课1.机器
深度学习路线，包括书籍和视频 jjm2002 深度学习深度学习人工智能
深度学习是一个广泛而快速发展的领域，涉及多种技术和应用。以下是一个深度学习学习路线，包括书籍和视频资源。入门阶段：理解基础知识：书籍：《深度学习》（DeepLearning）IanGoodfellow,YoshuaBengio和AaronCourville著。这是深度学习领域的权威书籍，适合初学者。书籍：《神经网络与深度学习》（NeuralNetworksandDeepLearning）Micha
神经网络与深度学习 Neural Networks and Deep Learning 课程笔记第一周林间得鹿吴恩达深度学习系列课程笔记深度学习神经网络笔记
神经网络与深度学习NeuralNetworksandDeepLearning课程笔记第一周文章目录神经网络与深度学习NeuralNetworksandDeepLearning课程笔记第一周深度学习简介什么是神经网络使用神经网络进行监督学习为什么神经网络会兴起本文是吴恩达深度学习系列课程的学习笔记。深度学习简介什么是神经网络深度学习一般是指训练神经网络。那么什么是神经网络？课程以房价预测的例子来说明
小白初探｜神经网络与深度学习神奇的代码在哪里人工智能深度学习神经网络人工智能外接显卡
一、学习背景由于工作的原因，需要开展人工智能相关的研究，虽然不用参与实际研发，但在项目实施过程中发现，人工智能的项目和普通程序开发项目不一样，门槛比较高，没有相关基础没法搞清楚人力、财力如何投入，很难合理管控成本以及时间。为搞清楚情况，老年博主决定一步一个脚印，好好自学。在写本文时，博主已学到一定阶段了，趁有时间，通过博文记录下来，以免遗忘。二、学习准备常年的学习告诉我们，一门学科要快速入门，主流
神经网络与深度学习Pytorch版 Softmax回归笔记砍树＋c＋v 深度学习神经网络 pytorch 人工智能 python 回归笔记
Softmax回归目录Softmax回归1.独热编码2.Softmax回归的网络架构是一个单层的全连接神经网络。3.Softmax回归模型概述及其在多分类问题中的应用4.Softmax运算在多分类问题中的应用及其数学原理5.小批量样本分类的矢量计算表达式6.交叉熵损失函数7.模型预测及评价8.小结Softmax回归，也称为多类逻辑回归，是一种用于解决多分类问题的机器学习算法。它与普通的logist
【吴恩达-神经网络与深度学习】第3周：浅层神经网络倏然希然_ 深度学习与神经网络神经网络深度学习人工智能
目录神经网络概览神经网络表示含有一个隐藏层的神经网络（双层神经网络）计算神经网络的输出多样本的向量化向量化实现的解释激活函数（Activationfunctions）一些选择激活函数的经验法则：为什么需要非线性激活函数？激活函数的导数神经网络的梯度下降法（选修）直观理解反向传播随机初始化神经网络概览右上角方括号[]里面的数字表示神经网络的层数可以把许多sigmoid单元堆叠起来形成一个神经网络：第
2023年度佳作：AIGC、AGI、GhatGPT、人工智能大语言模型的崛起与挑战鸭鸭渗透人工智能 AIGC agi 语言模型自然语言处理
目录前言01《ChatGPT驱动软件开发》内容简介02《ChatGPT原理与实战》内容简介03《神经网络与深度学习》04《AIGC重塑教育》内容简介05《通用人工智能》目录前言2023年是人工智能大语言模型大爆发的一年，一些概念和英文缩写也在这一年里集中出现，很容易混淆，甚至把人搞懵。LLM：LargeLanguageModel，即大语言模型，旨在理解和生成人类语言。LLM的特点是规模庞大，包含成
Pytorch 实现强化学习策略梯度Reinforce算法爱喝咖啡的加菲猫强化学习强化学习神经网络 pytorch
一、公式推导这里参考邱锡鹏大佬的《神经网络与深度学习》第三章进阶模型部分，链接《神经网络与深度学习》。`伪代码：二、核心代码defmain():env=gym.make('CartPole-v0')obs_n=env.observation_space.shape[0]act_n=env.action_space.nlogger.info('obs_n{},act_n{}'.format(obs_
基于图神经网络与深度学习的商品推荐算法谦谦菜鸟深度学习机器学习人工智能
传统做法现阶段局限创新方法结果相关工作目前推荐算法基于矩阵分解的推荐算法基于深度学习的推荐算法基于图神经网络的推荐算法创新点模型设计本文的核心任务是训练出一个模型LGDL模型框架嵌入层ID特征嵌入评论文本特征嵌入前向传播层关联关系提取偏好特征提取评分预测层模型优化传统做法利用深度学习方法从用户ID、评论文本等数据中提取其中所隐藏的用户物品特征，根据该特征预测用户对新物品的打分从而给出推荐是传统推荐
神经网络与深度学习（五）——人工神经网络和卷积神经网络吴丞楚20012100032
姓名：吴丞楚学号：20012100032学院：竹园三号书院【嵌牛导读】简要介绍NN与CNN【嵌牛鼻子】深度学习神经网络【嵌牛提问】NN与CNN的区别有哪些人工神经网络简称神经网络(NN)，是目前各种神经网络的基础，其构造是仿造生物神经网络，将神经元看成一个逻辑单元，其功能是用于对函数进行估计和近似，是一种自适应系统，通俗的讲就是具备学习能力。其作用，目前为止就了解到分类。其目的就是在圈和叉之间画出
学习笔记--神经网络与深度学习之卷积神经网络 qssssss79 深度学习神经网络深度学习学习
目录1.卷积1.1一维卷积1.2卷积的作用1.3卷积扩展1.4二维卷积1.5互相关2.卷积神经网络2.1用卷积代替全连接2.2卷积层2.3汇聚层（池化层）2.4卷积网络结构3.其它卷积种类3.1空洞卷积3.2转置卷积/微步卷积4典型的卷积神经网络4.1LeNet-54.2AlexNet4.3Inception4.4残差网络利用全连接前馈网络处理图像时的问题：（1）参数太多：对于输入的10010
计划1 JLcucumber
1.吴恩达DL2021(强推|双字)2021版吴恩达深度学习课程Deeplearning.ai_哔哩哔哩_bilibiliPart1神经网络与深度学习（6+19+12+8）共45Part2训练、开发、测试集（14+10+11）共35Part3机器学习策略（13+11）共24Part4计算机视觉（11+14+14+(5+6)）共50Part5序列模型（12+10+15）共372.经典网络模型论文ht
[23-24 秋学期] NNDL-作业2 HBU 洛杉矶县牛肉板面深度学习人工智能机器学习深度学习
前言：本文解决《神经网络与深度学习》-邱锡鹏第二章课后题。对于习题2-1，平方损失函数在机器学习课程中学习过，但是惭愧的讲，在完成这篇博客前我对均方误差和平方损失函数的概念还有些混淆。交叉熵损失函数我未曾了解过，只在决策树一节中学习过关于熵entropy的基本概念。借此机会弄清原理，并且尝试着学会应用它。对于习题2-12，考察对混淆矩阵的理解程度和计算。其中宏平均和微平均是我未曾学习过的概念，借此
【22-23 春学期】AI作业5-深度学习基础 HBU_David AI 深度学习人工智能 python
人工智能、机器学习、深度学习之间的关系神经网络与深度学习的关系“深度学习”和“传统浅层学习”的区别和联系神经元、人工神经元MP模型单层感知机SLP异或问题XOR多层感知机MLP前馈神经网络FNN激活函数ActivationFunction为什么要使用激活函数？常用激活函数有哪些？均方误差和交叉熵损失函数，哪个适合于分类？哪个适合于回归？为什么？
神经网络与深度学习day01-基础知识小鬼缠身、深度学习神经网络人工智能 python
今天开始新学期，然后就是每周要在这里发这周的实验报告，CSDN对不起了，你可能不情愿，但是必须要稍微容纳一下我(这个菜比)在这里吹了。第一周的基础知识训练：1、导入numpy库importnumpy2、建立一个一维数组a=[4,5,6]。输出：(1)a的类型；(2)a的各维度的大小；(3)a的第一个元素a=[4,5,6]print(type(a))print(numpy.shape(a))prin
HBU_神经网络与深度学习实验10 卷积神经网络：基于ResNet18网络完成图像分类任务 ZodiAc7 cnn 深度学习 python
目录写在前面的一些内容一、实践：基于ResNet18网络完成图像分类任务1.数据处理(1)数据集介绍(2)数据读取(3)构造Dataset类2.模型构建3.模型训练4.模型评价5.模型预测二、实验Q&A写在前面的一些内容本文为HBU_神经网络与深度学习实验（2022年秋）实验10的实验报告，此文的基本内容参照[1]Github/卷积神经网络-下.ipynb，检索时请按对应序号进行检索。本实验编程语
Python练习题：猜数字游戏 BioVS python 开发语言
#题目来源于MOOC课程《神经网络与深度学习》，程序为自己独立编写题目：随机产生一个1-10之间的整数，并提示用户输入1-10的整数进行猜测，判断是否猜中。每次猜完后，提示“太大了”或者“太小了”，猜对之后提示“恭喜你，猜对了！”，并退出程序。当用户才出数字后，询问是否想要继续下一轮游戏，并记录显示用户已参加轮次。对应python程序：importrandomtimes=1#存放第几轮游戏，用于后
2023年度盘点：AIGC、AGI、GhatGPT、人工智能大模型必读书单家有娇妻张兔兔粉丝送书活动 AIGC agi 人工智能福利送书
2023年度盘点智能大模型必读书单概述好书推荐01《ChatGPT驱动软件开发》02《ChatGPT原理与实战》03《神经网络与深度学习》04《AIGC重塑教育》05《通用人工智能》写在末尾：主页传送门：传送送书系列：送书第一期：考研必备书单送书第二期：CTF那些事儿送书第三期：数据要素安全流通送书第四期：MLOps工程实践：工具、技术与企业级应用送书第五期：Python数据挖掘：入门进阶与实用案
搜索与人工智能码海串游人工智能
前言第一：通过博弈树搜索和启发式搜索的例子了解基于搜索的通用问题求解方法第二：了解人工智能发展的历程和社会影响第三：了解机器学习的基本思想和典型应用第四：了解人工智能应用开发的基本模式内容1.博弈树与剪纸、零和博弈，极大极小策略博弈树与搜索，α与β剪枝以及著名的计算机博弈的例子2.启发式搜索启发式函数，启发式搜索过程，3.人工智能与机器学习人工智能发展历程，专家系统，机器学习，神经网络与深度学习。
2023年度AI盘点 AIGC|AGI|ChatGPT|人工智能大模型 herosunly 优质书籍推荐人工智能 AIGC agi
文章目录0.前言1.《ChatGPT驱动软件开发》2.《ChatGPT原理与实战》3.《神经网络与深度学习》4.《AIGC重塑教育》5.《通用人工智能》0.前言 2023年是人工智能大语言模型大爆发的一年，一些概念和英文缩写也在这一年里集中出现，很容易混淆，甚至把人搞懵。LLM：LargeLanguageModel，即大语言模型，旨在理解和生成人类语言。LLM的特点是规模庞大，包含成百、上千亿的
DL Homework 11 熬夜患者 DL Homework 人工智能深度学习
目录1.被优化函数编辑(代码来源于邱锡鹏老师的神经网络与深度学习的实验）L1.pyop.py（1）SimpleBatchGD（2）Adagrad（3）RMSprop（4）Momentum（5）Adam2.被优化函数编辑3.解释不同轨迹的形成原因，并分析各个算法的优缺点（1）SimpleBatchGD（2）Adagrad（3）RMSprop（4）Momentum（5）Adam总结在展开本次作业之前，
2020-12-07 吴恩达-神经网络与深度学习-第三周编程练习 Vivivivi安
Github地址：https://github.com/Poissons/wuenda-Deep-Learning-And-Neural-Network-third-week-excercise.git
2020-12-03 吴恩达-神经网络与深度学习-第二周编程练习 Vivivivi安
最近听吴恩达老师的课，写课后作业Github地址：https://github.com/Poissons/wuenda-Deep-Learning-And-Neural-Network-second-week-excercise
2023年度AI盘点 AIGC|AGI|ChatGPT|人工智能大模型雪碧有白泡泡粉丝福利活动人工智能 AIGC agi
前言「作者主页」：雪碧有白泡泡「个人网站」：雪碧的个人网站2023年是人工智能大语言模型大爆发的一年，一些概念和英文缩写也在这一年里集中出现，很容易混淆，甚至把人搞懵。文章目录前言01《ChatGPT驱动软件开发》02《ChatGPT原理与实战》03《神经网络与深度学习》《AIGC重塑教育》05《通用人工智能》LLM：LargeLanguageModel，即大语言模型，旨在理解和生成人类语言。LL
二分查找排序算法周凡杨 java 二分查找排序算法折半
一：概念二分查找又称折半查找（折半搜索/ 二分搜索），优点是比较次数少，查找速度快，平均性能好；其缺点是要求待查表为有序表，且插入删除困难。因此，折半查找方法适用于不经常变动而查找频繁的有序列表。首先，假设表中元素是按升序排列，将表中间位置记录的关键字与查找关键字比较，如果两者相等，则查找成功；否则利用中间位置记录将表分成前、后两个子表，如果中间位置记录的关键字大于查找关键字，则进一步
java中的BigDecimal bijian1013 java BigDecimal
在项目开发过程中出现精度丢失问题，查资料用BigDecimal解决，并发现如下这篇BigDecimal的解决问题的思路和方法很值得学习，特转载。原文地址：http://blog.csdn.net/ugg/article/de
Shell echo命令详解 daizj echo shell
Shell echo命令 Shell 的 echo 指令与 PHP 的 echo 指令类似，都是用于字符串的输出。命令格式： echo string 您可以使用echo实现更复杂的输出格式控制。 1.显示普通字符串: echo "It is a test" 这里的双引号完全可以省略，以下命令与上面实例效果一致： echo Itis a test 2.显示转义
Oracle DBA 简单操作周凡杨 oracle dba sql
--执行次数多的SQL select sql_text,executions from ( select sql_text,executions from v$sqlarea order by executions desc ) where rownum<81; &nb
画图重绘朱辉辉33 游戏
我第一次接触重绘是编写五子棋小游戏的时候，因为游戏里的棋盘是用线绘制的，而这些东西并不在系统自带的重绘里，所以在移动窗体时，棋盘并不会重绘出来。所以我们要重写系统的重绘方法。在重写系统重绘方法时，我们要注意一定要调用父类的重绘方法，即加上super.paint(g)，因为如果不调用父类的重绘方式，重写后会把父类的重绘覆盖掉，而父类的重绘方法是绘制画布，这样就导致我们
线程之初体验西蜀石兰线程
一直觉得多线程是学Java的一个分水岭，懂多线程才算入门。之前看《编程思想》的多线程章节，看的云里雾里，知道线程类有哪几个方法，却依旧不知道线程到底是什么？书上都写线程是进程的模块，共享线程的资源，可是这跟多线程编程有毛线的关系，呜呜。。。线程其实也是用户自定义的任务，不要过多的强调线程的属性，而忽略了线程最基本的属性。你可以在线程类的run()方法中定义自己的任务，就跟正常的Ja
linux集群互相免登陆配置林鹤霄 linux
配置ssh免登陆 1、生成秘钥和公钥 ssh-keygen -t rsa 2、提示让你输入，什么都不输，三次回车之后会在~下面的.ssh文件夹中多出两个文件id_rsa 和 id_rsa.pub 其中id_rsa为秘钥，id_rsa.pub为公钥，使用公钥加密的数据只有私钥才能对这些数据解密 c
mysql : Lock wait timeout exceeded; try restarting transaction aigo mysql
原文：http://www.cnblogs.com/freeliver54/archive/2010/09/30/1839042.html 原因是你使用的InnoDB 表类型的时候, 默认参数:innodb_lock_wait_timeout设置锁等待的时间是50s, 因为有的锁等待超过了这个时间,所以抱错. 你可以把这个时间加长,或者优化存储
Socket编程基本的聊天实现。 alleni123 socket
public class Server { //用来存储所有连接上来的客户 private List<ServerThread> clients; public static void main(String[] args) { Server s = new Server(); s.startServer(9988); } publi
多线程监听器事件模式(一个简单的例子) 百合不是茶线程监听模式
多线程的事件监听器模式监听器时间模式经常与多线程使用,在多线程中如何知道我的线程正在执行那什么内容,可以通过时间监听器模式得到创建多线程的事件监听器模式思路: 1, 创建线程并启动,在创建线程的位置设置一个标记 2,创建队
spring InitializingBean接口 bijian1013 java spring
spring的事务的TransactionTemplate，其源码如下： public class TransactionTemplate extends DefaultTransactionDefinition implements TransactionOperations, InitializingBean{ ... } TransactionTemplate继承了DefaultT
Oracle中询表的权限被授予给了哪些用户 bijian1013 oracle 数据库权限
Oracle查询表将权限赋给了哪些用户的SQL，以备查用。 select t.table_name as "表名", t.grantee as "被授权的属组", t.owner as "对象所在的属组"
【Struts2五】Struts2 参数传值 bit1129 struts2
Struts2中参数传值的3种情况 1.请求参数绑定到Action的实例字段上 2.Action将值传递到转发的视图上 3.Action将值传递到重定向的视图上一、请求参数绑定到Action的实例字段上以及Action将值传递到转发的视图上 Struts可以自动将请求URL中的请求参数或者表单提交的参数绑定到Action定义的实例字段上，绑定的规则使用ognl表达式语言
【Kafka十四】关于auto.offset.reset[Q/A] bit1129 kafka
I got serveral questions about auto.offset.reset. This configuration parameter governs how consumer read the message from Kafka when there is no initial offset in ZooKeeper or
nginx gzip压缩配置 ronin47 nginx gzip 压缩范例
nginx gzip压缩配置更多 0 nginx gzip 配置随着nginx的发展，越来越多的网站使用nginx，因此nginx的优化变得越来越重要，今天我们来看看nginx的gzip压缩到底是怎么压缩的呢？ gzip(GNU-ZIP)是一种压缩技术。经过gzip压缩后页面大小可以变为原来的30%甚至更小，这样，用
java-13.输入一个单向链表，输出该链表中倒数第 k 个节点 bylijinnan java
two cursors. Make the first cursor go K steps first. /* * 第 13 题：题目：输入一个单向链表，输出该链表中倒数第 k 个节点 */ public void displayKthItemsBackWard(ListNode head,int k){ ListNode p1=head,p2=head;
Spring源码学习-JdbcTemplate queryForObject bylijinnan java spring
JdbcTemplate中有两个可能会混淆的queryForObject方法： 1. Object queryForObject(String sql, Object[] args, Class requiredType) 2. Object queryForObject(String sql, Object[] args, RowMapper rowMapper) 第1个方法是只查
[冰川时代]在冰川时代,我们需要什么样的技术? comsci 技术
看美国那边的气候情况....我有个感觉...是不是要进入小冰期了? 那么在小冰期里面...我们的户外活动肯定会出现很多问题...在室内呆着的情况会非常多...怎么在室内呆着而不发闷...怎么用最低的电力保证室内的温度.....这都需要技术手段... &nb
js 获取浏览器型号 cuityang js 浏览器
根据浏览器获取iphone和apk的下载地址 <!DOCTYPE html> <html> <head> <meta charset="utf-8" content="text/html"/> <meta name=
C# socks5详解转 dalan_123 socket C#
http://www.cnblogs.com/zhujiechang/archive/2008/10/21/1316308.html 这里主要讲的是用.NET实现基于Socket5下面的代理协议进行客户端的通讯，Socket4的实现是类似的，注意的事，这里不是讲用C#实现一个代理服务器，因为实现一个代理服务器需要实现很多协议，头大，而且现在市面上有很多现成的代理服务器用，性能又好，
运维 Centos问题汇总 dcj3sjt126com 云主机
一、sh 脚本不执行的原因 sh脚本不执行的原因只有2个 1.权限不够 2.sh脚本里路径没写完整。二、解决You have new mail in /var/spool/mail/root 修改/usr/share/logwatch/default.conf/logwatch.conf配置文件 MailTo = MailFrom 三、查询连接数
Yii防注入攻击笔记 dcj3sjt126com sql WEB安全 yii
网站表单有注入漏洞须对所有用户输入的内容进行个过滤和检查，可以使用正则表达式或者直接输入字符判断，大部分是只允许输入字母和数字的，其它字符度不允许；对于内容复杂表单的内容，应该对html和script的符号进行转义替换：尤其是<,>,',"",&这几个符号这里有个转义对照表： http://blog.csdn.net/xinzhu1990/articl
MongoDB简介[一] eksliang mongodb MongoDB简介
MongoDB简介转载请出自出处：http://eksliang.iteye.com/blog/2173288 1.1易于使用 MongoDB是一个面向文档的数据库，而不是关系型数据库。与关系型数据库相比，面向文档的数据库不再有行的概念，取而代之的是更为灵活的“文档”模型。另外，不
zookeeper windows 入门安装和测试 greemranqq zookeeper 安装分布式
一、序言以下是我对zookeeper 的一些理解： zookeeper 作为一个服务注册信息存储的管理工具，好吧，这样说得很抽象，我们举个“栗子”。栗子1号：假设我是一家KTV的老板，我同时拥有5家KTV，我肯定得时刻监视
Spring之使用事务缘由(2-注解实现) ihuning spring
Spring事务注解实现 1. 依赖包： 1.1 spring包： spring-beans-4.0.0.RELEASE.jar spring-context-4.0.0.
iOS App Launch Option 啸笑天 option
iOS 程序启动时总会调用application:didFinishLaunchingWithOptions:，其中第二个参数launchOptions为NSDictionary类型的对象，里面存储有此程序启动的原因。 launchOptions中的可能键值见UIApplication Class Reference的Launch Options Keys节。 1、若用户直接
jdk与jre的区别（_） macroli java jvm jdk
简单的说JDK是面向开发人员使用的SDK，它提供了Java的开发环境和运行环境。SDK是Software Development Kit 一般指软件开发包，可以包括函数库、编译程序等。 JDK就是Java Development Kit JRE是Java Runtime Enviroment是指Java的运行环境，是面向Java程序的使用者，而不是开发者。如果安装了JDK，会发同你
Updates were rejected because the tip of your current branch is behind qiaolevip 学习永无止境每天进步一点点众观千象 git
$ git push joe prod-2295-1 To [email protected]:joe.le/dr-frontend.git ! [rejected] prod-2295-1 -> prod-2295-1 (non-fast-forward) error: failed to push some refs to '[email protected]
[一起学Hive]之十四-Hive的元数据表结构详解 superlxw1234 hive hive元数据结构
关键字：Hive元数据、Hive元数据表结构之前在 “[一起学Hive]之一–Hive概述，Hive是什么”中介绍过，Hive自己维护了一套元数据，用户通过HQL查询时候，Hive首先需要结合元数据，将HQL翻译成MapReduce去执行。本文介绍一下Hive元数据中重要的一些表结构及用途，以Hive0.13为例。文章最后面，会以一个示例来全面了解一下，
Spring 3.2.14，4.1.7，4.2.RC2发布 wiselyman Spring 3
Spring 3.2.14、4.1.7及4.2.RC2于6月30日发布。其中Spring 3.2.1是一个维护版本(维护周期到2016-12-31截止)，后续会继续根据需求和bug发布维护版本。此时，Spring官方强烈建议升级Spring框架至4.1.7 或者将要发布的4.2 。其中Spring 4.1.7主要包含这些更新内容。

吴恩达神经网络与深度学习——深度神经网络习题4：构建DNN架构

吴恩达神经网络与深度学习——深度神经网络习题4

构建DNN架构

包

作业大纲

初始化

2层NN

L层NN

前向传播

2层NN

线性部分

激活函数

线性+激活函数

L层NN

代价函数

反向传播

2层NN

线性部分

激活函数+线性部分

L层NN

更新参数

你可能感兴趣的:(神经网络与深度学习)