北境の守卫

【DL by Ng】Deeplearning.ai 深度学习工程师课程 -- By Andrew Ng 吴恩达

三个版本

deeplearning.ai 付费版本, 带练习.
Coursera 版本
网易云课堂翻译版本, 练习题需要自己找. 但是网速快, 可以下载观看.

这里笔记用的是网易云课堂的版本.

I. 神经网络和深度学习

Week 01. 深度学习概论 Introduction

Lecture 001 ：欢迎来到深度学习工程师微专业

喜欢 Ng 普通话的声音

我希望可以培养成千上万的人使用人工智能, 去解决真实世界的实际问题, 创造一个人工智能驱动的社会.

Lecture 002 : 什么是神经网络?

以 housing Price Prediction 为例, 讲解了构成神经网络的三个基本要素

input layer : initial features, X
hidden layers : high-level features, some could be explained, most couldn’t\
output layer : Y

Lecture 003 : Supervised Learning with Neural Networks

some common tasks of Supervised Learning using specific NNs

Possibly the single most lucrative application of deep learning today is online advertising. maybe not the most inspiring, but certainly very lucrative.

Structured data VS Unstructured data

Lecture 004 : Why is deep learning taking off?

Scale drives deep learning progress
- scale of data, digital labeled data
- scale of NNs, lots of inputs, hidden layers, connections …

Lecture 004: About this Course

Lecture 005: Course Resources

forum, e-mails, etc

Week 02: Basics of Neural Network programming

Lecture 001: Binary Classification

Lecture 002: Logistic Regression

Logistic Regression : 在 linear regression 的外面套上 sigmoid 函数.

Lecture 003: Logistic Regression cost function

Loss(error) function : on a single train example. Not using MSE, it’s not convex and hard to find optimal point using Gradient descent. So we need to design a convex one, taking advantages of $\hat y$ features.
Cost function : measure the perfomance of $(\omega, b)$ on the full train set

Lecture 004: Gradient Descent

Lecture 005: Derivatives

Lecture 006: More derivative examples

Lecture 007: Computation Graph

Left to Right : blue lines, forward-propagation
Right to Left : red lines, back-propagation

Lecture 008: Derivatives with a Computation Graph

back-propagation, calculate derivatives, chain rule

Lecture 009: Logistic regression derivatives

input : $X = (x_1, x_2)$
initial params : $w = (w_1, w_2), b$
hidden layer: $z = w_1x_1 + w_2x_2 + b$
activation function $\sigma(z) = \frac {1}{1+e^{-z}}$
loss function $\mathcal L(a,y) = -(y\ln(a) + (1-y)\ln(1-a))$

calculation of derivatives

$\frac{d \mathcal L(a,y) }{da} = -(\frac y a - \frac{1-y}{1-a}) = -\frac y a + \frac{1-y}{1-a}$
$\frac{d \mathcal L}{da}\cdot\frac{da}{dz} = (-\frac y a + \frac{1-y}{1-a}) \cdot a(1-a) =-y(1-a)+a(1-y)=a-y$
$dw_1 = (a-y)x_1$
$dw_2 = (a-y)x_2$
$d b = (a - y)$

so, in the next gradient descent phrase

$w_1 := w_1 - \alpha dw_1$
$w_2 := w_2 - \alpha dw_2$
$\alpha db$

Lecture 010: Logistic regression on m examples

上一讲说的是 single examples 的栗子，对于有 $m$ 个examples 的情况下，采用累加求平均的方式计算 $d w ， d b$ 。
这里面计算最大的问题就是两层for-loop，外层是 m 个examples，内层是 n 个weights，复杂度是 $O (m n)$ . 对于 DL 来说，这个值会非常大，下一讲来看如何使用向量化 vectorizazion 来摆脱for循环，加速计算。

Lecture 011: Vectorization

vectorization 用来代替 explicit for-loops，达到加速目的
对于1M 数据， np.dot() 约 1.5ms，for-loop 约 500ms
np.dot()这类的向量化处理，是利用SIMD(single instruction multiple data )技术，在 CPU／GPU 上均可以达到很良好的效果，GPU is better。

Lecture 012: More vectorization examples

NN programming guideline : Whenever possible, avoid explicit for-loops.
- Whenever thinking of using for-loops, find alternative vectorization solution in numpy.

Lecture 013: Vectorizing Logistic Regression

I’m super excited about this techinique, and when we talk about neural networks later, without using even a single explicit for loop.

$Z = n p . d o t (w . T, X) + b$ , where $b$ is a scalar and broadcasting to a vector $b]_{1 \times m}$

Lecture 014: Vectorizing Logistic Regression Gradient Descent

propagation 和 back-propagation 全部实现了 vectorization
最外层的gradient descent 迭代次数，还是要用一层 explicit for-loop，这个没办法去掉

Lecture 015: Broadcasting in Python

一个栗子

import numpy as np

A = np.array([ [1,2,3,4],
			   [5,6,7,8],
			   [9,10,11,12] ])
sum_cols = A.sum(axis = 0)
percentage = A / sum_cols.reshape(1,4)

A.sum() 默认对所有元素求和，返回一个scalar。参数 axis = 0 代表每列求和，返回一个 1x4 的 vector， axis = 1 代表每行求和，返回一个 3x1 的vector
sum_cols 的reshape(1,4) 是多此一举，但是可以清晰的反应其shape，防止出错。reshape 的时间复杂度是 $O (1)$ ，所以不用担心性能损耗
broadcasting 就是将 sum_cols 复制3层，变成和 A 相同的shape，然后进行 element-wise operations。

Lecture 016: A note on Python/Numpy vectors

a = np.random.randn(5) # a.shape = (5,), a bizarre `rank 1 array`, not a vector

a = np.random.randn(1,5) # a.shape = (1,5), good 

a = np.random.randn(5,1) # a.shape = (5,1), good

assert(a.shape == (1,5)) # check whether its shape suits what we expected

Lecture 017: Quick tour of Jupyter/ipython notebooks

Lecture 018: Explanation of logistic regression cost function(optional)

The Loss function
- $\hat y = \sigma (w^Tx+b)$
- we interpret $\hat y$ as $\hat y = p(y=1|x)$ ,
- so, if $\hat y$ , if $\hat y$
- combine two to one $\hat y^y(1-\hat y)^{1-y}$
- because the log function is a strictly monotonically increasing function, so maximize/minimize $p (y ∣ x)$ equals to maximize/minimize $\log p(y|x)$ equals to max/min $\log \hat y^y(1-\hat y)^{1-y} = y\log \hat y + (1-y)\log (1-\hat y)$
- minimizing the loss ( $1 - p (y ∣ x)$ ) corresponding to maximizing the log of the probability, corresponding to minimizing the negative probability

Week 03: One hidden layer Neural Networks

Lecture 001: Neural Networks Overview

multi-layer, $a^{[k]}_i$ the i-th node of k-th layer.
each unit, is a combination of linear unit + activation function

Lecture 002: Neural Network Representation

input layer, hidden layer, output layer
But this is a 2 layers NN, the input layer is not counted, just for conversion.

Lecture 003: Computing a NN’s Output

what you see is that is like logistic regression but repeat of all the times.

Lecture 004: Vectorizing across multiple examples

each column representes an example
each row representes a hidden unit

Lecture 005: Explanation for vectorized implementation

Lecture 006: Activation Functions

sigmoid $\sigma (z) = \frac 1 {1+e^{-z}}$ , never use this except for output layer $(0, 1)$
tanh, $\frac {e^z - e^{-z}}{e^z + e^{-z}}$ , always better than sigmoid but worse than ReLU
ReLU $\max (0, z)$ , best , fast, most used.
leaky ReLU $\max (0.01z, z)$

Lecture 007: Why do you need non-linear activation function?

It turns out that if you use a linear activation function, or alternatively if you don’t have an activation function, then no matter how many layers your neural network has, your NN always doing is just computing one single linear activation function, so you might as well not have any hidden layers.
Because the composition of two linear functions is itself a linear function, so unless you throw a non-linearity in there, then you’re not computing more interesting functions, even as you go deeper in the work.

Lecture 008: Derivatives of activation functions

sigmoid function, $\frac {1}{1+e^{-z}} \to g'(z) = g(z)(1-g(z))$
$\frac{e^z - e^{-z}}{e^z + e^{-z}} \to tanh'(z) = 1 - (tanh(z))^2$

Lecture 009: Gradient descent for neural networks

forward propagation & back propagation

Lecture 010: Back-propagation intuition (Optional)

Lecture 011: Random Initialization

What happens if you initialize weights to zero?

Because all the hidden units start off computing the same function, and both hidden units have the same influence on the output unit, No matter how long you train your neural network, all the hidden units are still computing exactly the same function. and so in this case, there is really no point to having more than one hidden unit.
The solution to this is to initalize your parameters randomly.

we intend to initialize weights as a very small number to accelerate gradient descent.

Week 04: Deep Neural Networks

Lecture 001. Deep L-layer Neural Network

Lecture 002: Forward Propagation in a Deep Network

It turns out that we implement deep neural network, one of the ways to increase your odds of having bug-free implementation is to think very systematically and carefully about the matrix dimensions you’re working with. so when i’m trying to debug my own code I’ll often pull a piece of paper and just think carefully through the dimensions of the matrix I’m working with

Lecture 003: Getting your matrix dimensions right

Lecture 004: Why deep representations?

Some people like to make an analogy between deep neural networks and the human brain where we believe um… neuroscientists believe that the human brain also starts off detecting simple things like edges in what your eyes see and it builds those up to detect more complex things, like the faces that you see. I think analogies between deep learning and the human brain are sometimes a little bit dangerous. but you know there is a lot of truth, to this being how we think the human brain works and that the human brain probably detects simple things like edges first and put them togeter to form more and more complex objects. and so that has served as a loose form of inspiration for some deep learning as well.

Lecture 005: Building blocks of deep neural networks

implementation tricks: cache $z^{[l]}$ , and $w^{[l]}$ , $b^{[l]}$

Lecture 006: Forward and Backward Propagation

存疑， $a^l = g^l(z^l) \to dz^l = da^l \div g^l{'}(z^l)$ ???
上述疑虑错误, 因为 notation 中 $dz^l$ 指的是从源头算起, $\frac {dJ}{dz^l} = \frac{dJ}{da^l} \cdot \frac{da^l}{dz^l} = da^l \cdot g^l{'}(z^l)$

Lecture 007: Parameters VS Hyperparameters

Parameters: $w, b$
Hyperparameters
- learning rate $\alpha$
- #iterations
- hidden layers $L$
- hidden units $n^1, n^2, ..., n^L$
- choice of activation function
- momentum
- mini-batch size
- various forms of regularization parameters
why we call them hyper-parameters? because they determine the real parameters.
empirical process: try, try, try until find it

Lecture 008: What does this have to do with the brain?

II. 改善神经网络:超参数调参,正则化以及优化

Week 01: Setting up your ML application

Lecture 001: Train/dev/test sets

In this week, you’ll learn the practical aspects of how to make your neural network work well, ranging from things like hyper-parameter tuning to how to set up your data to how to make sure your optimization algorithm runs quickly.

And what I’ve seen is that intuitions from one domain or from one application area often do not transfer to other application areas. And the best choices may depend on the amount of data you have,

Cycle of Idea -> Code -> Experiment -> Idea -> Code -> Experiment

Train / Development / Test sets

Train for train
dev: for hold-out cross-validation
Test
In classical ML era, 70% / 30% , or 60%/20%/20%
In modern big data era, a million examples in total, 10k is enough for dev/test. 98%/1%/1%

it’s OK to have no test set, only the result is biased estimation.

Lecture 002: Bias / Variance

I’ve noticed that almost all the really good machine learning practitioners tend to be very sophisticated in understanding of Bias and Variance. Bias and Variance is one of those concepts that’s easily learned but difficult to master. Even if you think you’ve seen the basic concepts of Bias and Variance, there’s often more new ones to it than you’d expect.
In DL trend, there’s less discussion about Bias-Variance trade-off.

The two key numbers to look at to understand bias and variance will be the train set error and the dev set error.

Lecture 003: Basic “Recipe” for ML

Lecture 004: Regularization

If you use L1 regularization, the w will end up being sparse, which means that the w vector will have lots of zeros in it. And some people say that this can help with compressing the model, although, I find that, in practice, L1 regularization to make your model sparse helps only a little bit. So I don’t think it’s used that much, at least not for the purpose of compressing your model.
L2 regularization is just used much much more often.
lambda is a key-word in Python, while writing Python code, use lambd for $\lambda$

L2 norm regularization is also called weight decay, $dw^l = (\text{from backprop}) + \frac \lambda m w^l \to w^l := w^l -\alpha dw^l$

Lecture 005: Why regularization reduces overfitting?

L2 regularization makes some weights decay to zero, so a trend from overfitting to underfitting, then there a midll value of $\lambda$ that makes the result just right.
while using L2 normalization, $z$ is constraint to a small value near $0$ , falls into the linearly part of activation function. so, the NN won’t be over sophisticated.

Lecture 006: Dropout regularization

Dropout some hidde units randomly by zero them out to get a much dimimished network.
Implementation of Inverted dropout

# illustrate with layer l = 3

# the prob of a unit being kept, (1-prob): the prob of a unit being dropped out
keep_prob = 0.8

# stands for keep-dropout distruibution
d3 = np.random.rand(a3.shape[0], a3.shape[1]) < keep_prob
# the dropped out ones are set to 0
a3 = np.multiply(a3, d3)
# invert the expectation of a3 to the origin full NN
a3 /= keep_prob

Lecture 007: Understanding dropout

Intuition: Can’t rely on any one feature, so have to spread out weights.
Dropout is mostly used in computer vision, almost as default. cause always not enough image data, CV always has a trend to over-fitting.
while using dropout, you lose the well defined cost function $J$ and it becomes hard to calculate. so what I usually do is turn off dropout, and run my code and make sure that it is monotonically decreasing J. and the turn on dropout and hope that I didn’ introduce bugs into my code during dropout.

Lecture 008: Other regularization methods

Data augmentation
- flip horizentally
- random crops of the image
Early stopping

I personly prefer L2 normalization, and some differert values of $\lambda$

Lecture 009: Normalizing inputs

Normalization steps
1. subtract mean : $\mu = \frac 1 m \sum x^i, x:= x-\mu$
2. normalize variance: $\sigma^2 = \frac 1 m \sum (x^i)^2, x /= \sigma^2$
3. normalize your train set and test set
Why normalize inputs?

010: Vanishing/Exploding gradients 梯度消失/梯度爆炸

explode/vanish expotentially
while using ReLU, $\text{w = np.sqrt}(\frac 2 {n^{[l-1]}})$
while using tanh, use Xavier initialization $\text{w = np.sqrt}(\frac 1 {n^{[l-1]}})$

011: Weight initialization for DL

Hopefully that makes your weights, not explode too quickly and not decay to zero too quickly, so you can train a reasonably deep network without the weights or the gradiets exploding or vanishing too much.

012: Numerical approximation of gradients 梯度的数值逼近

Two-sided difference formula is much more accurate than one-sided difference
$f'(\theta) = \lim_{\epsilon \to 0} \frac {f(\theta + \epsilon) - f(\theta - \epsilon)}{2\epsilon}$

013: Gradient Checking

Gradient checking is a technique that’s helped me save tons of time, and helped me find bugs in my implementations of back propagation many times.

014: Gradient Checking implementation notes

Don’t use in training, – only to debug
If algo fails grad check, look at components to try to identify bug
Remember regularization
Doesn’t work with dropout

Week 02: Optimization Algorithms

Lecture 001: Mini-batch gradient descent

Lecture 002: Understanding mini-batch gradient descent

Lecture 003: Exponentially weighted averages

Lecture 004: Understanding Exponentially weighted average

Why called exp weighted average?

Lecture 005: Bias correction in exponentially weighted average

Bias correction can make your computation of these averages more accurately. Warm up.

Lecture 006: Gradient descent with momentum

Gradient descent with momentum almost always works faster than
the standard gradient descent algorithm. In one sentence, the basic idea is to compute an exponentially weighted average of your gradients and then use that gradient to update your weights instead.

Lecture 007: RMSprop

RMSprop, root mean square prop

Lecture 008: Adam optimization algorithm

Adam: Adaptive Moment Estimation,

Hyperparameters default values

Lecture 008: Learning rate decay

Lecture 010: The problem of local optima

Week 03: Hyperparameter tuning & Batch Norm & Programming frameworks

Lecture 008: Softmax regression

C = #classes = 4 (0,1,2,…, C-1 )
$n^[L] =4=C$ , we want the probability to each class
$\hat y size (4,1)$ , contains 4 probabilities, sum to 1
how to calculate it?
softmax examples with no hidden layers, just produce linear boundaries

Lecture 009: Training a softmax classifier

The name softmax contrast to hardmax
Softmax regression generalizes logistic regression to C classes.
if C=2, softmax regression regressioned to logistic regression.(only compute C0, Cother = 1 - C0)
single loss function $L(\hat y, y) = -\sum^{n}_{j=1}y_j\log \hat{y_j}$
total cost function $\frac 1 m \sum^{m}_{i=1}L_i$
The inital GD, $dz^{[L]} = \hat y - y$ , a vector of size Cx1

Lecture 010: DL frameworks

Lecture 011: TensorFlow

Basic version

import numpy as np
import tensorflow as tf

w = tf.Variable(0, dtype = tf.float32)
cost = tf.add(tf.add(w**2, tf.multiply(-10, w)), 25)
# same as
# cost = w**2 - 10*w + 25
train = tf.train.GradientDescentOptimizer(0.01).minimize(cost)

init = tf.global_variables_initializer()
session = tf.Session()
session.run(init)

//before gradient descent optimization
print(session.run(w)) # 0.0
//run optimization
session.run(train)
print(session.run(w)) # 0.1
// run 1000 times
for i in range(1000):
	session.run(train)
print(session.run(w)) # 4.99999

Importing train data set as parameters

import numpy as np
import tensorflow as tf

coefficients = np.array([ [1], [-20], [25] ])

w = tf.Variable([0], dtype=tf.float32)
x = tf.placeholder(tf.float32, [3,1])
cost = x[0][0]*w**2 + x[1][0]*w + x[2][0]
train = tf.train.GradientDescentOptimizer(0.01).minimize(cost)
init = tf.global_variables_initializer()

with tf.Session() as session:
	session.run(init)
	for i in range(1000):
		session.run(train, feed_dict = {x:coefficients} )
	print(seesion.run(w))

III. 结构化机器学习项目

IV. 卷积神经网络 CNN

Week 01: CNN

Lecture 001: Computer Vision

Image classification: A cat or not?
Object detection: Find the cars and locate them
Neural Style Transfer: CNN artist
If we use the raw image, there will be too big featuremaps, for example, an RGB image of size 1000 x 1000, the feature number is 3M. if we have 1k hidden units in layer1, we’ll have 3 billion parameters to fit. Impossible work! -> That’s why we inrouduce convolutional operations.

Lecture 002: Edge detection example

Lecture 003: More edge detections

So the idea that you can treat these 9 numbers as parameters, to be learned, has been one ot the most powerful ideas in computer vision.

Lecture 004: Padding

Let’s say, an image of size $\times n$ , a filter of size $\times f$ , so the size of output is

No padding : 从左向右滑动, 每一列都做一个filter, 但是最后 $f - 1$ 列没有filter 了, so 最后的size : $(n - (f-1)) \times (n - (f-1)) $
- But the size of original image is shrinked,
- and the edge corner pixels was visited much less than central pixels, thus lots of edge information is lost
Padding to preserve the original size, the missing size of each direction is $(f - 1)$ , and we padding two sides of that direction, so the padding size is $ p = \frac {(f-1)}{2}$, the original image is padded to size $\times (n + 2p)$ , if we only know $n, p, f$ , the output size is $\times (n+2p-(f-1))$ , particularly, if $ p = \frac {(f-1)}{2}$, the output size is $n \times n $
terminology : valid or same convolutions
- valid: no padding, all pixels are valid real-value, $\times n \ast f\times f \to (n - (f-1)) \times (n - (f-1))$
- same: pad so that ouput size if the same as the input size, $\frac {f-1}{2}$

Lecture 005: Strided convolutions

origin image size: $\times n$
filter size: $\times f$
padding count: $p$
stride length $s$
the output size: $(\lfloor \frac{n+2p-(f-s)}{s} \rfloor) \times (\lfloor \frac{n+2p-(f-s)}{s} \rfloor)$

Technical notes: In CV, the convolution operation is cross-correlation in maths, while the strict convolution is implemented with a vert/hori flipped kernel.

Lecture 006: Convolutions over volumes (RGB images)

Lecture 006: One layer of a convolutional network

No matter how large the image is, there are only fixed (3x3x3+1)x10 = 280 parameters to learn. This is really one property of CNNs that makes them less prone to over-fitting

Lecture 007: One layer of a convolutional network

Lecture 008: A simple convolution network example

Designing a ConvNet
- the size of H and W are shrinking
- the #filters are enlarging
- more and more smaller filters
Types of layer in a convolutional network:
- Convolution (Conv)
- Pooling (POOL)
- Fully connected (FC)

Lecture 009: Pooling layers

Other than convolutional layers, ConvNets often also use pooling layers to reduce the size of their representation to speed up computation. as well as to make some of the features it detects a bit more robust.

max pooling, most used. works great. No one knows exactly why, maybe choose out the strongest feature
average pooling: much less used

pooling applies to each of your channels independently
there is no parameters to learn in pooling layer, it’s just a fixed layer. Only hyperparameters to set.

Lecture 010: Convolutional neural network example

A CNN example, inspired by LeNet-5
notation: only layers with parameters is regarded as layers, so pooling layer is added to conv layer.
a max-pooling of size $f = 2, s = 2$ will shrink the size of origin size $(W, H)$ to $(W / 2, H / 2)$
And I know this seems like there are a lot of hyperparameters, we’ll give you some more specific suggestions later for how to choose these types of hyperparameters. Maybe one common guideline is to actually not try to invent your own settings of hyperparameters, But to look in the literature to see what hyperparameters that work well for others. And just choose an architecture that has worked well for someone else, and there’s a chance that will work for your application as well.
As you go deeper in NN, usually the $n_H, n_W$ will decrease, will the number of channels $n_c$ will increase.

most CNNs have similar properties/ patterns to above ones
A CNN, the CONV layer, the POOLING layer, and the fully connected layer. A lot of CV research has gone into figuring out how to put together these basic building blocks to build effective neural networks, and putting these things together actually requires quite a bit of insight. I think that one of the best ways for you to gain intuition about how to put these things together is to see a number of concrete examples of how others have done it.

011. Why convolutions?

see the origin, filter, and output. if we build a linear fully connections, we are going to have $4704 \times 3072 + 4704 \approx 14M$ parameters. But if we use a conv filter, the number of parameters shrinked to $\times 5 \times 6 + 6 = 156$ . A giant progress.
Why conv so efficient? There are two main reasons
- Parameter sharing: A feature detector (a low-level edge detector, or a high-level eye detector) that’s useful in one part of the image is probably useful in other part of the image.
- Sparsity of connections: In each layer, each output value depends only on a small number of inputs. 卷积的响应值只和 filter 覆盖的那一块儿区域有关, 而与其他区域无关. Sparisity.
Putting it together

Week 02: Case Studies

Lecture 001: Why look at case studies?

I think that a good way to get an intuition of how to build components is to read or to see other examples of effective components. And it turns out that a NN architecture that works well on one CV task often works well on other tasks as well.

classic networks: build the foundation
- LeNet-5
- AlexNet
- VGG
ResNet(152 layers deep, good experience of trainning such a deep model)
Inception

Lecture 002: Classic networks

LeNet - 5, designed by LeCun et al. was aimed for digital numbers recognition on 32x32x1 gray images. Very primal architecture for CNN evolution
AlexNet

DL was starting to gain attraction in speech recognition and a few other areas, but it was really this paper that convinced a lot of the CV community to take a serious look at DL and to convince them that DL did work well in CV and then it grew on to have a huge impact.

VGG-16
- simple repetive architecture, CONV = 3x3 filter, s=1, same. MAX-POOL=2x2, s=2
- goes very deep
- ~138M parameters

Lecture 003: Residual Networks(ResNets)

ResNet is designed to train very deep network by adding short-cuts to plain networks.

极大的优势就是，
- in theory, the deeper, the smaller error
- but in practice, is a U curve
- but ResNet did achieve that theory perfomance

Lecture 004: Why ResNets work so well?

Very good at identity formation learning. Skip the layer

Lecture 005: Network in Network and 1x1 convolutions

could be used to shrink/keep/increase the #channels of your volumn

Lecture 006: Inception network motivation

If you’re building a layer of a NN, and you don’t have to decide do you want a 1x1 or 3x3 or 5x5 of pooling layer, the inception module, let’s you say, let’s do them all, and let’s concatenate the results.
Huge amout of compution could be shrinked to 1/10 using a well-designed bottomneck with 1x1 convolutions.

Lecture 007: Inception Network

Inception module
GoogLeNet

Lecture 008: Using open-source implementations

Lecture 009: Transfer Learning

Transfer other’s model to train mine using a small set of data

Lecture 010:

V. 序列模型

Ref

吴恩达deeplearning.ai学完总结 : 很用心了. 给我的学习计划和体例安排带来很大的启发.

你可能感兴趣的:(DL,公开课集锦)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
C语言宏函数南林yan C语言 c语言
一、什么是宏函数？通过宏定义的函数是宏函数。如下，编译器在预处理阶段会将Add(x,y)替换为((x)*(y))#defineAdd(x,y)((x)*(y))#defineAdd(x,y)((x)*(y))intmain(){inta=10;intb=20;intd=10;intc=Add(a+d,b)*2;cout<
Linux下QT开发的动态库界面弹出操作（SDL2） 13jjyao QT类 qt 开发语言 sdl2 linux
需求：操作系统为linux，开发框架为qt，做成需带界面的qt动态库，调用方为java等非qt程序难点：调用方为java等非qt程序，也就是说调用方肯定不带QApplication::exec()，缺少了这个，QTimer等事件和QT创建的窗口将不能弹出(包括opencv也是不能弹出)；这与qt调用本身qt库是有本质的区别的思路：1.调用方缺QApplication::exec()，那么我们在接口
linux sdl windows.h,Windows下的SDL安装奔跑吧linux内核 linux sdl windows.h
首先你要下载并安装SDL开发包。如果装在C盘下，路径为C:\SDL1.2.5如果在WINDOWS下。你可以按以下步骤：1.打开VC++，点击"Tools",Options2,点击directories选项3.选择"Includefiles"增加一个新的路径。"C:\SDL1.2.5\include"4，现在选择"Libaryfiles“增加"C:\SDL1.2.5\lib"现在你可以开始编写你的第
linux中sdl的使用教程,sdl使用入门 Melissa Corvinus linux中sdl的使用教程
本文通过一个简单示例讲解SDL的基本使用流程。示例中展示一个窗口，窗口里面有个随机颜色快随机移动。当我们鼠标点击关闭按钮时间窗口关闭。基本步骤如下：1.初始化SDL并创建一个窗口。SDL_Init()初始化SDL_CreateWindow()创建窗口2.纹理渲染存储RGB和存储纹理的区别：比如一个从左到右由红色渐变到蓝色的矩形，用存储RGB的话就需要把矩形中每个点的具体颜色值存储下来；而纹理只是一
Redis系列：Geo 类型赋能亿级地图位置计算 Ly768768 redis bootstrap 数据库
1前言我们在篇深刻理解高性能Redis的本质的时候就介绍过Redis的几种基本数据结构，它是基于不同业务场景而设计的：动态字符串(REDIS_STRING)：整数(REDIS_ENCODING_INT)、字符串(REDIS_ENCODING_RAW)双端列表(REDIS_ENCODING_LINKEDLIST)压缩列表(REDIS_ENCODING_ZIPLIST)跳跃表(REDIS_ENCODI
ARM驱动学习之4小结 JT灬新一嵌入式 C++arm开发学习 linux
ARM驱动学习之4小结#include#include#include#include#include#defineDEVICE_NAME"hello_ctl123"MODULE_LICENSE("DualBSD/GPL");MODULE_AUTHOR("TOPEET");staticlonghello_ioctl(structfile*file,unsignedintcmd,unsignedlo
准备胡珊珊乐平九小
尊敬的各位领导、各位同仁们：大家上午好！我是来自乐平九小的胡珊珊。今天很高兴能有机会给大家做“智慧作业”应用培训。说到“智慧作业”我感触颇多，我是在智慧作业中成长起来的，我也时常以自己是一名“智慧作业人”自居。早在2020年疫情期间，学校电教处周光杰主任在学校群里发出智慧作业抢题通知，我看了有些心动，一节微课相当于一次省级公开课，这对于我们普通老师是多么难得的机会啊。但想归想，我也不会用软件啊，再
入门MySQL——查询语法练习 K_un
前言：前面几篇文章为大家介绍了DML以及DDL语句的使用方法，本篇文章将主要讲述常用的查询语法。其实MySQL官网给出了多个示例数据库供大家实用查询，下面我们以最常用的员工示例数据库为准，详细介绍各自常用的查询语法。1.员工示例数据库导入官方文档员工示例数据库介绍及下载链接：https://dev.mysql.com/doc/employee/en/employees-installation.h
02-Cesium聚合分析EntityCluster完整代码 fxshy html css javascript
1.完整代码Document-->-->Cesium.Ion.defaultAccessToken='eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJqdGkiOiJhZjZkZDAwZC1mNTFhLTRhOTEtOGExNi00MzRhNGIzMDdlNDQiLCJpZCI6MTA1MTUzLCJpYXQiOjE2NjA4MDg0Njd9.qajeJtc4-kp
【加密算法基础——RSA 加密】 XWWW668899 网络服务器笔记 python
RSA加密RSA（Rivest-Shamir-Adleman）加密是非对称加密，一种广泛使用的公钥加密算法，主要用于安全数据传输。公钥用于加密，私钥用于解密。RSA加密算法的名称来源于其三位发明者的姓氏：R:RonRivestS:AdiShamirA:LeonardAdleman这三位计算机科学家在1977年共同提出了这一算法，并发表了相关论文。他们的工作为公钥加密的基础奠定了重要基础，使得安全通
Ubuntu18.04 Docker部署Kinship(Django)项目过程 Dante617
1Docker的安装https://blog.csdn.net/weixin_41735055/article/details/1003551792下载镜像dockerpullprogramize/python3.6.8-dlib下载的镜像里包含python3.6.8和dlib19.17.03启动镜像dockerrun-it--namekinship-p7777:80-p3307:3306-p55
golang实现从服务器下载文件到本地指定目录余生逆风飞翔 golang 服务器开发语言
一、连接服务器，采用sftp连接模式packagemiddlewaresimport("fmt""time""github.com/pkg/sftp""golang.org/x/crypto/ssh")//建立服务器连接funcConnect(user,password,hoststring,portint)(*sftp.Client,error){var(auth[]ssh.AuthMethod
el-table实现全选整表，单元一页复选框功能周bro vue.js elementui javascript 前端
全选整表单选一页0":popper-append-to-body="false":total="tableData.length":page-size="pageObj.pagesize":page-sizes="[10,50,100]"layout="total,sizes,prev,pager,next"@size-change="handleSizeChange"@current-chang
spring security中几大组件的作用和执行顺序阿信在这里 java spring
springsecurity中几大组件的作用和执行顺序在SpringSecurity中，AuthenticationProvider、GroupPermissionEvaluator、PermissionEvaluator、AbstractAuthenticationProcessingFilter、DefaultMethodSecurityExpressionHandler和ManageSecu
C#动态加载DLL程序集及使用反射创建实例-简记不全 C#相关 Asp.net WebForm Asp.net MVC c#Assembly 反射程序集
Assembly动态加载程序集：分两种情况：1、需要加载的程序集已经在程序中被引用了，则直接从当前程序域中查找即可：Assemblyassembly=AppDomain.CurrentDomain.GetAssemblies().FirstOrDefault(x=>x.GetName().Name.Contains("theAssemblyName"));2、需要加载的程序集未被加载，则使用程序集
六、全局锁和表锁：给表加个字段怎么有这么多阻碍 nieniemin
数据库锁设计的初衷是处理并发问题。作为多用户共享的资源，当出现并发访问的时候，数据库需要合理地控制资源的访问规则。而锁就是用来实现这些访问规则的重要数据结构。根据加锁的范围，MySQL里面的锁大致可以分成全局锁、表级锁和行锁三类。6.1全局锁全局锁就是对整个数据库实例加锁。MySQL提供了一个加全局读锁的方法，命令是Flushtableswithreadlock(FTWRL)。当你需要让整个库处于
经济金融学公开课学习总汇（九）佳佳爱科技AITech
本章内容：1.什么是金融风险2.什么是风险偏好与满意度，人都是风险厌恶吗3.单一投资还是多元投资4.无差别曲线金融风险：金融风险是指金融变量的各种可能值偏离期望的可能性以及幅度，所以风险不是说，一定会发生概率的亏损或者偏离回报，它也有可能发生超额的回报作为理财的投资人，我们一般只关注系统风险（经济环境不好造成房市大跌等）。还有非系统性风险（购买理财，卷款跑路等）。其中系统风险是可分散的风险；后者是
LCR 078. 合并 K 个升序链表装B且挨揍の LeetCode 链表算法数据结构经验分享笔记 java
https://leetcode.cn/problems/vvXgSW/description/https://leetcode.cn/problems/vvXgSW/description/解题思路方法一：每个链表维护一个索引，每次找到值最小的节点，索引加一。可以采用优先队列实现。/***Definitionforsingly-linkedlist.*publicclassListNode{*i
python字符串相等怎么表示_python怎样判断字符串相等 weixin_39993989 python字符串相等怎么表示
python字符串如何判断相等1.is来判断groupName=params['groupName']##groupName的值是'url'reqBody['dim']=groupNameprint("reqBody_dim-SummaryListHandler",reqBody['dim'])##('reqBody_dim-SummaryListHandler',u'url')print("re
音视频知识图谱 2022.04 关键帧Keyframe
前些时间，我在知识星球上创建了一个音视频技术社群：关键帧的音视频开发圈，在这里群友们会一起做一些打卡任务。比如：周期性地整理音视频相关的面试题，汇集一份音视频面试题集锦，你可以看看《音视频面试题集锦2022.04》。再比如：循序渐进地归纳总结音视频技术知识，绘制一幅音视频知识图谱。下面是2022.04月知识图谱新增的内容节选：1）图谱路径：**采集/音频采集/声音三要素/响度******主观计量响
【解决内存泄漏的问题】 Qt 框架中的父子对象关系会自动管理内存，父对象会在其销毁时自动销毁所有子对象。课堂随想 QT qt
修改前的代码这段代码可能会出现内存泄漏问题，主要原因是构造函数中创建的LoginDialog和RegisterDialog对象未在合适的地方被正确释放。具体分析如下：1.构造函数中的问题_login_dlg=newLoginDialog();setCentralWidget(_login_dlg);_login_dlg->show();connect(_login_dlg,&LoginDialog
BP神经网络的传递函数大胜归来19 MATLAB
BP网络一般都是用三层的，四层及以上的都比较少用；传输函数的选择，这个怎么说，假设你想预测的结果是几个固定值，如1,0等，满足某个条件输出1，不满足则0的话，首先想到的是hardlim函数，阈值型的，当然也可以考虑其他的；然后，假如网络是用来表达某种线性关系时，用purelin---线性传输函数；若是非线性关系的话，用别的非线性传递函数，多层网络时，每层不一定要用相同的传递函数，可以是三种配合，可
静态常量（static const）|| 日志记录器课堂随想 moveit2 机器人
//AllsourcefilesthatuseROSloggingshoulddefineafile-specific//staticconstrclcpp::LoggernamedLOGGER,locatedatthetopofthefile//andinsidethenamespacewiththenarrowestscope(ifthereisone)staticconstrclcpp::L
Lua 与 C#交互 z2014z lua c#开发语言
Lua与C#交互前提Lua是一种嵌入式脚本语言，Lua的解释器是用C编写的，因此可以方便的与C/C++进行相互调用。轻量级Lua语言的官方版本只包括一个精简的核心和最基本的库，这使得Lua体积小、启动速度快，也适合嵌入在别的程序里。交互过程C#调用Lua:由C#文件调用Lua解析器底层dll库（由C语言编写），再由dll文件执行相应的Lua文件。Lua调用C#：1、Wrap方式：首先生成C#源文件
由于直接在一个回答中提供完整且多语言的游戏商城代码是不现实的（因为每种语言都有其独特的语法和库），我将为你概述一个游戏商城的核心概念，并提供几种不同编程语言的基本框架或示例代码段。 uthRaman 游戏 python 开发语言
商城系统概述hailiangwang.com游戏商城系统通常包含以下部分：用户系统（登录、注册、用户信息）商品列表（游戏、DLC、虚拟货币等）购物车系统支付系统订单系统2.示例框架（伪代码）首先，我们给出一个伪代码框架，描述商城的核心逻辑。plaintextclassUser:deflogin(username,password):#验证用户登录passdefregister(username,p
Makefile问答之 04 优化异常与警告设置捕鲸叉 Linux使用 Linux系统编程 Makefile linux
Makefile怎样指定优化选项，包括编译和链接优化，常用的选项有哪些？在Makefile中，你可以通过设置编译器和链接器的选项来指定优化选项。优化选项可以分为编译优化和链接优化，以下是如何在Makefile中指定这些选项，以及一些常用的选项。示例Makefile#编译器CC=gcc#编译选项CFLAGS=-Wall-O2#链接选项LDFLAGS=-O2#需要链接的库LDLIBS=#目标文件TAR
好看的vue登录页面(附源代码背景图) 小小薛定谔 vue.js javascript css 前端
一、效果展示二、代码你好!欢迎回来登录忘记密码?注册exportdefault{name:"MedLogin",data(){return{confirm_disabled:false,loginForm:{no:'',password:''},rules:{no:[{required:true,message:'请输入账号',trigger:'blur'},{min:3,max:6,messag
ComfyUI中的sam模型国内下载方法 jayli517 ComfyUI python stable diffusion
was-node-suite-comfyui这个节点安装的时候，有它内部的config配置文件，里面其实给了一些下载地址，配置文件里是这么写的："sam_model_vith_url":"https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth","sam_model_vitl_url":"https://dl.fba
一串奇特的代码 hi武林高手
一个空的div元素，所有浏览器的渲染结果都不一样。body{display:table-cell;vertical-align:middle;//垂直居中}div{margin:atuo;height:100px;width:100px;outline:inset100pxgreen;//设置4个边框的样式outline-offset:-125px;//对轮廓进行偏移}html{display：t
jvm调优总结（从基本概念到深度优化） oloz java jvm jdk 虚拟机应用服务器
JVM参数详解：http://www.cnblogs.com/redcreen/archive/2011/05/04/2037057.html Java虚拟机中，数据类型可以分为两类：基本类型和引用类型。基本类型的变量保存原始值，即：他代表的值就是数值本身；而引用类型的变量保存引用值。“引用值”代表了某个对象的引用，而不是对象本身，对象本身存放在这个引用值所表示的地址的位置。
【Scala十六】Scala核心十：柯里化函数 bit1129 scala
本篇文章重点说明什么是函数柯里化，这个语法现象的背后动机是什么，有什么样的应用场景，以及与部分应用函数(Partial Applied Function)之间的联系 1. 什么是柯里化函数 A way to write functions with multiple parameter lists. For instance def f(x: Int)(y: Int) is a
HashMap dalan_123 java
HashMap在java中对很多人来说都是熟的；基于hash表的map接口的非同步实现。允许使用null和null键；同时不能保证元素的顺序；也就是从来都不保证其中的元素的顺序恒久不变。 1、数据结构在java中，最基本的数据结构无外乎：数组和引用（指针），所有的数据结构都可以用这两个来构造，HashMap也不例外，归根到底HashMap就是一个链表散列的数据
Java Swing如何实时刷新JTextArea，以显示刚才加append的内容周凡杨 java 更新 swing JTextArea
在代码中执行完textArea.append("message")后，如果你想让这个更新立刻显示在界面上而不是等swing的主线程返回后刷新，我们一般会在该语句后调用textArea.invalidate()和textArea.repaint()。问题是这个方法并不能有任何效果，textArea的内容没有任何变化，这或许是swing的一个bug，有一个笨拙的办法可以实现
servlet或struts的Action处理ajax请求 g21121 servlet
其实处理ajax的请求非常简单，直接看代码就行了： //如果用的是struts //HttpServletResponse response = ServletActionContext.getResponse(); // 设置输出为文字流 response.setContentType("text/plain"); // 设置字符集 res
FineReport的公式编辑框的语法简介老A不折腾 finereport 公式总结
FINEREPORT用到公式的地方非常多，单元格（以=开头的便被解析为公式），条件显示，数据字典，报表填报属性值定义，图表标题，轴定义，页眉页脚，甚至单元格的其他属性中的鼠标悬浮提示内容都可以写公式。简单的说下自己感觉的公式要注意的几个地方： 1.if语句语法刚接触感觉比较奇怪，if(条件式子,值1,值2)，if可以嵌套，if(条件式子1，值1，if(条件式子2，值2，值3)
linux mysql 数据库乱码的解决办法墙头上一根草 linux mysql 数据库乱码
linux 上mysql数据库区分大小写的配置 lower_case_table_names=1 1-不区分大小写 0-区分大小写修改/etc/my.cnf 具体的修改内容如下: [client] default-character-set=utf8 [mysqld] datadir=/var/lib/mysql socket=/va
我的spring学习笔记6-ApplicationContext实例化的参数兼容思想 aijuans Spring 3
ApplicationContext能读取多个Bean定义文件，方法是： ApplicationContext appContext = new ClassPathXmlApplicationContext（ new String[]｛“bean-config1.xml”，“bean-config2.xml”，“bean-config3.xml”，“bean-config4.xml
mysql 基准测试之sysbench annan211 基准测试 mysql基准测试 MySQL测试 sysbench
1 执行如下命令，安装sysbench-0.5： tar xzvf sysbench-0.5.tar.gz cd sysbench-0.5 chmod +x autogen.sh ./autogen.sh ./configure --with-mysql --with-mysql-includes=/usr/local/mysql
sql的复杂查询使用案列与技巧百合不是茶 oracle sql 函数数据分页合并查询
本片博客使用的数据库表是oracle中的scott用户表; ------------------- 自然连接查询查询 smith 的上司(两种方法) &
深入学习Thread类 bijian1013 java thread 多线程 java多线程
一．线程的名字下面来看一下Thread类的name属性，它的类型是String。它其实就是线程的名字。在Thread类中，有String getName()和void setName(String)两个方法用来设置和获取这个属性的值。同时，Thr
JSON串转换成Map以及如何转换到对应的数据类型 bijian1013 java fastjson net.sf.json
在实际开发中，难免会碰到JSON串转换成Map的情况，下面来看看这方面的实例。另外，由于fastjson只支持JDK1.5及以上版本，因此在JDK1.4的项目中可以采用net.sf.json来处理。一.fastjson实例 JsonUtil.java package com.study; impor
【RPC框架HttpInvoker一】HttpInvoker：Spring自带RPC框架 bit1129 spring
HttpInvoker是Spring原生的RPC调用框架，HttpInvoker同Burlap和Hessian一样，提供了一致的服务Exporter以及客户端的服务代理工厂Bean，这篇文章主要是复制粘贴了Hessian与Spring集成一文，【RPC框架Hessian四】Hessian与Spring集成在【RPC框架Hessian二】Hessian 对象序列化和反序列化一文中
【Mahout二】基于Mahout CBayes算法的20newsgroup的脚本分析 bit1129 Mahout
#!/bin/bash # # Licensed to the Apache Software Foundation (ASF) under one or more # contributor license agreements. See the NOTICE file distributed with # this work for additional information re
nginx三种获取用户真实ip的方法 ronin47
随着nginx的迅速崛起，越来越多公司将apache更换成nginx. 同时也越来越多人使用nginx作为负载均衡, 并且代理前面可能还加上了CDN加速，但是随之也遇到一个问题：nginx如何获取用户的真实IP地址,如果后端是apache,请跳转到<apache获取用户真实IP地址>，如果是后端真实服务器是nginx，那么继续往下看。实例环境：用户IP 120.22.11.11
java-判断二叉树是不是平衡 bylijinnan java
参考了 http://zhedahht.blog.163.com/blog/static/25411174201142733927831/ 但是用java来实现有一个问题。由于Java无法像C那样“传递参数的地址，函数返回时能得到参数的值”，唯有新建一个辅助类：AuxClass import ljn.help.*; public class BalancedBTree {
BeanUtils.copyProperties VS PropertyUtils.copyProperties 诸葛不亮 PropertyUtils BeanUtils
BeanUtils.copyProperties VS PropertyUtils.copyProperties 作为两个bean属性copy的工具类，他们被广泛使用，同时也很容易误用，给人造成困然；比如：昨天发现同事在使用BeanUtils.copyProperties copy有integer类型属性的bean时，没有考虑到会将null转换为0，而后面的业
[金融与信息安全]最简单的数据结构最安全 comsci 数据结构
现在最流行的数据库的数据存储文件都具有复杂的文件头格式，用操作系统的记事本软件是无法正常浏览的，这样的情况会有什么问题呢？从信息安全的角度来看，如果我们数据库系统仅仅把这种格式的数据文件做异地备份，如果相同版本的所有数据库管理系统都同时被攻击，那么
vi区段删除 Cwind linux vi 区段删除
区段删除是编辑和分析一些冗长的配置文件或日志文件时比较常用的操作。简记下vi区段删除要点备忘。 vi概述引文中并未将末行模式单独列为一种模式。单不单列并不重要，能区分命令模式与末行模式即可。 vi区段删除步骤： 1. 在末行模式下使用:set nu显示行号非必须，随光标移动vi右下角也会显示行号，能够正确找到并记录删除开始行
清除tomcat缓存的方法总结 dashuaifu tomcat 缓存
用tomcat容器，大家可能会发现这样的问题，修改jsp文件后，但用IE打开依然是以前的Jsp的页面。出现这种现象的原因主要是tomcat缓存的原因。解决办法如下: 在jsp文件头加上 <meta http-equiv="Expires" content="0"> <meta http-equiv="kiben&qu
不要盲目的在项目中使用LESS CSS dcj3sjt126com Web less
　如果你还不知道LESS CSS是什么东西，可以看一下这篇文章，是我一朋友写给新人看的《CSS——LESS》　　不可否认，LESS CSS是个强大的工具，它弥补了css没有变量、无法运算等一些“先天缺陷”，但它似乎给我一种错觉，就是为了功能而实现功能。　　比如它的引用功能 ? .rounded_corners{
[入门]更上一层楼 dcj3sjt126com PHP yii2
更上一层楼通篇阅读完整个“入门”部分，你就完成了一个完整 Yii 应用的创建。在此过程中你学到了如何实现一些常用功能，例如通过 HTML 表单从用户那获取数据，从数据库中获取数据并以分页形式显示。你还学到了如何通过 Gii 去自动生成代码。使用 Gii 生成代码把 Web 开发中多数繁杂的过程转化为仅仅填写几个表单就行。本章将介绍一些有助于更好使用 Yii 的资源：
Apache HttpClient使用详解 eksliang httpclient http协议
Http协议的重要性相信不用我多说了，HttpClient相比传统JDK自带的URLConnection，增加了易用性和灵活性（具体区别，日后我们再讨论），它不仅是客户端发送Http请求变得容易，而且也方便了开发人员测试接口（基于Http协议的），即提高了开发的效率，也方便提高代码的健壮性。因此熟练掌握HttpClient是很重要的必修内容，掌握HttpClient后，相信对于Http协议的了解会
zxing二维码扫描功能 gundumw100 android zxing
经常要用到二维码扫描功能现给出示例代码 import com.google.zxing.WriterException; import com.zxing.activity.CaptureActivity; import com.zxing.encoding.EncodingHandler; import android.app.Activity; import an
纯HTML+CSS带说明的黄色导航菜单 ini html Web html5 css hovertree
HoverTree带说明的CSS菜单:纯HTML+CSS结构链接带说明的黄色导航在线体验效果：http://hovertree.com/texiao/css/1.htm代码如下,保存到HTML文件可以看到效果： <!DOCTYPE html > <html > <head> <title>HoverTree
fastjson初始化对性能的影响 kane_xie fastjson 序列化
之前在项目中序列化是用thrift，性能一般，而且需要用编译器生成新的类，在序列化和反序列化的时候感觉很繁琐，因此想转到json阵营。对比了jackson，gson等框架之后，决定用fastjson，为什么呢，因为看名字感觉很快。。。网上的说法： fastjson 是一个性能很好的 Java 语言实现的 JSON 解析器和生成器，来自阿里巴巴的工程师开发。
基于Mybatis封装的增删改查实现通用自动化sql mengqingyu DAO
1.基于map或javaBean的增删改查可实现不写dao接口和实现类以及xml，有效的提高开发速度。 2.支持自定义注解包括主键生成、列重复验证、列名、表名等 3.支持批量插入、批量更新、批量删除 <bean id="dynamicSqlSessionTemplate" class="com.mqy.mybatis.support.Dynamic
js控制input输入框的方法封装(数字，中文，字母，浮点数等) qifeifei javascript js
在项目开发的时候，经常有一些输入框，控制输入的格式，而不是等输入好了再去检查格式，格式错了就报错，体验不好。 /** 数字，中文，字母,浮点数(+/-/.) 类型输入限制，只要在input标签上加上 jInput="number,chinese,alphabet,floating" 备注：floating属性只能单独用*/ funct
java 计时器应用 tangqi609567707 java timer
mport java.util.TimerTask; import java.util.Calendar; public class MyTask extends TimerTask { private static final int
erlang输出调用栈信息 wudixiaotie erlang
在erlang otp的开发中，如果调用第三方的应用，会有有些错误会不打印栈信息，因为有可能第三方应用会catch然后输出自己的错误信息，所以对排查bug有很大的阻碍，这样就要求我们自己打印调用的栈信息。用这个函数：erlang:process_display (self (), backtrace).需要注意这个函数只会输出到标准错误输出。也可以用这个函数：erlang:get_s