tyxr5

Neural+machine+translation+with+attention+-+v3

Neural Machine Translation

Welcome to your first programming assignment for this week!

You will build a Neural Machine Translation (NMT) model to translate human readable dates (“25th of June, 2009”) into machine readable dates (“2009-06-25”). You will do this using an attention model, one of the most sophisticated sequence to sequence models.

This notebook was produced together with NVIDIA’s Deep Learning Institute.

Let’s load all the packages you will need for this assignment.

from keras.layers import Bidirectional, Concatenate, Permute, Dot, Input, LSTM, Multiply
from keras.layers import RepeatVector, Dense, Activation, Lambda
from keras.optimizers import Adam
from keras.utils import to_categorical
from keras.models import load_model, Model
import keras.backend as K
import numpy as np

from faker import Faker
import random
from tqdm import tqdm
from babel.dates import format_date
from nmt_utils import *
import matplotlib.pyplot as plt
%matplotlib inline

Using TensorFlow backend.

1 - Translating human readable dates into machine readable dates

The model you will build here could be used to translate from one language to another, such as translating from English to Hindi. However, language translation requires massive datasets and usually takes days of training on GPUs. To give you a place to experiment with these models even without using massive datasets, we will instead use a simpler “date translation” task.

The network will input a date written in a variety of possible formats (e.g. “the 29th of August 1958”, “03/30/1968”, “24 JUNE 1987”) and translate them into standardized, machine readable dates (e.g. “1958-08-29”, “1968-03-30”, “1987-06-24”). We will have the network learn to output dates in the common machine-readable format YYYY-MM-DD.

1.1 - Dataset

We will train the model on a dataset of 10000 human readable dates and their equivalent, standardized, machine readable dates. Let’s run the following cells to load the dataset and print some examples.

m = 10000
dataset, human_vocab, machine_vocab, inv_machine_vocab = load_dataset(m)

100%|██████████| 10000/10000 [00:01<00:00, 8431.37it/s]

dataset[:10]

[('9 may 1998', '1998-05-09'),
 ('10.09.70', '1970-09-10'),
 ('4/28/90', '1990-04-28'),
 ('thursday january 26 1995', '1995-01-26'),
 ('monday march 7 1983', '1983-03-07'),
 ('sunday may 22 1988', '1988-05-22'),
 ('tuesday july 8 2008', '2008-07-08'),
 ('08 sep 1999', '1999-09-08'),
 ('1 jan 1981', '1981-01-01'),
 ('monday may 22 1995', '1995-05-22')]

You’ve loaded:
- dataset: a list of tuples of (human readable date, machine readable date)
- human_vocab: a python dictionary mapping all characters used in the human readable dates to an integer-valued index
- machine_vocab: a python dictionary mapping all characters used in machine readable dates to an integer-valued index. These indices are not necessarily consistent with human_vocab.
- inv_machine_vocab: the inverse dictionary of machine_vocab, mapping from indices back to characters.

Let’s preprocess the data and map the raw text data into the index values. We will also use Tx=30 (which we assume is the maximum length of the human readable date; if we get a longer input, we would have to truncate it) and Ty=10 (since “YYYY-MM-DD” is 10 characters long).

Tx = 30
Ty = 10
X, Y, Xoh, Yoh = preprocess_data(dataset, human_vocab, machine_vocab, Tx, Ty)

print("X.shape:", X.shape)
print("Y.shape:", Y.shape)
print("Xoh.shape:", Xoh.shape)
print("Yoh.shape:", Yoh.shape)

X.shape: (10000, 30)
Y.shape: (10000, 10)
Xoh.shape: (10000, 30, 37)
Yoh.shape: (10000, 10, 11)

You now have:
- X: a processed version of the human readable dates in the training set, where each character is replaced by an index mapped to the character via human_vocab. Each date is further padded to Tx values with a special character (< pad >). X.shape = (m, Tx)
- Y: a processed version of the machine readable dates in the training set, where each character is replaced by the index it is mapped to in machine_vocab. You should have Y.shape = (m, Ty).
- Xoh: one-hot version of X, the “1” entry’s index is mapped to the character thanks to human_vocab. Xoh.shape = (m, Tx, len(human_vocab))
- Yoh: one-hot version of Y, the “1” entry’s index is mapped to the character thanks to machine_vocab. Yoh.shape = (m, Tx, len(machine_vocab)). Here, len(machine_vocab) = 11 since there are 11 characters (‘-’ as well as 0-9).

Lets also look at some examples of preprocessed training examples. Feel free to play with index in the cell below to navigate the dataset and see how source/target dates are preprocessed.

index = 0
print("Source date:", dataset[index][0])
print("Target date:", dataset[index][1])
print()
print("Source after preprocessing (indices):", X[index])
print("Target after preprocessing (indices):", Y[index])
print()
print("Source after preprocessing (one-hot):", Xoh[index])
print("Target after preprocessing (one-hot):", Yoh[index])

Source date: 9 may 1998
Target date: 1998-05-09

Source after preprocessing (indices): [12  0 24 13 34  0  4 12 12 11 36 36 36 36 36 36 36 36 36 36 36 36 36 36 36
 36 36 36 36 36]
Target after preprocessing (indices): [ 2 10 10  9  0  1  6  0  1 10]

Source after preprocessing (one-hot): [[ 0.  0.  0. ...,  0.  0.  0.]
 [ 1.  0.  0. ...,  0.  0.  0.]
 [ 0.  0.  0. ...,  0.  0.  0.]
 ..., 
 [ 0.  0.  0. ...,  0.  0.  1.]
 [ 0.  0.  0. ...,  0.  0.  1.]
 [ 0.  0.  0. ...,  0.  0.  1.]]
Target after preprocessing (one-hot): [[ 0.  0.  1.  0.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  0.  0.  0.  0.  0.  0.  0.  0.  0.  1.]
 [ 0.  0.  0.  0.  0.  0.  0.  0.  0.  0.  1.]
 [ 0.  0.  0.  0.  0.  0.  0.  0.  0.  1.  0.]
 [ 1.  0.  0.  0.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  1.  0.  0.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  0.  0.  0.  0.  0.  1.  0.  0.  0.  0.]
 [ 1.  0.  0.  0.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  1.  0.  0.  0.  0.  0.  0.  0.  0.  0.]
 [ 0.  0.  0.  0.  0.  0.  0.  0.  0.  0.  1.]]

2 - Neural machine translation with attention

If you had to translate a book’s paragraph from French to English, you would not read the whole paragraph, then close the book and translate. Even during the translation process, you would read/re-read and focus on the parts of the French paragraph corresponding to the parts of the English you are writing down.

The attention mechanism tells a Neural Machine Translation model where it should pay attention to at any step.

2.1 - Attention mechanism

In this part, you will implement the attention mechanism presented in the lecture videos. Here is a figure to remind you how the model works. The diagram on the left shows the attention model. The diagram on the right shows what one “Attention” step does to calculate the attention variables α⟨t,t′⟩, which are used to compute the context variable context⟨t⟩ for each timestep in the output (t=1,…,Ty).

Figure 1: Neural machine translation with attention

Here are some properties of the model that you may notice:

There are two separate LSTMs in this model (see diagram on the left). Because the one at the bottom of the picture is a Bi-directional LSTM and comes before the attention mechanism, we will call it pre-attention Bi-LSTM. The LSTM at the top of the diagram comes after the attention mechanism, so we will call it the post-attention LSTM. The pre-attention Bi-LSTM goes through Tx time steps; the post-attention LSTM goes through Ty time steps.
The post-attention LSTM passes s⟨t⟩,c⟨t⟩ from one time step to the next. In the lecture videos, we were using only a basic RNN for the post-activation sequence model, so the state captured by the RNN output activations s⟨t⟩. But since we are using an LSTM here, the LSTM has both the output activation s⟨t⟩ and the hidden cell state c⟨t⟩. However, unlike previous text generation examples (such as Dinosaurus in week 1), in this model the post-activation LSTM at time t does will not take the specific generated y⟨t−1⟩ as input; it only takes s⟨t⟩ and c⟨t⟩ as input. We have designed the model this way, because (unlike language generation where adjacent characters are highly correlated) there isn’t as strong a dependency between the previous character and the next character in a YYYY-MM-DD date.
We use a⟨t⟩=[a→⟨t⟩;a←⟨t⟩] to represent the concatenation of the activations of both the forward-direction and backward-directions of the pre-attention Bi-LSTM.
The diagram on the right uses a RepeatVector node to copy s⟨t−1⟩’s value Tx times, and then Concatenation to concatenate s⟨t−1⟩ and a⟨t⟩ to compute e⟨t,t′, which is then passed through a softmax to compute α⟨t,t′⟩. We’ll explain how to use RepeatVector and Concatenation in Keras below.

Lets implement this model. You will start by implementing two functions: one_step_attention() and model().

1) one_step_attention(): At step t, given all the hidden states of the Bi-LSTM ([a<1>,a<2>,...,a<Tx>]) and the previous hidden state of the second LSTM (s<t−1>), one_step_attention() will compute the attention weights ([α<t,1>,α<t,2>,...,α<t,Tx>]) and output the context vector (see Figure 1 (right) for details):

c o n t e x t < t > = \sum t' = 0 T x α < t, t' > a < t' > (1)

Note that we are denoting the attention in this notebook context⟨t⟩. In the lecture videos, the context was denoted c⟨t⟩, but here we are calling it context⟨t⟩ to avoid confusion with the (post-attention) LSTM’s internal memory cell variable, which is sometimes also denoted c⟨t⟩.

2) model(): Implements the entire model. It first runs the input through a Bi-LSTM to get back [a<1>,a<2>,...,a<Tx>]. Then, it calls one_step_attention() Ty times (for loop). At each iteration of this loop, it gives the computed context vector c<t> to the second LSTM, and runs the output of the LSTM through a dense layer with softmax activation to generate a prediction y^<t>.

Exercise: Implement one_step_attention(). The function model() will call the layers in one_step_attention() Ty using a for-loop, and it is important that all Ty copies have the same weights. I.e., it should not re-initiaiize the weights every time. In other words, all Ty steps should have shared weights. Here’s how you can implement layers with shareable weights in Keras:
1. Define the layer objects (as global variables for examples).
2. Call these objects when propagating the input.

We have defined the layers you need as global variables. Please run the following cells to create them. Please check the Keras documentation to make sure you understand what these layers are: RepeatVector(), Concatenate(), Dense(), Activation(), Dot().

# Defined shared layers as global variables
repeator = RepeatVector(Tx,name='rep')
concatenator = Concatenate(axis=-1,name='con')
densor1 = Dense(10, activation = "tanh",name='den1')
densor2 = Dense(1, activation = "relu",name='den2')
activator = Activation(softmax, name='attention_weights') # We are using a custom softmax(axis = 1) loaded in this notebook
dotor = Dot(axes = 1,name='dot')

Now you can use these layers to implement one_step_attention(). In order to propagate a Keras tensor object X through one of these layers, use layer(X) (or layer([X,Y]) if it requires multiple inputs.), e.g. densor(X) will propagate X through the Dense(1) layer defined above.


# GRADED FUNCTION: one_step_attention

def one_step_attention(a, s_prev):
    """
    Performs one step of attention: Outputs a context vector computed as a dot product of the attention weights
    "alphas" and the hidden states "a" of the Bi-LSTM.

    Arguments:
    a -- hidden state output of the Bi-LSTM, numpy-array of shape (m, Tx, 2*n_a)
    s_prev -- previous hidden state of the (post-attention) LSTM, numpy-array of shape (m, n_s)

    Returns:
    context -- context vector, input of the next (post-attetion) LSTM cell
    """

    ### START CODE HERE ###
    # Use repeator to repeat s_prev to be of shape (m, Tx, n_s) so that you can concatenate it with all hidden states "a" (≈ 1 line)
    s_prev = repeator(s_prev)
    # Use concatenator to concatenate a and s_prev on the last axis (≈ 1 line)
    concat = concatenator([a,s_prev])
    # Use densor1 to propagate concat through a small fully-connected neural network to compute the "intermediate energies" variable e. (≈1 lines)
    e = densor1(concat)
    # Use densor2 to propagate e through a small fully-connected neural network to compute the "energies" variable energies. (≈1 lines)
    energies = densor2(e)
    # Use "activator" on "energies" to compute the attention weights "alphas" (≈ 1 line)
    alphas = activator(energies)
    # Use dotor together with "alphas" and "a" to compute the context vector to be given to the next (post-attention) LSTM-cell (≈ 1 line)
    context = dotor([alphas,a])
    ### END CODE HERE ###

    return context

You will be able to check the expected output of one_step_attention() after you’ve coded the model() function.

Exercise: Implement model() as explained in figure 2 and the text above. Again, we have defined global layers that will share weights to be used in model().

n_a = 32
n_s = 64
post_activation_LSTM_cell = LSTM(n_s, return_state = True)
output_layer = Dense(len(machine_vocab), activation=softmax)

Now you can use these layers Ty times in a for loop to generate the outputs, and their parameters will not be reinitialized. You will have to carry out the following steps:

Propagate the input into a Bidirectional LSTM
Iterate for t=0,…,Ty−1:
1. Call one_step_attention() on [α<t,1>,α<t,2>,...,α<t,Tx>] and s<t−1> to get the context vector context<t>.
2. Give context<t> to the post-attention LSTM cell. Remember pass in the previous hidden-state s⟨t−1⟩ and cell-states c⟨t−1⟩ of this LSTM using initial_state= [previous hidden state, previous cell state]. Get back the new hidden state s<t> and the new cell state c<t>.
3. Apply a softmax layer to s<t>, get the output.
4. Save the output by adding it to the list of outputs.
Create your Keras model instance, it should have three inputs (“inputs”, s<0> and c<0>) and output the list of “outputs”.

# GRADED FUNCTION: model

def model(Tx, Ty, n_a, n_s, human_vocab_size, machine_vocab_size):
    """
    Arguments:
    Tx -- length of the input sequence
    Ty -- length of the output sequence
    n_a -- hidden state size of the Bi-LSTM
    n_s -- hidden state size of the post-attention LSTM
    human_vocab_size -- size of the python dictionary "human_vocab"
    machine_vocab_size -- size of the python dictionary "machine_vocab"

    Returns:
    model -- Keras model instance
    """

    # Define the inputs of your model with a shape (Tx,)
    # Define s0 and c0, initial hidden state for the decoder LSTM of shape (n_s,)
    X = Input(shape=(Tx, human_vocab_size))
    s0 = Input(shape=(n_s,), name='s0')
    c0 = Input(shape=(n_s,), name='c0')
    s = s0
    c = c0

    # Initialize empty list of outputs
    outputs = []

    ### START CODE HERE ###

    # Step 1: Define your pre-attention Bi-LSTM. Remember to use return_sequences=True. (≈ 1 line)
    a = Bidirectional(LSTM(n_a, return_sequences=True),input_shape=(m, Tx, n_a*2))(X)
    print(a.shape)
    print(Ty)
    # Step 2: Iterate for Ty steps
    for t in range(Ty):

        # Step 2.A: Perform one step of the attention mechanism to get back the context vector at step t (≈ 1 line)
        context =  one_step_attention(a, s)

        # Step 2.B: Apply the post-attention LSTM cell to the "context" vector.
        # Don't forget to pass: initial_state = [hidden state, cell state] (≈ 1 line)
        s, _, c =  post_activation_LSTM_cell(context,initial_state = [s, c] )

        # Step 2.C: Apply Dense layer to the hidden state output of the post-attention LSTM (≈ 1 line)
        out = output_layer(s)

        # Step 2.D: Append "out" to the "outputs" list (≈ 1 line)
        outputs.append(out)

    # Step 3: Create model instance taking three inputs and returning the list of outputs. (≈ 1 line)
    model = Model(inputs=[X,s0,c0],outputs=outputs)

    ### END CODE HERE ###

    return model

Run the following cell to create your model.

model = model(Tx, Ty, n_a, n_s, len(human_vocab), len(machine_vocab))

(?, ?, 64)
10

Let’s get a summary of the model to check if it matches the expected output.

model.summary()

____________________________________________________________________________________________________
Layer (type)                     Output Shape          Param #     Connected to                     
====================================================================================================
input_20 (InputLayer)            (None, 30, 37)        0                                            
____________________________________________________________________________________________________
s0 (InputLayer)                  (None, 64)            0                                            
____________________________________________________________________________________________________
bidirectional_19 (Bidirectional) (None, 30, 64)        17920       input_20[0][0]                   
____________________________________________________________________________________________________
rep (RepeatVector)               (None, 30, 64)        0           s0[0][0]                         
                                                                   lstm_24[10][0]                   
                                                                   lstm_24[11][0]                   
                                                                   lstm_24[12][0]                   
                                                                   lstm_24[13][0]                   
                                                                   lstm_24[14][0]                   
                                                                   lstm_24[15][0]                   
                                                                   lstm_24[16][0]                   
                                                                   lstm_24[17][0]                   
                                                                   lstm_24[18][0]                   
____________________________________________________________________________________________________
con (Concatenate)                (None, 30, 128)       0           bidirectional_19[0][0]           
                                                                   rep[10][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[11][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[12][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[13][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[14][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[15][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[16][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[17][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[18][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[19][0]                       
____________________________________________________________________________________________________
den1 (Dense)                     (None, 30, 10)        1290        con[10][0]                       
                                                                   con[11][0]                       
                                                                   con[12][0]                       
                                                                   con[13][0]                       
                                                                   con[14][0]                       
                                                                   con[15][0]                       
                                                                   con[16][0]                       
                                                                   con[17][0]                       
                                                                   con[18][0]                       
                                                                   con[19][0]                       
____________________________________________________________________________________________________
den2 (Dense)                     (None, 30, 1)         11          den1[10][0]                      
                                                                   den1[11][0]                      
                                                                   den1[12][0]                      
                                                                   den1[13][0]                      
                                                                   den1[14][0]                      
                                                                   den1[15][0]                      
                                                                   den1[16][0]                      
                                                                   den1[17][0]                      
                                                                   den1[18][0]                      
                                                                   den1[19][0]                      
____________________________________________________________________________________________________
attention_weights (Activation)   (None, 30, 1)         0           den2[10][0]                      
                                                                   den2[11][0]                      
                                                                   den2[12][0]                      
                                                                   den2[13][0]                      
                                                                   den2[14][0]                      
                                                                   den2[15][0]                      
                                                                   den2[16][0]                      
                                                                   den2[17][0]                      
                                                                   den2[18][0]                      
                                                                   den2[19][0]                      
____________________________________________________________________________________________________
dot (Dot)                        (None, 1, 64)         0           attention_weights[10][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[11][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[12][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[13][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[14][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[15][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[16][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[17][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[18][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[19][0]         
                                                                   bidirectional_19[0][0]           
____________________________________________________________________________________________________
c0 (InputLayer)                  (None, 64)            0                                            
____________________________________________________________________________________________________
lstm_24 (LSTM)                   [(None, 64), (None, 6 33024       dot[10][0]                       
                                                                   s0[0][0]                         
                                                                   c0[0][0]                         
                                                                   dot[11][0]                       
                                                                   lstm_24[10][0]                   
                                                                   lstm_24[10][2]                   
                                                                   dot[12][0]                       
                                                                   lstm_24[11][0]                   
                                                                   lstm_24[11][2]                   
                                                                   dot[13][0]                       
                                                                   lstm_24[12][0]                   
                                                                   lstm_24[12][2]                   
                                                                   dot[14][0]                       
                                                                   lstm_24[13][0]                   
                                                                   lstm_24[13][2]                   
                                                                   dot[15][0]                       
                                                                   lstm_24[14][0]                   
                                                                   lstm_24[14][2]                   
                                                                   dot[16][0]                       
                                                                   lstm_24[15][0]                   
                                                                   lstm_24[15][2]                   
                                                                   dot[17][0]                       
                                                                   lstm_24[16][0]                   
                                                                   lstm_24[16][2]                   
                                                                   dot[18][0]                       
                                                                   lstm_24[17][0]                   
                                                                   lstm_24[17][2]                   
                                                                   dot[19][0]                       
                                                                   lstm_24[18][0]                   
                                                                   lstm_24[18][2]                   
____________________________________________________________________________________________________
dense_11 (Dense)                 (None, 11)            715         lstm_24[10][0]                   
                                                                   lstm_24[11][0]                   
                                                                   lstm_24[12][0]                   
                                                                   lstm_24[13][0]                   
                                                                   lstm_24[14][0]                   
                                                                   lstm_24[15][0]                   
                                                                   lstm_24[16][0]                   
                                                                   lstm_24[17][0]                   
                                                                   lstm_24[18][0]                   
                                                                   lstm_24[19][0]                   
====================================================================================================
Total params: 52,960
Trainable params: 52,960
Non-trainable params: 0
____________________________________________________________________________________________________

Expected Output:

Here is the summary you should see

Total params:	185,484
Trainable params:	185,484
Non-trainable params:	0
bidirectional_1’s output shape	(None, 30, 128)
repeat_vector_1’s output shape	(None, 30, 128)
concatenate_1’s output shape	(None, 30, 256)
attention_weights’s output shape	(None, 30, 1)
dot_1’s output shape	(None, 1, 128)
dense_2’s output shape	(None, 11)

As usual, after creating your model in Keras, you need to compile it and define what loss, optimizer and metrics your are want to use. Compile your model using categorical_crossentropy loss, a custom Adam optimizer (learning rate = 0.005, β1=0.9, β2=0.999, decay = 0.01) and ['accuracy'] metrics:

### START CODE HERE ### (≈2 lines)
opt = Adam(lr=0.005, beta_1=0.9, beta_2=0.999,decay=0.01)
model.compile(loss='categorical_crossentropy', optimizer=opt,metrics=['accuracy'])
### END CODE HERE ###

The last step is to define all your inputs and outputs to fit the model:
- You already have X of shape (m=10000,Tx=30) containing the training examples.
- You need to create s0 and c0 to initialize your post_activation_LSTM_cell with 0s.
- Given the model() you coded, you need the “outputs” to be a list of 11 elements of shape (m, T_y). So that: outputs[i][0], ..., outputs[i][Ty] represent the true labels (characters) corresponding to the ith training example (X[i]). More generally, outputs[i][j] is the true label of the jth character in the ith training example.

s0 = np.zeros((m, n_s))
c0 = np.zeros((m, n_s))
outputs = list(Yoh.swapaxes(0,1))

Let’s now fit the model and run it for one epoch.

model.fit([Xoh, s0, c0], outputs, epochs=1, batch_size=100)

Epoch 1/1
10000/10000 [==============================] - 38s - loss: 17.0479 - dense_11_loss_1: 1.2586 - dense_11_loss_2: 1.0343 - dense_11_loss_3: 1.7487 - dense_11_loss_4: 2.7333 - dense_11_loss_5: 0.8554 - dense_11_loss_6: 1.3875 - dense_11_loss_7: 2.7841 - dense_11_loss_8: 1.0104 - dense_11_loss_9: 1.6879 - dense_11_loss_10: 2.5477 - dense_11_acc_1: 0.4415 - dense_11_acc_2: 0.6585 - dense_11_acc_3: 0.3054 - dense_11_acc_4: 0.0765 - dense_11_acc_5: 0.9349 - dense_11_acc_6: 0.2551 - dense_11_acc_7: 0.0418 - dense_11_acc_8: 0.9262 - dense_11_acc_9: 0.3003 - dense_11_acc_10: 0.1159

While training you can see the loss as well as the accuracy on each of the 10 positions of the output. The table below gives you an example of what the accuracies could be if the batch had 2 examples:

Thus, dense_2_acc_8: 0.89 means that you are predicting the 7th character of the output correctly 89% of the time in the current batch of data.

We have run this model for longer, and saved the weights. Run the next cell to load our weights. (By training a model for several minutes, you should be able to obtain a model of similar accuracy, but loading our model will save you time.)

model.load_weights('models/model.h5')

You can now see the results on new examples.

EXAMPLES = ['3 May 1979', '5 April 09', '21th of August 2016', 'Tue 10 Jul 2007', 'Saturday May 9 2018', 'March 3 2001', 'March 3rd 2001', '1 March 2001']
for example in EXAMPLES:

    source = string_to_int(example, Tx, human_vocab)
    source = np.array(list(map(lambda x: to_categorical(x, num_classes=len(human_vocab)), source))).swapaxes(0,1)
    prediction = model.predict([source, s0, c0])
    prediction = np.argmax(prediction, axis = -1)
    output = [inv_machine_vocab[int(i)] for i in prediction]

    print("source:", example)
    print("output:", ''.join(output))

source: 3 May 1979
output: 1979-05-03
source: 5 April 09
output: 2009-05-05
source: 21th of August 2016
output: 2016-08-21
source: Tue 10 Jul 2007
output: 2007-07-10
source: Saturday May 9 2018
output: 2018-05-09
source: March 3 2001
output: 2001-03-03
source: March 3rd 2001
output: 2001-03-03
source: 1 March 2001
output: 2001-03-01

You can also change these examples to test with your own examples. The next part will give you a better sense on what the attention mechanism is doing–i.e., what part of the input the network is paying attention to when generating a particular output character.

3 - Visualizing Attention (Optional / Ungraded)

Since the problem has a fixed output length of 10, it is also possible to carry out this task using 10 different softmax units to generate the 10 characters of the output. But one advantage of the attention model is that each part of the output (say the month) knows it needs to depend only on a small part of the input (the characters in the input giving the month). We can visualize what part of the output is looking at what part of the input.

Consider the task of translating “Saturday 9 May 2018” to “2018-05-09”. If we visualize the computed α⟨t,t′⟩ we get this:

Figure 8: Full Attention Map

Notice how the output ignores the “Saturday” portion of the input. None of the output timesteps are paying much attention to that portion of the input. We see also that 9 has been translated as 09 and May has been correctly translated into 05, with the output paying attention to the parts of the input it needs to to make the translation. The year mostly requires it to pay attention to the input’s “18” in order to generate “2018.”

3.1 - Getting the activations from the network

Lets now visualize the attention values in your network. We’ll propagate an example through the network, then visualize the values of α⟨t,t′⟩.

To figure out where the attention values are located, let’s start by printing a summary of the model .

model.summary()

____________________________________________________________________________________________________
Layer (type)                     Output Shape          Param #     Connected to                     
====================================================================================================
input_20 (InputLayer)            (None, 30, 37)        0                                            
____________________________________________________________________________________________________
s0 (InputLayer)                  (None, 64)            0                                            
____________________________________________________________________________________________________
bidirectional_19 (Bidirectional) (None, 30, 64)        17920       input_20[0][0]                   
____________________________________________________________________________________________________
rep (RepeatVector)               (None, 30, 64)        0           s0[0][0]                         
                                                                   lstm_24[10][0]                   
                                                                   lstm_24[11][0]                   
                                                                   lstm_24[12][0]                   
                                                                   lstm_24[13][0]                   
                                                                   lstm_24[14][0]                   
                                                                   lstm_24[15][0]                   
                                                                   lstm_24[16][0]                   
                                                                   lstm_24[17][0]                   
                                                                   lstm_24[18][0]                   
____________________________________________________________________________________________________
con (Concatenate)                (None, 30, 128)       0           bidirectional_19[0][0]           
                                                                   rep[10][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[11][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[12][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[13][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[14][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[15][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[16][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[17][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[18][0]                       
                                                                   bidirectional_19[0][0]           
                                                                   rep[19][0]                       
____________________________________________________________________________________________________
den1 (Dense)                     (None, 30, 10)        1290        con[10][0]                       
                                                                   con[11][0]                       
                                                                   con[12][0]                       
                                                                   con[13][0]                       
                                                                   con[14][0]                       
                                                                   con[15][0]                       
                                                                   con[16][0]                       
                                                                   con[17][0]                       
                                                                   con[18][0]                       
                                                                   con[19][0]                       
____________________________________________________________________________________________________
den2 (Dense)                     (None, 30, 1)         11          den1[10][0]                      
                                                                   den1[11][0]                      
                                                                   den1[12][0]                      
                                                                   den1[13][0]                      
                                                                   den1[14][0]                      
                                                                   den1[15][0]                      
                                                                   den1[16][0]                      
                                                                   den1[17][0]                      
                                                                   den1[18][0]                      
                                                                   den1[19][0]                      
____________________________________________________________________________________________________
attention_weights (Activation)   (None, 30, 1)         0           den2[10][0]                      
                                                                   den2[11][0]                      
                                                                   den2[12][0]                      
                                                                   den2[13][0]                      
                                                                   den2[14][0]                      
                                                                   den2[15][0]                      
                                                                   den2[16][0]                      
                                                                   den2[17][0]                      
                                                                   den2[18][0]                      
                                                                   den2[19][0]                      
____________________________________________________________________________________________________
dot (Dot)                        (None, 1, 64)         0           attention_weights[10][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[11][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[12][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[13][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[14][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[15][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[16][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[17][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[18][0]         
                                                                   bidirectional_19[0][0]           
                                                                   attention_weights[19][0]         
                                                                   bidirectional_19[0][0]           
____________________________________________________________________________________________________
c0 (InputLayer)                  (None, 64)            0                                            
____________________________________________________________________________________________________
lstm_24 (LSTM)                   [(None, 64), (None, 6 33024       dot[10][0]                       
                                                                   s0[0][0]                         
                                                                   c0[0][0]                         
                                                                   dot[11][0]                       
                                                                   lstm_24[10][0]                   
                                                                   lstm_24[10][2]                   
                                                                   dot[12][0]                       
                                                                   lstm_24[11][0]                   
                                                                   lstm_24[11][2]                   
                                                                   dot[13][0]                       
                                                                   lstm_24[12][0]                   
                                                                   lstm_24[12][2]                   
                                                                   dot[14][0]                       
                                                                   lstm_24[13][0]                   
                                                                   lstm_24[13][2]                   
                                                                   dot[15][0]                       
                                                                   lstm_24[14][0]                   
                                                                   lstm_24[14][2]                   
                                                                   dot[16][0]                       
                                                                   lstm_24[15][0]                   
                                                                   lstm_24[15][2]                   
                                                                   dot[17][0]                       
                                                                   lstm_24[16][0]                   
                                                                   lstm_24[16][2]                   
                                                                   dot[18][0]                       
                                                                   lstm_24[17][0]                   
                                                                   lstm_24[17][2]                   
                                                                   dot[19][0]                       
                                                                   lstm_24[18][0]                   
                                                                   lstm_24[18][2]                   
____________________________________________________________________________________________________
dense_11 (Dense)                 (None, 11)            715         lstm_24[10][0]                   
                                                                   lstm_24[11][0]                   
                                                                   lstm_24[12][0]                   
                                                                   lstm_24[13][0]                   
                                                                   lstm_24[14][0]                   
                                                                   lstm_24[15][0]                   
                                                                   lstm_24[16][0]                   
                                                                   lstm_24[17][0]                   
                                                                   lstm_24[18][0]                   
                                                                   lstm_24[19][0]                   
====================================================================================================
Total params: 52,960
Trainable params: 52,960
Non-trainable params: 0
____________________________________________________________________________________________________

Navigate through the output of model.summary() above. You can see that the layer named attention_weights outputs the alphas of shape (m, 30, 1) before dot_2 computes the context vector for every time step t=0,…,Ty−1. Lets get the activations from this layer.

The function attention_map() pulls out the attention values from your model and plots them.

attention_map = plot_attention_map(model, human_vocab, inv_machine_vocab, "Tuesday 09 Oct 1993", num = 7, n_s = 64)

On the generated plot you can observe the values of the attention weights for each character of the predicted output. Examine this plot and check that where the network is paying attention makes sense to you.

In the date translation application, you will observe that most of the time attention helps predict the year, and hasn’t much impact on predicting the day/month.

Congratulations!

You have come to the end of this assignment

Here’s what you should remember from this notebook:

Machine translation models can be used to map from one sequence to another. They are useful not just for translating human languages (like French->English) but also for tasks like date format translation.
An attention mechanism allows a network to focus on the most relevant parts of the input when producing a specific part of the output.
A network using an attention mechanism can translate from inputs of length Tx to outputs of length Ty, where Tx and Ty can be different.
You can visualize attention weights α⟨t,t′⟩ to see what the network is paying attention to while generating each output.

Congratulations on finishing this assignment! You are now able to implement an attention model and use it to learn complex mappings from one sequence to another.

你可能感兴趣的:(吴恩达深度学习作业收集)

Redis Copy-on-Write机制： SHENKEM redis 数据库缓存
Copy-on-Write机制：父子进程共享内存页当父进程修改数据时，内核会复制被修改的页这可能导致内存使用量暂时增加通俗的话描述一下可以用一个生活中的例子来通俗解释Copy-on-Write（写时复制）机制：比喻：父子共用一本作业本假设有一对父子（父进程和子进程）要完成以下任务：初始状态：父亲有一本写满数据的作业本（Redis内存数据），现在孩子需要做一份完全相同的作业（RDB持久化）。传统方式
让Ealge管理设计素材与灵感库 UI设计达人
收集「灵感」和「设计素材」几乎是UI设计师日常必不可少的工作。偶尔还会收藏大神的经验文章，而这些东西都收藏在各大网站上，当过一段时间后，你就会发现收集资料都不知道放哪里或是收藏在哪个网站上，所以我们需要一款工具来管理这所有，而今天介绍的设计工具Eagle即可实现这一切。Eagle可以解决大量图片素材「收藏、整理、查找」的各种困扰，让你可以轻松管理各种图片，提升工作效率，同时支持Mac与Window
DolphinScheduler 如何高效调度 AnalyticDB on Spark 作业？ DolphinScheduler社区 spark 大数据分布式
DolphinScheduler是一个分布式易扩展的可视化DAG工作流任务调度开源系统，能高效地执行和管理大数据流程。用户可以在DolphinSchedulerWeb界面轻松创建、编辑和调度云原生数据仓库AnalyticDBMySQL版的Spark作业。前提条件AnalyticDBforMySQL集群的产品系列为企业版、基础版或湖仓版。AnalyticDBforMySQL集群中已创建Job型资源组
python作业陈小铃子 python 开发语言
基础练习练习目标函数01.计算车费题目描述小红打车，起步价8元(3公里),每公里收费2元，她打车行驶了n公里，通过函数封装并计算车费输入描述输入一个公里数输出描述输出应付车费示例输入：5输出：12defcalculate_fare(distance):base_price=8#起步价per_km_cost=2#每公里费用min_distance=3#最小计费距离ifdistance0:sum_nu
《海源寺猜想》龍茶館
图片发自App龍•茶馆（一）众仙来朝执云子龙门奕棋笻竹紫衣提蓝里可是一尾金鱼有五百市井众生在等待挂印封禅菩提都住在一座山上临海的大钟在收集风声绿尽的百花红透了满山又被百花悄悄覆盖来迟的永远是时间季节都重叠在春天一千年太短远远的春天醒来还是从未走远的春天五百里春秋纳霞降瑞法海正源金丝银线锦绣慰依黑山大风的领前古滇王国英雄儿女已经老去的熠熠生辉的文字在浩渺烟波里跌宕出大悲古风这伏藏在法脉里的文脉啊这文
亲子日记25 我死行了吧
2018年10月13日.星期六.晴今天我和两个孩子都休息，我们三睡到9点多才起床，起来吃了点东西，姐姐开始写作业，先把周四晚上的作业补上，告诉她要写拼音，她说老师说从第一页开始写，我说，那你写吧，一会的功夫说写完了，从a写到了ui……，孩，你倒是省事，怎么简单怎么来……图片发自App我告诉她老师不是布置这个作业，她立马垮了脸，可怜兮兮的说，我已经写了很多了啊，怎么还有那么多！？我说，是啊，你为了偷
我的这一年老普洱
走走停停，加入007这一年，到底收获了什么？01一份自律加入007之后发现，有太多的比你厉害还比你努力的人。02一份真实人只要一虚假，智慧就会被屏蔽；一真实，智慧就显现。这一年，看过了太多的励志成功学，已经自动免疫。每次应付作业，心里总会些许不安。每次提前交作业，心里总有些许得意。03一点感悟成年人的学习，没有作业，没有考试，这容易导致一个结果，就是所谓“学了”其实跟没学一样。不要幻觉自己在“学习
车辆云端威胁情报共享系统的多维解析与发展路径百态老人大数据人工智能
第一部分：内容本质提取原始内容描述了一个闭环网络安全体系：“车辆实时上传异常行为日志至安全运营中心（VSOC），云端通过机器学习分析攻击模式并下发全局防御策略”。其核心架构包含：数据采集层：车辆端持续收集异常行为日志数据，包含CAN总线通信模式、网络流量特征及驾驶行为数据传输层：通过V2X通信协议和OTA更新通道实现车云双向通信分析层：安全运营中心(VSOC)采用CNN-BiSRU等深度学习模型进
UE 编译项目时遇到的各种问题（收集中） xx-xzh UE ue5
UE编译项目时遇到的各种问题（收集中）问题1：0>Microsoft.MakeFile.Targets(44,5):ErrorMSB3073解决办法：关闭已经打开的UE编译器，然后重新编译即可问题2：Nobuildactionfilesorversionfilesspecified解决办法：启动项目选错了，切换启动项目为你的项目问题3：GENERATED_BODY()报错解决办法：确保这两行代码在
读书收获安心1978
中原焦点团队中20李倩，坚持分享第567天。2021年8月10日。读心理咨询师三级教程，学到了很多，在咨询过程中应该加以注意或要使用到的地方。比如今天读的第五单元了解，求助者的既往史寻找有价值的资料。在这一块儿看到了在咨询过程中，对来访者收集曾经有过的咨询或治疗经历的一些资料。包括何时为何去做过何种咨询或治疗，当时的诊断是什么？怎么治疗用了什么药？用了什么方法？效果如何？现在是什么样的情况？比如求
我读《史记·刺客列传》奋笔疾书的待业妈妈
这个月的陪伴营文学史作业是汉的文学史讲义。汉的文学相对于先秦来说，少了不少。而且这个月看到友友们写的先秦文学讲义，每一份都太棒啦！有对整个先秦文学史进行梳理的，也有对某一部作品进行整理概括的，也有对个别文章进行赏析的。在学习友友们的先秦文学讲义时，也大概定下自己的这个月目标——《史记》。《史记》的内容非常多，从三皇五帝到汉，上千年的历史被融入一本书中，司马迁着实是个怪才。司马迁为了完成心中的《史记
补作业小华_ab01
补，有三个意思:1.把残破的东西加上材料修理完整；2.把缺少的东西充实起来或添上；3.益处。衣服烂了得赶紧补衣服，羊丢了得赶紧补羊圈，职位空缺也得抓紧落实……补，很有必要。补作业，也是如此?！可能本周末孩子们过得极其快乐，就像我一样过了一个任性的周末吧，以致今天检查作业状况连出:这个孩子说作业本找不到了，那个孩子说作业忘家了，还有孩子干脆拿着空白日记本滥竽充数交了上来，一会儿问的我火腾腾往上冒。给
第⑥期儿童阅读指导师•中级班，第④周线上读书会，由我主持开展第⑦天！英妈恋上思维导图
今天我必将全力以赴！尽自己所能，组织好本次的线上读书会。白天上班，晚上陪两娃！而这周又多了主持工作，除了给群里100+老师们点赞和回评作业外，还得拟定今日研讨会要分享的主题，以及自己制作PPT。因为本期儿童阅读指导师的学习对象，大多是体制内的语文老师，本身就是顶着巨大的“压力”，硬着头皮接受了本次任务，在老师们面前，绝对不敢怠慢，连续⑦天都坚持“认真学习，好好工作”。今日可是最后一天，也是最关键的
致家长朋友们的一封信韩洼小学董翠玲
亲爱的家长朋友们：你们好！我是一二班的语文老师董老师，很高兴我们可以通过这样的方式进行沟通、交流。在这里，我想说的是，希望通过这一封信，可以让我们的心灵靠的更近，让我们的家校合作更紧密。我们都知道今年是“双减政策”落地的第一年，孩子们不准再上辅导班，一二年级也不准再布置书面作业，至于其他作业也是在延时课上就完成了。延时课的开展，让孩子们与家长共同相处的时间更少了。以前还有作业可以辅导辅导，通过作业
亲子日记545篇2019.3.27 明懿妈妈
今晚下班回家，大宝已经和爸爸在下象棋，第一句话就是骄傲的告诉我，作业已经完成，而且在学校等校车时已经写完。一定是昨天跟他二姨家哥哥聊天时受苦的影响。他的表哥今年四年级，作业从来不带回家，在学校全部完成，英语也很棒。有这样的榜样，大宝加油哦！晚上吃过饭已经八点多，洗漱完，大宝拿着书到我们卧室，和小宝一起看书（小宝只是不停的翻着图片问这这、那那的让我回答），读了一篇后就回房间睡觉了。
2023-06-21 王兰芳
每日一省妞咋了？昨天下午朋友打电话聊她家妞儿。主诉：六年级，1米7多点，学习好，最近学校组织研学三天，回来后发现用圆规刺自己的异常行为，妈妈问她疼吗？答不疼，问啥原因？不说，只是说最开心的就是这三天，没有作业，有压力好几个月了，哪方面的压力不说。问我孩子是咋了？为啥这样？我：你俩的夫妻关系咋样？主诉：可以呀？俺两没啥事儿？俺两都是各忙各的没啥矛盾。我：那就好，平时谁照顾妞呢？你和爸爸跟妞的关系咋样
中原焦点团队焦点初级32期孙晓娟2021年11月29日坚持分享第️12天 85b9745cfed8
周一恐惧证还是没能消失，周未的作业完不成，去了也会被罚到教室外面补作业，这是孩子的理由，今日他没有照常上学。由于老公昨天休息的早，我没有来及和他沟通。早上，老公隐忍很长时间的怒气一下子爆发了，老公说周未有那么多时间，可他写作业了吗？你一次次给他机会，抱着希望，可结果又如何呢。老公的意思我明白，他认为我不应该太信任孩子，没有必要再为这样的孩子付出太多。就在我们不愉快的争执中，孩子起床了，老公起身离开
网络安全第三次作业搭建前端页面并解析
我制作的是一个简单的登录页面网源代码1.CSS中box-sizing:border-box：使元素宽度包含边框和内边距，避免布局因padding变化错位。2.min-height:100vh：让body高度至少等于屏幕高度，确保登录框始终居中，不受内容高度影响。3..login-container的max-width:400px：限制登录框最大宽度，在大屏设备上不无限拉伸，保持美观。4.input
2510-香纱-第13天作业-#裂变增长实验室# 香纱
1、每天做一个问题+专业话术库SOP（未完成）2、每天做一个引流话术库SOP(未完成）3、做一个朋友圈文案素材库，每天收集5条朋友圈文案，截图展示打卡（三条）今天爬楼才看到老师关于两个SOP话术库的解释，按照这个解释的话，目前制作的话题库是偏题的，就不发上来了。朋友圈收集了三个方案：其中两条是类似的模式，只是话术上面会有不同的方式。今天在朋友圈看到裂变群的小伙伴朋友圈有几个人都是这样的模式，看样子
1月29星期一晴倔犟的张博闻
明天孩子期末考试，放学回家说作业很少，做了一张数学试卷，说没作业了。晚饭后自己把明天考试用的纸和笔，检查了一下收拾好书包迎接明天的考试，之后把明天考的科目复习了一遍。希望明天孩子不要急燥，认真面对考试。加油!我相信你是最棒的。
每周复盘 2019年 2.4.---2.10 简书时间煮雨
感悟:再难也要坚持，慢慢找思路，写着写着就顺了！学习:1.死磕！终于完成第二次作业上交，难度四个字一一吭吭哧哧！2.听有书共读《行为设计学一一零成本改变》。3.手勤，眼要勤。及时记录稍纵即逝的灵感，抓住它，更文2篇。不管好坏，在写得过程中锻炼自己。工作:过年待班两天，也没有发生年前担心的那么多事。所以说，焦虑和恐惧只是因为自己的内心还不够强大。休闲与放松:图片发自App1.观影两场:《飞驰人生》和
gpt面试题任小栗 #面试题 gpt vue.js 前端
vue面试题一、响应式系统相关❓1.Vue3的响应式系统是如何实现的？和Vue2有何本质区别？答案：Vue3使用Proxy实现响应式（位于@vue/reactivity模块），替代Vue2的Object.defineProperty。核心机制如下：使用targetMap:WeakMap存储依赖关系利用track()和trigger()方法实现依赖收集与派发更新effect()包装副作用函数，自动收
2023-09-24 植萱
现在已经是凌晨1点多了，家里还有个五年级的娃娃在补作业，补啥作业呢？补的是她爸比惩罚她给她布置的作业，因为她学校里的练习没有仔细做，检查过了还有错，很多错，哎……今天我们都出去，留她一个人在家3，4个小时，结果就补了网课，没有做她爸比布置的，所以弄这么晚……她爸比的脾气有点……也实在不敢留到天亮再做……所以，熬夜熬夜吧~
微信投票如何快速涨票数,网上投票怎样才能弄到更多的票巨体5个细节！桃朵APP
微信投票如何快速涨票数,网上投票怎样才能弄到更多的票巨体5个细节！专业团队投票微信205956123(长按微信号可复制粘贴)纯人工快速涨票利用社交媒体传播：在微信朋友圈、QQ空间、微博等社交平台上发布投票信息和呼吁亲友支持，并通过加入相关微信群组或论坛积极参与讨论，以扩大投票的影响力和覆盖范围。1个人号码库：收集亲友的手机号码并添加至通讯录，直接通过微信发送投票链接，这样可以迅速扩大票数。有奖互动
深入解析 Spark：关键问题与答案汇总 ※尘 sql hive spark
在大数据处理领域，Spark凭借其高效的计算能力和丰富的功能，成为了众多开发者和企业的首选框架。然而，在使用Spark的过程中，我们会遇到各种各样的问题，从性能优化到算子使用等。本文将围绕Spark的一些核心问题进行详细解答，帮助大家更好地理解和运用Spark。Spark性能优化策略Spark性能优化是提升作业执行效率的关键，主要可以从以下几个方面入手：首先，资源配置优化至关重要。合理设置Exec
全面指南：如何监控Kafka Topic的生产者客户端码农阿豪@新空间包罗万象 kafka 分布式
个人名片作者简介：java领域优质创作者个人主页：码农阿豪工作室：新空间代码工作室（提供各种软件服务)个人邮箱：[[email protected]]个人微信：15279484656个人导航网站：www.forff.top座右铭：总有人要赢。为什么不能是我呢？专栏导航：码农阿豪系列专栏导航面试专栏：收集了java相关高频面试题，面试实战总结️Spring5系列专栏：整理了Spring5重要知识点与
智能体性能优化：延迟、吞吐量与成本控制 .摘星. AI人工智能性能优化人工智能系统架构成本控制智能体
智能体性能优化：延迟、吞吐量与成本控制Hello，我是摘星！在彩虹般绚烂的技术栈中，我是那个永不停歇的色彩收集者。每一个优化都是我培育的花朵，每一个特性都是我放飞的蝴蝶。每一次代码审查都是我的显微镜观察，每一次重构都是我的化学实验。在编程的交响乐中，我既是指挥家也是演奏者。让我们一起，在技术的音乐厅里，奏响属于程序员的华美乐章。目录智能体性能优化：延迟、吞吐量与成本控制摘要1.性能瓶颈识别与分析1
好习惯，除了坚持，还是坚持。青青夏草小花老师
习惯，是指积久养成的生活方式，是决定一个孩子品行的重要基础。很多家有小学生的宝爸宝妈都没有时间给孩子做早餐，且不说起床穿衣是最消耗时间的部分，还有小女孩的扎辫子，早晨要出门才发现忘记签字、忘记带书带作业......这么多的事情曾经让多少妈妈或爸爸崩溃，哪里还有时间吃早餐呀。“早餐要吃好”这吃早餐的习惯，该是有多少孩子没有养成哦。水晶班里有个女孩子，一年级上学期经常迟到，到了上午十点，准点儿肚子疼不
4.10感恩日记4 LISA莹_11ce
（实操作业）1、去拍下今天生活和工作中的照片，或自拍照。2、继续写不少于三条感恩日记，把照片做为感恩日记的配图。可以直接上传朋友圈截图或链接感恩我的工作，给我带来了力量，活力和希望感恩美好的早餐，安抚我的小肚肚感恩我的小宝贝，出门总祝我工作开心，早点回家最重要的还是要感谢自己，感谢自己的努力和不断的成长突然发现生活中多了很多心想事成图片发自App
意外的惊喜吉祥_0486
亲子日记第401天星期三晴新年新气象，2020年的第一天天气如此暖和，想必今年是一个丰收年。原本以为今天我会和孩子们一起放假，想在家好好的陪她们一天，可是事与愿违，等来的确是正常上班，还是和往常一样，二宝跟着我去上班，大宝在家写作业，就在中午下班回家的时候，领导突然叫住我们，说给我们一个惊喜，一人一个汉堡一张贺卡，这是园里给我们的新年礼物，谢谢领导能记得我们幕后工作者。真的礼物不在大小，能把我们和
VMware Workstation 11 或者 VMware Player 7安装MAC OS X 10.10 Yosemite iwindyforest vmware mac os 10.10 workstation player
最近尝试了下VMware下安装MacOS 系统，安装过程中发现网上可供参考的文章都是VMware Workstation 10以下， MacOS X 10.9以下的文章，只能提供大概的思路，但是实际安装起来由于版本问题，走了不少弯路，所以我尝试写以下总结，希望能给有兴趣安装OSX的人提供一点帮助。写在前面的话：其实安装好后发现，由于我的th
关于《基于模型驱动的B/S在线开发平台》源代码开源的疑虑？ deathwknight JavaScript java 框架
本人从学习Java开发到现在已有10年整，从一个要自学 java买成javascript的小菜鸟，成长为只会java和javascript语言的老菜鸟（个人邮箱：[email protected]）一路走来，跌跌撞撞。用自己的三年多业余时间，瞎搞一个小东西（基于模型驱动的B/S在线开发平台，非MVC框架、非代码生成）。希望与大家一起分享，同时有许些疑虑，希望有人可以交流下平台
如何把maven项目转成web项目 Kai_Ge maven MyEclipse
创建Web工程，使用eclipse ee创建maven web工程 1.右键项目,选择Project Facets,点击Convert to faceted from 2.更改Dynamic Web Module的Version为2.5.(3.0为Java7的,Tomcat6不支持). 如果提示错误,可能需要在Java Compiler设置Compiler compl
主管？？？ Array_06 工作
转载：http://www.blogjava.net/fastzch/archive/2010/11/25/339054.html 很久以前跟同事参加的培训，同事整理得很详细，必须得转！前段时间，公司有组织中高阶主管及其培养干部进行了为期三天的管理训练培训。三天的课程下来，虽然内容较多，因对老师三天来的课程内容深有感触，故借着整理学习心得的机会，将三天来的培训课程做了一个
python内置函数大全 2002wmj python
最近一直在看python的document，打算在基础方面重点看一下python的keyword、Build-in Function、Build-in Constants、Build-in Types、Build-in Exception这四个方面，其实在看的时候发现整个《The Python Standard Library》章节都是很不错的，其中描述了很多不错的主题。先把Build-in Fu
JSP页面通过JQUERY合并行 357029540 JavaScript jquery
在写程序的过程中我们难免会遇到在页面上合并单元行的情况，如图所示如果对于会的同学可能很简单，但是对没有思路的同学来说还是比较麻烦的，提供一下用JQUERY实现的参考代码 function mergeCell(){ var trs = $("#table tr"); &nb
Java基础冰天百华 java基础
学习函数式编程 package base; import java.text.DecimalFormat; public class Main { public static void main(String[] args) { // Integer a = 4; // Double aa = (double)a / 100000; // Decimal
unix时间戳相互转换 adminjun 转换 unix 时间戳
如何在不同编程语言中获取现在的Unix时间戳(Unix timestamp)？ Java time JavaScript Math.round(new Date().getTime()/1000) getTime()返回数值的单位是毫秒 Microsoft .NET / C# epoch = (DateTime.Now.ToUniversalTime().Ticks - 62135
作为一个合格程序员该做的事 aijuans 程序员
作为一个合格程序员每天该做的事 1、总结自己一天任务的完成情况最好的方式是写工作日志，把自己今天完成了什么事情，遇见了什么问题都记录下来，日后翻看好处多多 2、考虑自己明天应该做的主要工作把明天要做的事情列出来，并按照优先级排列，第二天应该把自己效率最高的时间分配给最重要的工作 3、考虑自己一天工作中失误的地方，并想出避免下一次再犯的方法出错不要紧，最重
由html5视频播放引发的总结 ayaoxinchao html5 视频 video
前言项目中存在视频播放的功能，前期设计是以flash播放器播放视频的。但是现在由于需要兼容苹果的设备，必须采用html5的方式来播放视频。我就出于兴趣对html5播放视频做了简单的了解，不了解不知道，水真是很深。本文所记录的知识一些浅尝辄止的知识，说起来很惭愧。视频结构本该直接介绍html5的<video>的，但鉴于本人对视频
解决httpclient访问自签名https报javax.net.ssl.SSLHandshakeException: sun.security.validat bewithme httpclient
如果你构建了一个https协议的站点，而此站点的安全证书并不是合法的第三方证书颁发机构所签发，那么你用httpclient去访问此站点会报如下错误 javax.net.ssl.SSLHandshakeException: sun.security.validator.ValidatorException: PKIX path bu
Jedis连接池的入门级使用 bijian1013 redis redis数据库 jedis
Jedis连接池操作步骤如下： a.获取Jedis实例需要从JedisPool中获取； b.用完Jedis实例需要返还给JedisPool； c.如果Jedis在使用过程中出错，则也需要还给JedisPool； packag
变与不变 bingyingao 不变变亲情永恒
变与不变周末骑车转到了五年前租住的小区，曾经最爱吃的西北面馆、江西水饺、手工拉面早已不在，各种店铺都换了好几茬，这些是变的。三年前还很流行的一款手机在今天看起来已经落后的不像样子。三年前还运行的好好的一家公司，今天也已经不复存在。一座座高楼拔地而起，
【Scala十】Scala核心四：集合框架之List bit1129 scala
Spark的RDD作为一个分布式不可变的数据集合，它提供的转换操作，很多是借鉴于Scala的集合框架提供的一些函数，因此，有必要对Scala的集合进行详细的了解 1. 泛型集合都是协变的，对于List而言，如果B是A的子类，那么List[B]也是List[A]的子类，即可以把List[B]的实例赋值给List[A]变量 2. 给变量赋值(注意val关键字，a，b
Nested Functions in C bookjovi c closure
Nested Functions 又称closure，属于functional language中的概念，一直以为C中是不支持closure的，现在看来我错了，不过C标准中是不支持的，而GCC支持。既然GCC支持了closure，那么 lexical scoping自然也支持了，同时在C中label也是可以在nested functions中自由跳转的
Java-Collections Framework学习与总结-WeakHashMap BrokenDreams Collections
总结这个类之前，首先看一下Java引用的相关知识。Java的引用分为四种：强引用、软引用、弱引用和虚引用。强引用：就是常见的代码中的引用，如Object o = new Object();存在强引用的对象不会被垃圾收集
读《研磨设计模式》-代码笔记-解释器模式-Interpret bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ package design.pattern; /* * 解释器（Interpreter）模式的意图是可以按照自己定义的组合规则集合来组合可执行对象 * * 代码示例实现XML里面1.读取单个元素的值 2.读取单个属性的值 * 多
After Effects操作&快捷键 cherishLC After Effects
1、快捷键官方文档中文版：https://helpx.adobe.com/cn/after-effects/using/keyboard-shortcuts-reference.html 英文版：https://helpx.adobe.com/after-effects/using/keyboard-shortcuts-reference.html 2、常用快捷键
Maven 常用命令 crabdave maven
Maven 常用命令 mvn archetype:generate mvn install mvn clean mvn clean complie mvn clean test mvn clean install mvn clean package mvn test mvn package mvn site mvn dependency:res
shell bad substitution daizj shell 脚本
#!/bin/sh /data/script/common/run_cmd.exp 192.168.13.168 "impala-shell -islave4 -q 'insert OVERWRITE table imeis.${tableName} select ${selectFields}, ds, fnv_hash(concat(cast(ds as string), im
Java SE 第二讲（原生数据类型 Primitive Data Type） dcj3sjt126com java
Java SE 第二讲： 1. Windows: notepad, editplus, ultraedit, gvim Linux: vi, vim, gedit 2. Java 中的数据类型分为两大类： 1）原生数据类型（Primitive Data Type） 2）引用类型（对象类型）（R
CGridView中实现批量删除 dcj3sjt126com PHP yii
1，CGridView中的columns添加 array( 'selectableRows' => 2, 'footer' => '<button type="button" onclick="GetCheckbox();" style=&
Java中泛型的各种使用 dyy_gusi java 泛型
Java中的泛型的使用：1.普通的泛型使用在使用类的时候后面的<>中的类型就是我们确定的类型。 public class MyClass1<T> {//此处定义的泛型是T private T var; public T getVar() { return var; } public void setVa
Web开发技术十年发展历程 gcq511120594 Web 浏览器数据挖掘
回顾web开发技术这十年发展历程： Ajax 03年的时候我上六年级，那时候网吧刚在小县城的角落萌生。传奇，大话西游第一代网游一时风靡。我抱着试一试的心态给了网吧老板两块钱想申请个号玩玩，然后接下来的一个小时我一直在，注，册，账，号。彼时网吧用的512k的带宽，注册的时候，填了一堆信息，提交，页面跳转，嘣，”您填写的信息有误，请重填”。然后跳转回注册页面，以此循环。我现在时常想，如果当时a
openSession()与getCurrentSession()区别： hetongfei java DAO Hibernate
来自 http://blog.csdn.net/dy511/article/details/6166134 1.getCurrentSession创建的session会和绑定到当前线程,而openSession不会。 2. getCurrentSession创建的线程会在事务回滚或事物提交后自动关闭,而openSession必须手动关闭。这里getCurrentSession本地事务(本地
第一章安装Nginx+Lua开发环境 jinnianshilongnian nginx lua openresty
首先我们选择使用OpenResty，其是由Nginx核心加很多第三方模块组成，其最大的亮点是默认集成了Lua开发环境，使得Nginx可以作为一个Web Server使用。借助于Nginx的事件驱动模型和非阻塞IO，可以实现高性能的Web应用程序。而且OpenResty提供了大量组件如Mysql、Redis、Memcached等等，使在Nginx上开发Web应用更方便更简单。目前在京东如实时价格、秒
HSQLDB In-Process方式访问内存数据库 liyonghui160com
HSQLDB一大特色就是能够在内存中建立数据库，当然它也能将这些内存数据库保存到文件中以便实现真正的持久化。先睹为快！下面是一个In-Process方式访问内存数据库的代码示例：下面代码需要引入hsqldb.jar包（hsqldb-2.2.8） import java.s
Java线程的5个使用技巧 pda158 java 数据结构
Java线程有哪些不太为人所知的技巧与用法？　　萝卜白菜各有所爱。像我就喜欢Java。学无止境，这也是我喜欢它的一个原因。日常工作中你所用到的工具，通常都有些你从来没有了解过的东西，比方说某个方法或者是一些有趣的用法。比如说线程。没错，就是线程。或者确切说是Thread这个类。当我们在构建高可扩展性系统的时候，通常会面临各种各样的并发编程的问题，不过我们现在所要讲的可能会略有不同。
开发资源大整合：编程语言篇——JavaScript（1） shoothao JavaScript
概述：本系列的资源整合来自于github中各个领域的大牛，来收藏你感兴趣的东西吧。程序包管理器管理javascript库并提供对这些库的快速使用与打包的服务。 Bower - 用于web的程序包管理。 component - 用于客户端的程序包管理，构建更好的web应用程序。 spm - 全新的静态的文件包管
避免使用终结函数 vahoa.ma java jvm C++
终结函数（finalizer）通常是不可预测的，常常也是很危险的，一般情况下不是必要的。使用终结函数会导致不稳定的行为、更差的性能，以及带来移植性问题。不要把终结函数当做C++中的析构函数（destructors）的对应物。我自己总结了一下这一条的综合性结论是这样的： 1）在涉及使用资源，使用完毕后要释放资源的情形下，首先要用一个显示的方

Total params:	185,484
Trainable params:	185,484
Non-trainable params:	0
bidirectional_1’s output shape	(None, 30, 128)
repeat_vector_1’s output shape	(None, 30, 128)
concatenate_1’s output shape	(None, 30, 256)
attention_weights’s output shape	(None, 30, 1)
dot_1’s output shape	(None, 1, 128)
dense_2’s output shape	(None, 11)