xxX.888

吴恩达深度学习课后习题第五课第二周编程作业2:Emojify

Packages
1 - Baseline Model: Emojifier-V1
- 1.1 - Dataset EMOJISET
- 1.2 - Overview of the Emojifier-V1
- 1.3 - Implementing Emojifier-V1
  - Exercise 1 - sentence_to_avg
- 1.4 - Implement the Model
  - Exercise 2 - model
- 1.5 - Examining Test Set Performance
2 - Emojifier-V2: Using LSTMs in Keras
- 2.1 - Model Overview
- 2.2 Keras and Mini-batching
- 2.3 - The Embedding Layer
  - Exercise 3 - sentences_to_indices
  - Exercise 4 - pretrained_embedding_layer
- 2.4 - Building the Emojifier-V2
  - Exercise 5 - Emojify_V2
- 2.5 - Train the Model
3 - Acknowledgments

Packages

Let's get started! Run the following cell to load the packages you're going to use.

In [1]:

import numpy as np
from emo_utils import *
import emoji
import matplotlib.pyplot as plt
from test_utils import *

%matplotlib inline

1 - Baseline Model: Emojifier-V1

1.1 - Dataset EMOJISET

Let's start by building a simple baseline classifier.

You have a tiny dataset (X, Y) where:

X contains 127 sentences (strings).
Y contains an integer label between 0 and 4 corresponding to an emoji for each sentence.

Figure 1: EMOJISET - a classification problem with 5 classes. A few examples of sentences are given here.

Load the dataset using the code below. The dataset is split between training (127 examples) and testing (56 examples).

In [2]:

X_train, Y_train = read_csv('data/train_emoji.csv')
X_test, Y_test = read_csv('data/tesss.csv')

In [3]:

maxLen = len(max(X_train, key=len).split())

Run the following cell to print sentences from X_train and corresponding labels from Y_train.

Change idx to see different examples.
Note that due to the font used by iPython notebook, the heart emoji may be colored black rather than red.

In [4]:

for idx in range(10):
    print(X_train[idx], label_to_emoji(Y_train[idx]))

never talk to me again 
I am proud of your achievements 
It is the worst day in my life 
Miss you so much ❤️
food is life 
I love you mum ❤️
Stop saying bullshit 
congratulations on your acceptance 
The assignment is too long  
I want to go play ⚾

1.2 - Overview of the Emojifier-V1

In this section, you'll implement a baseline model called "Emojifier-v1".

Figure 2: Baseline model (Emojifier-V1).

Inputs and Outputs

The input of the model is a string corresponding to a sentence (e.g. "I love you").
The output will be a probability vector of shape (1,5), (indicating that there are 5 emojis to choose from).
The (1,5) probability vector is passed to an argmax layer, which extracts the index of the emoji with the highest probability.

One-hot Encoding

To get your labels into a format suitable for training a softmax classifier, convert YY from its current shape (m,1)(m,1) into a "one-hot representation" (m,5)(m,5),
- Each row is a one-hot vector giving the label of one example.
- Here, Y_oh stands for "Y-one-hot" in the variable names Y_oh_train and Y_oh_test:

In [5]:

Y_oh_train = convert_to_one_hot(Y_train, C = 5)
Y_oh_test = convert_to_one_hot(Y_test, C = 5)

Now, see what convert_to_one_hot() did. Feel free to change index to print out different values.

In [6]:

idx = 50
print(f"Sentence '{X_train[idx]}' has label index {Y_train[idx]}, which is emoji {label_to_emoji(Y_train[idx])}", )
print(f"Label index {Y_train[idx]} in one-hot encoding format is {Y_oh_train[idx]}")

Sentence 'I missed you' has label index 0, which is emoji ❤️
Label index 0 in one-hot encoding format is [1. 0. 0. 0. 0.]

All the data is now ready to be fed into the Emojify-V1 model. You're ready to implement the model!

1.3 - Implementing Emojifier-V1

As shown in Figure 2 (above), the first step is to:

Convert each word in the input sentence into their word vector representations.
Take an average of the word vectors.

Similar to this week's previous assignment, you'll use pre-trained 50-dimensional GloVe embeddings.

Run the following cell to load the word_to_vec_map, which contains all the vector representations.

In [7]:

word_to_index, index_to_word, word_to_vec_map = read_glove_vecs('data/glove.6B.50d.txt')

You've loaded:

word_to_index: dictionary mapping from words to their indices in the vocabulary
- (400,001 words, with the valid indices ranging from 0 to 400,000)
index_to_word: dictionary mapping from indices to their corresponding words in the vocabulary
word_to_vec_map: dictionary mapping words to their GloVe vector representation.

Run the following cell to check if it works:

In [8]:

word = "cucumber"
idx = 289846
print("the index of", word, "in the vocabulary is", word_to_index[word])
print("the", str(idx) + "th word in the vocabulary is", index_to_word[idx])

the index of cucumber in the vocabulary is 113317
the 289846th word in the vocabulary is potatos

Exercise 1 - sentence_to_avg

Implement sentence_to_avg()

You'll need to carry out two steps:

Convert every sentence to lower-case, then split the sentence into a list of words.
- X.lower() and X.split() might be useful.
For each word in the sentence, access its GloVe representation.
- Then take the average of all of these word vectors.
- You might use numpy.zeros(), which you can read more about here.

Additional Hints

When creating the avg array of zeros, you'll want it to be a vector of the same shape as the other word vectors in the word_to_vec_map.
- You can choose a word that exists in the word_to_vec_map and access its .shape field.
- Be careful not to hard-code the word that you access. In other words, don't assume that if you see the word 'the' in the word_to_vec_map within this notebook, that this word will be in the word_to_vec_map when the function is being called by the automatic grader.

Hint: you can use any one of the word vectors that you retrieved from the input sentence to find the shape of a word vector.

In [9]:

# UNQ_C1 (UNIQUE CELL IDENTIFIER, DO NOT EDIT)
# GRADED FUNCTION: sentence_to_avg

def sentence_to_avg(sentence, word_to_vec_map):
    """
 Converts a sentence (string) into a list of words (strings). Extracts the GloVe representation of each word
 and averages its value into a single vector encoding the meaning of the sentence.
    
 Arguments:
 sentence -- string, one training example from X
 word_to_vec_map -- dictionary mapping every word in a vocabulary into its 50-dimensional vector representation
    
 Returns:
 avg -- average vector encoding information about the sentence, numpy-array of shape (J,), where J can be any number
 """
    # Get a valid word contained in the word_to_vec_map. 
    any_word = list(word_to_vec_map.keys())[0]
    
    ### START CODE HERE ###
    # Step 1: Split sentence into list of lower case words (≈ 1 line)
    words = sentence.lower().split()

    # Initialize the average word vector, should have the same shape as your word vectors.
    avg = np.zeros(word_to_vec_map[any_word].shape)
    
    # Initialize count to 0
    count = 0
    
    # Step 2: average the word vectors. You can loop over the words in the list "words".
    for w in words:
        # Check that word exists in word_to_vec_map
        if w in word_to_vec_map:
            avg += word_to_vec_map[w]
            # Increment count
            count +=1
          
    if count > 0:
        # Get the average. But only if count > 0
        avg = avg / count
    
    ### END CODE HERE ###
    
    return avg

In [10]:

# BEGIN UNIT TEST
avg = sentence_to_avg("Morrocan couscous is my favorite dish", word_to_vec_map)
print("avg = \n", avg)

def sentence_to_avg_test(target):
    # Create a controlled word to vec map
    word_to_vec_map = {'a': [3, 3], 'synonym_of_a': [3, 3], 'a_nw': [2, 4], 'a_s': [3, 2], 
                       'c': [-2, 1], 'c_n': [-2, 2],'c_ne': [-1, 2], 'c_e': [-1, 1], 'c_se': [-1, 0], 
                       'c_s': [-2, 0], 'c_sw': [-3, 0], 'c_w': [-3, 1], 'c_nw': [-3, 2]
                      }
    # Convert lists to np.arrays
    for key in word_to_vec_map.keys():
        word_to_vec_map[key] = np.array(word_to_vec_map[key])
        
    avg = target("a a_nw c_w a_s", word_to_vec_map)
    assert tuple(avg.shape) == tuple(word_to_vec_map['a'].shape),  "Check the shape of your avg array"  
    assert np.allclose(avg, [1.25, 2.5]),  "Check that you are finding the 4 words"
    avg = target("love a a_nw c_w a_s", word_to_vec_map)
    assert np.allclose(avg, [1.25, 2.5]), "Divide by count, not len(words)"
    avg = target("love", word_to_vec_map)
    assert np.allclose(avg, [0, 0]), "Average of no words must give an array of zeros"
    avg = target("c_se foo a a_nw c_w a_s deeplearning c_nw", word_to_vec_map)
    assert np.allclose(avg, [0.1666667, 2.0]), "Debug the last example"
    
    print("\033[92mAll tests passed!")
    
sentence_to_avg_test(sentence_to_avg)

# END UNIT TEST

avg = 
 [-0.008005    0.56370833 -0.50427333  0.258865    0.55131103  0.03104983
 -0.21013718  0.16893933 -0.09590267  0.141784   -0.15708967  0.18525867
  0.6495785   0.38371117  0.21102167  0.11301667  0.02613967  0.26037767
  0.05820667 -0.01578167 -0.12078833 -0.02471267  0.4128455   0.5152061
  0.38756167 -0.898661   -0.535145    0.33501167  0.68806933 -0.2156265
  1.797155    0.10476933 -0.36775333  0.750785    0.10282583  0.348925
 -0.27262833  0.66768    -0.10706167 -0.283635    0.59580117  0.28747333
 -0.3366635   0.23393817  0.34349183  0.178405    0.1166155  -0.076433
  0.1445417   0.09808667]
All tests passed!

1.4 - Implement the Model

You now have all the pieces to finish implementing the model() function! After using sentence_to_avg() you need to:

Pass the average through forward propagation
Compute the cost
Backpropagate to update the softmax parameters

Exercise 2 - model

Implement the model() function described in Figure (2).

The equations you need to implement in the forward pass and to compute the cross-entropy cost are below:
The variable YohYoh ("Y one hot") is the one-hot encoding of the output labels.

z(i)=Wavg(i)+bz(i)=Wavg(i)+b

a(i)=softmax(z(i))a(i)=softmax(z(i))

L(i)=−∑k=0ny−1Y(i)oh,k∗log(a(i)k)L(i)=−∑k=0ny−1Yoh,k(i)∗log(ak(i))

Note: It is possible to come up with a more efficient vectorized implementation. For now, just use nested for loops to better understand the algorithm, and for easier debugging.

The function softmax() is provided, and has already been imported.

In [11]:

# UNQ_C2 (UNIQUE CELL IDENTIFIER, DO NOT EDIT)
# GRADED FUNCTION: model

def model(X, Y, word_to_vec_map, learning_rate = 0.01, num_iterations = 400):
    """
 Model to train word vector representations in numpy.
    
 Arguments:
 X -- input data, numpy array of sentences as strings, of shape (m, 1)
 Y -- labels, numpy array of integers between 0 and 7, numpy-array of shape (m, 1)
 word_to_vec_map -- dictionary mapping every word in a vocabulary into its 50-dimensional vector representation
 learning_rate -- learning_rate for the stochastic gradient descent algorithm
 num_iterations -- number of iterations
    
 Returns:
 pred -- vector of predictions, numpy-array of shape (m, 1)
 W -- weight matrix of the softmax layer, of shape (n_y, n_h)
 b -- bias of the softmax layer, of shape (n_y,)
 """
    
    # Get a valid word contained in the word_to_vec_map 
    any_word = list(word_to_vec_map.keys())[0]
        
    # Define number of training examples
    m = Y.shape[0]                             # number of training examples
    n_y = len(np.unique(Y))                    # number of classes 
    n_h = word_to_vec_map[any_word].shape[0]   # dimensions of the GloVe vectors 
    
    # Initialize parameters using Xavier initialization
    W = np.random.randn(n_y, n_h) / np.sqrt(n_h)
    b = np.zeros((n_y,))
    
    # Convert Y to Y_onehot with n_y classes
    Y_oh = convert_to_one_hot(Y, C = n_y) 
    
    # Optimization loop
    for t in range(num_iterations): # Loop over the number of iterations
        
        cost = 0
        dW = 0
        db = 0
        
        for i in range(m):          # Loop over the training examples
            
            ### START CODE HERE ### (≈ 4 lines of code)
            # Average the word vectors of the words from the i'th training example
            avg = sentence_to_avg(X[i], word_to_vec_map)

            # Forward propagate the avg through the softmax layer. 
            # You can use np.dot() to perform the multiplication.
            z = np.dot(W, avg) + b
            a = softmax(z)

            # Add the cost using the i'th training label's one hot representation and "A" (the output of the softmax)
            cost += -np.sum(Y_oh[i] * np.log(a))
            ### END CODE HERE ###
            
            # Compute gradients 
            dz = a - Y_oh[i]
            dW += np.dot(dz.reshape(n_y,1), avg.reshape(1, n_h))
            db += dz

            # Update parameters with Stochastic Gradient Descent
            W = W - learning_rate * dW
            b = b - learning_rate * db
        
        if t % 100 == 0:
            print("Epoch: " + str(t) + " --- cost = " + str(cost))
            pred = predict(X, Y, W, b, word_to_vec_map) #predict is defined in emo_utils.py

    return pred, W, b

In [12]:

# UNIT TEST
def model_test(target):
    # Create a controlled word to vec map
    word_to_vec_map = {'a': [3, 3], 'synonym_of_a': [3, 3], 'a_nw': [2, 4], 'a_s': [3, 2], 'a_n': [3, 4], 
                       'c': [-2, 1], 'c_n': [-2, 2],'c_ne': [-1, 2], 'c_e': [-1, 1], 'c_se': [-1, 0], 
                       'c_s': [-2, 0], 'c_sw': [-3, 0], 'c_w': [-3, 1], 'c_nw': [-3, 2]
                      }
    # Convert lists to np.arrays
    for key in word_to_vec_map.keys():
        word_to_vec_map[key] = np.array(word_to_vec_map[key])
        
    # Training set. Sentences composed of a_* words will be of class 0 and sentences composed of c_* words will be of class 1
    X = np.asarray(['a a_s synonym_of_a a_n c_sw', 'a a_s a_n c_sw', 'a_s  a a_n', 'synonym_of_a a a_s a_n c_sw', " a_s a_n",
                    " a a_s a_n c ", " a_n  a c c c_e",
                   'c c_nw c_n c c_ne', 'c_e c c_se c_s', 'c_nw c a_s c_e c_e', 'c_e a_nw c_sw', 'c_sw c c_ne c_ne'])
    
    Y = np.asarray([0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1])
    
    np.random.seed(10)
    pred, W, b = model(X, Y, word_to_vec_map, 0.0025, 110)
    
    assert W.shape == (2, 2), "W must be of shape 2 x 2"
    assert np.allclose(pred.transpose(), Y), "Model must give a perfect accuracy"
    assert np.allclose(b[0], -1 * b[1]), "b should be symmetric in this example"
    
    print("\033[92mAll tests passed!")
    
model_test(model)

Epoch: 0 --- cost = 2.603378473480253
Accuracy: 0.9166666666666666
Epoch: 100 --- cost = 0.4732825238878884
Accuracy: 1.0
All tests passed!

Run the next cell to train your model and learn the softmax parameters (W, b). The training process will take about 5 minutes

In [13]:

np.random.seed(1)
pred, W, b = model(X_train, Y_train, word_to_vec_map)
print(pred)

Epoch: 0 --- cost = 410.4336578831472
Accuracy: 0.5454545454545454
Epoch: 100 --- cost = 63.612639746961435
Accuracy: 0.9318181818181818
Epoch: 200 --- cost = 0.7391301193275178
Accuracy: 1.0
Epoch: 300 --- cost = 0.3104825413333956
Accuracy: 1.0
[[3.]
 [2.]
 [3.]
 [0.]
 [4.]
 [0.]
 [3.]
 [2.]
 [3.]
 [1.]
 [3.]
 [3.]
 [1.]
 [3.]
 [2.]
 [3.]
 [2.]
 [3.]
 [1.]
 [2.]
 [3.]
 [0.]
 [2.]
 [2.]
 [2.]
 [1.]
 [4.]
 [2.]
 [2.]
 [4.]
 [0.]
 [3.]
 [4.]
 [2.]
 [0.]
 [3.]
 [2.]
 [2.]
 [3.]
 [4.]
 [2.]
 [2.]
 [0.]
 [2.]
 [3.]
 [0.]
 [3.]
 [2.]
 [4.]
 [3.]
 [0.]
 [3.]
 [3.]
 [3.]
 [4.]
 [2.]
 [1.]
 [1.]
 [1.]
 [2.]
 [3.]
 [1.]
 [0.]
 [0.]
 [0.]
 [3.]
 [4.]
 [4.]
 [2.]
 [2.]
 [1.]
 [2.]
 [0.]
 [3.]
 [2.]
 [2.]
 [0.]
 [0.]
 [3.]
 [1.]
 [2.]
 [1.]
 [2.]
 [2.]
 [4.]
 [3.]
 [3.]
 [2.]
 [4.]
 [0.]
 [0.]
 [0.]
 [3.]
 [3.]
 [3.]
 [2.]
 [0.]
 [1.]
 [2.]
 [3.]
 [0.]
 [2.]
 [2.]
 [2.]
 [3.]
 [2.]
 [2.]
 [2.]
 [4.]
 [1.]
 [1.]
 [3.]
 [3.]
 [4.]
 [1.]
 [2.]
 [1.]
 [1.]
 [3.]
 [1.]
 [0.]
 [4.]
 [0.]
 [3.]
 [3.]
 [4.]
 [4.]
 [1.]
 [4.]
 [3.]
 [0.]
 [2.]]

Great! Your model has pretty high accuracy on the training set. Now see how it does on the test set:

1.5 - Examining Test Set Performance

Note that the predict function used here is defined in emo_util.py.

In [14]:

print("Training set:")
pred_train = predict(X_train, Y_train, W, b, word_to_vec_map)
print('Test set:')
pred_test = predict(X_test, Y_test, W, b, word_to_vec_map)

Training set:
Accuracy: 1.0
Test set:
Accuracy: 0.9107142857142857

Note:

Random guessing would have had 20% accuracy, given that there are 5 classes. (1/5 = 20%).
This is pretty good performance after training on only 127 examples.

The Model Matches Emojis to Relevant Words

In the training set, the algorithm saw the sentence

"I love you."

with the label ❤️.

You can check that the word "cherish" does not appear in the training set.
Nonetheless, let's see what happens if you write "I cherish you."

In [15]:

X_my_sentences = np.array(["i cherish you", "i love you", "funny lol", "lets play with a ball", "food is ready", "not feeling happy"])
Y_my_labels = np.array([[0], [0], [2], [1], [4],[3]])

pred = predict(X_my_sentences, Y_my_labels , W, b, word_to_vec_map)
print_predictions(X_my_sentences, pred)

Accuracy: 0.8333333333333334

i cherish you 
i love you ❤️
funny lol 
lets play with a ball ⚾
food is ready 
not feeling happy

Amazing!

Because adore has a similar embedding as love, the algorithm has generalized correctly even to a word it has never seen before.
Words such as heart, dear, beloved or adore have embedding vectors similar to love.
- Feel free to modify the inputs above and try out a variety of input sentences.
- How well does it work?

Word Ordering isn't Considered in this Model

Note that the model doesn't get the following sentence correct:

"not feeling happy"
This algorithm ignores word ordering, so is not good at understanding phrases like "not happy."

Confusion Matrix

Printing the confusion matrix can also help understand which classes are more difficult for your model.
A confusion matrix shows how often an example whose label is one class ("actual" class) is mislabeled by the algorithm with a different class ("predicted" class).

Print the confusion matrix below:

In [16]:

# START SKIP FOR GRADING
print(Y_test.shape)
print('           '+ label_to_emoji(0)+ '    ' + label_to_emoji(1) + '    ' +  label_to_emoji(2)+ '    ' + label_to_emoji(3)+'   ' + label_to_emoji(4))
print(pd.crosstab(Y_test, pred_test.reshape(56,), rownames=['Actual'], colnames=['Predicted'], margins=True))
plot_confusion_matrix(Y_test, pred_test)
# END SKIP FOR GRADING

(56,)
           ❤️    ⚾           
Predicted  0.0  1.0  2.0  3.0  4.0  All
Actual                                 
0            6    0    0    1    0    7
1            0    8    0    0    0    8
2            1    0   17    0    0   18
3            1    0    2   13    0   16
4            0    0    0    0    7    7
All          8    8   19   14    7   56

What you should remember:

Even with a mere 127 training examples, you can get a reasonably good model for Emojifying.
- This is due to the generalization power word vectors gives you.
Emojify-V1 will perform poorly on sentences such as "This movie is not good and not enjoyable"
- It doesn't understand combinations of words.
- It just averages all the words' embedding vectors together, without considering the ordering of words.

Not to worry! You will build a better algorithm in the next section!

2 - Emojifier-V2: Using LSTMs in Keras

You're going to build an LSTM model that takes word sequences as input! This model will be able to account for word ordering.

Emojifier-V2 will continue to use pre-trained word embeddings to represent words. You'll feed word embeddings into an LSTM, and the LSTM will learn to predict the most appropriate emoji.

Packages

Run the following cell to load the Keras packages you'll need:

In [17]:

import numpy as np
import tensorflow
np.random.seed(0)
from tensorflow.keras.models import Model
from tensorflow.keras.layers import Dense, Input, Dropout, LSTM, Activation
from tensorflow.keras.layers import Embedding
from tensorflow.keras.preprocessing import sequence
from tensorflow.keras.initializers import glorot_uniform
np.random.seed(1)

2.1 - Model Overview

Here is the Emojifier-v2 you will implement:

Figure 3: Emojifier-V2. A 2-layer LSTM sequence classifier.

2.2 Keras and Mini-batching

In this exercise, you want to train Keras using mini-batches. However, most deep learning frameworks require that all sequences in the same mini-batch have the same length.

This is what allows vectorization to work: If you had a 3-word sentence and a 4-word sentence, then the computations needed for them are different (one takes 3 steps of an LSTM, one takes 4 steps) so it's just not possible to do them both at the same time.

Padding Handles Sequences of Varying Length

The common solution to handling sequences of different length is to use padding. Specifically:
- Set a maximum sequence length
- Pad all sequences to have the same length.

Example of Padding:

Given a maximum sequence length of 20, you could pad every sentence with "0"s so that each input sentence is of length 20.
Thus, the sentence "I love you" would be represented as (eI,elove,eyou,0⃗ ,0⃗ ,…,0⃗ )(eI,elove,eyou,0→,0→,…,0→).
In this example, any sentences longer than 20 words would have to be truncated.
One way to choose the maximum sequence length is to just pick the length of the longest sentence in the training set.

2.3 - The Embedding Layer

In Keras, the embedding matrix is represented as a "layer."

The embedding matrix maps word indices to embedding vectors.
- The word indices are positive integers.
- The embedding vectors are dense vectors of fixed size.
- A "dense" vector is the opposite of a sparse vector. It means that most of its values are non-zero. As a counter-example, a one-hot encoded vector is not "dense."
The embedding matrix can be derived in two ways:
- Training a model to derive the embeddings from scratch.
- Using a pretrained embedding.

Using and Updating Pre-trained Embeddings

In this section, you'll create an Embedding() layer in Keras

You will initialize the Embedding layer with GloVe 50-dimensional vectors.
In the code below, you'll observe how Keras allows you to either train or leave this layer fixed.
- Because your training set is quite small, you'll leave the GloVe embeddings fixed instead of updating them.

Inputs and Outputs to the Embedding Layer

The Embedding() layer's input is an integer matrix of size (batch size, max input length).
- This input corresponds to sentences converted into lists of indices (integers).
- The largest integer (the highest word index) in the input should be no larger than the vocabulary size.
The embedding layer outputs an array of shape (batch size, max input length, dimension of word vectors).
The figure shows the propagation of two example sentences through the embedding layer.
- Both examples have been zero-padded to a length of max_len=5.
- The word embeddings are 50 units in length.
- The final dimension of the representation is (2,max_len,50).

Figure 4: Embedding layer

Prepare the Input Sentences

Exercise 3 - sentences_to_indices

Implement sentences_to_indices

This function processes an array of sentences X and returns inputs to the embedding layer:

Convert each training sentences into a list of indices (the indices correspond to each word in the sentence)
Zero-pad all these lists so that their length is the length of the longest sentence.

Additional Hints:

Note that you may have considered using the enumerate() function in the for loop, but for the purposes of passing the autograder, please follow the starter code by initializing and incrementing j explicitly.

In [18]:

for idx, val in enumerate(["I", "like", "learning"]): print(idx, val)

0 I 1 like 2 learning

In [19]:

# UNQ_C3 (UNIQUE CELL IDENTIFIER, DO NOT EDIT) # GRADED FUNCTION: sentences_to_indices def sentences_to_indices(X, word_to_index, max_len): """  Converts an array of sentences (strings) into an array of indices corresponding to words in the sentences.  The output shape should be such that it can be given to `Embedding()` (described in Figure 4).   Arguments:  X -- array of sentences (strings), of shape (m, 1)  word_to_index -- a dictionary containing the each word mapped to its index  max_len -- maximum number of words in a sentence. You can assume every sentence in X is no longer than this.   Returns:  X_indices -- array of indices corresponding to words in the sentences from X, of shape (m, max_len)  """ m = X.shape[0] # number of training examples ### START CODE HERE ### # Initialize X_indices as a numpy matrix of zeros and the correct shape (≈ 1 line) X_indices = np.zeros((m, max_len)) for i in range(m): # loop over training examples # Convert the ith training sentence in lower case and split is into words. You should get a list of words. sentence_words = X[i].lower().split() # Initialize j to 0 j = 0 # Loop over the words of sentence_words for w in sentence_words: # if w exists in the word_to_index dictionary if w in word_to_index: # Set the (i,j)th entry of X_indices to the index of the correct word. X_indices[i, j] = word_to_index[w] # Increment j to j + 1 j = j+1 ### END CODE HERE ### return X_indices

In [20]:

# UNIT TEST def sentences_to_indices_test(target): # Create a word_to_index dictionary word_to_index = {} for idx, val in enumerate(["i", "like", "learning", "deep", "machine", "love", "smile", '´0.=']): word_to_index[val] = idx + 1; max_len = 4 sentences = np.array(["I like deep learning", "deep ´0.= love machine", "machine learning smile"]); indexes = target(sentences, word_to_index, max_len) print(indexes) assert type(indexes) == np.ndarray, "Wrong type. Use np arrays in the function" assert indexes.shape == (sentences.shape[0], max_len), "Wrong shape of ouput matrix" assert np.allclose(indexes, [[1, 2, 4, 3], [4, 8, 6, 5], [5, 3, 7, 0]]), "Wrong values. Debug with the given examples" print("\033[92mAll tests passed!") sentences_to_indices_test(sentences_to_indices)

[[1. 2. 4. 3.] [4. 8. 6. 5.] [5. 3. 7. 0.]] All tests passed!

Expected value

[[1, 2, 4, 3], [4, 8, 6, 5], [5, 3, 7, 0]]

Run the following cell to check what sentences_to_indices() does, and take a look at your results.

In [21]:

X1 = np.array(["funny lol", "lets play baseball", "food is ready for you"]) X1_indices = sentences_to_indices(X1, word_to_index, max_len=5) print("X1 =", X1) print("X1_indices =\n", X1_indices)

X1 = ['funny lol' 'lets play baseball' 'food is ready for you'] X1_indices = [[155345. 225122. 0. 0. 0.] [220930. 286375. 69714. 0. 0.] [151204. 192973. 302254. 151349. 394475.]]

Build Embedding Layer

Now you'll build the Embedding() layer in Keras, using pre-trained word vectors.

The embedding layer takes as input a list of word indices.
- sentences_to_indices() creates these word indices.
The embedding layer will return the word embeddings for a sentence.

Exercise 4 - pretrained_embedding_layer

Implement pretrained_embedding_layer() with these steps:

Initialize the embedding matrix as a numpy array of zeros.
- The embedding matrix has a row for each unique word in the vocabulary.
  - There is one additional row to handle "unknown" words.
  - So vocab_size is the number of unique words plus one.
- Each row will store the vector representation of one word.
  - For example, one row may be 50 positions long if using GloVe word vectors.
- In the code below, emb_dim represents the length of a word embedding.
Fill in each row of the embedding matrix with the vector representation of a word
- Each word in word_to_index is a string.
- word_to_vec_map is a dictionary where the keys are strings and the values are the word vectors.
Define the Keras embedding layer.
- Use Embedding().
- The input dimension is equal to the vocabulary length (number of unique words plus one).
- The output dimension is equal to the number of positions in a word embedding.
- Make this layer's embeddings fixed.
  - If you were to set trainable = True, then it will allow the optimization algorithm to modify the values of the word embeddings.
  - In this case, you don't want the model to modify the word embeddings.
Set the embedding weights to be equal to the embedding matrix.
- Note that this is part of the code is already completed for you and does not need to be modified!

In [22]:

# UNQ_C4 (UNIQUE CELL IDENTIFIER, DO NOT EDIT) # GRADED FUNCTION: pretrained_embedding_layer def pretrained_embedding_layer(word_to_vec_map, word_to_index): """  Creates a Keras Embedding() layer and loads in pre-trained GloVe 50-dimensional vectors.  Arguments:  word_to_vec_map -- dictionary mapping words to their GloVe vector representation.  word_to_index -- dictionary mapping from words to their indices in the vocabulary (400,001 words)  Returns:  embedding_layer -- pretrained layer Keras instance  """ vocab_size = len(word_to_index) + 1 # adding 1 to fit Keras embedding (requirement) any_word = list(word_to_vec_map.keys())[0] emb_dim = word_to_vec_map[any_word].shape[0] # define dimensionality of your GloVe word vectors (= 50) ### START CODE HERE ### # Step 1 # Initialize the embedding matrix as a numpy array of zeros. # See instructions above to choose the correct shape. emb_matrix = np.zeros((vocab_size, emb_dim)) # Step 2 # Set each row "idx" of the embedding matrix to be  # the word vector representation of the idx'th word of the vocabulary for word, idx in word_to_index.items(): emb_matrix[idx, :] = word_to_vec_map[word] # Step 3 # Define Keras embedding layer with the correct input and output sizes # Make it non-trainable. embedding_layer = Embedding(vocab_size, emb_dim, trainable=False) ### END CODE HERE ### # Step 4 (already done for you; please do not modify) # Build the embedding layer, it is required before setting the weights of the embedding layer.  embedding_layer.build((None,)) # Do not modify the "None". This line of code is complete as-is. # Set the weights of the embedding layer to the embedding matrix. Your layer is now pretrained. embedding_layer.set_weights([emb_matrix]) return embedding_layer

In [23]:

# UNIT TEST def pretrained_embedding_layer_test(target): # Create a controlled word to vec map word_to_vec_map = {'a': [3, 3], 'synonym_of_a': [3, 3], 'a_nw': [2, 4], 'a_s': [3, 2], 'a_n': [3, 4], 'c': [-2, 1], 'c_n': [-2, 2],'c_ne': [-1, 2], 'c_e': [-1, 1], 'c_se': [-1, 0], 'c_s': [-2, 0], 'c_sw': [-3, 0], 'c_w': [-3, 1], 'c_nw': [-3, 2] } # Convert lists to np.arrays for key in word_to_vec_map.keys(): word_to_vec_map[key] = np.array(word_to_vec_map[key]) # Create a word_to_index dictionary word_to_index = {} for idx, val in enumerate(list(word_to_vec_map.keys())): word_to_index[val] = idx; np.random.seed(1) embedding_layer = target(word_to_vec_map, word_to_index) assert type(embedding_layer) == Embedding, "Wrong type" assert embedding_layer.input_dim == len(list(word_to_vec_map.keys())) + 1, "Wrong input shape" assert embedding_layer.output_dim == len(word_to_vec_map['a']), "Wrong output shape" assert np.allclose(embedding_layer.get_weights(), [[[ 3, 3], [ 3, 3], [ 2, 4], [ 3, 2], [ 3, 4], [-2, 1], [-2, 2], [-1, 2], [-1, 1], [-1, 0], [-2, 0], [-3, 0], [-3, 1], [-3, 2], [ 0, 0]]]), "Wrong vaulues" print("\033[92mAll tests passed!") pretrained_embedding_layer_test(pretrained_embedding_layer)

All tests passed!

In [24]:

embedding_layer = pretrained_embedding_layer(word_to_vec_map, word_to_index) print("weights[0][1][1] =", embedding_layer.get_weights()[0][1][1]) print("Input_dim", embedding_layer.input_dim) print("Output_dim",embedding_layer.output_dim)

weights[0][1][1] = 0.39031 Input_dim 400001 Output_dim 50

2.4 - Building the Emojifier-V2

Now you're ready to build the Emojifier-V2 model, in which you feed the embedding layer's output to an LSTM network!

Figure 3: Emojifier-v2. A 2-layer LSTM sequence classifier.

Exercise 5 - Emojify_V2

Implement Emojify_V2()

This function builds a Keras graph of the architecture shown in Figure (3).

The model takes as input an array of sentences of shape (m, max_len, ) defined by input_shape.
The model outputs a softmax probability vector of shape (m, C = 5).
You may need to use the following Keras layers:
- Input()
  - Set the shape and dtype parameters.
  - The inputs are integers, so you can specify the data type as a string, 'int32'.
- LSTM()
  - Set the units and return_sequences parameters.
- Dropout()
  - Set the rate parameter.
- Dense()
  - Set the units,
  - Note that Dense() has an activation parameter. For the purposes of passing the autograder, please do not set the activation within Dense(). Use the separate Activation layer to do so.
- Activation()
  - You can pass in the activation of your choice as a lowercase string.
- Model()
  - Set inputs and outputs.

Additional Hints

Remember that these Keras layers return an object, and you will feed in the outputs of the previous layer as the input arguments to that object. The returned object can be created and called in the same line.

# How to use Keras layers in two lines of code dense_object = Dense(units = ...) X = dense_object(inputs) # How to use Keras layers in one line of code X = Dense(units = ...)(inputs)

The embedding_layer that is returned by pretrained_embedding_layer is a layer object that can be called as a function, passing in a single argument (sentence indices).

Here is some sample code in case you're stuck:

raw_inputs = Input(shape=(maxLen,), dtype='int32') preprocessed_inputs = ... # some pre-processing X = LSTM(units = ..., return_sequences= ...)(processed_inputs) X = Dropout(rate = ..., )(X) ... X = Dense(units = ...)(X) X = Activation(...)(X) model = Model(inputs=..., outputs=...) ...

In [25]:

# UNQ_C5 (UNIQUE CELL IDENTIFIER, DO NOT EDIT) # GRADED FUNCTION: Emojify_V2 def Emojify_V2(input_shape, word_to_vec_map, word_to_index): """  Function creating the Emojify-v2 model's graph.  Arguments:  input_shape -- shape of the input, usually (max_len,)  word_to_vec_map -- dictionary mapping every word in a vocabulary into its 50-dimensional vector representation  word_to_index -- dictionary mapping from words to their indices in the vocabulary (400,001 words)  Returns:  model -- a model instance in Keras  """ ### START CODE HERE ### # Define sentence_indices as the input of the graph. # It should be of shape input_shape and dtype 'int32' (as it contains indices, which are integers). sentence_indices = Input(input_shape, dtype='int32') # Create the embedding layer pretrained with GloVe Vectors (≈1 line) embedding_layer = pretrained_embedding_layer(word_to_vec_map, word_to_index) # Propagate sentence_indices through your embedding layer # (See additional hints in the instructions). embeddings = embedding_layer(sentence_indices) # Propagate the embeddings through an LSTM layer with 128-dimensional hidden state # The returned output should be a batch of sequences. X = LSTM(128, return_sequences=True)(embeddings) # Add dropout with a probability of 0.5 X = Dropout(0.5)(X) # Propagate X trough another LSTM layer with 128-dimensional hidden state # The returned output should be a single hidden state, not a batch of sequences. X = LSTM(128, return_sequences=False)(X) # Add dropout with a probability of 0.5 X = Dropout(0.5)(X) # Propagate X through a Dense layer with 5 units X = Dense(5)(X) # Add a softmax activation X = Activation('softmax')(X) # Create Model instance which converts sentence_indices into X. model = Model(sentence_indices, X) ### END CODE HERE ### return model

In [26]:

# UNIT TEST def Emojify_V2_test(target): # Create a controlled word to vec map word_to_vec_map = {'a': [3, 3], 'synonym_of_a': [3, 3], 'a_nw': [2, 4], 'a_s': [3, 2], 'a_n': [3, 4], 'c': [-2, 1], 'c_n': [-2, 2],'c_ne': [-1, 2], 'c_e': [-1, 1], 'c_se': [-1, 0], 'c_s': [-2, 0], 'c_sw': [-3, 0], 'c_w': [-3, 1], 'c_nw': [-3, 2] } # Convert lists to np.arrays for key in word_to_vec_map.keys(): word_to_vec_map[key] = np.array(word_to_vec_map[key]) # Create a word_to_index dictionary word_to_index = {} for idx, val in enumerate(list(word_to_vec_map.keys())): word_to_index[val] = idx; maxLen = 4 model = target((maxLen,), word_to_vec_map, word_to_index) expectedModel = [['InputLayer', [(None, 4)], 0], ['Embedding', (None, 4, 2), 30], ['LSTM', (None, 4, 128), 67072, (None, 4, 2), 'tanh', True], ['Dropout', (None, 4, 128), 0, 0.5], ['LSTM', (None, 128), 131584, (None, 4, 128), 'tanh', False], ['Dropout', (None, 128), 0, 0.5], ['Dense', (None, 5), 645, 'linear'], ['Activation', (None, 5), 0]] comparator(summary(model), expectedModel) Emojify_V2_test(Emojify_V2)

All tests passed!

Run the following cell to create your model and check its summary.

Because all sentences in the dataset are less than 10 words, max_len = 10 was chosen.
You should see that your architecture uses 20,223,927 parameters, of which 20,000,050 (the word embeddings) are non-trainable, with the remaining 223,877 being trainable.
Because your vocabulary size has 400,001 words (with valid indices from 0 to 400,000) there are 400,001*50 = 20,000,050 non-trainable parameters.

In [27]:

model = Emojify_V2((maxLen,), word_to_vec_map, word_to_index) model.summary()

Model: "functional_3" _________________________________________________________________ Layer (type) Output Shape Param # ================================================================= input_2 (InputLayer) [(None, 10)] 0 _________________________________________________________________ embedding_3 (Embedding) (None, 10, 50) 20000050 _________________________________________________________________ lstm_2 (LSTM) (None, 10, 128) 91648 _________________________________________________________________ dropout_2 (Dropout) (None, 10, 128) 0 _________________________________________________________________ lstm_3 (LSTM) (None, 128) 131584 _________________________________________________________________ dropout_3 (Dropout) (None, 128) 0 _________________________________________________________________ dense_1 (Dense) (None, 5) 645 _________________________________________________________________ activation_1 (Activation) (None, 5) 0 ================================================================= Total params: 20,223,927 Trainable params: 223,877 Non-trainable params: 20,000,050 _________________________________________________________________

Compile the Model

As usual, after creating your model in Keras, you need to compile it and define what loss, optimizer and metrics you want to use. Compile your model using categorical_crossentropy loss, adam optimizer and ['accuracy'] metrics:

In [28]:

model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=['accuracy'])

2.5 - Train the Model

It's time to train your model! Your Emojifier-V2 model takes as input an array of shape (m, max_len) and outputs probability vectors of shape (m, number of classes). Thus, you have to convert X_train (array of sentences as strings) to X_train_indices (array of sentences as list of word indices), and Y_train (labels as indices) to Y_train_oh (labels as one-hot vectors).

In [29]:

X_train_indices = sentences_to_indices(X_train, word_to_index, maxLen) Y_train_oh = convert_to_one_hot(Y_train, C = 5)

Fit the Keras model on X_train_indices and Y_train_oh, using epochs = 50 and batch_size = 32.

In [30]:

model.fit(X_train_indices, Y_train_oh, epochs = 50, batch_size = 32, shuffle=True)

Epoch 1/50 5/5 [==============================] - 0s 28ms/step - loss: 1.5794 - accuracy: 0.2803 Epoch 2/50 5/5 [==============================] - 0s 38ms/step - loss: 1.5043 - accuracy: 0.3333 Epoch 3/50 5/5 [==============================] - 0s 25ms/step - loss: 1.4934 - accuracy: 0.3409 Epoch 4/50 5/5 [==============================] - 0s 35ms/step - loss: 1.3900 - accuracy: 0.5152 Epoch 5/50 5/5 [==============================] - 0s 35ms/step - loss: 1.3060 - accuracy: 0.5455 Epoch 6/50 5/5 [==============================] - 0s 23ms/step - loss: 1.2157 - accuracy: 0.5758 Epoch 7/50 5/5 [==============================] - 0s 24ms/step - loss: 1.1345 - accuracy: 0.5985 Epoch 8/50 5/5 [==============================] - 0s 24ms/step - loss: 0.9905 - accuracy: 0.6742 Epoch 9/50 5/5 [==============================] - 0s 35ms/step - loss: 0.8956 - accuracy: 0.6970 Epoch 10/50 5/5 [==============================] - 0s 22ms/step - loss: 0.8450 - accuracy: 0.6970 Epoch 11/50 5/5 [==============================] - 0s 23ms/step - loss: 0.7811 - accuracy: 0.6894 Epoch 12/50 5/5 [==============================] - 0s 23ms/step - loss: 0.6928 - accuracy: 0.7576 Epoch 13/50 5/5 [==============================] - 0s 35ms/step - loss: 0.5783 - accuracy: 0.8030 Epoch 14/50 5/5 [==============================] - 0s 23ms/step - loss: 0.4974 - accuracy: 0.8561 Epoch 15/50 5/5 [==============================] - 0s 25ms/step - loss: 0.4102 - accuracy: 0.8712 Epoch 16/50 5/5 [==============================] - 0s 23ms/step - loss: 0.4159 - accuracy: 0.8485 Epoch 17/50 5/5 [==============================] - 0s 35ms/step - loss: 0.5112 - accuracy: 0.7879 Epoch 18/50 5/5 [==============================] - 0s 22ms/step - loss: 0.3243 - accuracy: 0.8939 Epoch 19/50 5/5 [==============================] - 0s 24ms/step - loss: 0.3595 - accuracy: 0.8788 Epoch 20/50 5/5 [==============================] - 0s 35ms/step - loss: 0.3827 - accuracy: 0.8712 Epoch 21/50 5/5 [==============================] - 0s 35ms/step - loss: 0.3565 - accuracy: 0.8788 Epoch 22/50 5/5 [==============================] - 0s 22ms/step - loss: 0.3054 - accuracy: 0.9015 Epoch 23/50 5/5 [==============================] - 0s 23ms/step - loss: 0.3084 - accuracy: 0.8636 Epoch 24/50 5/5 [==============================] - 0s 34ms/step - loss: 0.2696 - accuracy: 0.9091 Epoch 25/50 5/5 [==============================] - 0s 23ms/step - loss: 0.2094 - accuracy: 0.9394 Epoch 26/50 5/5 [==============================] - 0s 23ms/step - loss: 0.2124 - accuracy: 0.9318 Epoch 27/50 5/5 [==============================] - 0s 23ms/step - loss: 0.2007 - accuracy: 0.9242 Epoch 28/50 5/5 [==============================] - 0s 34ms/step - loss: 0.1550 - accuracy: 0.9470 Epoch 29/50 5/5 [==============================] - 0s 22ms/step - loss: 0.1396 - accuracy: 0.9545 Epoch 30/50 5/5 [==============================] - 0s 23ms/step - loss: 0.1508 - accuracy: 0.9394 Epoch 31/50 5/5 [==============================] - 0s 34ms/step - loss: 0.0971 - accuracy: 0.9773 Epoch 32/50 5/5 [==============================] - 0s 22ms/step - loss: 0.0560 - accuracy: 0.9924 Epoch 33/50 5/5 [==============================] - 0s 23ms/step - loss: 0.0528 - accuracy: 0.9848 Epoch 34/50 5/5 [==============================] - 0s 35ms/step - loss: 0.0722 - accuracy: 0.9773 Epoch 35/50 5/5 [==============================] - 0s 22ms/step - loss: 0.1071 - accuracy: 0.9621 Epoch 36/50 5/5 [==============================] - 0s 23ms/step - loss: 0.0903 - accuracy: 0.9773 Epoch 37/50 5/5 [==============================] - 0s 23ms/step - loss: 0.0616 - accuracy: 0.9773 Epoch 38/50 5/5 [==============================] - 0s 34ms/step - loss: 0.0537 - accuracy: 0.9848 Epoch 39/50 5/5 [==============================] - 0s 23ms/step - loss: 0.0815 - accuracy: 0.9773 Epoch 40/50 5/5 [==============================] - 0s 24ms/step - loss: 0.0539 - accuracy: 0.9924 Epoch 41/50 5/5 [==============================] - 0s 23ms/step - loss: 0.0712 - accuracy: 0.9773 Epoch 42/50 5/5 [==============================] - 0s 35ms/step - loss: 0.4644 - accuracy: 0.9091 Epoch 43/50 5/5 [==============================] - 0s 23ms/step - loss: 0.1897 - accuracy: 0.9394 Epoch 44/50 5/5 [==============================] - 0s 24ms/step - loss: 0.2394 - accuracy: 0.9242 Epoch 45/50 5/5 [==============================] - 0s 24ms/step - loss: 0.4693 - accuracy: 0.7955 Epoch 46/50 5/5 [==============================] - 0s 35ms/step - loss: 0.3220 - accuracy: 0.8636 Epoch 47/50 5/5 [==============================] - 0s 23ms/step - loss: 0.2777 - accuracy: 0.8939 Epoch 48/50 5/5 [==============================] - 0s 23ms/step - loss: 0.2876 - accuracy: 0.8864 Epoch 49/50 5/5 [==============================] - 0s 24ms/step - loss: 0.1491 - accuracy: 0.9621 Epoch 50/50 5/5 [==============================] - 0s 35ms/step - loss: 0.1420 - accuracy: 0.9697

Out[30]:

Your model should perform around 90% to 100% accuracy on the training set. Exact model accuracy may vary!

Run the following cell to evaluate your model on the test set:

In [31]:

X_test_indices = sentences_to_indices(X_test, word_to_index, max_len = maxLen) Y_test_oh = convert_to_one_hot(Y_test, C = 5) loss, acc = model.evaluate(X_test_indices, Y_test_oh) print() print("Test accuracy = ", acc)

2/2 [==============================] - 0s 3ms/step - loss: 0.2834 - accuracy: 0.8750 Test accuracy = 0.875

You should get a test accuracy between 80% and 95%. Run the cell below to see the mislabelled examples:

In [32]:

# This code allows you to see the mislabelled examples C = 5 y_test_oh = np.eye(C)[Y_test.reshape(-1)] X_test_indices = sentences_to_indices(X_test, word_to_index, maxLen) pred = model.predict(X_test_indices) for i in range(len(X_test)): x = X_test_indices num = np.argmax(pred[i]) if(num != Y_test[i]): print('Expected emoji:'+ label_to_emoji(Y_test[i]) + ' prediction: '+ X_test[i] + label_to_emoji(num).strip())

Expected emoji: prediction: she got me a nice present ❤️ Expected emoji: prediction: This girl is messing with me ❤️ Expected emoji: prediction: you brighten my day ❤️ Expected emoji:⚾ prediction: enjoy your game Expected emoji: prediction: will you be my valentine ❤️ Expected emoji: prediction: go away ⚾ Expected emoji:❤️ prediction: family is all I have

Now you can try it on your own example! Write your own sentence below:

In [33]:

# Change the sentence below to see your prediction. Make sure all the words are in the Glove embeddings.  x_test = np.array(['I cannot play']) X_test_indices = sentences_to_indices(x_test, word_to_index, maxLen) print(x_test[0] +' '+ label_to_emoji(np.argmax(model.predict(X_test_indices))))

I cannot play ⚾

LSTM Version Accounts for Word Order

The Emojify-V1 model did not "not feeling happy" correctly, but your implementation of Emojify-V2 got it right!
- If it didn't, be aware that Keras' outputs are slightly random each time, so this is probably why.
The current model still isn't very robust at understanding negation (such as "not happy")
- This is because the training set is small and doesn't have a lot of examples of negation.
- If the training set were larger, the LSTM model would be much better than the Emojify-V1 model at understanding more complex sentences.

Congratulations!

You've completed this notebook, and harnessed the power of LSTMs to make your words more emotive! ❤️❤️❤️

By now, you've:

Created an embedding matrix
Observed how negative sampling learns word vectors more efficiently than other methods
Experienced the advantages and disadvantages of the GloVe algorithm
And built a sentiment classifier using word embeddings!

Cool! (or Emojified: )

What you should remember:

If you have an NLP task where the training set is small, using word embeddings can help your algorithm significantly.
Word embeddings allow your model to work on words in the test set that may not even appear in the training set.
Training sequence models in Keras (and in most other deep learning frameworks) requires a few important details:
- To use mini-batches, the sequences need to be padded so that all the examples in a mini-batch have the same length.
- An Embedding() layer can be initialized with pretrained values.
  - These values can be either fixed or trained further on your dataset.
  - If however your labeled dataset is small, it's usually not worth trying to train a large pre-trained set of embeddings.
- LSTM() has a flag called return_sequences to decide if you would like to return every hidden states or only the last one.
- You can use Dropout() right after LSTM() to regularize your network.

Input sentences:

"Congratulations on finishing this assignment and building an Emojifier." "We hope you're happy with what you've accomplished in this notebook!"

Output emojis:

☁ ☁☁

 ✨ BYE-BYE!

☁ ✨

 ✨ ☁ ✨ ✨

✨

你可能感兴趣的:(python,深度学习,人工智能,大数据)

机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
理解Gunicorn：Python WSGI服务器的基石范范0825 ipython linux 运维
理解Gunicorn：PythonWSGI服务器的基石介绍Gunicorn，全称GreenUnicorn，是一个为PythonWSGI（WebServerGatewayInterface）应用设计的高效、轻量级HTTP服务器。作为PythonWeb应用部署的常用工具，Gunicorn以其高性能和易用性著称。本文将介绍Gunicorn的基本概念、安装和配置，帮助初学者快速上手。1.什么是Gunico
Python数据分析与可视化实战指南 William数据分析 python python 数据
在数据驱动的时代，Python因其简洁的语法、强大的库生态系统以及活跃的社区，成为了数据分析与可视化的首选语言。本文将通过一个详细的案例，带领大家学习如何使用Python进行数据分析，并通过可视化来直观呈现分析结果。一、环境准备1.1安装必要库在开始数据分析和可视化之前，我们需要安装一些常用的库。主要包括pandas、numpy、matplotlib和seaborn等。这些库分别用于数据处理、数学
python os.environ 江湖偌大 python 深度学习
os.environ['TF_CPP_MIN_LOG_LEVEL']='0'#默认值，输出所有信息os.environ['TF_CPP_MIN_LOG_LEVEL']='1'#屏蔽通知信息（INFO）os.environ['TF_CPP_MIN_LOG_LEVEL']='2'#屏蔽通知信息和警告信息（INFO\WARNING）os.environ['TF_CPP_MIN_LOG_LEVEL']='
Python中os.environ基本介绍及使用方法鹤冲天Pro #Python python 服务器开发语言
文章目录python中os.environos.environ简介os.environ进行环境变量的增删改查python中os.environ的使用详解1.简介2.key字段详解2.1常见key字段3.os.environ.get()用法4.环境变量的增删改查和判断是否存在4.1新增环境变量4.2更新环境变量4.3获取环境变量4.4删除环境变量4.5判断环境变量是否存在python中os.envi
Pyecharts数据可视化大屏：打造沉浸式数据分析体验我的运维人生信息可视化数据分析数据挖掘运维开发技术共享
Pyecharts数据可视化大屏：打造沉浸式数据分析体验在当今这个数据驱动的时代，如何将海量数据以直观、生动的方式展现出来，成为了数据分析师和企业决策者关注的焦点。Pyecharts，作为一款基于Python的开源数据可视化库，凭借其丰富的图表类型、灵活的配置选项以及高度的定制化能力，成为了构建数据可视化大屏的理想选择。本文将深入探讨如何利用Pyecharts打造数据可视化大屏，并通过实际代码案例
Python教程：一文了解使用Python处理XPath 旦莫 Python进阶 python 开发语言
目录1.环境准备1.1安装lxml1.2验证安装2.XPath基础2.1什么是XPath？2.2XPath语法2.3示例XML文档3.使用lxml解析XML3.1解析XML文档3.2查看解析结果4.XPath查询4.1基本路径查询4.2使用属性查询4.3查询多个节点5.XPath的高级用法5.1使用逻辑运算符5.2使用函数6.实战案例6.1从网页抓取数据6.1.1安装Requests库6.1.2代
python os.environ_python os.environ 读取和设置环境变量 weixin_39605414 python os.environ
>>>importos>>>os.environ.keys()['LC_NUMERIC','GOPATH','GOROOT','GOBIN','LESSOPEN','SSH_CLIENT','LOGNAME','USER','HOME','LC_PAPER','PATH','DISPLAY','LANG','TERM','SHELL','J2REDIR','LC_MONETARY','QT_QPA
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
探索OpenAI和LangChain的适配器集成：轻松切换模型提供商 nseejrukjhad langchain easyui 前端 python
#探索OpenAI和LangChain的适配器集成：轻松切换模型提供商##引言在人工智能和自然语言处理的世界中，OpenAI的模型提供了强大的能力。然而，随着技术的发展，许多人开始探索其他模型以满足特定需求。LangChain作为一个强大的工具，集成了多种模型提供商，通过提供适配器，简化了不同模型之间的转换。本篇文章将介绍如何使用LangChain的适配器与OpenAI集成，以便轻松切换模型提供商
使用Faiss进行高效相似度搜索 llzwxh888 faiss python
在现代AI应用中，快速和高效的相似度搜索是至关重要的。Faiss（FacebookAISimilaritySearch）是一个专门用于快速相似度搜索和聚类的库，特别适用于高维向量。本文将介绍如何使用Faiss来进行相似度搜索，并结合Python代码演示其基本用法。什么是Faiss？Faiss是一个由FacebookAIResearch团队开发的开源库，主要用于高维向量的相似性搜索和聚类。Faiss
python是什么意思中文-在python中%是什么意思编程大乐趣
Python中%有两种：1、数值运算：%代表取模，返回除法的余数。如：>>>7%212、%操作符（字符串格式化，stringformatting），说明如下：%[(name)][flags][width].[precision]typecode(name)为命名flags可以有+，-，''或0。+表示右对齐。-表示左对齐。''为一个空格，表示在正数的左侧填充一个空格，从而与负数对齐。0表示使用0填
深入理解 MultiQueryRetriever：提升向量数据库检索效果的强大工具 nseejrukjhad 数据库 python
深入理解MultiQueryRetriever：提升向量数据库检索效果的强大工具引言在人工智能和自然语言处理领域，高效准确的信息检索一直是一个关键挑战。传统的基于距离的向量数据库检索方法虽然广泛应用，但仍存在一些局限性。本文将介绍一种创新的解决方案：MultiQueryRetriever，它通过自动生成多个查询视角来增强检索效果，提高结果的相关性和多样性。MultiQueryRetriever的工
Day1笔记-Python简介&标识符和关键字&输入输出 ~在杰难逃~ Python python 开发语言大数据数据分析数据挖掘
大家好，从今天开始呢，杰哥开展一个新的专栏，当然，数据分析部分也会不定时更新的，这个新的专栏主要是讲解一些Python的基础语法和知识，帮助0基础的小伙伴入门和学习Python，感兴趣的小伙伴可以开始认真学习啦！一、Python简介【了解】1.计算机工作原理编程语言就是用来定义计算机程序的形式语言。我们通过编程语言来编写程序代码，再通过语言处理程序执行向计算机发送指令，让计算机完成对应的工作，编程
python八股文面试题分享及解析(1) Shawn________ python
#1.'''a=1b=2不用中间变量交换a和b'''#1.a=1b=2a,b=b,aprint(a)print(b)结果：21#2.ll=[]foriinrange(3):ll.append({'num':i})print(11)结果:#[{'num':0},{'num':1},{'num':2}]#3.kk=[]a={'num':0}foriinrange(3):#0,12#可变类型，不仅仅改变
人工智能时代，程序员如何保持核心竞争力？ jmoych 人工智能
随着AIGC（如chatgpt、midjourney、claude等）大语言模型接二连三的涌现，AI辅助编程工具日益普及，程序员的工作方式正在发生深刻变革。有人担心AI可能取代部分编程工作，也有人认为AI是提高效率的得力助手。面对这一趋势,程序员应该如何应对?是专注于某个领域深耕细作，还是广泛学习以适应快速变化的技术环境?又或者，我们是否应该将重点转向AI无法轻易替代的软技能？让我们一起探讨程序员
每日算法&面试题，大厂特训二十八天——第二十天（树）肥学 ⚡算法题⚡面试题每日精进 java 算法数据结构
目录标题导读算法特训二十八天面试题点击直接资料领取导读肥友们为了更好的去帮助新同学适应算法和面试题，最近我们开始进行专项突击一步一步来。上一期我们完成了动态规划二十一天现在我们进行下一项对各类算法进行二十八天的一个小总结。还在等什么快来一起肥学进行二十八天挑战吧！！特别介绍小白练手专栏，适合刚入手的新人欢迎订阅编程小白进阶python有趣练手项目里面包括了像《机器人尬聊》《恶搞程序》这样的有趣文章
Python快速入门 —— 第三节：类与对象孤华暗香 Python快速入门 python 开发语言
第三节：类与对象目标：了解面向对象编程的基础概念，并学会如何定义类和创建对象。内容：类与对象：定义类：class关键字。类的构造函数：__init__()。类的属性和方法。对象的创建与使用。示例：classStudent:def__init__(self,name,age,major):self.name&#
pyecharts——绘制柱形图折线图 2224070247 信息可视化 python java 数据可视化
一、pyecharts概述自2013年6月百度EFE(ExcellentFrontEnd）数据可视化团队研发的ECharts1.0发布到GitHub网站以来，ECharts一直备受业界权威的关注并获得广泛好评，成为目前成熟且流行的数据可视化图表工具，被应用到诸多数据可视化的开发领域。Python作为数据分析领域最受欢迎的语言，也加入ECharts的使用行列，并研发出方便Python开发者使用的数据
Python 实现图片裁剪（附代码） | Python工具剑客阿良_ALiang
前言本文提供将图片按照自定义尺寸进行裁剪的工具方法，一如既往的实用主义。环境依赖ffmpeg环境安装，可以参考我的另一篇文章：windowsffmpeg安装部署_阿良的博客-CSDN博客本文主要使用到的不是ffmpeg，而是ffprobe也在上面这篇文章中的zip包中。ffmpy安装：pipinstallffmpy-ihttps://pypi.douban.com/simple代码不废话了，上代码
【华为OD技术面试真题 - 技术面】- python八股文真题题库（4) 算法大师华为od 面试 python
华为OD面试真题精选专栏：华为OD面试真题精选目录:2024华为OD面试手撕代码真题目录以及八股文真题目录文章目录华为OD面试真题精选**1.Python中的`with`**用途和功能自动资源管理示例：文件操作上下文管理协议示例代码工作流程解析优点2.\_\_new\_\_和**\_\_init\_\_**区别__new____init__区别总结3.**切片（Slicing）操作**基本切片语法
python os 环境变量 CV矿工 python 开发语言 numpy
环境变量：环境变量是程序和操作系统之间的通信方式。有些字符不宜明文写进代码里，比如数据库密码，个人账户密码，如果写进自己本机的环境变量里，程序用的时候通过os.environ.get（）取出来就行了。os.environ是一个环境变量的字典。环境变量的相关操作importos"""设置/修改环境变量：os.environ[‘环境变量名称’]=‘环境变量值’#其中key和value均为string类
Python爬虫解析工具之xpath使用详解 eqa11 python 爬虫开发语言
文章目录Python爬虫解析工具之xpath使用详解一、引言二、环境准备1、插件安装2、依赖库安装三、xpath语法详解1、路径表达式2、通配符3、谓语4、常用函数四、xpath在Python代码中的使用1、文档树的创建2、使用xpath表达式3、获取元素内容和属性五、总结Python爬虫解析工具之xpath使用详解一、引言在Python爬虫开发中，数据提取是一个至关重要的环节。xpath作为一门
【华为OD技术面试真题 - 技术面】- python八股文真题题库（1）算法大师华为od 面试 python
华为OD面试真题精选专栏：华为OD面试真题精选目录:2024华为OD面试手撕代码真题目录以及八股文真题目录文章目录华为OD面试真题精选1.数据预处理流程数据预处理的主要步骤工具和库2.介绍线性回归、逻辑回归模型线性回归（LinearRegression）模型形式：关键点：逻辑回归（LogisticRegression）模型形式：关键点：参数估计与评估：3.python浅拷贝及深拷贝浅拷贝（Shal
数字里的世界17期：2021年全球10大顶级数据中心，中国移动榜首张三叨
你知道吗？2016年，全球的数据中心共计用电4160亿千瓦时，比整个英国的发电量还多40％！前言每天，我们都会创造超过250万TB的数据。并且随着物联网（IOT）的不断普及，这一数据将持续增长。如此庞大的数据被存储在被称为“数据中心”的专用设施中。虽然最早的数据中心建于20世纪40年代，但直到1997-2000年的互联网泡沫期间才逐渐成为主流。当前人类的技术，比如人工智能和机器学习，已经将我们推向
nosql数据库技术与应用知识点皆过客，揽星河 NoSQL nosql 数据库大数据数据分析数据结构非关系型数据库
Nosql知识回顾大数据处理流程数据采集(flume、爬虫、传感器)数据存储(本门课程NoSQL所处的阶段)Hdfs、MongoDB、HBase等数据清洗(入仓)Hive等数据处理、分析(Spark、Flink等)数据可视化数据挖掘、机器学习应用(Python、SparkMLlib等)大数据时代存储的挑战(三高)高并发(同一时间很多人访问)高扩展(要求随时根据需求扩展存储)高效率(要求读写速度快)
《Python数据分析实战终极指南》 xjt921122 python 数据分析开发语言
对于分析师来说，大家在学习Python数据分析的路上，多多少少都遇到过很多大坑**，有关于技能和思维的**：Excel已经没办法处理现有的数据量了，应该学Python吗？找了一大堆Python和Pandas的资料来学习，为什么自己动手就懵了？跟着比赛类公开数据分析案例练了很久，为什么当自己面对数据需求还是只会数据处理而没有分析思路？学了对比、细分、聚类分析，也会用PEST、波特五力这类分析法，为啥
Python中深拷贝与浅拷贝的区别 yuxiaoyu.
转自：http://blog.csdn.net/u014745194/article/details/70271868定义：在Python中对象的赋值其实就是对象的引用。当创建一个对象，把它赋值给另一个变量的时候，python并没有拷贝这个对象，只是拷贝了这个对象的引用而已。浅拷贝：拷贝了最外围的对象本身，内部的元素都只是拷贝了一个引用而已。也就是，把对象复制一遍，但是该对象中引用的其他对象我不复
Python开发常用的三方模块如下：换个网名有点难 python 开发语言
Python是一门功能强大的编程语言，拥有丰富的第三方库，这些库为开发者提供了极大的便利。以下是100个常用的Python库，涵盖了多个领域：1、NumPy，用于科学计算的基础库。2、Pandas，提供数据结构和数据分析工具。3、Matplotlib，一个绘图库。4、Scikit-learn，机器学习库。5、SciPy，用于数学、科学和工程的库。6、TensorFlow，由Google开发的开源机
ES聚合分析原理与代码实例讲解光剑书架上的书大厂Offer收割机面试题简历程序员读书硅基计算碳基计算认知计算生物计算深度学习神经网络大数据 AIGC AGI LLM Java Python 架构设计 Agent 程序员实现财富自由
ES聚合分析原理与代码实例讲解1.背景介绍1.1问题的由来在大规模数据分析场景中，特别是在使用Elasticsearch（ES）进行数据存储和检索时，聚合分析成为了一个至关重要的功能。聚合分析允许用户对数据集进行细分和分组，以便深入探索数据的结构和模式。这在诸如实时监控、日志分析、业务洞察等领域具有广泛的应用。1.2研究现状目前，ES聚合分析已经成为现代大数据平台的核心组件之一。它支持多种类型的聚
java线程Thread和Runnable区别和联系 zx_code java jvm thread 多线程 Runnable
我们都晓得java实现线程2种方式，一个是继承Thread，另一个是实现Runnable。模拟窗口买票，第一例子继承thread，代码如下 package thread; public class ThreadTest { public static void main(String[] args) { Thread1 t1 = new Thread1(
【转】JSON与XML的区别比较丁_新 json xml
1.定义介绍 (1).XML定义扩展标记语言 (Extensible Markup Language, XML) ，用于标记电子文件使其具有结构性的标记语言，可以用来标记数据、定义数据类型，是一种允许用户对自己的标记语言进行定义的源语言。 XML使用DTD(document type definition)文档类型定义来组织数据;格式统一，跨平台和语言，早已成为业界公认的标准。 XML是标
c++ 实现五种基础的排序算法 CrazyMizzz C++c 算法
#include<iostream> using namespace std; //辅助函数，交换两数之值 template<class T> void mySwap(T &x, T &y){ T temp = x; x = y; y = temp; } const int size = 10; //一、用直接插入排
我的软件麦田的设计者我的软件音乐类娱乐放松
这是我写的一款app软件，耗时三个月，是一个根据央视节目开门大吉改变的，提供音调，猜歌曲名。1、手机拥有者在android手机市场下载本APP，同意权限，安装到手机上。2、游客初次进入时会有引导页面提醒用户注册。（同时软件自动播放背景音乐）。3、用户登录到主页后，会有五个模块。a、点击不胫而走，用户得到开门大吉首页部分新闻，点击进入有新闻详情。b、
linux awk命令详解被触发 linux awk
awk是行处理器: 相比较屏幕处理的优点，在处理庞大文件时不会出现内存溢出或是处理缓慢的问题，通常用来格式化文本信息 awk处理过程: 依次对每一行进行处理，然后输出 awk命令形式: awk [-F|-f|-v] ‘BEGIN{} //{command1; command2} END{}’ file [-F|-f|-v]大参数，-F指定分隔符，-f调用脚本，-v定义变量 var=val
各种语言比较 _wy_ 编程语言
Java Ruby PHP 擅长领域
oracle 中数据类型为clob的编辑知了ing oracle clob
public void updateKpiStatus(String kpiStatus,String taskId){ Connection dbc=null; Statement stmt=null; PreparedStatement ps=null; try { dbc = new DBConn().getNewConnection(); //stmt = db
分布式服务框架 Zookeeper -- 管理分布式环境中的数据矮蛋蛋 zookeeper
原文地址： http://www.ibm.com/developerworks/cn/opensource/os-cn-zookeeper/ 安装和配置详解本文介绍的 Zookeeper 是以 3.2.2 这个稳定版本为基础，最新的版本可以通过官网 http://hadoop.apache.org/zookeeper/来获取，Zookeeper 的安装非常简单，下面将从单机模式和集群模式两
tomcat数据源 alafqq tomcat
数据库 JNDI(Java Naming and Directory Interface，Java命名和目录接口)是一组在Java应用中访问命名和目录服务的API。没有使用JNDI时我用要这样连接数据库： 03. Class.forName("com.mysql.jdbc.Driver"); 04. conn
遍历的方法百合不是茶遍历
遍历在java的泛
linux查看硬件信息的命令 bijian1013 linux
linux查看硬件信息的命令一.查看CPU： cat /proc/cpuinfo 二.查看内存： free 三.查看硬盘： df linux下查看硬件信息 1、lspci 列出所有PCI 设备； lspci - list all PCI devices:列出机器中的PCI设备（声卡、显卡、Modem、网卡、USB、主板集成设备也能
java常见的ClassNotFoundException bijian1013 java
1.java.lang.ClassNotFoundException: org.apache.commons.logging.LogFactory 添加包common-logging.jar2.java.lang.ClassNotFoundException: javax.transaction.Synchronization
【Gson五】日期对象的序列化和反序列化 bit1129 反序列化
对日期类型的数据进行序列化和反序列化时，需要考虑如下问题： 1. 序列化时，Date对象序列化的字符串日期格式如何 2. 反序列化时，把日期字符串序列化为Date对象，也需要考虑日期格式问题 3. Date A -> str -> Date B,A和B对象是否equals 默认序列化和反序列化 import com
【Spark八十六】Spark Streaming之DStream vs. InputDStream bit1129 Stream
1. DStream的类说明文档： /** * A Discretized Stream (DStream), the basic abstraction in Spark Streaming, is a continuous * sequence of RDDs (of the same type) representing a continuous st
通过nginx获取header信息 ronin47 nginx header
1. 提取整个的Cookies内容到一个变量，然后可以在需要时引用，比如记录到日志里面， if ( $http_cookie ~* "(.*)$") { set $all_cookie $1; } 变量$all_cookie就获得了cookie的值，可以用于运算了
java-65.输入数字n，按顺序输出从1最大的n位10进制数。比如输入3，则输出1、2、3一直到最大的3位数即999 bylijinnan java
参考了网上的http://blog.csdn.net/peasking_dd/article/details/6342984 写了个java版的： public class Print_1_To_NDigit { /** * Q65.输入数字n，按顺序输出从1最大的n位10进制数。比如输入3，则输出1、2、3一直到最大的3位数即999 * 1.使用字符串
Netty源码学习-ReplayingDecoder bylijinnan java netty
ReplayingDecoder是FrameDecoder的子类，不熟悉FrameDecoder的，可以先看看 http://bylijinnan.iteye.com/blog/1982618 API说，ReplayingDecoder简化了操作，比如： FrameDecoder在decode时，需要判断数据是否接收完全： public class IntegerH
js特殊字符过滤 cngolon js特殊字符 js特殊字符过滤
1.js中用正则表达式过滤特殊字符, 校验所有输入域是否含有特殊符号function stripscript(s) { var pattern = new RegExp("[`~!@#$^&*()=|{}':;',\\[\\].<>/?~！@#￥……&*（）——|{}【】‘；：”“'。，、？]"
hibernate使用sql查询 ctrain Hibernate
import java.util.Iterator; import java.util.List; import java.util.Map; import org.hibernate.Hibernate; import org.hibernate.SQLQuery; import org.hibernate.Session; import org.hibernate.Transa
linux shell脚本中切换用户执行命令方法 daizj linux shell 命令切换用户
经常在写shell脚本时，会碰到要以另外一个用户来执行相关命令，其方法简单记下： 1、执行单个命令：su - user -c "command" 如：下面命令是以test用户在/data目录下创建test123目录 [root@slave19 /data]# su - test -c "mkdir /data/test123"
好的代码里只要一个 return 语句 dcj3sjt126com return
别再这样写了：public boolean foo() { if (true) { return true; } else { return false;
Android动画效果学习 dcj3sjt126com android
1、透明动画效果方法一：代码实现 public View onCreateView(LayoutInflater inflater, ViewGroup container, Bundle savedInstanceState) { View rootView = inflater.inflate(R.layout.fragment_main, container, fals
linux复习笔记之bash shell (4)管道命令 eksliang linux管道命令汇总 linux管道命令 linux常用管道命令
转载请出自出处： http://eksliang.iteye.com/blog/2105461 bash命令执行的完毕以后，通常这个命令都会有返回结果，怎么对这个返回的结果做一些操作呢？那就得用管道命令‘|’。上面那段话，简单说了下管道命令的作用，那什么事管道命令呢？答：非常的经典的一句话，记住了，何为管
Android系统中自定义按键的短按、双击、长按事件 gqdy365 android
在项目中碰到这样的问题：由于系统中的按键在底层做了重新定义或者新增了按键，此时需要在APP层对按键事件（keyevent）做分解处理，模拟Android系统做法，把keyevent分解成： 1、单击事件：就是普通key的单击； 2、双击事件：500ms内同一按键单击两次； 3、长按事件：同一按键长按超过1000ms（系统中长按事件为500ms）； 4、组合按键：两个以上按键同时按住；
asp.net获取站点根目录下子目录的名称 hvt .net C#asp.net hovertree Web Forms
使用Visual Studio建立一个.aspx文件(Web Forms)，例如hovertree.aspx,在页面上加入一个ListBox代码如下： <asp:ListBox runat="server" ID="lbKeleyiFolder" /> 那么在页面上显示根目录子文件夹的代码如下： string[] m_sub
Eclipse程序员要掌握的常用快捷键 justjavac java eclipse 快捷键 ide
判断一个人的编程水平，就看他用键盘多，还是鼠标多。用键盘一是为了输入代码（当然了，也包括注释），再有就是熟练使用快捷键。曾有人在豆瓣评《卓有成效的程序员》：“人有多大懒，才有多大闲”。之前我整理了一个程序员图书列表，目的也就是通过读书，让程序员变懒。写道程序员作为特殊的群体，有的人可以这么懒，懒到事情都交给机器去做，而有的人又可
c++编程随记 lx.asymmetric C++笔记
为了字体更好看，改变了格式…… &&运算符： #include<iostream> using namespace std; int main(){ int a=-1,b=4,k; k=(++a<0)&&!(b--
linux标准IO缓冲机制研究音频数据 linux
一、什么是缓存I/O(Buffered I/O)缓存I/O又被称作标准I/O,大多数文件系统默认I/O操作都是缓存I/O。在Linux的缓存I/O机制中，操作系统会将I/O的数据缓存在文件系统的页缓存(page cache)中，也就是说，数据会先被拷贝到操作系统内核的缓冲区中，然后才会从操作系统内核的缓冲区拷贝到应用程序的地址空间。1.缓存I/O有以下优点:A.缓存I/O使用了操作系统内核缓冲区，
随想生活暗黑小菠萝生活
其实账户之前就申请了，但是决定要自己更新一些东西看也是最近。从毕业到现在已经一年了。没有进步是假的，但是有多大的进步可能只有我自己知道。毕业的时候班里12个女生，真正最后做到软件开发的只要两个包括我，PS：我不是说测试不好。当时因为考研完全放弃找工作，考研失败，我想这只是我的借口。那个时候才想到为什么大学的时候不能好好的学习技术，增强自己的实战能力，以至于后来找工作比较费劲。我
我认为POJO是一个错误的概念 windshome java POJO 编程 J2EE 设计
这篇内容其实没有经过太多的深思熟虑，只是个人一时的感觉。从个人风格上来讲，我倾向简单质朴的设计开发理念；从方法论上，我更加倾向自顶向下的设计；从做事情的目标上来看，我追求质量优先，更愿意使用较为保守和稳妥的理念和方法。 &

吴恩达深度学习课后习题第五课第二周编程作业2:Emojify

Table of Contents

Packages

1 - Baseline Model: Emojifier-V1

1.1 - Dataset EMOJISET

1.2 - Overview of the Emojifier-V1

1.3 - Implementing Emojifier-V1

Exercise 1 - sentence_to_avg

1.4 - Implement the Model

Exercise 2 - model

1.5 - Examining Test Set Performance

2 - Emojifier-V2: Using LSTMs in Keras

Packages

2.1 - Model Overview

2.2 Keras and Mini-batching

2.3 - The Embedding Layer

Exercise 3 - sentences_to_indices

Exercise 4 - pretrained_embedding_layer

2.4 - Building the Emojifier-V2

Exercise 5 - Emojify_V2

2.5 - Train the Model

Congratulations!

Input sentences:

Output emojis:

你可能感兴趣的:(python,深度学习,人工智能,大数据)