0% found this document useful (0 votes)

205 views15 pages

Natural Language Processing With RNNs .Ipynb - Colaboratory

The document introduces natural language processing and recurrent neural networks. It discusses how RNNs can be used for tasks like sentiment analysis and character generation. It explains that sequence data like text needs to be encoded differently than static data like images. It then describes different methods for encoding text, including bag-of-words, integer encoding, and word embeddings. Word embeddings aim to encode similar words similarly and different words differently. The document concludes by introducing recurrent neural networks as better able to process sequential text data word-by-word compared to traditional feedforward neural networks.

Uploaded by

zb lai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

205 views15 pages

Natural Language Processing With RNNs .Ipynb - Colaboratory

Uploaded by

zb lai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

8/6/2020 Natural Language Processing with RNNs .

ipynb - Colaboratory

Natural Language Processing

Natural Language Processing (or NLP for short) is a discipline in computing that deals with the
communication between natural (human) languages and computer languages. A common
example of NLP is something like spellcheck or autocomplete. Essentially NLP is the eld that
focuses on how computers can understand and/or process natural/human languages.

Recurrent Neural Networks

In this tutorial we will introduce a new kind of neural network that is much more capable of
processing sequential data such as text or characters called a recurrent neural network (RNN
for short).

We will learn how to use a reccurent neural network to do the following:

Sentiment Analysis
Character Generation

RNN's are complex and come in many different forms so in this tutorial we wil focus on how
they work and the kind of problems they are best suited for.

Sequence Data
In the previous tutorials we focused on data that we could represent as one static data point
where the notion of time or step was irrelevant. Take for example our image data, it was simply a
tensor of shape (width, height, channels). That data doesn't change or care about the notion of
time.

In this tutorial we will look at sequences of text and learn how we can encode them in a
meaningful way. Unlike images, sequence data such as long chains of text, weather patterns,
videos and really anything where the notion of a step or time is relevant needs to be processed
and handled in a special way.

But what do I mean by sequences and why is text data a sequence? Well that's a good question.
Since textual data contains many words that follow in a very speci c and meaningful order, we
need to be able to keep track of each word and when it occurs in the data. Simply encoding say
an entire paragraph of text into one data point wouldn't give us a very meaningful picture of the
data and would be very di cult to do anything with. This is why we treat text as a sequence and
process one word at a time. We will keep track of where each of these words appear and use
that information to try to understand the meaning of peices of text.

Encoding Text

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 1/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

As we know machine learning models and neural networks don't take raw text data as an input.
This means we must somehow encode our textual data to numeric values that our models can
understand. There are many different ways of doing this and we will look at a few examples
below.

Before we get into the different encoding/preprocessing methods let's understand the
information we can get from textual data by looking at the following two movie reviews.

I thought the movie was going to be bad, but it was actually amazing!

I thought the movie was going to be amazing, but it was actually bad!

Although these two setences are very similar we know that they have very different meanings.
This is because of the ordering of words, a very important property of textual data.

Now keep that in mind while we consider some different ways of encoding our textual data.

Bag of Words
The rst and simplest way to encode our data is to use something called bag of words. This is a
pretty easy technique where each word in a sentence is encoded with an integer and thrown into
a collection that does not maintain the order of the words but does keep track of the frequency.
Have a look at the python function below that encodes a string of text into bag of words.
vocab = {} # maps word to integer representing it
word_encoding = 1
def bag_of_words(text):
global word_encoding

words = text.lower().split(" ") # create a list of all of the words in the text, well a
bag = {} # stores all of the encodings and their frequency

for word in words:

if word in vocab:
encoding = vocab[word] # get encoding from vocab
else:
vocab[word] = word_encoding
encoding = word_encoding
word_encoding += 1

if encoding in bag:
bag[encoding] += 1
else:
bag[encoding] = 1

return bag

text = "this is a test to see if this test will work is is test a a"
bag = bag_of_words(text)
print(bag)
print(vocab)

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 2/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

This isn't really the way we would do this in practice, but I hope it gives you an idea of how bag
of words works. Notice that we've lost the order in which words appear. In fact, let's look at how
this encoding works for the two sentences we showed above.
positive_review = "I thought the movie was going to be bad but it was actually amazing"
negative_review = "I thought the movie was going to be amazing but it was actually bad"

pos_bag = bag_of_words(positive_review)
neg_bag = bag_of_words(negative_review)

print("Positive:", pos_bag)
print("Negative:", neg_bag)

We can see that even though these sentences have a very different meaning they are encoded
exaclty the same way. Obviously, this isn't going to y. Let's look at some other methods.

Integer Encoding
The next technique we will look at is called integer encoding. This involves representing each
word or character in a sentence as a unique integer and maintaining the order of these words.
This should hopefully x the problem we saw before were we lost the order of words.

vocab = {}
word_encoding = 1
def one_hot_encoding(text):
global word_encoding

words = text.lower().split(" ")

encoding = []

for word in words:

if word in vocab:
code = vocab[word]
encoding.append(code)
else:
vocab[word] = word_encoding
encoding.append(word_encoding)
word_encoding += 1

return encoding

text = "this is a test to see if this test will work is is test a a"
encoding = one_hot_encoding(text)
print(encoding)
print(vocab)

And now let's have a look at one hot encoding on our movie reviews.

positive_review = "I thought the movie was going to be bad but it was actually amazing"
negative_review = "I thought the movie was going to be amazing but it was actually bad"
https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 3/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory
g _ g g g g y

pos_encode = one_hot_encoding(positive_review)
neg_encode = one_hot_encoding(negative_review)

print("Positive:", pos_encode)
print("Negative:", neg_encode)

Much better, now we are keeping track of the order of words and we can tell where each occurs.
But this still has a few issues with it. Ideally when we encode words, we would like similar words
to have similar labels and different words to have very different labels. For example, the words
happy and joyful should probably have very similar labels so we can determine that they are
similar. While words like horrible and amazing should probably have very different labels. The
method we looked at above won't be able to do something like this for us. This could mean that
the model will have a very di cult time determing if two words are similar or not which could
result in some pretty drastic performace impacts.

Word Embeddings
Luckily there is a third method that is far superior, word embeddings. This method keeps the
order of words intact as well as encodes similar words with very similar labels. It attempts to not
only encode the frequency and order of words but the meaning of those words in the sentence.
It encodes each word as a dense vector that represents its context in the sentence.

Unlike the previous techniques word embeddings are learned by looking at many different
training examples. You can add what's called an embedding layer to the beggining of your model
and while your model trains your embedding layer will learn the correct embeddings for words.
You can also use pretrained embedding layers.

This is the technique we will use for our examples and its implementation will be showed later
on.

Recurrent Neural Networks (RNN's)

Now that we've learned a little bit about how we can encode text it's time to dive into recurrent
neural networks. Up until this point we have been using something called feed-forward neural
networks. This simply means that all our data is fed forwards (all at once) from left to right
through the network. This was ne for the problems we considered before but won't work very
well for processing text. After all, even we (humans) don't process text all at once. We read word
by word from left to right and keep track of the current meaning of the sentence so we can
understand the meaning of the next word. Well this is exaclty what a recurrent neural network is
designed to do. When we say recurrent neural network all we really mean is a network that
contains a loop. A RNN will process one word at a time while maintaining an internal memory of
what it's already seen. This will allow it to treat words differently based on their order in a
sentence and to slowly build an understanding of the entire input, one word at a time.
https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 4/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

This is why we are treating our text data as a sequence! So that we can pass one word at a time
to the RNN.

Let's have a look at what a recurrent layer might look like.

Source: https://colah.github.io/posts/2015-08-Understanding-LSTMs/

Let's de ne what all these variables stand for before we get into the explination.

ht output at time t

xt input at time t

A Recurrent Layer (loop)

What this diagram is trying to illustrate is that a recurrent layer processes words or input one at
a time in a combination with the output from the previous iteration. So, as we progress further in
the input sequence, we build a more complex understanding of the text as a whole.

What we've just looked at is called a simple RNN layer. It can be effective at processing shorter
sequences of text for simple problems but has many downfalls associated with it. One of them
being the fact that as text sequences get longer it gets increasingly di cult for the network to
understand the text properly.

LSTM
The layer we dicussed in depth above was called a simpleRNN. However, there does exist some
other recurrent layers (layers that contain a loop) that work much better than a simple RNN layer.
The one we will talk about here is called LSTM (Long Short-Term Memory). This layer works very
similarily to the simpleRNN layer but adds a way to access inputs from any timestep in the past.
Whereas in our simple RNN layer input from previous timestamps gradually disappeared as we
got further through the input. With a LSTM we have a long-term memory data structure storing
all the previously seen inputs as well as when we saw them. This allows for us to access any
previous value we want at any point in time. This adds to the complexity of our network and
allows it to discover more useful relationships between inputs and when they appear.

For the purpose of this course we will refrain from going any further into the math or details
behind how these layers work.

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 5/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

Sentiment Analysis
And now time to see a recurrent neural network in action. For this example, we are going to do
something called sentiment analysis.

The formal de nition of this term from Wikipedia is as follows:

the process of computationally identifying and categorizing opinions expressed in a piece of text,
especially in order to determine whether the writer's attitude towards a particular topic, product,
etc. is positive, negative, or neutral.

The example we’ll use here is classifying movie reviews as either postive, negative or neutral.

This guide is based on the following tensor ow tutorial:

https://www.tensor ow.org/tutorials/text/text_classi cation_rnn

Movie Review Dataset

Well start by loading in the IMDB movie review dataset from keras. This dataset contains 25,000
reviews from IMDB where each one is already preprocessed and has a label as either positive or
negative. Each review is encoded by integers that represents how common a word is in the
entire dataset. For example, a word encoded by the integer 3 means that it is the 3rd most
common word in the dataset.

%tensorflow_version 2.x # this line is not required unless you are in a notebook
from keras.datasets import imdb
from keras.preprocessing import sequence
import keras
import tensorflow as tf
import os
import numpy as np

VOCAB_SIZE = 88584

MAXLEN = 250
BATCH_SIZE = 64

(train_data, train_labels), (test_data, test_labels) = imdb.load_data(num_words = VOCAB_SI

# Lets look at one review

train_data[1]

More Preprocessing
If we have a look at some of our loaded in reviews, we'll notice that they are different lengths.
This is an issue. We cannot pass different length data into our neural network. Therefore, we
must make each review the same length. To do this we will follow the procedure below:
https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 6/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

if the review is greater than 250 words then trim off the extra words
if the review is less than 250 words add the necessary amount of 0's to make it equal to
250.

Luckily for us keras has a function that can do this for us:

train_data = sequence.pad_sequences(train_data, MAXLEN)

test_data = sequence.pad_sequences(test_data, MAXLEN)

Creating the Model

Now it's time to create the model. We'll use a word embedding layer as the rst layer in our
model and add a LSTM layer afterwards that feeds into a dense node to get our predicted
sentiment.

32 stands for the output dimension of the vectors generated by the embedding layer. We can
change this value if we'd like!

model = tf.keras.Sequential([
tf.keras.layers.Embedding(VOCAB_SIZE, 32),
tf.keras.layers.LSTM(32),
tf.keras.layers.Dense(1, activation="sigmoid")
])

model.summary()

Training
Now it's time to compile and train the model.

model.compile(loss="binary_crossentropy",optimizer="rmsprop",metrics=['acc'])

history = model.fit(train_data, train_labels, epochs=10, validation_split=0.2)

And we'll evaluate the model on our training data to see how well it performs.

results = model.evaluate(test_data, test_labels)

print(results)

So we're scoring somewhere in the mid-high 80's. Not bad for a simple recurrent network.

Making Predictions

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 7/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

Now let’s use our network to make predictions on our own reviews.

Since our reviews are encoded well need to convert any review that we write into that form so
the network can understand it. To do that well load the encodings from the dataset and use
them to encode our own data.

word_index = imdb.get_word_index()

def encode_text(text):
tokens = keras.preprocessing.text.text_to_word_sequence(text)
tokens = [word_index[word] if word in word_index else 0 for word in tokens]
return sequence.pad_sequences([tokens], MAXLEN)[0]

text = "that movie was just amazing, so amazing"

encoded = encode_text(text)
print(encoded)

# while were at it lets make a decode function

reverse_word_index = {value: key for (key, value) in word_index.items()}

def decode_integers(integers):
PAD = 0
text = ""
for num in integers:
if num != PAD:
text += reverse_word_index[num] + " "

return text[:-1]

print(decode_integers(encoded))

# now time to make a prediction

def predict(text):
encoded_text = encode_text(text)
pred = np.zeros((1,250))
pred[0] = encoded_text
result = model.predict(pred)
print(result[0])

positive_review = "That movie was! really loved it and would great watch it again because
predict(positive_review)

negative_review = "that movie really sucked. I hated it and wouldn't watch it again. Was o
predict(negative_review)

RNN Play Generator

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 8/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

Now time for one of the coolest examples we've seen so far. We are going to use a RNN to
generate a play. We will simply show the RNN an example of something we want it to recreate
and it will learn how to write a version of it on its own. We'll do this using a character predictive
model that will take as input a variable length sequence and predict the next character. We can
use the model many times in a row with the output from the last predicition as the input for the
next call to generate a sequence.

This guide is based on the following: https://www.tensor ow.org/tutorials/text/text_generation

%tensorflow_version 2.x # this line is not required unless you are in a notebook
from keras.preprocessing import sequence
import keras
import tensorflow as tf
import os
import numpy as np

Dataset
For this example, we only need one peice of training data. In fact, we can write our own poem or
play and pass that to the network for training if we'd like. However, to make things easy we'll use
an extract from a shakesphere play.

path_to_file = tf.keras.utils.get_file('shakespeare.txt', 'https://storage.googleapis.com/

Loading Your Own Data

To load your own data, you'll need to upload a le from the dialog below. Then you'll need to
follow the steps from above but load in this new le instead.

from google.colab import files

path_to_file = list(files.upload().keys())[0]

Read Contents of File

Let's look at the contents of the le.

# Read, then decode for py2 compat.

text = open(path_to_file, 'rb').read().decode(encoding='utf-8')
# length of text is the number of characters in it
print ('Length of text: {} characters'.format(len(text)))

# Take a look at the first 250 characters in text

print(text[:250])

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 9/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

Encoding
Since this text isn't encoded yet well need to do that ourselves. We are going to encode each
unique character as a different integer.

vocab = sorted(set(text))
# Creating a mapping from unique characters to indices
char2idx = {u:i for i, u in enumerate(vocab)}
idx2char = np.array(vocab)

def text_to_int(text):
return np.array([char2idx[c] for c in text])

text_as_int = text_to_int(text)

# lets look at how part of our text is encoded

print("Text:", text[:13])
print("Encoded:", text_to_int(text[:13]))

And here we will make a function that can convert our numeric values to text.

def int_to_text(ints):
try:
ints = ints.numpy()
except:
pass
return ''.join(idx2char[ints])

print(int_to_text(text_as_int[:13]))

Creating Training Examples

Remember our task is to feed the model a sequence and have it return to us the next character.
This means we need to split our text data from above into many shorter sequences that we can
pass to the model as training examples.

The training examples we will prepapre will use a seq_length sequence as input and a seq_length
sequence as the output where that sequence is the original sequence shifted one letter to the
right. For example:

input: Hell | output: ello

Our rst step will be to create a stream of characters from our text data.

seq_length = 100 # length of sequence for a training example

examples_per_epoch = len(text)//(seq_length+1)

# Create training examples / targets

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 10/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

char_dataset = tf.data.Dataset.from_tensor_slices(text_as_int)

Next we can use the batch method to turn this stream of characters into batches of desired
length.

sequences = char_dataset.batch(seq_length+1, drop_remainder=True)

Now we need to use these sequences of length 101 and split them into input and output.

def split_input_target(chunk): # for the example: hello

input_text = chunk[:-1] # hell
target_text = chunk[1:] # ello
return input_text, target_text # hell, ello

dataset = sequences.map(split_input_target) # we use map to apply the above function to e

for x, y in dataset.take(2):
print("\n\nEXAMPLE\n")
print("INPUT")
print(int_to_text(x))
print("\nOUTPUT")
print(int_to_text(y))

Finally we need to make training batches.

BATCH_SIZE = 64
VOCAB_SIZE = len(vocab) # vocab is number of unique characters
EMBEDDING_DIM = 256
RNN_UNITS = 1024

# Buffer size to shuffle the dataset

# (TF data is designed to work with possibly infinite sequences,
# so it doesn't attempt to shuffle the entire sequence in memory. Instead,
# it maintains a buffer in which it shuffles elements).
BUFFER_SIZE = 10000

data = dataset.shuffle(BUFFER_SIZE).batch(BATCH_SIZE, drop_remainder=True)

Building the Model

Now it is time to build the model. We will use an embedding layer a LSTM and one dense layer
that contains a node for each unique character in our training data. The dense layer will give us a
probability distribution over all nodes.

def build_model(vocab_size, embedding_dim, rnn_units, batch_size):

model = tf.keras.Sequential([
tf.keras.layers.Embedding(vocab size, embedding dim,
https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 11/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory
y g(
_ , g_ ,
batch_input_shape=[batch_size, None]),
tf.keras.layers.LSTM(rnn_units,
return_sequences=True,
stateful=True,
recurrent_initializer='glorot_uniform'),
tf.keras.layers.Dense(vocab_size)
])
return model

model = build_model(VOCAB_SIZE,EMBEDDING_DIM, RNN_UNITS, BATCH_SIZE)

model.summary()

Creating a Loss Function

Now we are going to create our own loss function for this problem. This is because our model
will output a (64, sequence_length, 65) shaped tensor that represents the probability distribution
of each character at each timestep for every sequence in the batch.

However, before we do that let's have a look at a sample input and the output from our untrained
model. This is so we can understand what the model is giving us.

for input_example_batch, target_example_batch in data.take(1):

example_batch_predictions = model(input_example_batch) # ask our model for a prediction
print(example_batch_predictions.shape, "# (batch_size, sequence_length, vocab_size)") #

# we can see that the predicition is an array of 64 arrays, one for each entry in the batc
print(len(example_batch_predictions))
print(example_batch_predictions)

# lets examine one prediction

pred = example_batch_predictions[0]
print(len(pred))
print(pred)
# notice this is a 2d array of length 100, where each interior array is the prediction for

# and finally well look at a prediction at the first timestep

time_pred = pred[0]
print(len(time_pred))
print(time_pred)
# and of course its 65 values representing the probabillity of each character occuring nex

# If we want to determine the predicted character we need to sample the output distributio
sampled_indices = tf.random.categorical(pred, num_samples=1)

# now we can reshape that array and convert all the integers to numbers to see the actual
sampled_indices = np.reshape(sampled_indices, (1, -1))[0]
predicted_chars = int_to_text(sampled_indices)

di t d h # d thi i h t th d l di t d f t i i 1
https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 12/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory
predicted_chars # and this is what the model predicted for training sequence 1

So now we need to create a loss function that can compare that output to the expected output
and give us some numeric value representing how close the two were.

def loss(labels, logits):

return tf.keras.losses.sparse_categorical_crossentropy(labels, logits, from_logits=True)

Compiling the Model

At this point we can think of our problem as a classi cation problem where the model predicts
the probabillity of each unique letter coming next.

model.compile(optimizer='adam', loss=loss)

Creating Checkpoints
Now we are going to setup and con gure our model to save checkpoinst as it trains. This will
allow us to load our model from a checkpoint and continue training it.

# Directory where the checkpoints will be saved

checkpoint_dir = './training_checkpoints'
# Name of the checkpoint files
checkpoint_prefix = os.path.join(checkpoint_dir, "ckpt_{epoch}")

checkpoint_callback=tf.keras.callbacks.ModelCheckpoint(
filepath=checkpoint_prefix,
save_weights_only=True)

Training
Finally, we will start training the model.

If this is taking a while go to Runtime > Change Runtime Type and choose "GPU" under
hardware accelerator.

history = model.fit(data, epochs=50, callbacks=[checkpoint_callback])

Loading the Model

We'll rebuild the model from a checkpoint using a batch_size of 1 so that we can feed one peice
of text to the model and have it make a prediction.

model = build_model(VOCAB_SIZE, EMBEDDING_DIM, RNN_UNITS, batch_size=1)

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 13/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

Once the model is nished training, we can nd the lastest checkpoint that stores the models
weights using the following line.

model.load_weights(tf.train.latest_checkpoint(checkpoint_dir))
model.build(tf.TensorShape([1, None]))

We can load any checkpoint we want by specifying the exact le to load.

checkpoint_num = 10
model.load_weights(tf.train.load_checkpoint("./training_checkpoints/ckpt_" + str(checkpoin
model.build(tf.TensorShape([1, None]))

Generating Text
Now we can use the lovely function provided by tensor ow to generate some text using any
starting string we'd like.

def generate_text(model, start_string):

# Evaluation step (generating text using the learned model)

# Number of characters to generate

num_generate = 800

# Converting our start string to numbers (vectorizing)

input_eval = [char2idx[s] for s in start_string]
input_eval = tf.expand_dims(input_eval, 0)

# Empty string to store our results

text_generated = []

# Low temperatures results in more predictable text.

# Higher temperatures results in more surprising text.
# Experiment to find the best setting.
temperature = 1.0

# Here batch size == 1

model.reset_states()
for i in range(num_generate):
predictions = model(input_eval)
# remove the batch dimension

predictions = tf.squeeze(predictions, 0)

# using a categorical distribution to predict the character returned by the model

predictions = predictions / temperature
predicted_id = tf.random.categorical(predictions, num_samples=1)[-1,0].numpy()

# We pass the predicted character as the next input to the model

# along with the previous hidden state
https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 14/15
8/6/2020 Natural Language Processing with RNNs .ipynb - Colaboratory

input_eval = tf.expand_dims([predicted_id], 0)

text_generated.append(idx2char[predicted_id])

return (start_string + ''.join(text_generated))

inp = input("Type a starting string: ")

print(generate_text(model, inp))

And that's pretty much it for this module! I highly reccomend messing with the model we just
created and seeing what you can get it to do!

Sources
1. Chollet François. Deep Learning with Python. Manning Publications Co., 2018.
2. “Text Classi cation with an RNN : TensorFlow Core.” TensorFlow,
www.tensor ow.org/tutorials/text/text_classi cation_rnn.
3. “Text Generation with an RNN : TensorFlow Core.” TensorFlow,
www.tensor ow.org/tutorials/text/text_generation.
4. “Understanding LSTM Networks.” Understanding LSTM Networks -- Colah's Blog,
https://colah.github.io/posts/2015-08-Understanding-LSTMs/.

https://colab.research.google.com/drive/1ysEKrw_LE2jMndo1snrZUh5w87LQsCxk#forceEdit=true&sandboxMode=true&printMode=true 15/15

Nlp Lab Manual
No ratings yet
Nlp Lab Manual
21 pages
NLP_slides2
No ratings yet
NLP_slides2
93 pages
NLP_basics
No ratings yet
NLP_basics
119 pages
NLP-week7-rnnlstm
No ratings yet
NLP-week7-rnnlstm
66 pages
Recurrent Neural Networks: Amir H. Payberah
No ratings yet
Recurrent Neural Networks: Amir H. Payberah
142 pages
2022-foundations-tutorial3-sunwang-deeplearning4nlp
No ratings yet
2022-foundations-tutorial3-sunwang-deeplearning4nlp
103 pages
Slow Momentum With Fast Reversion - A Trading Strategy Using Deep Learning and Changepoint Detection
No ratings yet
Slow Momentum With Fast Reversion - A Trading Strategy Using Deep Learning and Changepoint Detection
19 pages
Model5 partial
No ratings yet
Model5 partial
52 pages
WINSEM2024-25_CSE4006_ETH_AP2024254000689_2025-02-28_Reference-Material-I
No ratings yet
WINSEM2024-25_CSE4006_ETH_AP2024254000689_2025-02-28_Reference-Material-I
39 pages
CS-875-Lecture 4
No ratings yet
CS-875-Lecture 4
47 pages
dsa unit 1 pdf
No ratings yet
dsa unit 1 pdf
32 pages
Cs224n 2025 Lecture05 Rnnlm
No ratings yet
Cs224n 2025 Lecture05 Rnnlm
54 pages
UNIT-II
No ratings yet
UNIT-II
20 pages
RNN LSTM
No ratings yet
RNN LSTM
49 pages
Machine Learning Algorithms For Signal and Image Processing
No ratings yet
Machine Learning Algorithms For Signal and Image Processing
487 pages
01-Transformer Based NLP Applications
No ratings yet
01-Transformer Based NLP Applications
55 pages
Word Embeddings
No ratings yet
Word Embeddings
12 pages
04 - RNNs
No ratings yet
04 - RNNs
37 pages
Project Machine Translation
No ratings yet
Project Machine Translation
45 pages
Deep Learning (MODULE-4)_RNN - NLP
No ratings yet
Deep Learning (MODULE-4)_RNN - NLP
52 pages
L5_CSE256_FA24_LM
No ratings yet
L5_CSE256_FA24_LM
65 pages
04 - Text Representation
No ratings yet
04 - Text Representation
131 pages
RNN
No ratings yet
RNN
53 pages
10 - Neural Networks For Text
No ratings yet
10 - Neural Networks For Text
40 pages
Unit iv
No ratings yet
Unit iv
57 pages
DR - Amin.ML Ch07 DeepLearning 1
No ratings yet
DR - Amin.ML Ch07 DeepLearning 1
12 pages
Creating Word Embeddings - Coding The Word2Vec Algorithm in Python Using Deep Learning - by Eligijus Bujokas - Towards Data Science
No ratings yet
Creating Word Embeddings - Coding The Word2Vec Algorithm in Python Using Deep Learning - by Eligijus Bujokas - Towards Data Science
11 pages
ESE - 3 TITLE: Hate Speech Detection Using Machine Learning
No ratings yet
ESE - 3 TITLE: Hate Speech Detection Using Machine Learning
11 pages
Day 4
No ratings yet
Day 4
22 pages
ioegc-10-032-100471
No ratings yet
ioegc-10-032-100471
8 pages
4. Word Embadding
No ratings yet
4. Word Embadding
24 pages
RNN-1
No ratings yet
RNN-1
50 pages
On The Automatic Generation of Medical Imaging Reports
No ratings yet
On The Automatic Generation of Medical Imaging Reports
10 pages
Intro DL 10 NLP
No ratings yet
Intro DL 10 NLP
99 pages
8478457
No ratings yet
8478457
13 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages
AN2DL_05_2324_Seq2SeqAndWordEmbedding
No ratings yet
AN2DL_05_2324_Seq2SeqAndWordEmbedding
42 pages
08 NLP With Deep Learning
No ratings yet
08 NLP With Deep Learning
31 pages
08 Word Embeddings (2021)
No ratings yet
08 Word Embeddings (2021)
58 pages
An Image-Based Fall Detection System For The Elderly: Article
No ratings yet
An Image-Based Fall Detection System For The Elderly: Article
31 pages
An Ingression Into Deep Learning - Resp
No ratings yet
An Ingression Into Deep Learning - Resp
25 pages
Large Language Models From Scratch
No ratings yet
Large Language Models From Scratch
29 pages
3 Sequence and Language Modeling
No ratings yet
3 Sequence and Language Modeling
56 pages
Module03 Embeddings
No ratings yet
Module03 Embeddings
102 pages
CISC 867 Deep Learning: 14. Text Classification With Recurrent Neural Networks and Word Embeddings
No ratings yet
CISC 867 Deep Learning: 14. Text Classification With Recurrent Neural Networks and Word Embeddings
28 pages
Image Captions With Deep Learning: Yulia Kogan & Ron Shiff
No ratings yet
Image Captions With Deep Learning: Yulia Kogan & Ron Shiff
24 pages
ML for NLP-LO4
No ratings yet
ML for NLP-LO4
42 pages
3 - Deep Learning
No ratings yet
3 - Deep Learning
33 pages
Quora Insincere Questions Classification: Joseph Lionel Shabharinath TR Saravanakumar Velayutham
No ratings yet
Quora Insincere Questions Classification: Joseph Lionel Shabharinath TR Saravanakumar Velayutham
12 pages
Unit 5b - Natural Language Processing
No ratings yet
Unit 5b - Natural Language Processing
41 pages
Two Way Indian Sign Language Translator Using LSTM and NLP
No ratings yet
Two Way Indian Sign Language Translator Using LSTM and NLP
8 pages
Prediction of Mechanical Properties For Deep Drawing Steel by Deep Learning
No ratings yet
Prediction of Mechanical Properties For Deep Drawing Steel by Deep Learning
10 pages
Problem 1 Proposal
No ratings yet
Problem 1 Proposal
24 pages
Continuous Bag of Words (Cbow) - Single Word Model - How It Works - Thinkinfi
No ratings yet
Continuous Bag of Words (Cbow) - Single Word Model - How It Works - Thinkinfi
14 pages
1 s2.0 S0301421522003226 Main
No ratings yet
1 s2.0 S0301421522003226 Main
13 pages
1508.06615 - PTB Character Aware Neural Language Models Yoon Kim
No ratings yet
1508.06615 - PTB Character Aware Neural Language Models Yoon Kim
9 pages
CMSC 201 - Lec18 - String Formatting
No ratings yet
CMSC 201 - Lec18 - String Formatting
36 pages
Unit 5 NLP
No ratings yet
Unit 5 NLP
24 pages
NLP NN Language Modeling Week5
No ratings yet
NLP NN Language Modeling Week5
33 pages
Stock Market Prediction
100% (1)
Stock Market Prediction
15 pages
Encoder Decoder PDF
No ratings yet
Encoder Decoder PDF
12 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
cs224n-2021-LSTM NN
No ratings yet
cs224n-2021-LSTM NN
59 pages
LSTM and Transformer
No ratings yet
LSTM and Transformer
4 pages
DM Chapter 9 - word embedding
No ratings yet
DM Chapter 9 - word embedding
7 pages
XLSTMTime - Long-term Time Series Forecasting With XLSTM
No ratings yet
XLSTMTime - Long-term Time Series Forecasting With XLSTM
13 pages
Zhou 2020
No ratings yet
Zhou 2020
5 pages
Ad3501-Deep-Learningquestion Bak
No ratings yet
Ad3501-Deep-Learningquestion Bak
15 pages
NLP Final Review
No ratings yet
NLP Final Review
32 pages
Kalyan 1 s2.0 S2949719123000456 Main
No ratings yet
Kalyan 1 s2.0 S2949719123000456 Main
48 pages
Deep Learning For Dravidian Codemix Problem
No ratings yet
Deep Learning For Dravidian Codemix Problem
10 pages
NLP An Intuitive Understanding of Word Embeddings From Count Vectors To Word2Vec
No ratings yet
NLP An Intuitive Understanding of Word Embeddings From Count Vectors To Word2Vec
18 pages
Chapter II
No ratings yet
Chapter II
26 pages
Reinforcement Learning - Ipynb - Colaboratory
No ratings yet
Reinforcement Learning - Ipynb - Colaboratory
7 pages
Big Data Analytics and Mining For Effective Visualization and Trends Forecasting of Crime Data
No ratings yet
Big Data Analytics and Mining For Effective Visualization and Trends Forecasting of Crime Data
8 pages
A Study On Deep Learning For Fake News Detection
No ratings yet
A Study On Deep Learning For Fake News Detection
48 pages
NLP Quick NOtes
No ratings yet
NLP Quick NOtes
15 pages
Modern Language Models
No ratings yet
Modern Language Models
28 pages
Word Embedding
No ratings yet
Word Embedding
9 pages
Word Embeddings Notes
No ratings yet
Word Embeddings Notes
9 pages
Natural Language Processing With Neural Network - Class3
No ratings yet
Natural Language Processing With Neural Network - Class3
25 pages
Word Embeddings Classification
No ratings yet
Word Embeddings Classification
52 pages
NLP - Natural Language Processing
No ratings yet
NLP - Natural Language Processing
74 pages
E3sconf Icmed-Icmpc2023 01048
No ratings yet
E3sconf Icmed-Icmpc2023 01048
9 pages
Understanding Consumer Behavior With Recurrent Neural Networks
No ratings yet
Understanding Consumer Behavior With Recurrent Neural Networks
8 pages
Explaining The Intuition of Word2Vec & Implementing It in Python
No ratings yet
Explaining The Intuition of Word2Vec & Implementing It in Python
13 pages
Computer Vision - Ipynb - Colaboratory
No ratings yet
Computer Vision - Ipynb - Colaboratory
17 pages
Recurrent Neural Networks Tutorial, Part 1 - Introduction To RNNs - WildML
No ratings yet
Recurrent Neural Networks Tutorial, Part 1 - Introduction To RNNs - WildML
8 pages
Deep Learning Question Bank(2024-25)
No ratings yet
Deep Learning Question Bank(2024-25)
2 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
12 pages
Accident Detection Using Deep Learning
No ratings yet
Accident Detection Using Deep Learning
4 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
No ratings yet
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
3 pages
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
From Everand
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
Alexandra George
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Natural Language Processing With RNNs .Ipynb - Colaboratory

Uploaded by

Natural Language Processing With RNNs .Ipynb - Colaboratory

Uploaded by

8/6/2020 Natural Language Processing with RNNs .

Natural Language Processing

Recurrent Neural Networks

We will learn how to use a reccurent neural network to do the following:

for word in words:

words = text.lower().split(" ")

for word in words:

Recurrent Neural Networks (RNN's)

Let's have a look at what a recurrent layer might look like.

A Recurrent Layer (loop)

The formal de nition of this term from Wikipedia is as follows:

This guide is based on the following tensor ow tutorial:

Movie Review Dataset

(train_data, train_labels), (test_data, test_labels) = imdb.load_data(num_words = VOCAB_SI

# Lets look at one review

train_data = sequence.pad_sequences(train_data, MAXLEN)

Creating the Model

history = model.fit(train_data, train_labels, epochs=10, validation_split=0.2)

results = model.evaluate(test_data, test_labels)

text = "that movie was just amazing, so amazing"

# while were at it lets make a decode function

reverse_word_index = {value: key for (key, value) in word_index.items()}

# now time to make a prediction

RNN Play Generator

This guide is based on the following: https://www.tensor ow.org/tutorials/text/text_generation

path_to_file = tf.keras.utils.get_file('shakespeare.txt', 'https://storage.googleapis.com/

Loading Your Own Data

from google.colab import files

Read Contents of File

# Read, then decode for py2 compat.

# Take a look at the first 250 characters in text

# lets look at how part of our text is encoded

Creating Training Examples

input: Hell | output: ello

seq_length = 100 # length of sequence for a training example

# Create training examples / targets

sequences = char_dataset.batch(seq_length+1, drop_remainder=True)

def split_input_target(chunk): # for the example: hello

dataset = sequences.map(split_input_target) # we use map to apply the above function to e

Finally we need to make training batches.

# Buffer size to shuffle the dataset

data = dataset.shuffle(BUFFER_SIZE).batch(BATCH_SIZE, drop_remainder=True)

Building the Model

def build_model(vocab_size, embedding_dim, rnn_units, batch_size):

model = build_model(VOCAB_SIZE,EMBEDDING_DIM, RNN_UNITS, BATCH_SIZE)

Creating a Loss Function

for input_example_batch, target_example_batch in data.take(1):

# lets examine one prediction

# and finally well look at a prediction at the first timestep

def loss(labels, logits):

Compiling the Model

# Directory where the checkpoints will be saved

history = model.fit(data, epochs=50, callbacks=[checkpoint_callback])

Loading the Model

model = build_model(VOCAB_SIZE, EMBEDDING_DIM, RNN_UNITS, batch_size=1)

We can load any checkpoint we want by specifying the exact le to load.

def generate_text(model, start_string):

# Number of characters to generate

# Converting our start string to numbers (vectorizing)

# Empty string to store our results

# Low temperatures results in more predictable text.

# Here batch size == 1

# using a categorical distribution to predict the character returned by the model

# We pass the predicted character as the next input to the model

return (start_string + ''.join(text_generated))

inp = input("Type a starting string: ")

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.