0% found this document useful (0 votes)

18 views12 pages

MNIST Dataset

Uploaded by

Mihir Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views12 pages

MNIST Dataset

Uploaded by

Mihir Patel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Image Classification with the MNIST Dataset

In traditional programming, the programmer is able to articulate rules and conditions in their code that
their program can then use to act in the correct way. This approach continues to work exceptionally well
for a huge variety of problems.

Image classification, which asks a program to correctly classify an image it has never seen before into
its correct class, is near impossible to solve with traditional programming techniques. How could a
programmer possibly define the rules and conditions to correctly classify a huge variety of images,
especially taking into account images that they have never seen?

The Solution: Deep Learning

Deep learning excels at pattern recognition by trial and error. By training a deep neural network with
sufficient data, and providing the network with feedback on its performance via training, the network can
identify, though a huge amount of iteration, its own set of conditions by which it can act in the correct
way.

The MNIST Dataset

In the history of deep learning, the accurate image classification of the MNIST dataset
(http://yann.lecun.com/exdb/mnist/), a collection of 70,000 grayscale images of handwritten digits from 0
to 9, was a major development. While today the problem is considered trivial, doing image classification
with MNIST has become a kind of "Hello World" for deep learning.

Training and Validation Data and Labels

When working with images for deep learning, we need both the images themselves, usually denoted as
X , and also, correct labels (https://developers.google.com/machine-learning/glossary#label) for these
images, usually denoted as Y . Furthermore, we need X and Y values both for training the model,
and then, a separate set of X and Y values for validating the performance of the model after it has
been trained. Therefore, we need 4 segments of data for the MNIST dataset:

1. x_train : Images used for training the neural network

2. y_train : Correct labels for the x_train images, used to evaluate the model's predictions during
training
3. x_valid : Images set aside for validating the performance of the model after it has been trained
4. y_valid : Correct labels for the x_valid images, used to evaluate the model's predictions after it
has been trained

The process of preparing data for analysis is called Data Engineering (https://medium.com/@rchang/a-
beginners-guide-to-data-engineering-part-i-4227c5c457d7). To learn more about the differences
between training data and validation data (as well as test data), check out this article
(https://machinelearningmastery.com/difference-test-validation-datasets/) by Jason Brownlee.

Difference between GPU and TPU

GPUs are made for parallel processing, ideal for training complex neural networks. TPUs take this
specialization further, focusing on tensor operations to achieve higher speeds and energy efficiencies.

Tensors are mathematical objects from linear algebra and are used to represent multidimensional
objects. They can be used to perform the same arithmetic operations that are already familiar with
vectors or matrices, for example.

Loading the Data Into Memory (with Keras)

There are many deep learning frameworks (https://developer.nvidia.com/deep-learning-frameworks),

each with their own merits. We will be working with Tensorflow 2
(https://www.tensorflow.org/tutorials/quickstart/beginner), and specifically with the Keras API
(https://keras.io/). Keras has many useful built in functions designed for the computer vision tasks. It is
also a legitimate choice for deep learning in a professional setting due to its readability
(https://blog.pragmaticengineer.com/readable-code/) and efficiency, though it is not alone in this regard,
and it is worth investigating a variety of frameworks when beginning a deep learning project.

One of the many helpful features that Keras provides are modules containing many helper methods for
many common datasets (https://www.tensorflow.org/api_docs/python/tf/keras/datasets), including
MNIST.

We will begin by loading the Keras dataset module for MNIST:

In [3]: 1 from tensorflow.keras.datasets import mnist

WARNING:tensorflow:From C:\Users\pdsin\AppData\Local\Programs\Python\Python310\lib
\site-packages\keras\src\losses.py:2976: The name tf.losses.sparse_softmax_cross_en
tropy is deprecated. Please use tf.compat.v1.losses.sparse_softmax_cross_entropy in
stead.

Type Markdown and LaTeX: 𝛼 2

With the mnist module, we can easily load the MNIST data, already partitioned into images and labels
for both training and validation:

In [4]: 1 # the data, split between train and validation sets

2 (x_train, y_train), (x_test, y_test) = mnist.load_data()

Exploring the MNIST Data

We stated above that the MNIST dataset contained 70,000 grayscale images of handwritten digits. By
executing the following cells, we can see that Keras has partitioned 60,000 of these images for training,
and 10,000 for validation (after training), and also, that each image itself is a 2D array with the
dimensions 28x28:

In [5]: 1 x_train.shape

Out[5]: (60000, 28, 28)

In [6]: 1 x_test.shape

Out[6]: (10000, 28, 28)

Furthermore, we can see that these 28x28 images are represented as a collection of unsigned 8-bit
integer values between 0 and 255, the values corresponding with a pixel's grayscale value where 0 is
black, 255 is white, and all other values are in between:

In [7]: 1 x_train.dtype

Out[7]: dtype('uint8')

In [8]: 1 x_train.min()

Out[8]: 0

In [9]: 1 x_train.max()

Out[9]: 255

In [10]: 1 x_train[0]

Out[10]: array([[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 3,
18, 18, 18, 126, 136, 175, 26, 166, 255, 247, 127, 0, 0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 30, 36, 94, 154, 170,
253 253 253 253 253 225 172 253 242 195 64 0 0
Using Matplotlib (https://matplotlib.org/), we can render one of these grayscale images in our dataset:
In [11]: 1 import matplotlib.pyplot as plt
2
3 image = x_train[4]
4 plt.imshow(image, cmap='gray')

Out[11]: <matplotlib.image.AxesImage at 0x213ace2b1c0>

In this way we can now see that this is a 28x28 pixel image of a 9. Or is it a 4? The answer is in the
y_train data, which contains correct labels for the data. Let's take a look:

In [20]: 1 y_train[4]

Out[20]: 9

In [18]: 1 import random

2 for i in range(9):
3 plt.subplot(3,3,i+1)
4 num = random.randint(0, len(x_train))
5 plt.imshow(x_train[num], cmap='gray')
6 plt.title("Class {}".format(y_train[num]))
7
8 plt.tight_layout()
9
10
Preparing the Data for Training

In deep learning, it is common that data needs to be transformed to be in the ideal state for training. For
this particular image classification problem, there are 3 tasks we should perform with the data in
preparation for training:

1. Flatten the image data, to simplify the image input into the model
2. Normalize the image data, to make the image input values easier to work with for the model
3. Categorize the labels, to make the label values easier to work with for the model

Flattening the Image Data

Though it's possible for a deep learning model to accept a 2-dimensional image (in our case 28x28
pixels), we're going to simplify things to start and reshape
(https://www.tensorflow.org/api_docs/python/tf/reshape) each image into a single array of 784
continuous pixels (note: 28x28 = 784). This is also called flattening the image.

Here we accomplish this using the helper method reshape :

In [21]: 1 x_train = x_train.reshape(60000, 784)

2 x_test = x_test.reshape(10000, 784)

We can confirm that the image data has been reshaped and is now a collection of 1D arrays containing
784 pixel values each:

In [22]: 1 x_train.shape

Out[22]: (60000, 784)

In [23]: 1 x_train[0]

Out[23]: array([ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 3, 18, 18, 18,
126, 136, 175, 26, 166, 255, 247, 127, 0, 0, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 30, 36, 94, 154, 170, 253,
253, 253, 253, 253, 225, 172, 253, 242, 195, 64, 0, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 49, 238, 253, 253, 253,
253, 253, 253, 253, 253, 251, 93, 82, 82, 56, 39, 0, 0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 18, 219, 253,
253, 253, 253, 253, 198, 182, 247, 241, 0, 0, 0, 0, 0,
0 0 0 0 0 0 0 0 0 0 0 0 0
Normalizing the Image Data

Deep learning models are better at dealing with floating point numbers between 0 and 1. Converting
integer values to floating point values between 0 and 1 is called normalization
(https://developers.google.com/machine-learning/glossary#normalization), and a simple approach we
will take here to normalize the data will be to divide all the pixel values (which if you recall are between
0 and 255) by 255:

In [24]: 1 x_train = x_train / 255

2 x_test = x_test / 255

We can now see that the values are all floating point values between 0.0 and 1.0 :

In [25]: 1 x_train.dtype

Out[25]: dtype('float64')

In [26]: 1 x_train.min()

Out[26]: 0.0

In [27]: 1 x_train.max()

Out[27]: 1.0

Categorical Encoding

Consider for a moment, if we were to ask, what is 7 - 2? Stating that the answer was 4 is closer than
stating that the answer was 9. However, for this image classification problem, we don't want the neural
network to learn this kind of reasoning: we just want it to select the correct category, and understand
that if we have an image of the number 5, that guessing 4 is just as bad as guessing 9.

As it stands, the labels for the images are integers between 0 and 9. Because these values represent a
numerical range, the model might try to draw some conclusions about its performance based on how
close to the correct numerical category it guesses.

Therefore, we will do something to our data called categorical encoding. This kind of transformation
modifies the data so that each value is a collection of all possible categories, with the actual category
that this particular value is set as true.

As a simple example, consider if we had 3 categories: red, blue, and green. For a given color, 2 of these
categories would be false, and the other would be true:

Actual Color Is Red? Is Blue? Is Green?

Red True False False

Green False False True

Blue False True False

Green False False True

Rather than use "True" or "False", we could represent the same using binary, either 0 or 1:
Actual Color Is Red? Is Blue? Is Green?

Red 1 0 0

Green 0 0 1

Blue 0 1 0

Green 0 0 1

This is what categorical encoding is, transforming values which are intended to be understood as
categorical labels into a representation that makes their categorical nature explicit to the model. Thus, if
we were using these values for training, we would convert...

values = ['red, green, blue, green']

... which a neural network would have a very difficult time making sense of, instead to:

values = [
[1, 0, 0],
[0, 0, 1],
[0, 1, 0],
[0, 0, 1]
]

Categorically Encoding the Labels

Keras provides a utility to categorically encode values

(https://www.tensorflow.org/api_docs/python/tf/keras/utils/to_categorical), and here we use it to perform
categorical encoding for both the training and validation labels:

In [28]: 1 import tensorflow.keras as keras

2 num_categories = 10
3
4
5 y_train = keras.utils.to_categorical(y_train, num_categories)
6 y_test = keras.utils.to_categorical(y_test, num_categories)

Here are the first 10 values of the training labels, which you can see have now been categorically
encoded:

In [30]: 1 y_train[0:10]

Out[30]: array([[0., 0., 0., 0., 0., 1., 0., 0., 0., 0.],
[1., 0., 0., 0., 0., 0., 0., 0., 0., 0.],
[0., 0., 0., 0., 1., 0., 0., 0., 0., 0.],
[0., 1., 0., 0., 0., 0., 0., 0., 0., 0.],
[0., 0., 0., 0., 0., 0., 0., 0., 0., 1.],
[0., 0., 1., 0., 0., 0., 0., 0., 0., 0.],
[0., 1., 0., 0., 0., 0., 0., 0., 0., 0.],
[0., 0., 0., 1., 0., 0., 0., 0., 0., 0.],
[0., 1., 0., 0., 0., 0., 0., 0., 0., 0.],
[0., 0., 0., 0., 1., 0., 0., 0., 0., 0.]], dtype=float32)
Creating the Model

With the data prepared for training, it is now time to create the model that we will train with the data.
This first basic model will be made up of several layers and will be comprised of 3 main parts:

1. An input layer, which will receive data in some expected format

2. Several hidden layers (https://developers.google.com/machine-learning/glossary#hidden-layer),
each comprised of many neurons. Each neuron (https://developers.google.com/machine-
learning/glossary#neuron) will have the ability to affect the network's guess with its weights, which
are values that will be updated over many iterations as the network gets feedback on its
performance and learns
3. An output layer, which will depict the network's guess for a given image

Instantiating the Model

To begin, we will use Keras's Sequential

(https://www.tensorflow.org/api_docs/python/tf/keras/Sequential) model class to instantiate an instance
of a model that will have a series of layers that data will pass through in sequence:

In [32]: 1 from tensorflow.keras.models import Sequential

2
3 model = Sequential()

WARNING:tensorflow:From C:\Users\pdsin\AppData\Local\Programs\Python\Python310\lib
\site-packages\keras\src\backend.py:873: The name tf.get_default_graph is deprecate
d. Please use tf.compat.v1.get_default_graph instead.

Creating the Input Layer

Next, we will add the input layer. This layer will be densely connected, meaning that each neuron in it,
and its weights, will affect every neuron in the next layer. To do this with Keras, we use Keras's Dense
(https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dense) layer class.

In [33]: 1 from tensorflow.keras.layers import Dense

The units argument specifies the number of neurons in the layer. We are going to use 512 (chosen
from experimentation). Choosing the correct number of neurons is what puts the "science" in "data
science" as it is a matter of capturing the statistical complexity of the dataset. Try playing around with
this value later to see how it affects training and to start developing a sense for what this number
means.

We will learn more about activation functions later, but for now, we will use the relu activation
function, which in short, will help our network to learn how to make more sophisticated guesses about
data than if it were required to make guesses based on some strictly linear function.

The input_shape value specifies the shape of the incoming data which in our situation is a 1D array
of 784 values:
In [34]: 1 model.add(Dense(units=512, activation='relu', input_shape=(784,)))

Creating the Hidden Layer

Now we will add an additional densely connected layer. These layers give the network more parameters
to contribute towards its guesses, and therefore, more subtle opportunities for accurate learning:

In [35]: 1 model.add(Dense(units = 512, activation='relu'))

Creating the Output Layer

Finally, we will add an output layer. This layer uses the activation function softmax which will result in
each of the layer's values being a probability between 0 and 1 and will result in all the outputs of the
layer adding to 1. In this case, since the network is to make a guess about a single image belonging to 1
of 10 possible categories, there will be 10 outputs. Each output gives the model's guess (a probability)
that the image belongs to that specific class:

In [36]: 1 model.add(Dense(units = 10, activation='softmax'))

Summarizing the Model

Keras provides the model instance method summary

(https://www.tensorflow.org/api_docs/python/tf/summary) which will print a readable summary of a
model:

In [37]: 1 model.summary()

Model: "sequential"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
dense (Dense) (None, 512) 401920

dense_1 (Dense) (None, 512) 262656

dense_2 (Dense) (None, 10) 5130

=================================================================
Total params: 669706 (2.55 MB)
Trainable params: 669706 (2.55 MB)
Non-trainable params: 0 (0.00 Byte)
_________________________________________________________________

Note the number of trainable parameters. Each of these can be adjusted during training and will
contribute towards the trained model's guesses.

Compiling the Model

The final step we need to do before we can actually train our model with data is to compile
(https://www.tensorflow.org/api_docs/python/tf/keras/Sequential#compile) it. Here we specify a loss
function (https://developers.google.com/machine-learning/glossary#loss) which will be used for the
model to understand how well it is performing during training. We also specify that we would like to track
while the model trains:
In [39]: 1 model.compile(loss='categorical_crossentropy', metrics=['accuracy'])

WARNING:tensorflow:From C:\Users\pdsin\AppData\Local\Programs\Python\Python310\lib
\site-packages\keras\src\optimizers\__init__.py:309: The name tf.train.Optimizer is
deprecated. Please use tf.compat.v1.train.Optimizer instead.

Training the Model

Now that we have prepared training and validation data, and a model, it's time to train our model with
our training data, and verify it with its validation data.

"Training a model with data" is often also called "fitting a model to data." Put this latter way, it highlights
that the shape of the model changes over time to more accurately understand the data that it is being
given.

When fitting (training) a model with Keras, we use the model's fit
(https://www.tensorflow.org/api_docs/python/tf/keras/Model#fit) method. It expects the following
arguments:

The training data

The labels for the training data
The number of times it should train on the entire training dataset (called an epoch)
The validation or test data, and its labels

Run the cell below to train the model. We will discuss its output after the training completes:

In [40]: 1 history = model.fit(

2 x_train, y_train, epochs=5, verbose=1, validation_data=(x_test, y_test)
3 )

Epoch 1/5
WARNING:tensorflow:From C:\Users\pdsin\AppData\Local\Programs\Python\Python310\lib
\site-packages\keras\src\utils\tf_utils.py:492: The name tf.ragged.RaggedTensorValu
e is deprecated. Please use tf.compat.v1.ragged.RaggedTensorValue instead.

WARNING:tensorflow:From C:\Users\pdsin\AppData\Local\Programs\Python\Python310\lib
\site-packages\keras\src\engine\base_layer_utils.py:384: The name tf.executing_eage
rly_outside_functions is deprecated. Please use tf.compat.v1.executing_eagerly_outs
ide_functions instead.

1875/1875 [==============================] - 16s 8ms/step - loss: 0.1899 - accurac

y: 0.9430 - val_loss: 0.1026 - val_accuracy: 0.9694
Epoch 2/5
1875/1875 [==============================] - 15s 8ms/step - loss: 0.0837 - accurac
y: 0.9763 - val_loss: 0.0986 - val_accuracy: 0.9733
Epoch 3/5
1875/1875 [==============================] - 15s 8ms/step - loss: 0.0612 - accurac
y: 0.9835 - val_loss: 0.0845 - val_accuracy: 0.9792
Epoch 4/5
1875/1875 [==============================] - 15s 8ms/step - loss: 0.0468 - accurac
y: 0.9869 - val_loss: 0.0888 - val_accuracy: 0.9814
Epoch 5/5
1875/1875 [==============================] - 15s 8ms/step - loss: 0.0355 - accurac
y: 0.9902 - val_loss: 0.1032 - val_accuracy: 0.9792
Observing Accuracy

For each of the 5 epochs, notice the accuracy and val_accuracy scores. accuracy states how
well the model did for the epoch on all the training data. val_accuracy states how well the model did
on the validation data, which if you recall, was not used at all for training the model.

The model did quite well! The accuracy quickly reached close to 100%, as did the validation accuracy.
We now have a model that can be used to accurately detect and classify hand-written images.

The next step would be to use this model to classify new not-yet-seen handwritten images. This is
called inference (https://blogs.nvidia.com/blog/2016/08/22/difference-deep-learning-training-inference-
ai/).

What a Neuron Does

Ultimately, each neuron is trying to fit a line to some data. Below, we have some datapoints and a
randomly drawn line using the equation y = mx + b (https://www.mathsisfun.com/equation_of_line.html).

Try changing the m and the b in order to find the lowest possible loss. How did you find the best line?
Can you make a program to follow your strategy?

In [45]: 1 import numpy as np

2 from numpy.polynomial.polynomial import polyfit
3 import matplotlib.pyplot as plt
4
5 m = 5 # -2 to start, change me please
6 b = 10 # 40 to start, change me please
7
8 # Sample data
9 x = np.array([ 0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
10 y = np.array([10, 20, 25, 30, 40, 45, 40, 50, 60, 55])
11 y_hat = x * m + b
12
13 plt.plot(x, y, '.')
14 plt.plot(x, y_hat, '-')
15 plt.show()
16
17 print("Loss:", np.sum((y - y_hat)**2)/len(x))

Loss: 40.0
Clear the Memory
Before moving on, please execute the following cell to clear up the GPU memory. This is required to
move on to the next notebook.

In [46]: 1 import IPython

2 app = IPython.Application.instance()
3 app.kernel.do_shutdown(True)

Out[46]: {'status': 'ok', 'restart': True}

01_mnist.ipynb (4) - JupyterLab
No ratings yet
01_mnist.ipynb (4) - JupyterLab
23 pages
stanfordKNNassignment
No ratings yet
stanfordKNNassignment
78 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
Deep Learning Project for Computer Vision with Python 2022
No ratings yet
Deep Learning Project for Computer Vision with Python 2022
297 pages
UNIT II_PPT_Part 1
No ratings yet
UNIT II_PPT_Part 1
41 pages
DLT Record Final
No ratings yet
DLT Record Final
120 pages
Image Datasets For Practicing Machine Learning in OpenCV
No ratings yet
Image Datasets For Practicing Machine Learning in OpenCV
9 pages
MNIST
No ratings yet
MNIST
54 pages
Building A Brain in 10 Minutes: Perceptron Research From The 50's & 6 Perceptron Research From The 50's & 6
No ratings yet
Building A Brain in 10 Minutes: Perceptron Research From The 50's & 6 Perceptron Research From The 50's & 6
14 pages
0
No ratings yet
0
343 pages
Deep 2
No ratings yet
Deep 2
57 pages
Implemented LeNet on PyTorch
100% (1)
Implemented LeNet on PyTorch
17 pages
TF Mannual
No ratings yet
TF Mannual
19 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
END_TO_END_PROJECT
No ratings yet
END_TO_END_PROJECT
21 pages
A2 - Jupyter Notebook PDF
No ratings yet
A2 - Jupyter Notebook PDF
8 pages
CNN 1721592934
No ratings yet
CNN 1721592934
53 pages
24mcs1025-ex2-part-a
No ratings yet
24mcs1025-ex2-part-a
6 pages
Image Classification With The MNIST Dataset: Objectives
No ratings yet
Image Classification With The MNIST Dataset: Objectives
21 pages
Download (Ebook) Upgrading and Repairing Networks by Scott Mueller, Terry W. Ogletree ISBN 9780789728173, 0789728176 ebook All Chapters PDF
No ratings yet
Download (Ebook) Upgrading and Repairing Networks by Scott Mueller, Terry W. Ogletree ISBN 9780789728173, 0789728176 ebook All Chapters PDF
82 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Lecture 21
No ratings yet
Lecture 21
138 pages
Newbie’s Deep Learning Project to Recognize Handwritten Digit
No ratings yet
Newbie’s Deep Learning Project to Recognize Handwritten Digit
6 pages
Capstone Project-1
No ratings yet
Capstone Project-1
15 pages
09-Neural Networks
No ratings yet
09-Neural Networks
17 pages
AI INTERNSHIP
No ratings yet
AI INTERNSHIP
5 pages
Lab09 Assignment
No ratings yet
Lab09 Assignment
29 pages
Image Classification of An American Sign Language Dataset: Objectives
No ratings yet
Image Classification of An American Sign Language Dataset: Objectives
11 pages
Implemented MobileNet on PyTorch
No ratings yet
Implemented MobileNet on PyTorch
20 pages
LAB_Quiz2
No ratings yet
LAB_Quiz2
4 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
Chapter 2
No ratings yet
Chapter 2
19 pages
Lab DigitRecognitionMINST
No ratings yet
Lab DigitRecognitionMINST
10 pages
Enterprise Cloud Strategy PDF
100% (3)
Enterprise Cloud Strategy PDF
109 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Intro Ai Group3
No ratings yet
Intro Ai Group3
35 pages
Chapter02 Mathematical-Building-Blocks
No ratings yet
Chapter02 Mathematical-Building-Blocks
9 pages
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
No ratings yet
MLP - Week 5 - MNIST - Perceptron - Ipynb - Colaboratory
31 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Dinushasan Courseproject04: Sign in
No ratings yet
Dinushasan Courseproject04: Sign in
19 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
Visualizing Deep Neural Networks Classes and Features - Ankivil
No ratings yet
Visualizing Deep Neural Networks Classes and Features - Ankivil
58 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
L8 - Image Classification
No ratings yet
L8 - Image Classification
20 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
No ratings yet
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
5 pages
Time Series Classification: Lab Based Project
No ratings yet
Time Series Classification: Lab Based Project
14 pages
DL 3
No ratings yet
DL 3
10 pages
Image Classification: Keras
No ratings yet
Image Classification: Keras
21 pages
LP V GRPB 2b
No ratings yet
LP V GRPB 2b
8 pages
Image Classification
No ratings yet
Image Classification
18 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
7.2 Netezza Odbc JDBC Guide
No ratings yet
7.2 Netezza Odbc JDBC Guide
74 pages
3.2 Preprocessing
No ratings yet
3.2 Preprocessing
10 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
Daruma Framework 256
No ratings yet
Daruma Framework 256
402 pages
PyTorch Neural Network Classifcation
No ratings yet
PyTorch Neural Network Classifcation
1 page
Introduction To Vsat Networks
100% (3)
Introduction To Vsat Networks
100 pages
Treasury Bank Gurantee
No ratings yet
Treasury Bank Gurantee
32 pages
2 Fall22-Lecture2QualityMetrics
No ratings yet
2 Fall22-Lecture2QualityMetrics
68 pages
Image Processing
No ratings yet
Image Processing
5 pages
ATM Simulation Full DoCumEntary With Code
No ratings yet
ATM Simulation Full DoCumEntary With Code
141 pages
Lab 08 - Stack
No ratings yet
Lab 08 - Stack
7 pages
Ldi Plus Device SCCM Setup Guide
No ratings yet
Ldi Plus Device SCCM Setup Guide
30 pages
3 Encapsulation
No ratings yet
3 Encapsulation
40 pages
Rotary Ac Buyers Guide.2023.07
No ratings yet
Rotary Ac Buyers Guide.2023.07
17 pages
Practica 05-07 Resuelta
No ratings yet
Practica 05-07 Resuelta
38 pages
A Practical Approach To Optimize Code Implementation
No ratings yet
A Practical Approach To Optimize Code Implementation
11 pages
Importance of PCBs in IoMT Devices
No ratings yet
Importance of PCBs in IoMT Devices
14 pages
Configuring Sata Hard Drive (S) (Controller: Uli M1689) ................................................................... 2
No ratings yet
Configuring Sata Hard Drive (S) (Controller: Uli M1689) ................................................................... 2
15 pages
DFT Timing Design Methodology For At-Speed BIST: February 2003
No ratings yet
DFT Timing Design Methodology For At-Speed BIST: February 2003
7 pages
Comfar Comfar Comfar: III Expert III Business Planner III Mini Expert
No ratings yet
Comfar Comfar Comfar: III Expert III Business Planner III Mini Expert
20 pages
String Data Transfers
No ratings yet
String Data Transfers
13 pages
DiagBox 8.xx Instalacja I Uaktualnianie
No ratings yet
DiagBox 8.xx Instalacja I Uaktualnianie
3 pages
3GTurboCharger Installer Update 3 RC11 Test 1.Sh
No ratings yet
3GTurboCharger Installer Update 3 RC11 Test 1.Sh
13 pages
Calculating Oracle Dataguard Network Bandwidth in SAP Environments PDF
No ratings yet
Calculating Oracle Dataguard Network Bandwidth in SAP Environments PDF
7 pages
Class 3 Lesson 4: Introduction To Logo
No ratings yet
Class 3 Lesson 4: Introduction To Logo
2 pages
Creating A Fillable Form On Word
No ratings yet
Creating A Fillable Form On Word
3 pages
Matrikon OPC UA Explorer: Datasheet
No ratings yet
Matrikon OPC UA Explorer: Datasheet
3 pages
A Quick Start Guide For The MARIE Machine Simulator Environment
No ratings yet
A Quick Start Guide For The MARIE Machine Simulator Environment
3 pages
Dos
No ratings yet
Dos
39 pages
Automatic Creation of Transfer Orders For Outbound Delivery
No ratings yet
Automatic Creation of Transfer Orders For Outbound Delivery
3 pages
Makefile PS1
No ratings yet
Makefile PS1
2 pages
Rahul's Resume
No ratings yet
Rahul's Resume
1 page
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

MNIST Dataset

Uploaded by

MNIST Dataset

Uploaded by

Image Classification with the MNIST Dataset

The Solution: Deep Learning

The MNIST Dataset

Training and Validation Data and Labels

1. x_train : Images used for training the neural network

Difference between GPU and TPU

Loading the Data Into Memory (with Keras)

There are many deep learning frameworks (https://developer.nvidia.com/deep-learning-frameworks),

We will begin by loading the Keras dataset module for MNIST:

In [3]: 1 from tensorflow.keras.datasets import mnist

Type Markdown and LaTeX: 𝛼 2

In [4]: 1 # the data, split between train and validation sets

Exploring the MNIST Data

Out[5]: (60000, 28, 28)

Out[6]: (10000, 28, 28)

Out[11]: <matplotlib.image.AxesImage at 0x213ace2b1c0>

In [18]: 1 import random

Flattening the Image Data

Here we accomplish this using the helper method reshape :

In [21]: 1 x_train = x_train.reshape(60000, 784)

Out[22]: (60000, 784)

In [24]: 1 x_train = x_train / 255

Actual Color Is Red? Is Blue? Is Green?

Red True False False

Green False False True

Blue False True False

Green False False True

values = ['red, green, blue, green']

Categorically Encoding the Labels

Keras provides a utility to categorically encode values

In [28]: 1 import tensorflow.keras as keras

1. An input layer, which will receive data in some expected format

Instantiating the Model

To begin, we will use Keras's Sequential

In [32]: 1 from tensorflow.keras.models import Sequential

Creating the Input Layer

In [33]: 1 from tensorflow.keras.layers import Dense

Creating the Hidden Layer

In [35]: 1 model.add(Dense(units = 512, activation='relu'))

Creating the Output Layer

In [36]: 1 model.add(Dense(units = 10, activation='softmax'))

Summarizing the Model

Keras provides the model instance method summary

dense_1 (Dense) (None, 512) 262656

dense_2 (Dense) (None, 10) 5130

Compiling the Model

Training the Model

The training data

In [40]: 1 history = model.fit(

1875/1875 [==============================] - 16s 8ms/step - loss: 0.1899 - accurac

What a Neuron Does

In [45]: 1 import numpy as np

In [46]: 1 import IPython

Out[46]: {'status': 'ok', 'restart': True}

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.