0% found this document useful (0 votes)

14 views47 pages

Numpy slides

The document provides an overview of the NumPy package, highlighting its importance in data analysis, machine learning, and scientific computing within the Python ecosystem. It explains key functionalities such as array creation, arithmetic operations, indexing, and data representation for various data types including tables, audio, images, and text. Additionally, it emphasizes NumPy's capabilities in handling multi-dimensional arrays and its role in implementing mathematical formulas essential for machine learning models.

Uploaded by

Olyvier Nzighou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views47 pages

Numpy slides

Uploaded by

Olyvier Nzighou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 47

NumPy visualization and

data representantion
NumPy package is the workhorse of data analysis, machine learning, and scienti c
computing in the python ecosystem. It is so due to the fact that it vastly simpli es
manipulating and crunching vectors and matrices. Some of python’s leading package rely
on NumPy as a fundamental piece of their infrastructure
Examples:

Scikit-learn
Scipy
Pandas
TensorFlow
Beyond the ability to slice and dice numeric data, mastering numpy will give you an edge
when dealing and debugging with advanced usecases in these libraries.
Here we are going to look at some of the main ways to use NumPy and how it can
represent different types of data (tables, images, text…etc) before we use it to more
complex things such as machine learning models.

import numpy as np
Creating Arrays
We can create a NumPy array by passing a python list to it and using np.array() . In this
case, python creates the array we can see on the right here:
There are often cases when we want NumPy to initialize the values of the array for us.
NumPy provides methods like ones() , zeros() , and random.random() for these
cases. We just pass them the number of elements we want it to generate:
Once we’ve created our arrays, we can start to manipulate them in interesting ways.

Array Arithmetic
Let’s create two NumPy arrays to showcase their usefulness. We’ll call them data and
ones :
Adding them up position-wise (i.e. adding the values of each row) is as simple as typing
data + ones :

Personal note: When i started learning such tools, i found it kinda refreshing that an
abstraction like this makes me not have to program such a calculation in loops. It’s a
wonderful abstraction that allows you to think about problems at a higher level, but at the
same time i thought if there's such of abstractions there shuolb be something weird behind
it.
It’s not only addition that we can do this way (We can do almost everything like this):
There are often cases when we want carry out an operation between an array and a single
number (we can also call this an operation between a vector and a scalar). Say, for example,
our array represents distance in miles and we want to convert it to kilometers. We simply
say data *1.6:

As we can see, NumPy understood that operation to mean that the multiplication should
happen with each cell? That concept is called broadcasting, and it’s very useful.
Indexing
We can index and slice NumPy arrays in all the ways we can slice python lists:
Aggregation
Additional bene ts Numpy has are aggregation functions:

In addition to min , max , and sum , you get all the greats like mean to get the average,
prod to get the result of multiplying all the elements together, std to get standard
deviation, and plenty of others.
In more dimensions
All the examples we’ve looked at deal with vectors in one dimension. A key part of the
beauty of NumPy is its ability to apply everything we’ve looked at so far to any number of
dimensions!!!
Creating Matrices
We can pass python lists of lists in the following shape to have NumPy create a matrix to
represent them:

np.array([1,2],[3,4]])
We can also use the same methods we mentioned above ( ones() , zeros() , and
random.random() ) as long as we give them a tuple describing the dimensions of the
matrix we are creating:
Matrix Arithmetic
We can add and multiply matrices using arithmetic operators (+-*/) if the two matrices
are the same size!!!!. NumPy handles those as position-wise operations:
We can get away with doing these arithmetic operations on matrices of different size only
if the different dimension is one (e.g. the matrix has only one column or one row), in which
case NumPy uses its broadcast rules for that operation
Dot Product
A key distinction to make with arithmetic is the case of matrix multiplication using the dot
product. NumPy gives every matrix a dot() method we can use to carry-out dot product
operations with other matrices:
We've added matrix dimensions at the bottom of this gure to stress that the two matrices
have to have the same dimension on the side they face each other with. You can visualize
this operation as looking like this:
Matrix Indexing
Indexing and slicing operations become even more useful when we’re manipulating
matrices:
Matrix Aggregation
We can aggregate matrices the same way we aggregated vectors:
Not only can we aggregate all the values in a matrix, but we can also aggregate across the
rows or columns by using the axis parameter:
Transposing and Reshaping
A common need when dealing with matrices is the need to rotate them. This is often the
case when we need to take the dot product of two matrices and need to align the
dimension they share. NumPy arrays have a convenient property called T to get the
transpose of a matrix:
In more advanced use case, you may nd yourself needing to switch the dimensions of a
certain matrix. This is often the case in machine learning applications where a certain
model expects a certain shape for the inputs that is different from your dataset. NumPy’s
reshape() method is useful in these cases. You just pass it the new dimensions you want for
the matrix. You can pass -1 for a dimension and NumPy can infer the correct dimension
based on your matrix:
Yet More Dimensions
NumPy can do everything we’ve mentioned in any number of dimensions. Its central data
structure is called ndarray (N-Dimensional Array) for a reason.
In a lot of ways, dealing with a new dimension is just adding a comma to the parameters of
a NumPy function:
Practical Usage
And now for the payoff. Here are some examples of the useful things NumPy will help you
through.
Formulas
Implementing mathematical formulas that work on matrices and vectors is a key use case
to consider NumPy for. This is the main reason why NumPy is the darling of the scienti c
python community. For example, consider the mean square error formula that is central to
supervised machine learning models tackling regression problems:
Implementing this is quite simple in NumPy:
The beauty of this is that numpy does not care if predictions and labels contain one
or a thousand values (as long as they’re both the same size). We can walk through an
example stepping sequentially through the four operations in that line of code:
Both the predictions and labels vectors contain three values. Which means n has a value of
three. After we carry out the subtraction, we end up with the values looking like this:
Then we can square the values in the vector:
Now we sum these values:

Which results in the error value for that prediction and a score for the quality of the model.
Data Representation
Think of all the data types you’ll need to crunch and build models around (spreadsheets,
images, audio…etc). So many of them are perfectly suited for representation in an n-
dimensional array:
Tables and Spreadsheets

A spreadsheet or a table of values is a two dimensional matrix. Each sheet in a

spreadsheet can be its own variable. The most popular abstraction in python for
those is the pandas dataframe, which actually uses NumPy and builds on top of it.
Audio and Timeseries

An audio le is a one-dimensional array of samples. Each sample is a number

representing a tiny chunk of the audio signal. CD-quality audio may have 44,100
samples per second and each sample is an integer between -65535 and 65536.
Meaning if you have a ten-seconds WAVE le of CD-quality, you can load it in a
NumPy array with length 10 * 44,100 = 441,000 samples. Want to extract the rst
second of audio? simply load the le into a NumPy array that we’ll call audio, and get
audio[:44100].

Here’s a look at a slice of an audio le:

The same goes for time-series data (for example, the price of a stock over time).
Images

An image is a matrix of pixels of size (height x width).

If the image is black and white (a.k.a. grayscale), each pixel can be
represented by a single number (commonly between 0 (black) and 255
(white)). Want to crop the top left 10 x 10 pixel part of the image? Just tell
NumPy to get you image[:10,:10].

Here’s a look at a slice of an image le:

If the image is colored, then each pixel is represented by three numbers - a value for
each of red, green, and blue. In that case we need a 3rd dimension (because each
cell can only contain one number). So a colored image is represented by an ndarray
of dimensions: (height x width x 3).
Language
If we’re dealing with text, the story is a little different. The numeric representation of text
requires a step of building a vocabulary (an inventory of all the unique words the model
knows) and an embedding step. Let us see the steps of numerically representing this
(translated) quote by an ancient spirit:

“Have the bards who preceded me left any theme unsung?”

A model needs to look at a large amount of text before it can numerically represent the
anxious words of this warrior poet. We can proceed to have it process a small dataset and
use it to build a vocabulary (of 71,290 words):
The sentence can then be broken into an array of tokens (words or parts of words based on
common rules):

We then replace each word by its id in the vocabulary table:

These ids still don’t provide much information value to a model. So before feeding a
sequence of words to a model, the tokens/words need to be replaced with their
embeddings (50 dimension word2vec embedding in this case):
You can see that this NumPy array has the dimensions [embedding_dimension x
sequence_length]. In practice these would be the other way around, but I’m presenting it
this way for visual consistency. For performance reasons, deep learning models tend to
preserve the rst dimension for batch size (because the model can be trained faster if
multiple examples are trained in parallel). This is a clear case where reshape() becomes
super useful. A model like BERT (http://ruder.io/nlp-imagenet/), for example, would expect
its inputs in the shape: [batch_size, sequence_length, embedding_size].

03-Python Libraries - Numpy - Matplotlib
No ratings yet
03-Python Libraries - Numpy - Matplotlib
56 pages
numpy
No ratings yet
numpy
38 pages
Lesson 03 Python Libraries For Data Science
No ratings yet
Lesson 03 Python Libraries For Data Science
190 pages
Service Cloud Consultant V18.95 2
No ratings yet
Service Cloud Consultant V18.95 2
58 pages
Numpy and Scipy: Numerical Computing in Python
No ratings yet
Numpy and Scipy: Numerical Computing in Python
44 pages
Numpy
No ratings yet
Numpy
44 pages
Advanced NumPy Broadcasting and Strides Guide
No ratings yet
Advanced NumPy Broadcasting and Strides Guide
21 pages
Introduction To NumPy and OpenCV
No ratings yet
Introduction To NumPy and OpenCV
51 pages
A Visual Intro To NumPy and Data Representation - Jay Alammar - Visualizing Machine Learning One Concept at A Time
No ratings yet
A Visual Intro To NumPy and Data Representation - Jay Alammar - Visualizing Machine Learning One Concept at A Time
16 pages
Numpy ML - AI
No ratings yet
Numpy ML - AI
135 pages
More on Numpy
No ratings yet
More on Numpy
50 pages
13 - NumPy
No ratings yet
13 - NumPy
46 pages
UNIT 5 python aktu
No ratings yet
UNIT 5 python aktu
49 pages
A Visual Intro To Numpy and Data Representation: Jay Alammar (/)
No ratings yet
A Visual Intro To Numpy and Data Representation: Jay Alammar (/)
15 pages
Lecture 4-Python-NumPy Hadi Updated
No ratings yet
Lecture 4-Python-NumPy Hadi Updated
44 pages
Practical Guide To NumPy For Data Science
100% (1)
Practical Guide To NumPy For Data Science
27 pages
09_20241101_NumPy
No ratings yet
09_20241101_NumPy
38 pages
Numpy in Visually Appealing Manner
No ratings yet
Numpy in Visually Appealing Manner
12 pages
Python Notes by Jobhunter Team
No ratings yet
Python Notes by Jobhunter Team
255 pages
Unit 3
No ratings yet
Unit 3
42 pages
CSE488_Lab3_Numpy
No ratings yet
CSE488_Lab3_Numpy
14 pages
Python Presentation 3
No ratings yet
Python Presentation 3
44 pages
Unit - Iii
No ratings yet
Unit - Iii
79 pages
Machine Learning- Section #3 (Numpy)
No ratings yet
Machine Learning- Section #3 (Numpy)
21 pages
Numpy Python
No ratings yet
Numpy Python
36 pages
Numpy
No ratings yet
Numpy
18 pages
Python 5 Unit
No ratings yet
Python 5 Unit
74 pages
Numpy and Scipy: Numerical Computing in Python
No ratings yet
Numpy and Scipy: Numerical Computing in Python
47 pages
NumPY Array
No ratings yet
NumPY Array
8 pages
45B AIML Practical1.1
No ratings yet
45B AIML Practical1.1
57 pages
Python 5th Sem
No ratings yet
Python 5th Sem
33 pages
AUTOSAR SWS DiagnosticCommunicationManager
No ratings yet
AUTOSAR SWS DiagnosticCommunicationManager
683 pages
Python Numpy
No ratings yet
Python Numpy
20 pages
UNIT-2
No ratings yet
UNIT-2
21 pages
Numpy, Pandas and Matplotlib
No ratings yet
Numpy, Pandas and Matplotlib
60 pages
numpyintro-pdf
No ratings yet
numpyintro-pdf
17 pages
Python Sem v Portion 2
No ratings yet
Python Sem v Portion 2
29 pages
Numpy Full
100% (1)
Numpy Full
40 pages
p
No ratings yet
p
27 pages
Numpy in python
No ratings yet
Numpy in python
34 pages
New Chat
No ratings yet
New Chat
30 pages
Num Py
No ratings yet
Num Py
49 pages
NUMPY _ PANDAS
No ratings yet
NUMPY _ PANDAS
26 pages
Numpy_and_Pandas[1]
No ratings yet
Numpy_and_Pandas[1]
28 pages
Numpy
No ratings yet
Numpy
4 pages
NumPy Basics
No ratings yet
NumPy Basics
23 pages
NumPy Python Library by ChatGPT
No ratings yet
NumPy Python Library by ChatGPT
30 pages
Unit 3 Numpy
No ratings yet
Unit 3 Numpy
23 pages
Introduction to NumPy
No ratings yet
Introduction to NumPy
5 pages
Data Science Handwritten Notes - 3
No ratings yet
Data Science Handwritten Notes - 3
26 pages
HikCentral Professional OpenAPI Developer Guide V2.3.1
No ratings yet
HikCentral Professional OpenAPI Developer Guide V2.3.1
453 pages
NumPy Notes (3)
No ratings yet
NumPy Notes (3)
15 pages
Week2-1 Numpy
No ratings yet
Week2-1 Numpy
43 pages
Lab-3 AI
No ratings yet
Lab-3 AI
21 pages
Unit 4
No ratings yet
Unit 4
19 pages
Lecture+Notes Python+for+DS PDF
No ratings yet
Lecture+Notes Python+for+DS PDF
48 pages
Numpy & Pandas
No ratings yet
Numpy & Pandas
13 pages
A Visual Intro To Numpy and Data Representation
No ratings yet
A Visual Intro To Numpy and Data Representation
16 pages
DELL PRECISION M6700 SCPEC DxDiag
No ratings yet
DELL PRECISION M6700 SCPEC DxDiag
34 pages
An Experimental Investigation of Text-based CAPTCHA Attacks and Their Robustness
No ratings yet
An Experimental Investigation of Text-based CAPTCHA Attacks and Their Robustness
37 pages
Internship Presentation On Autocad Software: Presented by
No ratings yet
Internship Presentation On Autocad Software: Presented by
17 pages
HP Elitebook 2560p Inventec Styx MV Laptop Schematics
No ratings yet
HP Elitebook 2560p Inventec Styx MV Laptop Schematics
58 pages
? Gl Tool - Apple Bionic A18 Performance Extension
No ratings yet
? Gl Tool - Apple Bionic A18 Performance Extension
3 pages
HGENSB0192E
No ratings yet
HGENSB0192E
28 pages
Report
No ratings yet
Report
12 pages
Relation Between Roots and Coefficients of Quadratic Equations
No ratings yet
Relation Between Roots and Coefficients of Quadratic Equations
11 pages
B.tech Cse in Ai & FT - Ay24
No ratings yet
B.tech Cse in Ai & FT - Ay24
28 pages
NumPy Notes
No ratings yet
NumPy Notes
13 pages
Lab 2, Python Numpy - LUMS
No ratings yet
Lab 2, Python Numpy - LUMS
4 pages
Ballarat Library Homework Club
100% (1)
Ballarat Library Homework Club
5 pages
Julian James McKinnon - Hacking_ 3 Books in 1_ a Beginners Guide for Hackers (How to Hack Websites, Smartphones, Wireless Networks) + Linux Basic for Hackers (Command Line and All the Essentials) + Ha
No ratings yet
Julian James McKinnon - Hacking_ 3 Books in 1_ a Beginners Guide for Hackers (How to Hack Websites, Smartphones, Wireless Networks) + Linux Basic for Hackers (Command Line and All the Essentials) + Ha
249 pages
LS_THR81_2411_01
No ratings yet
LS_THR81_2411_01
12 pages
Turing Machines
No ratings yet
Turing Machines
30 pages
Module 4: The Graphics Device Interface (GDI), Colors, and Fonts
No ratings yet
Module 4: The Graphics Device Interface (GDI), Colors, and Fonts
33 pages
Commercial Displays Brochure PDF
No ratings yet
Commercial Displays Brochure PDF
20 pages
Telecom Case Study - Solution - Saurabh Kumar
No ratings yet
Telecom Case Study - Solution - Saurabh Kumar
5 pages
Warppls Emulator For Macos: Link Download at Here
No ratings yet
Warppls Emulator For Macos: Link Download at Here
4 pages
Test Driven Development of Embedded Systems
No ratings yet
Test Driven Development of Embedded Systems
7 pages
CCNASv2 SKillsAssessment-B Student Training
No ratings yet
CCNASv2 SKillsAssessment-B Student Training
10 pages
Operating System
No ratings yet
Operating System
3 pages
A3 Alpha Meter With EA-NIC: Connected Utilities
No ratings yet
A3 Alpha Meter With EA-NIC: Connected Utilities
2 pages
Database Management: 1 - Computer Applications
No ratings yet
Database Management: 1 - Computer Applications
7 pages
Csi v7 Error Codes List June 2016 PDF
No ratings yet
Csi v7 Error Codes List June 2016 PDF
2 pages
Surface Area of A Sphere
No ratings yet
Surface Area of A Sphere
2 pages
Caesar Cipher
No ratings yet
Caesar Cipher
2 pages
The Effectiveness of Using Mobile Learning Techniques To Improve Learning Outcomes in Higher Education
No ratings yet
The Effectiveness of Using Mobile Learning Techniques To Improve Learning Outcomes in Higher Education
15 pages
Pulsar: High Resolution Side Scan Sonar
No ratings yet
Pulsar: High Resolution Side Scan Sonar
2 pages
The Numpy Pocketbook: Essentials on the Go
From Everand
The Numpy Pocketbook: Essentials on the Go
Silas Meadowlark
No ratings yet
Numpy Simply In Depth
From Everand
Numpy Simply In Depth
Ajit Singh
5/5 (1)
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Numpy slides

Uploaded by

Numpy slides

Uploaded by

NumPy visualization and

A spreadsheet or a table of values is a two dimensional matrix. Each sheet in a

An audio le is a one-dimensional array of samples. Each sample is a number

Here’s a look at a slice of an audio le:

An image is a matrix of pixels of size (height x width).

Here’s a look at a slice of an image le:

“Have the bards who preceded me left any theme unsung?”

We then replace each word by its id in the vocabulary table:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.