0% found this document useful (0 votes)
39 views54 pages

Class X Artificial Intelligence: Computer Vision

Chapter 5 of the Class X Artificial Intelligence curriculum focuses on Computer Vision, a domain that enables machines to interpret and analyze visual data similarly to humans. It discusses various applications, particularly in retail, and outlines key tasks such as image classification, object detection, and instance segmentation. The chapter also covers the basics of images and pixels, including resolution and color representation in RGB and grayscale formats.

Uploaded by

Laksin VJ
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views54 pages

Class X Artificial Intelligence: Computer Vision

Chapter 5 of the Class X Artificial Intelligence curriculum focuses on Computer Vision, a domain that enables machines to interpret and analyze visual data similarly to humans. It discusses various applications, particularly in retail, and outlines key tasks such as image classification, object detection, and instance segmentation. The chapter also covers the basics of images and pixels, including resolution and color representation in RGB and grayscale formats.

Uploaded by

Laksin VJ
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 54

CLASS X

ARTIFICIAL
INTELLIGENCE

CHAPTER 5
COMPUTER VISION
•As we all know, artificial
intelligence is a technique
that enables computers to
mimic human intelligence.
• As humans we can see
things, analyse it and then do
the required action on the
basis of what we see. But can
machines do the same?
•Can machines have the eyes
that humans have? If you
answered Yes, then you are
absolutely right.
•The Computer Vision
domain of Artificial
Intelligence,enables
machines to see
through images or
visual data, process
and analyse them on
the basis of algorithms
and methods in order to
analyse actual
phenomena with
images.
https://
emojiscavengerhunt.withgoogle.com/
•Applications of Computer
Vision The concept of computer
vision was first introduced in
the 1970s.
•All these new applications of
computer vision excited
everyone. Having said that, the
computer vision technology
advanced enough to make
these applications available to
everyone at ease today.
Computer Vision in Retail*:
The retail field has been one of the fastest growing
field and at the same time is using Computer Vision for
making the user experience more fruitful.

Retailers can use Computer Vision techniques to track


customers’ movements through stores, analyse
navigational routes and detect walking patterns.
•Computer Vision in Retail*:

•The retail field has been one of


the fastest growing field and at
the same time is using Computer
Vision for making the user
experience more fruitful.

• Retailers can use Computer


Vision techniques to track
customers’ movements through
stores, analyse navigational
routes and detect walking
patterns.
•Computer Vision:
•Getting Started

•Computer Vision is a domain of


Artificial Intelligence, that deals
with the images. It involves the
concepts of image processing
and machine learning models to
build a Computer Vision based
application.
•Computer Vision Tasks
•The various applications of
Computer Vision are based on a
certain number of tasks which are
performed to get certain
information from the input image
which can be directly used for
prediction or forms the base for
further analysis.
The tasks used
in a computer
vision
application are :
•Classification Image
•Classification problem is the task
of assigning an input image one
label from a fixed set of
categories.
•This is one of the core problems
in CV that, despite its simplicity,
has a large variety of practical
applications.
•Classification + Localisation .
This is the task which involves both
processes of identifying what object is
present in the image and at the same
time identifying at what location that
object is present in that image.

It is used only for single objects.


•Object Detection
•Object detection is the process of finding
instances of real-world objects such as
faces, bicycles, and buildings in images or
videos. Object detection algorithms
typically use extracted features and
learning algorithms to recognize instances
of an object category.

•It is commonly used in applications such


as image retrieval and automated vehicle
parking systems.
•Instance Segmentation
•Instance Segmentation is the
process of detecting instances of the
objects, giving them a category and
then giving each pixel a label on the
basis of that.
• A segmentation algorithm takes an
image as input and outputs a
collection of regions (or segments).
•Basics of Images
•Basics of Pixels
•The word “pixel” means a
picture element. Every
photograph, in digital form, is
made up of pixels.
•They are the smallest unit of
information that make up a
picture. Usually round or square,
they are typically arranged in a
2-dimensional grid.
In the image below, one portion has been magnified many
times over so that you can see its individual composition in
pixels.

As you can see, the pixels approximate the actual image.


The more pixels you have, the more closely the image
resembles the original.
Resolution
•The number of pixels in an image is
sometimes called the resolution.
•When the term is used to describe
pixel count, one convention is to
express resolution as the width by
the height, for example a monitor
resolution of 1280×1024.
•This means there are 1280 pixels
from one side to the other, and 1024
from top to bottom.
•Another convention is to express the
number of pixels as a single number, like a 5
mega pixel camera (a megapixel is a million
pixels).
•This means the pixels along the width
multiplied by the pixels along the height of
the image taken by the camera equals 5
million pixels.
•In the case of our 1280×1024 monitors, it
could also be expressed as 1280 x 1024 =
1,310,720, or 1.31 megapixels.
Pixel value

Each of the pixels that represents an image stored inside a


computer has a pixel value which describes how bright
that pixel is, and/or what colour it should be.

The most common pixel format is the byte image, where


this number is stored as an 8-bit integer giving a range of
possible values from 0 to 255.

Typically, zero is to be taken as no colour or black and 255


is taken to be full colour or white.
Why do we have a value of 255 ?

In the computer systems, computer data is in the


form of ones and zeros, which we call the binary
system.

Each bit in a computer system can have either a


zero or a one.

Since each pixel uses 1 byte of an image, which is


equivalent to 8 bits of data.

Since each bit can have two possible values which


tells us that the 8 bit can have 255 possibilities of
Grayscale Images
•Grayscale images are images which have a
range of shades of gray without apparent
colour.
•The darkest possible shade is black, which is
the total absence of colour or zero value of
pixel.
•The lightest possible shade is white, which is
the total presence of colour or 255 value of a
pixel .
•Intermediate shades of gray are represented
by equal brightness levels of the three primary
colours.
RGB Images

All the images that we see around are coloured images.

These images are made up of three primary colours Red, Green


and Blue.

All the colours that are present can be made by combining


different intensities of red, green and blue.
Let us experience!
Go to this online link

https://www.w3schools.com/colors/colors_rgb.asp.

On the basis of this online tool, try and answer all


the below mentioned questions.
How do computers store RGB images?
• Every RGB image is stored in the form of
three different channels called the R channel,
G channel and the B channel.
• Each plane separately has a number of pixels
with each pixel value varying from 0 to 255.
• All the three planes when combined together
form a colour image.
• This means that in a RGB image, each pixel
has a set of three different values which
together give colour to that particular pixel.
In the above given image, if we split the image into three different
channels, namely Red (R), Green (G) and Blue (B), the individual
layers will have the following intensity of colours of the individual
pixels.

These individual layers when stored in the memory looks like the
image on the extreme right.

The images look in the grayscale image because each pixel has a
value intensity of 0 to 255 and as studied earlier, 0 is considered
as black or no presence of colour and 255 means white or full
presence of colour.

These three individual RGB values when combined together form


the colour of each pixel.
Task : Go to the following link

www.piskelapp.com

and create your own pixel art.

Try and make a GIF using the online app


for your own pixel art.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy