0% found this document useful (0 votes)

52 views7 pages

Image Recognition in Self-Driving Cars Using CNN

Uploaded by

you are the best guy ever

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views7 pages

Image Recognition in Self-Driving Cars Using CNN

Uploaded by

you are the best guy ever

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Image recognition in self-driving cars using CNN

Shreya Muppidi *

Department of Mechanical Engineering, VNR Vignana Jyothi Institute of Engineering and Technology, Hyderabad, India.

International Journal of Science and Research Archive, 2023, 09(02), 342–348

Publication history: Received on 11 June 2023; revised on 24 July 2023; accepted on 27 July 2023

Article DOI: https://doi.org/10.30574/ijsra.2023.9.2.0574

Abstract
The concept of neural networks has existed for over decades but was never considerably acknowledged as much as of
today. The main reason happens to be “data.” To analyze a problem statement using neural networks, large data is
required in its various forms and therefore it has not been instigated back in the day. But now, with today’s vast
technology, neural networks have begun to take over some of the numerous machine learning applications with the
help of huge datasets. In this research paper, a certain deep learning approach namely convolutional neural network
(CNN) has been discussed which plays a major role in classifying and recognizing objects i.e., obstacles on the road.
Earlier, computer-based algorithms have been followed for image processing in vehicles which seemed to be applicable
to a certain extent. So much so, now with deep learning approaches, simpler yet faster networks can be implemented
for a safe drive. Automatic vehicles such as Tesla which is examined to be “fully self-driving” nevertheless needs a driver
to watch over the road at some particular point. This proves that there is not yet a fully controlled self-driving car
created which can drive itself without a spectator. This appeal can be solved by means of image detection mechanisms
using neural networks along with a programming language to deploy machine learning models at ease. The main
objective is to develop a simple and accurate algorithm to make image recognition more precise for a better self-driving
car.

Keywords: Image recognition; Self-driving cars; Deep learning; Neural networks; CNN

1. Introduction
Arthur Lee Samuel defines machine learning as the field of study that gives computers the ability to learn and perform
tasks without being explicitly programmed. Of course, a software can be programmed to achieve a function but a new
approach called machine learning followed by deep learning is radically changing how we create software to solve these
problems. There are many different approaches to machine learning but all these different types of learning
use statistical algorithms and data. Deep learning, on the other hand, is just a type of machine learning, inspired by the
structure of a human brain. Deep learning algorithms attempt to draw similar conclusions as humans would, by
simultaneously analysing data with a given radical structure. To achieve the following, deep learning uses a multi-layered
structure of algorithms called neural networks.

For a system to inherit computer vision, it is required to program and train a computer by breaking the task down into
smaller and easier details. The task is as terrifying as it sounds nevertheless, simpler when broken down into a
hierarchical. This is where machine learning and artificial intelligence comes to use. Generally, image recognition is the
ability of a system to identify people, places, objects, and actions in images. It makes the use of technologies involving
machine vision and algorithms with artificial intelligence to recognize and distinguish images through a camera system.

As it is known that a computer sees an image or video in the form of 0s and 1s. It stores the image in the form of
combination of pixels which is the smallest unit in an image. Each pixel contains a various number of channels. A

* Corresponding author: Shreya Muppidi

Copyright © 2023 Author(s) retain the copyright of this article. This article is published under the terms of the Creative Commons Attribution Liscense 4.0.
International Journal of Science and Research Archive, 2023, 09(02), 342–348

grayscale image has only one pixel, whereas a coloured image contains three channels namely red, green, and blue. In a
digital-coloured image, each channel of each pixel has a value between 0 and 255. Each of these values when represented
in binary can be understood by the computer. But the problem is just being able to read the image is of no use if it cannot
understand what it means. This is where machine learning comes into the picture.

1.1. Use case of image recognition in today’s technology

Several different use cases of image recognition are in production and are already deployed on a large scale in various
sectors for example self-driving cars. The gaming field has started using image recognition technology combined with
augmented reality to their advantage, as it helps in providing gamers with a realistic experience. Some of the popular
games using this feature are Pokémon Go, The Walking Dead, Harry Potter: Wizards Unite and many more.

Social networks also use image data which can be analyzed and visualized to comprehend customer preferences, further
this data can be used for customized marketing. A powerful commercial use can be seen in the field of stock photography
and video as well where stock websites provide platforms so photographers and video makers can sell their content.

Google photos which consist of a huge visual dataset uses image recognition along with its subfield deep learning, to
sort millions of images on the internet in order to classify them more accurately. Pinterest uses algorithms to identify
the patterns in a picture that have been pinned so that similar images are displayed when you search for them which
works as an image recommender Similarly, there are many other websites and companies that use this technology
to develop and improvise their services and marketing.

2. Material and methods

Image recognition is the creation of a neural network that processes all the pixels that build to be a complete image. All
image recognition models start with encoders which are made of blocks of layers that learn specific patterns of the
pixels of images that correspond to their labels. These encoders then give outputs of confidence scores for every input
label provided which varies depending on the type of class of the image recognition system. Every machine learning
model requires one common element in order to process the whole operation which is data in the form of datasets.
Popular image recognition benchmark datasets include CIFAR, ImageNet, COCO, Open Images, and many other datasets.
With the help of these datasets the model can be trained and tested to its best accuracy.

2.1. Convolutional Neural Networks (CNNs)

Convolutional Neural Networks (CNNs) are one of the most popular deep learning neural network methods. As deep
learning deals with large amounts of data when compared to any other approach it makes CNN even more applicable in
many real-life applications. Any deep learning model requires a large amount of data to train which also requires a lot
of computing resources. This was a major drawback for CNNs during that time and hence they were only limited to the
postal sectors and it failed to enter the world of machine learning. But now with the growing technology and increasing
problem statements, more data is collected and stored into datasets which will thereby be used for making machine
learning models.

These neural networks are composed of multiple layers of artificial neurons. The first layer usually extracts basic
features such as horizontal or diagonal edges. The corresponding output is passed on to the next layer which detects
more complex features such as corners or combinational edges. As we move deeper into the neural network, it can
identify even more complex features such as objects, faces, etc. Alongside the input and output, the CNN consists of
hidden layers which typically consist of a series of convolutional layers. ReLU is the typical activation function which is
thereby followed by additional operations such as pooling layers, fully connected layers, and normalization layers.
Backpropagation is a method used for error distribution and weight adjustment. The efficient use of convolutional
neural networks depends on more layers and larger networks i.e., larger the dataset, higher the efficiency of CNNs.

343
International Journal of Science and Research Archive, 2023, 09(02), 342–348

Figure 1 Structure of a convolutional neural network

2.2. Tools used in making an image recognition model:

Other than programming languages like Python, R and Java and basic Python libraries like NumPy, Matplotlib, Pandas,
SciPy, etc. there are certainly other libraries and software which help in building an image recognition model in a much
more accurate and faster way. Some of them are listed below:

 OpenCV is a large open-source, cross platform library for computer vision, machine learning, and image
processing. Currently now it plays a major role in real-time operation which is very important in today’s
technology. One can process images and videos to identify objects, faces, or even handwriting of a human by
using this library. The identification of image patterns and its other features is achieved by using vector space
and performing mathematical operations. Using OpenCV we can also analyse videos and estimate the motion
in it, subtract the background, and finally track objects in the video.
 PyTorch is an open-source framework that eases one’s way to develop machine learning models and deploy
them to production. PyTorch provides dynamic computation graphs and libraries for distributed training,
which are tuned for high performance on AWS. Thus, it is one of the most popular deep learning libraries
competing with Keras and TensorFlow. Using PyTorch, one can process images and videos to develop a highly
accurate and precise computer vision model.
 YOLO is an acronym for "you only look once". It basically is a real time object detection system. Instead of using
classifiers to perform detection of objects, we frame object detection as a regression problem to spatially
separated bounding boxes and associated class probabilities. A single neural network predicts bounding boxes
and class probabilities directly. The biggest advantage of YOLO is its incredible speed. It is very fast and can
process 45 frames per second. It has been used in various applications to detect traffic signals, people, parking
meters, animals, and other objects.

2.3. Procedural Method

In order to build an image recognition model, we can use conventional statistical approaches such as decision trees or
support vector machines but the disadvantage is that it is very time consuming and inaccurate. In today’s time, the state-
of-the-art method used is convolutional neural networks (CNNs). The larger the dataset, the higher is the performance
of the model especially for deep neural networks. Based on the recent challenges, a CNN model made predictions on
millions of pictures with 1000 classes which is close to the performance of an actual human being.

To build an image recognition model using CNN, we follow the listed steps:

 Import the required libraries

 Load and normalise the train and test datasets
 Define the convolutional neural network
 Define the loss function and optimizer
 Train the model on the train data
 Test the model on the test data
The dataset used in building this image recognition model is the CIFAR 10 dataset which consists of 10 classes as it says
in the name.

344
International Journal of Science and Research Archive, 2023, 09(02), 342–348

2.4. Image Classifier model using CNN

Figure 2 Importing and installing libraries

Figure 3 Loading data

345
International Journal of Science and Research Archive, 2023, 09(02), 342–348

Figure 4 Normalizing the test and training datasets

Figure 5 Defining the CNN class

346
International Journal of Science and Research Archive, 2023, 09(02), 342–348

Figure 6 Defining the optimizer

Figure 7 Running the loss factor and training the model

Figure 8 Testing the model

3. Results and Discussion

By the means of PyCharm, an image classifier has been built by using one of the deep learning approaches i.e.,
convolutional neural networks. In the above model, a dataset called CIFAR 10 has been imported and split into training
and test data which thereby was trained using the CNN algorithm to correctly detect and recognize the corresponding
object viewed. The accuracy of the network on the 10000 images can be approximated using the below code which was
about 75%, relevant enough for a first trial image classifier.

347
International Journal of Science and Research Archive, 2023, 09(02), 342–348

Figure 9 Determining the accuracy of the model

4. Conclusion
This method of detecting images through computer vision in self-driving cars has proven to be much more accurate
than regular machine learning outlooks. The deep learning algorithm would perform a classification of the images by the
extraction of the features. Whereas manually, a programmer must explicitly meddle in the action for the model to come
to a conclusive statement. Although convolutional neural network models seem to showcase great performance, they all
have their own consequences to address. For example, during computer vision, when an object viewed from a certain
angle slightly different from the way the model has been trained, it may cause false predictions. This may lead the vehicle
to end up in taking unnecessary actions during the drive. Alongside the position of the image of the object, ideal lighting
as well as colour contrast can also affect the model to flaw. However, this can be solved by adding different variations
to the image during the training process which is otherwise known as data augmentation. Nonetheless, this all brings
us back to collecting and analysing proper data before feeding it to the training set.

Although the self-driving cars created in today’s day to day life have plenty of features, they still have not been developed
to its highest extreme. Soon driverless cars will be developed where people can pretty much take a small nap while
taking a ride to their destination without worrying about the road.

References
[1] Hornigold, Thomas Building a Moral Machine: Who Decides the Ethics of Self Driving Cars? . Singularity Hub, (31
October 2018).
[2] Dr. Sebastian Raschka Chapter 1: Introduction to Machine Learning and Deep Learning .5 August 2020.
Retrieved 28 October 2020.
[3] Phantom Auto will tour city . The Milwaukee Sentinel. 8 December 1926. Retrieved 23 July 2013.
[4] S. Mittal and S. Vaishay A Survey of Techniques for Optimizing Deep Learning on GPUs Archived 2021-05-09 at
the Wayback Machine , Journal of Systems Architecture, 2019.
[5] Mesnil, Gregoire; Deng, Li; Gao, Jianfeng; He, Xiaodong; Shen, Yelong Learning Semantic Representations Using
Convolutional Neural Networks for Web Search – Microsoft Research , Microsoft Research (April 2014).
[6] Takeo Kanade Three-Dimensional Machine Vision. Springer Science & Business Media. ISBN 978-1-4613-1981-
8 (6 December 2012).
[7] Margaret Ann Boden Mind as Machine: A History of Cognitive Science. Clarendon Press. p. 781. ISBN 978-0-19-
954316-8 (2006).
[8] Berton cello, Ten ways autonomous driving could redefine the automotive world . McKinsey & Company.

348

Cat and Dog Classification Using CNN Fin
No ratings yet
Cat and Dog Classification Using CNN Fin
34 pages
Visual Image Understanding
No ratings yet
Visual Image Understanding
7 pages
Dip 7
No ratings yet
Dip 7
4 pages
Brand Logo Detection Using Convolutional Neural Network IJERTCONV6IS13121
No ratings yet
Brand Logo Detection Using Convolutional Neural Network IJERTCONV6IS13121
4 pages
Report About Neural Network For Image Classification
No ratings yet
Report About Neural Network For Image Classification
51 pages
Unit I
No ratings yet
Unit I
10 pages
Image Classification Using Resnet
No ratings yet
Image Classification Using Resnet
28 pages
Deep Learning: An Overview of Convolutional Neural Network (CNN)
No ratings yet
Deep Learning: An Overview of Convolutional Neural Network (CNN)
54 pages
Admin,+4554 Article+Text 17736 2 10 20210928
No ratings yet
Admin,+4554 Article+Text 17736 2 10 20210928
13 pages
Computation 11 00052
No ratings yet
Computation 11 00052
24 pages
249 254Tesma601IJEAST
No ratings yet
249 254Tesma601IJEAST
7 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
4 100593163merged
No ratings yet
4 100593163merged
11 pages
A Brief Survey and An Application of Sem
No ratings yet
A Brief Survey and An Application of Sem
38 pages
IJCRT2210371
No ratings yet
IJCRT2210371
4 pages
CH 8
No ratings yet
CH 8
42 pages
Image Recognition Using Machine Learning Research Paper
No ratings yet
Image Recognition Using Machine Learning Research Paper
5 pages
2 Deep Learning in Image Classification A Survey Report
No ratings yet
2 Deep Learning in Image Classification A Survey Report
4 pages
Theories, Detection Methods, and Opportunities of Fake News Detection
No ratings yet
Theories, Detection Methods, and Opportunities of Fake News Detection
4 pages
Object Detection Using Convolutional Neural Network Transfer Learning
No ratings yet
Object Detection Using Convolutional Neural Network Transfer Learning
11 pages
Computer Vision Is A Field of Artificial Intelligence
No ratings yet
Computer Vision Is A Field of Artificial Intelligence
2 pages
Project Report
No ratings yet
Project Report
16 pages
Deep Learning Applications and Image Processing
No ratings yet
Deep Learning Applications and Image Processing
5 pages
A Survey On Computer Vision Algorithms
No ratings yet
A Survey On Computer Vision Algorithms
16 pages
AI Training2024Haile
No ratings yet
AI Training2024Haile
37 pages
Deep Learning
100% (3)
Deep Learning
32 pages
Artificial Intelligence (AI)
No ratings yet
Artificial Intelligence (AI)
6 pages
Handwritten Digit Recognition Roadmap
No ratings yet
Handwritten Digit Recognition Roadmap
17 pages
(IJCST-V10I5P12) :mrs J Sarada, P Priya Bharathi
No ratings yet
(IJCST-V10I5P12) :mrs J Sarada, P Priya Bharathi
6 pages
A Review of Advances in Image Recognition Models F
No ratings yet
A Review of Advances in Image Recognition Models F
5 pages
Sagar Paper
No ratings yet
Sagar Paper
4 pages
Feature Extraction Using Convolution Neural Networks (CNN) and Deep Learning
No ratings yet
Feature Extraction Using Convolution Neural Networks (CNN) and Deep Learning
5 pages
Topic Ai: Submitted by Sheharbano
No ratings yet
Topic Ai: Submitted by Sheharbano
7 pages
Ijet 10892
No ratings yet
Ijet 10892
5 pages
Workspace
No ratings yet
Workspace
19 pages
ML Unit 4
No ratings yet
ML Unit 4
16 pages
Paper 12
No ratings yet
Paper 12
3 pages
Reading+10+ +Introduction+to+Deep+Learning
No ratings yet
Reading+10+ +Introduction+to+Deep+Learning
21 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
The Quiet Revolution in Machine Vision
No ratings yet
The Quiet Revolution in Machine Vision
19 pages
Bundled
No ratings yet
Bundled
12 pages
DL Unit3 1
No ratings yet
DL Unit3 1
67 pages
Whitepaper AI Machine Learning Impacting DI Investigations AUG 2020
No ratings yet
Whitepaper AI Machine Learning Impacting DI Investigations AUG 2020
11 pages
Study Material BTech IT VIII Sem Subject Deep Learning Deep Learning Btech IT VIII Sem
No ratings yet
Study Material BTech IT VIII Sem Subject Deep Learning Deep Learning Btech IT VIII Sem
30 pages
Image Classification Using Convolutional Neural Networks
No ratings yet
Image Classification Using Convolutional Neural Networks
8 pages
11.theoretical Understanding of Convolutional Neural Network Concepts, Architectures, Mohammad Mustafa Taye, 2023
No ratings yet
11.theoretical Understanding of Convolutional Neural Network Concepts, Architectures, Mohammad Mustafa Taye, 2023
23 pages
Research On Application of Deep Learning Algorithm in Image Classification (2021)
No ratings yet
Research On Application of Deep Learning Algorithm in Image Classification (2021)
4 pages
Deep Learning File
No ratings yet
Deep Learning File
60 pages
Building CNN Model - Formatted Paper
No ratings yet
Building CNN Model - Formatted Paper
7 pages
Max78000 Article Series Part 1
No ratings yet
Max78000 Article Series Part 1
4 pages
Animal Classification pAPER
No ratings yet
Animal Classification pAPER
7 pages
Full Document - Fake News Detection
No ratings yet
Full Document - Fake News Detection
69 pages
Intro To AI
No ratings yet
Intro To AI
44 pages
Unit 5
No ratings yet
Unit 5
136 pages
Unit-5 DL
No ratings yet
Unit-5 DL
35 pages
Unit 3
No ratings yet
Unit 3
105 pages
Machine Learning Unit - 1
No ratings yet
Machine Learning Unit - 1
7 pages
Deep Convolutional Neural Networks: Structure, Feature Extraction and Training
No ratings yet
Deep Convolutional Neural Networks: Structure, Feature Extraction and Training
8 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
Xchapter 4 Mapping Research Methods
No ratings yet
Xchapter 4 Mapping Research Methods
4 pages
Energies 16 07015 v2
No ratings yet
Energies 16 07015 v2
31 pages
Blackbook Finalversion
No ratings yet
Blackbook Finalversion
39 pages
Cross-Cultural Expectations From Self-Driving Cars
No ratings yet
Cross-Cultural Expectations From Self-Driving Cars
11 pages
Theexternalprimaryfocuswithincardrivingsole Lyencompassesthemovementsofthecar
No ratings yet
Theexternalprimaryfocuswithincardrivingsole Lyencompassesthemovementsofthecar
8 pages
AUTOMATEDCAR
No ratings yet
AUTOMATEDCAR
13 pages
Practice and Exploration of Music Solfeggio Teachi
No ratings yet
Practice and Exploration of Music Solfeggio Teachi
9 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Image Recognition in Self-Driving Cars Using CNN

Uploaded by

Image Recognition in Self-Driving Cars Using CNN

Uploaded by

Image recognition in self-driving cars using CNN

International Journal of Science and Research Archive, 2023, 09(02), 342–348

Article DOI: https://doi.org/10.30574/ijsra.2023.9.2.0574

* Corresponding author: Shreya Muppidi

1.1. Use case of image recognition in today’s technology

2. Material and methods

2.1. Convolutional Neural Networks (CNNs)

Figure 1 Structure of a convolutional neural network

2.2. Tools used in making an image recognition model:

2.3. Procedural Method

 Import the required libraries

2.4. Image Classifier model using CNN

Figure 2 Importing and installing libraries

Figure 3 Loading data

Figure 4 Normalizing the test and training datasets

Figure 5 Defining the CNN class

Figure 6 Defining the optimizer

Figure 7 Running the loss factor and training the model

Figure 8 Testing the model

3. Results and Discussion

Figure 9 Determining the accuracy of the model

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.