0% found this document useful (0 votes)

20 views16 pages

CV Expl 21070126001

This document discusses image segmentation using the Cityscapes dataset. It provides context on image segmentation and its applications. It then explains the project, which involves implementing segmentation methods like clustering algorithms, U-Net, and Mask R-CNN on Cityscapes data. A literature review covers papers on segmentation techniques. Metrics to evaluate the methods are also discussed.

Uploaded by

aditya.pande.btech2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views16 pages

CV Expl 21070126001

Uploaded by

aditya.pande.btech2021

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Image Segmentation of Cityscapes Data with U-

NET Pytorch
Aditya Pande
21070126001
AIML A1
Computer Vision Experiential Learning
Introduction to Image
Segmentation
• Image segmentation is a fundamental task in computer vision, playing a pivotal role in extracting
meaningful information from images by dividing them into semantically coherent regions. Unlike
object detection, which identifies and localizes objects within an image, segmentation goes a step
further by precisely outlining the boundaries of individual objects or regions. This process is critical
for various applications, ranging from medical imaging and autonomous vehicles to augmented
reality and content-based image retrieval.
Significance in Computer Vision
Applications:
1. Object Recognition and Tracking:
- Image segmentation facilitates precise identification and tracking of objects within a scene, enabling applications like
object recognition and tracking in real-time video streams.

2. Medical Imaging:
- In medical fields, segmentation aids in the accurate delineation of structures and organs, assisting in diagnosis,
treatment planning, and monitoring of diseases.

3. Autonomous Vehicles:
- For autonomous vehicles, accurate segmentation is crucial for understanding the surrounding environment, identifying
road lanes, pedestrians, and other vehicles.

4. Augmented Reality:
- In augmented reality applications, segmentation helps distinguish between the foreground and background, allowing
virtual elements to seamlessly interact with the real world.
Explanation of the Project
• In our project, we focus on image segmentation using the Cityscapes dataset, which contains labeled urban scenes
captured from vehicles in Germany. The dataset provides a challenging yet realistic environment for testing and
evaluating segmentation techniques. Our project involves implementing various image segmentation methods,
encompassing traditional techniques such as thresholding, clustering algorithms, as well as state-of-the-art deep
learning models like U-Net and Mask R-CNN.

• One aspect of our project involves the application of clustering algorithms such as K-means and DBSCAN to
segment images. These algorithms group pixels based on similarities in color, allowing us to explore their
effectiveness in extracting meaningful regions from the dataset. We will compare the results of clustering algorithms
with traditional and deep learning methods to understand their respective advantages and limitations.

• Our evaluation will not only focus on visual comparisons but will also include quantitative assessments using metrics
such as Intersection over Union (IoU) and Dice Coefficient. These metrics provide insights into the accuracy and
precision of the segmentation methods, aiding in a comprehensive analysis of their performance.

• Additionally, our project aims to explore the trade-offs between traditional and deep learning approaches, taking into
consideration factors such as computational efficiency, robustness to variations, and interpretability. By conducting
this analysis, we seek to contribute insights into the effectiveness of different segmentation techniques, offering a
holistic understanding of the challenges associated with image segmentation in complex urban environments.
Literature Review on Image Segmentation:

Author Title Result

Olaf Ronneberger, Philipp Fischer, Thomas Brox U-Net: Convolutional Networks for Biomedical Image Segmentation In this paper, we present a network and training strategy that relies on the strong use of data
augmentation to use the available annotated samples more efficiently

Vijay Badrinarayanan, Alex Kendall, Roberto Cipolla SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation The novelty of SegNet lies is in the manner in which the decoder upsamples its lower resolution
input feature map(s). Specifically, the decoder uses pooling indices computed in the max-pooling step
of the corresponding encoder to perform non-linear upsampling.

Fausto Milletari, Nassir Navab, Seyed-Ahmad Ahmadi · V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation In this work we propose an approach to 3D image segmentation based on a volumetric, fully
convolutional, neural network.

S. Prabu A Study on Image Segmentation Method for Image Processing In this paper different algorithms of segmentation can be reviewed, analyzed and finally list out the comparison for all the algorithms.
This comparison study is useful for increasing accuracy and performance of segmentation methods in various image processing
J.M. Gnanasekar domains.

Refik Samet; Şahin Emrah Amrahov; Ali Hikmet Ziroğlu Fuzzy Rule-Based Image Segmentation technique for rock thin section images In this paper, we propose Fuzzy Rule-Based Image Segmentation technique to segment rock thin
section images.

Ashwani Kumar Yadav; Ratnadeep Roy; Rajkumar; Vaishali; Devendra Somwanshi Thresholding and morphological based segmentation techniques for medical images The main objective of this work is to segment the medical image under various conditions and
different backgrounds.

Sharifah Lailee Syed Abdullah; Hamirul'Aini Hambali; Nursuriati Jamil An accurate thresholding-based segmentation technique for natural images The traditional thresholding and clustering segmentation techniques that were widely used are Otsu
and K-means

Annegreet van Opbroek; M. Arfan Ikram; Meike W. Vernooij; Marleen de Bruijne Transfer Learning Improves Supervised Image Segmentation Across Imaging Protocols The variation between images obtained with different scanners or different imaging protocols
presents a major challenge in automatic segmentation of biomedical images.
Acknowledgements

• This dataset is the same as what is available here from the Berkeley AI Research group.

About the License

The Cityscapes data available from cityscapes-dataset.com has the following license:
Dataset • This dataset is made freely available to academic and non-academic entities for non-
commercial purposes such as academic research, teaching, scientific publications, or
Context: personal experimentation. Permission is granted to use the data given that you agree:

Cityscapes data (dataset home page) contains labelled • That the dataset comes "AS IS", without express or implied warranty. Although every
videos taken from vehicles driven in Germany. This effort has been made to ensure accuracy, we (Daimler AG, MPI Informatics, TU
version is a processed subsample created as part of Darmstadt) do not accept any responsibility for errors or omissions.
the Pix2Pix paper. The dataset has still images from the
original videos, and the semantic segmentation labels are • That you include a reference to the Cityscapes Dataset in any work that makes use of the
shown in images alongside the original image. This is one
dataset. For research papers, cite our preferred publication as listed on our website; for
of the best datasets around for semantic segmentation
tasks. other media cite our preferred publication as listed on our website or link to the
Cityscapes website.
Content:
• That you do not distribute this dataset or modified versions. It is permissible to distribute
This dataset has 2975 training images files and 500
validation image files. Each image file is 256x512 pixels, derivative works in as far as they are abstract representations of this dataset (such as
and each file is a composite with the original photo on the models trained on it or additional annotations that do not directly include any of our
left half of the image, alongside the labeled image (output data) and do not allow to recover the dataset or something similar in character.
of semantic segmentation) on the right half.
• That you may not use the dataset or any derivative work for commercial purposes as, for
example, licensing or selling the data, or using the data with a purpose to procure a
commercial gain.

• That all rights not expressly granted to you are reserved by (Daimler AG, MPI Informatics,
TU Darmstadt).
Overall Findings and Observations
• In our exploration of image segmentation techniques using the Cityscapes dataset, we observed distinct
strengths and limitations across traditional and deep learning approaches. Traditional methods, such as
thresholding and clustering algorithms like K-means and DBSCAN, showcased computational efficiency but
struggled with precision, particularly in handling complex scenes.
• Deep learning models, including U-Net and Mask R-CNN, exhibited superior precision, but at the expense of
increased computational demands. Evaluation metrics such as Intersection over Union (IoU) and Dice
Coefficient provided a quantitative perspective, revealing nuanced performance differences among the methods.
• We identified a trade-off between computational efficiency and segmentation precision, emphasizing the need
for a balanced approach tailored to specific application requirements. The robustness of traditional methods to
variations and their interpretability were highlighted, while deep learning models demonstrated superior
generalization.
• Key takeaways include the potential for a hybrid approach integrating the strengths of both methods and the
importance of further research to optimize deep learning models for efficiency without compromising precision.
Additionally, domain-specific adaptations may enhance segmentation performance in diverse urban
environments.
https://github.com/adityapande403/CV_segmentation_UNET_EXPL/tree/main

Lecture 3 Image Segmentation
No ratings yet
Lecture 3 Image Segmentation
25 pages
A Comprehensive Review of Modern Object Segmentation Approaches
No ratings yet
A Comprehensive Review of Modern Object Segmentation Approaches
177 pages
Expl CV
No ratings yet
Expl CV
16 pages
Topic2 Semantic Image Segmentation
No ratings yet
Topic2 Semantic Image Segmentation
56 pages
Thesis Image Segmentation
100% (3)
Thesis Image Segmentation
8 pages
A1745136595 29458 13 2025 Unit6cv
No ratings yet
A1745136595 29458 13 2025 Unit6cv
54 pages
Semantic Segmentation For Urban-Scene Images: Shorya Sharma
No ratings yet
Semantic Segmentation For Urban-Scene Images: Shorya Sharma
15 pages
FuseSeg Semantic Segmentation of Urban Scenes Based On RGB and Thermal Data Fusion
No ratings yet
FuseSeg Semantic Segmentation of Urban Scenes Based On RGB and Thermal Data Fusion
12 pages
Image Processing
No ratings yet
Image Processing
7 pages
Exploring Fusion Techniques in U-Net and DeepLab V3 Architectures For Multi-Modal Land Cover Classification
No ratings yet
Exploring Fusion Techniques in U-Net and DeepLab V3 Architectures For Multi-Modal Land Cover Classification
12 pages
Technical Updated
No ratings yet
Technical Updated
22 pages
Explo PPT
No ratings yet
Explo PPT
25 pages
Lec 2 (Image Segemnation)
No ratings yet
Lec 2 (Image Segemnation)
52 pages
Two-Stage Framework For Faster Semantic Segmentation
No ratings yet
Two-Stage Framework For Faster Semantic Segmentation
9 pages
Electronics 12 01199
No ratings yet
Electronics 12 01199
24 pages
Report Explo
No ratings yet
Report Explo
31 pages
Understanding Deep Learning Techniques For Image Segmentation
No ratings yet
Understanding Deep Learning Techniques For Image Segmentation
58 pages
Image Segmentation Using Deep Learning: A Survey
No ratings yet
Image Segmentation Using Deep Learning: A Survey
22 pages
Sec 2 Team 06
No ratings yet
Sec 2 Team 06
71 pages
Lecture 8 Image Segmentationi N Computer Vision 2025
No ratings yet
Lecture 8 Image Segmentationi N Computer Vision 2025
18 pages
Image Segmentation Using Deep Learning: A Survey
No ratings yet
Image Segmentation Using Deep Learning: A Survey
23 pages
285 May2019p
No ratings yet
285 May2019p
9 pages
SDPT Semantic-Aware Dimension-Pooling Transformer For Image Segmentation
No ratings yet
SDPT Semantic-Aware Dimension-Pooling Transformer For Image Segmentation
13 pages
Unit 4
No ratings yet
Unit 4
17 pages
Lecture 13 Image Segmentation Using Convolutional Neural Network
No ratings yet
Lecture 13 Image Segmentation Using Convolutional Neural Network
9 pages
Computer Vision Experiential Learning Report
No ratings yet
Computer Vision Experiential Learning Report
20 pages
Minor Report
No ratings yet
Minor Report
27 pages
DL UNIt-III
No ratings yet
DL UNIt-III
21 pages
Decomposing A Scene Into Geometric and Semantically Consistent Regions
No ratings yet
Decomposing A Scene Into Geometric and Semantically Consistent Regions
15 pages
CV Project Proposal
No ratings yet
CV Project Proposal
3 pages
Da Unit-Iv
No ratings yet
Da Unit-Iv
23 pages
10623proposal Copy
No ratings yet
10623proposal Copy
4 pages
IP Bankai
No ratings yet
IP Bankai
10 pages
A Study On Image Segmentation Method For Image Pro
No ratings yet
A Study On Image Segmentation Method For Image Pro
6 pages
23-2021 - A Comprehensive Survey of Image Segmentation
No ratings yet
23-2021 - A Comprehensive Survey of Image Segmentation
26 pages
Image Segmentation in Deep Learning
No ratings yet
Image Segmentation in Deep Learning
12 pages
Image Segmentationand Semantic Labelingusing Machine Learning
No ratings yet
Image Segmentationand Semantic Labelingusing Machine Learning
6 pages
(IJCST-V12I3P11) :M. Rega, Dr. S. Sivakumar
No ratings yet
(IJCST-V12I3P11) :M. Rega, Dr. S. Sivakumar
6 pages
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
No ratings yet
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
18 pages
Image Segmentation Keras: Implementation of Segnet, FCN, Unet, Pspnet and Other Models in Keras
No ratings yet
Image Segmentation Keras: Implementation of Segnet, FCN, Unet, Pspnet and Other Models in Keras
5 pages
Image Segmentation Final
No ratings yet
Image Segmentation Final
6 pages
BML Assign Print 4
No ratings yet
BML Assign Print 4
8 pages
IVP Notes
No ratings yet
IVP Notes
25 pages
A Study On Image Categorization Techniques
No ratings yet
A Study On Image Categorization Techniques
7 pages
Image Segmentation ÔÇö A BeginnerÔÇÖs Guide - Medium
No ratings yet
Image Segmentation ÔÇö A BeginnerÔÇÖs Guide - Medium
16 pages
Image Segmentation For Object Detection Using Mask R-CNN in Colab
No ratings yet
Image Segmentation For Object Detection Using Mask R-CNN in Colab
5 pages
V3I5201499a84 PDF
No ratings yet
V3I5201499a84 PDF
6 pages
A Survey of Diverse Segmentation Methods in Image Processing
No ratings yet
A Survey of Diverse Segmentation Methods in Image Processing
5 pages
Thesis On Image Segmentation
No ratings yet
Thesis On Image Segmentation
4 pages
IJRAR1DUP001
No ratings yet
IJRAR1DUP001
3 pages
Semantic Segmentation Architecture: A Key Part of Scene Understanding Applications
No ratings yet
Semantic Segmentation Architecture: A Key Part of Scene Understanding Applications
9 pages
Image Segmentation
No ratings yet
Image Segmentation
6 pages
IA Unit-03
No ratings yet
IA Unit-03
10 pages
Color Fusion
No ratings yet
Color Fusion
9 pages
Image Segmentation: Birla Technical Training Institute, Pilani
No ratings yet
Image Segmentation: Birla Technical Training Institute, Pilani
16 pages
ML Report-Image Segmentation
No ratings yet
ML Report-Image Segmentation
19 pages
Pfe Rapport
No ratings yet
Pfe Rapport
94 pages
Integrating Image Segmentation and Classification Using Texture Primitives For Natural and Aerial Images
No ratings yet
Integrating Image Segmentation and Classification Using Texture Primitives For Natural and Aerial Images
8 pages
2020 - Machine Learning Approach To Predictive
No ratings yet
2020 - Machine Learning Approach To Predictive
10 pages
Image Segmentation Using Soft Computing: Sukhmanpreet Singh, Deepa Verma, Arun Kumar, Rekha
No ratings yet
Image Segmentation Using Soft Computing: Sukhmanpreet Singh, Deepa Verma, Arun Kumar, Rekha
5 pages
Digital Image Processing Segmntation Lab With Python
No ratings yet
Digital Image Processing Segmntation Lab With Python
9 pages
Assignment 4
No ratings yet
Assignment 4
46 pages
Project Customer Segmentation For E-Commerce
No ratings yet
Project Customer Segmentation For E-Commerce
40 pages
International Journal of Fatigue: Zhixin Zhan, Hua Li
No ratings yet
International Journal of Fatigue: Zhixin Zhan, Hua Li
15 pages
Practical - Regression
No ratings yet
Practical - Regression
114 pages
Machine Learning Based Design Patterns Prediction-1
No ratings yet
Machine Learning Based Design Patterns Prediction-1
67 pages
ML - Viva QnA - Doubtly - in
No ratings yet
ML - Viva QnA - Doubtly - in
14 pages
20 Questions On Feature Engineering and Eda
No ratings yet
20 Questions On Feature Engineering and Eda
9 pages
Estimating Urban Noise Along Road Network From Street View Imagery
No ratings yet
Estimating Urban Noise Along Road Network From Street View Imagery
29 pages
AIML Manual V1 2-86
No ratings yet
AIML Manual V1 2-86
85 pages
Ali Saad
No ratings yet
Ali Saad
6 pages
Backorder Prediction in The Supply Chain Using Machine Learning
No ratings yet
Backorder Prediction in The Supply Chain Using Machine Learning
6 pages
Reference Dataset For Rate of Penetration Benchmar
No ratings yet
Reference Dataset For Rate of Penetration Benchmar
12 pages
GPT-3 Presentation
No ratings yet
GPT-3 Presentation
63 pages
Ali Aug
No ratings yet
Ali Aug
29 pages
Facial Emotion Recognition Methods, Datasets and Technologies A Literature Survey
No ratings yet
Facial Emotion Recognition Methods, Datasets and Technologies A Literature Survey
5 pages
Paper 5
No ratings yet
Paper 5
22 pages
s41664 018 0068 2
No ratings yet
s41664 018 0068 2
14 pages
Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models
No ratings yet
Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models
9 pages
Drug Recommendation Using Recurrent Neural Networks Augmented With Cellular Automata
No ratings yet
Drug Recommendation Using Recurrent Neural Networks Augmented With Cellular Automata
7 pages
07 - Model Selection & Building
No ratings yet
07 - Model Selection & Building
17 pages
07 Intelligent Disease Detection in Sugarcane Plants
No ratings yet
07 Intelligent Disease Detection in Sugarcane Plants
8 pages
Erka
No ratings yet
Erka
11 pages
Hiraiwa Et Al 2018 A Deep Learning Artificial Intelligence System For Assessment of Root Morphology of The Mandibular
No ratings yet
Hiraiwa Et Al 2018 A Deep Learning Artificial Intelligence System For Assessment of Root Morphology of The Mandibular
7 pages
Phishing Website Identification Based On Double Weight Random Forest
No ratings yet
Phishing Website Identification Based On Double Weight Random Forest
4 pages
Variational End-to-End Navigation and Localization: Alexander Amini, Guy Rosman, Sertac Karaman and Daniela Rus
No ratings yet
Variational End-to-End Navigation and Localization: Alexander Amini, Guy Rosman, Sertac Karaman and Daniela Rus
7 pages
CS4622 Machine Learning PROJECT
No ratings yet
CS4622 Machine Learning PROJECT
3 pages
Outlier
No ratings yet
Outlier
2 pages
Overfitting and Solution Sovlve
No ratings yet
Overfitting and Solution Sovlve
3 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CV Expl 21070126001

Uploaded by

CV Expl 21070126001

Uploaded by

Image Segmentation of Cityscapes Data with U-

Author Title Result

About the License

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.