0% found this document useful (0 votes)

142 views5 pages

Project Synopsis Imagecaptioning

The document summarizes a student project on image captioning using deep learning. The project aims to automatically generate captions in sentences to describe images for visually impaired people. It will use convolutional neural networks to extract image features and recurrent neural networks to generate captions. The work is divided between two students, with one focusing on design, coding and testing and the other on requirements gathering and analysis. The project will be implemented over 6 weeks and integrated into a web application using Flask. It aims to address challenges in compositionality and generating diverse yet relevant captions.

Uploaded by

Raunak Jalan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

142 views5 pages

Project Synopsis Imagecaptioning

Uploaded by

Raunak Jalan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Department of Computer Science &

Engineering

Synopsis
of
Image Captioning
using Deep Learning

B.E. IV Year – 7th Semester

(Branch: CSE)

SUBMITTED BY: - SUBMITTED TO: -

Raunak Jalan (17BCS2596) IS-1(B) Er. Ankita Sharma
Bhuvaneshwar Choudhary (17BCS1762) IS-1(B) (Assistant Professor)

Introduction:
The people communicate through language, whether written or spoken. They often use
this language to describe the visual world around them. Images, signs are another way of
communication and understanding for the physically challenged people. The generation of
description from the image automatically in proper sentences is a very difficult and
challenging task, but it can help and have a great impact on visually impaired people for better
understanding of the description of the images of the web.

In order to make this happen, we will combine both image and text processing to build
a useful Deep Learning application, aka Image Captioning. Image Captioning refers to the
process of generating textual description from an image – based on the objects and actions in
the image.

Objective of the project: -

The objective of this project is to create a system that detects what is happening in an
image without actually telling the system what is happening. This can be applied in social
media systems where machines will automatically detect what the user is going to write based
on image or it can be used to help explain blind people, what is image all about. This project
will be combined with a flask-based web application.

Feasibility Study: -

Image captioning is a popular research area of Artificial Intelligence (AI) that deals
with image understanding and a language description for that image. Image understanding
needs to detect and recognize objects. It also needs to understand scene type or location,
object properties and their interactions. Generating well-formed sentences requires both
syntactic and semantic understanding of the language.

Understanding an image largely depends on obtaining image features. The techniques

used for this purpose can be broadly divided into two categories: (1) Traditional machine
learning based techniques and (2) Deep machine learning based techniques.

In traditional machine learning, hand crafted features such as Local Binary Patterns
(LBP), Scale-Invariant Feature Transform (SIFT), the Histogram of Oriented Gradients
(HOG), and a combination of such features are widely used. In these techniques, features are
extracted from input data. They are then passed to a classifier such as Support Vector
Machines (SVM) in order to classify an object. Since hand crafted features are task specific,
extracting features from a large and diverse set of data is not feasible. Moreover, real world
data such as images and video are complex and have different semantic interpretations.

On the other hand, in deep machine learning based techniques, features are learned
automatically from training data and they can handle a large and diverse set of images and
videos. For example, Convolutional Neural Networks (CNN) are widely used for feature
learning, and a classifier such as Softmax is used for classification. CNN is generally
followed by Recurrent Neural Networks (RNN) in order to generate captions

Methodology/ Planning of work: -

The software we are using to implement our drowsiness detection system is Spyder,
which is simple, fun, and productive.

The approach we will be using for this deep learning project is as follows:

Step 1 – Take image as input from the system or camera.

Step 2 – Using CNN to extracts the features from our input image. This is our image
understanding part.
Step 3 – The feature vector is linearly transformed to have the same dimension as the input
dimension of the RNN/LSTM network. This is our text generation part.
Step 4 – Sending the generated caption as a response to the front end which is made using
Flask.

We are starting with the requirement gathering followed by the feasibility study. Then
the coding will start which resumes for 3 weeks.

Stages of work Timeline

Requirement Gathering and feasibility study 2 weeks
Requirement Analysis and Design 2 weeks
Coding 2 weeks
Testing 3 weeks

At the end of the project, following use cases will be covered.

1. On providing any image to the application as an input, a relevant and creative caption
in the form of descriptive sentence is generated.
2. The generated output will describe in a single sentence what is shown in the image –
the objects present, their properties, the actions being performed and the interaction
between the objects, etc.

Module & Team Member wise Distribution of work: -

1st Member: -
(Raunak Jalan): Designing the Module, Coding and Testing, API design
2nd Member: -
(Bhuvaneshwar Choudhary): Requirement Gathering & Analysis, Coding/Testing

Innovation in Project:
The first challenge stems from the compositional nature of natural language and visual
scenes. While the training dataset contains co-occurrences of some objects in their context, a
captioning system should be able to generalize by composing objects in other contexts.
Traditional captioning systems suffer from lack of compositionality and naturalness as they
often generate captions in a sequential manner, i.e., next generated word depends on both the
previous word and the image feature. This can frequently lead to syntactically correct, but
semantically irrelevant language structures, as well as to a lack of diversity in the generated
captions. We propose to address the compositionality issue with a context-aware Attention
captioning model, which allows the captioner to compose sentences based on fragments of
the observed visual scenes. Specifically, we used a recurrent language model with a gated
recurrent visual attention that gives the choice at every generating step of attending to either
visual or textual cues from the last generation step.

Dependencies and Requirements: -

System Requirements:
 Python 3.7.2

Software Requirements:
 Spyder IDE
 Python

Hardware Requirements:
 CPU: Intel Pentium 4, 2.53 GHz or equivalent
 OS: Microsoft Windows 7, 8.1, 10 / MacOS Mojave (version 10.14)
 RAM: 2 GB
 Storage: 1.4 GB of free disk space
Bibliography: -
 https://www.analyticsvidhya.com/blog/2018/04/solving-an-image-captioning-task-using-
deep-learning/
 https://www.researchgate.net/publication/329037107_Image_Captioning_Based_on_Deep
Neural_Networks
 https://medium.com/swlh/image-captioning-in-python-with-keras-870f976e0f18

Amjad Khan
No ratings yet
Amjad Khan
2 pages
Exercises Aeroelasticity 2
100% (1)
Exercises Aeroelasticity 2
1 page
Visual Image Caption Generator 38
No ratings yet
Visual Image Caption Generator 38
6 pages
Image Caption Generator Research Paper
No ratings yet
Image Caption Generator Research Paper
4 pages
Image Captioning Using Deep Learning Mait
No ratings yet
Image Captioning Using Deep Learning Mait
8 pages
Image Caption Bot With Keras and Speech Generation For
No ratings yet
Image Caption Bot With Keras and Speech Generation For
7 pages
Project Report
No ratings yet
Project Report
35 pages
Review 3
No ratings yet
Review 3
18 pages
A Novel Approach of Image Caption Generator Using Deep Learning
No ratings yet
A Novel Approach of Image Caption Generator Using Deep Learning
6 pages
New PDF
No ratings yet
New PDF
48 pages
Image Caption Generator
No ratings yet
Image Caption Generator
2 pages
Mini Project Fln..
No ratings yet
Mini Project Fln..
51 pages
Building A Voice Based Image Caption Generator With Deep Learning
No ratings yet
Building A Voice Based Image Caption Generator With Deep Learning
6 pages
Image Captioning Synopsis
No ratings yet
Image Captioning Synopsis
17 pages
Review 3
No ratings yet
Review 3
18 pages
A Novel Approach of Image Caption Generator Using Deep Learning
No ratings yet
A Novel Approach of Image Caption Generator Using Deep Learning
6 pages
Minor
No ratings yet
Minor
14 pages
Generating Caption From Images Using Flickr Image Dataset
No ratings yet
Generating Caption From Images Using Flickr Image Dataset
7 pages
ROHAN PRASAD FinalProjectReport - Rohan Gamer
No ratings yet
ROHAN PRASAD FinalProjectReport - Rohan Gamer
39 pages
Paper 17881
No ratings yet
Paper 17881
6 pages
DL Group 6 Rep
No ratings yet
DL Group 6 Rep
11 pages
Image Caption Generator: Minor Project (BCA 5005)
No ratings yet
Image Caption Generator: Minor Project (BCA 5005)
15 pages
(IJCST-V11I4P7) :dr. T. S. Suganya, Mrs. M. Divya, T. Santhosh Kumar, K. Prem Kumar
No ratings yet
(IJCST-V11I4P7) :dr. T. S. Suganya, Mrs. M. Divya, T. Santhosh Kumar, K. Prem Kumar
4 pages
Project Review
No ratings yet
Project Review
12 pages
DW & Caption Generator - Paper 1
No ratings yet
DW & Caption Generator - Paper 1
6 pages
Image Caption Generator Report
No ratings yet
Image Caption Generator Report
27 pages
Seminar Report Final
No ratings yet
Seminar Report Final
20 pages
Document From Deependra Singh
No ratings yet
Document From Deependra Singh
10 pages
Image Captioning
No ratings yet
Image Captioning
8 pages
Image Caption Generator Using Deep Learning: Guided by Dr. Ch. Bindu Madhuri, M Tech, PH.D
No ratings yet
Image Caption Generator Using Deep Learning: Guided by Dr. Ch. Bindu Madhuri, M Tech, PH.D
9 pages
Conference Paper A5
No ratings yet
Conference Paper A5
9 pages
Review 2
No ratings yet
Review 2
34 pages
RP Springer
No ratings yet
RP Springer
10 pages
Image Caption
No ratings yet
Image Caption
16 pages
Image Caption Generator Using AI: Review - 1
No ratings yet
Image Caption Generator Using AI: Review - 1
9 pages
PGCON Paper Final
No ratings yet
PGCON Paper Final
4 pages
Automated Image Captioning Using CNN and RNN
No ratings yet
Automated Image Captioning Using CNN and RNN
17 pages
Image Caption Generator PCL
No ratings yet
Image Caption Generator PCL
19 pages
Image Caption Generator
No ratings yet
Image Caption Generator
6 pages
IJNRD2309143
No ratings yet
IJNRD2309143
11 pages
Detection and Recognition of Objects in Image Caption Generator System A Deep Learning Approach
No ratings yet
Detection and Recognition of Objects in Image Caption Generator System A Deep Learning Approach
3 pages
Hybrid Image Captioning Model
No ratings yet
Hybrid Image Captioning Model
6 pages
Image Captioning Generator Using Deep Machine Learning
No ratings yet
Image Captioning Generator Using Deep Machine Learning
3 pages
Image Caption Technical Report
No ratings yet
Image Caption Technical Report
31 pages
Scene Description
No ratings yet
Scene Description
6 pages
Report Contents Image Caption Generation-1
No ratings yet
Report Contents Image Caption Generation-1
42 pages
Image Captioning
No ratings yet
Image Captioning
17 pages
Image Captioningforthe Visually Impaired 1
No ratings yet
Image Captioningforthe Visually Impaired 1
6 pages
DL 20i0551 Project Proposal
No ratings yet
DL 20i0551 Project Proposal
3 pages
Image Caption Technical Report
50% (2)
Image Caption Technical Report
28 pages
Automatic Image Captioning Using Neural Networks
No ratings yet
Automatic Image Captioning Using Neural Networks
9 pages
Image Captionbot For Assistive Technology
No ratings yet
Image Captionbot For Assistive Technology
3 pages
Major Report Final
No ratings yet
Major Report Final
40 pages
Sample Project doc-REC
No ratings yet
Sample Project doc-REC
66 pages
IJIEMR March 2023 COPY RIGHT (2 Files Merged)
No ratings yet
IJIEMR March 2023 COPY RIGHT (2 Files Merged)
8 pages
Image Captioning Based Website Forvisuall y Impaired
No ratings yet
Image Captioning Based Website Forvisuall y Impaired
5 pages
Internship Report (Sanjay Final)
No ratings yet
Internship Report (Sanjay Final)
45 pages
Research Paper - Virtual Assistant
No ratings yet
Research Paper - Virtual Assistant
15 pages
Apply Deep Learning-Based CNN and LSTM For Visual Image Caption Generator
No ratings yet
Apply Deep Learning-Based CNN and LSTM For Visual Image Caption Generator
6 pages
Image Captioning Generator Using CNN and LSTM
No ratings yet
Image Captioning Generator Using CNN and LSTM
8 pages
(Ankitveer)
No ratings yet
(Ankitveer)
18 pages
Learning OpenCV 3 Application Development
From Everand
Learning OpenCV 3 Application Development
Samyak Datta
No ratings yet
Gandhi, Islam and More
No ratings yet
Gandhi, Islam and More
2 pages
City of Lakewood Opposition To Brian Essi's Motion For Attorneys' Fees (@MatthewMarkling, @DannLaw) & Statutory Damages
No ratings yet
City of Lakewood Opposition To Brian Essi's Motion For Attorneys' Fees (@MatthewMarkling, @DannLaw) & Statutory Damages
25 pages
CAMEL Analysis of HDFC Bank 2024 Only Bank Statement
No ratings yet
CAMEL Analysis of HDFC Bank 2024 Only Bank Statement
12 pages
Mla Bibliography Website
100% (1)
Mla Bibliography Website
4 pages
Purchase Order
No ratings yet
Purchase Order
1 page
Republic of The Philippines City of Taguig Taguig City University Gen. Santos Avenue, Central Bicutan, Taguig City
No ratings yet
Republic of The Philippines City of Taguig Taguig City University Gen. Santos Avenue, Central Bicutan, Taguig City
7 pages
Biology 163 Study Guide 1
No ratings yet
Biology 163 Study Guide 1
8 pages
Lesson Plan Year 11 Math
No ratings yet
Lesson Plan Year 11 Math
6 pages
MRP System Nervousness
100% (1)
MRP System Nervousness
232 pages
Case Study of CPTED & Defensible Space Theory On Malaysia Low Cost Housing
100% (1)
Case Study of CPTED & Defensible Space Theory On Malaysia Low Cost Housing
31 pages
3.3.7 IGCSE Chemistry Notes Percentage Purity and Percentage Yield
No ratings yet
3.3.7 IGCSE Chemistry Notes Percentage Purity and Percentage Yield
2 pages
Care Management of Small Ruminant
No ratings yet
Care Management of Small Ruminant
29 pages
Well Completion
No ratings yet
Well Completion
64 pages
Present Perfect Tense
No ratings yet
Present Perfect Tense
2 pages
Applied Basic Sciences in Paediatrics Addendum
No ratings yet
Applied Basic Sciences in Paediatrics Addendum
20 pages
Introduction To Professionalism
No ratings yet
Introduction To Professionalism
7 pages
Activins in Adipogenesis and Obesity: Review
No ratings yet
Activins in Adipogenesis and Obesity: Review
4 pages
Machine Tool Industry in India
No ratings yet
Machine Tool Industry in India
26 pages
PTON Q3 2023 Shareholder Letter - VF
No ratings yet
PTON Q3 2023 Shareholder Letter - VF
13 pages
Creating Attachments To Work Items or To User Decisions in Workflows
100% (1)
Creating Attachments To Work Items or To User Decisions in Workflows
20 pages
Javascript - Javascript Tutorial: Javascript Tutorials Is One of The Best Quick Reference To The Javascript. in This
100% (1)
Javascript - Javascript Tutorial: Javascript Tutorials Is One of The Best Quick Reference To The Javascript. in This
39 pages
Domain Name System: Window Server 2012 R2
No ratings yet
Domain Name System: Window Server 2012 R2
46 pages
LFL Technical
No ratings yet
LFL Technical
4 pages
List of FPOs in The State of Meghalaya
No ratings yet
List of FPOs in The State of Meghalaya
1 page
BW27RH
0% (1)
BW27RH
4 pages
Strategic Plan - UnderArmour
75% (4)
Strategic Plan - UnderArmour
21 pages
DOS 1.0 Jan82
No ratings yet
DOS 1.0 Jan82
307 pages
Questions That Need Be Answered
No ratings yet
Questions That Need Be Answered
10 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Project Synopsis Imagecaptioning

Uploaded by

Project Synopsis Imagecaptioning

Uploaded by

Department of Computer Science &

B.E. IV Year – 7th Semester

SUBMITTED BY: - SUBMITTED TO: -

Objective of the project: -

Understanding an image largely depends on obtaining image features. The techniques

Methodology/ Planning of work: -

Step 1 – Take image as input from the system or camera.

Stages of work Timeline

At the end of the project, following use cases will be covered.

Module & Team Member wise Distribution of work: -

Dependencies and Requirements: -

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.