Project LiteratureSurvey Presentation
Project LiteratureSurvey Presentation
PROJECT GUIDE :
We hope to use this project to help the visually impaired and the elderly to
better understand what’s in front of them. This is also a huge leap for research
areas and robot training.
Semantic consistency are attribute values for the same logical concepts expressed and defined the
same way throughout the system/context.
● Analyzed the connections between simple object detection and a knowledge graph network.
● Claimed that semantic consistency can be applied to ‘any’ object detection model and increase
performance using transfer learning.
● The claim can not be reproduced and confirmed for the knowledge graph approaches.
“[Re] Object Detection Mee COCO, VGG -16 Multiple Object detection with Failed to showcase or put out
ts Knowledge Graphs - Sup bounding boxes and labels any code implementation of the
porting Datasets” same.
“Few-Shot Object R-CNN, Few shot learning - Generation of scene graphs. -Failed to showcase or put out
Detection Method Based on -Combined visual any code implementation of the
same.
Knowledge Reasoning “ information and knowledge
reasoning methods. - Limited to general semantic
structures of knowledge graphs.
“Object detection meets MSCOCO 15 - Quantify semantic -Only with the re-optimization
knowledge graphs “ PASCAL 07 consistency from knowledge using KG-CNet it identifies all
objects correctly otherwise it
graphs that can generalize to
identifies the objects partially.
new images with unobserved
contexts.
● Graph Search Neural Network (GSNN) is used which focuses on the features from
the image and selects relevant subsets of the input graph and predicts the output on
the various nodes.
● To propagate the beliefs from the initial nodes to the adjacent nodes, a method called
a per-node scoring function that tells how important the node is.
● The nodes of the graph could have represented anything from human relationships
to a computer program.
● Visual Genome is used in which each image is labeled with objects, attributes, and
relationships between objects entered by human annotators as well as the COCO
dataset.
The More You Know: Faster R-CNN, -Graph Search Neural -The GSNN and the framework
COCO, Visual Network (GSNN) is used that is used for vision problems
Using Knowledge Graphs Genome based on input and only is completely general
for Image Classification
chooses to expand nodes
that are useful for the
final output.
-All models are trained
with Stochastic Gradient
Descent, except GSNN
which is trained using
ADAM.
● KVCD,this model combines external knowledge and visual features to determine the
global contextual information in a scene and thus generates a more balanced scene
graph.
● Feature extraction module uses object detection to determine the visual features of an
image.
● Relational reasoning module uses the visual features extracted by the previous
module and applies a novel approach to combine external knowledge with these visual
features.
● Visual dependency constraint module applies two losses to balance the model’s
reliance on the two types of knowledge applied in the relational reasoning module.
● The objectives are to detect objects in an image, to identify the relationships between
object pairs, and to use graph structures to visualize these objects and their
relationships. DEPARTMENT OF COMPUTER SCIENCE, SJEC
6. Knowledge-Based Scene Graph Generation with Visual Contextual
Dependency
Dataset/ tools used Approaches Limitations
Knowledge-Based Scene Visual Genome, -given an image, detect -the computational complexity
R-CNN object of method is high.
Graph Generation with the object categories and
detector network
Visual Contextual with a their relationships. -due to significant categories
Dependency ResNeXt-101-FPN -The object features and imbalance in the dataset, the
backbone, bounding boxes in the model over-fits large sample
VSPNET. relationships & may ignore
input images are extracted
by the feature extraction small-sample relationships.
module.
-The visual relationship
is introduced to learn and
adjust the visual
dependencies in the
model
Artificial intelligence for inter-object statistical - Related to the research areas of - Failed to reduce the
Medical Imaging information to initialize object detection architectures computational complexity
the edge weights in the - Especially their classification
graph structure. losses and the usage of -Failed to obtain unbiased
commonsense knowledge in features
computer vision.
Contrastive Object Using a systematic -Reasoning over knowledge - The KBX-systems that
Detection Using literature review graphs can be performed by analyzed exploit knowledge
Knowledge Graph Methodology, COCO means of standard graphs as silos of facts.
Embeddings dataset was used knowledge representation. - From which relevant triples
- formalisms allowing to are aggregated to support or
describe and label entities and explain a given observation,
without following any
the relationships between them.
particular semantic structure
As humans we are able to understand the scene because of the general common sense
and the reasoning ability that is present within us.
It is difficult for a machine to do so.
We need a system that can derive conclusions about the scene by depending on visual
context and knowledge graphs.
Tools used for object detection R-CNN, VGG-16 , COCO , YOLO datasets
2.Wang, Jianwei, and Deyun Chen. "Few-Shot Object Detection Method Based on Knowledge Reasoning." Electronics 11, no. 9 (2022): 1327.
3.Fang, Yuan, Kingsley Kuan, Jie Lin, Cheston Tan, and Vijay Chandrasekhar. "Object detection meets knowledge graphs." International Joint Conferences on
Artificial Intelligence, 2017.
4.Kinasih, Fabiola Maria Teresa Retno, Carmadi Machbub, Lenni Yulianti, and Arief Syaichu Rohman. "Centroid-Tracking-Aided Robust Object Detection for
Hospital Objects." In 2020 6th International Conference on Interactive Digital Media (ICIDM), pp. 1-5. IEEE, 2020.
5.Marino, Kenneth, Ruslan Salakhutdinov, and Abhinav Gupta. "The more you know: Using knowledge graphs for image classification." arXiv preprint
arXiv:1612.04844 (2016).
6.Zhang, Lizong, Haojun Yin, Bei Hui, Sijuan Liu, and Wei Zhang. "Knowledge-Based Scene Graph Generation with Visual Contextual Dependency."
Mathematics 10, no. 14 (2022): 2525.
7.Tran, Khanhvi, Johan Peter Bøtker, Arash Aframian, and Kaveh Memarzadeh. "Artificial intelligence for medical imaging." In Artificial Intelligence in
Healthcare, pp. 143-162. Academic Press, 2020.
8.Lang, Christopher, Alexander Braun, and Abhinav Valada. "Contrastive object detection using knowledge graph embeddings." arXiv preprint arXiv:2112.11366
(2021).