Computer Vision Exam Questions English
Computer Vision Exam Questions English
Based on Lectures 6, 7, 8, 9
I. Essay Questions
3. Describe the concept of Residual Blocks in ResNet, and how Skip connections
help solve the Vanishing Gradient problem. (4 marks)
2. Describe the concept of Region Proposals in object detection, and how this
technique evolved from Selective Search to Region Proposal Networks (RPN).
(4 marks)
1. Explain the evolution from R-CNN to Fast R-CNN to Faster R-CNN, highlighting
the main improvements in each version and their impact on performance. (5
marks)
3. Explain the architecture of the SSD (Single Shot Detector) network and how it
differs from the Faster R-CNN architecture, highlighting the advantages and
disadvantages of each. (4 marks)
1. Convolutional layers in CNNs are used for feature extraction while fully connected
layers are used for classification. (True)
2. Dropout layers are more effective when used in input layers compared to hidden
layers. (False)
3. Global Average Pooling calculates the average of all pixel values in the feature map,
which significantly reduces the network dimensions. (True)
4. When using Backpropagation in training convolutional neural networks, the
weights of convolutional layers are updated before the weights of fully connected
layers. (False)
1. AlexNet uses ReLU activation function instead of the tanh function used in LeNet-5,
resulting in faster network training. (True)
2. VGGNet is characterized by using large filters (7×7 and 5×5) in the first
convolutional layers to extract more complex features. (False)
4. Skip connections in ResNet improve information flow through the network, helping
to solve the Vanishing Gradient problem. (True)
5. When using Transfer Learning, all layers of the pretrained network must always be
retrained to achieve the best results. (False)
1. Object Detection includes two main tasks: Object Localization and Object
Classification. (True)
2. Selective Search algorithm is faster and more accurate than Region Proposal
Network in generating regions of interest. (False)
4. Intersection Over Union (IOU) is more important than Mean Average Precision
(MAP) in evaluating the performance of object detection models. (False)
5. R-CNN networks use SVM classifier for classification while Fast R-CNN networks use
softmax layer. (True)
Lecture 9: Object Detection (continued)
1. Fast R-CNN applies the CNN feature extractor to each region of interest (RoI)
separately, which improves detection speed compared to R-CNN. (False)
3. One-stage Detectors like SSD are slower but more accurate than Two-stage
Detectors like Faster R-CNN. (False)
4. SSD uses multi-scale feature layers to detect objects of different sizes. (True)
5. All modern object detection models are trained from scratch without relying on
pretrained networks for feature extraction. (False)
5. d) Depth
11. What problem does the Dropout Layer address in neural networks?
12. a) Underfitting
13. b) Vanishing Gradient
14. c) Overfitting
6. What is the main advantage of using 3×3 filters in VGGNet instead of larger
filters?
11. Which main component in ResNet helps solve the Vanishing Gradient problem?
16. Which of the following Transfer Learning approaches involves freezing some
layers of the pretrained network and training both the non-frozen layers and
the newly added classifier layers?
8. What is the first step in the R-CNN algorithm for object detection?
1. What was the main improvement introduced by Fast R-CNN compared to the
original R-CNN?
3. What is the main advantage of One-stage Detectors like SSD compared to Two-
stage Detectors like Faster R-CNN?
3. Pooling layers help reduce the _ of the network by reducing the number of _
passed to the next layer, and there are two main types of pooling layers: _ and _.
4. _ occurs when the model fails to fit the training data, while _ occurs when the
model memorizes the training data instead of learning the actual features.
1. _ network was developed by Lecun and his team in 1998, and it uses _
activation function instead of the ReLU function used currently.
5. Transfer Learning means transferring what a neural network has learned from
being trained on a specific _ to another related problem, and there are three
major transfer learning approaches: _, _, and _.
1. Object Detection is a computer vision task that involves two main tasks: _ and _ of
objects within an image.
2. Object detection methods are divided into two types: object detection using _ and
object detection using _.
2. Faster R-CNN uses _ network instead of Selective Search algorithm, and this
network is known as an _ network because it guides the network's attention to
interesting regions in the image.
3. Detection approaches are divided into two types: _ detectors where detection
happens in two stages, and _ detectors that skip the region proposal stage and
run detection directly.
4. The architecture of the SSD model is composed of three main parts: _ to extract
feature maps, _ which is a series of convolution filters, and ____ used to eliminate
overlapping boxes and keep only one box for each object detected.