0% found this document useful (0 votes)

38 views18 pages

Model Questions DWT COMPLETE SOLUTIONS

The document discusses various concepts related to neural networks, including multi-layer perceptrons (MLPs), activation functions, cost functions, and learning algorithms. It highlights the differences between MLPs and single-layer perceptrons, the significance of activation functions, and methods to prevent overfitting in deep learning. Additionally, it covers advanced topics such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), and long short-term memory (LSTM) networks.

Uploaded by

soubhagyakumarroutray065

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views18 pages

Model Questions DWT COMPLETE SOLUTIONS

Uploaded by

soubhagyakumarroutray065

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

1. What is mul -layer perceptron? How is it diﬀerent from single layer perceptron?

 Mul -layer Perceptron (MLP):

 A type of neural network consis ng of mul ple layers: an input layer, one or more
hidden layers, and an output layer.

 Each layer consists of neurons that apply ac va on func ons to the weighted sum of
inputs.

 Diﬀerences from Single Layer Perceptron:

 Architecture: MLP has mul ple layers, while a single layer perceptron has only one layer
of output nodes.

 Complexity: MLP can model complex rela onships due to its depth, whereas single layer
perceptron can only solve linearly separable problems.

2. What is the role of ac va on func on in a neural network?

 Purpose:

 Introduces non-linearity into the model, allowing it to learn complex pa erns.

 Helps in determining the output of a neuron based on its input.

 Common Ac va on Func ons:

 Sigmoid, ReLU, Tanh, So max.

3. Explain the concept of a neural network and the role of neurons, weights, and biases?

 Neural Network:

 A computa onal model inspired by the human brain, consis ng of interconnected

neurons.

 Components:

 Neurons: Basic units that receive inputs, process them, and produce an output.

 Weights: Parameters that adjust the strength of the input signals to the neurons.

 Biases: Addi onal parameters that allow the model to fit the data be er by shi ing the
ac va on func on.
4. What is the cost func on? State different cost func ons used in Regression and classifica on.

 Cost Func on:

 A measure of how well the model's predic ons match the actual data.

 Common Cost Func ons:

 Regression: Mean Squared Error (MSE), Mean Absolute Error (MAE).

 Classiﬁca on: Cross-Entropy Loss, Hinge Loss.

5. Write the learning algorithm for perceptron model.

 Perceptron Learning Algorithm:

1. Ini alize weights and bias.

2. For each training sample:

 Compute the output: ( y = f(w \cdot x + b) )

 Update weights: ( w = w + \eta (target - y) x )

 Update bias: ( b = b + \eta (target - y) )

3. Repeat un l convergence.

6. What is gradient descent? What are the types of gradient descent?

 Gradient Descent:

 An op miza on algorithm used to minimize the cost func on by itera vely adjus ng the
weights.

 Types:

 Batch Gradient Descent: Uses the en re dataset to compute gradients.

 Stochas c Gradient Descent (SGD): Uses one sample at a me to compute gradients.

 Mini-batch Gradient Descent: Uses a small batch of samples to compute gradients.

7. Explain the need for mul -layered perceptron with an example.

 Need for MLP:

 MLPs can model complex, non-linear rela onships that single-layer perceptrons cannot.

 Example:

 Classifying images of handwri en digits requires recognizing pa erns that are not
linearly separable, which MLP can achieve through mul ple layers.

8. Consider a neural network with one input layer, one hidden layer with 2 neurons and one output
layer with one neuron. Assume the neurons have a sigmoid ac va on func on, actual output=1,
learning rate=0.9. The network parameters for the neural network are as follows: inputs x1=0.35,
x2=0.9. Weights and bias: input to hidden layer: w1=0.1, w12=0.3, w21=0.3, w22=0.4. Hidden to
output layer: wh1=0.45, wh2=0.65.

 (i) Draw the architecture of the neural network with the given data.

 Architecture:

Input Layer: x1, x2

Hidden Layer: h1, h2

Output Layer: y

 (ii) Calculate the output of the network in the forward propaga on.

 Forward Propaga on:

 Hidden layer outputs:

 ( h1 = \sigma(0.1 \cdot 0.35 + 0.3 \cdot 0.9) )

 ( h2 = \sigma(0.3 \cdot 0.35 + 0.4 \cdot 0.9) )

 Calculate ( h1 ) and ( h2 ) using the sigmoid func on ( \sigma(x) = \frac{1}{1 + e^{-x}} ).

 Output layer:

 ( y = \sigma(wh1 \cdot h1 + wh2 \cdot h2) )

 (iii) Calculate the error at the output layer for the actual output Y=0.5.

 Error Calcula on:

 ( \text{Error} = \text{Actual Output} - \text{Predicted Output} = 1 - y )

 (iv) Calculate the gradients of the weights for the hidden to output layer in the backward
propaga on.

 Gradients Calcula on:

 Compute the gradient for each weight ( wh1 ) and ( wh2 ) using the chain rule.

 (v) Calculate the gradients of the weights for input to hidden layer in the backward
propaga on.

 Gradients Calcula on:

 Compute the gradients for weights ( w1, w12, w21, w22 ) using the chain rule
and the errors from the output layer.

9. State the Diﬀerence between Machine learning and Deep learning.

 Machine Learning:

 Involves algorithms that learn from data and make predic ons or decisions based on
that data.

 Typically requires feature engineering and domain knowledge.

 Deep Learning:

 A subset of machine learning that uses neural networks with many layers (deep
architectures).

 Automa cally learns features from raw data, reducing the need for manual feature
extrac on.

10. Write the signiﬁcance of valida on set in training a deep neural network.

 Signiﬁcance of Valida on Set:

 Used to tune hyperparameters and prevent overﬁ ng.

 Provides an unbiased evalua on of the model during training, helping to ensure that the
model generalizes well to unseen data.
11. Discuss the methods to avoid overﬁ ng in deep neural network?

 Methods to Avoid Overﬁ ng:

 Regulariza on: Techniques like L1 and L2 regulariza on add a penalty for large weights.

 Dropout: Randomly drops units during training to prevent co-adapta on.

 Early Stopping: Monitors valida on loss and stops training when it starts to increase.

 Data Augmenta on: Increases the diversity of the training set by applying
transforma ons.

12. Prove that the MLP is a cascade of non-linear func ons.

 Proof:

 Each neuron applies a non-linear ac va on func on to the weighted sum of its inputs.

 Stacking mul ple layers of these neurons results in a composi on of non-linear

func ons, allowing MLPs to approximate complex func ons.

13. Discuss the advantages of MLP over Single Perceptron.

 Advantages of MLP:

 Can model non-linear rela onships due to mul ple layers and non-linear ac va on
func ons.

 Capable of solving complex problems that are not linearly separable, unlike single
perceptrons.

14. Discuss the ReLu Ac va on func on.

 ReLU (Rec ﬁed Linear Unit):

 Deﬁned as ( f(x) = \max(0, x) ).

 Advantages:

 Computa onally eﬃcient and helps mi gate the vanishing gradient problem.

 Allows for sparse ac va on, leading to more eﬃcient models.

15. What is dying ReLu Problem? Explain with example.

 Dying ReLU Problem:

 Occurs when neurons output zero for all inputs, eﬀec vely becoming inac ve.

 Example: If a neuron has a weight that causes it to always output nega ve values, it will
never ac vate.

16. State how Leaky ReLu overcomes the dying ReLu problem.

 Leaky ReLU:

 Deﬁned as ( f(x) = x ) if ( x > 0 ) else ( \alpha x ) (where ( \alpha ) is a small constant).

 Allows a small, non-zero gradient when the unit is not ac ve, preven ng neurons from
dying.

17. Describe the role of convolu on and pooling layers in CNN.

 Convolu on Layers:

 Extract features from the input image by applying ﬁlters (kernels) that slide over the
input.

 Captures spa al hierarchies and pa erns.

 Pooling Layers:

 Downsample the feature maps to reduce dimensionality and computa on.

 Helps in making the representa on invariant to small transla ons.

18. Discuss the signiﬁcance of using padding technique in convolu onal layer with suitable example.

 Signiﬁcance of Padding:

 Prevents the reduc on of spa al dimensions a er convolu on.

 Example: Using same padding ensures that the output feature map has the same spa al
dimensions as the input, which is crucial for maintaining the structure of the data,
especially in deep networks.

19. Discuss types of padding techniques used in CNN with suitable example.

 Types of Padding:

 Valid Padding: No padding is applied, resul ng in a smaller output size. Example: For a
5x5 ﬁlter on a 7x7 input, the output will be 3x3.

 Same Padding: Padding is added to ensure the output size matches the input size.
Example: For a 5x5 ﬁlter on a 7x7 input, 2 pixels of padding are added, resul ng in a 7x7
output.

 Full Padding: Adds enough padding to ensure that the ﬁlter can slide over every pixel of
the input. Example: For a 5x5 ﬁlter on a 7x7 input, 4 pixels of padding are added,
resul ng in a 10x10 output.

20. Write the diﬀerence between valid padding, same padding and full padding.

 Valid Padding:

 No padding is applied.

 Output size is smaller than input size.

 Same Padding:

 Padding is added to maintain the same output size as the input size.

 Useful for preserving spa al dimensions.

 Full Padding:

 Maximum padding is applied.

 Output size is larger than input size, allowing the ﬁlter to cover all input pixels.

21. State and discuss types of pooling in CNN. Which pooling technique is widely used?

 Types of Pooling:
 Max Pooling: Takes the maximum value from a patch of the feature map. Widely used
for its ability to retain important features.

 Average Pooling: Takes the average value from a patch of the feature map. Less common
as it may lose important features.

 Global Average Pooling: Averages the en re feature map, o en used before the ﬁnal
classiﬁca on layer.

 Widely Used Technique: Max pooling is the most commonly used pooling technique due to its
eﬀec veness in retaining dominant features.

22. Discuss how early stopping combats overﬁ ng.

 Early Stopping:

 A regulariza on technique that monitors the model's performance on a valida on set

during training.

 Training is halted when the valida on loss begins to increase, indica ng that the model
is star ng to overﬁt the training data.

 This helps in achieving a model that generalizes be er to unseen data.

23. What is Recurrent Neural Network (RNN)? What is the use of it?

 Recurrent Neural Network (RNN):

 A type of neural network designed for sequen al data, where connec ons between
nodes can create cycles.

 It maintains a hidden state that captures informa on about previous inputs, making it
suitable for tasks involving me series or sequences.

 Uses:

 Natural language processing, speech recogni on, and me series predic on.

24. State the limita ons of RNN model. How LSTM overcomes the limita ons of RNN?

 Limita ons of RNN:

 Struggles with long-term dependencies due to vanishing and exploding gradient
problems.

 Diﬃculty in learning from long sequences.

 LSTM (Long Short-Term Memory):

 A specialized type of RNN that includes memory cells and gates to control the ﬂow of
informa on.

 Capable of retaining informa on over long periods, eﬀec vely addressing the limita ons
of standard RNNs.

25. Diﬀeren ate between feed forward neural network and Recurrent Neural Network?

 Feed Forward Neural Network:

 Informa on moves in one direc on, from input to output.

 No cycles or loops in the architecture.

 Recurrent Neural Network:

 Informa on can ﬂow in both direc ons due to cycles in the architecture.

 Capable of processing sequences and maintaining context over me.

26. What is LSTM network? How does an LSTM network work?

 LSTM Network:

 A type of RNN designed to remember informa on for long periods.

 It consists of memory cells, input gates, output gates, and forget gates.

 How it Works:

 Input Gate: Decides which informa on to keep from the current input.

 Forget Gate: Decides which informa on to discard from the cell state.

 Output Gate: Determines what the next hidden state should be based on the cell state.

27. Write the learning algorithm for perceptron model.

 Perceptron Learning Algorithm:

1. Ini alize weights and bias.

2. For each training sample:

 Compute the output: ( y = f(w \cdot x + b) )

 Update weights: ( w = w + \eta (target - y) x )

 Update bias: ( b = b + \eta (target - y) )

3. Repeat un l convergence.

28. State and discuss diﬀerent nonlinear ac va on func ons.

 Nonlinear Ac va on Func ons:

 Sigmoid: ( \sigma(x) = \frac{1}{1 + e^{-x}} )

 Range: (0, 1)

 Used in binary classiﬁca on.

 Tanh: ( \tanh(x) = \frac{e^{x} - e^{-x}}{e^{x} + e^{-x}} )

 Range: (-1, 1)

 Zero-centered, o en preferred over sigmoid.

 ReLU: ( f(x) = \max(0, x) )

 Range: [0, ∞)

 Computa onally eﬃcient, helps mi gate vanishing gradient.

 Leaky ReLU: ( f(x) = x ) if ( x > 0 ) else ( \alpha x )

 Allows a small gradient when inac ve, addressing dying ReLU problem.

 So max: Converts logits into probabili es for mul -class classiﬁca on.

 Range: (0, 1) for each class, sums to 1.

29. Write the perceptron learning algorithm.

 Perceptron Learning Algorithm:

1. Ini alize weights ( w ) and bias ( b ) to small random values.

2. For each training sample ( (x, target) ):

 Compute the output: ( y = f(w \cdot x + b) )

 Update weights: ( w = w + \eta (target - y) x )

 Update bias: ( b = b + \eta (target - y) )

3. Repeat un l the model converges or a maximum number of itera ons is reached.

30. Find the op mal weights of the perceptron which act as an OR gate for the given data keeping bias
(b=0) as ﬁxed. w1=0.6, w2=0.6 and Learning rate(η)=0.5. Draw the resultant perceptron which acts as
an OR gate with the op mal weights calculated.

 Op mal Weights for OR Gate:

 Given weights ( w1 = 0.6 ), ( w2 = 0.6 ), and bias ( b = 0 ).

 The perceptron will output 1 if ( w1 \cdot x1 + w2 \cdot x2 + b > 0 ).

 Resultant Perceptron:

VerifyOpen In EditorRunCopy code

1Input Layer: x1, x2

2Weights: w1=0.6, w2=0.6

3Bias: b=0

4Output: y = f(w1x1 + w2x2 + b)

31. Find the op mal weights of the perceptron which act as an AND gate for the given data keeping
bias (b=0) as ﬁxed. w1=1.2, w2=0.6 and Learning rate(η)=0.5. Draw the resultant perceptron which
acts as an AND gate with the op mal weights calculated.

 Op mal Weights for AND Gate:

 Given weights ( w1 = 1.2 ), ( w2 = 0.6 ), and bias ( b = 0 ).

 The perceptron will output 1 if ( w1 \cdot x1 + w2 \cdot x2 + b > 0 ).

 Resultant Perceptron:

Input Layer: x1, x2

Weights: w1=1.2, w2=0.6

Bias: b=0

Output: y = f(w1x1 + w2x2 + b)

32. Discuss types of RNN with examples.

 Types of RNN:

 Vanilla RNN: Basic form of RNN with simple recurrent connec ons. Example: Simple
sequence predic on tasks.

 LSTM (Long Short-Term Memory): Designed to remember long-term dependencies.

Example: Language modeling and transla on.

 GRU (Gated Recurrent Unit): A simpliﬁed version of LSTM with fewer parameters.
Example: Time series forecas ng.

33. Discuss the advantages of LSTM over RNN.

 **Advantages of LSTM: - Long-Term Memory: LSTMs can remember informa on for long
periods, eﬀec vely addressing the vanishing gradient problem that aﬀects standard RNNs.

 Ga ng Mechanisms: LSTMs use input, output, and forget gates to control the ﬂow of
informa on, allowing them to learn which informa on to keep or discard.

 Be er Performance: LSTMs generally outperform tradi onal RNNs on tasks involving long
sequences, such as language transla on and speech recogni on.

34. Explain the architecture of an autoencoder.

 Architecture of an Autoencoder:

 Encoder: Compresses the input into a lower-dimensional representa on (latent space).

 Latent Space: The compressed representa on of the input data.

 Decoder: Reconstructs the original input from the latent representa on.
 Loss Func on: Measures the diﬀerence between the input and the reconstructed
output, typically using Mean Squared Error.

35. What are the key diﬀerences between Convolu onal Neural Networks (CNNs) and Recurrent
Neural Networks (RNNs)?

 Data Type:

 CNNs are primarily used for spa al data (images).

 RNNs are designed for sequen al data ( me series, text).

 Architecture:

 CNNs use convolu onal layers to extract features from spa al hierarchies.

 RNNs use recurrent connec ons to maintain context over sequences.

 Processing:

 CNNs process inputs in parallel, making them faster for image data.

 RNNs process inputs sequen ally, which can lead to longer training mes.

36. Discuss how dropout combats overﬁ ng.

 Dropout:

 A regulariza on technique that randomly sets a frac on of the neurons to zero during
training.

 Prevents co-adapta on of neurons, forcing the network to learn more robust features.

 Reduces overﬁ ng by ensuring that the model does not rely on any speciﬁc set of
neurons.

37. Discuss how Regulariza on combats overﬁ ng.

 Regulariza on:

 Techniques that add a penalty to the loss func on to discourage complex models.
 L1 Regulariza on: Adds the absolute value of weights to the loss func on, promo ng
sparsity.

 L2 Regulariza on: Adds the squared value of weights to the loss func on, discouraging
large weights.

 Helps in improving generaliza on by preven ng the model from ﬁ ng noise in the

training data.

38. State the diﬀerence between valida on set and test set. Discuss how valida on sets are used in
early stopping the ANN model to combat overﬁ ng.

 Valida on Set:

 A subset of the training data used to tune hyperparameters and monitor model
performance during training.

 Test Set:

 A separate dataset used to evaluate the ﬁnal model's performance a er training and
valida on.

 Early Stopping:

 Involves monitoring the valida on loss during training.

 Training is halted when the valida on loss starts to increase, indica ng poten al
overﬁ ng.

 This ensures that the model retains its ability to generalize to unseen data.

39. State the variants of ReLu ac va on func on with the formula.

 Variants of ReLU:

 ReLU: ( f(x) = \max(0, x) )

 Leaky ReLU: ( f(x) = x ) if ( x > 0 ) else ( \alpha x ) (where ( \alpha ) is a small constant,
e.g., 0.01)

 Parametric ReLU (PReLU): ( f(x) = x ) if ( x > 0 ) else ( \alpha x ) (where ( \alpha ) is

learned during training)

 Exponen al Linear Unit (ELU): ( f(x) = x ) if ( x > 0 ) else ( \alpha (e^x - 1) )

40. Discuss the impact of the vanishing gradient problem on the weight updates during
backpropaga on.

 Vanishing Gradient Problem:

 Occurs when gradients become very small during backpropaga on, especially in deep
networks.

 Leads to slow or stalled learning, as weights are not updated eﬀec vely.

 Makes it diﬃcult for the network to learn long-range dependencies, par cularly in RNNs.

41. Discuss dying ReLU problem with example.

 Dying ReLU Problem:

 A situa on where neurons output zero for all inputs, eﬀec vely becoming inac ve.

 Example: If a neuron has a weight that causes it to always output nega ve values, it will
never ac vate, leading to a loss of informa on and reduced model capacity.

42. State the mathema cal formulas for both tanh and sigmoid func ons and describe their range of
outputs.

 Tanh Func on:

 Formula: ( \tanh(x) = \frac{e^{x} - e^{-x}}{e^{x} + e^{-x}} )

 Range: (-1, 1)

 Sigmoid Func on:

 Formula: ( \sigma(x) = \frac{1}{1 + e^{-x}} )

 Range: (0, 1)

43. Given a CNN output of Z= [2.1, 5.5, -4.3], calculate the So max probabili es for each class.

 So max Calcula on:

 Formula: ( P(y_i) = \frac{e^{z_i}}{\sum_{j} e^{z_j}} )

 For ( Z = [2.1, 5.5, -4.3] ):

 Calculate ( e^{Z} = [e^{2.1}, e^{5.5}, e^{-4.3}] )

 Compute the sum: ( S = e^{2.1} + e^{5.5} + e^{-4.3} )

 So max probabili es:

 ( P(y_1) = \frac{e^{2.1}}{S} )

 ( P(y_2) = \frac{e^{5.5}}{S} )

 ( P(y_3) = \frac{e^{-4.3}}{S} )

45. Design a CNN for image classifica on task with 10 classes. The CNN is having CONV1 layer with 8
filters, filter size is 5X5, stride=1, padding=0. CONV1 is followed by a maxpooling layer with filter 2x2.
Conv2 layer is having 16 filters followed by a maxpooling layer.

 a) Architecture of the CNN Model:

VerifyOpen In EditorRunCopy code

1Input Layer -> CONV1 (8 ﬁlters, 5x5, stride=1, padding=0) -> Max Pooling (2x2) ->

2CONV2 (16 ﬁlters, 5x5, stride=1, padding=0) -> Max Pooling (2x2) ->

3Fully Connected Layer -> Output Layer (10 classes)

 b) Find the number of parameters at each layer of CNN:

 CONV1:

 Number of parameters = (Filter Width * Filter Height * Input Channels + 1) *

Number of Filters

 Assuming input channels = 3 (for RGB images):

 Parameters = (5 * 5 * 3 + 1) * 8 = 608

 Max Pooling Layer: No parameters.

 CONV2:

 Number of parameters = (5 * 5 * 8 + 1) * 16 = 32016

 Max Pooling Layer: No parameters.

 Fully Connected Layer: Depends on the output size from the last pooling layer.
 c) Find the total number of learnable parameters in the above CNN:

 Total parameters = Parameters from CONV1 + Parameters from CONV2 + Parameters

from Fully Connected Layer.

 Assuming the output size from the last pooling layer is ( N ):

 Total = 608 + 32016 + (N * 10) (where ( N ) is the number of outputs from the
last pooling layer).

46. What is perceptron? Write perceptron learning algorithm.

 Perceptron:

 A type of linear classiﬁer that makes its predic ons based on a linear predictor func on
combining a set of weights with the feature vector.

 Perceptron Learning Algorithm:

1. Ini alize weights ( w ) and bias ( b ) to small random values.

2. For each training sample ( (x, target) ):

 Compute the output: ( y = f(w \cdot x + b) )

 Update weights: ( w = w + \eta (target - y) x )

 Update bias: ( b = b + \eta (target - y) )

3. Repeat un l convergence or a maximum number of itera ons is reached.

47. How nonlinearity is introduced in CNN network?

 Introduc on of Nonlinearity:

 Nonlinearity is introduced through ac va on func ons applied a er convolu onal

layers.

 Common ac va on func ons include ReLU, Sigmoid, and Tanh, which allow the network
to learn complex pa erns and rela onships in the data by transforming the linear
combina ons of inputs into non-linear outputs. This enables the CNN to capture
intricate features in the input images, enhancing its ability to perform tasks such as
image classiﬁca on and object detec on.
48. How weights are ini alized in neural networks?

 Weight Ini aliza on Techniques:

 Random Ini aliza on: Weights are ini alized randomly, o en using a uniform or normal
distribu on. This helps break symmetry.

 Xavier Ini aliza on: Designed for layers with sigmoid or tanh ac va on func ons, it sets
weights to values drawn from a distribu on with a mean of 0 and a variance of (
\frac{2}{n_{in} + n_{out}} ), where ( n_{in} ) and ( n_{out} ) are the number of input and
output units, respec vely.

 He Ini aliza on: Suitable for layers with ReLU ac va on func ons, it ini alizes weights
from a distribu on with a mean of 0 and a variance of ( \frac{2}{n_{in}} ).

 Zero Ini aliza on: All weights are ini alized to zero, but this is generally avoided as it
leads to symmetry and prevents learning.

49. Write the formula for ﬁnding the output shape of the convolu onal layer with given input size,
ﬁlter size, stride, and padding in CNN model.

 Output Shape Formula:

 For a convolu onal layer, the output shape can be calculated using the formula: [
\text{Output Height} = \le \lfloor \frac{\text{Input Height} - \text{Filter Height} + 2
\ mes \text{Padding}}{\text{Stride}} \right\rfloor + 1 ] [ \text{Output Width} =
\le \lfloor \frac{\text{Input Width} - \text{Filter Width} + 2 \ mes
\text{Padding}}{\text{Stride}} \right\rfloor + 1 ]

 The output depth is equal to the number of ﬁlters used in the convolu onal layer.

50. What is dropout and batch normaliza on?

 Dropout:

 A regulariza on technique used to prevent overﬁ ng in neural networks by randomly

se ng a frac on of the neurons to zero during training. This forces the network to learn
redundant representa ons and improves generaliza on.

 Batch Normaliza on:

 A technique to improve the training of deep neural networks by normalizing the inputs
to each layer. It standardizes the inputs to have a mean of zero and a variance of one,
which helps in stabilizing and accelera ng the training process. Batch normaliza on can
also act as a form of regulariza on.

ML Unit-1
100% (2)
ML Unit-1
12 pages
Question Bank
No ratings yet
Question Bank
14 pages
Model Questions DWT
No ratings yet
Model Questions DWT
3 pages
DL CO1 and CO2 Answers
No ratings yet
DL CO1 and CO2 Answers
36 pages
Charotar University of Science and Technology Faculty of Technology and Engineering
No ratings yet
Charotar University of Science and Technology Faculty of Technology and Engineering
10 pages
Viva
No ratings yet
Viva
8 pages
Deepques
No ratings yet
Deepques
12 pages
Introduction To ANN
No ratings yet
Introduction To ANN
6 pages
Section - C: Unit 1
No ratings yet
Section - C: Unit 1
12 pages
OT Unit1
No ratings yet
OT Unit1
3 pages
Deep Learning
No ratings yet
Deep Learning
18 pages
Deep Learning Sem
No ratings yet
Deep Learning Sem
128 pages
Neural Network - Test Questions
No ratings yet
Neural Network - Test Questions
9 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
Assignment 4
No ratings yet
Assignment 4
7 pages
AIML Unit-5
No ratings yet
AIML Unit-5
26 pages
Unit 5 (QB) - ML
No ratings yet
Unit 5 (QB) - ML
38 pages
DL Internal
No ratings yet
DL Internal
9 pages
Question Bank Advanced CO1, CO2
No ratings yet
Question Bank Advanced CO1, CO2
4 pages
DL Important
No ratings yet
DL Important
13 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
Aiml U4
No ratings yet
Aiml U4
6 pages
Question Bank - Deep Learning
No ratings yet
Question Bank - Deep Learning
25 pages
QB DL
No ratings yet
QB DL
2 pages
Deep Learning Final
No ratings yet
Deep Learning Final
17 pages
Interview Questions in Neural Network
No ratings yet
Interview Questions in Neural Network
9 pages
Ai Unit 5
No ratings yet
Ai Unit 5
33 pages
Aiml Unit 5
No ratings yet
Aiml Unit 5
34 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Genai See
No ratings yet
Genai See
51 pages
ISE-1 Imp DLPDF
No ratings yet
ISE-1 Imp DLPDF
28 pages
Q1
No ratings yet
Q1
2 pages
Deep Learning QP
No ratings yet
Deep Learning QP
4 pages
Neural Network
No ratings yet
Neural Network
97 pages
Deep Learning Questions
50% (2)
Deep Learning Questions
51 pages
DL Imp Viva
No ratings yet
DL Imp Viva
5 pages
Deped Mission and Vision
No ratings yet
Deped Mission and Vision
5 pages
120 Deep Learning Important Questions + Answers ?
No ratings yet
120 Deep Learning Important Questions + Answers ?
68 pages
AAM Ut Answer
No ratings yet
AAM Ut Answer
11 pages
Aiml-Qb - Unit 5
No ratings yet
Aiml-Qb - Unit 5
4 pages
1
No ratings yet
1
15 pages
NNDL Question
No ratings yet
NNDL Question
11 pages
SCT 1st 3 Clusters 2022
No ratings yet
SCT 1st 3 Clusters 2022
9 pages
Updated AAM QB
No ratings yet
Updated AAM QB
6 pages
Unit 1
No ratings yet
Unit 1
3 pages
1157 CS F425 20231222015056 Mid Semester Question Paper DL
No ratings yet
1157 CS F425 20231222015056 Mid Semester Question Paper DL
2 pages
AMLQuestion BANK
No ratings yet
AMLQuestion BANK
3 pages
Tensorflow
No ratings yet
Tensorflow
25 pages
DL Questions
No ratings yet
DL Questions
5 pages
Important Questions Soft Computing
No ratings yet
Important Questions Soft Computing
9 pages
Deep Learning Exam With Answers
No ratings yet
Deep Learning Exam With Answers
4 pages
Ch4 and Ch5 Notes
No ratings yet
Ch4 and Ch5 Notes
38 pages
IV Ai & Ds Al3451 ML Unit4 QB
No ratings yet
IV Ai & Ds Al3451 ML Unit4 QB
6 pages
ML Prep For Samsung
No ratings yet
ML Prep For Samsung
73 pages
SS 2021 Solutions
No ratings yet
SS 2021 Solutions
16 pages
Module4 AI
No ratings yet
Module4 AI
12 pages
CS462 Assignment2
No ratings yet
CS462 Assignment2
3 pages
Chapters 1-4
No ratings yet
Chapters 1-4
6 pages
5.1 s2.0 S095006182032657X Main
No ratings yet
5.1 s2.0 S095006182032657X Main
15 pages
Feb - 2023-2
No ratings yet
Feb - 2023-2
2 pages
File 46953
No ratings yet
File 46953
28 pages
Anc Assessment
No ratings yet
Anc Assessment
6 pages
EN671: Solar Energy Conversion Technology: Fundamentals of Flat Plate Collectors
No ratings yet
EN671: Solar Energy Conversion Technology: Fundamentals of Flat Plate Collectors
24 pages
RFP DURG EPC S&T Work
No ratings yet
RFP DURG EPC S&T Work
110 pages
Extended Abstract For Diesel Combustion
No ratings yet
Extended Abstract For Diesel Combustion
8 pages
Class 10 Geography Chapter 12
No ratings yet
Class 10 Geography Chapter 12
10 pages
Successful Remedies For Early Marriage
No ratings yet
Successful Remedies For Early Marriage
4 pages
Single Phase String Inverter 7-10 KW: Csi-7Ktl1P-Gi-Fl - Csi-8Ktl1P-Gi-Fl CSI-9KTL1P-GI-FL - CSI-10KTL1P-GI-FL
No ratings yet
Single Phase String Inverter 7-10 KW: Csi-7Ktl1P-Gi-Fl - Csi-8Ktl1P-Gi-Fl CSI-9KTL1P-GI-FL - CSI-10KTL1P-GI-FL
2 pages
Properties of Ocean Water
100% (1)
Properties of Ocean Water
5 pages
Some Notes On Daphnis Et Chloé
No ratings yet
Some Notes On Daphnis Et Chloé
13 pages
ĐỀ KIỂM TRA ĐẦU VÀO - ANH 7 Global
No ratings yet
ĐỀ KIỂM TRA ĐẦU VÀO - ANH 7 Global
5 pages
1133010I Rev. 02
No ratings yet
1133010I Rev. 02
2 pages
ENVI Classic Tutorial: Target Detection
No ratings yet
ENVI Classic Tutorial: Target Detection
18 pages
Intro To Java Programming Comprehensive Version 10th Edition by Y Daniel Liang
No ratings yet
Intro To Java Programming Comprehensive Version 10th Edition by Y Daniel Liang
315 pages
Forest Succession 1
No ratings yet
Forest Succession 1
2 pages
Charlton Salt Screener
No ratings yet
Charlton Salt Screener
2 pages
Bon - Gas: Reduced Bore Ball Valve For Fuel Gas
No ratings yet
Bon - Gas: Reduced Bore Ball Valve For Fuel Gas
7 pages
Catalogue Corolla Altis Compressed 1
No ratings yet
Catalogue Corolla Altis Compressed 1
8 pages
Accenture Presentation Script
No ratings yet
Accenture Presentation Script
3 pages
Ds LEIAN DCDU 12B Specification
No ratings yet
Ds LEIAN DCDU 12B Specification
9 pages
Interim Budget 2024 EY Highlights
No ratings yet
Interim Budget 2024 EY Highlights
23 pages
4TB 3520203
No ratings yet
4TB 3520203
1 page
Tmco Single Chamber Manual
No ratings yet
Tmco Single Chamber Manual
13 pages
Types of Lighting
No ratings yet
Types of Lighting
7 pages
Ps 70 1 e
No ratings yet
Ps 70 1 e
8 pages
Reactive Streams PDF
No ratings yet
Reactive Streams PDF
4 pages
Where Bible Says Eat and Joy With You Wife in The Bible - Google Search
No ratings yet
Where Bible Says Eat and Joy With You Wife in The Bible - Google Search
1 page
Efu Health Insurance
No ratings yet
Efu Health Insurance
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.