0% found this document useful (0 votes)

3 views3 pages

Assignment 4 Solution

The document provides solutions to various mathematical problems related to vector-valued functions, directional derivatives, and convexity in the context of neural networks. It discusses methods such as Lagrange multipliers for optimization and explains the role of weights and biases in neural networks. Additionally, it includes proofs and calculations related to gradients and convexity conditions.

Uploaded by

2k22.it.2213439

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views3 pages

Assignment 4 Solution

Uploaded by

2k22.it.2213439

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Assignment 4 Solution - NOC-CS 41

2 4
1. C.
4 2
" #
√6
2. D. 2
√6
2
Explanation: The rate of change of a vector-valued function at a point in the direction of a vector
is given by the directional derivative. The directional derivative of a vector-valued function f (x, y) =
[x2 + y 2 , 2xy] at a point (x0 , y0 ) in the direction of a unit vector u is given by Du f (x0 , y0 ) = fx (x0 , y0 ) ∗
u1 + fy (x0 , y0 ) ∗ u2 , where fx and fy are the partial derivatives of f with respect to x and y respectively,
and u1 and u2 are the components of the unit vector u.
In this case, f (x, y) = [x2 + y 2 , 2xy], so fx (x, y) = [2x, 2y] and fy (x, y) = [2y, 2x]. At the point
√ (1, 2),
fx (1, 2) = [2, 4] and fy (1, 2) = [4, 2]. The vector (1,1) is not a unit vector. Its magnitude is 12 + 12 =
√
2, so the unit vector in the direction of (1, 1) is √12 , √12 . Therefore, the directional derivative of f
h i
at (1, 2) in the direction of (1, 1) is Du f (1, 2) = fx (1, 2) ∗ √12 + fy (1, 2) ∗ √12 = √62 , √62 .
So the rate of hchange iof the vector-valued function f (x, y) at the point (1, 2) in the direction of the
vector (1, 1) is √62 , √62 .
9
3. D. 2
9
2
Explanation: To find the minimum value of the vector-valued function f (x, y) = [x2 + y 2 , 2xy] subject
to the constraint x + y = 3, one of the methods we can use is the method of Lagrange multipliers. Let
g(x, y) = x + y − 3. Then we need to solve the system of equations given by ∇f (x, y) = λ∇g(x, y) and
g(x, y) = 0 for x, y, and λ.
The gradient of f is given by ∇f (x, y) = [2x, 2y]. The gradient of g is given by ∇g(x, y) = [1, 1]. So we
need to solve the system of equations given by

2x = λ
2y = λ
x+y =3

From the first two equations, we see that 2x = 2y, so x = y. Substituting this into the third equation
gives us x + x = 3, so x = 23 . Since x = y, we also have y = 23 .
Therefore, theh minimum value of thei vector-valued function f (x, y) subject to the constraint x + y = 3
2 2
is f 23 , 32 = 32 + 32 , 2 32 32 = 29 , 92 .

4. D. For a scalar-valued function f : Rn → R, a necessary and sufficient condition for f to be strictly

convex is that the Hessian of f at each point must be a positive definite matrix. So, the correct choice is:
5. A. The domain D of the function fi (x) must be convex. This is a necessary condition for a function to be
convex, but it is not sufficient on its own. Consider the function f (x) = x3 on the domain D = (−∞, ∞),
which is a convex set.
Let’s take x = −1, y = 1, and t = 0.5. We have:
f (0.5 ∗ (−1) + 0.5 ∗ 1) = f (0) = 0

and

0.5 ∗ f (−1) + 0.5 ∗ f (1) = 0.5 ∗ (−1) + 0.5 ∗ 1 = 0

In this case, the convex combination property holds. However, if we take x = −2, y = 1, and t = 0.5,
we get:

f (0.5 ∗ (−2) + 0.5 ∗ 1) = f (−0.5) = −0.125

and

0.5 ∗ f (−2) + 0.5 ∗ f (1) = 0.5 ∗ (−8) + 0.5 ∗ 1 = −3.5

In this case, −0.125 is not less than or equal to −3.5, so the convex combination property does not hold.
Therefore, the function f (x) = x3 is not convex, even though its domain D is convex. This demonstrates
that the convexity of the domain is a necessary but not sufficient condition for a function to be convex.
6. A. In a neural network, the primary purpose of a weight is indeed to increase or reduce the importance
of a certain input feature. This is how a neural network learns to prioritise certain features over others
during the training process.
7. B. To introduce non-linearity in the network
8. A. During the forward pass in a neural network, the network makes a prediction based on the input
data (option A). The weights and biases are not updated during the forward pass (this happens during
the backward pass), the gradient of the loss function is not calculated (this also happens during the
backward pass), and the activation function is determined before the forward pass begins, not during it.
9. B. The given function is not convex as neither component of the vector function is convex.
2 2
1. component f1 (x1 , x2 ) = x1 − x2 is not convex. This is because its Hessian
The first matrix is
2 0
, which is not positive semi-definite (the second eigenvalue is negative).
0 −2

0 2
2. The second component f2 (x1 , x2 ) = 2x1 x2 is also not convex. Its Hessian matrix is , which
2 0
is not positive semi-definite (the first eigenvalue is negative).
10. D.
pPn δ 1
Proof: Remember that ||x − u|| = i=1 (xi − ui )2 . Now, consider δxi ||x−u|| , we have the following:

δ 1 δ 1
= pPn
δxi ||x − u|| δxi i=1 (xi − ui )
2
n
1 δ X
=− Pn 3 · (xi − ui )2
2 · ( i=1 (xi − ui )2 ) 2 δxi i=1
2(xi − ui )
=− Pn 3
2 · ( i=1 (xi − ui )2 ) 2
xi − ui xi − ui
= − pP 3 = −
n 2 ||x − u||3
i=1 (xi − ui )

x−u
Following this logic, we get the gradient ∇f (x) = − ||x−u||3

Page 2
11. A. Q must be positive definite
∂J
12. A. ∂w1 = (ŷ − y) · ŷ(1 − ŷ) · x1
Explanantion: Let’s say the two inputs to the neural network are x1 and x2 , the two weights are w1
and w2 , and the bias is b. The output of the neural network before applying the activation function is
z = w1 x1 + w2 x2 + b. After applying the sigmoid activation function, the predicted output is ŷ = σ(z) =
1
1+e−z . Let’s say the true output is y. The objective function, which is the mean squared error between
the predicted output and the true output, is J = 12 (ŷ − y)2 .
∂J
The gradient of the objective function with respect to the first weight w1 is ∂w1 . We can use the chain
rule to compute this gradient as follows:
∂J ∂J ∂ ŷ ∂z
∂w1 = ∂ ŷ · ∂z · ∂w1
∂J
∂ ŷ = (ŷ − y)
∂ ŷ ′
∂z = σ (z) = σ(z)(1 − σ(z)) = ŷ(1 − ŷ)
∂z
∂w1 = x1
∂J
Substituting these values back into the expression for ∂w1 , we get:
∂J
∂w1 = (ŷ − y) · ŷ(1 − ŷ) · x1
∂J
13. B. ∂b = (ŷ − y) · ŷ(1 − ŷ)
Explanation: The gradient of the objective function with respect to the bias is given by ∂J ∂b = (ŷ −
y) · ŷ(1 − ŷ). This is because the bias term is added to the weighted sum of inputs before being passed
through the activation function. The derivative of the objective function with respect to the bias is
similar to the derivative with respect to a weight, except that the input term xi is replaced by 1 since
the bias has no corresponding input.
∂J
14. A. ∂w1 = (ŷ − y) · (1 − ŷ 2 ) · x1 .

Page 3

Unit 2 Introduction To Deep Learning
No ratings yet
Unit 2 Introduction To Deep Learning
79 pages
Updated-Numerical Solutions To CE Problems
100% (2)
Updated-Numerical Solutions To CE Problems
24 pages
Lecture 0.2 - Linear Methods For Regression, Optimization
No ratings yet
Lecture 0.2 - Linear Methods For Regression, Optimization
53 pages
Neural Problems
No ratings yet
Neural Problems
45 pages
Mclas Tema1 v2
No ratings yet
Mclas Tema1 v2
74 pages
Lecture Notes PDF
No ratings yet
Lecture Notes PDF
143 pages
Unit 2
No ratings yet
Unit 2
35 pages
DL 1
No ratings yet
DL 1
10 pages
Gershgorin GSC PDF
No ratings yet
Gershgorin GSC PDF
118 pages
Neural Network - Optimization DRAFT 3.11
No ratings yet
Neural Network - Optimization DRAFT 3.11
66 pages
1.deep Learning Assignment1 Solutions 1
100% (3)
1.deep Learning Assignment1 Solutions 1
12 pages
09 Convex
No ratings yet
09 Convex
48 pages
Solution of The ST Venant Equations (Part 2)
100% (2)
Solution of The ST Venant Equations (Part 2)
61 pages
Lecture 4 Introduction To Calculus (Part 1)
No ratings yet
Lecture 4 Introduction To Calculus (Part 1)
45 pages
Mathematical Physics 2024-09-19 09 - 48 - 18
No ratings yet
Mathematical Physics 2024-09-19 09 - 48 - 18
31 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
2-Linear System PDF
No ratings yet
2-Linear System PDF
73 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
Setting Parameters of A Deep Neural Network - Hierarchical Representations
No ratings yet
Setting Parameters of A Deep Neural Network - Hierarchical Representations
10 pages
Convex Functions
No ratings yet
Convex Functions
13 pages
06 Optimization
No ratings yet
06 Optimization
42 pages
Algebra 2: Polynomial Functions
No ratings yet
Algebra 2: Polynomial Functions
32 pages
CS6910 Tutorial5
No ratings yet
CS6910 Tutorial5
9 pages
Introduction To Quadratic Equations
No ratings yet
Introduction To Quadratic Equations
31 pages
Spring 2015 Mid-Sem Q - A
No ratings yet
Spring 2015 Mid-Sem Q - A
10 pages
Formula Sheet Trig Identities
No ratings yet
Formula Sheet Trig Identities
1 page
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
ML Lec-22
No ratings yet
ML Lec-22
25 pages
TFM Lichtner Bajjaoui Aisha
No ratings yet
TFM Lichtner Bajjaoui Aisha
18 pages
Calculus - Class Notes
No ratings yet
Calculus - Class Notes
4 pages
Week 2
No ratings yet
Week 2
7 pages
2025 12MS-1
No ratings yet
2025 12MS-1
3 pages
Lecture 8
No ratings yet
Lecture 8
16 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Solving Algebraic Expression and Equation
100% (1)
Solving Algebraic Expression and Equation
36 pages
Gradients Derivatives
No ratings yet
Gradients Derivatives
23 pages
Unit 2
No ratings yet
Unit 2
36 pages
Slides-4 Optimization Extra Gradient Descent
No ratings yet
Slides-4 Optimization Extra Gradient Descent
67 pages
Advanced Engineering Math
No ratings yet
Advanced Engineering Math
12 pages
Alg1 4.5
No ratings yet
Alg1 4.5
20 pages
COE292 - T221 - Final - Version C
No ratings yet
COE292 - T221 - Final - Version C
19 pages
2021 Exam2 Solution
No ratings yet
2021 Exam2 Solution
11 pages
Instructor Solution Manual To Neural Networks and Deep Learning A Textbook Solutions 3319944622 9783319944623 - Compress
No ratings yet
Instructor Solution Manual To Neural Networks and Deep Learning A Textbook Solutions 3319944622 9783319944623 - Compress
40 pages
Chapter - 2 - Convex Function
No ratings yet
Chapter - 2 - Convex Function
32 pages
Introduction To Discrete-Time Signals and Systems
No ratings yet
Introduction To Discrete-Time Signals and Systems
56 pages
MLF Combined
No ratings yet
MLF Combined
84 pages
Mod 2.3 - Activation Function, Loss Functions
No ratings yet
Mod 2.3 - Activation Function, Loss Functions
12 pages
Abstract: y F X X X, X, X
No ratings yet
Abstract: y F X X X, X, X
10 pages
Optimization For ML: CS771: Introduction To Machine Learning Nisheeth
No ratings yet
Optimization For ML: CS771: Introduction To Machine Learning Nisheeth
18 pages
Columns: Stability of Structures
No ratings yet
Columns: Stability of Structures
10 pages
Calculus
No ratings yet
Calculus
5 pages
Optimal Design of Efficient Acoustic Antenna Arrays
No ratings yet
Optimal Design of Efficient Acoustic Antenna Arrays
25 pages
Module 2
No ratings yet
Module 2
8 pages
Solution To Examples 1.1 and 1.2
No ratings yet
Solution To Examples 1.1 and 1.2
5 pages
CS 771A: Intro To Machine Learning, IIT Kanpur (1 Feb 2022) Name Roll No Dept
No ratings yet
CS 771A: Intro To Machine Learning, IIT Kanpur (1 Feb 2022) Name Roll No Dept
2 pages
MIS Class 12th Set B PREBOARD - 1 Exam
No ratings yet
MIS Class 12th Set B PREBOARD - 1 Exam
5 pages
CS 419M Midsem 2021 22
No ratings yet
CS 419M Midsem 2021 22
6 pages
COGS 118 Homework 3 Supervised Machine Learning Algorithms
No ratings yet
COGS 118 Homework 3 Supervised Machine Learning Algorithms
7 pages
03 Convex Functions Notes Cvxopt f22
No ratings yet
03 Convex Functions Notes Cvxopt f22
21 pages
3.5-3.6 Round Table Activity Answer Key
No ratings yet
3.5-3.6 Round Table Activity Answer Key
4 pages
Midterm With Solutions
No ratings yet
Midterm With Solutions
26 pages
Falkner-Skan Boundary Layer
No ratings yet
Falkner-Skan Boundary Layer
3 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
Mathematics II
No ratings yet
Mathematics II
2 pages
Mcr3U Practice Test#1: Introduction To Functions: 1. Indicate The Domain and Range of The Following Functions
100% (1)
Mcr3U Practice Test#1: Introduction To Functions: 1. Indicate The Domain and Range of The Following Functions
4 pages
PilliodPuckett VOF
No ratings yet
PilliodPuckett VOF
38 pages
CS6910 Tutorial1
No ratings yet
CS6910 Tutorial1
10 pages
Homework For The Course "Advanced Learninig Models": 1 Neural Networks
No ratings yet
Homework For The Course "Advanced Learninig Models": 1 Neural Networks
10 pages
Guide in Answering Compendium in General Mathematics
No ratings yet
Guide in Answering Compendium in General Mathematics
3 pages
Roles of A, B, C in The Graphs of Quadratic Functions
No ratings yet
Roles of A, B, C in The Graphs of Quadratic Functions
5 pages
AML 04 Backpropagation
100% (1)
AML 04 Backpropagation
26 pages
Ann2018 L5
No ratings yet
Ann2018 L5
23 pages
Exercises With Solutions PDF
No ratings yet
Exercises With Solutions PDF
37 pages
09 - nlp1 - Online
No ratings yet
09 - nlp1 - Online
23 pages
Artificial Neural Networks: Multilayer Perceptrons Backpropagation
No ratings yet
Artificial Neural Networks: Multilayer Perceptrons Backpropagation
71 pages
Instructor's Solution Manual For Neural Networks
No ratings yet
Instructor's Solution Manual For Neural Networks
40 pages
16 Lecture 17. Lagrange's Case: 16.1 The Symmetric Top
No ratings yet
16 Lecture 17. Lagrange's Case: 16.1 The Symmetric Top
5 pages
Unstructured Euler Solver (Cybo)
No ratings yet
Unstructured Euler Solver (Cybo)
6 pages
Samiullah Malik Aerospace Engineering PHD - Fall 2018
No ratings yet
Samiullah Malik Aerospace Engineering PHD - Fall 2018
2 pages
Week 2
No ratings yet
Week 2
8 pages
Probabilistic PDF
No ratings yet
Probabilistic PDF
16 pages
Ann Assignment 2ashish
No ratings yet
Ann Assignment 2ashish
10 pages
DR Tariq Jarad Projects
No ratings yet
DR Tariq Jarad Projects
3 pages
(IJCST-V6I4P17) :P T V Lakshmi
No ratings yet
(IJCST-V6I4P17) :P T V Lakshmi
4 pages
Connexions Module: m11240
100% (2)
Connexions Module: m11240
4 pages
Vitaly Vanchurin and Alexander Vilenkin - Eternal Observers and Bubble Abundances in The Landscape
No ratings yet
Vitaly Vanchurin and Alexander Vilenkin - Eternal Observers and Bubble Abundances in The Landscape
4 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Assignment 4 Solution

Uploaded by

Assignment 4 Solution

Uploaded by

Assignment 4 Solution - NOC-CS 41

4. D. For a scalar-valued function f : Rn → R, a necessary and sufficient condition for f to be strictly

0.5 ∗ f (−1) + 0.5 ∗ f (1) = 0.5 ∗ (−1) + 0.5 ∗ 1 = 0

f (0.5 ∗ (−2) + 0.5 ∗ 1) = f (−0.5) = −0.125

0.5 ∗ f (−2) + 0.5 ∗ f (1) = 0.5 ∗ (−8) + 0.5 ∗ 1 = −3.5

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.