0% found this document useful (0 votes)

115 views4 pages

NLP Midsem Paper Jan 2024 Regular Exam

The document outlines the mid-semester test details for the M.Tech. in AIML program at Birla Institute of Technology & Science, Pilani, including the course title, exam nature, weightage, duration, and instructions for students. It consists of five questions covering topics such as ambiguity in sentences, neural network architecture for NLP, TF-IDF calculations, skip-gram negative sampling, and the Viterbi algorithm for part-of-speech tagging. Each question has specific tasks and marks assigned, focusing on practical applications of Natural Language Processing concepts.

Uploaded by

Sudip Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

115 views4 pages

NLP Midsem Paper Jan 2024 Regular Exam

Uploaded by

Sudip Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Birla Institute of Technology & Science, Pilani

Work Integrated Learning Programmes Division

First Semester 2023-2024
M.Tech. in AIML

Mid-Semester Test
(EC-2 Regular Paper)

Course No. : AIMLCZG530

Course Title : Natural Language Processing
Nature of Exam : Closed Book
Weightage : 30% No. of Pages =3
Duration : 2 Hours No. of Questions = 5
Date of Exam : 21-01-2024_FN
Note to Students:
1. Please follow all the Instructions to Candidates given on the cover page of the answer book.
2. All parts of a question should be answered consecutively. Each answer should start from a fresh page.
3. Assumptions made if any, should be stated clearly at the beginning of your answer.

Question 1. [3+2+2=7 Marks]

A. Find the ambiguity of the below sentences and justify your answer [3 marks]
a) The tank is full of water. I saw a military tank.
b) Before the professor left the stage, the play begins
c) She is looking for a match

B.
i. Given is the following toy corpus. Calculate all the bigram probabilities. [2 marks]
<s> I like apples </s>
<s> Apple is good for health</s>
<s> Apple is in red colour </s>

ii. For above training data in (i), Calculate the probability of below sentence using raw bigram
probabilities and using Laplace smoothing, <s> I am eating apples</s> [2marks]

Question 2. [4+3 = 7 Marks]

A. Study the below neural network designed for learning word embedding along with other NLP
application stated below and answer the following questions.

If the input layer ‘X’ denote the one hot encoding of the vocabulary, “e” is the embedding layer,
“h1”,”h2” are hidden layers and “Y” is the output layer emitting continuous valued output, identify
no more than 3 issues/error in the architecture and suggest modification to suit the below use case
requirement. If there are no corrections required, then mention “No Error” explicitly.
Use Case: Neural network in language modelling for sentence completion
Given a training corpus with below vocabulary each vectorized with four dimensions, and following
test sentence phrase, the neural network, should have predictive ability to identify next best word
to fill in the blank of the test sentence, by analyzing context window with five tokens.
Vocabulary: {he, she, bat, tree, wooden, park, playing, saw, was, a, on, the, with, in, morning,
evening} Test Sentence: “on a morning he was playing in the ________”

B. The number of times each word appears in different documents is given in the table below.
Calculate the TF-IDF value for each term in D1. [1.5 mark]
Find the word embedding for each term using TF-IDF value. Find which words are closest using TF-
IDF word embedding. [0.5 mark]
Which documents are more similar to each other? [0.5 mark]
What is the disadvantage of using the TF-IDF values for the word embedding’s? [0.5 mark]

D1 D2 D3
NLP 10 0 0
is 50 66 89
extremely 20 22 12
interesting 30 32 11
course 20 0 0

Find the word embedding for each term using TF-IDF value.
NLP [0.496,0,0]
is [0,0,0]
extremely [0,0,0]
interesting[0,0,0]
course [0.63,0,0]

Find which words are closest using TF-IDF word embedding. [0.5 mark]
Which documents are more similar to each other? [0.5 mark] D2 and D3
What is the disadvantage of using the TF-IDF values for the word embedding’s? [0.5 mark] -
Sparsity

Question 3. [1.5+3+1.5=6 Marks]

Given a training corpus: “played in the morning”, use the skip-gram negative sampling method and answer
the following: The initial embedding matrix and initial context matrix has dimensions |v| x 3 and is given as
follows:
Note: No need to update or show any weights other than necessary for below questions. Follow only the
approach as discussed in class. i.e., Simplified Skip gram negative sampling with binary classification model.
Round all the calculations to exactly two decimal places.

a) Generate the training dataset for an input target word “played” and context window of 1 next
word and hyper parameter value k=2 for the negative sampling task. Use the information
available in the question.
b) Calculate the error for the above dataset for only the first iteration of skip-gram training, with only
one hidden layer.
c) Explain in no more than 40 words, why skipgram algorithm training was modified from multiclass
to binary classification task.

Question 4. [5 Marks]
Find the appropriate POS tag using statistical model with bigram assumption, for the word “cook” in the
sentence,
“He will cook the food”

Question 5. [5 Marks]
By using Viterbi Algorithm, fill the Viterbi table for the sentence, “He will fight”. The tag transition
probabilities and the word likelihood for this corpus are as follows:

Tag transition probabilities MD NN VB PRP

MD 0.000008 0.31 0.46 0.0056
NN 0.000096 0.209 0.658 0.00068
VB 0.001 0.05 0 0.008
PRP 0.08 0.02 0.001 0.00001
START 0.008 0.000934 0.05677 0.08

h
Word likelihood probabilities e will fight
MD 0 0.8 0
NN 0 0.2 0.4
VB 0 0 0.6
PRP 1 0 0

H FIGH
Viterbi Table E WILL T
NN
VB
MD
PRP
Note:
PRP: PERSONAL PRONOUN
MD:MODAL
VB:VERB BASE FORM
NN:NOUN, SINGULAR OR
MASS

Word Embedding
No ratings yet
Word Embedding
35 pages
Maths Class Ix Chapter 04 05 and 06 Practice Paper 02
100% (1)
Maths Class Ix Chapter 04 05 and 06 Practice Paper 02
4 pages
Module 5 Part2new
No ratings yet
Module 5 Part2new
71 pages
NLP m3
No ratings yet
NLP m3
111 pages
Schopenhauer, Arthur Singh, R. Raj Death, Contemplation and Schopenhauer PDF
100% (1)
Schopenhauer, Arthur Singh, R. Raj Death, Contemplation and Schopenhauer PDF
141 pages
cs224n 2025 Lecture02 Wordvecs2
No ratings yet
cs224n 2025 Lecture02 Wordvecs2
46 pages
Module 5: Purposes and Functions of Language Assessment & Test
100% (1)
Module 5: Purposes and Functions of Language Assessment & Test
15 pages
American Journal of Philology 1880 1000226048
No ratings yet
American Journal of Philology 1880 1000226048
517 pages
Word Vectors I
No ratings yet
Word Vectors I
23 pages
WordRepresentation
No ratings yet
WordRepresentation
26 pages
07 Word Embeddings Notes
No ratings yet
07 Word Embeddings Notes
23 pages
Week9 Discussion - Deep Learning
No ratings yet
Week9 Discussion - Deep Learning
22 pages
En6G-Iig-7.3.1 En6G-Iig-7.3.2: Test - Id 32317&title Prepositional Phrases
100% (1)
En6G-Iig-7.3.1 En6G-Iig-7.3.2: Test - Id 32317&title Prepositional Phrases
15 pages
NLP Midsem Paper August 2024 Regular Solution
No ratings yet
NLP Midsem Paper August 2024 Regular Solution
10 pages
The Value of Narrativity in The Represe
No ratings yet
The Value of Narrativity in The Represe
54 pages
English Literature
No ratings yet
English Literature
6 pages
Sem Endsems
No ratings yet
Sem Endsems
9 pages
Cs224n 2024 Lecture02 Wordvecs2
No ratings yet
Cs224n 2024 Lecture02 Wordvecs2
45 pages
Systems Engineering Management Plan
No ratings yet
Systems Engineering Management Plan
27 pages
Generative AI
No ratings yet
Generative AI
16 pages
Android Controlled Arduino Robot Car: Jie Hou
No ratings yet
Android Controlled Arduino Robot Car: Jie Hou
27 pages
Association For Computational Linguistics
No ratings yet
Association For Computational Linguistics
308 pages
Vin AI
No ratings yet
Vin AI
55 pages
Principles and Procedures of Materials Development For Language Learning
No ratings yet
Principles and Procedures of Materials Development For Language Learning
2 pages
CCS369 Two Marks
No ratings yet
CCS369 Two Marks
9 pages
Bla Power Pvt. LTD: Woodward 505 Governor Valve / Actuator Calibration &test
No ratings yet
Bla Power Pvt. LTD: Woodward 505 Governor Valve / Actuator Calibration &test
23 pages
Lesson Plan - Dental Health
No ratings yet
Lesson Plan - Dental Health
2 pages
Cs224n Midterm 2018 Solution
No ratings yet
Cs224n Midterm 2018 Solution
17 pages
Big Data Science in Finance
From Everand
Big Data Science in Finance
Irene Aldridge
No ratings yet
Piety in Print The Vaishnava Periodicals
No ratings yet
Piety in Print The Vaishnava Periodicals
24 pages
CS 224n Assignment #2: Word2Vec and Dependency Parsing
No ratings yet
CS 224n Assignment #2: Word2Vec and Dependency Parsing
10 pages
HIS-Print culture-MCQs-QUES
No ratings yet
HIS-Print culture-MCQs-QUES
8 pages
Exercises en Text Models 2
No ratings yet
Exercises en Text Models 2
5 pages
Unit 5 Turing Machine
No ratings yet
Unit 5 Turing Machine
53 pages
sp19 Midterm Solutions
No ratings yet
sp19 Midterm Solutions
11 pages
Cambridge IGCSE™: English As A Second Language 0510/13 October/November 2021
No ratings yet
Cambridge IGCSE™: English As A Second Language 0510/13 October/November 2021
9 pages
Question Bank NLP SOLUTIONS
No ratings yet
Question Bank NLP SOLUTIONS
21 pages
Week 9
No ratings yet
Week 9
6 pages
Applied NLP
50% (2)
Applied NLP
8 pages
sp20 Midterm Solutions
No ratings yet
sp20 Midterm Solutions
12 pages
Spark Handbook, Module 5
No ratings yet
Spark Handbook, Module 5
26 pages
42. BÀI TẬP ÔN KIỂM TRA 1 TIẾT LẦN 1- ANH 10
No ratings yet
42. BÀI TẬP ÔN KIỂM TRA 1 TIẾT LẦN 1- ANH 10
9 pages
CSC401 2017 Exam Shared w23
No ratings yet
CSC401 2017 Exam Shared w23
8 pages
NLP Final
No ratings yet
NLP Final
11 pages
cs224n Practice Midterm 3 Sol
No ratings yet
cs224n Practice Midterm 3 Sol
14 pages
Nov Dec 2023
No ratings yet
Nov Dec 2023
2 pages
AI-900-SampleQuestions March 2022
No ratings yet
AI-900-SampleQuestions March 2022
19 pages
PCS224 MST 23
No ratings yet
PCS224 MST 23
3 pages
String Handling
No ratings yet
String Handling
5 pages
An Astrologer's Day
No ratings yet
An Astrologer's Day
7 pages
English 5 - DLP - Week 1 - Day 1 - August 5, 2024
No ratings yet
English 5 - DLP - Week 1 - Day 1 - August 5, 2024
4 pages
CS6314
No ratings yet
CS6314
2 pages
AI-900: Microsoft Azure AI Fundamentals Sample Questions: User Guide
No ratings yet
AI-900: Microsoft Azure AI Fundamentals Sample Questions: User Guide
19 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
Assignment 2 - 20240709
No ratings yet
Assignment 2 - 20240709
13 pages
NLP Nov
No ratings yet
NLP Nov
2 pages
Kami Export - Assignment - 2 - 20240709
No ratings yet
Kami Export - Assignment - 2 - 20240709
13 pages
NLP Endsem Paper Regular Paper SOLUTION April 2024
No ratings yet
NLP Endsem Paper Regular Paper SOLUTION April 2024
10 pages
CT-1 Sem6
No ratings yet
CT-1 Sem6
6 pages
Faisal Khan: Objective
No ratings yet
Faisal Khan: Objective
3 pages
CS671A/CS671: Introduction To Natural Language Processing Mid-Semester Exam
No ratings yet
CS671A/CS671: Introduction To Natural Language Processing Mid-Semester Exam
7 pages
0 Yqn EK3 VG 4 He OTv 089 KX SI1 Ij Wzu Ax T1 Ag Gev OKKJE
No ratings yet
0 Yqn EK3 VG 4 He OTv 089 KX SI1 Ij Wzu Ax T1 Ag Gev OKKJE
4 pages
24CA4023
No ratings yet
24CA4023
6 pages
CS6314
No ratings yet
CS6314
2 pages
CS341 HomeworkSol PDF
No ratings yet
CS341 HomeworkSol PDF
5 pages
Ucs672 MST 23
No ratings yet
Ucs672 MST 23
3 pages
08 Exercises Word2vec MUD SOLVED
No ratings yet
08 Exercises Word2vec MUD SOLVED
3 pages
A Semiotic Theory of Life Lotman S Princ PDF
No ratings yet
A Semiotic Theory of Life Lotman S Princ PDF
13 pages
Question Paper midTernNLPOct26 2024
No ratings yet
Question Paper midTernNLPOct26 2024
2 pages
MUD Exam 2024 SOLVED
No ratings yet
MUD Exam 2024 SOLVED
6 pages
10 2023
No ratings yet
10 2023
53 pages
CT3
No ratings yet
CT3
3 pages
NLP Paper
No ratings yet
NLP Paper
4 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
12 pages
It-3035 (NLP) - CS Mid Feb 2024
No ratings yet
It-3035 (NLP) - CS Mid Feb 2024
6 pages
Tip 1: Conversion Rules As Per The Reporting Verb: What Is Direct & Indirect Speech?
No ratings yet
Tip 1: Conversion Rules As Per The Reporting Verb: What Is Direct & Indirect Speech?
9 pages
Applied NLP - Project - Learner Template
No ratings yet
Applied NLP - Project - Learner Template
5 pages
18CS71
No ratings yet
18CS71
4 pages
Applying Deep Learning To Answer Selection - A Study and An Open Task
No ratings yet
Applying Deep Learning To Answer Selection - A Study and An Open Task
8 pages
CS 540: Introduction To Artificial Intelligence: Final Exam: 5:30-7:30pm, December 17, 2015 Beatles Room at Epic
No ratings yet
CS 540: Introduction To Artificial Intelligence: Final Exam: 5:30-7:30pm, December 17, 2015 Beatles Room at Epic
11 pages
CS 540: Introduction To Artificial Intelligence: Final Exam: 8:15-9:45am, December 21, 2016 132 Noland
No ratings yet
CS 540: Introduction To Artificial Intelligence: Final Exam: 8:15-9:45am, December 21, 2016 132 Noland
8 pages
The Art of Music Production - The Theory and Practice (PDFDrive) - 105-119
No ratings yet
The Art of Music Production - The Theory and Practice (PDFDrive) - 105-119
15 pages
CS 224n Assignment #2: Word2vec (43 Points)
No ratings yet
CS 224n Assignment #2: Word2vec (43 Points)
4 pages
Typing Keyboard Lmg-Arun
No ratings yet
Typing Keyboard Lmg-Arun
2 pages
Sample Midterm Questions Answers
No ratings yet
Sample Midterm Questions Answers
5 pages
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
No ratings yet
Faculty of Engineering: - Answer Any Four Full Questions Missing Data, If Any, May Be Assumed Suitably. 1. (A)
2 pages
Programming in Visual Basic (VB): For Visual Studio
From Everand
Programming in Visual Basic (VB): For Visual Studio
Olga Maria Stefania Cucaro
No ratings yet
Narrative Report - INSET DAY 4
No ratings yet
Narrative Report - INSET DAY 4
2 pages
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
From Everand
IGNOU BCA Introduction to Algorithm Design Previous Year Unsolved Papers BCS 042
Manish Soni
No ratings yet
Advanced C++ Interview Questions You'll Most Likely Be Asked
From Everand
Advanced C++ Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

NLP Midsem Paper Jan 2024 Regular Exam

Uploaded by

NLP Midsem Paper Jan 2024 Regular Exam

Uploaded by

Birla Institute of Technology & Science, Pilani

Work Integrated Learning Programmes Division

Course No. : AIMLCZG530

Question 1. [3+2+2=7 Marks]

Question 2. [4+3 = 7 Marks]

Question 3. [1.5+3+1.5=6 Marks]

Word to Tag combination

Tag to Tag combination

Tag transition probabilities MD NN VB PRP

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.