0% found this document useful (0 votes)

14 views4 pages

Bai601 Simp

The document outlines a series of questions related to Natural Language Processing (NLP) across five modules, covering topics such as language modeling, syntactic analysis, text classification, information retrieval, and machine translation. Each module includes definitions, comparisons, explanations of theories, and practical applications, emphasizing the challenges and methodologies in NLP. The questions aim to assess understanding of fundamental concepts, techniques, and real-world applications in the field.

Uploaded by

rayangesh15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views4 pages

Bai601 Simp

Uploaded by

rayangesh15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

BAI601 TIE SIMP Questions- based on 22 scheme

Module 1: Introduction & Language Modeling

1. Define Natural Language Processing (NLP). Compare the rationalist and

empiricist approaches to modeling human language understanding.

2. Explain the five levels of language processing (lexical, syntactic, semantic,

discourse, and pragmatic) with suitable examples.

3. Discuss the major challenges in NLP, such as ambiguity, idioms, evolving language,
and ellipses, and explain how context helps in resolving these issues.

4. What is the difference between language and grammar? How does Chomsky’s
transformational grammar help in parsing natural language?

5. Explain the differences between Indian languages and English that affect NLP, and
describe how the Paninian framework addresses them.

6. What is Karaka Theory? Illustrate at least four Karaka roles with examples in an
Indian language sentence.

7. Compare Grammar-based and Statistical language models. Discuss n-gram

models and how sentence probability is estimated using bigrams.

8. Describe and differentiate between Add-one smoothing and Good-Turing

smoothing in statistical language modeling.

9. What are the applications of NLP in real-world systems? Briefly explain at least
three: Machine Translation, Question Answering, and Text Summarization.

Module 2: Word-Level & Syntactic Analysis

1. Define regular expressions. Explain how they are implemented using Finite-State
Automata (FSA) with examples.

2. What is morphological parsing? Describe the components of a morphological parser

and explain with examples (e.g., 'eggs', 'played').

3. Discuss different types of spelling errors (typographical, OCR, phonetic). Explain

minimum edit distance with an example (e.g., tutor → tumour).

4. Explain the difference between stemmers and morphological analyzers. Compare

Lovins' and Porter’s stemmer.

5. Describe the two-level morphological parsing model using Finite-State Transducers

(FSTs) with a relevant example (e.g., "walking" → "walk+V+PP").
6. What is Part-of-Speech tagging? Explain the three major types: Rule-based,
Stochastic (HMM), and Hybrid (Brill's tagger), with examples.

7. Explain Hidden Markov Model (HMM) tagging using unigram and bigram
probabilities. Show how Viterbi decoding is approximated.

8. What is Context-Free Grammar (CFG)? Write CFG rules for sentence generation and
parse the sentence: “Hena reads a book.”

9. Compare top-down and bottom-up parsing techniques. Explain CYK or Earley’s

algorithm with steps and an example.

Module 3: Naive Bayes, Text Classification & Sentiment Analysis

1. Explain the Naive Bayes Classifier. Derive the final equation for text classification
and explain the bag-of-words and conditional independence assumptions.
2. How is a Naive Bayes classifier trained? Explain how to estimate the prior and
likelihood probabilities using Maximum Likelihood Estimation and Laplace
Smoothing.
3. Perform a step-by-step Naive Bayes classification for a given test document using a
small training set.
4. What is binary Naive Bayes? Explain how clipping word counts and handling
negation improve sentiment classification.
5. What are the common issues faced in sentiment analysis using Naive Bayes?
Discuss solutions like negation handling, stop-word removal, and use of sentiment
lexicons.
6. Describe the use of Naive Bayes in spam detection and language identification.
Mention examples of features used in these tasks.
7. Explain how Naive Bayes can be viewed as a language model. How does it assign
probabilities to entire sentences?
8. How is text classification performance evaluated? Define precision, recall, F1-
score, and explain the importance of confusion matrix in classification.
9. What is the role of cross-validation and statistical significance testing in evaluating
classifiers? Explain the paired bootstrap test with an example.

Module 4: Information Retrieval & Lexical Resources

1. Explain the architecture and design features of an Information Retrieval (IR) system.
How do indexing, stop-word removal, and stemming contribute to its efficiency?

2. Compare and contrast the three classical IR models: Boolean, Probabilistic, and
Vector Space. Include examples and evaluation criteria.

3. What is TF-IDF weighting? Derive the formula and explain its significance with an
example.
4. Describe the Cluster, Fuzzy, and LSI (Latent Semantic Indexing) models in IR. How
do they address limitations of classical models?

5. Explain Zipf’s Law and how it applies to term selection and index size reduction in IR
systems.

6. Discuss major issues in Information Retrieval, including vocabulary mismatch,

polysemy, and scalability. Suggest strategies to handle them.

7. What is WordNet? Explain synsets, semantic relations (e.g., hypernym, hyponym,

troponym), and key applications like WSD and query expansion.

8. Describe FrameNet and its role in semantic role labeling. Use examples from frames
like ARREST or COMMUNICATION.

9. List different POS taggers (e.g., HMM, Brill, TreeTagger, Stanford Tagger). Compare
their approaches and applications in IR/NLP tasks.

Module 5: Machine Translation

1. Explain the major types of language divergences encountered in Machine

Translation. Illustrate with examples of:

o Word order typology

o Lexical divergences

o Morphological typology

o Referential density

2. What is an Encoder-Decoder architecture in Machine Translation? Describe how

the encoder and decoder components work together to generate translations.

3. Explain tokenization in modern MT systems. How do methods like Byte Pair

Encoding (BPE) and WordPiece help in subword modeling?

4. What are parallel corpora and how are they used to train MT systems? Discuss the
role of sentence alignment in creating bilingual datasets.

5. Describe the architecture and working of the Transformer-based Encoder-Decoder

model for MT. Include key components like multi-head attention, cross-attention,
and positional encoding.

6. What strategies are used to perform machine translation in low-resource

languages? Discuss data augmentation (backtranslation) and multilingual models
with examples.

7. What are the two key criteria for evaluating MT systems? How do BLEU and chrF
metrics work? Compare their strengths and limitations.
8. What are the bias and ethical concerns in Machine Translation? Explain with
examples how gender bias can manifest in MT outputs and how it is evaluated.

9. Discuss the advantages and challenges of automatic MT evaluation over human

evaluation. How is statistical significance used to compare MT systems?

AIResAnalyser
No ratings yet
AIResAnalyser
55 pages
Recent Advances in NLP The Case of Arabic Language
No ratings yet
Recent Advances in NLP The Case of Arabic Language
217 pages
Names and Their Environment
No ratings yet
Names and Their Environment
331 pages
SEM-2-NLP Questions
No ratings yet
SEM-2-NLP Questions
3 pages
CSE Syl BOS 14-15 Draft-161-188
No ratings yet
CSE Syl BOS 14-15 Draft-161-188
28 pages
NLP Semester 7
No ratings yet
NLP Semester 7
1,072 pages
NLP A
No ratings yet
NLP A
6 pages
Natural Language Processing (Synopsis)
No ratings yet
Natural Language Processing (Synopsis)
8 pages
Ai Unit - 5
No ratings yet
Ai Unit - 5
12 pages
SNLP
No ratings yet
SNLP
18 pages
Time Table-Add-Drop Monsoon 2021V5
No ratings yet
Time Table-Add-Drop Monsoon 2021V5
7 pages
NLP Subject Orientation SH23
No ratings yet
NLP Subject Orientation SH23
35 pages
NLP Imp Question
No ratings yet
NLP Imp Question
3 pages
POA - Tracker
No ratings yet
POA - Tracker
60 pages
A Study On The Impact of Artificial Inte
No ratings yet
A Study On The Impact of Artificial Inte
53 pages
Tsa Iat 12 Text and Speech Analysis
No ratings yet
Tsa Iat 12 Text and Speech Analysis
5 pages
GenAI - Text To Charts
100% (1)
GenAI - Text To Charts
37 pages
Artificial Intelligence For Mental Health and Mental Illnesses: An Overview
No ratings yet
Artificial Intelligence For Mental Health and Mental Illnesses: An Overview
18 pages
CS702B
No ratings yet
CS702B
114 pages
6th Sem AIML Syllabus 2022 Scheme
No ratings yet
6th Sem AIML Syllabus 2022 Scheme
53 pages
Endsem NLP IMPORTANT QUESTIONS
No ratings yet
Endsem NLP IMPORTANT QUESTIONS
2 pages
2 - 6N302 Natural Language Processing
No ratings yet
2 - 6N302 Natural Language Processing
6 pages
Question Answering System Using Ontology in Marathi Language
No ratings yet
Question Answering System Using Ontology in Marathi Language
12 pages
Neha Int
No ratings yet
Neha Int
33 pages
Model QP 21aml162
No ratings yet
Model QP 21aml162
2 pages
Unit 2
No ratings yet
Unit 2
15 pages
Pert23 - NLP
No ratings yet
Pert23 - NLP
30 pages
Natural Language Processin1
No ratings yet
Natural Language Processin1
86 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
4 pages
NLP QB
No ratings yet
NLP QB
5 pages
Natural Language Processing Question Bank
No ratings yet
Natural Language Processing Question Bank
3 pages
NLP QB
No ratings yet
NLP QB
4 pages
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
No ratings yet
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
5 pages
Bai601 NLP
No ratings yet
Bai601 NLP
5 pages
AI&NLP
No ratings yet
AI&NLP
1 page
Cogsci Placement Brochure
No ratings yet
Cogsci Placement Brochure
7 pages
NLP Key
No ratings yet
NLP Key
16 pages
Getchell Et Al 2022 Artificial Intelligence in Business Communication The Changing Landscape of Research and Teaching
No ratings yet
Getchell Et Al 2022 Artificial Intelligence in Business Communication The Changing Landscape of Research and Teaching
27 pages
Cs-3-Lesson Plan
No ratings yet
Cs-3-Lesson Plan
3 pages
Lesson Plan Aiml NLP
No ratings yet
Lesson Plan Aiml NLP
2 pages
Viva Q&a
No ratings yet
Viva Q&a
5 pages
NLP 2
No ratings yet
NLP 2
45 pages
Detailed Notes On Language Models and NLP
No ratings yet
Detailed Notes On Language Models and NLP
2 pages
Ai Concept Paper
No ratings yet
Ai Concept Paper
3 pages
Natural Language Processing For Requirements A Systematic Mapping Study PDF
No ratings yet
Natural Language Processing For Requirements A Systematic Mapping Study PDF
75 pages
MScIT Sem4
No ratings yet
MScIT Sem4
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
2 pages
Jurnal 14
No ratings yet
Jurnal 14
38 pages
MSC Data Science Oncampus 2020
No ratings yet
MSC Data Science Oncampus 2020
14 pages
How To Build An AI - A Step-by-Step Guide
No ratings yet
How To Build An AI - A Step-by-Step Guide
13 pages
Natural Language Processing Notes
No ratings yet
Natural Language Processing Notes
61 pages
NLP Assignment Notes
No ratings yet
NLP Assignment Notes
28 pages
Natural Language Processing 5
No ratings yet
Natural Language Processing 5
24 pages
It3ea06 Natural Lanuage Processing
No ratings yet
It3ea06 Natural Lanuage Processing
4 pages
Spelling Bee Solver, Answers and Hints
No ratings yet
Spelling Bee Solver, Answers and Hints
7 pages
NLP Study Material
No ratings yet
NLP Study Material
8 pages
Lecture#11
No ratings yet
Lecture#11
19 pages
CoE First Year Course Details I Sem - 2024 25
No ratings yet
CoE First Year Course Details I Sem - 2024 25
20 pages
TSP Unit1 Own
No ratings yet
TSP Unit1 Own
20 pages
CourseContent MSAI 15aug24
No ratings yet
CourseContent MSAI 15aug24
8 pages
NLP Syllabus
No ratings yet
NLP Syllabus
1 page
NLP Defaulter Assignment
No ratings yet
NLP Defaulter Assignment
2 pages
ChatGPT-NLP Course Summary
No ratings yet
ChatGPT-NLP Course Summary
34 pages
Long Answer Qs
No ratings yet
Long Answer Qs
2 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
NLP Sheets
No ratings yet
NLP Sheets
23 pages
NLP Important Questions
No ratings yet
NLP Important Questions
2 pages
PDF Hack Number Trick
No ratings yet
PDF Hack Number Trick
10 pages
List of Abbreviations
No ratings yet
List of Abbreviations
3 pages
6 Aimlsyll
No ratings yet
6 Aimlsyll
9 pages
Year B.Tech. - Computer Science &engineering: Speech and Natural Language Processing
No ratings yet
Year B.Tech. - Computer Science &engineering: Speech and Natural Language Processing
2 pages
Lucas Paquetta Raw NLP
No ratings yet
Lucas Paquetta Raw NLP
12 pages
BAI601
No ratings yet
BAI601
2 pages
Assignment Question NLP
No ratings yet
Assignment Question NLP
1 page
NLP Important Question and Answers Module Wise
No ratings yet
NLP Important Question and Answers Module Wise
101 pages
NLP Important QP
No ratings yet
NLP Important QP
2 pages
BAI601 Important Questions
No ratings yet
BAI601 Important Questions
2 pages
NLP Question and Answers Final
No ratings yet
NLP Question and Answers Final
129 pages
BAD613B Important Questions
No ratings yet
BAD613B Important Questions
2 pages
Natural Language Processing Course Content
No ratings yet
Natural Language Processing Course Content
2 pages
Research Paper ENC1102 Ayan Molerio - Evidence - For - Dialogue
No ratings yet
Research Paper ENC1102 Ayan Molerio - Evidence - For - Dialogue
8 pages
NLP FAQ and Numericals Modules1-5
No ratings yet
NLP FAQ and Numericals Modules1-5
4 pages
NLP Syllabus
No ratings yet
NLP Syllabus
4 pages
BAI601 All Modules VTU 10 Mark Complete
No ratings yet
BAI601 All Modules VTU 10 Mark Complete
18 pages
Course Code: Course Title Credit CSDO7011 Atural Language Processing 3
No ratings yet
Course Code: Course Title Credit CSDO7011 Atural Language Processing 3
4 pages
417 AI Facilitators Handbook X (2025-26)
No ratings yet
417 AI Facilitators Handbook X (2025-26)
203 pages
Maryamawit Shumetie Print
No ratings yet
Maryamawit Shumetie Print
84 pages
ChatGPT Simplified: A Comprehensive Guide to Understanding and Utilizing AI Language Models, ChatGPT-4, ChatGPT Prompts, Fiction Writing, Blogging, Content Writing, Make Money Online
From Everand
ChatGPT Simplified: A Comprehensive Guide to Understanding and Utilizing AI Language Models, ChatGPT-4, ChatGPT Prompts, Fiction Writing, Blogging, Content Writing, Make Money Online
Silas Quantum
5/5 (1)
Natural Language Processing
From Everand
Natural Language Processing
Ajit Singh
No ratings yet
Text-to-Speech Systems and Algorithms: Definitive Reference for Developers and Engineers
From Everand
Text-to-Speech Systems and Algorithms: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Bai601 Simp

Uploaded by

Bai601 Simp

Uploaded by

BAI601 TIE SIMP Questions- based on 22 scheme

Module 1: Introduction & Language Modeling

1. Define Natural Language Processing (NLP). Compare the rationalist and

2. Explain the five levels of language processing (lexical, syntactic, semantic,

7. Compare Grammar-based and Statistical language models. Discuss n-gram

8. Describe and differentiate between Add-one smoothing and Good-Turing

Module 2: Word-Level & Syntactic Analysis

2. What is morphological parsing? Describe the components of a morphological parser

3. Discuss different types of spelling errors (typographical, OCR, phonetic). Explain

4. Explain the difference between stemmers and morphological analyzers. Compare

5. Describe the two-level morphological parsing model using Finite-State Transducers

9. Compare top-down and bottom-up parsing techniques. Explain CYK or Earley’s

Module 3: Naive Bayes, Text Classification & Sentiment Analysis

Module 4: Information Retrieval & Lexical Resources

6. Discuss major issues in Information Retrieval, including vocabulary mismatch,

7. What is WordNet? Explain synsets, semantic relations (e.g., hypernym, hyponym,

Module 5: Machine Translation

1. Explain the major types of language divergences encountered in Machine

o Word order typology

2. What is an Encoder-Decoder architecture in Machine Translation? Describe how

3. Explain tokenization in modern MT systems. How do methods like Byte Pair

5. Describe the architecture and working of the Transformer-based Encoder-Decoder

6. What strategies are used to perform machine translation in low-resource

9. Discuss the advantages and challenges of automatic MT evaluation over human

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.