0% found this document useful (0 votes)

2 views5 pages

Viva Q&a

The document provides an overview of Natural Language Processing (NLP), including its definition, applications, challenges, and key concepts such as tokenization, morphology, and named entity recognition. It also discusses various NLP techniques and models, including word frequency analysis, probabilistic models, POS tagging, and chunking, along with their implementation details and challenges. Additionally, it mentions popular NLP libraries in Python, such as NLTK and spaCy.

Uploaded by

bms714491

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views5 pages

Viva Q&a

Uploaded by

bms714491

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

VIVA Q&A:

IMPORTANT Q&A:

 What is Natural Language Processing (NLP)?

NLP is a field of AI that enables computers to understand, interpret, and generate human language.

 Mention any two applications of NLP.

Machine Translation (e.g., Google Translate) and Sentiment Analysis.

 What are the key challenges in NLP?

Ambiguity, context understanding, and language variability.

 Define Tokenization.
Tokenization is the process of breaking text into smaller units like words or sentences.

 What is the difference between Syntax and Semantics in NLP?

Syntax deals with the structure of sentences, while semantics focuses on the meaning.

 What is a Corpus in NLP?

A corpus is a large collection of text used for training NLP models.

 Explain the term Morphology in NLP.

Morphology is the study of the structure and formation of words.

 What is Lemmatization?
Lemmatization reduces words to their base or dictionary form (lemma).

 Define Stop Words with an example.

Stop words are common words (e.g., "is," "the") often removed in NLP tasks.

 What is Named Entity Recognition (NER)?

NER identifies and classifies entities like names, locations, and dates in text.

 What are N-grams in NLP?

N-grams are continuous sequences of n words in a given text.

 Define Part-of-Speech (POS) tagging.

POS tagging assigns grammatical categories (like noun, verb) to words.

 What is a Language Model?

A language model predicts the probability of word sequences in text.

 What is the purpose of Stemming in NLP?

Stemming reduces words to their root form by removing suffixes.

 Mention any two NLP libraries in Python.

NLTK and spaCy.
QUESTIONS BASED ON PROGRAM:

1. Word Analysis (Frequency and Distribution of Words)

 What is word frequency analysis?

Counting how often each word appears in a text.

 How does your program handle case sensitivity?

By converting all text to lowercase.

 How do you manage punctuation in the text?

By removing or ignoring punctuation marks.

 Which data structure did you use to store word counts? Why?

A dictionary for fast key-based access.

2. Word Generation Using Probabilistic Models

 What is a probabilistic model in the context of word generation?

A model that predicts the next word based on probability.

 How does your program decide which word to generate next?

By selecting words based on their probability distribution.

 What is the difference between bigram and trigram models?

Bigrams consider one previous word, trigrams consider two.

 How does increasing the order of n-grams affect the output?

It improves context but requires more data.

3. Morphology Analysis (Root Words, Prefixes, Suffixes)

 What is morphology in NLP?

The study of word structure, including roots, prefixes, and suffixes.

 How does your program identify root words?

Using stemming or lemmatization techniques.

 What libraries or algorithms did you use for morphological analysis?

NLTK’s stemmer or spaCy’s lemmatizer.

 What is the difference between stemming and lemmatization?

Stemming chops off affixes; lemmatization finds the dictionary form.

4. Implementing N-Grams

 What is an N-gram?

A sequence of n words from a text.

 How does your program handle the start and end of a sentence?

By adding start and end tokens.

 What is the effect of changing the value of 'n' in N-grams?

Higher n gives better context but increases sparsity.

 How do you handle unseen word sequences?

Using smoothing techniques.

 Can you explain the real-world applications of N-grams?

Text prediction, autocomplete, and speech recognition.

5. N-Grams Smoothing

 Why is smoothing important in N-gram models?

To handle zero probabilities for unseen word sequences.

 What type of smoothing technique did you implement?

Laplace (add-one) smoothing.

 How does smoothing improve model performance?

By assigning small probabilities to unseen events.

 What happens if you don’t apply smoothing?

The model will assign zero probability to unseen sequences.

 How can you evaluate the effectiveness of smoothing techniques?

By measuring perplexity or model accuracy.

6. POS Tagging Using Hidden Markov Model (HMM)

 What is a Hidden Markov Model?

A statistical model where the system has hidden states (like POS tags).

 How does HMM help in POS tagging?

By modeling sequences of tags based on observed words.

 What are the hidden states and observations in your model?

Hidden states are POS tags; observations are words.

 How do you estimate transition and emission probabilities?

Using training data with tagged sentences.

 What are the limitations of HMM for POS tagging?

Difficulty handling long-range dependencies and unknown words.

7. POS Tagging Using Viterbi Decoding

 What is the Viterbi algorithm?

A dynamic programming algorithm for finding the most likely sequence of hidden states.

 Why is Viterbi decoding efficient for POS tagging?

It reduces the computational complexity of sequence prediction.

 How does your program initialize the Viterbi matrix?

By assigning probabilities to the starting states.

 What is the role of backtracking in the Viterbi algorithm?

To trace the optimal sequence of POS tags.

 How does Viterbi decoding handle ambiguous words?

By selecting the tag sequence with the highest probability.

8. Building a POS Tagger

 What approach did you use to build the POS tagger?

A statistical model using HMM and Viterbi decoding.

 How does your program deal with unknown words?

By using smoothing or assigning default probabilities.

 What datasets did you use to train your POS tagger?

Tagged corpora like the Penn Treebank.

 How do you evaluate the accuracy of your POS tagger?

By comparing predicted tags with a labeled test set.

 Can your POS tagger be improved with more data? How?

Yes, more data improves probability estimates and accuracy.

9. Chunking (Grouping Words into Phrases)

 What is chunking in NLP?

Grouping words into meaningful phrases like noun or verb phrases.

 How is chunking different from POS tagging?

POS tagging labels individual words, while chunking groups them.

 What are chunk patterns, and how are they defined?

Regular expressions based on POS tag sequences.

 How does your program recognize noun phrases?

Using patterns like (DT + JJ* + NN+).

 Can chunking be applied to languages other than English?

Yes, with language-specific POS tagging models.

10. Building a Chunker

 What algorithm or library did you use to build the chunker?

NLTK’s RegexpParser for pattern-based chunking.

 How do regular expressions help in chunking?

They define rules for grouping words based on POS tags.

 How do you evaluate the performance of your chunker?

Using precision, recall, and F1 score against annotated data.

 What challenges did you face while building the chunker?

Handling complex sentence structures and ambiguous tags.

 How would you extend the chunker to handle more complex syntactic structures?

By adding more detailed patterns and rules.

NLP QB
100% (2)
NLP QB
14 pages
MCQ NLP
67% (3)
MCQ NLP
11 pages
Lucas Paquetta Raw NLP
No ratings yet
Lucas Paquetta Raw NLP
12 pages
BAI601 All Modules VTU 10 Mark Complete
No ratings yet
BAI601 All Modules VTU 10 Mark Complete
18 pages
NLP Notes
No ratings yet
NLP Notes
3 pages
NLP Notes
No ratings yet
NLP Notes
10 pages
SEM-2-NLP Questions
No ratings yet
SEM-2-NLP Questions
3 pages
NLP 2
No ratings yet
NLP 2
45 pages
517-C-30070-Assignment - Chapter NLP
No ratings yet
517-C-30070-Assignment - Chapter NLP
9 pages
Part - A (2 Mark Questions)
No ratings yet
Part - A (2 Mark Questions)
35 pages
NLP Study Material
No ratings yet
NLP Study Material
8 pages
CAT King Study Material 5
No ratings yet
CAT King Study Material 5
21 pages
NLP Quiz Seg 1 To 4
No ratings yet
NLP Quiz Seg 1 To 4
9 pages
Chap 6 Exam
No ratings yet
Chap 6 Exam
13 pages
Question Bank
No ratings yet
Question Bank
2 pages
Question Bank
No ratings yet
Question Bank
3 pages
LP V Oral Questions and Answers
No ratings yet
LP V Oral Questions and Answers
4 pages
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
From Everand
Python Text Mining: Perform Text Processing, Word Embedding, Text Classification and Machine Translation
Alexandra George
No ratings yet
NLP Interview Questions 2025
No ratings yet
NLP Interview Questions 2025
4 pages
NLP Assignment Notes
No ratings yet
NLP Assignment Notes
28 pages
Basics of Chat GPT: How to utilize this powerful tool to enhance your life!
From Everand
Basics of Chat GPT: How to utilize this powerful tool to enhance your life!
Adam Larsen
No ratings yet
Q ClassX AI Ch7
No ratings yet
Q ClassX AI Ch7
6 pages
ChatGPT Simplified: A Comprehensive Guide to Understanding and Utilizing AI Language Models, ChatGPT-4, ChatGPT Prompts, Fiction Writing, Blogging, Content Writing, Make Money Online
From Everand
ChatGPT Simplified: A Comprehensive Guide to Understanding and Utilizing AI Language Models, ChatGPT-4, ChatGPT Prompts, Fiction Writing, Blogging, Content Writing, Make Money Online
Silas Quantum
5/5 (1)
Important Questions-Answers Text Analytics and Natural Language Processing (KAI073)
No ratings yet
Important Questions-Answers Text Analytics and Natural Language Processing (KAI073)
37 pages
Assignment-1: Natural Language Processing (21Cse356T)
No ratings yet
Assignment-1: Natural Language Processing (21Cse356T)
30 pages
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
No ratings yet
Lemmatization Is The Grouping Together of Different Forms of The Same Word. in Search
11 pages
Question Bank
No ratings yet
Question Bank
13 pages
NLP
No ratings yet
NLP
14 pages
MTE Practice Set
No ratings yet
MTE Practice Set
4 pages
Quest NLP
No ratings yet
Quest NLP
13 pages
NLP Unit 2 Imp
No ratings yet
NLP Unit 2 Imp
4 pages
Question Bank-Responsbile AI VTU
No ratings yet
Question Bank-Responsbile AI VTU
2 pages
NLP 2K22 DEC CS3EA06 - IT3EA06 Natural Language Processing
No ratings yet
NLP 2K22 DEC CS3EA06 - IT3EA06 Natural Language Processing
4 pages
CH-2 Natural Language Processing Models and Algorithm
No ratings yet
CH-2 Natural Language Processing Models and Algorithm
119 pages
SNLP
No ratings yet
SNLP
18 pages
NLP Part1
No ratings yet
NLP Part1
67 pages
NLP New QB
No ratings yet
NLP New QB
3 pages
Top 30 NLP Interview Questions and Answers: 1. What Do You Understand by Natural Language Processing?
No ratings yet
Top 30 NLP Interview Questions and Answers: 1. What Do You Understand by Natural Language Processing?
18 pages
Unit 6 Natural Language Processing
No ratings yet
Unit 6 Natural Language Processing
10 pages
Chapter-1 Introduction To NLP
No ratings yet
Chapter-1 Introduction To NLP
12 pages
Natural Language Processing Question Bank
No ratings yet
Natural Language Processing Question Bank
3 pages
Natural Language Processing All Question
No ratings yet
Natural Language Processing All Question
122 pages
64 Natural Language Processing Interview Questions and Answers-18 Juli 2019
No ratings yet
64 Natural Language Processing Interview Questions and Answers-18 Juli 2019
30 pages
NLP Pyq
No ratings yet
NLP Pyq
6 pages
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
No ratings yet
VND Openxmlformats-Officedocument Wordprocessingml Document&rendition 1
5 pages
NLP Question Bank
No ratings yet
NLP Question Bank
3 pages
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
No ratings yet
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
7 pages
X - AI-NLP Worksheet
No ratings yet
X - AI-NLP Worksheet
2 pages
NLP QB
No ratings yet
NLP QB
4 pages
NLP Quesion Bank
No ratings yet
NLP Quesion Bank
4 pages
Natural Language Processing - NOTES
No ratings yet
Natural Language Processing - NOTES
4 pages
NLP Sheets
No ratings yet
NLP Sheets
23 pages
Unit 6 - NLP Notes
No ratings yet
Unit 6 - NLP Notes
7 pages
NLP Programs
No ratings yet
NLP Programs
13 pages
Bai601 Simp
No ratings yet
Bai601 Simp
4 pages
Distributed Computing Help Book
No ratings yet
Distributed Computing Help Book
10 pages
SNLP Mid Term
No ratings yet
SNLP Mid Term
4 pages
NLP
No ratings yet
NLP
16 pages
Module 3
No ratings yet
Module 3
33 pages
02 - Text Preprocessing - Part2
No ratings yet
02 - Text Preprocessing - Part2
36 pages
!1373npcommunicativeengl!: Part Iii - Communicative English
No ratings yet
!1373npcommunicativeengl!: Part Iii - Communicative English
4 pages
English - AVM Language Prelim '23 Solution
No ratings yet
English - AVM Language Prelim '23 Solution
3 pages
FoundInTranslation FN CHKD
No ratings yet
FoundInTranslation FN CHKD
1 page
4 Introduction&Reports
No ratings yet
4 Introduction&Reports
627 pages
Full Name: Class: (M o T A, An, The)
No ratings yet
Full Name: Class: (M o T A, An, The)
3 pages
Come Get Happen Look Make Start Stay Try Work
No ratings yet
Come Get Happen Look Make Start Stay Try Work
2 pages
Must Have To, Should, Obligation
No ratings yet
Must Have To, Should, Obligation
7 pages
FOEG Chapter 1 - Present Time
No ratings yet
FOEG Chapter 1 - Present Time
26 pages
116 - Gerunds-and-Infinitives - Monday 17th BEGINNERS
No ratings yet
116 - Gerunds-and-Infinitives - Monday 17th BEGINNERS
20 pages
Phrasal Verbs Exercises - Be - Fall
No ratings yet
Phrasal Verbs Exercises - Be - Fall
3 pages
Challenges of The Implementation of Language Policies in Southern Africa
No ratings yet
Challenges of The Implementation of Language Policies in Southern Africa
6 pages
Introduction To Arabic
No ratings yet
Introduction To Arabic
17 pages
Seven (2025-26)
No ratings yet
Seven (2025-26)
3 pages
Adjudication Forms
No ratings yet
Adjudication Forms
9 pages
B1 UNIT 7 Extra Grammar Practice Extension
No ratings yet
B1 UNIT 7 Extra Grammar Practice Extension
1 page
Anna Radchuk AVP 15 10 21
No ratings yet
Anna Radchuk AVP 15 10 21
7 pages
Paper 1 English
No ratings yet
Paper 1 English
45 pages
Grammatical Metaphors in English
No ratings yet
Grammatical Metaphors in English
8 pages
Present Simple Negative and Questions
No ratings yet
Present Simple Negative and Questions
4 pages
Nacionalidades-Paises-Verb To Be - A-An
No ratings yet
Nacionalidades-Paises-Verb To Be - A-An
4 pages
Branches of Social Science
No ratings yet
Branches of Social Science
17 pages
Mitko Sotirov - Book - FM
100% (1)
Mitko Sotirov - Book - FM
32 pages
Vocabular Emotii
No ratings yet
Vocabular Emotii
4 pages
Cot 2-Mko Q4G4-W1
No ratings yet
Cot 2-Mko Q4G4-W1
7 pages
Change These Following Sentences Into Indirect Speech
No ratings yet
Change These Following Sentences Into Indirect Speech
2 pages
Regional Dialects Differences in Morocco (The East Versus The West)
No ratings yet
Regional Dialects Differences in Morocco (The East Versus The West)
3 pages
Summary Updated Nisab 2025-2026 - 02
No ratings yet
Summary Updated Nisab 2025-2026 - 02
15 pages
Thesis Vertaling Frans
100% (2)
Thesis Vertaling Frans
6 pages
English 8-Quarter 2, W3
No ratings yet
English 8-Quarter 2, W3
8 pages
Oxford Orwells Newspeak
No ratings yet
Oxford Orwells Newspeak
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.