0% found this document useful (0 votes)
31 views37 pages

A. Derivational Morphology

Uploaded by

Huy Gia
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
31 views37 pages

A. Derivational Morphology

Uploaded by

Huy Gia
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 37

NLP301c by Hon Pg

HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr

1. _ morphology is a type of word formation that creates A. Derivational


new lexemes morphology
A. Derivational morphology
B. Compound morphology
C. Inflectional morphology
D. Complex morphology

2. What kind of ambiguities are faced by NLP? D. Both a & c


A. Lexical and syntactical
B. NLP does not face any ambiguity
C. Semantical, discourse and Pragmatic
D. Both a & c

3. When the meaning of the words themselves can be C


misinterpreted then ___ ambiguity occurs.
A. Scope Ambiguity
B. Pragmatic Ambiguity
C. Semantic Ambiguity
D. None of the others

4. It is not word embedding library D


A. Word2vec
B. Glove
C. Fasttext
D. TextBlog

5. What is corpus? B
A. A corpus is collection of Parameters and argu-
ments
B. A corpus is a large and structured set of ma-
chine-readable texts that have been produced in a
natural communicative setting
C. It refers to a situation where the context of a phrase
gives it multiple interpretation
D. All of the others

6. In NLP, The process of removing words like "and", C


"is", "a", "an", "the" from a sentence is called as
A. Stemming

1 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
B. Lemmatization
C. Stop word
D. derivation

7. What is the number of trigrams in a normalized sen- C


tence of length n words?
A. n
B. n-1
C. n-2
D. n-3

8. What is FSA (Finite State Automata)? A


A. Finite state automaton is a model of behavior com-
posed of state, transitions and actions.
B. This consists of rules which map the two represen-
tations to each other. Each rule is described through
a finite-state transducer
C. It takes raw corpus as input and produces a seg-
mentation of the word forms observed in the text
D. None of the others

9. In NLP, The process of converting a sentence or para- B


graph into tokens is referred to as Stemming
A. True
B. False

10. What are the approaches of NLP? A


A. Morphological and Lexical Analysis, Syntactic
Analysis, Semantic Analysis, Discourse Integration,
Pragmatic
Analysis
B. Symbolic, Statistical, Connectionist and Hybrid
C. Machine Learning, Deep Learning & Al
D. None of the others

11. ___ Is The Type Of Morphology That Changes The B


Word Category And Affects The Meaning.
A. Inflectional
B. Derivational

2 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
C. Cliticization
D. Text-Proofing

12. When we encounter two or more words with the same B


form and related meanings, we have what is known
as ___
A. Hyponymy
B. Polysemy
C. Homonyms
D. Source

13. P(rolling an even number or a prime number) on a die A


A. 5/6
B. 1/6
C. 1/3
D. 2/3

14. P(rolling an odd number or a # >4) on a die B


. 5/6
B. 2/3
C. 1/6
D. 1/3

15. Syntactic analysis or parsing may be defined as the A


process of ___ the ___ of symbols in Natural lan-
guage conforming to the rules of formal grammar.
A. Analyzing & Strings
B. Defining & Groups
C. Reducing & Arrays
D. Reviewing & Letters

16. The branches of linguistics that focus on the meaning B


of a language
A. Semantics & phonology
B. Semantics & pragmatics
C. Morphology & pragmatics
D. Pragmatics & phonology

17. This involves analysis of the words in a sentence by D


following the grammatical structure of the sentence
3 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
A. Tokens
B. Lexical Analysis
C. Discourse
D. Syntax Analysis

18. What Can Be Called As "The Knowledge Of What Has C


Been Said Earlier"
A. Situational Context
B. Background Knowledge
C. Co-Textual Context
D. Operational Knowledge

19. Discrete representation, aka integerized words A


A. pre-cursor to words as vectors
B. distributed word representations
C. advantages of distributed word representations
D. notion of context as meaning

20. ___ ambiguity refers to a situation where the context A


of a phrase gives it multiple interpretation
A. Pragmatic
B. Anaphoric
C. Discourse
D. Cataphoric

21. How to use WordNet to measure semantic related- A


ness between words:
A. Measure the shortest path between two words on
WordNet
B. Count the number of shared parent nodes
C. Measure the difference between their depths in
WordNet
D. Measure the difference between the size of child
nodes they have

22. Suppose you have the following training data for C


Naive Bayes:
I liked the dish [LABEL = POS]
I disliked the dish because it contains sugar [LABEL
= NEG]
4 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
Really tasty dish [LABEL = POS]
What is the unsmoothed Maximum Likelihood Esti-
mate (MLE) of P(POS) for this data?
A. 1/2
B. 1/3
C. 2/3
D. 1

23. Which application use to determine people in con- D


text?
A. Stemming
B. Lemmatization
C. Stop word removal
D. Named entity recognition

24. The Area Of Ai That Investigates Methods Of Facilitat- A


ing Communication Between People And Computers
Is:
A. Natural Language Processing
B. Symbolic Processing
C. Decision Support
D. Robotics

25. ___ used for stripping of affixes. It uses a set of rules D


containing a list of stems and replacement rules.
A. Two-level morphology model
B. Chomsky Model
C. Finite State Automata
D. Stemmer

26. Morphotactics is a model of B


A. Spelling modifications that may occur during affix-
ation
B. How and which morphemes can be affixed to a
stem
C. All affixes in the English language
D. N-grams of affixes and stems

27. Which of the following component of NLP? D


A. Pragmatic analysis
5 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
B. Entity extraction
C. Syntactic analysis
D. All of the others

28. What are the two subfields of Natural Language Pro- D


cessing?
A. Context and Expectations
B. Recognition and Synthesis
C. Semantics of Pragmatics
D. Generation and Understanding

29. What is the main challenge/s of NLP? A


A. Handling Ambiguity of Sentences
B. Handling Tokenization
C. Handling POS-Tagging
D. All of the mentioned

30. An example of unstructured data is ___. B


A. age information
B. customer reviews
C. movie rating score
D. gender of customers

31. What are the components of Morphological Analyzer A


acc., to Shrivastava et. al 2005?
A. The recognition engine, identifying suffixes, and
finding a stem within the input word algorithms
B. Morpheme lexeme, Set of rules governing the
spelling and composition of morphologically com-
plex words & Decision algorithm
C. The recognition engine, set of rules & Algorithm
D. All of the others

32. Dictionary-based sentiment analysis is a computa- B


tional approach relies on a pre-defined list (or dictio-
nary) of sentiment-laden words.
A. probability model.
B. a pre-defined list of sentiment-laden words.
C. CRF
D. HMM
6 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr

33. two or more words with the same form and related C
meanings by extension (foot of a person, of a bed, of
a mountain); based on similarity
A. Metonymy
B. hyponymy
C. polysemy
D. hyponym

34. Who discovered "Turing Test"? A


A. Alan Turing
B. Venessa Turing
C. Leibniz
D. Descartes

35. What pragmatic ambiguity refers? A


A. It refers to a situation where the context of a phrase
gives it multiple interpretation
B. It refers to Statistical analysis
C. It refers to only Misinterpreted words
D. All of the others

36. NLP stands for Natural Language Processing. A


A. True
B. False

37. Which of the following technique is used to remove D


semantic ambiguity?
A. Fuzzy Logic
B. Shallow Semantic Analysis
C. Syntactic analysis
D. Word Sense Disambiguation

38. Which of the following is an advantage of normalizing AC


a word? (select 2)
A. It helps in reducing the randomness in the word
B. It increases the false negatives
C. It reduces the dimensionality of the input
D. All of the others

7 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
39. What is outcome thinking? A
A. Knowing what you want rather than what you don't
want.
B. Know about others
C. Know about the society None
D. language

40. ___ is also known as shallow parsing. B


A. Rooting
B. Chunking
C. Steaming
D. Lemmatization

41. Parts-of-Speech tagging Does not determine ___ D


A. part-of-speech for each word dynamically as per
meaning of the sentence
B. part-of-speech for each word dynamically as per
sentence structure
C. all part-of-speech for a specific word given as input
D. all part-of-speech for a specific stem from input

42. The process of converting data to something a com- B


puter can understand is referred to as ___
A. Post processing
B. Pre processing
C. Pre defined
D. Post defined

43. "He doesn't know" is an example of ___ type of deixis A


A. Personal
B. Time
C. Social
D. Space

44. IR (information Retrieval) and IE (Information Extrac- B


tion) are the two same thing
A. TRUE
B. FALSE

45.
8 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
How many trigrams phrases can be generated from A. 2 (theo áp án cça
the following sentence, after performing stop word SGK n°Ûc ngoà)
removal? C. 4 GPT
Google is one of the most widely used search engine
in Vietnam.
A. 2
B. 3
C. 4
D. 5

46. What Creates Problems In Machine Translation? A


A. Different Level Of Ambiguities
B. Processing Power
C. Memory
D. Diversity

47. The collection of documents, required for text analy- C


sis is known as ___
A. Dictionary
B. Lexicon
C. Corpus
D. Stemming

48. Which of the following is not a problem when using D


Maximum Likelihood Estimation to obtain parame-
ters in a language model?
A. Unreliable estimates where there is little training
data
B. Out-of-vocabulary terms
C. Overfitting
D. Smoothing

49. ___ is an advanced version of FSA (finite state au- A


tomata) and is used to represent the lexicon compu-
tationally.
A. FST
B. FSA
C. DAWG
D. Stemmer Algorithm

9 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
50. Which is not type of Sentiment Analysis? D
A. Emotion Detection
B. Aspect based
C. Word based
D. Bilingual

51. This type of automata maps between two sets of C


symbols.
A. DFA
B. Turing Machine
C. FST
D. NFA

52. ___ is a phrase whose head is a noun or a pronoun, C


optionally accompanied by a set of modifiers.
A. Pronoun Phrase
B. Adverb Phrase
C. Noun Phrase
D. Proposition Phrase

53. Elements of Semantic analysis D. Hyponymy,


A. Hyponymy Homonymy, Poly-
B. Homonymy semy
C. Polysemy
D. Hyponymy, Homonymy, Polysemy

54. _______is the type of morphology that changes the B. Derivational


word category and affects the meaning.
A. Inflectional
B. Derivational
C. Cliticization
D. All of the others

55. What are the input and output of an NLP system? B. Speech and
A. Speech and noise Written Text
B. Speech and Written Text
C. Noise and Written Text
D. Noise and value

56.
10 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
Which of the following statement is (are) true for C. Both CBOW
Word2Vec model? and Skip-gram
A. The architecture of word2vec consists of only two are shallow neural
layers - continuous bag of words and skip-gram mod- network models
el
B. Continuous bag of word (CBOW) is a Recurrent
Neural Network model
C. Both CBOW and Skip-gram are shallow neural net-
work models
D. All of the others

57. Software designed for taking input data(text) and give A. Compiler
structural representation of the input after checking
the correct syntax or grammar is
A. Compiler
B. Parser
C. Painter
D. Easydraw

58. OCR (Optical Character Recognition) uses NLP. A. TRUE


A. TRUE
B. FALSE

59. Given a sequence of observations and a HMM mod- C. Decoding prob-


el, which of the following fundamental problems of lem
HMM finds the most likely sequence of states that
produced the observations in an efficient way?
A. Evaluation problem
B. Likelihood estimation problem
C. Decoding problem
D. Learning problem

60. Which of the following smoothing techniques as- A. Add-1 smooth-


signs too much probability to unseen events? ing
A. Add-1 smoothing
B. Add-k smoothing
C. Witten-Bell smoothing
D. Good-Turing smoothing

61. C. Discourse
11 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
_ concerns how the immediately preceding sen-
tences affect the interpretation of the next sentence
A. Pragmatics
B. Syntax
C. Discourse
D. Semantics

62. Where the additional variables does are added in A. Temporal model
HMM?
A. Temporal model
B. Reality model
C. Probability model
D. In all three models, temporal, reality and probability
model

63. Given a sentence or larger chunk of text, determine B. Coreference


which words ("mentions") refer to the same objects Resolution
('entities")
A. Anaphora Resolution
B. Coreference Resolution
C. Noun Resolution
D. Pronoun Resolution

64. Which of the following is used to mapping sentence C. Text Realization


plan into sentence structure?
A. Text planning
B. Sentence planning
C. Text Realization
D. All of the others

65. Lexicon, orthographic rules and spelling variations A. FST


are the components of____
A. FST
B. FSA
C. Two-level morphology
D. Stemmer Algorithm

66. Speech Segmentation is a subtask of Speech Recog- A. TRUE


nition.

12 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
A. TRUE
B. FALSE

67. Which Among The Following Is Important Component A. Representation


Of Natural Language Processig?
A. Representation
B. Description
C. Exposion
D. Narration

68. The word bank can be (river bank or financial institu- B. Polysemy
tion) it denotes
A. Antonymy
B. Polysemy
C. Homonyms
D. Synonymy

69. Which of the following includes major tasks of NLP? D. All of the others
A. Automatic Summarization
B. Discourse Analysis
C. Machine Translation
D. All of the others

70. Dissimilarity between words expressed using cosine A. True


similarity will have values significantly higher than
0.5
A. True
B. False

71. What is Natural Language Processing good for? D. All of the others
A. Summarize blocks of text
B. Automatically generate keyword tags
C. Identify the type of entity extracted
D. All of the others

72. The more hidden layers a neural network has, the B. False
better it can predict desired outputs for new inputs
that it was not trained with.
A. True
B. False
13 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr

73. In the English language inflectional morphemes can B. Suffixes only


be
A. Prefixes, Suffixes and Infixes
B. Suffixes only
C. Infixes Only

74. The words there' and their causes which of the follow- C. Phonological
ing type of ambiguity?
A. Syntactic
B. Semantic
C. Phonological
D. Pragmatic

75. Which of the following models can be used for the D. All of the others
purpose of document similarity?
A. Training a word 2 vector model on the corpus that
learns context present in the document
B. Training a bag of words model that learns occur-
rence of words in the document
C. Creating a document-term matrix and using cosine
similarity for each document
D. All of the others

76. What is Machine Translation? A. Converts one


A. Converts one human language to another human language
B. Converts human language to machine language to another
C. Converts any human language to English
D. Converts Machine language to human language

77. Words may have multiple meanings. This leads to D. Lexical ambigu-
what type of ambiguity in NLP? ity
A. Syntactic ambiguity
B. Anaphoric ambiguity
C. Semantic ambiguity
D. Lexical ambiguity

78. You created a document term matrix on the input D. 1, 2, 3


data of 20K documents for a Machine learning model.
Which
14 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
of the following can be used to reduce the dimen-
sions of data?
1. Keyword Normalization
2. Latent Semantic Indexing
3. Latent Dirichlet Allocation
A. only 1
B. 2,3
C. 1, 3
D. 1, 2, 3

79. What does 'discourse' refer to in the study of lan- B. the structure,
guage? organisation and
A. the vocabulary of a text layout of a text
B. the structure, organisation and layout of a text
C. the meaning behind the vocabulary of a text
D. the mode of a text

80. Morphemes that cannot stand alone and are typically B. Bound mor-
attached to another to become a meaningful word is phemes
called,
A. Free morphemes
B. Bound morphemes
C. Derived morphemes
D. Lexical morphemes

81. Which of the following is not an error correction and C. Passive codes
detection code?
A. Block code
B. Convolutional codes
C. Passive codes
D. Turbo codes

82. The most widely used stemmer algorithm is B. Porter Algo-


A. Viterbi Algorithm rithm
B. Porter Algorithm
C. Decision Algorithm
D. Stemmer Algorithm

83. Which of the following is not a primitive operation of D. Projection


a regular expression?
15 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
A. Concatenation
B. Closure
C. Union
D. Projection

84. ____transfers linear sequences of words into struc- A. Semantic


tures Analysis
A. Semantic Analysis
B. Tokens
C. Lexical Analysis
D. Discourse

85. Which data is used to use supervised approach for C. Dictionary


Machine translation
A. Plain text
B. Labelled text
C. Dictionary
D. Vectors

86. What scope ambiguity involves? A. Operators and


A. Operators and quantifiers quantifiers
B. Parameters and arguments
C. Tokens
D. None of Above

87. Given a sound clip of a person or people speaking, B. Speech-to-text


determine the textual representation of the speech.
A. Text-to-speech
B. Speech-to-text
C. All of the mentioned
D. None of the others

88. There are 7 boys in the class and 3 play in the band. A. 1/5
Thereare 8 girls in the class and 2 play in the band.
What is theprobability of selecting a boy band mem-
ber?
A. 1/5
B. 2/5
C. 3/5
D. 4/5
16 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr

89. Which one of the following is not a pre-processing E. Sentiment


technique in NLP? analysis
A. Stemming and Lemmatization
B. converting to lowercase
C. removing punctuations
D. removal of stop words
E. Sentiment analysis

90. Which is NOT a conjunction? D. that


A. but
B. and
C. or
D. that

91. Given a stream of text, Named Entity Recognition B. FALSE


determines which pronoun maps to which noun.
A. TRUE
B. FALSE

92. How many uni-grams phrases can be generated from C. 6


the following sentence, after performing following
text cleaning steps: Stop word Removal and Replac-
ing punctuations by a single space i. "Delhi is the
capital of but
Mumbai is the financial capital of India."
A. 8
B. 7
C. 6
D. 5

93. In a corpus of N documents, one randomly chosen C. K * Log(3) / T


document contains a total of T terms and the term
"hello"
appears K times.
What is the correct value for the product of TF (term
frequency) and IDF (inverse-document-frequency), if
the
term "hello" appears in approximately one-third of the
total documents?
17 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
A. KT * Log(3)
B. T * Log(3) / K
C. K * Log(3) / T
D. Log(3) / KT

94. Operators and Quantifiers are mostly responsible for A. Scope Ambigu-
A. Scope Ambiguity ity
B. Pragmatic Ambiguity
C. Semantic Ambiguity
D. None of These

95. ___extracts all the documents containing the key B. Information Re-
words trieval
A. Information Extraction
B. Information Retrieval
C. Inflection
D. Inflation

96. Connectionist Approach is based on A. The intercon-


A. The interconnection of networks having simple nection of net-
processing units with knowledge stored in weights to works having sim-
identify connections between units. ple processing
B. It performs extensive analysis of linguistic phe- units with knowl-
nomena through explicit representation of facts edge stored in
about language and well-understood knowledge rep- weights to identi-
resentation schemas and associated algorithms. fy connections be-
C. It harnesses various mathematical techniques and tween units.
often uses large text corpora to deveiop approximate-
ly
generalized models of linguistic phenomena based
on actual examples.
D. None of Above

97. Markov chains can have more than one invariant dis- B. False
tribution.
A. True
B. False

98. What is morphology?


A. The study of the rules governing the sounds that
18 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
form words C. The study of
B. The study of the rules governing sentence forma- the rules govern-
tion ing word formation
C. The study of the rules governing word formation
D. The study of the rules governing sounds

99. Natural language processing is divided into the two D. understanding


subfields of and generation
A. symbolic and numeric
B. algorithmic and heuristic
C. time and motion
D. understanding and generation

100. What is 'indefinite noun phrases' in reference A. Introduces enti-


phonomena? ties that are new to
A. Introduces entities that are new to the hearer into the hearer into the
the discourse context discourse context
B. Introduces entities that are previous or old to the
hearer into the discourse context
C. Entities that accept the irregular pharses
D. Entities that accept the regular pharses

101. It focuses about the proper ordering of words which A. Syntax Analysis
can affect its meaning.
A. Syntax Analysis
B. Semantic Analysis
C. Lexical Analysis
D. Pragmatic Analysis

102. Which of the following NLP tasks use sequential la- D. All of the others
beling technique?
A. POS tagging
B. Named Entity Recognition
C. Speech recognition
D. All of the others

103. Connectionist Approach is widely known as___ C. Neural Network


A. Statistical
B. Symbolical

19 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
C. Neural Network
D. All of the others

104. Which of the following is not true input for the NLP? A. Image
A. Image
B. Text
C. Types input cent
D. Speech

105. Which of the following is not an algorithm for decod- D. Ant colony opti-
ing convolution codes? mization
A. Viterbi algorithm
B. Stack algorithm
C. Fano's sequential coding
D. Ant colony optimization

106. Typing buckled when you meant bucked is a type of B. Real Word Er-
which Spelling error rors
A. Non-word Errors
B. Real Word Errors
C. Cognitive Errors
D. Short forms/Slang/Lingo

107. "The car hit the pole while it was moving." what type A. Semantic
of ambiguity exists in above sentence?
A. Semantic
B. Syntactic
C. Lexical
D. Pragmatic

108. -Cluster analysis: category of unsupervised learn- C. K-Means Clus-


ingtechniquies that allow us to discover hidden struc- tering (define clus-
tures in tering and goal)
datawhere we do not know right answer up front
Goal: find natural groupings in data such that items in
thesame cluster are more similar to each other than
those indifferent clusters
-K-Means is most popular clustering
A. How multi-layer perceptron (MLP) functions (steps)
B. Concept-based Approach (what, how, weakness-
20 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
es)
C. K-Means Clustering (define clustering and goal)
D. Dictionary-based/Keyword Spotting (what, how,
weaknesses)

109. ___deals with the overall communicative and social B. Pragmatic


content and its effect on interpretation. AnalysisWord em-
A. Tokens beddings capture
B. Pragmatic Analysis multiple dimen-
C. Symbolic Analysis sions of data and
D. Morphical And Lexical Analysis are represented
as vectors

110. Word embeddings capture multiple dimensions of A. True


data and are represented as vectors
A. True
B. False

111. ___also converts a word to its root form. D. Lemmatization


A. Rooting
B. Dreaming
C. Steaming
D. Lemmatization

112. Function morphemes are also called___ D. closed-class


A. open-class morphemes morphemes
B. sub-class morphemes
C.super-class morphemes
D. closed-class morphemes

113. What are the components of NLP? D. All of the others


A. Morphological and Lexical Analysis
B. Syntactic Analysis and Semantic Analysis
C. Discourse Integration, Pragmatic Analysis
D. All of the others

114. The reference to an entity that has been previously B. anaphora


introduced into the sentence is called as
A. discourse
B. anaphora
21 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
C. co refer
D. referent

115. In the sentence, "They bought a blue house", the A. Noun phrase
underlined part (a blue house) is an example of
A. Noun phrase
B. Verb phrase
C. Prepositional phrase
D. Adverbial phrase

116. A Bidirectional Feedback Loop Links Computer Mod- D. Cognitive Sci-


elling With: ence
A. Artificial Science
B. Heuristic Processing
C. Human Intelligence
D. Cognitive Science

117. Statistical Approach is also called A. Corpus Based


A. Corpus Based Approach Approach
B. Rule Based Approach
C. CNN
D. K- nearest

118. How to implement NLP? A. Machine Learn-


A. Machine Learning & Statistical Inference. ing & Statistical In-
B. Machine Learning & Al ference.
C. Deep Learning
D. Python & R

119. What is the right order for a text classification model C. 12534
components?
1. Text cleaning
2. Text annotation
3. Gradient descent
4. Model tuning
5. Text to predictors
A. 12345
B. 13425
C. 12534
D. 13452
22 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr

120. Finite State Transducer is an advanced version B. FSA


of______and is used to represent the lexicon compu-
tationally.
A. FST
B. FSA
C. DAWG
D.Stemmer Algorithm

121. Which of the following is/are the input(s) to k-means A. Number of clus-
algorithm? (select 3) ters
A. Number of clusters C. Distance metric
B. Class labels D. Number of cen-
C. Distance metric troids
D. Number of centroids

122. is the process of extracting phrases from unstruc- B. Chunking


tured text and more structure to it.
A. Rooting
B. Chunking
C. Steaming
D. Lemmatization

123. Generating natural, conversational language that ex- B. Relevant


plains complex concepts in a way that is easy to
consume.
A. Intuitive
B. Relevant
C. Timely
D. Space

124. What is Syntax Analysis? D. It focuses about


A. This only abstracts the dictionary meaning or the the proper or-
real meaning from the given context. dering of words
B.This component transfers linear sequences of which can affect
words into structures. It shows how the words are its meaning. This
associated with each other. involves analysis
C. It deals with the overall communicative and social of the words in
content and its effect on interpretation. It means ab- a sentence by fol-
stracting or deriving the meaningful use of language lowing the gram-
23 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
in situations. matical structure
D. It focuses about the proper ordering of words of the sentence.
which can affect its meaning. This involves analysis The words are
of the words in a sentence by following the grammat- transformed into
ical structure of the sentence. The words are trans-
formed into
the structure to show how the words are related to
each other.

125. Which of the text parsing techniques can be used for D. Dependency
noun phrase detection, verb phrase detection, sub- Parsing and Con-
ject detection, and object detection in NLP. stituency Parsing
A. Part of speech tagging
B. Skip Gram and N-Gram extraction
C. Continuous Bag of Words
D. Dependency Parsing and Constituency Parsing

126. The study of the sound patterns in natural language C. Phonology


and the rules that govern them is:
A. Phonetics
B. Morphology
C. Phonology
D. Syntax

127. Consider the language L = (a^nb^nc^m| m, n >=0}. B. ab


Which of the following strings are in L?
A. abbe
B. ab
C. aabc
D. abbec

128. "He was running quickly into the stadium". What type B. Verb phrase
of phrase is this?
A. Noun phrase
B. Verb phrase
C. Prepositional phrase
D. Adjectival phrase

129. Which of the following techniques can be used for A. Lemmatization


the purpose of keyword normalization, the proces
24 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
converting a keyword into its meaningful base form?
A. Lemmatization
B. Levenshtein distance
C. Morphing
D. Stemming
RUOMER

130. Corpus Based Approach is also called A. Statistical Ap-


A. Statistical Approach proach
B. Rule Based Approach
C. CNN
D. K- nearest

131. Which is a finite state machine with two tapes: an A. Finite


input tape and an output tape State Transducers
A. Finite State Transducers (FSTs) (FSTs)
B. Finite State Translators (FSTs)
C. Finite Automata
D. Deterministic Finite Automaton

132. NLP is concerned with the interactions between com- A. True


puters and human (natural) languages.
A. True
B. False

133. What are the possible features of a text corpus in E. All of the others
NLP?
A. Count of the word in a document
B. Vector notation of the word
C. Part of Speech Tag
D. Basic Dependency Grammar
E. All of the others

134. What will be the perplexity value if you calculate the B. Infinity
perplexity of an unsmoothed language model or cor-
pus with unseen words?
A. Zero
B. Infinity
C. Any Non-Zero Value
D. Inefficient
25 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr

135. Computer vs computational is an example of___mor- B. Derivational


phology.
A. Inflectional
B. Derivational
C. Cliticization
D. All of the others

136. Which of the following is NOT a good example of C. Prepositions


cohesive device?
A. Discourse markers
B. Pronouns
C. Prepositions
D. Demonstratives

137. The Hidden Markov Model directly models the depen- B. False
dency of each hidden state on all previous hidden
states
A. True
B. False

138. A frequently used statistical model inNLP C. HMM


A. Stochestic
B. Hybrid
C. HMM
D. Lengustic

139. In linguistic morphology__ is the process for reduc- B. Stemming


ing inflected wor
A. Rooting
B. Stemming
C. Text-Proofing
D. Both Rooting & Stemming

140. To whether "duck" is a verb or a noun can be solved A. Part-of-speech


by___ tagging.
A. Part-of-speech tagging.
B. Lexical analysis
C. Semantic analysis
D. Pragmatic analysis
26 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr

141. Which sentence describes inflectional morphology? A. Adding a mor-


A. Adding a morpheme to produce a new word but the pheme to produce
same lexeme. a new word but the
B. Adding a morpheme to produce a new word and same lexeme.
different lexeme.
C. Adding a morpheme to produce the same word but
different lexeme.
D. Adding a morpheme to produce the same sentence
but different lexeme.

142. What refers to a situation where the context of a D. Pragmatic Am-


phrase gives it multiple interpretation biguity
A. Lexical Ambiguity
B. Scope Ambiguity
C. Semantic Ambiguity
D. Pragmatic Ambiguity

143. Which of the following techniques can be used for A. Lemmatization


keyword normalization in NLP, the proces
keyword into its base form?
A. Lemmatization
B. Soundex
C. Cosine Similarity
D. N-grams

144. Symbolic Approach is also called B. Rule based Ap-


A. Convolutional Neural Networks. proach.
B. Rule based Approach.
C. Corpus based.
D. Hybrid

145. Parsing determines Parse Trees (Grammatical Analy- A. TRUE


sis) for a given sentence.
A. TRUE
B. FALSE

146. "He lifted the beetle with red cap." contain which type B. Syntax Level
of ambiguity? ambiguity
A. Lexical ambiguity
27 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
B. Syntax Level ambiguity
C. Referential ambiguity
D. All of the mentioned

147. Examples of NLP? A. Digital assis-


A. Digital assistance, chatbots, Text summarization, tance, chatbots,
text retrieval, sentiment analysis... Text summariza-
B. Clustering and differentiating patterns. tion, text retrieval,
C. Deep Learning, Machine Learning, Al etc. sentiment analy-
D. None of Above sis...

148. Dog is hyponym of C. Animal


A. Forest
B. Human
C. Animal
D. Automobile

149. Choose form the following areas where NLP can be D. All of the others
useful.
A. Automatic Text Summarization
B. Automatic Question-Answering Systems
C. Information Retrieval
D. All of the others

150. In NLP, the process of removing words like "and", C. Stop word
"is", "a", "an", "the" from a sentence is called
A. Stemming
B. Lemmatization
C. Stop word
D. All of the others

151. Which of the following techniques can be used to B. Euclidean dis-


compute the distance between two word.. tance
(select 2) C. Cosine Similari-
A. Lemmatization ty
B. Euclidean distance
C. Cosine Similarity
D. N-grams

152.
28 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
The Area Of Ai That Investigates Methods Of Facilitat- A. Natural Lan-
ing Communication Between People. guage Processing
A. Natural Language Processing
B. Symbolic Processing
C. Decision Support
D. Robotics

153. ____depicts analyzing, identifying and description of D. Morphical And


the structure of words. Lexical Analysis
A. Tokens
B. Semantic Analysis
C. Symbolic Analysis
D. Morphical And Lexical Analysis

154. What are the possible values of the variable? D. Possible states
A. Variables of the world
B. Literals
C. Discrete variable
D. Possible states of the world

155. Which of the following component of LP? D. All of the others


A. Pragmatic analysis
B. Entity extraction
C. Syntactic analysis
D. All of the others

156. What is Morphical and Lexical Analysis? A. It depicts an-


A. It depicts analyzing, identifying and description alyzing, identify-
of the structure of words. It includes dividing para- ing and descrip-
graphs, words and the sentences. tion of the struc-
B. This component transfers linear sequences of ture of words.
words into structures. It includes di-
C. This only abstracts the dictionary meaning or the viding paragraphs,
real meaning from the given context. words and the
D. All of the others sentences.

157. What Pragmatic Analysis does? D. It deals with the


A. This component transfers linear sequences of overall commu-
words into structures. nicative and so-
B. This only abstracts the dictionary meaning or the cial content and
29 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
real meaning from the given context. its effect on inter-
C. This component transfers linear sequences of pretatio abstract-
words into structures. It shows how the v ing or deriving the
associated with each other. meaningful use of
D. It deals with the overall communicative and social language in situa-
content and its effect on interpretatio abstracting or tions
deriving the meaningful use of language in situations

158. Morphological analysis is also known as__ D. Lexical Analysis


A. Sentiment Analysis
B. Pragmatic Analysis
C. CNN
D. Lexical Analysis

159. Different learning methods does not include? D. Introduction


A. Memorization
B. Analogy
C. Deduction
D. Introduction

160. The words "bank/data bank/blood bank" is an exam- C. Polysemy


ple of___
A. Homophony
B. Synonymy
C. Polysemy
D. Hyponymy

161. Natural Language Processing (NLP) is the field of D. All of the others
A. Artificial Intelligence
B. Computer Science
C. Linguistics
D. All of the others

162. How does the Statistical Approach work? A. It uses statisti-


A. It uses statistical methods to resolve some of the cal methods to re-
difficulties in symbolic approach. It does this by har- solve some of the
nessing various mathematical techniques and often difficulties in sym-
using large text corpora to develop approximately bolic approach.
generalized models of linguistic phenomena based It does this by
on actual examples. harnessing vari-
30 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
B. It performs extensive analysis of linguistic phe- ous mathematical
nomena through explicit representation of facts techniques and of-
about language and well-understood knowledge rep- ten using large
resentation schemas and associated algorithms. text corpora to
C. It harnesses various mathematical techniques and develop approxi-
often uses large text corpora to develop approximate- mately
ly generalized models of linguistic phenomena based
on actual examples.
D. All of the others

163. What will be the perplexity value if you calculate the B. Infinity
perplexity of an unsmoothed language model on a
test corpus with unseen words?
A. Zero
B. Infinity
C. Any Non-Zero Value
D. Inefficient

164. Which of the following techniques can be used for A. Lemmatization


the purpose of keyword normalization, the process of
converting a keyword into its meaningful base form?
A. Lemmatization
B. Levenshtein distance
C. Morphing
D. Stemming

165. _____deals with the overall communicative and so- B. Pragmatic


cial content and its effect on interpretation. Analysis
A. Tokens
B. Pragmatic Analysis
C. Symbolic Analysis
D. Morphical And Lexical Analysis

166. Which of the following is not true input for the LP? A. Image
A. Image
B. Text
C. Types input
D. Speech

167. A. Many to many


31 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
What type of architecture is a named entity recogni-
tion using?
A. Many to many
B. Many to one
C. One to many

168. eg. 'do', 'eat', 'go' are examples of which type of verb B. Irregular verb
A. Regular verb
B. Irregular verb
C. Complex verb
D. Normal verb

169. Input layer ved sim unped per rature), one or more C. Neural network
hiddenlayers and a output layer (w. one output per
target
A. What needs to be chosen?
B. Feedforeword network
C. Neural network
D. Recurrent network

170. In add-k smoothing method, for a small k value, what A. High perplexity
would be perplexity?
A. High perplexity
B. Zero perplexity
C. Low perplexity
D. Perplexity is not disturbed

171. Syntactical analysis is done at___level A. Sentence


A. Sentence
B. Word
C. Lexicon
D. Symbol

172. For HMM Model, with N hidden states, V observable C. N*N


states, what is the dimension of State Transition
Probability Matrix
A. N×V
B. N×1
C. N*N
D. 1 For HMM Model, with N hidden states, V observ-
32 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
able states, what is the dimension of Emission
Probability *N

173. The english words through and threw are examples D. Homophony
of_
A. Automymy
B. Polysemy
C. Synonymy
D. Homophony

174. The words 'there' and their causes which of the fol- C. Phonological
lowing type of ambiguity?
A. Syntactic
B. Semantic
C. Phonological
D. Pragmatic

175. Markov chains can have more than one invariant dis- A. True
tribution.
A. True
B. False

176. Which of the following is not correct with respect to B. Character level
levels of semantic analysis?
A. Word level
B. Character level
C. Sentence level
D. Utterance level

177. WordNet is the___database C. Lexical|


A. Symbol
B. Word
C. Lexical|
D. Annotation

178. Rule-based POS taggers doesnt possess which of the A. The rules in
following properties Rule-based POS
A. The rules in Rule-based POS tagging are built auto tagging are built
B. These taggers are knowledge-driven taggers auto
C. These taggers are consist of many hand written
33 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
rules
D. The information is coded in the form of rules.

179. Two words with very closely related meanings C. Synonyms


A. Antonyms
B. Homonyms
C. Synonyms
D. Hyponymy

180. SVD, PCA D. Word2vec is not


A. What does a co-occurence matrix look like? a single algorithm
B. Types of Prediction embeddings but a combination
C. What are some examples of co-occurence matrix of two techniques
technologies?
D. Word2vec is not a single algorithm but a combina-
tion of two techniques

181. Which one of the following is morpheme of the word A. un


"unbelievable"?
A. un
B. unbe
C. evable
D. able

182. Video summarization extracts the most important A. Video


frames from the
__content
A. Video
B. Image
C. Sound
D. Doccument

183. Every word is a row, every word is a column, the D. What does a
number isthe number of times the two words occur co-occurence ma-
in the same context trix look like?
A. CountVector could also be a
B. What are some examples of co-occurence matrix
technologies?
C. Two types of word embeddings
D. What does a co-occurence matrix look like?
34 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr

184. Discrete representation, aka integerized words A. pre-cursor to


A. pre-cursor to words as vectors words as vectors
B. distributed word representations
C. advantages of distributed word representations
D. notion of context as meaning

185. Which of the following techniques can be used for A. Lemmatization


keyword normalization in NLP, the process of con-
verting a keyword into its base form?
A. Lemmatization
B. Soundex
C. Cosine Similarity
D. N-grams

186. What is corpus? B. Corpus is a


A. A corpus is collection of Parameters and argu- large collection of
ments written text be-
B. Corpus is a large collection of written text belong- longing to a partic-
ing to a particular language ular language
C. It refers to a situation where the context of a phrase
gives it multiple interpretation
D. All of the others.

187. Which of the following instances the regular expres- B. "onetwo"


sion "Ib(one|two|three)lb" can recognize?
A. "one"
B. "onetwo"
C. 'TWO"
D. "Onetwothree"

188. a_____often called a pattern, specifies a set of strings A. Regular Ex-


required for a particular purpose. A simple way spec- pression
ify a finite set of strings is to list its elements or
members.
A. Regular Expression
B. Non regular Expression
C. Finite Automata
D. None of the others

35 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
189. ____are created by removing the suffixes or prefixes A. Stems, Stem-
used with a word. This process is called as ming
A. Stems, Stemming
B. Lemma, Lemmatization
C. Corpus
D. Suffix stripping

190. The words are transformed into the structure to show A. Syntax Analysis
how the words are related to each other. This process
is called as_____
A. Syntax Analysis
B. Semantic Analysis
C. Lexical Analysis
D. Pragmatic Analysis

191. Which of the following is an advantage of normalizing B. It helps in re-


a word? ducing the ran-
A. It quarantees word to be inconsistent domness in the
B. It helps in reducing the randomness in the word word
C. It increases the false negatives
D. It increases the dimensionality of the input

192. Which of the following measurements are used to D. All of the others
evaluate the quality of entity recognition?Which of
the following measurements are used to evaluate the
quality of entity recognition?
A. Precision
B. Recall
C. F-measure
D. All of the others

193. When training a language model, if we use an overty D. Don't general-


narrow corpus, the probabilities ize
A. Don't reflect the task
B. Reflect all possible wordings
C. Reflect intuition
D. Don't generalize

194. Which of the following is not a learning approach for


QA system
36 / 37
NLP301c by Hon Pg
HÍc trñc tuy¿n t¡i https://quizlet.com/_g4qrfr
A. Unsupervised approach D. Sense dis-
B. Supervised approach ambiguation ap-
C. Knowledge based approach proach
D. Sense disambiguation approach

195. One of the main challenge/s of NLP is D. All of the others


A. Handling Tokenization
B. Handling Ambiguity of Sentences
C. Handling POS-Tagging
D. All of the others

196. History of Natural Language Processing does not B. Compression


include Algorithms
A. Automata Theory
B. Compression Algorithms
C. CFG by Chomsky
D. Predicate and First Order Logic

197. Regular expressions are combination of simple units D. Conjunction


as given in options, select incorrect unit.
A. Character or string
B. Concatenation
C. Kleen star
D. Conjunction

198. Which of the below are NLP use cases? D. Text Summa-
A. Detecting objects from an image rization
B. Facial Recognition
C. Speech Biometric
D. Text Summarization

37 / 37

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy