NLP MCQ 153 Out of 427 - Part One
NLP MCQ 153 Out of 427 - Part One
Details Contents
1.
Introduction
History of NLP, Generic NLP system, levels of NLP , Knowledge in
language processing , Ambiguity in Natural language , stages in
NLP, challenges of NLP ,Applications of NLP
2.
Word Level Analysis
Morphology analysis –survey of English Morphology, Inflectional
morphology & Derivational morphology, Lemmatization, Regular
expression, finite automata, finite state transducers (FST)
,Morphological parsing with FST , Lexicon free FST Porter
stemmer. N –Grams- N-gram language model, N-gram for
spelling correction.
3.
Syntax analysis
Part-Of-Speech tagging( POS)- Tag set for English ( Penn
Treebank ) , Rule based POS tagging, Stochastic POS tagging,
Issues –Multiple tags & words, Unknown words. Introduction to
CFG, Sequence labeling: Hidden Markov Model (HMM), Maximum
Entropy, and Conditional Random Field (CRF).
4.
Semantic Analysis
Lexical Semantics, Attachment for fragment of English-
sentences, noun phrases, Verb phrases, prepositional phrases,
Relations among lexemes & their senses –Homonymy, Polysemy,
Synonymy, Hyponymy, WordNet, Robust Word Sense
Disambiguation (WSD) ,Dictionary based approach
5.
Pragmatics
Discourse –reference resolution, reference phenomenon ,
syntactic & semantic constraints on co reference
6.
Applications
(preferably for
Indian regional
languages)
Machine translation, Information retrieval, Question answers
system, categorization, summarization, sentiment analysis,
Named Entity Recognition
______________________________________________________----------------------------------------------
NLTK stands for _____.
a. Artificial Intelligence
b. Machine Learning
c. Deep Learning
d. None of Above
c. recognizing the sentiment among several online posts and comments using NLP.
d. recognizing the sentiment among several online posts and comments using NLTK.
4. Examples of NLP?
b. Mid 1930
c. Mid 2000
d. Before 1930
a. Alan Turing
b. Venessa Turing
8. NLP breaks down language into shorter, more basic pieces, called _____.
a. Parameters
b. Tokens.
c. None.
d. Arguments.
b. This component transfers linear sequences of words into structures. It shows how the
words
are associated with each other. And focuses only on the literal meaning of words, phrases,
and sentences.
c. deals with the overall communicative and social content.
d. None of Above
d. It deals with the overall communicative and social content and its effect on
interpretation.
It means abstracting or deriving the meaningful use of language in situations.
13. What is Syntax Analysis?
a. This only abstracts the dictionary meaning or the real meaning from the given context.
b. This component transfers linear sequences of words into structures. It shows how the words
are
associated with each other.
c. It deals with the overall communicative and social content and its effect on interpretation. It
means abstracting or deriving the meaningful use of language in situations.
d. It focuses about the proper ordering of words which can affect its meaning. This
involves
analysis of the words in a sentence by following the grammatical structure of the
sentence.
The words are transformed into the structure to show how the words are related to each
other.
a. It means a sense of the context. The meaning of any single sentence which depends
upon those sentences. It also considers the meaning of the following sentence.
b. It depicts analyzing, identifying and description of the structure of words. It includes dividing
a
text into paragraphs, words and the sentences.
c. This component transfers linear sequences of words into structures. It shows how the words
are
associated with each other. And focuses only on the literal meaning of words, phrases,
and
sentences.
d. All of Above.
a. The interconnection of networks having simple processing units with knowledge stored
in weights to identify connections between units.
b. It performs extensive analysis of linguistic phenomena through explicit representation of
facts
about language and well-understood knowledge representation schemas and associated
algorithms.
c. It harnesses various mathematical techniques and often uses large text corpora to
develop
approximately generalized models of linguistic phenomena based on actual examples.
d. None of Above
c. Neural Network
d. All of above
d. Both a & c
b. A corpus is a large and structured set of machine-readable texts that have been
produced in a natural communicative setting.
c. It refers to a situation where the context of a phrase gives it multiple interpretation
29. ______________ depicts analyzing, identifying and description of the structure of words.
a. Tokens
b. Semantic Analysis
c. Symbolic Analysis
30. _____________ includes dividing a text into paragraphs, words and the sentences.
a. Semantic Analysis
b. Tokens
c. Lexical Analysis
d. Discourse
32. _______________ shows how the words are associated with each other.
a. Semantic Analysis
b. Tokens
c. Lexical Analysis
d. Discourse
33. _______________ focuses only on the literal meaning of words, phrases, and sentences.
a. Morphological and Lexical Analysis
b. Semantic Analysis
c. Quantifiers
d. None of the above.
34. ________ deals with the overall communicative and social content and its effect on
interpretation.
a. Tokens
b. Pragmatic Analysis
c. Symbolic Analysis
d. Morphical And Lexical Analysis
35. ____ means abstracting or deriving the meaningful use of language in situations.
a. Semantic Analysis
b. Tokens
c. Lexical Analysis
d. Pragmatic Analysis
36 .It focuses about the proper ordering of words which can affect its meaning.
a. Syntax Analysis
b. Semantic Analysis
c. Lexical Analysis
d. Pragmatic Analysis
37. This involves analysis of the words in a sentence by following the grammatical
structure of the
sentence.
a. Tokens
b. Lexical Analysis
c. Discourse
d. Syntax Analysis
38. The words are transformed into the structure to show how the words are related to each
other. This
process is called as ____________
a. Syntax Analysis
b. Semantic Analysis
c. Lexical Analysis
d. Pragmatic Analysis
39. ____means a sense of the context. The meaning of any single sentence which depends upon
those sentences. It also considers the meaning of the following sentence.
a. Discourse
b. Semantic Analysis
c. Lexical Analysis
d. Pragmatic Analysis
40. Machine Learning & Statistical Inference are the popular methods for implementing___.
a. Lexical Analysis
b. Tokens and Quantifiers
c. NLP
d. Statistical Approach
43. The interconnection of networks having simple processing units with knowledge stored in
weights
to identify connections between units.
a. Connectionist Approach
b. Neural Networks
c. Hybrid approach
d. Both a & b.
b. Symbolic Approach.
c. Corpus based.
d. Hybrid.
a. Statistical Approach.
b. Rule Based Approach
c. CNN
d. K- nearest
47. Ambiguity of a single word when it can be used as a verb, noun or an adjective is called as
________
a. Lexical Ambiguity
b. Pragmatic Ambiguity
c. Semantic Ambiguity
d. None of Thes
48. Operators and Quantifiers are mostly responsible for ___________
a. Scope Ambiguity
b. Pragmatic Ambiguity
c. Semantic Ambiguity
d. None of These
49. when the meaning of the words themselves can be misinterpreted then ______ ambiguity
occurs.
a. Scope Ambiguity
b. Pragmatic Ambiguity
c. Semantic Ambiguity
d. None of These
50. What refers to a situation where the context of a phrase gives it multiple interpretations?
a. Lexical Ambiguity
b. Scope Ambiguity
c. Semantic Ambiguity
d. Pragmatic Ambiguity
52. What are the components of Morphological Analyzer acc., to Shrivastava et. al 2005?
a. The recognition engine, identifying suffixes, and finding a stem within the input
word algorithms
b. Morpheme lexeme, Set of rules governing the spelling and composition of
morphologically
complex words & Decision algorithm
c. The recognition engine, set of rules & Algorithm.
d. All of the above.
53. Morphological analyzer is composed of the following Three parts according to Kumar, 2013?
a. The recognition engine, identifying suffixes, and finding a stem within the input word
algorithms
b. FSA
c. DAWG
d. Stemmer Algorithm
a. Morphophonemic
b. Morphotacties
c. Both a & b
d. None of the above
c. Kimmo Koskenniemi
d. Chomsky
60. The two-level morphology model consists of two representations and one rule. What are
they?
d. Both a & c
62. _____is an advanced version of FSA(finite state automata)and is used to represent the
lexicon computationally.
a. FST
b. FSA
c. DAWG
d. Stemmer Algorithm
63. Finite State Transducer is an advanced version of _____ and is used to represent the
lexicon
computationally.
a. FST
b. FSA
c. DAWG
d. Stemmer Algorithm
65. lexicon, orthographic rules and spelling variations are the components of ______.
a. Stemmer Algorithm
b. Two-level morphology
c. FST
d. FSA
66. An FST is simply a classical finite-state automaton whose transitions are ____ , rather than
with single
symbols.
a. Labeled with tokens
c. It is used for stripping of affixes. It uses a set of rules containing a list of stems
and replacement rules.
d. All of the above
b. Porter Algorithm
c. Decision Algorithm
d. Both a & b
69. _____ used for stripping of affixes. It uses a set of rules containing a list of stems and
replacement rules.
a. Two-level morphology model
b. Chomsky Model
c. Finite State Automata
d. Stemmer
73. A ____ defines all the word forms of a given stem and also provides a feature structure with
every word
form. The _____ is efficient for inflectional rich languages.
a. Paradigm, paradigm-based approach
b. Paradigm based approach, paradigm
c. Paradigm, rule-based Approach
d. None of the above
74. The ANUSAARAKA research group has developed a language independent paradigm
based morphological compiler program for ____. This or a variant of this scheme has been used
widely in NLP.
a. European Languages
b. English Languages
c. Indian Languages
d. All of the above
75. The _______research group has developed a language independent paradigm based
morphological compiler program for Indian languages. This or a variant of this scheme has been
used widely in NLP.
a. ANUSAARAKA
b. Anusarka
c. ANUSARKAR
d. All of the above
77. _____is a change in word form. This usually means the use of endings. For eg., He works, he
worked, and he is working
a. Derivational Morphology
b. Inflectional Morphology
c. Both a & b
d. None of the above
78. ________creates new words. For example, beauty becomes beautiful. The affix -ful changes the
word from a noun to an adjective.
a. Derivational Morphology
b. Inflectional Morphology
c. Both a & b
d. None of the above
81. In _____, the words are replaced by the root words or the words with similar context.
E.g.- Walking will be replaced by Walk(walk is the root word of walking)
a. Stemming
b. Lemmatization
c. Both a & b
d. None of the above
82. ____are created by removing the suffixes or prefixes used with a word. This process is called
as____
a. Stems, Stemming
b. Lemma, Lemmatization
c. Corpus
d. Suffix stripping
84. A _____, often called a pattern, specifies a set of strings required for a particular purpose. A
simple way
to specify a finite set of strings is to list its elements or members.
a. Regular Expression
90. ____is the simplest machine to recognize patterns. It has a set of states and rules for moving
from one
state to another but it depends upon the applied input symbol.
a. Finite Automata
b. DFA
c. NFA
d. None of the above
95. In ____null (or ε) move is allowed i.e., it can move forward without reading symbols.
a. NFA
b. DFA
c. Cyclic graphs
d. None of the above
101. The ______ algorithm is a process of removing suffixes from words in English.
a. Lovins Stemmer
b. Porter Stemmer
c. paice/ Husk Stemmer
d. None of the above
108. __________ basically, means a body, and in the context of Natural Language Processing (NLP),
it means a body of text.
a. Corpus
b. WordNet
c. Lexicon
d. None of the above
109. What are the applications of FST?
a. Word Inflections. For example, pluralizing words (cat -> cats)
b. Morphological Parsing; i.e., extracting the “properties” of a word (e.g., computers -> computer
+ [Noun] + [Plural])
c. Simple Word Translation, e.g., translating US English to UK English
d. All of the above.
110. ______are general rules used when breaking a word into its stem and modifiers. An
example would be: singular English words ending with -y, when pluralized, end with -ies.
a. Morphological Rules
b. Orthographic Rules
c. Parsing
d. None of the above
112. ____ is when two words with different stems are stemmed to the same root. This is also
known as a
_____.
a. Under Stemming, False Positive
b. Over Stemming, False Positive
c. Under Stemming, False Negative
d. Over Stemming, False Negative.
113. ______ is when two words that should be stemmed to the same root are not. This is also
known as
a____.
a. Under Stemming, False Positive
b. Over Stemming, False Positive
c. Under Stemming, False Negative
d. Over Stemming, False Negative.
115. It is not possible to use backtracking at all times in the case of _____.
a. NFA
b. DFA
c. DAG
d. FA
116. The full form of DFA is____.
a.Derived Finite Automata
Deterministic Final Automata
c. Deterministic Finite Automata
d. Duplicate Final Automation
118. The generally accepted approach to morphological parsing is through the use of a _____,
which inputs words and outputs their stem and modifiers.
a. Non-deterministic Finite Automata(NFA)
b. Deterministic Finite Automata(DFA)
c. Directed Acyclic Graph(DAG)
d. Finite StateTransducer(FST)
123. To construct a regular expression from a DFA, we replace each state in the ___one by
one with a
corresponding ___.
a. NFA, regular expression
b. DFA, regular expression
c. Regular expression, DFA
d. None of the above
124. If we can eliminate ________ from an FA, then our construction of an FA from a regular
expression can
be completed.
a. Epsilon transitions
b. States
c. DFA
d. NFA
125. In the fields of computational linguistics and probability, an_____ is a contiguous sequence
of n items from a given sample of text or speech. The items can be phonemes, syllables, letters,
words or base pairs according to the application.
a. corpus
b. Epsilon Transitions
c. N-grams
d. Lemma
a. design kernels that allow machine learning algorithms such as support vector machines to
learn from string data
b. find likely candidates for the correct spelling of a misspelled word
c. improve compression in compression algorithms where a small area of data requires n-grams
of greater length
d. All of the above
127. Syntactic n-grams are n-grams defined by paths in _________ dependency or constituent
trees rather than the linear structure of the text.
a. Syntactic
128. NLP is a subfield of ____________, computer science, and artificial intelligence concerned
with the
interactions between computers and human language
a. Definitions
b. Texts
c. Contexts
d. Linguistics
139. ____________________ is the step in which an input sentence is converted into a hierarchical
structure
that corresponds to the units of meaning in the sentence.
a. Graph Processing
b. Syntactic Processing
c. Semantic Processing
d. All of the mentioned
140. Choose form the following areas where NLP can be useful.
a. Information Retrieval
b. Automatic Text Summarization
c. Automatic Question-Answering Systems
d. All of the mentioned
143. In linguistic morphology _____________ is the process for reducing inflected words to their
root form.
a. Rooting
b. Stemming
c. Text-Proofing
d. Both Rooting & Stemming
146. Given a sound clip of a person or people speaking, determine the textual
representation of the
speech.
a. Text-to-speech
b. Speech-to-text
c. Both A and B
d. None of the Above
147. Which of the following is used to mapping sentence plan into sentence structure?
a. Text planning
Sentence planning
c. Text Realization
d. None of the Above
149. In morphology, we care about the ______ that make up the sentence
a. Characters
b. Words
c. Symbols
d. Lexicons
151. In parsing stage, we focus more on the _________ of the words within a sentence
a. Sequence
b. Group
c. Relationship
d. None of the above