Machine Learning Q&A
Machine Learning Q&A
The process of splitting a dataset into a certain ratio of training and testing dataset
using Scikit Learn's 'train_test_split()' method/function is called:
It's important to clean out recurrent or unhelpful data to help your classifiers solve
your problem.
a. true
b. false
A confusion matrix is also known as a:
a. error matrix
b.truth matrix
What question can classification help answer?
a. Is this email spam or not?
b. Can pigs fly?
What is the first step to using Classification techniques?
a. creating classes of a dataset
b. cleaning and balancing your data
What is a multiclass question?
a. the task of classifying data points into multiple classes
b. the task of classifying data points into one of several classes
N-grams refer to
a.A word can be split into sequences of characters of a set length
b.A text can be split into sequences of words of a set length
A conversational bot gives output based on
a. Analyzing the input and using machine intelligence
b. Randomly choosing predefined choices
Eliza was an early bot that acted as a computer
a.doctor
b.therapist
Joseph Weizenbaum invented the bot
a.Eliza
b.Elisha
Sentiment analysis:
a. analyzes a phrase for positivity or negativity
b. analyzes a phrase for sentimentality
Embeddings
a. embeds words into phrases
b. converts text data numerically so words can cluster
Build a dictionary of how often words reoccur using:
a. Word and Phrase Dictionary
b. Word and Phrase Frequencies
Tokenization
a. Splits text by means of punctuation
b. Splits text into separate tokens (words)
What does NLP stand for in these lessons?
a. natural language processing
b. Neural Language Processing
Alan Turing's 'Turing Test' tried to determine if a computer was
a. indistinguishable from a human
b. thinking
Your data's ___ will impact the quality of your ML model
a.quantity
b.shape
Which ML technique can be used to detect credit card fraud?
a. Regression
b. Clustering