Machine Learning Lab Viva QA
Machine Learning Lab Viva QA
The process of inspecting, cleaning, transforming, and modeling data to discover useful
information and support decision-making.
4. 4. What is statistics?
It summarizes and describes the features of a dataset (e.g., mean, median, mode).
Interquartile Range = Q3 − Q1; it shows the middle 50% spread of the data.
Exploratory Data Analysis is the initial process of analyzing data for insights.
Outliers are data points far from other observations. Detected using Box plot, Z-score, or
IQR method.
Principal Component Analysis transforms features into fewer components retaining most
variance.
20. 20. Give examples for univariate, multivariate and bivariate data analysis.
A graphical representation showing values using colors, often for correlation matrices.
A concept learning algorithm that finds the most specific hypothesis consistent with
training data.
Algorithms that do not assume data follows a specific distribution (e.g., KNN).
30. 30. How do you compute the distance between data points?
A tree-like model where nodes represent features and leaves represent outcomes.
A regression where points close to the query point are weighted more heavily.
Used in K-Means to determine optimal number of clusters using cost vs. k plot.