0% found this document useful (0 votes)

6 views4 pages

Machine Learning Lab Viva QA

The document contains a list of machine learning lab viva questions and answers, covering fundamental concepts such as data analysis, statistics, machine learning, and data visualization techniques. It differentiates between various statistical methods, types of data, and machine learning algorithms, including supervised and unsupervised learning. Additionally, it discusses practical applications of techniques like decision trees, clustering, and regression analysis.

Uploaded by

skshyam106

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views4 pages

Machine Learning Lab Viva QA

Uploaded by

skshyam106

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Machine Learning Lab Viva Questions

and Answers (BSCL606)

1. 1. What is data analysis?

The process of inspecting, cleaning, transforming, and modeling data to discover useful
information and support decision-making.

2. 2. Why do we need to visualize data?

To understand patterns, trends, and outliers in data quickly and effectively.

3. 3. Mention the differences between box plot and histograms.

- Box plot shows distribution using quartiles; highlights outliers.

- Histogram shows frequency of data intervals (bins).

4. 4. What is statistics?

The science of collecting, analyzing, interpreting, and presenting data.

5. 5. What is descriptive statistics?

It summarizes and describes the features of a dataset (e.g., mean, median, mode).

6. 6. What is inferential statistics?

It makes predictions or inferences about a population based on a sample.

7. 7. Differentiate between Machine learning and AI.

- AI is the broader concept of machines mimicking human behavior.

- ML is a subset where machines learn from data.

8. 8. What are the different types of data?

- Numerical (continuous, discrete)

- Categorical (nominal, ordinal)
- Time series
- Text

9. 9. Illustrate the use of Histograms.

Used to visualize frequency distribution of continuous data, e.g., student scores.

10. 10. Mention the use of Box plots.

To detect spread and outliers in a dataset using min, Q1, median, Q3, and max.

11. 11. What is IQR?

Interquartile Range = Q3 − Q1; it shows the middle 50% spread of the data.

12. 12. How to handle missing values?

By removal, mean/median imputation, forward/backward fill, or using ML models.

13. 13. What is EDA?

Exploratory Data Analysis is the initial process of analyzing data for insights.

14. 14. Define outliers and how to detect them.

Outliers are data points far from other observations. Detected using Box plot, Z-score, or
IQR method.

15. 15. What is dimensionality reduction?

Reducing the number of input variables/features to simplify models (e.g., PCA).

16. 16. Explain PCA.

Principal Component Analysis transforms features into fewer components retaining most
variance.

17. 17. Differentiate between supervised and unsupervised learning.

- Supervised: Labeled data (e.g., classification)

- Unsupervised: Unlabeled data (e.g., clustering)

18. 18. Differentiate between regression and classification.

- Regression predicts continuous values.

- Classification predicts categories.

19. 19. Give examples for binary and multi classification.

- Binary: Spam vs. Not spam

- Multi: Classifying fruits as apple, banana, or orange

20. 20. Give examples for univariate, multivariate and bivariate data analysis.

- Univariate: Histogram of age

- Bivariate: Scatter plot of height vs. weight
- Multivariate: Dataset with age, income, and score

21. 21. Name the measures of central tendency.

Mean, Median, Mode.

22. 22. What is variance, bias and standard deviation?

- Variance: Spread of data.

- Bias: Error due to wrong assumptions.
- Std Deviation: Square root of variance.

23. 23. What is heatmap?

A graphical representation showing values using colors, often for correlation matrices.

24. 24. Define correlation matrix.

A table showing correlation coefficients between variables.

25. 25. Mention types of correlation.

Positive, Negative, and No correlation.

26. 26. Explain the importance of pair plot.

Shows relationships between variables pairwise using scatter plots.

27. 27. Explain Find-S algorithm and its importance.

A concept learning algorithm that finds the most specific hypothesis consistent with
training data.

28. 28. What do you mean by non-parametric algorithms?

Algorithms that do not assume data follows a specific distribution (e.g., KNN).

29. 29. Explain importance of KNN.

Simple, effective for classification/regression; based on distance from neighbors.

30. 30. How do you compute the distance between data points?

Using metrics like Euclidean, Manhattan, or Minkowski distance.

31. 31. Explain Decision Tree.

A tree-like model where nodes represent features and leaves represent outcomes.

32. 32. What is information gain?

A measure of reduction in entropy/surprise from splitting a dataset.

33. 33. What is entropy?

A measure of disorder or impurity in a dataset.

34. 34. What is Gini index?

A metric to measure impurity used in decision trees (CART).

35. 35. What is Bayesian learning?

Probabilistic learning based on Bayes' Theorem.

36. 36. Differentiate between linear regression and polynomial regression.

- Linear: Straight-line fit

- Polynomial: Fits nonlinear curves using higher-order terms

37. 37. Explain locally weighted regression.

A regression where points close to the query point are weighted more heavily.

38. 38. Why do we use standard scaler?

To normalize features to mean = 0 and std = 1 for better ML performance.

39. 39. Mention the applications of polynomial regression.

Modeling growth curves, market trends, or any nonlinear patterns.

40. 40. Mention the applications of clustering algorithms.

Customer segmentation, image compression, anomaly detection.

41. 41. What is elbow technique?

Used in K-Means to determine optimal number of clusters using cost vs. k plot.

42. 42. What is normalization?

Scaling data to a fixed range (like 0 to 1) to bring uniformity.

43. 43. List the applications of decision tree.

Credit scoring, medical diagnosis, loan approval, fraud detection.

Machine Learning Interview Questions and Answers
No ratings yet
Machine Learning Interview Questions and Answers
34 pages
FML Solution 1
No ratings yet
FML Solution 1
19 pages
Full ML Viva Questions Answers Q1 To Q70
No ratings yet
Full ML Viva Questions Answers Q1 To Q70
6 pages
Foundation of Data Science Previous Year Question Paper
No ratings yet
Foundation of Data Science Previous Year Question Paper
40 pages
Data Minig Anwers
No ratings yet
Data Minig Anwers
37 pages
Company Wise Data Science Interview Questions
100% (2)
Company Wise Data Science Interview Questions
39 pages
Complete Data Science Questions
No ratings yet
Complete Data Science Questions
5 pages
Question For Interview Machine Leaning Part
No ratings yet
Question For Interview Machine Leaning Part
2 pages
Question Bank - Intro To Data Science
No ratings yet
Question Bank - Intro To Data Science
2 pages
Da 1733591326
No ratings yet
Da 1733591326
132 pages
ADS Viva
No ratings yet
ADS Viva
55 pages
2 Marks
No ratings yet
2 Marks
14 pages
ML Exam Preparation Tips
No ratings yet
ML Exam Preparation Tips
41 pages
Data Science Interview Questions
No ratings yet
Data Science Interview Questions
20 pages
K
No ratings yet
K
11 pages
100-Machine-Learning-Interview-Questions-and-Answers (Downloaded From Internet)
No ratings yet
100-Machine-Learning-Interview-Questions-and-Answers (Downloaded From Internet)
24 pages
Crack Data Science Interview 1731300339
No ratings yet
Crack Data Science Interview 1731300339
132 pages
Zep - Machine Learning Interview Questions
No ratings yet
Zep - Machine Learning Interview Questions
83 pages
ML Unit 3
No ratings yet
ML Unit 3
10 pages
Ai Chapter 3
No ratings yet
Ai Chapter 3
8 pages
ML Lab Viva Questions
No ratings yet
ML Lab Viva Questions
5 pages
Detailed 12 Data Mining Answers
No ratings yet
Detailed 12 Data Mining Answers
3 pages
Viva
No ratings yet
Viva
7 pages
Data Science Quiz Questions
No ratings yet
Data Science Quiz Questions
7 pages
Question Bank Module1 Machine Learning
No ratings yet
Question Bank Module1 Machine Learning
2 pages
Top 170 Machine Learning Interview Questions and Answers (2024) - Reader View
No ratings yet
Top 170 Machine Learning Interview Questions and Answers (2024) - Reader View
51 pages
Question Bank
No ratings yet
Question Bank
5 pages
ML 5 Mark Questions Answers
No ratings yet
ML 5 Mark Questions Answers
3 pages
15 Mlops Interview Questions For 2025
No ratings yet
15 Mlops Interview Questions For 2025
13 pages
Machine Learning Bangalore City University 2024
No ratings yet
Machine Learning Bangalore City University 2024
5 pages
ML Question Bank
No ratings yet
ML Question Bank
1 page
DA (All CHP.)
No ratings yet
DA (All CHP.)
14 pages
Data Science
No ratings yet
Data Science
28 pages
Data Science 101
No ratings yet
Data Science 101
1 page
MLL Final Exam Prep
No ratings yet
MLL Final Exam Prep
5 pages
AI ML Question Bank With Answers
No ratings yet
AI ML Question Bank With Answers
29 pages
Question Bank 1
No ratings yet
Question Bank 1
4 pages
Final Revision
No ratings yet
Final Revision
3 pages
Interview QUES - AI
No ratings yet
Interview QUES - AI
18 pages
Questo Es
No ratings yet
Questo Es
8 pages
MLANS
No ratings yet
MLANS
26 pages
FDS QP - Thy
No ratings yet
FDS QP - Thy
1 page
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
No ratings yet
Data Science MCQs Sample Mid2xlsx 2024 11-29-23!19!54
8 pages
Study Structure
No ratings yet
Study Structure
13 pages
Top 100 Machine Learning Questions With Answers For Interview PDF
100% (3)
Top 100 Machine Learning Questions With Answers For Interview PDF
48 pages
ERERER
No ratings yet
ERERER
1 page
Untitled 10
No ratings yet
Untitled 10
12 pages
DS
No ratings yet
DS
7 pages
Questions Bank Faml
No ratings yet
Questions Bank Faml
2 pages
Unit 5 Classification PDF
No ratings yet
Unit 5 Classification PDF
131 pages
Exam Preparation - Machine Learning Applications
No ratings yet
Exam Preparation - Machine Learning Applications
4 pages
ML DS Interview Quetions
No ratings yet
ML DS Interview Quetions
17 pages
ML Chapter 2
No ratings yet
ML Chapter 2
9 pages
ANS - For ML
No ratings yet
ANS - For ML
10 pages
Study Notes To Ace Your Data Science Interview
No ratings yet
Study Notes To Ace Your Data Science Interview
7 pages
Top 50 ML Interview Questions Recreated
No ratings yet
Top 50 ML Interview Questions Recreated
5 pages
ML Supervised Learning Unit 3
No ratings yet
ML Supervised Learning Unit 3
51 pages
Seminar Report On Machine Learing
33% (3)
Seminar Report On Machine Learing
30 pages
Data Science Assignment
No ratings yet
Data Science Assignment
9 pages
Machine Learning
No ratings yet
Machine Learning
2 pages
Data Science Master
No ratings yet
Data Science Master
11 pages
Top 45 Machine Learning Interview Questions in 2025
100% (1)
Top 45 Machine Learning Interview Questions in 2025
37 pages
Question Bank - Machine Learning
100% (1)
Question Bank - Machine Learning
4 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
MQP1
No ratings yet
MQP1
3 pages
Solution For DWDM Problems
No ratings yet
Solution For DWDM Problems
24 pages
Solar Radiation Prediction: Dr. Himani Bansal
No ratings yet
Solar Radiation Prediction: Dr. Himani Bansal
43 pages
ML Assignment No 1
No ratings yet
ML Assignment No 1
2 pages
Day 1 Special Bonus
No ratings yet
Day 1 Special Bonus
23 pages
Detecting Cocoa Plantations in C Te D Ivoire and Ghana and - 2021 - Ecological I
No ratings yet
Detecting Cocoa Plantations in C Te D Ivoire and Ghana and - 2021 - Ecological I
13 pages
Classification Algorithms 3rd
No ratings yet
Classification Algorithms 3rd
15 pages
Bankruptcy Prediction Report
No ratings yet
Bankruptcy Prediction Report
32 pages
Section 2 - Introduction To Machine Learning-Bje Edits - Ipynb - Colab
No ratings yet
Section 2 - Introduction To Machine Learning-Bje Edits - Ipynb - Colab
7 pages
An Introduction To Statistical Learning From A Reg PDF
No ratings yet
An Introduction To Statistical Learning From A Reg PDF
25 pages
Diabetes Prediction
No ratings yet
Diabetes Prediction
15 pages
7 PythonDyslexia USETHIS jdr20230059
No ratings yet
7 PythonDyslexia USETHIS jdr20230059
9 pages
Data Mining Tools
No ratings yet
Data Mining Tools
13 pages
Computer Science Textbook Solutions - 6
No ratings yet
Computer Science Textbook Solutions - 6
30 pages
BigData ML
No ratings yet
BigData ML
10 pages
Educational Data Mining: A Review of The State of The Art
No ratings yet
Educational Data Mining: A Review of The State of The Art
18 pages
EE2211 CheatSheet
No ratings yet
EE2211 CheatSheet
15 pages
DWDM Unit-3: What Is Classification? What Is Prediction?
No ratings yet
DWDM Unit-3: What Is Classification? What Is Prediction?
12 pages
Detecting Stock Market Manipulation Using Supervised Learning Algorithms
No ratings yet
Detecting Stock Market Manipulation Using Supervised Learning Algorithms
8 pages
COS3751 Nov 2022 Exams
No ratings yet
COS3751 Nov 2022 Exams
8 pages
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
No ratings yet
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Reviews
4 pages
Demand Forecasting
No ratings yet
Demand Forecasting
10 pages
Classification Using Decision Trees
No ratings yet
Classification Using Decision Trees
0 pages
5.1 Mining Data Streams
No ratings yet
5.1 Mining Data Streams
16 pages
Data Analytics
From Everand
Data Analytics
Jeffery Short
1/5 (1)
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Machine Learning Lab Viva QA

Uploaded by

Machine Learning Lab Viva QA

Uploaded by

Machine Learning Lab Viva Questions

and Answers (BSCL606)

2. 2. Why do we need to visualize data?

To understand patterns, trends, and outliers in data quickly and effectively.

3. 3. Mention the differences between box plot and histograms.

- Box plot shows distribution using quartiles; highlights outliers.

The science of collecting, analyzing, interpreting, and presenting data.

5. 5. What is descriptive statistics?

6. 6. What is inferential statistics?

It makes predictions or inferences about a population based on a sample.

7. 7. Differentiate between Machine learning and AI.

- AI is the broader concept of machines mimicking human behavior.

8. 8. What are the different types of data?

- Numerical (continuous, discrete)

9. 9. Illustrate the use of Histograms.

Used to visualize frequency distribution of continuous data, e.g., student scores.

10. 10. Mention the use of Box plots.

11. 11. What is IQR?

12. 12. How to handle missing values?

By removal, mean/median imputation, forward/backward fill, or using ML models.

13. 13. What is EDA?

14. 14. Define outliers and how to detect them.

15. 15. What is dimensionality reduction?

Reducing the number of input variables/features to simplify models (e.g., PCA).

16. 16. Explain PCA.

17. 17. Differentiate between supervised and unsupervised learning.

- Supervised: Labeled data (e.g., classification)

18. 18. Differentiate between regression and classification.

- Regression predicts continuous values.

19. 19. Give examples for binary and multi classification.

- Binary: Spam vs. Not spam

- Univariate: Histogram of age

21. 21. Name the measures of central tendency.

22. 22. What is variance, bias and standard deviation?

- Variance: Spread of data.

23. 23. What is heatmap?

24. 24. Define correlation matrix.

A table showing correlation coefficients between variables.

25. 25. Mention types of correlation.

Positive, Negative, and No correlation.

26. 26. Explain the importance of pair plot.

Shows relationships between variables pairwise using scatter plots.

27. 27. Explain Find-S algorithm and its importance.

28. 28. What do you mean by non-parametric algorithms?

29. 29. Explain importance of KNN.

Simple, effective for classification/regression; based on distance from neighbors.

Using metrics like Euclidean, Manhattan, or Minkowski distance.

31. 31. Explain Decision Tree.

32. 32. What is information gain?

A measure of reduction in entropy/surprise from splitting a dataset.

33. 33. What is entropy?

34. 34. What is Gini index?

A metric to measure impurity used in decision trees (CART).

35. 35. What is Bayesian learning?

Probabilistic learning based on Bayes' Theorem.

36. 36. Differentiate between linear regression and polynomial regression.

- Linear: Straight-line fit

37. 37. Explain locally weighted regression.

38. 38. Why do we use standard scaler?

To normalize features to mean = 0 and std = 1 for better ML performance.

39. 39. Mention the applications of polynomial regression.

Modeling growth curves, market trends, or any nonlinear patterns.

40. 40. Mention the applications of clustering algorithms.

Customer segmentation, image compression, anomaly detection.

41. 41. What is elbow technique?

42. 42. What is normalization?

Scaling data to a fixed range (like 0 to 1) to bring uniformity.

43. 43. List the applications of decision tree.

Credit scoring, medical diagnosis, loan approval, fraud detection.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.