0% found this document useful (0 votes)

2 views4 pages

ML Lab Question Set - 21

The document outlines a series of programming tasks involving data science and machine learning using Python and R. It includes tasks such as dataset splitting, regression and classification model implementation, data preprocessing, and evaluation of model performance across various datasets. Specific algorithms mentioned include Linear Regression, Naive Bayes, Random Forest, K-Means, and others, along with techniques like PCA and feature scaling.

Uploaded by

Bala Krish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views4 pages

ML Lab Question Set - 21

Uploaded by

Bala Krish

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

1.

Write a Python program using Scikit-learn to split the Iris dataset into 70% train data and 30% test
data. Out of total 150 records, the training set will contain 105 records and the test set contains 45 of
those records. Predict the response for the test dataset (SepalLengthCm, SepalWidthCm,
PetalLengthCm, PetalWidthCm).

2. Consider the below salary dataset:

YearsExperience Salary
1.1 45000
3.5 60000
6.8 85000
8.5 95000
10.0 120000
9.0 105000
2.0 52000
3.8 65000
12.5 150000
15.6 180000
Write an R program to predict the salary for the years of experience is 5 using Linear Regression.

3. Write a Python program using Scikit-learn to split the Motorcycle dataset into 80% train data and
20% test data. Out of total 303 records, the training set will contain 80% records and the test set
contains 20% of those records. Predict the response for the test dataset using a classifier.

4. Consider the given dataset with the odd number of observations arranged in descending order – 23,
21, 18, 16, 15, 13, 12, 10, 9, 7, 6, 5, and 2. Find mean, median, mode, range, standard deviation,
variance, five number summary, boxplot using Python's numpy, scipy and matplotlib libraries.

5. Write a Python program to apply pre-processing techniques such as handling missing data, scaling,
and encoding categorical variables for a retail sales dataset. Then implement the backpropagation
algorithm using a neural network to predict customer purchase behavior.

6. Evaluate the regression model, and you notice that the Mean Squared Error (MSE) is significantly
higher than the Mean Absolute Error (MAE). What does this indicate about your data, and how would
you address it?

7. Import a dataset of employee job satisfaction, which has attributes such as age, years_at_company,
job_role, education_level, and salary. Implement the Naive Bayes classification algorithm using Python
to predict whether an employee will stay with the company or leave (e.g., predicting if an employee will
leave based on job satisfaction).

8. Write a Python program to apply pre-processing techniques for the vote dataset, which includes
details like population, gender, education, age, superfund, crime, etc. Then implement a Decision Tree
classifier for predicting class outcomes and display the decision tree visually.
9. Evaluate a regression model where the target variable has a skewed distribution. What metrics
would be most appropriate for assessing model performance?

10. Import a dataset for student performance, with attributes containing study_hours, attendance,
previous_grades, participation, study_material_usage, and age. Implement the Bagging ensemble
algorithm using Python's scikit-learn to predict the final exam score of a student based on these
features.

11. Import a dataset of species in a forest ecosystem and apply the K-Means clustering algorithm
with n_clusters=2, n_clusters=3, and n_clusters=4 using Python. The dataset includes attributes like
height, leaf_size, growth_rate, and environmental_factors. Analyze the accuracy of the clustering and
visualize the distribution of the species into different groups based on these features.

12. Consider the below salary dataset:

Years experienced - Salary

10.1 - 39343.00
10.3 - 46205.00
10.5 - 37731.00
20.0 - 43525.00

20.2 - 69891.00

22.9 - 118882.00
34.0 - 110150.00
35.2 - 134445.00
36.2 - 144445.00

38.7 - 157189.00

Predict the salary for the years of experience is 55 using a regression model in Python.

13. Apply pre-processing techniques for a dataset that includes details of employee_age,
years_at_company, department, job_satisfaction, and performance_score. Implement the Support
Vector Machine (SVM) algorithm using Python's scikit-learn to classify whether an employee is likely
to be promoted or not. Visualize the decision boundaries and assess the model's accuracy.

14. Write a Python program to split the customer churn dataset into 70% train data and 30% test data.
Out of a total of 1000 records, the training set will contain 700 records and the test set will contain 300
records. Predict whether a customer will churn using a Random Forest classifier.
15. Consider the following housing dataset:
House Age (Years) - Number of Bedrooms - Price

1 - 2 - 300000

3 - 3 - 250000

5 - 3 - 200000

8 - 4 - 180000

10 - 4 - 150000

Write a Python program to predict the house price based on the number of bedrooms and house age
using a Linear Regression model.

16. Write a Python program to perform feature selection using Recursive Feature Elimination (RFE)
for a dataset and implement classification using Support Vector Machines.

17. Apply pre-processing techniques for a vehicle dataset containing details such as car_model,
year_of_manufacture, mileage, fuel_type, and service_history. Implement the Random Forest algorithm
to predict whether a car will require major repairs in the next 12 months based on these features. Use
Python to preprocess the data and evaluate the model's performance.

18. Import the weather dataset with attributes like "temperature, humidity, wind speed, and
condition" and implement a Naive Bayes classifier to predict the weather condition (Sunny, Cloudy,
Rainy) using Python.

19. Consider the following dataset on online transactions:

Transaction Amount - Fraudulent (0 = No, 1 = Yes)
50 - 0
100 - 1
75 - 0
200 - 1
150 - 0
Write a Python program to predict whether a transaction is fraudulent based on the transaction amount
using Logistic Regression.
20. Import the social media dataset with features such as "age, number of followers, likes per post,
and posts per day". Implement the K-Nearest Neighbors (KNN) algorithm to predict the category of the
user (Influencer, Regular User) using Python.

21. Write a Python program using Scikit-learn to split the Minst dataset into 80% train data and 20%
test data. Train a Logistic Regression classifier on the dataset and evaluate the accuracy on the test set.

22. Implement a Python program to apply Principal Component Analysis (PCA) on the Iris dataset
to reduce the dimensionality to 2 principal components and visualize the results in a 2D scatter plot.

23. Write a Python program using Scikit-learn to split a loan approval dataset into 75% train data and
25% test data. The dataset includes features like income, credit_score, loan_amount,
employment_status, and debt_to_income_ratio. Preprocess the data and train a Random Forest classifier
to predict whether a loan application will be approved or denied.

24. Implement a Python program to perform feature scaling using StandardScaler on a dataset with
numerical features and then train a K-Nearest Neighbors (KNN) classifier for classification tasks.

25. Write a Python program to implement the Naive Bayes classifier on a spam email dataset.
Predict whether an email is spam or not based on features like word frequency, length, etc.

26. Apply Linear Discriminant Analysis (LDA) on a fraud detection dataset using Python, and compare
the classification results with a Logistic Regression model. The dataset includes features such as
transaction_amount, transaction_time, user_id, location, and device_used. Evaluate both models to
predict whether a transaction is fraudulent or not.

27. Write a Python program to implement the AdaBoost algorithm using Scikit-learn on a binary
classification dataset (e.g., predicting whether a customer will purchase or not).

28. Implement a Python program to use the Gradient Boosting classifier on a dataset, predict the class
labels, and evaluate its accuracy.

29. Import the Boston Housing dataset, and implement a Random Forest regressor to predict housing prices based on
features such as crime rate, average number of rooms, etc.

ML Lab Question Set - 2
No ratings yet
ML Lab Question Set - 2
5 pages
Class 10 Programs For Project File
No ratings yet
Class 10 Programs For Project File
9 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
Set2 ML Lab+viva
No ratings yet
Set2 ML Lab+viva
15 pages
Set 1 ML Lab+Viva
No ratings yet
Set 1 ML Lab+Viva
14 pages
CSK Academy - Projects List
No ratings yet
CSK Academy - Projects List
6 pages
Ai Class 12 Practical 2
No ratings yet
Ai Class 12 Practical 2
21 pages
Chat GPT With Data Science
100% (1)
Chat GPT With Data Science
36 pages
Ai
No ratings yet
Ai
3 pages
Beginner Level Projects
No ratings yet
Beginner Level Projects
5 pages
ML Lab Question Set - 1
No ratings yet
ML Lab Question Set - 1
5 pages
CS 3361 SET 1 QN Only
No ratings yet
CS 3361 SET 1 QN Only
4 pages
Pythonqp Obe 1
No ratings yet
Pythonqp Obe 1
5 pages
PPPL Final Practical Questions
No ratings yet
PPPL Final Practical Questions
5 pages
Index
No ratings yet
Index
2 pages
Questions
No ratings yet
Questions
7 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
DLWP Assignment 2
No ratings yet
DLWP Assignment 2
2 pages
ML Termwork
No ratings yet
ML Termwork
30 pages
Int375 Etp Paper
No ratings yet
Int375 Etp Paper
11 pages
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
100% (1)
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
256 pages
AMLW Assignment 5
No ratings yet
AMLW Assignment 5
1 page
ML Record
No ratings yet
ML Record
23 pages
AI and ML Questions Mid Ter
No ratings yet
AI and ML Questions Mid Ter
1 page
First
No ratings yet
First
35 pages
Creative Problem Solving
No ratings yet
Creative Problem Solving
2 pages
109 Sourabh Vivek Chougule
No ratings yet
109 Sourabh Vivek Chougule
75 pages
ML Assignment 2025 (2022 25)
No ratings yet
ML Assignment 2025 (2022 25)
1 page
Tushar ML
No ratings yet
Tushar ML
52 pages
AIot Lab Syllabus
No ratings yet
AIot Lab Syllabus
4 pages
DSBDA Lab Plan
No ratings yet
DSBDA Lab Plan
5 pages
ML Syllabus
No ratings yet
ML Syllabus
4 pages
AML ML Practical List
No ratings yet
AML ML Practical List
10 pages
DSBDAlab Manual
No ratings yet
DSBDAlab Manual
116 pages
AAM PR QB
No ratings yet
AAM PR QB
13 pages
New Text Document
No ratings yet
New Text Document
4 pages
XI - AI - Practical File 2024
No ratings yet
XI - AI - Practical File 2024
2 pages
PR List Dsbda
No ratings yet
PR List Dsbda
2 pages
CS-605-MJPLab Course On CS-602-MJ (Machine Learning)
No ratings yet
CS-605-MJPLab Course On CS-602-MJ (Machine Learning)
2 pages
Rdatascience - Problem Statements
No ratings yet
Rdatascience - Problem Statements
2 pages
DSML Problem Statements
No ratings yet
DSML Problem Statements
8 pages
Python
No ratings yet
Python
38 pages
ML Index Nancy
No ratings yet
ML Index Nancy
3 pages
22CM1105
No ratings yet
22CM1105
2 pages
Constitution
No ratings yet
Constitution
3 pages
Project List
No ratings yet
Project List
2 pages
Machine Learning
No ratings yet
Machine Learning
22 pages
Write A Python Program To Check The Validity of A Password Given by The User. The Password
No ratings yet
Write A Python Program To Check The Validity of A Password Given by The User. The Password
5 pages
DSBDA Sample Problem Statements
No ratings yet
DSBDA Sample Problem Statements
3 pages
Datascience
No ratings yet
Datascience
8 pages
Study Material IP XII
No ratings yet
Study Material IP XII
116 pages
Python: Master
No ratings yet
Python: Master
37 pages
GE DigitalFlow GF868
No ratings yet
GE DigitalFlow GF868
163 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
Pert Q Python
No ratings yet
Pert Q Python
3 pages
ML - LAB - FILE Pankaj
No ratings yet
ML - LAB - FILE Pankaj
13 pages
Sentiment Analysis On Youtube Comments
No ratings yet
Sentiment Analysis On Youtube Comments
54 pages
ML - LAB - FILE Amrit
No ratings yet
ML - LAB - FILE Amrit
13 pages
SharePoint Online Power Automate Notes
No ratings yet
SharePoint Online Power Automate Notes
5 pages
AC-coupled PV With Fronius PV Inverters: Cerbo GX Color Control GX
No ratings yet
AC-coupled PV With Fronius PV Inverters: Cerbo GX Color Control GX
14 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
LM7 - Embedded SQL and Dynamic SQL
No ratings yet
LM7 - Embedded SQL and Dynamic SQL
14 pages
Important Questions
No ratings yet
Important Questions
4 pages
Grape Detection With Convolutional Neural N - 2020 - Expert Systems With Applica
No ratings yet
Grape Detection With Convolutional Neural N - 2020 - Expert Systems With Applica
9 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
Huawei ICT Competition 2023-2024 Exam Outline - Cloud Track
0% (1)
Huawei ICT Competition 2023-2024 Exam Outline - Cloud Track
1 page
Stiker Tinggal Print NEW
No ratings yet
Stiker Tinggal Print NEW
50 pages
Ak Unit 6 Codetantra Updated
No ratings yet
Ak Unit 6 Codetantra Updated
18 pages
Ie DP I II&II III
No ratings yet
Ie DP I II&II III
2 pages
DBMS - Answer Key
No ratings yet
DBMS - Answer Key
6 pages
Internship Report
No ratings yet
Internship Report
40 pages
Computerized Accounting System
100% (1)
Computerized Accounting System
6 pages
LM 6 - SQL Fundamentals Advanced SQL Features
No ratings yet
LM 6 - SQL Fundamentals Advanced SQL Features
58 pages
LM 5 - Constraints Relational Algebra
No ratings yet
LM 5 - Constraints Relational Algebra
47 pages
Windows Basic Notes December 2024
No ratings yet
Windows Basic Notes December 2024
3 pages
Unit 3 - 1
No ratings yet
Unit 3 - 1
53 pages
Log
No ratings yet
Log
25 pages
LM 4 - Relational Database Keys
No ratings yet
LM 4 - Relational Database Keys
30 pages
Updated Dbms Lab Obe
No ratings yet
Updated Dbms Lab Obe
4 pages
LM 3 - Database System Architecture
No ratings yet
LM 3 - Database System Architecture
20 pages
Editable Report
No ratings yet
Editable Report
11 pages
SchneiderF M DomahidiE DietrichF 2020 Whatisimportantwhenweevaluatemovies
No ratings yet
SchneiderF M DomahidiE DietrichF 2020 Whatisimportantwhenweevaluatemovies
12 pages
Project Database
No ratings yet
Project Database
7 pages
1.editable - IIC Approval Letter - IIC Event 30.09.23
No ratings yet
1.editable - IIC Approval Letter - IIC Event 30.09.23
2 pages
Lec6 PDF
No ratings yet
Lec6 PDF
22 pages
Rohit Data Analysis
No ratings yet
Rohit Data Analysis
1 page
Fall SlidesMania
No ratings yet
Fall SlidesMania
11 pages
Building Brand Loyalty Through User Engagement in Online Brand Communities in
No ratings yet
Building Brand Loyalty Through User Engagement in Online Brand Communities in
21 pages
Unit 4-4
No ratings yet
Unit 4-4
8 pages
Marine Panel Personal Computer Mvpc-1901: Unicont SPB LTD
No ratings yet
Marine Panel Personal Computer Mvpc-1901: Unicont SPB LTD
12 pages
Framework Cheat Sheet
No ratings yet
Framework Cheat Sheet
2 pages
Resumen Del Hardware Del Ordenador
No ratings yet
Resumen Del Hardware Del Ordenador
2 pages
Air-to-Air Visual Detection of Micro-UAVs An Experimental Evaluation of Deep Learning
No ratings yet
Air-to-Air Visual Detection of Micro-UAVs An Experimental Evaluation of Deep Learning
8 pages
7.5 Effects of Layer 2 Devices On Data Flow: 7.5.1 Ethernet LAN Segmentation
No ratings yet
7.5 Effects of Layer 2 Devices On Data Flow: 7.5.1 Ethernet LAN Segmentation
9 pages
Basic Computer Concepts
100% (12)
Basic Computer Concepts
9 pages
User Behavior Analytics
No ratings yet
User Behavior Analytics
2 pages
Python Programming June July 2022
No ratings yet
Python Programming June July 2022
1 page
Monday Wednesday Thursday Friday
No ratings yet
Monday Wednesday Thursday Friday
4 pages
Introduction To Embedded Systems: Printed Book
No ratings yet
Introduction To Embedded Systems: Printed Book
1 page

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ML Lab Question Set - 21

Uploaded by

ML Lab Question Set - 21

Uploaded by

1.

2. Consider the below salary dataset:

12. Consider the below salary dataset:

19. Consider the following dataset on online transactions:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.