0% found this document useful (0 votes)

45 views2 pages

Capstone Project-Naan Mudlvan

The document provides instructions for a capstone project to build a predictive model using a CPU performance dataset. The dataset contains 209 rows and 9 columns with details on CPU vendors, models, specifications and performance scores. The tasks are to preprocess the data by encoding categorical variables, dropping rare vendors and the model column, split the data into train and test sets, build a linear regression model on the training set, calculate performance metrics on both sets, check for multicollinearity using VIF, and predict the performance score for a new CPU instance.

Uploaded by

mohanraj28174

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views2 pages

Capstone Project-Naan Mudlvan

Uploaded by

mohanraj28174

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

Capstone Project

General Instructions:
1. Provide appropriate comments in your code.
2. Perform all task programmatically using Python libraries.

‘XYZ’ hardware service center is specialized in servicing the CPUs. The center has maintained the details
about the CPUs they have serviced which is available in “machine_data.csv”. The dataset has 209 rows
and 9 columns. [Source of raw dataset: UCI repository] The details of the columns are as follows:

 vendor: represents the manufacturer of the CPU.

 model: represents the model number of the CPU.
 cycle_time: represents the time taken for internal data transfer in nanoseconds of the CPU.
 min_memory: represents the minimum main memory required by the CPU.
 max_memory: represents the maximum main memory supported by the CPU.
 cache: represents the size of cache memory required by the CPU.
 min_threads: represents the number of threads that run in the CPU when it is just switched on.
 max_threads: represents the maximum number of threads that can be run on the CPU.
 score: represents the performance score of the CPU.

Based on this data, ‘XYZ’ hardware service center would like to build a predictive model that predicts the
performance score for the new CPUs.
As a data science expert, you are expected to build the best model for the given scenario.

Problem statement:

Perform the following activities to build the model:

1. Import the data set “machine_data.csv”.

2. As part of data preprocessing, perform the following activities:
a. Encode the categorical column – ‘vendor’ using label encoder.
b. Identify the vendors who have manufactured less than 5 CPUs and drop those rows from the
given dataset, corresponding to the identified vendors.
c. Drop the column ‘model’.

[Note: The preprocessed dataset should be used further.]

3. Select ‘score’ as the target variable to be predicted and remaining features as predictors.
4. Split the data into training and testing data set in the ratio 80:20.
5. As part of model building, perform the following activities:
a. Based on the training data, build a Linear Regression model.
b. Find the train and the test score for the built model.
c. Calculate the adjusted R-Squared values on both the train and the test data.
6. Calculate the VIF values for all the features considered while building the model using the train data.
7. Based on the model built, predict the performance score of a new test sample/ new CPU instance
which is given below:

(Note: In the above hardware instance, the vendor value '14' is the label encoded value for vendor
'harris'.)

Predictive Modeling Projectt
No ratings yet
Predictive Modeling Projectt
109 pages
Basic Python Notes
100% (3)
Basic Python Notes
8 pages
How To Turn On or Off The Call and SMS Blocking Feature in Your Phone. by Moses Grey Medium
No ratings yet
How To Turn On or Off The Call and SMS Blocking Feature in Your Phone. by Moses Grey Medium
1 page
Laptop Price Analysis
No ratings yet
Laptop Price Analysis
37 pages
XSTK
No ratings yet
XSTK
36 pages
Predictive Modelling
No ratings yet
Predictive Modelling
28 pages
Laptop Price Analysis (Finance Analyst)
No ratings yet
Laptop Price Analysis (Finance Analyst)
36 pages
First
No ratings yet
First
35 pages
Business+Report Linear
No ratings yet
Business+Report Linear
20 pages
Report
No ratings yet
Report
14 pages
ML5&6&7&8&9&10
No ratings yet
ML5&6&7&8&9&10
35 pages
MSC Academic Internship Config Manual IDS Improvement Using MIGBM Feature Selection
No ratings yet
MSC Academic Internship Config Manual IDS Improvement Using MIGBM Feature Selection
19 pages
MLPC Group Assignment
No ratings yet
MLPC Group Assignment
16 pages
Generative AI For Models Development
No ratings yet
Generative AI For Models Development
8 pages
Data Mining
No ratings yet
Data Mining
10 pages
Saayna Narvekar ESE
No ratings yet
Saayna Narvekar ESE
12 pages
Pravesh 6301
No ratings yet
Pravesh 6301
11 pages
Plate Notebook Guided Project 1 1
No ratings yet
Plate Notebook Guided Project 1 1
58 pages
Final Exam MPML
No ratings yet
Final Exam MPML
5 pages
Manual (2023 CS 156)
No ratings yet
Manual (2023 CS 156)
26 pages
Capstone 2 Corizo
No ratings yet
Capstone 2 Corizo
2 pages
Practical Assignment. Applying Methods of Machine Learning With Example
No ratings yet
Practical Assignment. Applying Methods of Machine Learning With Example
2 pages
Juspay Interview Experience
No ratings yet
Juspay Interview Experience
3 pages
Assignment
No ratings yet
Assignment
5 pages
Business Report
No ratings yet
Business Report
30 pages
Report
No ratings yet
Report
4 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
EST - Problem Statement-3
No ratings yet
EST - Problem Statement-3
3 pages
Digital Transformation in Banking
No ratings yet
Digital Transformation in Banking
4 pages
Task - Case Study - DLMDSME01
No ratings yet
Task - Case Study - DLMDSME01
7 pages
Lab08 ML
No ratings yet
Lab08 ML
6 pages
Project Description Document
No ratings yet
Project Description Document
7 pages
Artificial Intelligence Semester Project: Topic: Car Mileage Predictor Presented by Abdullah Farooq
No ratings yet
Artificial Intelligence Semester Project: Topic: Car Mileage Predictor Presented by Abdullah Farooq
17 pages
MS5107 DM Regression Models Workshop - NUIG
No ratings yet
MS5107 DM Regression Models Workshop - NUIG
8 pages
Midterm Task
No ratings yet
Midterm Task
1 page
A1991370857 65680 10 2025 Csm355ca1
No ratings yet
A1991370857 65680 10 2025 Csm355ca1
6 pages
Sunita Pradhan Previous Resume
No ratings yet
Sunita Pradhan Previous Resume
2 pages
ML Assignment 2
No ratings yet
ML Assignment 2
3 pages
Power Consumption Forecasting - 191030052
No ratings yet
Power Consumption Forecasting - 191030052
6 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
FMT - Problem - Statement
No ratings yet
FMT - Problem - Statement
2 pages
DS Assignment
No ratings yet
DS Assignment
2 pages
Predictive Modelling ALOK KUMAR
100% (1)
Predictive Modelling ALOK KUMAR
25 pages
Python - Project 2 Problem Statement
No ratings yet
Python - Project 2 Problem Statement
3 pages
Sari Go MM Ulaan U Deep Resume
No ratings yet
Sari Go MM Ulaan U Deep Resume
3 pages
Subject - Machine Learning Group - E27-24 Name
No ratings yet
Subject - Machine Learning Group - E27-24 Name
18 pages
CS-605-MJPLab Course On CS-602-MJ (Machine Learning)
No ratings yet
CS-605-MJPLab Course On CS-602-MJ (Machine Learning)
2 pages
Mobile Computing Full Notes
No ratings yet
Mobile Computing Full Notes
195 pages
Ce473 Project - Fall 2024
No ratings yet
Ce473 Project - Fall 2024
8 pages
Example 2 SPM Lec#1
No ratings yet
Example 2 SPM Lec#1
3 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Predictive Modelling Project
No ratings yet
Predictive Modelling Project
29 pages
ADS Phase2
No ratings yet
ADS Phase2
2 pages
Cours 3 - TP
No ratings yet
Cours 3 - TP
3 pages
Capstone Project - Jaro-Prof. Babji
No ratings yet
Capstone Project - Jaro-Prof. Babji
5 pages
Google Ai ML Virtual Internship Report
No ratings yet
Google Ai ML Virtual Internship Report
29 pages
Important Questions
No ratings yet
Important Questions
4 pages
Machine Learning Assignment-02
No ratings yet
Machine Learning Assignment-02
2 pages
3529201
No ratings yet
3529201
3 pages
Urban Freestyle Soccer ReadMe 15th October 2003 Version 1.0 TABLE
No ratings yet
Urban Freestyle Soccer ReadMe 15th October 2003 Version 1.0 TABLE
5 pages
Siemens - Profibus and Modbus Comparison
No ratings yet
Siemens - Profibus and Modbus Comparison
5 pages
Trackwise® Customer Complaint Solutions: A Global Approach Integrated Corrective & Preventive Actions (Capa)
100% (1)
Trackwise® Customer Complaint Solutions: A Global Approach Integrated Corrective & Preventive Actions (Capa)
2 pages
Database Testing Interview Questions
No ratings yet
Database Testing Interview Questions
7 pages
Scenario Overview - 20250525 - 233522 - 0000
No ratings yet
Scenario Overview - 20250525 - 233522 - 0000
3 pages
AI-Powered Research & Report-Making
No ratings yet
AI-Powered Research & Report-Making
34 pages
Group 08 - Progress Review I - Online Movie Ticket Booking System For Southern Province
No ratings yet
Group 08 - Progress Review I - Online Movie Ticket Booking System For Southern Province
18 pages
Manual CKDZ - Procesador Video
No ratings yet
Manual CKDZ - Procesador Video
29 pages
Skyline Health Diagnostics
No ratings yet
Skyline Health Diagnostics
99 pages
Leco Gis Spec
No ratings yet
Leco Gis Spec
8 pages
Reviewed Basic Units OS Level 5 and 6
No ratings yet
Reviewed Basic Units OS Level 5 and 6
27 pages
Bca 2021 - 22
No ratings yet
Bca 2021 - 22
123 pages
Null 1
No ratings yet
Null 1
27 pages
2012 - SCADA Security in The Light of Cyber-Warfare
100% (1)
2012 - SCADA Security in The Light of Cyber-Warfare
19 pages
Principales of Ac One Mark 5 Units NEW-1
No ratings yet
Principales of Ac One Mark 5 Units NEW-1
11 pages
FusionHub User Manual and Installation Guide PDF
No ratings yet
FusionHub User Manual and Installation Guide PDF
116 pages
Design and Implementation Online Software Store
No ratings yet
Design and Implementation Online Software Store
55 pages
ReleaseNotes S1AgileV1.4.1 RC
No ratings yet
ReleaseNotes S1AgileV1.4.1 RC
20 pages
BGP Frequently Asked Questions
No ratings yet
BGP Frequently Asked Questions
9 pages
Video Graphics Array (VGA) : (VGA) Port HD-15 DE-15 DB-15
No ratings yet
Video Graphics Array (VGA) : (VGA) Port HD-15 DE-15 DB-15
8 pages
A New Automobile Sales Marketing Model For Innovat
No ratings yet
A New Automobile Sales Marketing Model For Innovat
24 pages
INTERNSHIPppp Presentation
No ratings yet
INTERNSHIPppp Presentation
13 pages
Volume7 Issue2 Paper6 2023
No ratings yet
Volume7 Issue2 Paper6 2023
17 pages
Assignment 03
No ratings yet
Assignment 03
11 pages
Quantitative Aptitude
No ratings yet
Quantitative Aptitude
4 pages
AN-276 IEC 61131-3 Flying Shear
No ratings yet
AN-276 IEC 61131-3 Flying Shear
6 pages
UBIX Roadmap
No ratings yet
UBIX Roadmap
1 page
TOC For Industries OM
No ratings yet
TOC For Industries OM
2 pages
Using Low-Code Solutions To Make The Most of Industrial IoT
No ratings yet
Using Low-Code Solutions To Make The Most of Industrial IoT
9 pages
Power Off Reset Reason Backup
No ratings yet
Power Off Reset Reason Backup
5 pages
FortiGate 1500D Spec
No ratings yet
FortiGate 1500D Spec
6 pages
Debugging Imagej Plugins in Netbeans
No ratings yet
Debugging Imagej Plugins in Netbeans
4 pages
SIMATIC Field PG M2 - Guideline To The Operating Instructions
No ratings yet
SIMATIC Field PG M2 - Guideline To The Operating Instructions
1 page
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-4: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-4: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
AZ-801 Exam Prep: Configuring Windows Server Hybrid Services
From Everand
AZ-801 Exam Prep: Configuring Windows Server Hybrid Services
Steve Brown
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Capstone Project-Naan Mudlvan

Uploaded by

Capstone Project-Naan Mudlvan

Uploaded by

Capstone Project

 vendor: represents the manufacturer of the CPU.

Perform the following activities to build the model:

1. Import the data set “machine_data.csv”.

[Note: The preprocessed dataset should be used further.]

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.