0% found this document useful (0 votes)

36 views

ML Practical 04

This document provides steps for building a simple linear regression model to predict house prices based on house area using Python. The steps include: 1) Importing necessary libraries and loading the dataset 2) Exploring and visualizing the data 3) Splitting the data into training and testing sets 4) Creating and training a linear regression model on the training set 5) Making predictions on the test set and evaluating the model's performance 6) Visualizing the regression line and predicted prices 7) Allowing users to input an area to predict the corresponding house price.

Uploaded by

chatgptlogin2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

ML Practical 04

Uploaded by

chatgptlogin2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

ITC 2252 - Introduction

to Machine Learning
Practical Session - 04
Steps of the process
01 Import Data
02 Clean the Data
03 Split the data to testing & Training
04 Design the model
05 Train the Model
06 Make Predictions
07 Evaluate and Improve
Today’s session
Simple linear regression model building
01 Split the data to testing & Training
02 Design the model
03 Train the Model
04 Make Predictions
01
Linear Regression
What is Linear Regression?
Linear regression is a statistical method used for modeling the
relationship between a dependent variable and one or more independent
variables by fitting a linear equation to the observed data.
How to predict house price according to the area of the house?
House area (m2) Price ($)

10000 4000

20000 5000

30000 6000

40000 7000

50000 8000

We can use linear regression model for the price prediction.

Y = a +bx
Y = Dependent variable (Price)
X = Independent variable (House area)
a = y intercept (value of the dependent variable when x = 0)
b = coefficient of the independent variable
02
Model building
Step 1: Import necessary libraries

This step imports the required Python libraries:

➔ pandas for data frame creation and manipulation.
➔ matplotlib.pyplot for data visualization.
➔ train_test_split from sklearn.model_selection to split the dataset into training and
testing sets.
➔ LinearRegression from sklearn.linear_model for building a linear regression model.
➔ mean_squared_error from sklearn.metrics to evaluate the model's performance.
Step 2: Load the dataset

This step reads the Housing dataset from a CSV ﬁle into a pandas DataFrame named
data.

Step 3: Explore the data

This prints the ﬁrst few rows of the dataset(head), giving you an idea about its structure.
Step 4: Visualize the data

This step creates a scatter plot to visually represent the relationship between the 'Area'
and 'Price' columns.
Step 5: Prepare the data for training

This separates the independent variable (X - 'Area') and the dependent variable (y - 'Price').
Step 6: Split the data into training and testing sets

This splits the data into training and testing sets.

➔ The test_size parameter determines the proportion of the data used for testing (in
this case, 20%).
➔ The random _state parameter ensures that the split is fixed, meaning that the same
split will be produced every time you run the code.
Step 6: Split the data into training and testing sets cont.
The purpose of splitting the data into training and testing sets is to evaluate how well the machine
learning model generalizes to new, unseen data.
Training Set:
❖ Purpose: The model learns the patterns and relationships within the training data.
❖ Benefit: The model adjusts its parameters based on this data to make accurate predictions.
Testing Set:
❖ Purpose: The model's performance is assessed on data it has never seen before.
❖ Benefit: This evaluation provides an estimate of how well the model is likely to perform on new,
real-world data.
Test Size Parameter:
❖ Purpose: It determines the proportion of the data allocated to the testing set.
❖ Benefit: A larger test set can provide a more reliable evaluation, but a smaller test set may lead
to more data for training.
Random State Parameter:
❖ Purpose: It ensures reproducibility by fixing the random seed for the data split.
❖ Benefit: With the same random state, the data split remains consistent across runs, making
experiments reproducible.
Step 7: Create and train the linear regression model

This step creates a Linear Regression Model:

model = LinearRegression(): This line creates an instance of the LinearRegression class from the scikit-learn
library. This instance (model) will be used to represent the linear regression model.
Train the Model: model.fit(X_train, y_train): This line trains the linear regression model using the training data.
The fit method takes two main parameters:
● X_train: The input features (independent variable) from the training set. In the context of house price
prediction, it represents the 'Area' of the house.
● y_train: The target variable (dependent variable) from the training set. In this case, it represents the
corresponding house prices.
The fit method adjusts the model's parameters (slope and intercept) to find the best-fit line that minimizes the
difference between the predicted values and the actual values in the training data.
After this line is executed, the model object is now trained and can be used to make predictions on new,
unseen data.
Step 8: Make predictions on the test set

This step uses the trained model to make predictions on the test set.
● model.predict(X_test): This line uses the trained model to predict the dependent
variable (y) based on the independent variable (X_test), which represents the 'Area'
of the houses in the test set.
● y_pred : The predicted values are stored in the variable y_pred.
Step 9: Evaluate the model

This calculates the Mean Squared Error, a metric to evaluate how well the model is
performing on the test data.
Step 10: Visualize the regression line

This step visualizes the regression line along with the test set to understand how well
the model ﬁts the data.
Step 11: Predict house price for user input

1. This step takes user input for the area of the house, converts it to a DataFrame with
the column name 'Area'.
2. Then uses the trained model to predict the house price based on the user's input.
3. The predicted price is then displayed.
4. This allows users to get a predicted house price for a speciﬁc area without having
to look at the entire dataset.
Thanks
Do you have any questions?

Predicting House Prices
No ratings yet
Predicting House Prices
9 pages
Phase 5
No ratings yet
Phase 5
5 pages
House Price Prediction Using Linear Regression in ML
No ratings yet
House Price Prediction Using Linear Regression in ML
9 pages
ml record
No ratings yet
ml record
21 pages
1_Lab Manual (ML)
No ratings yet
1_Lab Manual (ML)
42 pages
Aastha Mahajan Python File
No ratings yet
Aastha Mahajan Python File
17 pages
ADS_LAB8
No ratings yet
ADS_LAB8
5 pages
day 5
No ratings yet
day 5
2 pages
Lab 2 Linear Regression Representation
No ratings yet
Lab 2 Linear Regression Representation
6 pages
Regression Dataset
No ratings yet
Regression Dataset
3 pages
For House Price Prediction Model
No ratings yet
For House Price Prediction Model
9 pages
House price predictor ppt Project
No ratings yet
House price predictor ppt Project
13 pages
Shub Neet Dt
No ratings yet
Shub Neet Dt
12 pages
Exp4(Linear Regression)
No ratings yet
Exp4(Linear Regression)
2 pages
Report
No ratings yet
Report
40 pages
Project
No ratings yet
Project
10 pages
ML MANUAL
No ratings yet
ML MANUAL
24 pages
Machine learning lab manual
No ratings yet
Machine learning lab manual
22 pages
Real-Estate Property
No ratings yet
Real-Estate Property
11 pages
Report On Java Chatting
No ratings yet
Report On Java Chatting
10 pages
AIML
No ratings yet
AIML
5 pages
ML Assignment2 33418
No ratings yet
ML Assignment2 33418
6 pages
Real Estate Price Prediction Model
No ratings yet
Real Estate Price Prediction Model
3 pages
Dl Assignment 1ms24rai03
No ratings yet
Dl Assignment 1ms24rai03
10 pages
Title Predicting House Pricing Using AIML (KASHISH)
No ratings yet
Title Predicting House Pricing Using AIML (KASHISH)
2 pages
Comparing Linear Regression and Decision Trees For Housing Price Prediction
No ratings yet
Comparing Linear Regression and Decision Trees For Housing Price Prediction
8 pages
Regression House Price
No ratings yet
Regression House Price
34 pages
Utkarsh Gupta - House Price Prediction
No ratings yet
Utkarsh Gupta - House Price Prediction
6 pages
End To End Machine Learning Project-2
No ratings yet
End To End Machine Learning Project-2
10 pages
House Price Prediction PyCharm
No ratings yet
House Price Prediction PyCharm
9 pages
ml project clg (2)
No ratings yet
ml project clg (2)
62 pages
Price Prediction
No ratings yet
Price Prediction
4 pages
House Price Prediction Report
No ratings yet
House Price Prediction Report
2 pages
HOUSE PRICE PREDICTION
No ratings yet
HOUSE PRICE PREDICTION
17 pages
Linear Regression in python
No ratings yet
Linear Regression in python
9 pages
UtkarshGupta (House Price Prediction)
No ratings yet
UtkarshGupta (House Price Prediction)
14 pages
House Price Prediction Using Machine Learning Techniques
No ratings yet
House Price Prediction Using Machine Learning Techniques
5 pages
House Price Prediction Using Machine Learning Techniques
No ratings yet
House Price Prediction Using Machine Learning Techniques
5 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
intership report
No ratings yet
intership report
20 pages
House Pricing
No ratings yet
House Pricing
15 pages
Coding Question
No ratings yet
Coding Question
6 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Seminar Ppt4
No ratings yet
Seminar Ppt4
19 pages
ml project part a 1
No ratings yet
ml project part a 1
6 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Project Presentation On House Price Prediction System: Presented by Name: Simran B Solanki Roll No: 19020
100% (1)
Project Presentation On House Price Prediction System: Presented by Name: Simran B Solanki Roll No: 19020
32 pages
Solution Methodology
No ratings yet
Solution Methodology
5 pages
House Prices
No ratings yet
House Prices
5 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
AI_ML
No ratings yet
AI_ML
2 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
House Price Prediction - Research Paper FINAL DRAFT
100% (1)
House Price Prediction - Research Paper FINAL DRAFT
10 pages
P05 The Regression Pipeline - Training and Testing Ans
No ratings yet
P05 The Regression Pipeline - Training and Testing Ans
13 pages
Unit 5
No ratings yet
Unit 5
18 pages
Model_learning_steps
No ratings yet
Model_learning_steps
12 pages
ISMLA_Module5
No ratings yet
ISMLA_Module5
25 pages
L03 The Regression Pipeline
No ratings yet
L03 The Regression Pipeline
94 pages
MY PRO DAY 9 Copy
No ratings yet
MY PRO DAY 9 Copy
59 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
The Jackknife: Patrick Breheny
No ratings yet
The Jackknife: Patrick Breheny
23 pages
Final Exam January 2019 Ines Barkia PDF
No ratings yet
Final Exam January 2019 Ines Barkia PDF
10 pages
A1w2017 PDF
No ratings yet
A1w2017 PDF
2 pages
The Spearman Rho Rank Correlation Coefficient
No ratings yet
The Spearman Rho Rank Correlation Coefficient
22 pages
Results
No ratings yet
Results
12 pages
Assignment
No ratings yet
Assignment
20 pages
Frequency Table: Statistics
No ratings yet
Frequency Table: Statistics
7 pages
A Comprehensive Guide To Time Series Analysis
No ratings yet
A Comprehensive Guide To Time Series Analysis
18 pages
Case Analysis No. 5-Regression
No ratings yet
Case Analysis No. 5-Regression
5 pages
Biostatistics 202: Logistic Regression Analysis: Yhchan
No ratings yet
Biostatistics 202: Logistic Regression Analysis: Yhchan
5 pages
Chapter 2
No ratings yet
Chapter 2
59 pages
MOOC Econometrics 6
100% (1)
MOOC Econometrics 6
4 pages
Hypothesis Testing Roadmap
No ratings yet
Hypothesis Testing Roadmap
2 pages
Regression
No ratings yet
Regression
48 pages
SPSS Project
0% (1)
SPSS Project
12 pages
6 金融大数据只信用风险管理-骆司融老师 (9 10pm)
No ratings yet
6 金融大数据只信用风险管理-骆司融老师 (9 10pm)
14 pages
STAT501 Online - Spring2024 - FinalExam
No ratings yet
STAT501 Online - Spring2024 - FinalExam
14 pages
Output Spss
No ratings yet
Output Spss
5 pages
MATH 1281 Discussion Assignment Unit 2
No ratings yet
MATH 1281 Discussion Assignment Unit 2
3 pages
GLS e FGLS
No ratings yet
GLS e FGLS
10 pages
Assumption of Anova
No ratings yet
Assumption of Anova
8 pages
Forecasting The Quarterly Production of Rice and Corn in The Philippines: A Time Series Analysis
No ratings yet
Forecasting The Quarterly Production of Rice and Corn in The Philippines: A Time Series Analysis
11 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
14 pages
Correlation and Regression
No ratings yet
Correlation and Regression
18 pages
What Uncertainties Do We Need in Bayesian Deep Learning For Computer Vision?
No ratings yet
What Uncertainties Do We Need in Bayesian Deep Learning For Computer Vision?
12 pages
Week 1 Stats
No ratings yet
Week 1 Stats
23 pages
Weekly Usage Hrs Annual Maintenance Expense (1000s)
No ratings yet
Weekly Usage Hrs Annual Maintenance Expense (1000s)
5 pages
Assignment-Regression Analysis
No ratings yet
Assignment-Regression Analysis
6 pages
Lecture 4 Linear Regression 1 07032024 082032pm
No ratings yet
Lecture 4 Linear Regression 1 07032024 082032pm
32 pages
Advanced Econometric Methods I: Problem Set 1: Geert Mesters September 26, 2020
No ratings yet
Advanced Econometric Methods I: Problem Set 1: Geert Mesters September 26, 2020
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ML Practical 04

Uploaded by

ML Practical 04

Uploaded by

ITC 2252 - Introduction

We can use linear regression model for the price prediction.

This step imports the required Python libraries:

Step 3: Explore the data

This splits the data into training and testing sets.

This step creates a Linear Regression Model:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.