0% found this document useful (0 votes)

11 views9 pages

Logistic Regression Using Python

LOGISTIC REGRESSION USING PYTHON -

Uploaded by

Ishant

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views9 pages

Logistic Regression Using Python

LOGISTIC REGRESSION USING PYTHON -

Uploaded by

Ishant

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

12/22/23, 3:45 AM practice

In [83]: import pandas as pd

import numpy as np
data = pd.read_csv("mbasalary.csv")
data.head()
#data.iloc[:,0:1]

Out[83]: S. No. Percentage in Grade 10 Salary

0 1 62.00 270000

1 2 76.33 200000

2 3 72.00 240000

3 4 60.00 250000

4 5 61.00 180000

In [84]: data = data[['Percentage in Grade 10','Salary']]

x = data[['Percentage in Grade 10']]
y= data[['Salary']]
data.describe()

Out[84]: Percentage in Grade 10 Salary

count 50.000000 50.000000

mean 63.922400 258192.000000

std 9.859937 76715.790993

min 37.330000 120000.000000

25% 57.685000 204500.000000

50% 64.700000 250000.000000

75% 70.000000 300000.000000

max 83.000000 450000.000000

In [85]: import matplotlib.pyplot as plt

plt.scatter(data.iloc[:,0:1],data.iloc[:,-1])

<matplotlib.collections.PathCollection at 0x26fcab243d0>
Out[85]:

localhost:8888/nbconvert/html/practice .ipynb?download=false 1/9

12/22/23, 3:45 AM practice

In [ ]:

In [86]: data.mean()

Percentage in Grade 10 63.9224

Out[86]:
Salary 258192.0000
dtype: float64

In [87]: data.mode()

Out[87]: Percentage in Grade 10 Salary

0 68.0 300000

In [88]: data.median()

Percentage in Grade 10 64.7

Out[88]:
Salary 250000.0
dtype: float64

In [89]: data.std()

Percentage in Grade 10 9.859937

Out[89]:
Salary 76715.790993
dtype: float64

In [90]: data.quantile() #by default q=0.5

Percentage in Grade 10 64.7

Out[90]:
Salary 250000.0
Name: 0.5, dtype: float64

In [91]: data.quantile(q=0.25)

localhost:8888/nbconvert/html/practice .ipynb?download=false 2/9

12/22/23, 3:45 AM practice
Percentage in Grade 10 57.685
Out[91]:
Salary 204500.000
Name: 0.25, dtype: float64

In [92]: data.quantile(q=[0.25,0.5])

Out[92]: Percentage in Grade 10 Salary

0.25 57.685 204500.0

0.50 64.700 250000.0

In [93]: data.var()

Percentage in Grade 10 9.721836e+01

Out[93]:
Salary 5.885313e+09
dtype: float64

LOGISTIC REGRESSION
In [151… # Import necessary libraries
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, classification_report, f1_score

# Create a simple dataset

data = pd.DataFrame({
'Hours_Studied': [2, 3, 4, 5, 6, 7, 8, 9, 10],
'Hours_Slept': [5, 6, 5, 7, 8, 7, 8, 9, 10],
'Pass': [0, 0, 0, 0, 1, 1, 1, 1, 1] # 1 indicates pass, 0 indicates fail
})

# Display the dataset

print("Dataset:")
print(data)

# Split the data into training and testing sets

X = data[['Hours_Studied', 'Hours_Slept']]
y = data['Pass']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_sta

# Create a logistic regression model

model = LogisticRegression()

# Train the model on the training data

model.fit(X_train, y_train)

# Make predictions on the testing data

y_pred = model.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)
print(f"\nAccuracy: {accuracy:.2f}")
print("f1 score: ", f1_score(y_test,y_pred))

localhost:8888/nbconvert/html/practice .ipynb?download=false 3/9

12/22/23, 3:45 AM practice
Dataset:
Hours_Studied Hours_Slept Pass
0 2 5 0
1 3 6 0
2 4 5 0
3 5 7 0
4 6 8 1
5 7 7 1
6 8 8 1
7 9 9 1
8 10 10 1

Accuracy: 1.00
f1 score: 1.0

In [95]: # Display classification report

print("Classification Report:")
print(classification_report(y_test, y_pred))

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 1

1 1.00 1.00 1.00 1

accuracy 1.00 2
macro avg 1.00 1.00 1.00 2
weighted avg 1.00 1.00 1.00 2

In [ ]:

MULTINOMIAL REGRESSION
In [115… # Import necessary libraries
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, classification_report

# Create a simple dataset

data = pd.DataFrame({
'Hours_Studied': [2, 3, 4, 5, 6, 7, 8, 9, 10],
'Hours_Slept': [5, 6, 5, 7, 8, 7, 8, 9, 10],
'Grade': ['F', 'F', 'F', 'C', 'B', 'C', 'B', 'A', 'A'] # Three classes: F, C,
})

# Display the dataset

print("Dataset:")
print(data)

# Split the data into training and testing sets

localhost:8888/nbconvert/html/practice .ipynb?download=false 4/9

12/22/23, 3:45 AM practice
X = data[['Hours_Studied', 'Hours_Slept']]
y = data['Grade']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_sta

# Create a multinomial logistic regression model

model = LogisticRegression(multi_class='multinomial', solver='lbfgs')

# Train the model on the training data

model.fit(X_train, y_train)

# Make predictions on the testing data

y_pred = model.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)
print(f"\nAccuracy: {accuracy:.2f}")

Dataset:
Hours_Studied Hours_Slept Grade
0 2 5 F
1 3 6 F
2 4 5 F
3 5 7 C
4 6 8 B
5 7 7 C
6 8 8 B
7 9 9 A
8 10 10 A

Accuracy: 1.00

In [119… # Display classification report

print("Classification Report:")
print(classification_report(y_test, y_pred))

Classification Report:
precision recall f1-score support

A 1.00 1.00 1.00 1

F 1.00 1.00 1.00 1

accuracy 1.00 2
macro avg 1.00 1.00 1.00 2
weighted avg 1.00 1.00 1.00 2

Multiple Linear Regression(using sklearn)

In [113… # Import necessary libraries
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error, r2_score

# Create a simple dataset

data = pd.DataFrame({
'Hours_Studied': [2, 3, 4, 5, 6, 7, 8, 9, 10],
'Hours_Slept': [5, 6, 5, 7, 8, 7, 8, 9, 10],
'Score': [55, 65, 50, 80, 90, 75, 85, 95, 100] # Dependent variable
})

localhost:8888/nbconvert/html/practice .ipynb?download=false 5/9

12/22/23, 3:45 AM practice

# Display the dataset

print("Dataset:")
display(data)

# Split the data into training and testing sets

X = data[['Hours_Studied', 'Hours_Slept']]
y = data['Score']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_sta

# Create a multiple linear regression model

model = LinearRegression()

# Train the model on the training data

model.fit(X_train, y_train)

# Make predictions on the testing data

y_pred = model.predict(X_test)

# Evaluate the model

mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)

print(f"\nMean Squared Error: {mse:.2f}")

print(f"R-squared: {r2:.2f}")

# Display the coefficients and intercept

print("\nCoefficients:")
print(model.coef_)
print("Intercept:", model.intercept_)

Dataset:
Hours_Studied Hours_Slept Score

0 2 5 55

1 3 6 65

2 4 5 50

3 5 7 80

4 6 8 90

5 7 7 75

6 8 8 85

7 9 9 95

8 10 10 100

Mean Squared Error: 6.60

R-squared: 0.97

Coefficients:
[-2.43842365 13.36206897]
Intercept: -4.384236453201979

MLR Using statsmodel

In [140… import statsmodels.api as sm
dataa = pd.DataFrame({

localhost:8888/nbconvert/html/practice .ipynb?download=false 6/9

12/22/23, 3:45 AM practice
'Hours_Studied': [2, 3, 4, 5, 6, 7, 8, 9, 10],
'Hours_Slept': [5, 6, 5, 7, 8, 7, 8, 9, 10],
'Score': [55, 65, 50, 80, 90, 75, 85, 95, 100] # Dependent variable
})
print(dataa)
m = dataa[['Hours_Studied','Hours_Slept']]
n = dataa['Score']

mlr = sm.OLS(n,m).fit()
print("Params:")
print(mlr.params)
y_pred = mlr.predict(m)
print('Y Pred: ')
print(y_pred)

Hours_Studied Hours_Slept Score

0 2 5 55
1 3 6 65
2 4 5 50
3 5 7 80
4 6 8 90
5 7 7 75
6 8 8 85
7 9 9 95
8 10 10 100
Params:
Hours_Studied -1.428571
Hours_Slept 11.890756
dtype: float64
Y Pred:
0 56.596639
1 67.058824
2 53.739496
3 76.092437
4 86.554622
5 73.235294
6 83.697479
7 94.159664
8 104.621849
dtype: float64

In [142… rsquare = r2_score(y_pred,n)

print(r2)

0.9706465820573179

In [143… mlr.summary2()

C:\Users\Ishant\anaconda3\Lib\site-packages\scipy\stats\_stats_py.py:1736: UserWar
ning: kurtosistest only valid for n>=20 ... continuing anyway, n=9
warnings.warn("kurtosistest only valid for n>=20 ... continuing "

localhost:8888/nbconvert/html/practice .ipynb?download=false 7/9

12/22/23, 3:45 AM practice

Out[143]: Model: OLS Adj. R-squared (uncentered): 0.998

Dependent Variable: Score AIC: 48.5980

Date: 2023-12-21 18:23 BIC: 48.9925

No. Observations: 9 Log-Likelihood: -22.299

Df Model: 2 F-statistic: 2623.

Df Residuals: 7 Prob (F-statistic): 8.64e-11

R-squared (uncentered): 0.999 Scale: 10.684

Coef. Std.Err. t P>|t| [0.025 0.975]

Hours_Studied -1.4286 0.7787 -1.8346 0.1092 -3.2699 0.4127

Hours_Slept 11.8908 0.6872 17.3024 0.0000 10.2657 13.5158

Omnibus: 1.260 Durbin-Watson: 1.268

Prob(Omnibus): 0.533 Jarque-Bera (JB): 0.686

Skew: -0.168 Prob(JB): 0.710

Kurtosis: 1.689 Condition No.: 9

Notes:
[1] R² is computed without centering (uncentered) since the model does not contain a
constant.
[2] Standard Errors assume that the covariance matrix of the errors is correctly specified.

In [ ]:

In [107… #Reshaping the data

X_train.values.reshape(-1,1)

array([[ 7],
Out[107]:
[ 7],
[ 2],
[ 5],
[10],
[10],
[ 4],
[ 5],
[ 6],
[ 8],
[ 5],
[ 7],
[ 8],
[ 8]], dtype=int64)

DataFrame from Dictionary

In [106… dataas = pd.DataFrame(data)
dataas

localhost:8888/nbconvert/html/practice .ipynb?download=false 8/9

12/22/23, 3:45 AM practice

Out[106]: Hours_Studied Hours_Slept Pass

0 2 5 0

1 3 6 0

2 4 5 0

3 5 7 0

4 6 8 1

5 7 7 1

6 8 8 1

7 9 9 1

8 10 10 1

In [165…

Cell In[165], line 1

jupyter nbconvert --to FORMAT notebook.ipynb
^
SyntaxError: invalid syntax

In [ ]:

localhost:8888/nbconvert/html/practice .ipynb?download=false 9/9

2020 Electrical Engineering Paper-1 (PCC-EE-301) : Circuit Theory Total Marks - 70 Duration:3 Hrs
No ratings yet
2020 Electrical Engineering Paper-1 (PCC-EE-301) : Circuit Theory Total Marks - 70 Duration:3 Hrs
5 pages
22K61A0654 2 Sasi Auto
No ratings yet
22K61A0654 2 Sasi Auto
24 pages
Maths Class Ix Chapter 01 02 and 03 Practice Paper 01 Answers
67% (3)
Maths Class Ix Chapter 01 02 and 03 Practice Paper 01 Answers
6 pages
ML Journal
No ratings yet
ML Journal
45 pages
ML Lab 8
No ratings yet
ML Lab 8
9 pages
Assignment 2 Regression2
No ratings yet
Assignment 2 Regression2
4 pages
Swadhyay Assignment Logarithm Allen
No ratings yet
Swadhyay Assignment Logarithm Allen
12 pages
Intro To Neural Networks Explained For Beginners: Sajjad Mustafa
No ratings yet
Intro To Neural Networks Explained For Beginners: Sajjad Mustafa
110 pages
Machine Intelligence
No ratings yet
Machine Intelligence
24 pages
AIML Week7 Week8 Week9
No ratings yet
AIML Week7 Week8 Week9
6 pages
ML Mini Project
No ratings yet
ML Mini Project
9 pages
MATH1152 - Set Theory Notes
No ratings yet
MATH1152 - Set Theory Notes
6 pages
Deep Learning Assignments
No ratings yet
Deep Learning Assignments
5 pages
ML Lab Assessment 4
No ratings yet
ML Lab Assessment 4
4 pages
SPPUML5
No ratings yet
SPPUML5
4 pages
Ai Practicle
No ratings yet
Ai Practicle
8 pages
Shubham Pract 6 - Merged
No ratings yet
Shubham Pract 6 - Merged
12 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
7 pages
TP Minor Test 4 P 1 Enthuse Jee (Advanced) 26.08.2024 f1
No ratings yet
TP Minor Test 4 P 1 Enthuse Jee (Advanced) 26.08.2024 f1
19 pages
P17111204047 - Andini Ibriliyanti - 3B - Epiinfo
No ratings yet
P17111204047 - Andini Ibriliyanti - 3B - Epiinfo
7 pages
Open Lab 2
No ratings yet
Open Lab 2
15 pages
Machine Learnin1
100% (1)
Machine Learnin1
41 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
Simple and Multiple Regression
No ratings yet
Simple and Multiple Regression
9 pages
Aiml Exp 7
No ratings yet
Aiml Exp 7
10 pages
Science of The Egg Drop1
No ratings yet
Science of The Egg Drop1
2 pages
Print Version
No ratings yet
Print Version
29 pages
LR, Decision Tree
No ratings yet
LR, Decision Tree
48 pages
Data Mining Lab Manual
No ratings yet
Data Mining Lab Manual
7 pages
Regress Project
No ratings yet
Regress Project
5 pages
Datascience PR 6 Veda
No ratings yet
Datascience PR 6 Veda
6 pages
TSF Task1 GRIP
No ratings yet
TSF Task1 GRIP
4 pages
Ary Reg
No ratings yet
Ary Reg
10 pages
Friction - DPPs
No ratings yet
Friction - DPPs
11 pages
Lab5 Linear Regression
No ratings yet
Lab5 Linear Regression
1 page
ML Lab Record - 250625 - 105014
No ratings yet
ML Lab Record - 250625 - 105014
29 pages
Mini-Frac Analysis For Unconventional Reservoirs Using Fast Welltest 16-Aug-2013 0
100% (1)
Mini-Frac Analysis For Unconventional Reservoirs Using Fast Welltest 16-Aug-2013 0
44 pages
Trading Strategies Market Colour Ravi Kashyap 2018
No ratings yet
Trading Strategies Market Colour Ravi Kashyap 2018
26 pages
ML Interview Questions
No ratings yet
ML Interview Questions
10 pages
Garishav Basra 102103129 2CO5
No ratings yet
Garishav Basra 102103129 2CO5
8 pages
Sample QP
No ratings yet
Sample QP
4 pages
Ec8352 Ss Model 1
No ratings yet
Ec8352 Ss Model 1
2 pages
CH2114
No ratings yet
CH2114
2 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
01 Machine Learning
No ratings yet
01 Machine Learning
25 pages
Payal Practical5 Edited
No ratings yet
Payal Practical5 Edited
5 pages
Unit 13: Bernoulli, Binomial, Geometric and Poisson Distributions and Their Applications
No ratings yet
Unit 13: Bernoulli, Binomial, Geometric and Poisson Distributions and Their Applications
4 pages
Week 1 - Introduction To Discrete Structures
No ratings yet
Week 1 - Introduction To Discrete Structures
3 pages
Ai Lab
No ratings yet
Ai Lab
19 pages
Ann Experiential Learning
No ratings yet
Ann Experiential Learning
43 pages
Ricco Serial Verb Constructions in Three-Participant Event
No ratings yet
Ricco Serial Verb Constructions in Three-Participant Event
50 pages
I Avaliação Parcial - 25.0 PTS - Gabarito
No ratings yet
I Avaliação Parcial - 25.0 PTS - Gabarito
9 pages
DSBDA05
No ratings yet
DSBDA05
5 pages
1.3 Translational Equilibrium Statics
No ratings yet
1.3 Translational Equilibrium Statics
55 pages
ML Lab
No ratings yet
ML Lab
23 pages
Introduction To Artificial Intelligence: by Tanu Dixit CS-3 Year
No ratings yet
Introduction To Artificial Intelligence: by Tanu Dixit CS-3 Year
33 pages
Bitsat Paper 4
No ratings yet
Bitsat Paper 4
19 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Import As Import As Import As: "Default - CSV"
No ratings yet
Import As Import As Import As: "Default - CSV"
9 pages
Math 1210 Project 2
No ratings yet
Math 1210 Project 2
3 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
47 pages
St. John College of Engineering and Management, Palghar - Maharashtra
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
11 pages
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
No ratings yet
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
35 pages
Dsbda 5
No ratings yet
Dsbda 5
4 pages
Student - Linear Regression Example - Colaboratory
No ratings yet
Student - Linear Regression Example - Colaboratory
6 pages
DataAnalytics Lab Manual
No ratings yet
DataAnalytics Lab Manual
35 pages
ML Practical File
No ratings yet
ML Practical File
30 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
5 - A/D and D/A Conversion: Systems For Digital Signal Processing
No ratings yet
5 - A/D and D/A Conversion: Systems For Digital Signal Processing
35 pages
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
No ratings yet
Machine Learning Lab Assignment CSE-716: S. M. Shafkat Raihan ID: 16701041 SESSION: 2015-16
9 pages
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
No ratings yet
G 203008076 - 4 - Christhian Quiñonez - Ex1 - 2 A PDF
20 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Multi - Class - Scaled - Down - Data - Colaboratory
No ratings yet
Multi - Class - Scaled - Down - Data - Colaboratory
2 pages
1.3 - Super Elevation Equilibrium Cant Etc.
No ratings yet
1.3 - Super Elevation Equilibrium Cant Etc.
53 pages
ML RECORD - Merged
No ratings yet
ML RECORD - Merged
33 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
M911 G11 - Transformation Geometry
No ratings yet
M911 G11 - Transformation Geometry
12 pages
Bda Assign
No ratings yet
Bda Assign
15 pages
Physics
No ratings yet
Physics
68 pages
2D Shapes
No ratings yet
2D Shapes
62 pages
Oracle SQL Cheatsheet
No ratings yet
Oracle SQL Cheatsheet
2 pages
GATE Electromagnetic Theory Book
No ratings yet
GATE Electromagnetic Theory Book
12 pages
Methods and Tools Used in Criticality Analysis in Industrial Systems
100% (1)
Methods and Tools Used in Criticality Analysis in Industrial Systems
18 pages
(Download) SSC - CGL Tier-II Exam Paper-I (Arithmetical Ability) Held On - 16-09-2012 - SSCPORTAL PDF
No ratings yet
(Download) SSC - CGL Tier-II Exam Paper-I (Arithmetical Ability) Held On - 16-09-2012 - SSCPORTAL PDF
12 pages
20dit073 Jay Prajapati ML
No ratings yet
20dit073 Jay Prajapati ML
68 pages
ZXF01U03
No ratings yet
ZXF01U03
4 pages
Project Planning and Approval Worksheet
100% (2)
Project Planning and Approval Worksheet
8 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Blazor and API Example: Classroom Quiz Application
From Everand
Blazor and API Example: Classroom Quiz Application
Taurius Litvinavicius
No ratings yet
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.