0% found this document useful (0 votes)

40 views7 pages

Name: Yandrapu Manoj Naidu Roll No: 20MDT1017: Choose Files

This document analyzes a diabetes dataset with 768 rows and 9 columns containing information like pregnancies, glucose level, blood pressure, age, and patient outcome. The dataset is loaded and explored, including checking data types and calculating the correlation between columns. Zero values are replaced with column means. A neural network model is created and trained to predict patient outcomes based on the other column values.

Uploaded by

YANDRAPU MANOJ NAIDU 20MDT1017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views7 pages

Name: Yandrapu Manoj Naidu Roll No: 20MDT1017: Choose Files

Uploaded by

YANDRAPU MANOJ NAIDU 20MDT1017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

8/31/2021 Untitled6.

ipynb - Colaboratory

Name : YANDRAPU MANOJ NAIDU

Roll no: 20MDT1017

from google.colab import files
uploaded = files.upload()

Choose Files Diabetes.csv

Diabetes.csv(application/vnd.ms-excel) - 23873 bytes, last modified: 8/31/2021 - 100% done
Saving Diabetes.csv to Diabetes (1).csv

import pandas as pd
import io
df = pd.read_csv(io.BytesIO(uploaded['Diabetes.csv']))
print(df)

Pregnancies Glucose ... Age Outcome

0 6 148 ... 50 1

1 1 85 ... 31 0

2 8 183 ... 32 1

3 1 89 ... 21 0

4 0 137 ... 33 1

.. ... ... ... ... ...

763 10 101 ... 63 0

764 2 122 ... 27 0

765 5 121 ... 30 0

766 1 126 ... 47 1

767 1 93 ... 23 0

[768 rows x 9 columns]

df.head()

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI DiabetesPedigre

0 6 148 72 35 0 33.6

1 1 85 66 29 0 26.6

2 8 183 64 0 0 23.3

3 1 89 66 23 94 28.1

4 0 137 40 35 168 43.1

There are total of 9 columns in the Diabetes dataset

df.dtypes

Pregnancies int64

Glucose int64

BloodPressure int64

SkinThickness int64

https://colab.research.google.com/drive/1IAYPbv5kKKV04u8-wWsy6TvzPIXQplNv#scrollTo=FctYEwzZ-0zM&printMode=true 1/7
8/31/2021 Untitled6.ipynb - Colaboratory

Insulin int64

BMI float64

DiabetesPedigreeFunction float64

Age int64

Outcome int64

dtype: object

each column data type

correlation=df.corr()
correlation.style.background_gradient(cmap='coolwarm')

Pregnancies Glucose BloodPressure SkinThickness Insulin BM

Pregnancies 1.000000 0.129459 0.141282 -0.081672 -0.073535 0.0176
Glucose 0.129459 1.000000 0.152590 0.057328 0.331357 0.2210
BloodPressure 0.141282 0.152590 1.000000 0.207371 0.088933 0.2818
SkinThickness -0.081672 0.057328 0.207371 1.000000 0.436783 0.3925
Insulin -0.073535 0.331357 0.088933 0.436783 1.000000 0.1978
BMI 0.017683 0.221071 0.281805 0.392573 0.197859 1.0000
DiabetesPedigreeFunction -0.033523 0.137337 0.041265 0.183928 0.185071 0.1406
Age 0.544341 0.263514 0.239528 -0.113970 -0.042163 0.0362
Outcome 0.221898 0.466581 0.065068 0.074752 0.130548 0.2926

correlation matrix for diabetes dataset

correlation.style.background_gradient(cmap='coolwarm').set_precision(2)

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Diab

Pregnancies 1.00 0.13 0.14 -0.08 -0.07 0.02 -0.03
Glucose 0.13 1.00 0.15 0.06 0.33 0.22 0.14
BloodPressure 0.14 0.15 1.00 0.21 0.09 0.28 0.04
SkinThickness -0.08 0.06 0.21 1.00 0.44 0.39 0.18
Insulin -0.07 0.33 0.09 0.44 1.00 0.20 0.19
BMI 0.02 0.22 0.28 0.39 0.20 1.00 0.14
DiabetesPedigreeFunction -0.03 0.14 0.04 0.18 0.19 0.14 1.00
Age 0.54 0.26 0.24 -0.11 -0.04 0.04 0.03
Outcome 0.22 0.47 0.07 0.07 0.13 0.29 0.17

rounding the decimal values to two

import matplotlib.pyplot as plt
plt.matshow(df.corr())
plt.show()

https://colab.research.google.com/drive/1IAYPbv5kKKV04u8-wWsy6TvzPIXQplNv#scrollTo=FctYEwzZ-0zM&printMode=true 2/7
8/31/2021 Untitled6.ipynb - Colaboratory

age=df['Age']
out=df['Outcome']

import matplotlib.pyplot as plt
plt.bar(age,out)
plt.show()

bar chart for age vs outcome

df.boxplot(by ='Outcome', column =['Insulin'], grid = False)

https://colab.research.google.com/drive/1IAYPbv5kKKV04u8-wWsy6TvzPIXQplNv#scrollTo=FctYEwzZ-0zM&printMode=true 3/7
8/31/2021 Untitled6.ipynb - Colaboratory

/usr/local/lib/python3.7/dist-packages/numpy/core/_asarray.py:83: VisibleDeprecationW
return array(a, dtype, copy=False, order=order)

<matplotlib.axes._subplots.AxesSubplot at 0x7fdd06376050>
for i in df.columns:
print(i,":",df[i][df[i]==0].count())

Pregnancies : 111

Glucose : 5

BloodPressure : 35

SkinThickness : 227

Insulin : 374

BMI : 11

DiabetesPedigreeFunction : 0

Age : 0

Outcome : 500

number of zeros present in each column

for col in df.columns:
val=df[col].mean()
df[col]=df[col].replace(0,val)

replaced zeros with mean values

df.head(10)

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Diabete

0 6.000000 148.0 72.000000 35.000000 79.799479 33.600000

1 1.000000 85.0 66.000000 29.000000 79.799479 26.600000

2 8.000000 183.0 64.000000 20.536458 79.799479 23.300000

3 1.000000 89.0 66.000000 23.000000 94.000000 28.100000

4 3.845052 137.0 40.000000 35.000000 168.000000 43.100000

5 5.000000 116.0 74.000000 20.536458 79.799479 25.600000

6 3.000000 78.0 50.000000 32.000000 88.000000 31.000000

7 10.000000 115.0 69.105469 20.536458 79.799479 35.300000

8 2.000000 197.0 70.000000 45.000000 543.000000 30.500000

9 8.000000 125.0 96.000000 20.536458 79.799479 31.992578

df.boxplot(by ='Outcome', column =['Insulin'], grid = False)

https://colab.research.google.com/drive/1IAYPbv5kKKV04u8-wWsy6TvzPIXQplNv#scrollTo=FctYEwzZ-0zM&printMode=true 4/7
8/31/2021 Untitled6.ipynb - Colaboratory

/usr/local/lib/python3.7/dist-packages/numpy/core/_asarray.py:83: VisibleDeprecationW
return array(a, dtype, copy=False, order=order)

<matplotlib.axes._subplots.AxesSubplot at 0x7fdd085f80d0>

# split into input and output columns
X, y = df.values[:, :-1], df.values[:, -1]

type(X)

numpy.ndarray

from sklearn.model_selection import train_test_split
from sklearn.preprocessing import LabelEncoder
from tensorflow.keras import Sequential
from tensorflow.keras.layers import Dense
# ensure all data are floating point values
X = X.astype('float32')
# encode strings to integer
y = LabelEncoder().fit_transform(y)

# split into train and test datasets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.33)
print(X_train.shape, X_test.shape, y_train.shape, y_test.shape)

(514, 8) (254, 8) (514,) (254,)

# determine the number of input features
n_features = X_train.shape[1]

# define model
model = Sequential()
model.add(Dense(10, activation='relu', kernel_initializer='he_normal', input_shape=(n_feat
model.add(Dense(8, activation='relu', kernel_initializer='he_normal'))
model.add(Dense(1, activation='sigmoid'))

# compile the model
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

# fit the model
model.fit(X_train, y_train, epochs=150, batch_size=32, verbose=0)

https://colab.research.google.com/drive/1IAYPbv5kKKV04u8-wWsy6TvzPIXQplNv#scrollTo=FctYEwzZ-0zM&printMode=true 5/7
8/31/2021 Untitled6.ipynb - Colaboratory

<keras.callbacks.History at 0x7fdd06210110>

# evaluate the model
loss, acc = model.evaluate(X_test, y_test, verbose=0)
print('Test Accuracy: %.3f' % acc)

Test Accuracy: 0.661

# make a prediction
import numpy as np
row = np.array([[1,0,0.99539,-0.05889,0.85243,0.02306,0.83398,-0.37708]])
yhat = model.predict([row])
print('Predicted: %.3f' % yhat)

Predicted: 0.204

import numpy as np
row1=np.array([[1,0,0.99539,-0.05889,0.85243,0.02306,0.83398,-0.37708]])
row1.shape

(1, 8)

yhat = model.predict([row1])
print('Predicted: %.3f' % yhat)

Predicted: 0.204

model.summary()

Model: "sequential_1"

_________________________________________________________________

Layer (type) Output Shape Param #

=================================================================

dense_3 (Dense) (None, 10) 90

_________________________________________________________________

dense_4 (Dense) (None, 8) 88

_________________________________________________________________

dense_5 (Dense) (None, 1) 9

=================================================================

Total params: 187

Trainable params: 187

Non-trainable params: 0

_________________________________________________________________

df.boxplot(by ='Outcome', column =['Insulin'], grid = False)

https://colab.research.google.com/drive/1IAYPbv5kKKV04u8-wWsy6TvzPIXQplNv#scrollTo=FctYEwzZ-0zM&printMode=true 6/7
8/31/2021 Untitled6.ipynb - Colaboratory

/usr/local/lib/python3.7/dist-packages/numpy/core/_asarray.py:83: VisibleDeprecationW
return array(a, dtype, copy=False, order=order)

<matplotlib.axes._subplots.AxesSubplot at 0x7fdd086bc450>

check 0s completed at 12:23

https://colab.research.google.com/drive/1IAYPbv5kKKV04u8-wWsy6TvzPIXQplNv#scrollTo=FctYEwzZ-0zM&printMode=true 7/7

St. Cyril of Alexandria Term Paper For Patrology
100% (3)
St. Cyril of Alexandria Term Paper For Patrology
16 pages
John Zink Burner Control Narratives
100% (3)
John Zink Burner Control Narratives
19 pages
English Manual v3 001
No ratings yet
English Manual v3 001
63 pages
Microlink Information Technology College Department of Computer Science
No ratings yet
Microlink Information Technology College Department of Computer Science
87 pages
Attachment and Culture - Security in The United States and Japan
No ratings yet
Attachment and Culture - Security in The United States and Japan
12 pages
Bachelor Thesis
No ratings yet
Bachelor Thesis
88 pages
Imprest Format
No ratings yet
Imprest Format
3 pages
Understanding SAP EWM Wave
No ratings yet
Understanding SAP EWM Wave
8 pages
Casio AP500
0% (1)
Casio AP500
42 pages
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
No ratings yet
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
7 pages
Parkinson Disease & ALS Cheat Sheet
No ratings yet
Parkinson Disease & ALS Cheat Sheet
4 pages
Krisis Hipertensi
No ratings yet
Krisis Hipertensi
29 pages
Guidanc CTspection
No ratings yet
Guidanc CTspection
17 pages
BCOC Outstanding 24 Oktober 2023
No ratings yet
BCOC Outstanding 24 Oktober 2023
12 pages
Sundyne Compressor Brochure - US
No ratings yet
Sundyne Compressor Brochure - US
16 pages
Fa22 Rba 003
No ratings yet
Fa22 Rba 003
7 pages
PG AHC Admissions Policy 2020
No ratings yet
PG AHC Admissions Policy 2020
13 pages
Q1-DLL-WK-7 - October 9-13-2023-2024
No ratings yet
Q1-DLL-WK-7 - October 9-13-2023-2024
5 pages
Hyaluronic Acid
No ratings yet
Hyaluronic Acid
7 pages
t7 2009 Dec Q
No ratings yet
t7 2009 Dec Q
8 pages
Ephesians: What To Do
No ratings yet
Ephesians: What To Do
8 pages
How To Make A Good Presentation
No ratings yet
How To Make A Good Presentation
34 pages
SL 1297 - Rudder Tube Assembly Inspection 2021-10-22
No ratings yet
SL 1297 - Rudder Tube Assembly Inspection 2021-10-22
4 pages
Injection Engine Control System. VAZ 21213, 21214 (Niva)
No ratings yet
Injection Engine Control System. VAZ 21213, 21214 (Niva)
3 pages
PLC Interview Questions
No ratings yet
PLC Interview Questions
3 pages
List of Banned Pesticides
No ratings yet
List of Banned Pesticides
3 pages
Material Test Report: Cse. Chiang Sung Enterprise Co., LTD
No ratings yet
Material Test Report: Cse. Chiang Sung Enterprise Co., LTD
3 pages
Book Report Choice Board 1
No ratings yet
Book Report Choice Board 1
1 page
Das PDF
No ratings yet
Das PDF
3 pages
James Hou - Salesforce - Com Developer Resume
No ratings yet
James Hou - Salesforce - Com Developer Resume
3 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2133)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Name: Yandrapu Manoj Naidu Roll No: 20MDT1017: Choose Files

Uploaded by

Name: Yandrapu Manoj Naidu Roll No: 20MDT1017: Choose Files

Uploaded by

8/31/2021 Untitled6.

Name : YANDRAPU MANOJ NAIDU

Roll no: 20MDT1017

Choose Files Diabetes.csv

Pregnancies Glucose ... Age Outcome

.. ... ... ... ... ...

763 10 101 ... 63 0

764 2 122 ... 27 0

765 5 121 ... 30 0

766 1 126 ... 47 1

[768 rows x 9 columns]

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI DiabetesPedigre

4 0 137 40 35 168 43.1

There are total of 9 columns in the Diabetes dataset

each column data type

Pregnancies Glucose BloodPressure SkinThickness Insulin BM

correlation matrix for diabetes dataset

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Diab

rounding the decimal values to two

bar chart for age vs outcome

number of zeros present in each column

replaced zeros with mean values

Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Diabete

0 6.000000 148.0 72.000000 35.000000 79.799479 33.600000

1 1.000000 85.0 66.000000 29.000000 79.799479 26.600000

2 8.000000 183.0 64.000000 20.536458 79.799479 23.300000

3 1.000000 89.0 66.000000 23.000000 94.000000 28.100000

4 3.845052 137.0 40.000000 35.000000 168.000000 43.100000

5 5.000000 116.0 74.000000 20.536458 79.799479 25.600000

6 3.000000 78.0 50.000000 32.000000 88.000000 31.000000

7 10.000000 115.0 69.105469 20.536458 79.799479 35.300000

8 2.000000 197.0 70.000000 45.000000 543.000000 30.500000

9 8.000000 125.0 96.000000 20.536458 79.799479 31.992578

(514, 8) (254, 8) (514,) (254,)

Test Accuracy: 0.661

Layer (type) Output Shape Param #

dense_3 (Dense) (None, 10) 90

dense_4 (Dense) (None, 8) 88

dense_5 (Dense) (None, 1) 9

Total params: 187

Trainable params: 187

check 0s completed at 12:23

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.