ADS Exp-1
ADS Exp-1
ipynb - Colab
Rohit Goud
Import Libraries
import pandas as pd
import numpy as np
import seaborn as sns # For visualization
import matplotlib.pyplot as plt # For plotting graphs
from scipy import stats # For statistical analysis
from scipy.stats import poisson, chi2_contingency, f_oneway
print("Dataset Info:")
df.info()
Dataset Info:
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 768 entries, 0 to 767
Data columns (total 9 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Pregnancies 768 non-null int64
1 Glucose 768 non-null int64
2 BloodPressure 768 non-null int64
3 SkinThickness 768 non-null int64
4 Insulin 768 non-null int64
5 BMI 768 non-null float64
6 DiabetesPedigreeFunction 768 non-null float64
7 Age 768 non-null int64
8 Outcome 768 non-null int64
dtypes: float64(2), int64(7)
memory usage: 54.1 KB
print("\nFirst 5 rows:")
print(df.head())
First 5 rows:
Pregnancies Glucose BloodPressure SkinThickness Insulin BMI \
0 6 148 72 35 0 33.6
1 1 85 66 29 0 26.6
2 8 183 64 0 0 23.3
3 1 89 66 23 94 28.1
4 0 137 40 35 168 43.1
Data Cleaning
https://colab.research.google.com/drive/1aL4Hcol5NWRAh7AJ9uJv6plSMcoCg8Q6#scrollTo=st01zBF7kzvh&printMode=true 1/3
2/4/25, 9:00 PM ADS.ipynb - Colab
Replace number of zero values with mean of that column (excluding Outcome column)
Basic Statistics
print("\nDescriptive Statistics:")
print(df.describe())
Descriptive Statistics:
Pregnancies Glucose BloodPressure SkinThickness Insulin \
count 768.000000 768.000000 768.000000 768.000000 768.000000
mean 3.845052 120.894531 69.105469 20.536458 79.799479
std 3.369578 31.972618 19.355807 15.952218 115.244002
min 0.000000 0.000000 0.000000 0.000000 0.000000
25% 1.000000 99.000000 62.000000 0.000000 0.000000
50% 3.000000 117.000000 72.000000 23.000000 30.500000
75% 6.000000 140.250000 80.000000 32.000000 127.250000
max 17.000000 199.000000 122.000000 99.000000 846.000000
Poisson Distribution
plt.figure(figsize=(6, 4))
plt.bar(x_vals, poisson_probs, color='blue', alpha=0.7)
plt.title(f"Poisson Distribution (λ = {lambda_val:.2f})")
plt.xlabel("Number of Events")
plt.ylabel("Probability")
plt.show()
https://colab.research.google.com/drive/1aL4Hcol5NWRAh7AJ9uJv6plSMcoCg8Q6#scrollTo=st01zBF7kzvh&printMode=true 3/3