Eda - 1@3pm 8th Nov
Eda - 1@3pm 8th Nov
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
Conclusion:After printing head of the data we get to know these data represent
about US adult citizens, containing information about demographics, employment,
education, and salary brackets and also we get to know what are the columns
name(age,workclass etc) and there pupose. and also get to know what type of data
(cat or num) in each column
CONCLUSION: After finding the shape of the data we get to know that these dataset
has 32,561 rows and 15 columns. This means it contains information about 32,561
people, including their age, job type, education, and salary.
Conclusion:After finding the info of the data we get to know that these dataset
contains 32,561 entries and 15 columns. and also type of data in each column.
There are 5 int columns, 1 float column, and 9 object columns
And we get to know count of non null entries in each column. we get to know there
are some missing values in education-num, relationship columns.
and also we get to know that 3.7+MB of memory was used by these dataset.
Conclusion: After finding the no of duplicated rows we get to know that out of
32561 rows we have 24 duplicate rows and 32537 non duplicate rows.
Conclusion:
The average age is about 38.6 years, with ages ranging from 17 to 90.
The average final weight (fnlwgt) is around 189,778.
The average capital gain is 1,078, and the average capital loss is 87.