Data - Analysis On Rape Victims
Data - Analysis On Rape Victims
DATA ANALYSIS
Name: Vaishnavi. K
Roll. No: 20BIT060
Department: IT
OBJECTIVE:
The objective of the data analysis is to show the records of the rape cases which
have been grouped according to the state as well as age wise.
The dataset contains a set of 1051 records under 11 attributes- Area name, year,
subgroup, Total Rape cases, victims above 50 years, victims between the age of 10-
14 years, 14-18 years, 18- 30 years,30-50 years, total cases of rape and victims below
the age of 10
The data analysis gives you a glimpse of the rape cases reported area wise and age
wise along with displaying the data statistically with the help of graphs
The main purpose of this data is to spread awareness about the crime and to take
better steps to ensure the safety of women.
By reading the csv file and giving the command to read it would display the data
set as present in the excel sheet.
Here I have given head(51), therefore it would display the data of first 50 states
present in the excel data sheet where in vertically downwards it would display the
data state wise and horizontally it would display the data according to the area
name, year, and age wise as well the cases in total and those that have been
reported.
2. Getting to know about the data, its basic statistics and shape
The above data shows the mean, count etc of the data set accordingly, where in
the mean of the rape cases reported is around 361.920 and that of the total cases
is around 362.198
GROUPING THE DATA
1.Next was to gather the information about the datatype of the dataset
i. .info() is used to display the type of data structures available in the dataset
ii. As shown above except for Area_Name and Subgroup which are of string data
type the rest are of float data type
2. The next step was to group the data as well as display the cases area wise as a
sum
i. .groupby() would help in grouping the data given according to the attribute
give. Since I have given the attribute as Area_Name the data would be
grouped area wise
ii. The next step was to display this which was done just by mentioning
head(), here I have given the parameter for display as 10 so it would
display the first 9 lists of data present
3. The next step was to display the max cases rate from the data set
4. Next, I wanted to show the maximum cases that were reported area wise
As shown above Assam and Bihar show higher reported cases than Delhi. However
there have been incidents where in these are just cases which may not have been
reported which leads to the lowering of the crime rate in the area.
There are also individual line graphs of certain states to show the cases reported
age wise.
FOR DELHI:
FOR KARNATAKA:
FOR BIHAR:
CORRELATION MATRIX:
1. A correlation matrix is a tabular data representing the correlations between pairs
of variables in a given matrix
2. The correlation matrix is an important data analysis metric that is computed to
summarise data to understand the relationship between various variables and
make decisions accordingly.
Correlation matrix in the form of figure:
Each row and column represent a variable, and each value in this matrix is the correlation
coefficient between the variables represented by the corresponding row and column.
For the below figure we have drawn a correlation about the cases that have been
reported age wise corresponding to a particular area name(state / district) so that it is
easier to compound the data to come up with the finalised result and draw awareness
accordingly.
Through the analysis done so far we have drawn attention to the fact that between the
year 2000-2010 there has been a visible increase in the rape cases reported .
The highest being reported from Sikkim, Delhi, Bihar from the age gap of 14- 30 years
whereas the least has been reported from the state of TamilNadu , but we must also keep
in mind that the analysis done above shows the cases that just been reported , there
have been crimes that have taken place but not been reported.
Through this analysis I hope to draw attention to the hideous crime that has been
happening through the ages and that we as citizens must take our necessary steps
forward to reduce this and eventually make this come to an end.