0% found this document useful (0 votes)
694 views11 pages

Business Moments Graphic Assignmebt

Uploaded by

Anakha Prasad
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
694 views11 pages

Business Moments Graphic Assignmebt

Uploaded by

Anakha Prasad
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 11

Exploratory Data Analysis

Instructions:
Please share your answers filled in-line in the word document. Submit code
separately wherever applicable.

Please ensure you update all the details:


Name: Anakha.P.Nair Batch ID: ___________
Topic: Exploratory Data Analysis

Guidelines:
1. An assignment submission is considered complete only when correct and executable code(s) are
submitted along with the documentation explaining the method and results. Failing to submit either
of those will be considered an invalid submission and will not be considered as correct submission.

2. Ensure that you submit your assignments correctly. Resubmission is not allowed.

3. Post the submission you can evaluate your work by referring to keys provided. (will be available
only post the submission).

Hints: Follow CRISP-ML(Q) methodology steps, where were appropriate.


1. Data Understanding: work on each feature of the dataset to create a data
dictionary as displayed in the image below:

Make a table as shown above and provide information about the features such as its data
type and its relevance to the model building. And if not relevant, provide reasons and a
description of the feature.

Problem Statements:

© 2013 - 2022 360DigiTMG. All Rights Reserved.


Q1) Calculate Skewness, Kurtosis using R/Python code & draw inferences on the following data.
Refer to the Datasets attachment for data file.
Hint: [Insights drawn from the data such as data is normally distributed/not, outliers, measures
like mean, median, mode, variance, std. deviation]
a. Cars speed and distance

© 2013 - 2022 360DigiTMG. All Rights Reserved.


Histogram Of Speed
18
16
14
12
Frequency

10
8
6
4
2
0
5 10 15 20 25 More
Speed

Histogram Of Distance
20
18
16
14
12
Frequency

10
8
6
4
2
0
20 40 60 80 100 120 More
Diastance

Ans=
Skewness -0.11751 0.806895
Kurtosis -0.50899 0.405053

© 2013 - 2022 360DigiTMG. All Rights Reserved.


b. Top Speed (SP) and Weight (WT)

© 2013 - 2022 360DigiTMG. All Rights Reserved.


ANS=
Skewness 1.61145 -0.61475
Kurtosis 2.977329 0.950291

© 2013 - 2022 360DigiTMG. All Rights Reserved.


Q2) Draw inferences about the following boxplot & histogram.

ANS= Right side skewed or positively skewed.

© 2013 - 2022 360DigiTMG. All Rights Reserved.


ANS= The interface for this box plot is positively skewed.

Q3) Below are the scores obtained by a student in tests


34,36,36,38,38,39,39,40,40,41,41,41,41,42,42,45,49,56
1) Find mean, median, variance, standard deviation.
2) What can we say about the student marks?

2)ANS= The scores are in uniformly distribution data in ascending order.

1)ANS :

MEAN 41

MEDIAN 40.5

VARIANCE 25.52

STANDARD DEVIATION 5.05

© 2013 - 2022 360DigiTMG. All Rights Reserved.


Q5) What is the nature of skewness when mean, median of data is equal?

ANS= Normalized Skewness.

Q6) What is the nature of skewness when mean > median?

ANS= Right Skewed.

Q7) What is the nature of skewness when median > mean?

ANS= Left Skewed.

Q8) What does positive kurtosis value indicates for a data?

ANS= Sharp peak in the plot. Less gap between tails to x-axis.

Q9) What does negative kurtosis value indicates for a data?

ANS= Border peak under the curve and more gap between tails and x-axis.

Q10) Answer the below questions using the below boxplot visualization.

What can we say about the distribution of the data?

ANS= The data is distributed in De- assigned format.

© 2013 - 2022 360DigiTMG. All Rights Reserved.


What is nature of skewness of the data?

ANS= Left side skewed.


What will be the IQR of the data (approximately)?

ANS= Q3-Q1
=18-10
=8 is IQR

Q11) Comment on the below Boxplot visualizations?

Draw an Inference from the distribution of data for Boxplot 1 with respect Boxplot 2.

ANS= The box plot 1 designed with range =3


The second one range is=1.5

Q12)

© 2013 - 2022 360DigiTMG. All Rights Reserved.


Answer the following three questions based on the boxplot above.
(i) What is inter-quartile range of this dataset? [Hint: IQR = Q3 – Q1]
In one line, explain what this value implies. (Hint: Based on IQR definition)
ANS= Q3-Q1
= 12-5
=7

(ii) What can we say about the skewness of this dataset?


ANS= The data is positively skewed.

(iii) If it were found that the data point with the value 25 is 2.5, how would the new
boxplot be affected?
ANS= 3

Q13)

© 2013 - 2022 360DigiTMG. All Rights Reserved.


Answer the following three questions based on the histogram above.
(i) Where would the mode of this dataset lie?
ANS = The mode lie on the 7 on X-axis

(ii) Comment on the skewness of the dataset.


ANS= The data is Right side skewed.

(iii) Suppose that the above histogram and the boxplot in question 2 are plotted for
the same dataset. Explain how these graphs complement each other in providing
information about any dataset.
ANS =

© 2013 - 2022 360DigiTMG. All Rights Reserved.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy