Intro To Descriptive Statistics: By: Mahmoud Galal
Intro To Descriptive Statistics: By: Mahmoud Galal
01 History
Knowing the past …
Respecting the present.
02 Data Types and Terminologies
Knowing the type of the data
is knowing what to do with it.
03 Summary Statistics
Representing a great sum of 04 Using the swords we mastered to
slice down the beast, Just to know
data with just one number ! him better (We are not savages !)
-- It’s said that data is the biggest beast of our age --
01
History…
the past … the present.
How statistics eveolved ?
-3
Gender
Data Terminologies
• Variable are also called Column,
Feature, Dimension, field and Attribute.
Country Age Score
• Samples are also called Observations,
Records, Instances and rows.
Egypt 30 4
• Variables and Samples make up the
term “Data Set” or “Data Frame”.
Morocco 21 4
Germany 29 3
Outliers
• Then any data points < (Q1 – 1.5 * IQR) and > (Q3 + 1.5 * IQR) is considered an outlier.
2. Median
3. Mode Frequency(X)
Reference
Mean
It’s the sum of all values of the data set divided by its
records number
. Median and
Mode can not be algebraically manipulated. Reference
of data points.
Practice
5-Min Break
Please submit your attendance while
Making yourself something hot
Median
It’s the middle value of our data set.
1
CONCLUSION
2. Outliers are generally bad for our analysis but sometimes they are
the most important.
5. We can use the other summaries like median and mode in special
cases like outliers presence.