Session 1 - Introduction (Updated)
Session 1 - Introduction (Updated)
7 8
Reading Algebraic Equations Basic Concepts in Statistics
As mathematics is a “language”, we can express complex mathematical relationships by Statistics is a form of mathematical analysis that uses quantified models and
mathematical notations. Here are some commonly used forms of equations: representations for a given set of experimental data or real-life studies. The main
advantage of statistics is that information is presented in an easy way. Here are 8
"! + $ ! = & ! basic concepts used in machine learning and data science [Ref: Shirley Chen].
' = () !
• Type of Statistical Analytics
,+ , , − 1 + !
1++ " =1+ + +⋯ • Probability
1! 2!
• Central Tendency
&
,(+ ,(+ • Variability
1 + = "# + 2 "" cos + $" sin
6 6
"$% • Relationship Between Variables
+ +! +( • Probability Distribution
9' = 1 + + + +⋯ , −∞ < + < ∞
1! 2! 3! • Hypothesis Testing and Statistical Significance
" • Regression
"
, ) "*)
++" =2 + "
>
)$# (ref: https://www.kdnuggets.com/2020/06/8-basic-statistics-concepts.html)
9 10
11 12
Probability of multiple events Probability (Baye’s Theorem)
CondiKonal Probability: P(A|B) is a measure of the probability of one Bayes’ Theorem describes the probability of an event based on prior
event occurring with some relaeonship to one or more other events. knowledge of conditions that might be related to the event.
P(A|B)=P(A∩B)/P(B), when P(B)>0. Prior probability of a random event is the unconditional probability that is
assigned before any relevant evidence is taken into account.
Independent Events: Two events are independent if the occurrence of one Assume: P(A) and P(B) are the probabilities of observing A and B
does not affect the probability of occurrence of the other. P(A∩B)=P(A)P(B) respectively without any given conditions, they are known as prior
where P(A) != 0 and P(B) != 0 , P(A|B)=P(A), P(B|A)=P(B) probability
P(A∣B) is a conditional probability = the probability of event A occurring
Mutually Exclusive Events: Two events are mutually exclusive if they given that B is true. It is also called the posterior probability of A given B.
cannot both occur at the same eme. P(A∩B)=0 and P(A∪B)=P(A)+P(B). Similarly, P(B∣A) = the probability of event B occurring given that A is true.
It is also called the posterior probability of A given B.
Baye’s Theorem:
13 14
What is the probability that a person test posiOve in this test is a true cannabis user, or
in equaOon form, find P(A∣B)?
! #" !(#)
Using Baye’s Theorem: ! " # =
!(%)
But P(B) = P(B|A) P(A) + P(B|A’)P(A’) = 0.9 x 0.05 + 0.2 x 0.95 = 0.235
Assuming 1,000 random people were tested using this cannabis test:
• About 950 are non-users and 190 of them will test false positive (0.2 x
950); 50 of them are users (5%) and 45 of them will test true positive (0.9
x 50); 1,000 people will yield 235 positive results (45 + 190), of which only
45 are genuine drug users, or you are only 19% confident that they are a
true drug user!!
17 Probability Density CumulaIve DistribuIon 18
Function FuncIon
Sample Mean : Arithmeec mean of random sample values drawn from the
populaeon - x̄ (pronounced as x bar).
Goto Colab 23