0% found this document useful (0 votes)

19 views53 pages

Stats 1 Module Updated

Statistics for Data Analytics Part 1

Uploaded by

nandgaonkarmahesh041

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views53 pages

Stats 1 Module Updated

Statistics for Data Analytics Part 1

Uploaded by

nandgaonkarmahesh041

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 53

Statistics

1. Statistics: It is the science of conducting studies to collect,

organize, summarize, analyze, and draw a conclusion out
of data.
2. It deals with collective informative data, interpreting those
data, and drawing a conclusion from that data.
3. It is used in many disciplines like marketing, business,
healthcare, telecom, etc.

In any data science project, data helps us to analyze the initial

level of insight.

Types of Statistics

1. Descriptive
2. Inferential

Descriptive

1. It helps us to organize and summarize data using numbers

and graphs to look for a pattern in the data set.
2. Measures of Central tendency: Mean, Median, Mode.
3. The measure of Variability: Standard Deviation, Variance
& Range

(*Central Tendency- it is the single value which attempts to

describe a set of data)

Inferential

1. To make an inference or draw a conclusion from the

population, sample data is used.
2. Using probability to determine how confident we can be
that the conclusion we make is correct. (Confidence
Interval & margin of error)

Example: Our primary concern is to find out how many people

like blue cars in the data set.

Suppose, in a city, 1 lakh people are there. For our analysis, we

have taken 100 people from the data set. Out of 100, 20 people
like blue cars. i.e., 20/100 means 20% population like blue cars.
This 20% is descriptive Statistics.

If we say 20% +/- 2%, i.e., 20% people with 2% margin of error
like blue cars. So in this, we are 98% sure that this is correct.
This is called inferential.

Introduction to basic Statistics terms

• Population: A collection, or set, of individuals or objects or
events whose properties are to be analyzed ( Raw data that we
get from the client).

• Sample: A subset of the population. It should be

representative of the population.

• Variable: A characteristic of each element of a population or

sample.

• Data (singular): The value of the variable associated with one

element of a population or sample. This value may be a
number, a word, or a symbol.

• Data (plural): The set of values collected for the variable from
each of the elements belonging to the sample.

• Experiment: A planned activity whose results yield a

collection of data.

• Parameter: A numerical value summarizing all the data of an

entire population (Mean, Median, Mode).

• Statistic: A numerical value summarizing the sample data.

Descriptive Statistics
Descriptive Statistics: It is a method of organizing,
summarizing, and presenting data in an informative way.

Example: You have all the data on how the business is going
on, how much inventory you keep, how many customers come
to your store, In which month it has been more, at what day of
the week it occurs more. Which product is being sold more at
what point of time, on what hours is your product sold more.
What kind of customers come, do male customers come more
at a certain point in time, or female customers come then.
People with children come more, cigarette buyers come more,
or beer buyers come more, or grocery item buyers come more.

Descriptive Statistics answers all these questions based on

data.
Types of data
1) Categorical
2) Numerical

1) Categorical Data represents groups or categories

Examples: * Car brands: Audi, BMW and Mercedes.

* Answers to yes/no questions: yes or no.

2) Numerical data represent numbers. It is divided into two

groups: Discrete and Continuous.

Discrete data can be usually counted finitely, while

continuous is infinite and impossible to count.

Examples:

Discrete: * Number of children you want to have: 1, 2, 3.

* Grades at University: 0 to 100 %

* Number of Objects: Bottles, glasses, tables

or cars

Continuous: Height, Area, Distance, Time

Levels of measurement
1) Qualitative: A variable that categorizes or describes a
population element.

Note that arithmetic operations such as addition and averaging

are meaningless for data resulting from a qualitative variable.

2) Quantitative: A variable that quantifies a population element.

Note that arithmetic operations, such as addition and average,
are meaningful for data resulting from a quantitative variable.
1) Qualitative is classified into the following two levels:

 Nominal
 Ordinal

The nominal level represents categories that cannot be put in

any order, while ordinal represents categories that can be
ordered.

Examples: * Nominal: four seasons (winter, spring, summer,

autumn)

* Ordinal: rating your meal (disgusting,

unappetizing, neutral, tasty, and delicious)

2) There are two quantitative levels: interval and ratio.

They both represent “numbers”; however, ratios have a true

zero, while intervals don’t.

Examples:

* Interval: degrees Celsius and Fahrenheit

* Ratio: degree Kelvin.

Measures of Central Tendency

Mean

Here n is the size of the data set, x̄ is the sample mean, and
x¡ the numbers in sequence.

∑ is the summation of the entire data set

Similarly, for a data population of size N, the population mean

is:

Example: The systolic blood pressure of seven middle-aged

men in:

150, 123, 134, 170, 146, 124 and 113.

The Mean is =
(150+123+134+170+146+124+113)/7 = 137.14

Mode and Median

 The median for the sample data arranged in increasing

order is defined as :
 i. If"n" is an odd number - Middle value
 ii. If "n" is an even number - Midway between the two
middle values
 The mode is the most commonly occurring value.
 Mode exists as a data point.
 Useful for qualitative data.

Example – if n is odd

The re-ordered systolic blood pressure data:

113,124,125,132,146,151 and 170.

-> The median here is 132.

-> Two individuals have systolic blood pressure =

124mm Hg, so the Mode is 124.

Example – if n is even

Six men with high cholesterol participated in the study to

investigate the effects of diet on cholesterol levels. At the
beginning of the study, their cholesterol levels (mg/dl) were as
follows:

366, 327, 274, 292, 274 and 230

Rearrange the data in ascending order as follows:

230, 274, 274, 292, 327 and 366.

-> The median is 283(average of 274 and 292).

-> The mode between the two men having the same
cholesterol level = 274.
Mean, Mode and Median in Brief:

Note:
 Mean is highly sensitive to outliers
 Example:
o 1,2,3,4,5

o -> Mean: 3

-> Median: 3

o 1,2,3,4,5,100

o -> Mean: 51.5

-> Median: 3.5

Caculating Mean (Python Code):

import numpy as np
expenditure = np.random.normal(25000, 15000, 10000)
np.mean(expenditure)

24820.02338888958

Caculating Median:
np.median(expenditure)

24691.98032372038
Now, we are adding a large number to the sample.
expenditure = np.append(expenditure, [10000000000])

np.median(expenditure)

24698.883118187983

np.mean(expenditure)

424650.16332356015
Here, the Median did not change much, but the Mean did.

Calculating Mode:

Let’s generate a random expenditure set data using the script

below.

expenditure = np.random.randint(15, high=50, size=200)

expenditure
This gives us the folliwng output:
array([40, 15, 46, 22, 17, 45, 23, 29, 15, 24, 41, 34, 36, 37, 32,
17, 38,
31, 25, 28, 37, 48, 45, 31, 41, 47, 36, 39, 37, 29, 48, 38,
46, 28,
26, 39, 17, 18, 37, 22, 31, 30, 15, 24, 15, 23, 19, 31, 20,
38, 39,
42, 47, 27, 19, 24, 27, 34, 21, 20, 38, 15, 28, 48, 32, 41,
29, 21,
32, 19, 38, 28, 39, 32, 45, 22, 15, 24, 41, 23, 15, 30, 31,
47, 29,
48, 35, 32, 16, 33, 37, 43, 34, 25, 39, 22, 45, 34, 20, 45,
36, 15,
24, 24, 29, 42, 41, 47, 48, 33, 44, 25, 49, 39, 41, 41, 39,
34, 48,
47, 29, 22, 45, 24, 32, 46, 44, 47, 43, 24, 28, 15, 15, 47,
46, 29,
35, 21, 15, 40, 31, 37, 44, 38, 15, 48, 48, 17, 15, 29, 25,
40, 37,
35, 33, 47, 26, 48, 16, 20, 37, 32, 37, 30, 44, 25, 49, 19,
41, 15,
19, 27, 36, 25, 16, 49, 21, 49, 36, 44, 31, 23, 34, 35, 15,
43, 44,
36, 29, 22, 27, 49, 46, 31, 24, 40, 43, 41, 36, 28])

from scipy import stats

stats.mode(expenditure)

ModeResult(mode=array([15]), count=array([15]))
Measures of Dispersion
Range

 The difference between the smallest and the largest

observations in the sample is called Range.
 For example, the minimum and maximum blood pressure
are 113 and 170, respectively. Hence the range is 57.
 It is easy to calculate.
 It’s implemented for both "best" or " worst" case
scenarios.
 Too sensitive for extreme values.

Sample and Population

 Population - The entire set of objects or individuals or

interests or the measurements obtained from all
individuals or objects of interest.

-> Finite

-> Infinite

 Sample - A portion, or part, of the population of interest.

Variance:
Variance is a measure of dispersion in a data set.

It is measured by first finding the Deviation of each element in

a data set from the mean, and then by squaring it. Variance is
an average of all squared deviations.
The below figure shows that On an avg how far point is
distributed from the mean (x̄)

 Here(left side) variance is high because, from the

mean(x̄), the points are distributed at a longer distance as
compared to the right side, where the distance is a bit
smaller.

Q. Consider a list of random integers 3,3,3,5,6,1. We will

now calculate the variance using the numpy library.
import numpy as np
results = [3,3,3,5,6,1]
np.var(results)

2.5833333333333335

The variance of the random data set is 2.58.

The sample variance, s squared, is the arithmetic mean
of the squared deviations from the sample mean:

Standard deviation

 Standard deviation tells us about the concentration of

data around the mean of the data set.
 Standard deviation (S) is the square root of the variance.

Formula to calculate standard deviation:

 The sample standard deviation has the advantage of being

in the same units as the original variable (x).

To measure standard deviation, use the inbuilt function “std”

from numpy, as shown below:
np.std(results)
1.6072751268321592

Facts about Standard Deviation:

 If the standard deviation is small, the data has little
spread (i.e., the majority of points fall very near the
mean).
 If standard deviation = 0, there is no spread. This only
happens when all data items are the same value.
 The standard deviation is significantly affected by outliers
and skewed distributions.

Sample Vs. Population

The variance of Population and Sample

Note: In the sample variance formula, the denominator has n-1

instead of n, where n is the number of observations in the
sample. This use of ‘n-1’ is the Bessel’s correction method. The
reason behind using this method is, it corrects the bias in the
estimation of the population variance.
Symmetrical and asymmetrical distribution

# Using Numpy & Scipy to calculate Mean, Mode, and Median

import numpy as np
from scipy import stats

dataset = [5,5,2,3,4,6,18]

#mean value
mean = np.median(dataset)
#median value
median = np.median(dataset)
#mode value
mode = stats.mode(dataset)

std = np.std(dataset)
vr = np.var(dataset)

print("Mean:", mean)
print("Median:", median)
print("Mode:", mode)
print("STD", std)
print("Var:", vr)

Outputs:
Mean: 5.0
Median: 5.0
Mode: ModeResult(mode=array([5]), count=array([2]))
STD 4.997958767010258
Var: 24.9795918367347

Random Variables
A random variable is a set of all the possible values from a
random experiment.

A random variable is a variable whose possible values are

outcomes of a random phenomenon. Random variables can be
discrete or continuous. Discrete variables can only take specific
values, while continuous random variables can take any value
(within a range).

Discrete Random Variable

 The discrete random variable is one that may take on only
a countable number of distinct values.
 If the random variable can take only the finite number of
distinct values, then it must be a discrete random
variable.

Examples of discrete random variables :

* Number of children in the family.
* The Friday night attendance at a Multiplex.
* The number of patients in the doctor's surgery.
* The number of defective light bulbs in the box.
 The probability distribution of the discrete variable is the
list of the probabilities associated with every possible
value.

Suppose a random variable X may take k different values, with

a probability that X = xi defined to be P(X = xi) = pi. The
probabilities pi must satisfy the following:

1: 0 <= pi <= 1 for each i

2: p1 + p2 + ... + pk = 1.

Continuous Random Variables:

It is the one that takes an infinite number of possible values.
Continuous random variables usually are the measurements.
For example, the height of a person, the weight of a machine,
the amount of sugar in tea, etc. A continuous random variable
is defined throughout values and is represented by the area
under a curve.

Set
Set Definition
 A set is a well-defined collection of objects.
 A set that contains zero elements is called a null set
(empty set).
 Let A and B be two sets. Then A is said to be a subset of B
(or B is a superset of A) if every element of A belongs to B.

 A set may be defined by mentioning its elements written

in brackets.

For example, if Set X consists of the numbers 1, 2, 3, and

8, we may say X = {1, 2, 3, 8}.

 An empty set is denoted by { }, which is called a null set.

Operations of Set
 The union of two sets is defined as the set of elements
that are present in one or both sets. Thus, if A is {1, 2}
and B is {2, 3, 4}, the union of sets A and B is:

A ∪ B = {1, 2, 3, 4}

 The common elements that are present in both sets are

known as the intersection of two sets. Thus, if A is {1, 2,5}
and Y is {2, 3, 4,5}, the union of sets A and B is: The
intersection of two sets is

A ∩ B = {2,5}

 The complement of an event is defined as it is the set of

all the elements but not in the event.
 Thus, if the sample space is {0, 2, 3, 4, 8, 9}, and X is {2,
3, 4}, then the complement of set X is

X' = {0, 8, 9}

Examples
1. The set of consonants.

If B is the set of consonants, then B could be described as

B = {b,c,d,f,g,h,j,k,l,m,n,p,q,r,s,t,v,w,x}

2. Set X = {1, 2, 3} and Set Y = {3, 2, 1}. Is Set X equal to

Set Y?

Yes. Two sets are equal only if they have identical

elements.

3. Set X = {1, 2, 4} and Set Y = {1, 2, 3, 5, 6}. Is Set X a

subset of Set Y?
Set X would be a subset of set Y if every element from
Set X were also in Set Y.

Skewness: Skewness is defined as a measure of the

dataset’s symmetry. A perfectly symmetrical data set will
have zero skewness.
The skewness is defined as:

Where Xi is the ith X value, n is the sample size, x̄ is the

average, and s is the sample standard deviation.

This sample size formula is used here.

The figure above is for an asymmetrical data set. This data set
was created by generating the data from 65 to 135 in 5 number
of steps with the number of each value, as shown in Figure
above.
The above figure shows Symmetrical Data set with
Skewness equals to 0

For example, there are three 65's, six 70's, and nine 75's, etc.

The set of symmetrical data has a skewness equal to 0, where

Each X value is subtracted from the average. So, if a collection
of data is symmetrical, for each point that is a distance “d”
above the average, there will be a point that is a distance “-d”
below the average.

Consider 65 value and 135 value. The average of the data in

the above figure is 100.

when X = 65

For x = 135 then:

So, the -4278 value and the value of+4278 even out at 0. So, a
Symmetrical data set will have 0 skewness.

To explore +ve & -ve values of skewness, let’s define the

following terms:

So, when Xi is above the average, Sabove is the “size” of the

deviations from average. Likewise, when X i is below the
average, Sbelow can be viewed as the “size” of the deviations
from average.

Then the skewness becomes:

The skewness will be positive if S above is larger than Sbelow. This
means that the right-hand tail will be longer than the left-hand
tail. Figure 2 is an example of this. Skewness for this dataset is
0.514. Positive skewness indicates that the size of the right-
handed tail is larger than the left-handed tail.

Fig 2: A dataset with Positive Skewness

Figure 3 is an example of the dataset with negative skewness.

It is the mirror image, necessarily of Figure 2. The skewness is -
0.514. In this case, Sabove is smaller than Sbelow. The left-hand
tail will typically be longer than the right-hand tail.

Figure 3: Negative Skewness with Dataset

So, when is the skewness too much?

 If the skewness is between -0.5 and 0.5, the data are fairly
symmetrical. If the skewness is between -1 and – 0.5 or
between 0.5 and 1, the data is moderately skewed.

 If the skewness is greater than 1or less than -1, the data is
highly skewed.

Example: Heights of men in a college

Heig Clas Freque
ht s ncy,
(inch Mar
es) k, x f

59.5–
62.5 61 5

62.5–
65.5 64 18

65.5–
68.5 67 42

68.5–
71.5 70 27
Heig Clas Freque
ht s ncy,
(inch Mar
es) k, x f

71.5–
74.5 73 8

In the below histogram the data are skewed towards left:

To know how the dataset is highly skewed compared to other

data sets, we need to compute the skewness.

n = 4+19+42+27+8 = 100
x̅ = (61×5 + 64×18 + 67×42 + 70×27 + 73×8) ÷ 100
x̅ = 9305 + 1152 + 2814 + 1890 + 584) ÷ 100
x̅ = 6745÷100 = 67.45

Now, with the mean in hand, we can compute the skewness.

(x
Class Freque (x− (x−x̅
x*f −x̅
Marks, x ncy, f x̅)²f )³f
)

- -
30 6.4 208. 1341.
61 5 5 5 01 68

- -
11 3.4 214. 739.1
64 18 52 5 25 5
(x
Class Freque (x− (x−x̅
x*f −x̅
Marks, x ncy, f x̅)²f )³f
)

-
28 0.4
67 42 14 5 8.51 -3.83

18 2.5 175. 447.7

70 27 90 5 57 0

58 5.5 246. 1367.

73 8 4 5 42 63

∑ 67 852. −269
45 n/a 75 .33

x̅, m2, m3 67. 8.52 −2.6

45 n/a 75 933

Finally, the skewness is

g1 = m3 / m23/2 = −2.6933 / 08.52753/2

= − 0.1082
Probability Density Function

A Probability Distribution is a mathematical function through

which the probability of occurrence of different possible
outcomes in an experiment can be calculated.

If the probability of an event is higher, it’s more likely that the

event will occur.
For Example:
While tossing a fair (unbiased) coin, there could be a chance of
the possibility of two outcomes (“heads” and “tails”), which are
equally probable.

The probability of getting a head or a tail is 50 % or 0.5.

Types of the probability distribution
There are many different types of probability distribution. Some
of them which we will be covering in this blog are listed below:

 Normal Distribution
 Bernoulli’s Distribution
 Binomial Distribution
 Uniform Distribution
 Student’s T Distribution
 Poisson Distribution

Expected value

 E(C) = C
 The Expected Value of a Constant is only a value of a
constant.

 E (X + C) = E(X) + C

 E(CX) = cE(X)
 We can “pull” a constant out of an expected value
expression as a part of a sum with a random variable
X.

Binomial Distribution
The binomial distribution is used when there is more than one
outcome of a trial. These outcomes are labeled as “Success”
and “Failure.”

Here, the probability of both outcomes is the same for all the
trials.

Each trial is independent. The parameters of a binomial

distribution are p and n, where p is the probability of success in
the individual trial, and n is the total number of trials.

Binomial distribution:

Pyt
hon binomial distribution tells us the probability how often
there will be a success in ‘n’ independent experiments. Such
experiments are yes-no questions. One example may be
tossing a coin.
import seaborn
from scipy.stats import binom
data=binom.rvs(n=17,p=0.7,loc=0,size=1010)
ax=seaborn.distplot(data,kde=True,color='pink',hist_kws={"li
newidth": 22,'alpha':0.77})
ax.set(xlabel='Binomial',ylabel='Frequency')

How to Work a Binomial Distribution Formula:

Examples

Question: Hospital records show that of patients suffering

from a specific disease, `75%` die of it. What is the probability
that of six randomly selected Patients, four will recover?

Solution: This is a binomial distribution because the reason is

that there are only two outcomes (the patient dies, or does
not).

Let X = number who recover.

Here, n=6 and x=4.

p=0.25 (success, i.e., they live), q=0.75 (failure, i.e., they die).

The probability that four will recover:

Question: A die is rolled 3 times. What is the probability of

(a) No fives turning up?

(b) 1 five?

Solution:
There are only two possible outcomes (we get a 5, or we do
not).

Now, n=3 for each part.

Let X= number of fives appearing.

(a) When, x = 0.

(b) When, x = 1.

Question: In the old days, there was a probability of `0.8` of

success in any attempt to make a telephone call. (This often
depended on the importance of a person making the call, or the
operator's curiosity!)

Calculate the probability of having seven successes in just ten

attempts.

Solution:

Probability of success p=0.8, so probability of failure q = 0.2.

X= success in getting through.

Probability of 7 successes in 10 attempts:

Probability = P(X=7)

= 0.20133

Question: A (blindfolded) marksman finds that on the average,

he hits the target `4` times out of `5`. If he fires `4` shots,
what is a probability of

(a) more than `2` hits?

(b) at least `3` misses?

Solution:

Here, n=4, p=0.8, ‘q’ = 0.2.

Let x be the number of hits.

Let x0 = no hits, x1 = one hit, x2 = two hits, etc.

(a) P(X) = P(x3) + P(x4)

= 0.8192

(b) 3 misses mean 1 hit, and 4 misses mean 0 hits.

P(X) = P(x1) + P(x0)

= 0.0272

Question: A manufacturer of metal pistons finds that on the

average, `12%` of his pistons get rejected because either they
are oversize or they are undersize. What is a probability that a
batch of ten pistons will contain?

(a) no more than `2` rejects? (b) at least `2` rejects?

Solution: Let X= number of rejected pistons

(In this case, "success" means rejection!)

Here, n=10p=0.12, q=0.88.

(a) No rejects. That is when x=0:

One reject. That is when x=1:

Two rejects. That is when x=2:

So the probability is:

P(X ≤ 2)
= 0.2785 + 0.37977 + 0.23304
= 0.89131

(b) We could work out every case for X = 1,2,3,4,5,6,7,8,9,10,

but it is much easier to solve as follows:

Probability of at least 2 rejects

=1−P(X≤1)
=1−(P(x0)+P(x1))
=1−(0.2785+0.37977)
=0.34173

Question:
A company drills 9 wild-cat oil exploration wells, each with an
estimated probability of success of 0.1. What is the probability
that all nine wells fail?
Solution:
Let’s do 20,000 trials of the model, and count the number that
generates zero positive results.

sum(np.random.binomial(9, 0.1, 20000) == 0)/20000.

0.3918

Note: Suppose a binomial experiment consists of n trials and

results in x successes. If the probability of success on an
individual trial is ‘P’, binomial probability:

b(x; n, P) = nCx * Px * ( ‘1’ – P )n - x

Question:

1. Generate a random number between ‘0’ and ‘1’. If that

number is 0.5, then it will count it as heads, otherwise tails.
Do this n times using a list comprehension of python. This
should happen within the function run_binom.

2. Repeat this a specified number of times (the input

variable trials specify the number of trials). We will perform
1,000 trials.

Solution:
# Import libraries
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
# Input variables

# Number of trials
trials = 1000

# Number of independent experiments in each trial

n = 10

# Probability of success for each experiment

p = 0.5

# Function that runs our coin toss trials

def run_binom(trials, n, p):
head = []
for i in range(trials):
toss = [np.random.random() for i in range (n) ]
head.append (len ([i for i in toss if i>=0.50]))
return head

# Run the function

heads = run_binom(trials, n, p)

# Plot and save the results as the histogram

fig, ax = plt.subplots(figsize=(14,7))
ax = sns.distplot(heads, bins=11, label='simulation results')

ax.set_xlabel("Number of Heads",fontsize=16)
ax.set_ylabel("Frequency",fontsize=16)
# Probability of getting 5 heads

runs = 10000
prob_5 = sum([1 for i in np.random.binomial(n, p, size = runs)
if i==5])/runs
print('The probability of 6 heads is: ' + str(prob_5))

The probability of 5 heads is: 0.2035

Normal Distribution (Gaussian Distribution)

Normal Distribution is one of the most common continuous
probability distribution. This type of distribution is important in
statistics and is often used to represent random variables
whose distribution is not known.

This type of distribution is symmetric, and it's mean, median,

and mode are equal.

Mathematically, Gaussian Distribution is represented as:

N~(μ, σ2 )

Where N stands for Normal, symbol ~ for distribution, whereas

symbol μ stands for mean and σ2 stands for the variance.

In the above image, we can view the highest point is located at

the mean μ, and the spread of the graph can be observed by
the standard deviation ‘σ’.

Let us understand this with the easiest example where we can

have the random variable X with distribution:
X = {1, 2, 3, 4, 5}
When we take the mean and the standard deviation of the
above data set, we get mean(μ) = 3 and standard deviation(σ)
= 1.

When we plot it, we get a few distributions like this mentioned

below:
This Bell curve specifies the Gaussian distribution.
1.1 Empirical Formula
The empirical rule states that for the Normal Distribution,
nearly all of the data will fall within three range of standard
deviations of a mean. The empirical rule can be understood
through the following:

 68% of the data falls within the 1st standard deviation

from the mean.
 95% fall within two standard deviations.
 99.7% fall within three standard deviations.

 Approximately 68% of the data falls within one standard

deviation of a mean In mathematical notation, this is
represented as μ ± 1σ

 About 95% of the data falls within 2 standard deviations of

the mean (i.e., between the mean – 2 times the standard
deviation, & mean + 2 times the standard deviation). The
mathematical notation for this is: μ ± 2σ

 Approximately 99.7% of the data lies in 3 standard

deviation of a mean (i.e., between the mean –3 times the
standard deviation and the mean + 3 times the standard
deviation).
 The following notation is used to represent: μ ± 3σ

The Empirical Rule is often used to forecast when obtaining the

right data is difficult or impossible to get.
1.2 Standard Normal Distribution
Understanding Standardization in the context of statistics.
Every distribution can be standardized. Let say if the mean and
variance of a variable are μ and σ2, respectively.

Standardization is the process of transforming the distribution

to one with a mean of 0 and a standard deviation of 1.

i.e., ~(μ, σ2 ) → ~ (0, 1)

When a Normal Distribution is standardized, a result is called a

Standard Normal Distribution.

i.e., N~(μ, σ2 ) → ~ N(0, 1)

We use the following formula for standardization:

Where x is a data element, μ is mean & ‘σ’ is the standard

deviation, and Z is used to denote standardization, and Z is
known as the z-score.

With the help of Z scores, we can come to know how far a value
is from the mean. When you standardize a random variable, its
μ becomes 0, and its standard deviation becomes 1.

If the Z score of x is 0, then the value of x is equal to the mean.

Let us understand the steps in Standardization with the help of
a simple example.

Suppose we have a dataset with elements

X = {1,1,1,2,2,2,3,3,4,4,4,4,5}

And uniformly distributed as:

We get mean as 3, variance = 1.49 & standard deviation as

1.22 i.e., N ~ (3, 1.49).

Now we will subtract the mean from every the data points, that
is, x – μ.

We will get a new data set mentioned below:

X1 = {-2, -1, -1, 0, 0, 1, 1 ,1, 2,2}

μ as 0, but the variance and std dev still as 1.49 and 1.22
respectively

i.e., N ~ (0, 1.49)

So in the next step of standardization, dividing all data points
by the standard deviation, i.e., (x – μ)/σ

Dividing each datapoint by 1.22(standard deviation) we get a

new data set as :

X2 = {-1.6, -0.82, 0, 0.82, 0, , 0.82,0,0.82, 0.82, 0 and 1.63.}

Now if we calculate the mean as 1 i.e., N ~ (0, 1)

Plotting it on a graph :

Using this standardized normal distribution makes inferences &

predictions much easier.

1.3 Probability Density Function and Mass

Function
Probability density function and Probability mass function
defines a Probability Distribution for a random variable.

If we know the variance and the mean of the dataset, then we

can calculate the PDF and PMF. PDF and PMF tell how well data
is distributed around mean and standard deviation within a
particular curve.
1.4 Cumulative Density Function

The cumulative distribution function (CDF) of a random variable

is another method to describe the distribution of random
variables.

The cumulative frequency is the sum of the frequencies.

Cumulative frequency starts at the frequency of the 1st brand,
and then we add the 2nd, then 3rd, and so on until it finishes
100%.

For all kinds of random variables (discrete, continuous, and

mixed), CDF can be defined.

Examples of Normal Distribution:

Question: An average light bulb manufactured by the Acme
Corporation lasts 300 days with a standard deviation of 50
days. Assuming that bulb life is normally distributed. What is a
probability that an acme light bulb will last at most 365 days?

Solution: Given a mean score of 300 Days & A standard

deviation of fifty days, we want to find the cumulative
probability that bulb Life is higher than equal to 365 days. Thus,
we know the following:

 The value of the standard random variable is 365 days.

 The mean is equal to 300 days.
 The standard deviation is equal to 50days.

We enter these values into the Normal Distribution Calculator

and compute the cumulative probability. The answer is
P( X < 365) equal to 0.90. That means 90% of the chance that
a light bulb will burn-out within 365 days.

Question: Suppose that the scores on an IQ test are normally

distributed(bell-shaped). If the test has a mean is 100 and a
standard deviation(sigma^2) of 10, what is the probability that
a particular person who takes the test will score between 90
and 110.?

Solution: Here, we came to know the probability that the test

score comes between 90 and 110. The "trick" to solving this
problem is the following:

P( 90 < X < 110 )

= P( X < 110 ) - P( X < 90 )

 Here, the value of the normal random variable is 110, the

mean is 100, and the standard deviation = 10. Where
P( X < 110 ) is 0.84.
 To compute P( X < 90 ), The value of the normal random
variable is 90, the mean is 100, and the standard
deviation = 10. P( X < 90 ) is 0.16.

About 68% of the test scores will lie between 90 and

110.

Bernoulli Distribution
Bernoulli distribution is a discrete probability distribution of a
random variable that has only two outcomes.,
namely 1 (success) and 0 (failure). where n = 1 occurs with
probability p and n = 0 (usually called a “failure”) occurs with
probability q = 1 – p,

where 0 < p < 1.

Therefore, the probability density function(PDF) & the graph for
Bernoulli’s Distribution is shown in the figure below:

In the above diagram, 1 refers to ‘success’ & 0 refers to the

failure. The head and tail distribution in tossing a coin is an
example of Bernoulli’s Distribution with p = q = ½.

For example, probability (p) of scoring a goal in the last 10

minutes is 0.35 (success); the probability of not scoring a goal
in the previous 10 minutes (failure) is 1 - p = 0.65.

Plotting Bernoulli distribution with probability for p = 0.65

pyplot.bar(['0', '1'], [0.35, 0.65])

<BarContainer object of 2 artists>
Uniform Distribution
The probability distribution function of the continues uniform
distribution:

Since the area under the curve must be equal to 1, and the
length of the interval determines the height of the curve, the
following figure shows a uniform distribution (a,b).

Note that since the area needs to be 1. The height is set

to 1/(b−a)1/(b−a).

EXAMPLE:

 If we roll a die(numbered 1 to 8), then the probability of

getting 1 is one out of 8.
 Similarly, the probability of getting 2 to 6 is 1/6. There is
an equal chance to get each of 8 results (outcomes).
Poisson distribution:

Poisson Distribution can be used to find the probability of

several events in a time period.

Conditions for Poisson Distribution:

 Here the events can occur independently.

 An event can occur any number of times.
 The rate of occurrence is constant; i.e., the rate does not
change based on time.

Example:

A specific fast-food restaurant gets an average of three visitors

to the drive per minute. This is just an average. However, the
actual amount can vary.

Suppose we conduct the Poisson experiment, where the

average number of success in a given region is μ.

Then, the Poisson probability is:

P(x; μ) = (e-μ) (μx) / x!

The graph of Poisson Distribution:

Examples:
Question: A life insurance salesman sells on the average `3`
life insurance policies per week. Use Poisson's law to calculate
the probability

a. In a given week he will sell how may policies

b. In a given week, he will sell `2` or more policies but not
more than five policies.
c. Assuming that per week, there are `5` working days, what
is the probability that on a given day, he will sell one
policy?

Solution: Here, μ = 3

(a) "Some policies" means "1 or more policies". We can work

this out by computing to find 1 - "zero policies" probability:

P(X > 0) = 1 − P(x0)

Therefore, the probability of 1 or more than 1 policies:

P (X ≥ 0)

=1−4.9787×10−2 because of 1−P(x0)

=0.95021

(b) The probability of selling 2 or more policies but less than 5

policies are:

P(2≤X<5)
= P(x2)+P(x3)+P(x4)

= 0.61611

Q. Aluminium alloy sheets(20 sheets) were examined for

surface flaws. The frequency of sheets, along with the number
of flaws, is as follows:

Number Frequ
of flaws ency
`0` `4`
`1` `3`
`2` `5`
`3` `2`
`4` `4`
`5` `1`
`6` `1`

When chosen at random, what is the probability of finding a

sheet that contains three or more surface flaws?

Solution: The total number of flaws:

The average number of flaws for twenty sheets is given by:

(0×4)+ (1×3)+ (2×5)+ (3×2)+ (4×3) + (5×1)+ (6×1) = 44

An average number of flaws for twenty sheets is given by:
μ = 44/20 = 2.3

The required probability is:

Probability=P(X≥3)

=1−(P(x0)+P(x1)+P(x2))
=0.40396

Question:

If electrical power failures occur according to the Poisson

distribution with an average of `3` failures every twenty weeks,
calculate the probability of more than 1 failure during a
particular week.
Solution:
Per week, the average number of failures μ = 3/20 =00.15
We need to include the probabilities for zero failures plus
“1 failure".

Z Stats

(a) It allows us to calculate the score probability, which

occurs within our normal distribution.
(b) It enables us to compare two scores of different normal
distributions.
The basic z score formula is:

z= (x – μ) / σ

When you have multiple samples and their sample means (the
standard error), you will use the following z-score formula:

z = (x – μ) / (σ / √n)

Calculating z-scores
Question.
The grades on a physics midterm at Covington are roughly
symmetric with μ = 72 and σ=2.0.
 Stephanie scored 74 on the exam.
 Find the z-score for Stephanie's exam grade. Round to two
decimal places.
Solution:

z-score is defined as the number of the standard deviations a

particular point is away from the mean.

z= (74-72)/2.0

z ≈ 1.00

The z-score is 1.00. In other words, Stephanie's score was 1.00

standard deviation above the mean.
Question:

The grades on a math midterm at Gardner Bullis are roughly

symmetric with μ=76 and σ=4.5.
 Daniel scored 64 on the exam.
 Find the z-score for Daniel's exam grade. Round to two
decimal places.
Solution:

z-score is defined as the number of the standard deviations a

particular point is away from the mean.
z= (64-76)/4.5
z ≈ -2.67
The z-score is -2.67. In other words, Daniel's score was 2.67
standard deviations below the mean.

Question: Molly earned a score of 940 on a national

achievement test. The mean test score was 850, along with the
standard deviation of 100. What proportion of students had a
higher score than Molly?
(Assume that test scores are normally distributed.)

(A) 0.10
(B) 0.18
(C) 1.50
(D) 5.82
(E) 2.90

Solution: The correct answer is B. We assume that test scores

are normally distributed. Given an assumption of normality, It
involves the following steps:

 We transform Molly's test score into the z-score using the

z-score transformation equation.

z = (X - μ) / σ = (940 - 850) / 100 = 000.90

 Then, using the standard normal distribution table, we find

the cumulative probability associated with the z-score. In
this case, we find P(Z < 00.90) = 00.8159.

 Therefore, the P(z > 0.90)

= 1 - P( z < 0.90)

=1 - 0.8159 = 0.1841.

Thus, we estimate that 18.41% of the students tested had a

higher score than Molly.

Central Limit Theorem

After fetching different samples, which are enough in numbers,

we can then calculate the mean of each sample and then plot
the various distributions.
Also, if we take the average of the sample mean, then the
result will be equal to the actual population mean & the
standard deviation equals σ/√n.

Where,

‘σ’ is the population of std deviation

n = the sample size(i.e., # of observations in our sample)

Important points while applying the Central Limit Theorem:

 The distribution of the original(population) dataset does
not matter. It could be normal, uniform, binomial, etc.
 The distribution of the sample means would be Normal
Distribution

 Larger the number of samples taken from the population,

the closer to a Normal Distribution the sample means will

be.

 The samples extracted should be more significant than 30

observations.
 The sample mean average extracted will be approximately
equal to the mean of the population, and its variance
would be similar to the original variance, which is divided
by the sample size, i.e., ‘n’.
Mathematical Explanation of Central limit
Theorem
The central limit theorem states that the mean ( )follows
approximately the Normal distribution with mean µ and
standard deviation , where µ and σ are the mean and
standard deviation of the population.

Example: Let X be a random variable along with µ = 10 and σ

= 4. A sample of size 100. Find the probability that the sample
mean ‘µ’ of 100 number of observations is not more than (less
than) 9.

We write

Similarly, the central limit theorem states that the sum T

follows approximately a normal distribution,

Where µ and σ are the mean and standard deviation of a

population. To transform T into z, we use:

Example: Let X be a random variable along with ‘µ’ = 10 and σ

= 4. A sample of size 100 is taken from the population. Find the
probability that the sum of these 100 number of observations is
less than 900.
We write
P(T < 900)

= P(z < −2.5)

= 0.0062 (from the table ofstandard normal probabilities)

Applications of the central limit theorem

Question: A large freight elevator can transport a maximum of
9800 pounds. Suppose a load of cargo containing 49 boxes
must be carried through the elevator. Experience has shown
that the weight of some boxes of this type of cargo follows a
distribution with mean µ = 205 pounds & the standard
deviation ‘σ’ = 15 pounds. Based on this information, what is a
probability that all 49 boxes can be safely loaded onto the
freight elevator and transported?

Solution: We are given n = 49, µ = 205, σ = 15.

The elevator can transport up to 9,800 Pounds. Therefore,
these 49 boxes will be safely transported if they weigh in total,
not more than(less than) 9800 pounds.
The probability that the total weight of 49 boxes is not more
than(less than) 9800 pounds is
P(T < 9800)

=
= P(z < −2.33)
= 1 − 0.9901
= 0.0099.

Question: It is known that the number of tickets purchased by

a student standing in line at the ticket counter to buy the
tickets for the Football match of U.C.L.A. against USC follows a
distribution that has a mean µ = 2.4 & the standard deviation
‘σ’ = 2.0.
Suppose 100 eager students are standing in line to purchase
the tickets. If only 250 tickets remain unbooked, what is a
Probability that all 100 students will be able to buy the tickets
they desire?

Solution: We are given that µ = 2.4, σ = 2, n = 100.

There are 250 tickets available, so a hundred students will be
able to purchase the tickets they want if all together ask for not
more than 250 tickets. Probability for that is
P(T < 250)
= P(z < 0.5)
= 0.6915.
Question: Suppose that you have a sample of 100 values from
a population with mean µ = 500 and with standard deviation σ
= 80.
a. What is the probability that the µ will be in the interval (490,
510)?
b. Give the interval that covers the average 95 percent of the
distribution of the sample mean.
Solution: We are given µ = 500, σ = 80, n = 100.

= P(−1.25 < z < 1.25)

= 0.8944 − (1 − 0.8944)
= 0.7888.

= 0.95.

Question: The amount of regular unleaded gasoline purchased

every week at a gas station near UCLA follows the normal
distribution with mean 50,000 gallons and a standard deviation
of 10,000 gallons. The starting supply of gasoline is 74,000
Gallons, and there is a scheduled weekly delivery of 47000
gallons.
a. Find the probability that, after 11 weeks, a supply of gasoline
will be below 20000 gallons.
b. How much should the weekly delivery be so that after 11
weeks, the probability that the supply is only 0.5% below 20000
gallons is?
Solution:
Given: ‘µ’ = 50000, ‘σ’ = 10000, n = 11. The starting supply is
74000 gallons. 47000 gallons is the weekly delivery. Therefore,
the total supply for the eleven weeks is 74000 + 11 × 47000 =
591000 Gallons.
a. The supply will be below 20000 gaallons if the gasoline
purchased in these 11 weeks is more than 591000 − 20000 =
571000 Gallons
Therefore, we need to find

P(T > 571000) =

= P(z > 0.63)

= 1 − 0.7357
= 0.2643.

b. Let A be the unknown scheduled delivery.

Now, the total gasoline purchased must be higher than 74000
+ 11 × A − 20000.
We need this with probability 0.5% or Probability P(T > 74000
+ 11A − 20000) = 0.005.
The z value is 2.57566.

A = 52854.88
The weekly delivery must be 52854.8 gallons.

Safari
No ratings yet
Safari
385 pages
Ai - Ssmda
No ratings yet
Ai - Ssmda
142 pages
Statistics
No ratings yet
Statistics
21 pages
Ssmda End Sem
No ratings yet
Ssmda End Sem
152 pages
Biostatistics 1
No ratings yet
Biostatistics 1
120 pages
Statistics
No ratings yet
Statistics
45 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
93 pages
KLM k3
No ratings yet
KLM k3
929 pages
STA641 Book All Lectures
No ratings yet
STA641 Book All Lectures
338 pages
Unit 2 DS PDF
No ratings yet
Unit 2 DS PDF
97 pages
Statistics
No ratings yet
Statistics
152 pages
UNIT II - Statistics For Data Science - New
No ratings yet
UNIT II - Statistics For Data Science - New
153 pages
Advanced Statistics1
No ratings yet
Advanced Statistics1
19 pages
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
No ratings yet
Lesson 5 (Descriptive Statistics Part 1) - Oct 2024
72 pages
SSM & Da All Unit Notes
No ratings yet
SSM & Da All Unit Notes
152 pages
Statistics, Statistical Modelling & Data Analytics
No ratings yet
Statistics, Statistical Modelling & Data Analytics
68 pages
UNIT II - Statistics For Data Science - New
No ratings yet
UNIT II - Statistics For Data Science - New
153 pages
Statistics
No ratings yet
Statistics
152 pages
Statistics Notes Self Made
100% (1)
Statistics Notes Self Made
41 pages
Basic Statistics (3685) PPT - Lecture On 20-01-2019
100% (1)
Basic Statistics (3685) PPT - Lecture On 20-01-2019
64 pages
Introduction and Descriptive Statistics
No ratings yet
Introduction and Descriptive Statistics
50 pages
Week 01
No ratings yet
Week 01
71 pages
Advance Statistics For Data Science and Data Analysis
No ratings yet
Advance Statistics For Data Science and Data Analysis
47 pages
Full Slides Beginselen2019
No ratings yet
Full Slides Beginselen2019
364 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
53 pages
Statistics
No ratings yet
Statistics
68 pages
Introduction To Statistical Analysis
No ratings yet
Introduction To Statistical Analysis
41 pages
Mode of Grouped Data and Midrange
No ratings yet
Mode of Grouped Data and Midrange
16 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
Statistical Foundations - Intro 64zlf
100% (2)
Statistical Foundations - Intro 64zlf
86 pages
Importance of Descriptive Statistics
No ratings yet
Importance of Descriptive Statistics
59 pages
Chapter 1
No ratings yet
Chapter 1
23 pages
Psyc 103 (Stats)
No ratings yet
Psyc 103 (Stats)
75 pages
Intro SRM
No ratings yet
Intro SRM
73 pages
Unit 8. Data Analysis
No ratings yet
Unit 8. Data Analysis
69 pages
Data Management
No ratings yet
Data Management
48 pages
43hyrs Principles of Statistics 3
No ratings yet
43hyrs Principles of Statistics 3
56 pages
Stats Lect
No ratings yet
Stats Lect
77 pages
Assignment 1 (2023-24)
No ratings yet
Assignment 1 (2023-24)
5 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
X X X X X X: Data Presentation and Interpretation
No ratings yet
X X X X X X: Data Presentation and Interpretation
89 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
26 pages
Statistics For Data Science
100% (1)
Statistics For Data Science
27 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
18 pages
Particle Size Analysis
100% (1)
Particle Size Analysis
27 pages
Handout-A-Preliminaries (Advance Statistics)
No ratings yet
Handout-A-Preliminaries (Advance Statistics)
29 pages
Session 1 On Descriptive Statistics
No ratings yet
Session 1 On Descriptive Statistics
24 pages
Comm 215.MidtermReview
No ratings yet
Comm 215.MidtermReview
71 pages
Maa SL 4.1-4.3 Statistics - Basic Concepts
No ratings yet
Maa SL 4.1-4.3 Statistics - Basic Concepts
30 pages
C1S1 Statistics Packet
No ratings yet
C1S1 Statistics Packet
24 pages
PowerPoint Presentation
No ratings yet
PowerPoint Presentation
10 pages
Measure of Dispersion and Location
No ratings yet
Measure of Dispersion and Location
51 pages
MealKit4U Stats Student-1 (1) (AutoRecovered)
No ratings yet
MealKit4U Stats Student-1 (1) (AutoRecovered)
15 pages
Bma301 Cat1 Questions
No ratings yet
Bma301 Cat1 Questions
8 pages
Determining Optimum Room Dimensions For Critical L
No ratings yet
Determining Optimum Room Dimensions For Critical L
7 pages
Midterms Gec Math Adooooor
No ratings yet
Midterms Gec Math Adooooor
6 pages
Ecs Notes
No ratings yet
Ecs Notes
10 pages
Swedge Manual
100% (1)
Swedge Manual
92 pages
Statistical Analysis - Descriptive Stat
No ratings yet
Statistical Analysis - Descriptive Stat
6 pages
Central Tendency
No ratings yet
Central Tendency
105 pages
Statistics and Probabilities Quarter 1
No ratings yet
Statistics and Probabilities Quarter 1
6 pages
Statistics Practice
No ratings yet
Statistics Practice
7 pages
Assignment
75% (4)
Assignment
13 pages
Chapter1 Statistics
No ratings yet
Chapter1 Statistics
17 pages
Measures of Central Tendency and Dispersion
No ratings yet
Measures of Central Tendency and Dispersion
4 pages
NITKclass 1
No ratings yet
NITKclass 1
50 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
3 pages
MMW Data Management
No ratings yet
MMW Data Management
2 pages
DR - Nesrin H. Darwesh University of Duhok-College of Dentistry
No ratings yet
DR - Nesrin H. Darwesh University of Duhok-College of Dentistry
15 pages
Name: Siti Mursyida Abdul Karim (Data Science Program) Topic: Assignment - EDA
100% (1)
Name: Siti Mursyida Abdul Karim (Data Science Program) Topic: Assignment - EDA
13 pages
Basic Concepts in Statistics
No ratings yet
Basic Concepts in Statistics
42 pages
© The Institute of Chartered Accountants of India
No ratings yet
© The Institute of Chartered Accountants of India
18 pages
Module 001 Basic Statistical Concept
No ratings yet
Module 001 Basic Statistical Concept
12 pages
35 - 13 Normal Distribution
No ratings yet
35 - 13 Normal Distribution
6 pages
W1 Lesson 1 - Basic Statistical Concepts - Module PDF
No ratings yet
W1 Lesson 1 - Basic Statistical Concepts - Module PDF
11 pages
Assignment On Stem and Leaf: JANUARY 13, 2017
No ratings yet
Assignment On Stem and Leaf: JANUARY 13, 2017
5 pages
OMAE 005022: Offshore Viv Monitoring at Schiehallion - Analysis of Riser Viv Response
No ratings yet
OMAE 005022: Offshore Viv Monitoring at Schiehallion - Analysis of Riser Viv Response
9 pages
Averages: Mean, Median, And Mode (Group Data) : εf cf< fm
No ratings yet
Averages: Mean, Median, And Mode (Group Data) : εf cf< fm
3 pages
Why We Need More Than One Measure of Central Tendency (Mean, Median and Mode) 8 Grade Lesson Plan
No ratings yet
Why We Need More Than One Measure of Central Tendency (Mean, Median and Mode) 8 Grade Lesson Plan
6 pages
Statistics and Set Theory
No ratings yet
Statistics and Set Theory
7 pages
Grade 12 3rd Quarter
No ratings yet
Grade 12 3rd Quarter
4 pages
The Ultimate Data Literacy Cheat Sheet
100% (1)
The Ultimate Data Literacy Cheat Sheet
2 pages
Mean Median Mode Range-Lesson Plan
100% (1)
Mean Median Mode Range-Lesson Plan
3 pages
Statistics For Management
No ratings yet
Statistics For Management
4 pages
Chapter1 Introduction To Statistics
No ratings yet
Chapter1 Introduction To Statistics
27 pages
Introduction To Business Statistics Through R Software: Software
From Everand
Introduction To Business Statistics Through R Software: Software
Editor IJSMI
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
From Everand
Machine Learning - A Complete Exploration of Highly Advanced Machine Learning Concepts, Best Practices and Techniques: 4
Peter Bradley
No ratings yet
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.