Lecture 1
Lecture 1
10
Statistics Supports The Creative Process
11
Why Do We Study Probability and Statistics?
The Five Words of Statistics
A population is the collection of all items of interest or under
investigation
N represents the population size
A sample is an observed subset of the population
n represents the sample size
A variable is characteristic of an item or an individual that will be
analyzed using statistics.
A parameter is a specific characteristic of a population
A statistic is numerical measure that describes a variable
(characteristic) of a sample (part of a population).
Population vs. Sample
Population Sample
a b cd b c
ef gh i jk l m n gi n
o p q rs t u v w o r u
x y z y
Measurement scales:
- nominal
- ordinal
- interval
- ratio
The branches of statistics
Descriptive statistics
• Collecting, summarizing, and processing data to transform
data into information
Inferential statistics
• Provide the bases for predictions, forecasts, and estimates
that are used to transform information into knowledge
Data comprises raw, unprocessed facts that need context to become useful,
while information is data that has been processed, organized, and
interpreted to add meaning and value.
Descriptive Statistics
• The daily average temperature range of Mostar was 25 last
week
• The average age of athletes who participated in the Berlin
Marathon was 25 years
• 50% of teachers at IBU are female
• The score of 200 students in Calculus 1 are found to range
from 0 to 80
Descriptive
Collect data
e.g., Survey
Present data
e.g., Tables and graphs
Summarize data
e.g., Sample mean = X i
Surveys
A process that uses questionnaires or similar means to gather values for the responses from a set of
participants.
Experiments
A study that examines the effect on a variable of varying the value(s) of another variable or variables, while
keeping all other things equal. A typical experiment contains both a treatment group and a control group. The
treatment group consists of those individuals or things that receive the treatment(s) being studied. The control
group consists of those individuals or things that do not receive the treatment(s) being studied.
Observations
Individuals are conducted in a way that doesn't change the response or the variable being measured.
Data collection
Sampling Concepts
• Frame The list of all items in the population from which the sample will be selected.
• Sampling The process by which members of a population are selected for a sample.
• Probability Sampling A sampling process that considers the chance of selection of each
item. Probability sampling increases your chance that the sample will be representative
of the population.
• Simple Random Sampling Every possible sample of a certain size has the same chance of
being selected as every other sample of that size.
Probability
sampling
Non-
probability
sampling
Sampling Concepts
• Stratified sampling: Divide the entire population into distinct subgroups called strata.
The strata are based on a specific characteristic such as age, income, education level,
and so on. All members of the stratum share the specific characteristic. Draw random
samples from each stratum.
• Systematic sampling: Number of all members of the population sequentially. Then, from
a starting point selected at random, include every kth member of the population in the
sample.
• Cluster sampling: Divide the entire population into pre-existing segments or clusters.
The clusters are often geographic. Make a random selection of clusters. Include every
member of each selected cluster in the sample.
• Convenience sampling: Create a sample by using data from population members that
are readily available.
Random Sampling
Simple random sampling is a procedure in which:
Knowledge
Experience, Theory,
Literature, Inferential
Statistics, Computers
Information
Descriptive Statistics,
Begin Here: Probability, Computers