Elementary Statistics and Probability Chapter 1 3
Elementary Statistics and Probability Chapter 1 3
2 Types of Statistics:
● Descriptive- includes the collection, presentation, and description of sample data
● Inferential- the technique of interpreting the values resulting from the descriptive techniques and
making the decisions and drawing conclusions
Process:
1. Defining the situations/ problems
2. Collecting the data
3. Summarize and calculate
4. Analyze and make a decision
Types of Variables
1. Qualitative Variables- describes or categorizes an element of a population
a. Nominal- characterize (or describes, or names) an element of a population
i. Examples: hair color, hometown, gender, zip code, plate number
b. Ordinal- incorporates an ordered position, or ranking
i. Examples: level of satisfaction, class rank, Olympic medals
2. Quantitative Variables- quantifies an element of a population (numbers)
a. Discrete- can assume a countable number of values
i. Examples: Number of planets around the Sun, number of students in a class
b. Continuous- assume an uncountable number of values
i. Examples: Number of stars in the space , height or weight of the students in a
particular class
Data Collection:
Sampling Methods- the process of selecting items or events that will become the sample
1. Biased Sampling- produces data that systematically differ from the sampled population. Repeated
sampling will not correct the bias
a. Convenience samples- sometimes called a grab sample, occurs when items are chosen
arbitrarily and in an unstructured manner from a population
b. Volunteer samples- consist of results collected from those elements of the population on
their own initiative
2. Unbiased Sampling- not biased and produces data that are the representative or the sampled
population
a. Single-stage sampling- a sample design in which the elements of the sampling frame are
treated equally and there is no subdividing or partitioning of the frame
i. Simple random sample- every element in the population or sampling frame has an
equal probability of being chosen. Equivalently, all samples of size n have an equal
chance of being selected.
ii. Systematic sampling method- every kth item of the sampling frame is selected,
starting from a first element, which is randomly selected from the first k elements.
b. Multiple-stage methods- a sample design in which the elements of the sampling frame are
subdivided and the sample is chosen in more than one stage
i. Stratified random sample- obtained by stratifying the population, and then selecting
a number of items from each of the strata by means of a simple random sampling
technique (known as probability sampling)
ii. Proportional stratified sampling- obtained by stratifying the population, and then
selecting a number of items in proportion to the size of the strata from each strata by
means of a simple random sampling technique (known as quota sampling)
iii. Cluster sample- obtained by stratifying the population, and then selecting some or
all of the items from some, but not all, of the strata
*Strata- a subset of a population that shares common characteristics, such as gender, income, or location.
● Cumulative frequency distribution- a frequency distribution that pairs cumulative frequencies with
values of the variable
● Ogive- a line graph of a cumulative frequency or cumulative relative frequency distribution
Measure of Central Tendency- numerical values that locate, in some sense, the center of a set of data
● Mean- also called the arithmetic mean, represented by x̄, the middle point by the weight
● Median- the middle position when the data are ranked in order according to size, represented by x͂
○ Finding the mean involves three basic steps:
■ Rank the data
■ Determine the depth (position) of the median
Formula for ungrouped data: Formula for grouped data: (to find the median class)
● Midrange- the number exactly midway between the lowest data value (L) and the highest data
value (H)
● Range- the difference in value between the highest data value (H) and the lowest data value (L)
Measure of Dispersion- describe the amount of spread that is found among the data
● Variance
● Standard deviation
○ Formula: *use the complete answer (decimal) in variance
Measure of Position- used to describe the position a specific data value possesses in relation to the rest
of the data
*First, rank the data from lowest to highest
● Quartile- divide the data into four equal parts, or quarters
● Decile- divide the data into ten equal parts
● Percentile- divide data into one hundred equal parts
For Ungrouped data:
Step 2: substitute the value of the median class to the median formula
* use the answer in step 1 to substitute in n/2
CHAPTER 3
Linear Correlation- measures the strength (and direction) of a linear relationship between two variables
Interpretation:
● ± 0.0 - ± 0.10 No correlation
● ± 0.11 - ± 0.39 Weak ± Correlation
● ± 0.40 - ±0.69 Moderate ± Correlation
● ± 0.70 - ±0.89 Strong ± Correlation
● ± 0.90 - ± 1 Perfect ± Correlation
Linear Regression
1. Slope
2. Y-intercept