0% found this document useful (0 votes)
19 views4 pages

Statistics and Probability

Statistics
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views4 pages

Statistics and Probability

Statistics
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Tangub City Global College Levels of Measurement

Maloro, Tangub City, Misamis Occidental 7214 Nominal variable is a qualitative variable that
characterizes/describes an element of a population. These are
numbers that do not mean anything; they are just labels. It is the
Mathematics lowest level of measurement.
Statistics and Probability Example: SSS number, gender, hair color, hometown
Descriptive statistics-concerned with the collection,
organization, and presentation of data. Thus, descriptive Ordinal variable is a qualitative variable that incorporates an
statistical analysis aims to summarize some of the important ordered position or ranking. These are numbers that are used to
features of a set of data. Construction of tables, charts, and label and rank. The order/rank is meaningful.
graphs, and computations of measures such as averages and Example: satisfaction ratings, class ranking, size of t-shirt
percentages fall within this area of statistics.
Interval numbers are used to label + rank and do not have a
Inferential statistics-concerned with the formulation of true zero.
conclusions or generalizations about a population based on an Example: temperature
observation or a series of observations of a sample drawn from
the population. Ratio numbers are used to label + rank + equal unit of interval
and do have true zero.
Variables are properties or characteristics of some event, Example: number of votes
object, or person that can take on different values or amounts.
Sampling Techniques
Data are the values (measurements or observations) that the
variables can assume. Data are facts, or a set of information Probability Sampling
gathered or under study. 1.) Simple Random Sampling
This is the simplest form of random sampling where
Independent and dependent variables every subset of size n of the population has an equal chance of
When conducting research, experimenters often being selected.
manipulate variables. For example, an experimenter might
compare the effectiveness of four types of antidepressants. In 2.) Systematic Sampling
this case, the variable is “the type of antidepressant.” When a This is also called interval sampling. It means that there
variable is manipulated by an experimenter, it is called an is a gap or interval between each selection. Samples are
independent variable. randomly chosen following certain rules set by the researchers.
This involves choosing the kth member of the population, with
The experiment seeks to determine the effect of the =N/n , but there should be a random start.
independent variable on relief from depression. In this example,
relief from depression is called a dependent variable. Example: Choose a sample of size 10 from N= 500, using
systematic random sampling.
In an experiment on the effect of sleep on memory, the Step 1: Determine k (period); =500/10=50 , so this means that
independent variable is you have to include every 50th member of N after choosing a
a. number of hours of sleep random start.
b. recall score on a memory test Step 2: Put the random start at 15.
c. gender of the respondents Step 3: Include in the samples the following: 15,
d. gender of the researcher 65,115,165,215,265,315,365,415, and 465.

Qualitative Variables are categories that differ according to the 3.) Stratified Sampling
characteristics that they possess. They describe some This method is used when the population is too big to
characteristics of the person. These are attributes that cannot handle, thus dividing N into subgroups, called strata, is
be subjected to meaningful arithmetic. necessary. Samples per stratum are then randomly selected,
Example: gender, religion, occupation but consideration must be given to the sizes of the random
samples to be selected from the subgroups.
Quantitative Variables is a variable that quantifies an element
of a population. They are numerical in nature and therefore 4.) Cluster Sampling
meaningful arithmetic can be done. Cluster sampling is sometimes called area sampling
Example: age, salary, length of service because it is usually applied when the population is large. In this
technique, groups, or clusters instead of individuals are
randomly chosen. Recall that in simple random sampling, you
select members of the samples individually. In cluster sampling,
you will draw the members of the sample by group or cluster and The median is 84.5
then you select a sample from each group or cluster individually. Mode: since 87 appears twice and each of the other
number appears only one, the mode is 87.
Non-probability sampling
1.) Convenience Sampling Measures of Dispersion
This type is used because of the convenience it offers
to the researcher. Range: The range is the simplest measure of dispersion. It is
Example: Gathering of data through the telephone. the difference between the maximum and minimum
values in a dataset. For example, if you have a
2.) Quota Sampling dataset of test scores ranging from 60 to 90, the
This is very similar to stratified random sampling. The range is 90 - 60 = 30.
only difference is that the selection of the members of the
samples in stratified sampling is done randomly. Interquartile Range (IQR): The IQR is the range of the middle
Example: To get the most popular noontime show, each field 50% of data values when the dataset is ordered. It
researcher is given a quota of say 200 viewers per area. is calculated as the difference between the third
quartile (Q3) and the first quartile (Q1) and is less
3.) Purposive Sampling sensitive to outliers than the range.
Choosing the respondents based on pre-determined
criteria set by the researcher. Variance: Variance measures the average of the squared
Example: Suppose the research is all about the level of maturity differences between each data point and the mean.
of teenage parents in a particular school. Of course, only It provides a measure of how data points deviate
teenage parents in that school will be the respondents. from the mean.

4.) Snowball Sampling Standard Deviation: The standard deviation is the square root
The survey subjects of snowball sampling are selected of the variance. It is used to measure the spread of
based on referral from other survey respondents or selecting a data in the same units as the data. A lower standard
sample using networks. deviation indicates less variability, while a higher
Example: A researcher wanted to study the factors why some standard deviation indicates greater variability.
students occasionally use prohibited drugs. He intended to get
50 students, but he only knew 5 students who used it. By getting Coefficient of Variation (CV): The coefficient of variation is the
the cooperation of these 5 students, he was referred to other ratio of the standard deviation to the mean,
drug users, who in turn also provided additional contacts. In this expressed as a percentage. It's used to compare
way, he was able to get a sufficient number of students he the relative variability of different datasets,
needed. especially when they have different units or scales.

Measures of Position
Measures of Central Tendency Provide a way to understand the position of a specific data
Mean – the sum of all the data values divided by the number point or observation within a dataset in relation to
of values. the rest of the data.
Median – the number that separates the list of data into two
equal parts. To find the median, list the data in order Percentiles:
from the smallest to largest. If the number of data is Percentiles divide data into 100 equal parts. For example,
odd, the median is the middle number. If the the 25th percentile (P25) is the value below which
number of data is even, the median is the mean of 25% of the data falls.
the two middle numbers. If a student scores in the 75th percentile on a standardized
Mode – the number in the list that occurs the most test, it means they performed better than 75% of
frequently. the test-takers.

Example: Consider the following list of test scores: Quartiles:


87, 56, 69, 87, 93, 82. Quartiles divide data into four equal parts (quarters). The
Mean:
87+56+69+87+93+82
=
474
= 79 three quartiles are Q1 (the 25th percentile), Q2 (the
6 6 median, or 50th percentile), and Q3 (the 75th
The mean is 79.
percentile).
Median: 56, 69, 82, 87, 87, 93
Q1 represents the lower 25% of the data, and Q3
Take the mean of 82 and 87,
82 + 87 169 represents the upper 25% of the data.
= = 84.5
2 2
Z-Scores (Standard Scores): Example: Testing if there is a significant difference in the
Z-scores measure how many standard deviations a data mean test scores of students before and after a
point is from the mean of a dataset. specific intervention.
A z-score of 1 indicates a data point one standard deviation
above the mean, while a z-score of -1 indicates a Nonparametric Hypothesis Tests
data point one standard deviation below the mean.
Mann-Whitney U Test
The Mann-Whitney U test is a nonparametric test
Hypothesis Testing: used to compare two independent samples to determine if they
are significantly different, especially when the data is not
Null Hypothesis: normally distributed.
• There is no significant difference, there is no Example: Testing if there is a significant difference in the
significant relationship. distribution of exam scores between two groups of
• Uses =, ≥ or ≤ students.

Alternative Hypothesis: Wilcoxon Signed-Rank Test


• There is a significant difference, there is a The Wilcoxon Signed-Rank test is used to compare
significant relationship. two related or paired samples to determine if they are
• Uses ≠, > or < significantly different from each other, especially when the data
is not normally distributed.
Reject null hypothesis: Example: Testing if there is a significant difference in the
• If p-value is less than the significance level. rankings of product preferences before and after a
• If the test statistic is within the critical region. marketing campaign.

Parametric Hypothesis Tests Kruskal-Wallis Test


The Kruskal-Wallis test is a nonparametric test used
One-Sample T-Test to compare three or more independent samples to determine if
A one-sample T-test is used to assess whether the there are significant differences among them, especially when
mean of a sample is significantly different from a known or the data is not normally distributed.
hypothesized population mean when the population standard Example: Testing if there are differences in the pain tolerance
deviation is unknown. levels of patients under different treatment methods.
Example: Testing if the mean exam scores of a sample of
students are significantly different from a Chi-Squared Test for Independence
hypothesized population mean score. The chi-squared test for independence is used to
determine whether there is an association between two
Independent Samples T-Test categorical variables in a contingency table.
An independent samples T-test is used to compare Example: Testing if there is an association between gender
the means of two independent samples to determine if they are (male or female) and voting preference (e.g.,
significantly different from each other. Democrat, Republican, or Independent).
Example: Testing if there is a significant difference in the
mean test scores between two groups of students Spearman's Rank-Order Correlation
who received different teaching methods. Spearman's rank-order correlation is a nonparametric
test used to assess the strength and direction of the
One-Way Analysis of Variance (ANOVA) association between two ordinal variables.
ANOVA is used to compare the means of three or Example: Examining the association between the ranking of
more independent groups to determine if there are significant different brands and consumer preferences (e.g.,
differences among them. ranked as first, second, third).
Example: Testing if there are differences in the mean test
scores among three or more groups of students who Kendall's Tau Rank Correlation
received different types of instruction. Kendall's Tau rank correlation is another
nonparametric test used to measure association between two
Paired Samples T-Test ordinal variables, taking into account the concordant and
A two-sample paired T-test is used to compare the discordant pairs.
means of two related or paired samples to determine if they Example: Investigating the association between the rankings
are significantly different from each other. of various restaurants by food critics and customer
reviews (e.g., rankings by excellent, good, or poor)
Probability 4. Question: If you flip a fair coin three times, what is the
probability of getting at least one head?
Probability is a fundamental concept in mathematics and Answer: c) 7/8
statistics that quantifies the likelihood of an event
occurring. It is expressed as a number between 0
and 1, where 0 represents an impossible event, and Counting principles
1 represents a certain event. Here's a formal Also known as counting techniques or combinatorial
definition: principles, are fundamental concepts in combinatorics, a
branch of mathematics that deals with counting, arranging, and
Probability (P) of an event A is defined as: selecting objects. These principles provide systematic methods
for determining the number of possible outcomes in various
counting scenarios. Here are some key counting principles and
examples:

Union (OR) 1. Multiplication Principle (Fundamental Counting


The union of two events A and B, denoted as A ∪ B, Principle):
represents the event that either A or B (or both) occurs.
The multiplication principle states that if one event can occur in
The probability of the union of two events is calculated as m ways, and a second independent event can occur in n ways,
P(A∪B)=P(A)+P(B)−P(A∩B) then there are m×n ways for both events to occur together.
if A and B are not mutually exclusive (they can both occur). Example: If you have 4 choices for a meal (e.g., burger, pizza,
salad, or sandwich) and 3 choices for a drink (e.g., water,
Intersection (AND): soda, or juice), you have
The intersection of two events A and B, denoted as A ∩ B, 4×3=12 meal and drink combinations.
represents the event that both A and B occur simultaneously.
The probability of the intersection of two events is calculated 2. Permutations:
as
P(A∩B)=P(A)⋅P(B) Permutations are arrangements of objects in a specific order.
if A and B are independent events. The number of permutations of n distinct objects taken r at a
time is denoted as
Complement (NOT): P(n,r) and calculated as
The complement of an event A, denoted as A', represents the P(n,r)= n! / (n−r)!
event that A does not occur. where
The probability of the complement of an event is calculated as n is the total number of objects,
P(A′)=1−P(A). r is the number of objects chosen, and
n! is the factorial of n.
Conditional Probability: Example: How many ways can you arrange 3 books from a set
Conditional probability, denoted as P(A | B), of 6?
represents the probability of event A occurring given that event
B has occurred. 3. Combinations:
The conditional probability is calculated as Combinations are selections of objects without regard to order.
P(A∣B)= P(A∩B) / P(B) The number of combinations of n distinct objects taken r at a
where P(A ∩ B) is the joint probability of both A and B time is denoted as C(n,r) and calculated as
occurring, and P(B) is the probability of B occurring. C(n,r)= n!/ [r!⋅(n−r)!]

Examples: .
1.) What is the probability of rolling a 6 on a fair six-sided die? Example: How many ways can you choose 2 players from a
Answer: a) 1/6 group of 10 to form a doubles tennis team?
2. Question: In a deck of 52 playing cards, what is the
probability of drawing a red card (hearts or diamonds)?
Answer: b) 1/2 Arrange in circle
3. Question: You are drawing cards from a deck without (n-1)!
replacement. What is the probability of drawing two
consecutive aces?
Answer: b) 1/221

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy