Psychological Statistics
Psychological Statistics
- the basis of statistician on how they get the sample size whenever they conduct a study or
experiment
- if you have a population w/ a given mean [M] and standard deviation and take SUFFICIENTLY
LARGE RANDOM SAMPLE from the population w/ replacement, then the DISTRIBUTION of
the sample means will be approximately normally distributed
- based on z score
1. Parametric T
- more efficient to use whenever we use a survey ; - to use this, dapat normally distributed ang
distribution
- Kapag sapat ang nakuha na random sample, pwede i-assume na normally distributed ang
distribution ng sample mean
Q: Gaano kalaki "sufficiently large" random samples? How large is sufficiently large?
2. Non-Parametric T
1. Slovin's Formula
- works for bigger population na 'di natin alam o wala pa idea sa population
Example:
- 5% margin of error is not constant, it's up to you. Depende sa ilalagay na level of significant
sa study
- mas maliit margin of error ng mga nasa med field like expert sa paggawa ng gamot kasi buhay
na pinag uusapan at 1% lang sini set na margin of error. Almost all 5%
- 400 lang sinu-suggest na formula ni slovin na sample size na pwedeng kunin of popu size ay
54 m
- kapag super laki ng popul size, 'di na ok gamitin slovin's. Use the 2nd formula
Example:
Q: how to distribute 316 sa 6 provinces e ang popul size ng bawat prov ay iba iba?
- kapag mas malaki ang popul, mas malaki rin 'yong kukuning
sample size to ensure na ung proportion o ratio ng sample
size ay same at sample size of each province is proportionate
to their population size. How do we ensure that? With the use
of this formula:
- usually ginagamit 0.5 but pwede magbago like if may nabasa kang past study at may sinabi
abt sa percentage kung ilang percent mangyayari kaya pwedeng gawing p is 0.8/80%
Confidence level (CL) - gives us the percentage kung gaano ka confident na totoo o accurate
ang study natin ; beta value (% accurate study)
- by default, we use the CL at 5% bc 95% means it's the % na confidence ka na accurate ang
study mo. 5% lang chance na magkaka error. So how to get 95% CL? Sa z table, ang equiv ng
95% ay 1.96. z= 1.96 (95% is the area in normal distrib.
Ex: [46:56]
--> correlational study, may 2 grp or variables gusto mo malaman kung may kinalaman ba o
relationship ung pag-aaral mo para maging successful sa future. At least 50 sample size
Continuation...
- if we want to have a smaller margin of error, magiging mas malaki sample size na
i-su-suggest. the bigger the sample size, it follows the confidence level or vv.
- usually sa mga social studies, may mga nag se-set ng 10%. meaning mas malaki ang
discrepancies ng result ng data nila.
Confidence Interval- (range of values) meaning, may lowest at highest value. Ex: (election
prediction du30) 32-34% pwedeng maging actual result ng election wherein 32 is the lowest
value while 34% is the highest
Margin of error- gives un an idea kung saan pwede maglaro actual data
- if 5% MOE, mas maliit sample size na need kunin pero mas malaki magiging confidence
interval bc maliit sample size magiging less accurate. since less accurate, margin of error mas
malaki and it follows confidence interval mas lalaki
- in social studies, MOE is set to be 5% and it follows from the level of significance
Confidence level- measure of how confident u are in a given MOE. Meaning, ito magsasabi ng
confidence kung gaano kalapit sa actual result ng survey.
Inferential Statistics- we want from the start to get the idea of the whole POPULATION or
insight of the whole population sa isang study. However, due to some constraints, hindi
makukuha lahat ng population. So what we do is conduct a study thru sampling techniques para
makakuha ng % o part ng population and we'll use the sample size that we have gather to get
an idea w/ regards to population. Since sample & part lang, 'di perfect. Hindi nito exact
makukuha ang mean if want to compare the mean of popula w/ sample mean.
- In a normally distributed distribution, the closer we are to the mean, the better
Level of Significance- % kung ilang % magiging error
Confidence Level & Level of Significance are two related, Kung ilan natira sa Confidence
level, 'yon 'yong level of significance. Ex: [20:06] 95% is the CL while natitirang 5% is the LOS o
% na magkakaro'n ng error o magkakaro'n ng sample mean sa labas
HYPOTHESIS TESTING
- main goal of inferential stats, to draw inferences, have an idea of the population and to do that,
we need to gather data from the sample and we will treat that sample para makuha insight ng
population. Parang bridge ang sample para makuha pulso ng population and we do that by
hypothesis testing
- haka-haka or assumptions
Statistical Hypothesis- a conjecture about population parameters. This conjecture may or may
not be true. [ Conjecture is like an idea that has not been proven but obvious na true ]
2 TYPES OF HYPOTHESIS
- states that there is no difference between the data that we gather from our respondents vs
the general population.
[ Parameters are the values/measures that we get from our population like popula mean,
popula stand dev, p median and p mode are parameters. The Population Mean is the
parameter. The sample mean is the statistics- lahat ng values na nakukuha galing sa sample
ay statistics like sample mean, sample median, sample mode and stand dev.
1. Two-tailed test- used when the hypotheses are non-directional. Meaning, the only focus is
to determine if there is a DIFFERENCE OR NOT. Increase and decrease is not the primary
concern the important is if there is a difference on the result ; 2 boundaries and 2 critical region ;
not concern if we increase or nah ; two-tailed, hahatiin sa 2 (CL *??*) ; - always ang CV ng isa
positive at isa ay negative
2. One-tailed test - used when the hypothesis are directional. Meaning, it matters kung sinong
mas malaki o maliit. Mas maliit ba samples mean compared to population mean? ; isisiksik mga
.. of errors sa isang part. Mas malaki chance na mag commit ng mistake sa part na 'yon bc naka
one-tailed lang. ; concern if may increase or decrease ; 1 lang ang confidence level.
Example:
u=/= 82bpm
Example: electrician
- z and t tests, ANOVA, Chi-square(non-para), Scheffe's and Tukey tests. t test for the
correlation coefficient, and many more
- usually, mga test na ginagamitan ng table ay parametric tests bc they are based on the table
and table are based on normal distribution
- probability na mag fail ang test value ; % or chance na 'yong test value na makukuha ay
mapupunta sa critical region. Critical region na nasa extremes na maghahati sa areas. Mga
nasa gitna ay non-critical region. meaning pag test value ay makukuha sa gitna or between
boundaries
to minimize the error, by setting the level of significance smaller or bigger (?bigger?)
Type 2 Error - occurs if we accept the null hypothesis when it is actually false
- If totoo ang haka-haka, ifi-fail to reject. Pag may pinagkaiba, ire-reject o 'di tatanggapin ang Ho
- Ang correct decision ay may pagkakaiba, and lumabas sa study mo na wala nmn difference.
Kaya ginawa mo finail to reject mo ang alternative hypo. Sinabi na wala silang pinagkaiba pero
past studies shown na meron difference. Like u contradict past studies and this is type 2 error
- there is no perfect statistical test but we can minimize the error na pwede ma commit that is by
setting the level of significance. The smaller the LOS, the better bc maliit chance na mag
commit ng Type 1 error
Critical Region- boundary kung saan hinahati ang area ng normal distribution ; can be get from
the CONFIDENCE LEVEL (90%, 95%) Meaning, .95 so hahanapin 'yong area na 'yan sa z table
and each statistical test ay may kanya kanyang table
- test value makukuha sa STEP 2 (Identify 1/2 tailed test, and formula to use
06/14/21
TESTING CLAIM ABOUT A PROPORTION
Assumptions:
1. The sample observations are a SIMPLE RANDOM SAMPLING ( bunot lang or fishbowl
2. There are a fixed number of independent trials having constant probabilities, and each trial
has two outcome categories of success or failure. [ 8:50 for sample ]
3. The conditions np > 5 and nq > 5 are both satisfied ( n = sample size , ❎ p = probability that
✅
we'll be having a success , q = prob na mag fail ) [ if multiply probability na mag success X sa
samp size, it should be greater than or at least 5 ang value
n- sample size
p- population proportion
q- prob to fail
STEP 1:
STEP 2:
- for a ONE-tailed test, kapag CL ay 95%, meaning ung boundary na naghahati sa 95% at 5%
ay 1.645
- if 97.5 ang CL, meaning ang alpha ay 2.5% kaya ang CV na gagamitin ay 1.96
- if naka 95% CL, 5% LOS, at TWO-tailed test, and CV ay 1.96
STEP 5:
- compare computed z value sa critical value
STEP 1 :
STEP 5:
- compare computed z value sa critical value
--> kapag 'yong computed value ay nasa loob ng safe zone, mag fa-fail to reject. Kapag
nasa labas, mag re-reject Ho.
--> if computed value is within the non-critical region (safe zone), failed to reject Ho.
[ it states failed to reject than accept Ho bc it's not actually accepting the hypothesis.
Just showing that evidence is not enough to contradict our claim. Need more evidence.]
- determining if the sample proportion is different from the proportion in the population.
Look if may difference sa result na nakuha sa data vs whole population
TESTING ABOUT MEAN: σ population means is known ; Nag conduct study at alam
standard dev ng buong population
ASSUMPTIONS
where:
Example:
STEP 1:
( m=40.0 cm)
x = 0.05
CL = 95%
CV: 1.96
STEP 4:
z=
t,
STEP 5:
Reject Ho. Means, there is enough evidence to conclude that the mean head
circumference of all two month old babies is not 40.0 cm.
- Accepting alternative hypothesis that mean head circumference of all the babies is not equal to
40 cm
There is enough evidence to conclude that the mean head circumference of all two
month old babies is not 40.0 cm.
STEP 1:
STEP 2:
One-tailed test
z test
STEP 3:
x = 0.05
CL= 95%
CV = 1.645
STEP 4:
STEP 5:
Reject Ho
There is enough evidence to conclude that the average test score of the students is
higher than 100
Assumptions:
Example:
Ha: