0% found this document useful (0 votes)
97 views7 pages

Correlation and Regression

This document discusses correlation and regression. Correlation shows the association between two variables, while regression shows the value of one variable based on the other. There are different types of correlation, including positive and negative correlation. Regression analysis establishes a mathematical relationship between variables in order to predict values using regression equations derived through least squares methods. Common regression techniques include direct methods and calculating regression coefficients.

Uploaded by

mr haldar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
97 views7 pages

Correlation and Regression

This document discusses correlation and regression. Correlation shows the association between two variables, while regression shows the value of one variable based on the other. There are different types of correlation, including positive and negative correlation. Regression analysis establishes a mathematical relationship between variables in order to predict values using regression equations derived through least squares methods. Common regression techniques include direct methods and calculating regression coefficients.

Uploaded by

mr haldar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7

CORRELATION AND REGRESSION

Correlation shows Association or relation between two variables whereas regression shows the value of the variable based
on other Bivariate data.
 Bivariate Data are the data collected for tow variables irrespective of time.
 Can be Marginal and conditional distribution.
 Collected for 2 variables at the same time.
 For distribution p+q, a number of cells are pq.
 Some cells may be 0.
 For p*q, marginal distribution is 2, and conditional are p+q.

Correlation
The change in one variable is Reciprocated by a corresponding change in the other variable either directly or inversely; then
the two variables are known to be associated or correlated. Correlation analysis aim is establishing relation between two
variables and measuring the extent of relation between two variable.
These are of two types of correlation
1) Positive correlation 2) Negative correlation
The value of correlation is between -1 to 1, -1 means perfect negative, 0 to -1 means negatives, 0 means no correlation,
while 0 to 1 means positive and +1 means perfect positive correlation.
Scatter diagram:- this is a simple diagrammatic method to establish a correlation between a pair of variables, and it is used of
linear and non-linear (curvilinear) distribution.
Karl Pearson’s Product moment correlation coefficient
 The coefficient of correlation is a unit free measure.
 The coefficient of correlation is unaffected by change the origin or scale, but it changes its sign with the change of sign of
variables. If the sign of both variables is the same, r remains the same. While if sign differs r sign also changes.
 The coefficient of correlation always is between -1 and 1, including both limiting values.
 Spurious correlation means the correlation between two variables has no causal relation.
 Product moment correlation coefficient is considered for finding the nature of correlation and the the amount of correlation.
 Product moment correlation coefficient may be defined as the ratio of covariance between the variable to the product of their
standard deviations.
r= rxy = Where Cov (x,y) = or Cov (x,y)=
and
Spearman’s rank correlation:-
For finding the correlation between two attributes we consider
rR=
Where rR denotes rank correlation coefficient, and it lies between -1 and 1 inclusive of these two
Values. d= Rx – Ry. The Rx and Ry are Rankings given to x and y series. Ranks are given in descending order. 1 is given to highest
and so on.
In case the same Ranks are received by individuals, then

[∑ ∑
]
3
2 𝑚 −𝑚
1 −6 ⅆ +
12
𝑛 ( 𝑛2 − 1 )
m represents the number of repetitions. (m3 – m) will come under number of times numbers are repeated.
Coefficient of concurrent deviations:-
It is the quickest method to find correlation between two variables.
rc =

If (2c-m)> 0, then we take the positive sign both inside and outside the radical sign, and if (2c – m) < 0, we are to consider the
negative sign both inside and outside the radical sign.
C= no. of positive signs
m= n-1,
n= no. of observations

Regression
In regression analysis, we are concerned with the estimation of one variable for a given value of another variable or establishing
a mathematical relationship between two variables and predicting for deriving the regression equations is knowns as least square
method.
If y= a+bx, a and b are regression parameters, regression equation y on x, b yx methods based on least square. Regression
coefficient also represents the shape of regression equations.

byx= , byx= a= y –bx [where y= mean of y, x= mean of x] for solving a and b


cov ( 𝑥 , 𝑦 )
2
𝑠𝑥
Direct method
byx=
For solving equation bxy replace x with y.
byx=
The regression coefficients remain unchanged due to a shift of origin but change due to a shift of scale.
This property states that if the original pair of variables is (x,y) and if they are changed to the pair (u,v) under:
u= a+bx
v= c+dy
buv= xy
bvu=
 The regression equations are interested in their means.
 The coefficient of correlation between two variables x and y is the simple G.M of the two regression
coefficients. The sign of the correlation coefficient would be the common sign of the two-regression
coefficients.
r= ±
 Correlation coefficient measuring a linear relationship between the two variables indicate the amount of variation of one
variable accounted for by the other variable. A better measure for this purpose is provided by the square of the correlation
coefficient would be the common sign of the two- regression coefficients.
r2= explained variance / total variance
 Coefficient of non-determination = 1-r2
 Two regression lines coincide when r = -1 or 1. And perpendicular if r =0
 Product of regression coefficient must be numerically less than 1.

Probable error:-
 It is a method of obtaining the correlation coefficient of population. It is defined as
P.E = 0.674
S.E = Standard error of correlation coefficient.
S.E =
The limit of correlation coefficient = r ± P.E.
= r > 6 P.E. = presence of correlation is certain.
= r < P.E. = No evidence of correlation

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy