0% found this document useful (0 votes)
43 views30 pages

Correlation and Regression Analysis: Pembe Begul GUNER

This document discusses correlation and regression analysis. Correlation examines the relationship between two or more variables, and can be positive, negative, linear, or non-linear. Regression analysis examines how one variable changes when another variable is altered, and is used to predict unknown values. The coefficient of correlation measures the strength of association between variables from -1 to 1. Regression equations are developed to predict the values of one variable based on the known values of another variable. Examples are provided to illustrate key concepts.

Uploaded by

Muhammad Umair
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views30 pages

Correlation and Regression Analysis: Pembe Begul GUNER

This document discusses correlation and regression analysis. Correlation examines the relationship between two or more variables, and can be positive, negative, linear, or non-linear. Regression analysis examines how one variable changes when another variable is altered, and is used to predict unknown values. The coefficient of correlation measures the strength of association between variables from -1 to 1. Regression equations are developed to predict the values of one variable based on the known values of another variable. Examples are provided to illustrate key concepts.

Uploaded by

Muhammad Umair
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 30

Correlation and Regression

Analysis
Pembe Begul GUNER
Contents
 Introduction …………………………………………………………….3
 Correlation Analysis…………………………………...............4
 Positive and Negative Analysis………………………………….5
 Negative Analysis………………………………………………………8
 Linear and Non-Linear Correlation………………………….11
 The Coefficient of Correlation………………………………… 14
 Regression Analysis…………………………………………………19
 Types of Regression Models…………………………………….20
 Regression Equation………………………………………………..21
 Population Linear Regression………………………………….23
 Linear Regression Assumptions………………………………24
 Population Linear Regression………………………………...25
 Estimated Regression Model…………………………………..26
 Specify the Source…………………………………………………..30
Introduction

 Correlation analysis: Examines between two or more variables the


relationship.

 Regression analysis: Change one variable when a specific volume,


examines how other variables that show a change.
Correlation Analysis

There are two important types of correlation.

(1) Positive and Negative Correlation

(2) Linear and Non – Linear Correlation


Positive and Negative Correlation

If the values of the two variables deviate in the same


direction i.e. if an increase (or decrease) in the values of one
variable results, on an average, in a corresponding increase
(or decrease) in the values of the other variable the
correlation is said to be positive.
Some examples of series of positive
correlation are:
 Heights and weights;
 Household income and expenditure;
 Price and supply of commodities;
 Amount of rainfall and yield of crops.
Negative Correlation

Correlation between two variables is said to be


negative or inverse if the variables deviate in opposite
direction. That is, if the increase in the variables deviate in
opposite direction. That is, if increase (or decrease) in the
values of one variable results on an average, in corresponding
decrease (or increase) in the values of other variable.
Some examples of series of negative
correlation are:
 Volume and pressure of perfect gas;
 Current and resistance [keeping the
voltage constant] (R =V / I) ;
 Price and demand of goods.
Linear and Non – Linear Correlation
The correlation between two variables is said to be linear if
the change of one unit in one variable result in the
corresponding change in the other variable over the entire range
of values.

For Example;

Thus, for a unit change in the value of x, there is a constant


change in the corresponding values of y and the above data can
be expressed by the relation ;
 y = 3x +1
In general ;
 y= a + bx
The relationship between two variables is said to be non
– linear if corresponding to a unit change in one variable,
the other variable does not change at a constant rate but
changes at a fluctuating rate. In such cases, if the data is
plotted on a graph sheet we will not get a straight line curve.
For example, one may have a relation of the form

 y = a + bx + cx2

or more general polynomial.


The Coefficient of Correlation

One of the most widely used statistics is the coefficient of


correlation ‘r’ which measures the degree of association
between the two values of related variables given in the data
set.
• It takes values from + 1 to – 1.
• If two sets or data have r = +1, they are said to be
perfectly correlated positively .
• If r = -1 they are said to be perfectly correlated
negatively; and if r = 0 they are uncorrelated.
For Example: A study was conducted to find whether there is any
relationship between the weight and blood pressure of an individual. The
following set of data was arrived at from a clinical study. Let us determine
the coefficient of correlation for this set of data. The first column represents
the serial number and the second and third columns represent the weight
and blood pressure of each patient.
r = 0,5966
Regression Analysis
Regression analysis, in general sense, means the
estimation or prediction of the unknown value of one
variable from the known value of the other variable. It
is one of the most important statistical tools which is
extensively used in almost all sciences – Natural, Social
and Physical. It is specially used in business and
economics to study the relationship between two or
more variables that are related causally and for the
estimation of demand and supply graphs, cost
functions, production and consumption functions and so
on.
Regression Equation
Suppose we have a sample of size ‘n’ and it has two sets
of measures, denoted by x and y. We can predict the values
of ‘y’ given the values of ‘x’ by using the equation, called the
regression equation.
y* = a + bx
where the coefficients a and b are given by

The symbol y* refers to the predicted value of y from a given


value of x from the regression equation.
Example:
Scores made by students in a statistics class in the mid
- term and final examination are given here. Develop a
regression equation which may be used to predict final
examination scores from the mid – term score.
Solution:
We want to predict the final exam scores from the mid
term scores. So let us designate ‘y’ for the final exam scores
and ‘x’ for the mid – term exam scores. We open the
following table for the calculations.
Specify the Source
 https://medicine.tcd.ie/neuropsychiatric-
genetics/assets/pdf/2009_4_Regression.pdf
 http://www2.sas.com/proceedings/forum2008/364-
2008.pdf
 http://stud.pam.szczecin.pl/edu/eng/Chapter-5.pdf
 http://www.surgicalcriticalcare.net/Statistics/correlation.pd
f
 http://www.personal.kent.edu/~mshanker/personal/Classe
s/f06/ch13_F06.pdf
 http://pages.intnet.mu/cueboy/education/notes/statistics/
pearsoncorrel.pdf

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy