Financial Modelling
Financial Modelling
Financial modeling is the task of building an abstract representation (a model) of a real world financial
situation. This is a mathematical model designed to represent (a simplified version of) the performance of
a financial asset or portfolio of a business, project, or any other investment. Financial modeling is a general
term that means different things to different users; the reference usually relates either to accounting and
corporate finance applications, or to quantitative finance applications. While there has been some debate in
the industry as to the nature of financial modeling—whether it is a tradecraft, such as welding, or a
science—the task of financial modeling has been gaining acceptance and rigor over the years. Typically,
financial modeling is understood to mean an exercise in either asset pricing or corporate finance, of a
quantitative nature. In other words, financial modelling is about translating a set of hypotheses about the
behavior of markets or agents into numerical predictions; for example, a firm's decisions about investments
(the firm will invest 20% of assets), or investment returns (returns on "stock A" will, on average, be 10%
higher than the market's returns).
Financial planning
Financial planning is a continuous process of directing and allocating financial resources to meet
strategic goals and objectives.
This can be also be viewed as a single process that encompasses both operations and financing.
The operating people focuses on sales and production while financial planners are interested on
how to finance the operations.
The output from financial planning takes the form of budgets. The most widely used form of
budgets is Pro Forma or Budgeted Financial Statements. The foundation for Budgeted Financial
Statements is Detail Budgets. Detail Budgets include sales forecasts, production forecasts, and
other estimates in support of the Financial Plan. Collectively, all of these budgets are referred to as
the Master Budget.
Steps in financial planning.
There are six steps to the process of doing a financial plan.
1) Determining objectives for the plan. This is done by:
Quantifying specific amount goals within definite time frames and clarify any financial
goals within those parameters;
Rank objectives according to priorities.
2) The second step of the financial planning process is gathering data (both Qualitative and
Quantitative).
Qualitative provides general information concerning a family’s goals and objectives,
lifestyle, health, and investment-risk tolerance level.
Quantitative provide basic but specific identifying information concerning details of
family’s financial status. Examples include info about investments, cash flow, insurance
coverage's, and present liabilities or other obligations.
3) Processing and analyzing the information gathered. Analyze the information to determine the
strengths and weaknesses. Evaluate the objectives in view of available resources, and economic
conditions as they relate to future resources and cash flow for the firm. It is the planning role to
examine the viable options for achieving the determined objectives.
Page 1 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
4) The fourth step is the actual recommendation of a comprehensive financial plan for the firm. This
is a time for the firm explore about each strategy or product as it relates to solutions for achieving
their goals and dreams.
5) Implementation of the plan.
6) Monitoring the plan. Periodically review of implemented plans to evaluate the significance of any
changes in taxation, economic conditions, and available investment techniques.
Benefits of financial planning
The benefits of financial planning for the organization are
a) Identifies advance actions to be taken in various areas.
b) Seeks to develop number of options in various areas that can be exercised under different
conditions.
c) Facilitates a systematic exploration of interaction between investment and financing
decisions.
d) Clarifies the links between present and future decisions.
e) Forecasts what is likely to happen in future and hence helps in avoiding surprises.
f) Ensures that the strategic plan of the firm is financially viable.
g) Provides benchmarks against which future performance may be measured.
Financial forecasting
A financial forecast is an estimate of future financial outcomes for a company or country (for futures
and currency markets). Using historical internal accounting and sales data, in addition to external
market and economic indicators.
Financial Forecasting describes the process by which firms think about and prepare for the future.
The forecasting process provides the means for a firm to express its goals and priorities and to
ensure that they are internally consistent. It also assists the firm in identifying the asset requirements
and needs for external financing.
Unlike a financial plan or a budget a financial forecast doesn't have to be used as a planning
document. Outside analysts can use a financial forecast to estimate a company's success in the
coming year
Objectives of forecasting
To reduce cost of responding to emergencies by anticipating the future occurrences.
Prepare to take advantage of future opportunities.
Prepare contingency and emergency plans.
Prepare to deal with possible outcomes
a) Facilitate financial planning i.e. determination of cash surplus or deficit that are likely to occur in
future.
b) Facilitate control of expenditure. This will minimize wastage of financial resources in order to
achieve financial targets.
Page 2 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
c) It avoids surprise to the managers’ e.g. any cash deficit is known well in advance thus the firm can
plan for sources of short term funds such as bank drafts or short term loans.
d) Motivation to the employees – Financial forecasting using budgets and targets will enhance unity
of purpose and objectives among employees who are determined to achieve the set target.
Page 3 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
2. Regression Analysis
This is a statistical method which involves identification of dependent and independent variable to form a
regression equation *y = a + bx) on which forecasting will be based.
i) Identify various balance sheet items that are directly with sales this items include:
a) Net fixed asset – If the current production capacity of the firm is full an increase in sales will require
acquisition of new assets e.g. machinery to increase production.
b) Current Asset – An increase in sales due to increased production will lead to increase in stock of
raw materials, finished goods and work in progress. Increased credit sales will increase debtors
while more cash will be required to buy more raw materials in cash.
c) Current liabilities – Increased sales will lead to purchase of more raw materials
d) Retained earnings – This will increase with sales if and only if, the firm is operating profitability
and all net profits are not paid out as dividend.
Note
The increase in sales does not require an increase in ordinary share capital, preference share capital and
debentures since long term capital is used to finance long term project.
ii) Express the various balance sheet items varying with sales as percentage of sales e.g. assume for
year 2002 stock and net fixed assets amount to Sh.12M and 18M respectively sales amount to
Sh.40M. Therefore stock as percentage of sales”
12M
Stock = x10030%
40M
Fixed asset = 18M x10045%
40M
iii) Determine the increase in total asset as a result of increase in sales e.g suppose sales increases from
Sh.40 M to Sh.60 M during year 2003. The additional stock and net fixed asset required would be
determined as follows:
Page 4 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
= 45 %( 60 – 40) = Sh.9 M
iv) Determine the total increase in assets which will be financed by:
Note
Generally Net profit margin is called after tax return on sales.
Out of the total assets that are required as a result of increase in sales, the financing will come from the
two sources identified. Any amount that cannot be met from the two sources will be borrowed
externally on short term basis which will be a current liability.
1. The firm is operating at full or 100% capacity. Therefore the increase in production will require
acquisition of new fixed assets.
2. The firm will not issue new ordinary shares or debenture or preference shares thus this capital will
remain constant during the forecasting period.
3. The relationship between balance sheet item and sales i.e balance sheet items as % of sales will be
maintained during forecasting period.
4. The after tax, profit on sale or net profit margin will be achieved and shall remain constant during
the forecasting period.
Page 5 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Illustration
The following is the balance sheet of XYZ Ltd as at 31st December 2002:
Sh.’000’
Net fixed asset 300
Current assets 100
400
Financed by:
Ordinary share capital 100
Retained earnings 70
10% debentures 150
Trade creditors 50
Accrued expenses 30
400
Additional Information
1. The sales for year 2002 amounted to Sh.500,000. The sales will increase by 15% during year 2003
and 10% during year 2004.
2. The after tax return on sales is 12% which shall be maintained in future.
3. The company’s dividend payout ratio is 80%. This will be maintained during forecasting period.
4. Any additional financing from external sources will be affected through the issue of commercial
paper by company.
Required
a) Determine the amount of external finance for 2 years upto 31st December 2004.
b) Prepare a proforma balance as at 31 December 2004
Solution
Identify various items in balance sheet directly with sales:
• Fixed Asset
• Current Asset
• Trade creditors
• Accrued expenses
Page 6 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
d) Compute the amount of external requirement of the firm over the 2 years of forecasting period.
Interpretation
For the company to earn increase in sales of 132.5M it will have to acquire additional assets costing 106M.
Sh.’000’
Additional investment/asset required 106,000
Less: Spontaneous source of finance
Increase in creditors = % of sales x increase in sales
= 132,500 x 10% (13,250)
Increase in accrued expenses = % of sales x increase in sales
= 132,500 x 6% (7,950)
Less: Retained earnings during 2 years of operation (initial sources)
Net profit for 2003 = Net profit margin x sales of 2003
= 12% of 575,000 = 69,000
Less: Dividend payable 80% of 69,000 = 55,200
Net profit for 2004 = Net profit margin x sales of 2004 (13,800)
= 12% of 632,500 = 75,900
Less: dividend payable 80% of 75,900 = 60,720
(15,180)
External financial needs (commercial paper)
55,820
Page 7 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
This refers to the projected balance sheet at the end of forecasting period. The items in the proforma balance
which vary with sales would be determined in any of the following two ways:
Page 8 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
In general, there are four types of components in time series analysis: Seasonality, Trend, Cycling and
Irregularity (random)
General Form: Y = T * C * S ± ε,
Where
T = Trend - long term movement of mean
C = (Business) Cycle - an upturn or downturn not caused by seasonal variation; effect of the economy
S = Seasonal Variation - repetitive pattern observed over a specific time period
ε = Error (random variation)
Page 9 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Naïve forecasts are the most cost-effective forecasting model, and provide a benchmark against which more
sophisticated models can be compared. This forecasting method is only suitable for time series data. Using
the naïve approach, forecasts are produced that are equal to the last observed value. This method works
quite well for economic and financial time series, which often have patterns that are difficult to reliably and
accurately predict. If the time series is believed to have seasonality, seasonal naïve approach may be more
appropriate where the forecasts are equal to the value from last season.
Features
i. Whatever happened last period will happen again this time
ii. The model is simple and flexible
iii. Provides a baseline to measure other models
iv. Attempts to capture seasonal factors at the expense of ignoring trend
Wallace Garden Supply
Forecasting
Storage Shed Sales
( Forecast Error Yt Ft )
Bias - The arithmetic sum of the errors
MAD - Mean Absolute Deviation
MAPE – Mean Absolute Percentage Error
Mean Square Error (MSE) - Similar to simple sample variance
Standard Error - Standard deviation of the sampling distribution (the square root of the MSE)
Bias, MAD, and MAPE – typically used for time series
T
Bias (forecast error) /T
T
MSE | forecast error |2 /T
t 1 t 1
T
(Yt Ft ) / T
T
t 1
(Y
t 1
t Ft ) 2 / T
T
MAPE 100 [|Yt Ft | / Yt ] / T
T T
MAD | forecast error | /T |Yt Ft | / T
t 1 t 1 t 1
Page 10 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Example:
The table below represents company sales, calculate 3 and 6 monthly moving averages, for the
data
Months Sales
January 1200
February 1280
March 1310
April 1270
May 1190
June 1290
July 1410
August 1360
September 1430
October 1280
November 1410
December 1390
Solution.
These are calculated as follows
Jan + Feb + Mar 1200 +1280 +1310
April’s forecast = =
3 3
Feb + Mar + Apr 1280 +1310 +1270
May’s forecast = =
3 3
Page 11 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
And so on…
Jan + Feb + Mar + Apr + May + Jun 1200 +1280 +1310 +1270 +1190 +1290
July forecast = =
6 6
And so on…
3 months moving average 6 months moving average
April 1263
May 1287
June 1257
July 1250 1257
August 1297 1292
September 1353 1305
October 1400 1325
November 1357 1327
December 1373 1363
Note:
When plotting moving average on graphs the points are plotted as the midpoint of the period of the average,
e.g. in our example the forecast for April (1263) is plotted on mid Feb.
Characteristics of moving average
1) The more the number of periods in the moving average, the greater the smoothing effect.
2) Different moving averages produce different forecasts.
3) The more the randomness of data with underlying trend being constant then the more the
periods should be involved in the moving averages.
Page 12 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Example
Actual
Month Sales 3-Month Weighted Moving Average
January 10
February 12
March 13
April 16 (3x13) + (2x12) + (1x10) /6 =12.17
May 19 (3x16) + (2x13) + (1x12) /6 =14.33
June 23 (3x19) + (2x16) + (1x13) /6 =17
July 26 (3x23) + (2x19) + (1x16) /6 =20.50
August 30 (3x26) + (2x23) + (1x19) /6 =23.83
September 28 (3x30) + (2x26) + (1x23) /6 =27.50
October 18 (3x28) + (2x30) + (1x26) /6 =28.33
November 16 (3x18) + (2x28) + (1x30) /6 =23.33
December 14 (3x16) + (2x18) + (1x28) /6 =18.67
Both simple and weighted moving averages are effective in smoothing out sudden fluctuations in the
demand pattern to provide stable estimates. Moving averages do, however, present three problems:
a) Increasing the size of n (the number of periods averaged) does smooth out fluctuations better, but
it makes the method less sensitive to real changes in the data.
b) Moving averages cannot pick up trends very well. Because they are averages, they will always stay
within past levels and will not predict changes to either higher or lower levels. That is, they lag the
actual values.
c) Moving averages require extensive records of past data.
3. Exponential smoothing
This is a weighted moving average technique, it is given by:
New forecast = Old forecast + (Latest Observation – Old forecast)
Where = Smoothing constant
This method involves automatic weighing of past data with weights that decrease exponentially with time.
Example
Using the previous example and smoothing constant 0.3 generate monthly forecasts
Months Sales Forecasts: = 0.3
January 1200
February 1280 1200
March 1310 1224
April 1270 1250
May 1190 1256
June 1290 1233
July 1410 1250
August 1360 1283
Page 13 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Solution
Since there were no forecasts before January we take Jan to be the forecast for February.
Feb – 1200
For March;
March forecast = Feb forecast + 0.3 ( Feb sales – Feb forecast)
= 1200 + 0.3 (1280 – 1200)
=1224
Note:
The value lies between 0 and 1.
The higher the value, the more the forecast is sensitive to the current status.
For accurate forecasts these aspects are qualified separately (i.e. T, C, S and R) from data. This is known
as time decomposition or time series analysis
The separate elements are then combined to produce a forecast.
Page 14 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Additive Model
Time series value = T +S +C +R
Where S, C and R are expressed in absolute value.
This model is best suited where the component factors are independent e.g. where the seasonal variation is
unaffected by trend.
Multiplicative Model:
Time series value = T × S× C × R
Where S, C and are expressed as percentage or proportions.
This model is best applied where characteristics interact e.g. where high trends increase seasonal variations.
Multiplicative model is more commonly used in practice.
Of the four elements of time series the most important are trend and seasonal variation. The following
illustration shows how the trend (T) and seasonal variation (S) are separated out from a time series and how
the calculated T and S values are used to prepare forecast. The process of separating out the trend and
seasonal variation is known as deseasonalising the data.
There are two approaches to this process: one is based on regression through the actual data points and the
other calculates the regression line through moving average trend points. The method using the actual data
is demonstrated first followed by the moving average method.
1. Time series analysis: trend and seasonal variation using regression on the data
The following data will be used to illustrate how the trend and seasonal variation are calculated.
Example 1
It will be apparent that there is a strong seasonal element in the above data (low in Quarter 1 and high in
Quarter 3) and there is a generally upward trend.
Step 1: Calculate the trend in the data using the least squares method.
Step 2: Estimate the sales for each quarter using the regression formula established in
step 1.
Page 15 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Step 3: Calculate the percentage variation of each quarter’s actual sales from the
estimates, obtained in step 2.
Step 4: Average the percentage variations from step 3. This establishes the average
seasonal variations.
Solution
Step 1
Calculate the trend in the data by calculating the linear regression line y = a + bx.
x (quarters) x (sales) xy x2
1 20 20 1
2 32 64 4
Year 1 3 62 186 9
4 29 116 16
5 21 105 25
6 42 252 36
Year 2 7 75 525 49
8 31 248 64
9 23 207 81
10 39 390 100
Year 3 11 77 847 121
12 38 576 144
13 27 351 169
14 39 546 196
Year 4 15 92 1380 225
16 53 848 256
x=136 y= 710 xy= 6661 x2 =1496
y = an + bx
626 = 340b
Page 16 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
a = 28.74
Steps 2 and 3
Use the trend line to calculate the estimated sales for each quarter.
The actual value of sales is then expressed as a percentage of this estimate. For example, actual sales in the
first quarter were 20 so the seasonal variation is
Actual sales 20
% 65%
Estimate 30.58
5 21 37.94 55
6 42 39.78 106
Year 2 7 75 41.62 180
8 31 43.46 71
9 23 45.30 51
10 39 47.14 83
Year 3 11 77 48.98 157
12 48 50.82 94
13 27 52.66 51
14 39 54.50 72
Year 4 15 92 56.34 163
16 53 58.18 91
Step 4
Page 17 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
51 83 157 94
51 72 163 91
222 360 681 336
4= 56% 90% 170% 84%
These then are the average variations expected from the trend for each of the quarters; for example, on
average the first quarter of each year will be 56% of the value of the trend. Because the variations have
been averaged, the amounts over 100% (Q3 in this example). This can be checked by adding the average
and verifying that they total 400% thus:
On occasions, roundings in the calculations will make slight adjustments necessary to the average
variations.
Step 5
Prepare final forecasts based on the trend line estimates from “trend estimates and percentages variation
table” (i.e. 30.58, 32.42, etc) and the averaged seasonal variations from the table above. (i.e. 56%, 90%,
170% and 84%)
5 21 21.24
Year 2 6 42 35.80
7 75 70.75
8 31 36.51
9 23 25.37
Year 3 10 39 42.43
11 77 83.27
12 48 42.69
13 27 29.49
Year 4 14 39 49.05
15 92 95.78
16 53 48.87
Page 18 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
The forecasts are compared with the actual data to get some idea of how good extrapolated forecasts might
be. With further analysis they enable us to quantify the residual variations.
The absolute variations would have been averaged in the normal way to find the average absolute variation,
whether + or -, and these values would have been used to make the final seasonally adjusted forecasts.
Example 1 is reworked below using this method and, because there are many similarities to the earlier
method, only the key stages are shown.
Page 19 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
32 62 29
= 41, and so on
3
The regression line y = a + bx of the moving average values is calculated in the normal manner and results
in the following:
y = 33.06 + 1.32x
The percentage variations are averaged as previously shown, resulting in the following values:
Q1 Q2 Q3 Q4
Average seasonal variation % 54 89 170 86
The trend line and the average seasonal variations are then used in a similar manner to that
previously described.
For example, to extrapolate future sales for the next year (i.e. quarters 17, 18, 19 and 20) is as follows:
Page 20 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Quarter 17
Forecast sales = (33.06 + 1.32(17)) × 0.54 = 29.97
Quarter 18 = 50.57
19 = 98.84
20 = 51.13
Forecast errors
Differences between actual results and predictions may arise from many reasons. They may arise from
random influences, normal sampling errors, choice of the wrong forecasting system or alpha value or simply
that the future conditions turn out to be radically different from the past. Whatever the cause(s) management
wish to know the extent of the forecast errors and various methods exist to calculate these errors.
A commonly used technique, appropriate to time series, is to calculate the mean squared error of the
deviations between forecast and actual values then choose the forecasting system and/or parameters which
gives the lowest value of mean squared errors, i.e. akin to the ‘least squares’ method of establishing a
regression line.
Page 21 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
(a) Coefficient of correlation denoted by r, this provides a measure of the strength of association
between two variables one the dependent variable the other the independent variable r can
range between +1 and – 1 for perfect positive correlation and perfect negative correlation
respectively with zero indicating no relation i.e. for perfect positive correlation y increase
linearly with x increments.
(b) Rank correlation coefficient denoted by R is used to measure association between two sets of
ranked or ordered data. R can also vary from +1, perfect positive rank correlation and -1
perfect negative rank correlation where O or any number near zero representing no
correlation.
SCATTER GRAPHS
- A scatter graph is a graph which comprises of points which have been plotted but are not
joined by line segments
- The pattern of the points will definitely reveal the types of relationship existing between
variables
- The following sketch graphs will greatly assist in the interpretation of scatter graphs.
Page 22 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
x
Independent variable
NB: For the above pattern, it is referred to as perfect because the points may easily be represented by
a single line graph e.g. when measuring relationship between volumes of sales and profits in a
company, the more the company sales the higher the profits.
10 20 Price X
This example considers volume of sale in relation to the price, the cheaper the goods the bigger the
sale.
Page 23 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
independent variable x
600 x x x x x
x x x
400 x x x x x
x x x x
200 x x x x x
x x x x
0
10 20 30 40 50 x
Spurious Correlations
- in some rare situations when plotting the data for x and y we may have a group showing either
positive correlation or –ve correlation but when you analyze the data for x and y in normal life
there may be no convincing evidence that there is such a relationship. This implies therefore
that the relationship only exists in theory and hence it is referred to as spurious or non-sense
e.g. when high pass rates of student show high relation with increased accidents.
Page 24 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Correlation coefficient
- These are numerical measures of the correlations existing between the dependent and the
independent variables
- These are better measures of correlation than scatter graphs (diagrams)
- The range for correlation coefficients lies between +ve 1 and –ve 1. A correlation coefficient
of +1 implies that there is perfect positive correlation. A value of –ve shows that there is
perfect negative correlation. A value of 0 implies no correlation at all
- The following chart will be found useful in interpreting correlation coefficients
There are usually two types of correlation coefficients normally used namely;-
Note that this formula can be rearranged to have different outlooks but the resultant is always the
same.
Example
The following data was observed and it is required to establish if there exists a relationship between
the two.
X 15 24 25 30 35 40 45 65 70 75
Y 60 45 50 35 42 46 28 20 22 15
Page 25 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Solution
Compute the product moment coefficient of correlation (r)
X Y X2 Y2 XY
15 60 225 3,600 900
24 45 576 2,025 1,080
25 50 625 2,500 1,250
30 35 900 1,225 1,050
35 42 1,225 1,764 1,470
40 46 1,600 2,116 1,840
45 28 2,025 784 1,260
65 20 4,225 400 1,300
70 22 4,900 484 1,540
75 15 5,625 225 1,125
X 424 Y 363 X 21,926 Y 2 15,123
2
XY 12,815
n xy x y
r=
n x 2 x n y 2 y
2 2
25, 762
= 0.93
39, 484 19, 461
The correlation coefficient thus indicates a strong negative linear association between the two
variables.
NOTE:
A high value of r (+0.9 or – 0.9) only shows a strong association between the two variables but
doesn’t imply that there is a causal relationship i.e. change in one variable causes change in the
other it is possible to find two variables which produce a high calculated r yet they don’t have a
causal relationship. This is known as spurious or nonsense correlation e.g. high pass rates in QT
in Kenya and increased inflation in Asian countries.
Also note that a low correlation coefficient doesn’t imply lack of relation between variables but
lack of linear relationship between the variables i.e. there could exist a curvilinear relation.
A further problem in interpretation arises from the fact that the r value here measures the
relationship between a single independent variable and dependent variable, where as a particular
variable may be dependent on several independent variables (e.g. crop yield may be dependent on
fertilizer used, soil exhaustion, soil acidity level, season of the year, type of seed etc.) in which case
multiple correlation should be used instead.
Page 26 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Example
A group of 8 accountancy students are tested in Quantitative Techniques and Law II. Their rankings
in the two tests were.
Student Q. T. ranking Law II ranking d d2
A 2 3 -1 1
B 7 6 1 1
C 6 4 2 4
D 1 2 -1 1
E 4 5 -1 1
F 3 1 2 4
G 5 8 -3 9
H 8 7 1 1
d 22
2
= 0.74
Thus we conclude that there is a reasonable agreement between student’s performances in the two
types of tests.
NOTE: in this example, if we are given the actual marks then we find r. R varies between +1
and -1.
Tied Rankings
A slight adjustment to the formula is made if some students tie and have the same ranking the
adjustment is
t3 t
where t = number of tied rankings the adjusted formula becomes
12
R=1-
6 d
2 t 3 t
12
n n 1
2
Example
Assume that in our previous example student E & F achieved equal marks in Q. T. and were given
joint 3rd place.
Page 27 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Solution
Student Q. T. ranking Law II ranking d d2
A 2 3 -1 1
B 7 6 1 1
C 6 4 2 4
D 1 2 -1 1
E 3½ 5 -1 ½ 2¼
F 3½ 1 2½ 6¼
G 5 8 -3 9
H 8 7 1 1
d 2 26 1 2
R = 1-
6 d
2 t 3 t
12
= 1-
6 26 1 2 212 2
3
since t 2
n n 1
2
8 8 1
2
= 0.68
NOTE: It is conventional to show the shared rankings as above, i.e. E, & F take up the 3 rd and
4 rank which are shared between the two as 3½ each.
th
Page 28 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
REQUIRED
Calculate the rank correlation coefficient and hence comment briefly on the value obtained
d d2
A 6 5 1 1
B 1 3 -2 4
C 3 4 -1 1
D 7 6 1 1
E 8 7 1 1
F 2 1 1 1
G 4 8 -4 16
H 5 2 3 9
J 10 9 +1 1
K 9 10 -1 1
Σd2 = 36
6 d 2
R=1-
n n 2 1
6 36
=1-
10 10 2 1
216
=1-
990
= 1 – 0.22
= 0.78
Comment: since the correlation is 0.78 it implies that there is high positive correlation between the
ranks awarded to the contestants. 0.78 > 0 and 0.78 > 0.5
Example (Rank Correlation Coefficient)
Sometimes numerical data which refers to the quantifiable variables may be given after which a rank
correlation coefficient may be worked out.
Is such a situation, the rank correlation coefficient will be determined after the given variables have
been converted into ranks. See the following example;
Page 29 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Σd2 = 43.5
6 d 2
∴ Rank correlation r = 1-
n n 2 1
6 43.5 261
=1- =1–
10 10 1
2
990
= 0.74 (High positive correlation between mathematics
Marks and accounts)
Example
(Product moment correlation)
The following data was obtained during a social survey conducted in a given urban area regarding the
annual income of given families and the corresponding expenditures.
Workings:
4020 3550
X = = 402 Y 355
10 10
= 0.89
Comment: The value obtained 0.89 suggests that the correlation between annual income and annual
expenditure is high and positive. This implies that the more one earns the more one spends.
Page 30 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
- This is a concept, which refers to the changes which occur in the dependent variable as a result
of changes occurring on the independent variable.
- Knowledge of regression is particularly very useful in business statistics and forecasting where
it is necessary to consider the corresponding changes on dependant variables whenever
independent variables change
- It should be noted that most business activities involve a dependant variable and either one or
more independent variable. Therefore knowledge of regression will enable a business
statistician to predict or estimate the expenditure value of a dependant variable when given an
independent variable e.g. consider the above example for annual incomes and annual
expenditures. Using the regression techniques one can be able to determine the estimated
expenditure of a given family if the annual income is known and vice versa
- The general equation used in simple regression analysis is as follows
y = a + bx
Where y = Dependant variable
a= Interception y axis (constant)
b = Slope on the y axis
x = Independent variable
i. The determination of the regression equation such as given above is normally done by
using a technique known as “the method of least squares’.
x
The following sets of equations normally known as normal equation are used to determine the
equation of the above regression line when given a set of data.
Σy = an + bΣx
Σxy = aΣx + bΣx2
Where Σy = Sum of y values
Page 31 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
NB: The above regression line is normally used in one way only i.e. it is used to estimate the y values
when the x values are given.
Regression line of x on y i.e. x = a + by
- The fact that regression lines can only be used in one way leads to what is known as a
regression paradox
- This means that the regression lines are not ordinary mathematical line graphs which may be
used to estimate the x and y simultaneously
- Therefore one has to be careful when using regression lines as it becomes necessary to develop
an equation for x and y before doing the estimation.
The following example will illustrate how regression lines are used
Example
An investment company advertised the sale of pieces of land at different prices. The following table
shows the pieces of land their acreage and costs
Required
Determine the regression equations of
i. y on x and hence estimate the cost of a piece of land with 4.5 hectares
ii. Estimate the expected average if the piece of land costs £ 900,000
Σy = an + bΣxy
Σxy = a∑x + bΣx2
Page 32 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
intercept a =
y b x
n
n xy x y
Slope b =
n x 2 x
2
Example
The calculations for our sample size n = 10 are given below. The linear regression model is
y = a + bx
Table
Page 33 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
= 2.66
= 5.91
We now insert these values in the linear model giving
y = 5.91 + 2.66x
or
Delivery time (mins) = 5.91 + 2.66 (delivery distance in miles)
The slope of the regression line is the estimated number of minutes per mile needed for a delivery.
The intercept is the estimated time to prepare for the journey and to deliver the goods, that is the time
needed for each journey other than the actual traveling time.
It is sensible to use all the identified factors to predict department costs. Scatter diagram will not give
the relationship between the various factors and total costs. The linear model for multiple linear
regression if of the type; (which is the line of best fit).
y = α + b1x1 +b2x2 +………… + bnxn
We assume that errors or residuals are negligible.
Page 34 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
In order to choose between the models we examine the values of the multiple correlation coefficient
r and the standard deviation of the residuals α. A model which describes well the relationship between
y and x’s has multiple correlation coefficient r close to ±1 and the value of α which is small.
Example
Odino chemicals limited are aware that its power costs are semi variable cost and over the last six
months these costs have shown the following relationship with a standard measure of output.
Solution
a)
Output (x) Power costs (y) x2 y2 xy
12 6.2 144 38.44 74.40
18 8.0 324 64.00 144.00
19 8.6 361 73.96 163.40
20 10.4 400 108.16 208.00
24 10.2 576 104.04 244.80
30 12.4 900 153.76 372.00
Σx = 123 Σy = 55.8 Σx = 2705
2
Σy = 542.36 Σxy= 1,206.60
2
n xy x y
b=
n x 2 x
2
61206.612355.8
62705123
= 2
Page 35 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
376.2
= = 0.342
1101
1
a = (Σy – bΣx)
n
1
= (55.8 – 0.342) 123
6
= 2.29
(Power costs) = 2.29 + 0.342 (output)
b. For linear regression calculated above, the coefficient of correlation r is
r=
6 1206.6 123 55.8
6 2705 123 123 6 542.36 55.8 55.8
376.2
=
1101 140.52
= 0.96
This show a strong correlation between power cost and output. The multiple correlation when both
output and time are considered at the same time is 0.976.
We observe that there has been very little increase in r which means that inclusion of time variable
does not improve the correlation significantly
The value for time variable is only 0.60 which is insignificant as compared with a t value of 2.64 for
the output variable
In fact, if we work out correlation between output and time, there will be a high correlation. Hence
there is no necessity of taking both the variables. Inclusion of time does improve the correlation
coefficient but by a very small amount.
If we use the linear regression analysis and attempt to find the linear relationship between output and
time i.e.
Month Output
1 12
2 18
3 19
4 20
5 24
6 30
The value of b and a will turn out to be 3.11 and 9.6 i.e. relationship will be of the form
Output = 9.6 + 3.11 × month
For this equation forecast for 7th month will be
Output = 9.6 + 3.11 × 7
= 9.6 + 21.77
= 31.37 units
Page 36 of 37
HCOB 2411: Financial Modelling & Forecasting The Cooperative University
Page 37 of 37