0% found this document useful (0 votes)
43 views145 pages

CL 1.4 Quantitative Methods For Business

Uploaded by

nushath
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
43 views145 pages

CL 1.4 Quantitative Methods For Business

Uploaded by

nushath
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 145

CL 1.

4
QUANTITATIVE
METHODS FOR BUSINESS

CMA Professional Program


2023-2027
CMA SRI LANKA OFFICIAL STUDY TEXT
CMA Professional Programme 2023 – 2027
Certificate Level

CL 1.4:
Quantitative Methods for Business

Institute of Certified Management Accountants of Sri Lanka (CMA)


ISBN 978-955-0926-48-0

The Governing Council of CMA reserves the right to make any amendments itdeems necessary during the
period covered herein.

©Copyright reserved.
No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by
means, electronic, mechanical, photocopying, recording or otherwise without the prior written permission of
the Institute of Certified Management Accountants of Sri Lanka.

1st edition – August 2023

Published by:
Institute of Certified Management Accountants of Sri Lanka
29/24, Visakha Private Road,
Colombo 04, Sri Lanka.
A Word to the Students

This text book is designed as a guide to lead Certificate Level students in the study of CL 1.4
Quantitative Methods for Business (QMB) examination paper. It is carefully prepared to
cover the syllabus content, giving comprehensive explanations in each section including
exposure in answering questions.

Note that examination success will depend not only on your knowledge, but also on your
ability to present what you have learnt, in response to the given questions, within the
specified time period.

You may refer to the ‘Examination Guide’ published for each level and be familiar with the
CMA Examination Policies and Guidelines for Computer Base Examinations; Examination
paper structure, Hierarchy of Taxonomy (Actions verbs) and Pilot papers.

Make every effort to understand the subject and develop the skills to apply your knowledge.
Knowledge is the theoretical and practical understanding of a subject.

Application is the ability to use knowledge in a given relevant situation. This is the ability
to select the appropriate principles and/or techniques and apply them to relevant
information from a range of data.

We wish you success at the examination!

CMA Sri Lanka


Table of Contents

CHAPTER 1: BASIC MATHEMATICS FOR BUSINESS 5

CHAPTER 2: FREQUENCY DISTRIBUTIONS AND PRESENTATION OF DATA 42

CHAPTER 3: PROBABILITY AND SAMPLING THEORY 69

CHAPTER 4: RELATIONSHIPS AND FORECASTING TECHNIQUES 96

CHAPTER 5: INTRODUCTION TO FINANCIAL MATHEMATICS 129


CHAPTER 1 Page 5

Basic Mathematics for Business

Learning Outcomes

After successful completion of this chapter, you should be able to:

• Demonstrate the knowledge of set of numbers, laws of algebra, and their


applications such as simplifying and factoring
• Describe the properties of fractions and basic laws of exponents and radicals

• Demonstrate the graphical representation of the linear functions


• Demonstrate the ability to resolve problems of linear inequalities
• Demonstrate the ability to resolve problems of logarithmic and exponential functions
• Demonstrate the knowledge of progressions of arithmetic and geometric
• Describe the concept of the derivative and demonstrate basic rules for differentiation
• Describe the behaviour of a function using the first derivative
• Describe basic theories of indefinite integration

1.1 Classification of numbers

Classification of numbers is very important for identifying the solutions in various kinds of
applications of mathematics in various fields including business and economics. Therefore,
in this chapter we will provide a basic idea about the classification of numbers and some of
the properties of their operations.

The set of Natural Numbers denoted by 𝑁 is the set of counting numbers. The set of Natural
Numbersis also called positive integers.

𝑁 = {1, 2, 3, …}

The set of natural numbers and zero is called the set of Whole Numbers.

𝑊 = {0,1, 2, 3, …}
CHAPTER 1 Page 6
The natural numbers together with 0 and negative values, -1, -2, -3… form the set of integers
denotedby 𝑍.

𝑍 = {…, −3, −2, −1, 0, 1, 3, …}

The set of rational numbers consists of numbers such as 1/3,7/5, which can be written as
a ratio (quotient) of two integers. That is, a rational number is one that can be written as 𝑝⁄𝑞
, where 𝑝 and q are integers and 𝑞 ≠ 0. Thus, the set of rational numbers can be represented
as,
𝑝
𝑄 = {𝑥|𝑥 = , 𝑝, 𝑞 ∈ 𝑍 and 𝑞 ≠ 0}
𝑞

All rational numbers can be represented by terminating decimals such as 3/2 = 1.5 or by
non- terminating repeating decimals such as 4/11 = 0.363636… Numbers represented by
non-terminating and non-repeating decimals are called the irrational numbers. An
irrational number cannot be writtenas an integer divided by integer.

Examples

1 ) 𝜋 = the ratio of the circumference of a circle to its diameter. 𝜋 = 3.14159265 …


2 ) √2 = 1.4142
3) √3 = 1.73205

The union of the set of rational numbers and irrational numbers will form the set of real
numbers denoted by 𝑅.

1.2 Basics of Algebra

Algebra is a branch of mathematics that deals with mathematical symbols and the
manipulation of these symbols to represent relationships and solve equations. Here are
some basics of algebra:

• Variables: Algebra uses variables, represented by letters like x, y, or z, to represent


unknown quantities or values that can vary. Variables are used to express relationships
and solve equations.

• Expressions: Algebraic expressions are combinations of variables, constants (fixed


numbers), and mathematical operations such as addition, subtraction, multiplication, and
division. For example, 3x + 2y is an algebraic expression.
CHAPTER 1 Page 7
• Equations: Equations are statements that assert that two expressions are equal. Equations
typically involve an unknown variable that needs to be solved for. Solving an equation
means finding the value(s) of the variable that make the equation true. For example, x + 5
= 10 is an equation.

• Operations: Algebra involves performing operations on algebraic expressions and


equations. These operations include simplification, factoring, expanding, and solving for
variables.

• Linear Equation: Linear equation is an algebraic equation in which the highest power of
the variable is 1. The general form of a linear equation is 𝑎𝑥 + 𝑏 = 0, where 𝑎 and 𝑏 are
constant terms.

• Quadratic Equation: Quadratic equation is an equation in which the highest power of the
variable is 2. The general form of a quadratic equation is 𝑎𝑥 2 + 𝑏𝑥 + 𝑐 = 0, where 𝑎, 𝑏 and
𝑐 are constant terms.

• Systems of Equations: Systems of equations involve multiple equations with multiple


variables. The goal is to find values for the variables that satisfy all the equations
simultaneously.

• Graphing: Graphing is a visual representation of algebraic equations on a coordinate plane.


It allows for the visualisation of relationships between variables and helps in
understanding solutions.

Algebra provides a foundation for solving mathematical problems, modelling real-world


situations, and advancing to more complex mathematical concepts. It is widely used in
various fields, including science, engineering, economics, and computer science.

1.3 Indices

In mathematics, indices (also known as exponents or powers) are used to represent repeated
multiplication of a number or variable by itself. An index is a small number written placed to
the upper right to a base number. It indicates the number of times the base number is
multiplied by itself.

For example, in the expression 23 , the base number is 2, and the index (or exponent) is 3.
This means that 2 is multiplied by itself three times: 2 × 2 × 2 = 8.

1.3.1 Rules of indices


Indices follow certain rules and properties:
• Product Rule: When multiplying two numbers with the same base, add the indices. For
example, 𝑎2 × 𝑎3 = 𝑎5 .
CHAPTER 1 Page 8
• Quotient Rule: When dividing two numbers with the same base, subtract the indices. For
𝑎4
example, 𝑎2 = 𝑎4−2 = 𝑎2 .

• Power Rule: When raising a number with an index to another index, multiply the indices.
For example, (𝑎3 )2 = 𝑎3×2 = 𝑎6 .

• Zero Rule: Any non-zero number or variable raised to the power of zero equals 1. For
example, 𝑎0 = 1.

• Negative Rule: A negative index is equivalent to the reciprocal of the number or variable
1
raised to the positive index. For example, 𝑎−2 = 𝑎2 .

Illustration 1.1

• 23 × 24 = 23+4 = 27 = 128
23 1 1 1
• = = =
25 25−3 22 4
1
• 3 = √3
2

• √4 = ±2 There are two numbers of t such that 𝑡 2 = 4, namely 𝑡 = 2 and 𝑡 =


−2. The principal square root is 𝑡 = 2
3
• 42 = (√4)3 = 23 = 8

3
√27 = 3 since 33 = 27
• √−1 does not exist as a real number.

The rules of exponents have many uses in the simplifications of algebraic expression.

Illustration 1.2

(2𝑥 3 )(4𝑥 2 ) = (2 × 4)(𝑥 3 × 𝑥 2 ) = 8𝑥 5


2𝑥 3 1 3−2 1
= 𝑥 = 𝑥
4𝑥 2 2 2
1
(𝑦 −2 )4 = 𝑦 −8 = 8
𝑦
1 1 1
(5𝑥 3 )−2 = 5−2 𝑥 −6 = 2 × 6 =
5 𝑥 25𝑥 6

1.4 Linear Equations

An equation which can be written in the form 𝑎𝑥 + 𝑏 = 𝑐 (or ax + b∗ = 0, b∗ = 𝑏 − 𝑐 ) ,


where 𝑎, 𝑏 and 𝑐 are real numbers and the letter 𝑥 denotes the variable of the linear equation.
CHAPTER 1 Page 9
Illustration 1.3

1) 3𝑥 − 9 = 12 , Find 𝑥 2) 2𝑥 + 3 = 5𝑥 − 4 , Find 𝑥
3𝑥 = 12 + 9 2𝑥 − 5𝑥 = −4 − 3
3𝑥 = 21 −3𝑥 = −7
𝑥=7 𝑥 = 7⁄3

1.4.1 Simultaneous Linear Equations

When two or more equations are satisfied by the same values of the unknowns, they are
called simultaneous equations. We can solve two simultaneous linear equations by the
method of elimination or the method of substitution.

Illustration 1.4
3𝑥 + 7𝑦 = 27 → (1)
5𝑥 + 2𝑦 = 16 → (2)

To eliminate 𝑥 we multiply (1) by 5 and (2) by 3 so as to make the coefficients of 𝑥 in


both equations equal.

(1) × 5 → 15𝑥 + 35𝑦 = 135 → (3)


(2) × 3 → 15𝑥 + 6𝑦 = 48 → (4)

Subtracting (4) from (3) , we have

29𝑦 = 87
∴𝑦=3

To find 𝑥 substitute the value of 𝑦 = 3 into either equations (1) or (2).

Thus from (1), 3𝑥 + 21 = 27


𝑥=2
Therefore, the complete solution is 𝑥 = 2, 𝑦 = 3
Solve by substitution.

2𝑥 − 5𝑦 = 1 → (1)
7𝑥 + 3𝑦 = 24 → (2)
From (1),
1+5𝑦
𝑥= 2
Now we eliminate 𝑥 in (2) by its value obtained from (1). Then we have
7(1+5𝑦)
+ 3𝑦 = 24
2

∴ 7 + 35𝑦 + 6𝑦 = 48
∴ 41𝑦 = 48 − 7 = 41
𝑦=1
1+5(1)
𝑥 = 2 = 6⁄2 ∴ 𝑥 = 3
Therefore, the complete solution is 𝑥 = 3, 𝑦 − 1
CHAPTER 1 Page 10
1.4.2 Linear inequity

A linear inequality in one variable is an inequality that can be written as 𝑎𝑥 + 𝑏 < 𝑐, 𝑎𝑥 + 𝑏 ≤


𝑐, 𝑎𝑥 + 𝑏 > 𝑐, 𝑎𝑥 + 𝑏 ≥ 𝑐 where 𝑎, 𝑏, and 𝑐 are real numbers and 𝑎 ≠ 0.

Important rules of Inequalities

For any real number 𝑎, 𝑏, and 𝑐:

• If 𝑎 < 𝑏 and 𝑐 > 0, then 𝑎𝑐 < 𝑏𝑐 and 𝑎/𝑐 < 𝑏/𝑐.


• If 𝑎 < 𝑏 and 𝑐 < 0, then 𝑎𝑐 > 𝑏𝑐 and 𝑎/𝑐 > 𝑏/𝑐.

Illustration 1.5

1) 2𝑥 + 3 < 5 2) 5 ≥ 7𝑥 − 9
2𝑥 < 5 − 3 −7𝑥 ≥ −9 − 5
𝑥<1 𝑥 ≤ (−14)⁄(−7)
𝑥≤2

1.5 Quadratic Equations

A quadratic equation is an equation of the form of 𝑎𝑥2 + 𝑏𝑥 + 𝑐 = 0, where 𝑎, 𝑏 and 𝑐 are the
real numbers. Quadratic equations can often be solved by factoring the quadratic polynomial
and setting each factor equal to zero.

Illustration 1.6

1) 𝑥2 − 4 = 0 2) 𝑥2 + 2𝑥 − 3 = 0
(𝑥 − 2) (𝑥 + 2) = 0 𝑥2 + 3𝑥 − 𝑥 − 3 = 0
Hence, either 𝑥 − 2 = 0 or 𝑥 + 2 = 0 𝑥 (𝑥 + 3) − 1(𝑥 + 3) = 0
∴ 𝑥 = 2 or 𝑥 = −2 (𝑥 + 3) (𝑥 − 1) = 0
Hence,
either 𝑥 + 3 = 0 or 𝑥 − 1 = 0
∴ 𝑥 = −3 or 𝑥 = 1
3) 𝑥2 − 6𝑥 + 9 = 0 4) 𝑥2 + 7 = 0
𝑥2 − 3𝑥 − 3𝑥 + 9 = 0 𝑥2 = −7
𝑥 (𝑥 − 3) − 3(𝑥 − 3) = 0 The equation does not have real

(𝑥 – 3) (𝑥 − 3) = 0 numbered solutions.

∴𝑥−3=0
𝑥=3
CHAPTER 1 Page 11
Not all quadratic equations have real number solutions, and even those that do have are often
difficult to solve by factoring. It is possible to tell whether or not a quadratic equation has
real number of solutions, and to find them when they exist, by means of the quadratic
formula.

1.5.1 Quadratic Formula


The solutions of the quadratic equation 𝑎𝑥 2 + 𝑏𝑥 + 𝑐 = 0 are given by the quadratic formula.
−𝑏 ± √𝑏 2 − 4𝑎𝑐
𝑥=
2𝑎

The 𝑏 2 − 4𝑎𝑐 is called the discriminant of the quadratic equation.

• If 𝑏 2 − 4𝑎𝑐 > 0, the equation has two distinct real number solutions.

• If 𝑏 2 − 4𝑎𝑐 = 0, the equation has one real number solution.

• If 𝑏 2 − 4𝑎𝑐 < 0, the equation has no real number solutions.

Illustration 1.7

1) 𝑥 2 − 5𝑥 + 6 = 0 2) 𝑥 2 + 8𝑥 + 16 = 0
We see that a =1, b= -5, and c=6. We see that a =1, b= 8, and c=16.
−(−5)±√(−5)2 −4(1)(6) −(8)±√(8)2 −4(1)(16)
Hence, 𝑥 = Hence, 𝑥 =
2(1) 2(1)

−8+
− √64 − 64 −8 ± 0
5+ − 24 5±1 𝑥= =
− √25 2 2
𝑥= =
2 2
∴ 𝑥 = −4
∴ 𝑥 = 3 or 𝑥 = 2
3) 2𝑥 2 + 5𝑥 − 1 = 0 4) 𝑥 2 + 4𝑥 + 5 = 0
We see that a =2, b= 5, and c=-1. We see that a =1, b= 4, and c = 5.
−(5)±√(5)2 −4(2)(−1) Here,
Hence, 𝑥 =
2(−1)
𝑏 2 − 4𝑎𝑐 = 42 − 4(1)(5) = −4 < 0
−5+
− √25 +8 −5 ± √33
𝑥= = Therefore, this equation does not have
4 4
real number solution.
−5 ± 5.745
𝑥=
4
∴ x = 0.186 or x = −2.686
CHAPTER 1 Page 12
1.6 Graphing Functions

1.6.1 Graphing Linear Functions

A function having defining equation of the form 𝑦 = 𝑚𝑥 + 𝑏, where 𝑚 and 𝑏 are real numbers,
is called a linear function. The graph of a linear function is a straight line. Since two points
determine a straight line, we need only to find two points that lie on the line.
If the graph of a function intersects the 𝑦 - axis at the point (0, 𝑏), we say that 𝑏 is the 𝒚 –
intercept, if the graph intersects the 𝑥 - axis at the point (𝑎, 0), we say that a is the 𝒙 -
intercept. If the equation of a straight line is 𝑦 = 𝑚𝑥 + 𝑏, we say that 𝑚 is the slope of the
line.

Note: Two lines are parallel if they have the same slope.

Illustration 1.8

1) Find the slope and intercepts of


the line.
4𝑥 + 3 𝑦 = 1 2 .
Convert the equation into 𝑦 = 𝑚𝑥 + 𝑐 form.
Then we have,
4 4
𝑦 = −3𝑥 + 4 ∴ 𝑚 = −3

If 𝑥 = 0, 𝑦 = 4 then 𝑦 - intercept = 4.

If 𝑦 = 0, 𝑥 = 3 then 𝑥 - intercept = 3.

2) Find the slope and intercepts of the y


(0,5)
line −5𝑥 + 2𝑦 = 10.
−5𝑥 + 2𝑦 = 10
5 (-2,0) X
𝑦 =− 𝑥+2
2
If 𝑥 = 0, 𝑦 = 5 then 𝑦 - intercept= 5
If 𝑦 = 0, 𝑥 = −2 then x - intercept = −2

1.6.2 Graphing a Quadratic Functions

A quadratic function has the equation of the form 𝑦 = 𝑎𝑥 2 + 𝑏𝑥 + 𝑐 where 𝑎, 𝑏 and 𝑐 are
real numbers and the graph of a quadratic function is a parabola. The parabola has a single
vertex, that is, a lowest or highest point of the parabola. Each parabola is symmetric with
respect to a vertical line through its vertex.
CHAPTER 1 Page 13
The graph of the quadratic function 𝑦 = 𝑎𝑥2 + 𝑏𝑥 + 𝑐,

I. is concave up if 𝑎 > 0 and concave down if 𝑎 < 0.


𝑏 𝑏2
II. has its vertex at (− 2𝑎 , 𝑐 − 4𝑎)
III. has 𝑦 – intercepts at (0, 𝑐)
IV. has 𝑥 – intercepts, if the parabola intersects the x-axis at the solutions of the quadratic
equation 𝑎𝑥2 + 𝑏𝑥 + 𝑐 = 0

Illustration 1.9

1) 𝑦 = −𝑥 2 + 4𝑥 + 12

This is a parabola that is concave down since 𝑎 = −1 < 0.


Here, 𝑎 = −1, 𝑏 = 4, and 𝑐 = 12, we have

−𝑏 −4 𝑏2 42
= = 2 𝑎𝑛𝑑 𝑐 − = 12 − = 16
2𝑎 2(−1) 4𝑎 4(−1)

Hence the parabola has its vertex at (2,16). The 𝑦 − 𝑖𝑛𝑡𝑒𝑟𝑐𝑒𝑝𝑡 is (0,12).
To find 𝑥 − 𝑖𝑛𝑡𝑒𝑟𝑐𝑒𝑝𝑡𝑠 we solve −𝑥 2 + 4𝑥 + 12 = 0.
Thus, we have (𝑥 − 6)(𝑥 + 2) = 0;

∴ 𝑥 = 6 or 𝑥 = −2.
Hence, 𝑥 − 𝑖𝑛𝑡𝑒𝑟𝑐𝑒𝑝𝑡𝑠 are (6,0) and (-2,0).

2) 𝑦 = 4𝑥 2 − 20𝑥 + 25

Here, 𝑎 = 4, 𝑏 = −20, and 𝑐 = 25. The parabola is concave up since 𝑎 = 4 > 0, has its
vertex at
−𝑏 𝑏2 5
( 2𝑎 , 𝑐 − ) = (2 , 0) and has y – intercept at (0,25). Using quadratic formula to solve
4𝑎
the equation 4𝑥 2 − 20𝑥 + 20 = 0, we have,

−(−20) ± √(−20)2 − 4(4)(25) 5


𝑥= =
2(4) 2
CHAPTER 1 Page 14
5
The only x – intercept is (2 , 0). We need another point to draw the graph. Arbitrarily
choosing 𝑥 = 4 and substituting this value for x in the equation 𝑦 = 4𝑥 2 − 20𝑥 + 25,
we obtain 𝑦 = 9. Thus, the point (4,9) lies on the parabola.

3) 𝑦 = −3𝑥 2 + 6𝑥 − 4

The parabola is concave down, has its vertex at (1, −1) and has y-intercept at (0,4).
Using the quadratic formula to solve the equation−3𝑥 2 + 6𝑥 − 4 = 0, we obtain,

−6 ± √62 − 4(−3)(−4) −6 ± √−12


𝑥= =
2(−3) −6

Thus, the equation −3𝑥 2 + 6𝑥 − 4 = 0 does not have real number solutions because
the discriminant 𝑏 2 − 4𝑎𝑐 = −12 < 0. Therefore, the parabola does not have x –
intercepts. Choosing 𝑥 = 2 arbitrarily we find 𝑦 = −4. Hence the point (2,4) lies on
the parabola.
CHAPTER 1 Page 15
1.7 Exponential and Logarithmic Functions

1.7.1 Exponential Function with Base 𝒃

Let 𝑏 be any positive real number except 1. A function whose defining equation is of
the form 𝑦 = 𝑐𝑏𝑟𝑥, where 𝑐 and 𝑟 are real numbers with 𝑐 ≠ 0 and 𝑟 ≠ 0 is called an
exponential function with base 𝑏.

Illustration 1.10
𝑦 = 2𝑥

𝑥 -3 -2 -1 0 1 2 3
𝑦 1/8 1/4 1/2 1 2 4 8

The important properties of the exponential function 𝑦 = 2𝑥 are

1. 𝑦 is positive for all real numbers 𝑥.


2. As 𝑥 increases, 𝑦 increases greater and at a greater rate.
3. As 𝑥 decreases, 𝑦 decreases getting closer and closer to 0.
4. The 𝑦 − 𝑖𝑛𝑡𝑒𝑟𝑐𝑒𝑝𝑡 is the point (0,1).

2) 𝑦 = 2−𝑥

𝑥 -2 -1 0 1 2
𝑦 4 2 1 1/2 1/4
CHAPTER 1 Page 16

Base “e” Exponential Functions


A special number, which is denoted by 𝑒, is very important in the study of exponential
functions. The number 𝑒 is an irrational number and its value to five significant digits is given
by 𝑒 = 2.7183 …
Any function whose defining equation is of the form 𝑦 = 𝑐𝑒 𝑟𝑥
Where 𝑐 and 𝑟 are non-zero real numbers, is thus an exponential function with base 𝑒.

Using either a calculator or a table, we find that 𝑒 −2 = 0.1353, 𝑒 −1 = 0.3679, 𝑒 0.5 =


1.6487, 𝑒 2 = 7.3891, etc. The graph of 𝑦 = 𝑒 𝑥 is shown in the figure below.

a) Exponential Growth and Decline

Suppose a company's sales were Rs. 100,000 in 2000 and have been doubling every 5 years.
This is a situation of exponential growth. If the sales have been becoming one-half every 5
years, the situation is an exponential decline. Exponential growth functions are of the form
𝑦 = 𝑐𝑒 𝑟𝑥 and exponential decline functions are of the form 𝑦 = 𝑐𝑒 −𝑟𝑥 where 𝑐 > 0 and 𝑟 >
0.
CHAPTER 1 Page 17

Illustration 1.11

If inflation persists, the purchasing power of the rupee declines exponentially. Let y
denote the purchasing power of Rs. 1, in today's terms, 𝑡 years from now, and suppose
that inflation persists at an annual rate of 15%.

(i) Determine the exponential decline function.

Let y=𝑐𝑒 −𝑟𝑥 be the exponential decline function. Since 𝑦 = 𝑅𝑠. 1/− when 𝑡 = 0 years,
we have 1 = 𝑐𝑒 0 ∴ 𝑐 = 1.

Thus, 𝑦 = 𝑒 −𝑟𝑡
When t = 1 year, the rupee will be worth 15% less than it is now, that is Rs.0.85.
∴ 0.85 = 𝑒 −𝑟 , Solving for r, we have 𝑟 = 0.16 (logarithma transformation can be used
to solve, 𝑟 = −𝑙𝑛(0.85)
∴ 𝑦 = 𝑒 −0.16𝑡 .
(ii) Find the purchasing power of the rupee after 9.3 years.
When t = 9.3, we have 𝑦 = 𝑒 −0.16(9.3) = 0.23.
∴ Purchasing power declines by 77% in 9.3 years.

1.7.2 Logarithms

If 𝑏 is a real number and suppose 𝑏 > 1 such that 𝑥 = 𝑏𝑦. The real number 𝑦 is called the
logarithm to the base 𝑏 of 𝑥 and is denoted by 𝒚 = 𝒍𝒐𝒈𝒃 𝒙 with 𝑥 > 0.

1) log 2 8 = 3 because 23 = 8
1 1
2) log 3 27 = -3 because 3−3 = 27
3) log10 10 = 1 because 101 = 10
4) log e 1 = 1 because 𝑒 0 = 1
CHAPTER 1 Page 18
a) Rules of Logarithms

Let 𝑏 be real number such that 𝑏 > 1, and 𝐴 and 𝐵 denote positive numbers. Then

1. 𝑙𝑜𝑔𝑏 𝐴𝐵 = 𝑙𝑜𝑔𝑏 𝐴 + 𝑙𝑜𝑔𝑏 𝐵


𝐴
2. 𝑙𝑜𝑔𝑏 𝐵 = 𝑙𝑜𝑔𝑏 𝐴 − 𝑙𝑜𝑔𝑏 𝐵
3. 𝑙𝑜𝑔𝑏 𝐴𝑟 = 𝑟𝑙𝑜𝑔𝑏 𝐴 for any real number 𝑟

Note:
The logarithm to the base 10 is called the common logarithm and is written 𝑙𝑜𝑔. The
logarithm to the base 𝑒 is called the natural logarithm and is written 𝑙𝑛. Common and
natural logarithms can be evaluated using a calculator or a table.
For example, 𝑙𝑜𝑔 2 = 0.3010 𝑙𝑛 2 = 0.6391

Illustration 1.12

Given that log 2 = 0.3010, log 3 = 0.4771, and log 5 = 0.6990 use the rules of logarithm
to find the following.

1) 𝑙𝑜𝑔 6 = 𝑙𝑜𝑔(2 × 3) = 𝑙𝑜𝑔 2 + 𝑙𝑜𝑔 3 = 0.3010 + 0.4771 = 0.7781

2) 𝑙𝑜𝑔 8 = 𝑙𝑜𝑔(23 ) = 3𝑙𝑜𝑔 2 = 3(0.3010) = 0.9030

3) 𝑙𝑜𝑔 45 = 𝑙𝑜𝑔(32 × 5) = 2𝑙𝑜𝑔 3 + 𝑙𝑜𝑔 5 = 2(0.4771) + 0.6990 = 1.6532

2
4) 𝑙𝑜𝑔 (9) 𝑙𝑜𝑔 2 − 𝑙𝑜𝑔 9 = 𝑙𝑜𝑔 2 − 2𝑙𝑜𝑔3 = 0.3010 − 2(0.4771) = −0.6532

Solve the equations for x,


5) 2𝑥+1=5
Taking logarithm to base 10,
𝑙𝑜𝑔(2𝑥+1 ) = log 5
(𝑥 + 1)𝑙𝑜𝑔2 = 𝑙𝑜𝑔5
𝑙𝑜𝑔5
𝑥+1= = 2.3219
𝑙𝑜𝑔2
𝑥 = 1.3219

b) Change the Base Formula

Logarithm to a number to any base other than 10 or 𝑒, can be calculated using the formula,

𝑙𝑜𝑔 𝑥
𝑙𝑜𝑔𝑏 𝑥= 𝑙𝑜𝑔𝑎𝑏 where a > 1, b > 1 and x>0
𝑎
CHAPTER 1 Page 19
in particular,
log 𝑥 ln 𝑥
𝑙𝑜𝑔𝑏 𝑥 = =
log 𝑏 ln 𝑏

Example
log 5 ln 5
𝑙𝑜𝑔3 5 = = = 1.46497
log 3 ln 3

1.7.3 Graphs of Logarithmic Functions

Sketch the graph of 𝑦 = log2 𝑥

𝑦 = 𝑙𝑜𝑔2 𝑥 => 𝑥 = 2𝑦

𝑦 -3 -2 -1 0 1 2 3
𝑥 1/8 1/4 1/2 1 2 4 8

Note the following important properties of the graph 𝑦 = 𝑙𝑜𝑔2 𝑥.

1. 𝑥 > 0 for all 𝑦


2. 𝑦 < 0 for 0 < 𝑥 < 1
3. 𝑦 > 0 for 𝑥 > 1
4. 𝑦 = 0 when 𝑥 = 1
5. As 𝑥 increases, 𝑦 increases, but at slower rate.

1.8 Arithmetic Progressions

Arithmetic progression (AP) is a sequence of numbers in which the difference between any
two consecutive terms is constant. This constant value is known as the common difference
and denoted by “𝑑”. In an AP, the common difference ensures a consistent increment or
decrement between consecutive terms, leading to a predictable pattern.

The standard from of an AP is,

𝑎, (𝑎 + 𝑑), (𝑎 + 2𝑑), (𝑎 + 3𝑑), …


CHAPTER 1 Page 20
in which the first term is a.

First term = 𝑇1 = 𝑎 + (1 − 1)𝑑


Second term = 𝑇2 = 𝑎 + (2 − 1)𝑑
Third term = 𝑇3 = 𝑎 + (3 − 1)𝑑

From these results, it can be seen that the 𝑛𝑡ℎ term is,

𝑇𝑛 = 𝑎 + (𝑛 − 1)𝑑

Examples:

1) 14, 11, 8.... a=14, d = -3

2) -15, -11, -7, -3.... a=-15, d = 4

3) 10, 5, 0, -5, ... a=10, d = -5

4) -22, -25, -28, -31, ... a=-22, d=-3

Illustration 1.13

Find the 18th term of the arithmetic progression whose first term is 15 and the common
difference is 5.

Answer.

a = 15, d = 5, n = 18  T18 =?

Substituting in, Tn = a + (n -1) d

T18 = 15 + (18 -1) (5)

T25 = 15 + 85

T25 = 100.

Illustration 1.14

Find the first term of an AP if the 15th term is 35 and the common differences – 3.

Answer:

Tn = a + (n-1) d

Substituting in, Tn = 35, n = 15 and d = -3

35 = a + (15-1) (-3)

35 = a - 42

a = 77
CHAPTER 1 Page 21
Illustration 1.15

In an arithmetic progression with a first term of -81 and a common difference of 3,


what is the position of 22?
Answer:

Tn = a + (n-1) d

Substituting in, Tn =22, a = -81 and d = 3

22 = -81 + (n - 1) (3)

22 = -81 + 3n - 3

n = 32

Illustration 1.16

The first term of an arithmetic progression is 52 and its 30th terms is - 35, find the
common difference of the progression.

Tn = a + (n - 1) d

Substituting, a = 52, n = 30, Tn = T30 = -35

- 35 = 52 + (30-1) d

- 35 - 52 = 29 d

d = -3

Illustration 1.17

Introduce a series whose nth term is 3n - 2.

Answer.

𝑇𝑛 = 3𝑛 − 2

Substituting, n = 1 T1 = 3 (1) - 2 = 1

Substituting, n = 2 T2 = 3 (2) -2 = 4

Substituting, n = 3 T3 = 3 (3) - 2 = 7

T2 - T1 = 4 -1 = 3

T3 - T2 = 7 - 4 = 3

Hence, the above series is an arithmetic progression whose first term is 1 and the value
of common difference is 3.
CHAPTER 1 Page 22
Illustration 1.18
The sum of 4th and 10th terms of an arithmetic progression is 20 and the difference of
20th and 10th terms is 50. Find the first term, common difference, and the value of the
40th term.

Answer.

T4 + T10 = 20  a + 3d + a + 9d = 20

2 a + 12 d = 20

a + 6d = 10

(1)

T20 – T10 = 50  a + 19d - (a+9d) = 50

 10d = 50 d=5

Substituting in, (1) a + 6 (5) = 10

a = - 20

 First term = -20


Common difference = 5

T40 = a + 39d = -20 + 39 (5) = 175

Illustration 1.19

The terms -8, x2, and 17x are the first three terms of an arithmetic progression, find the
value of 'x’.

x2 - (-8) = 17x - x 2

( Then a common difference should be)

x2 + 8 = 17 x - x2

2x2 - 17x + 8 = 0

2x2 - 17x+ 8 = 0

(2x- 1) (x-8) = 0
1
𝑥= 𝑜𝑟 𝑥 = 8
2
CHAPTER 1 Page 23
1.8.1 Sum of the First n Terms of an Arithmetic Progression
Let 𝑆𝑛 denote the sum of n terms of an arithmetic progression, and it can be shown that,
𝑛
𝑆𝑛 = 2 (𝑎 + 𝑙).

This formula can be used to compute the sum of AP if the last term and the first term of AP
are known.

If the last term of AP is unknown but the first term and the common difference are known,
the formula below can be used to compute the sum of first n terms.
𝑛
𝑆𝑛 = {2𝑎 + (𝑛 − 1)𝑑}
2
Illustration 1.20

An arithmetic progression whose first terms is 4 and the last term is 124 and it has 13
numbers of terms. Find the sum of this series.

Answer
a = 4, l = 124, n = 13
𝑛
Substituting in, 𝑆𝑛 = 2 (𝑎 + 𝑙).
13
𝑆13 = (4 + 124).
2

𝑆13 = 832.

Illustration 1.21

Find the sum of the first 25 terms of the AP if 𝑎 = −1 𝑎𝑛𝑑 𝑑 = 10.


𝑛
Substituting in, 𝑆𝑛 = 2 {2𝑎 + (𝑛 − 1)𝑑}

25
𝑆25 = {2(−1) + (25 − 1)(10)}
2
𝑆25 = 2,975
CHAPTER 1 Page 24
Illustration 1.22

In the arithmetic progression 48, 45, 42, ..., what is the minimum number of terms
that need to be taken to achieve a sum of 405?

Answer.
a = 48, d = -3, Sn = 405, n =?
𝑛
𝑆𝑛 = {2𝑎 + (𝑛 − 1)𝑑}
2
𝑛
405 = {2(48) + (𝑛 − 1)(−3)}
2
810 = 𝑛(99 − 3𝑛)

3𝑛2 − 99𝑛 + 810 = 0

𝑛2 − 3𝑛 + 270 = 0
(𝑛 − 18)(𝑛 − 15) = 0
𝑛 = 18 𝑜𝑟 𝑛 = 15

1.9 Geometric Progression

The geometric progression (GP) is a sequence of terms that increase or decrease in a


constant ratio. This constant ratio is known as the common ratio of the sequence and is
usually denoted by 𝑟.

The standard form of a GP is 𝑎, 𝑎𝑟, 𝑎𝑟2, 𝑎𝑟3, …

In which the first term is 𝑎 and the common ratio is 𝑟,


the first term, 𝑇1 = 𝑎𝑟 1−1 = 𝑎
the second term, 𝑇2 = 𝑎𝑟 2−1 = 𝑎𝑟
the third term, 𝑇3 = 𝑎𝑟 3−1 = 𝑎𝑟 2.

Therefore, in general, the nth term of a geometric progression can be represented as

𝑇𝑛 = 𝑎𝑟 𝑛−1

Examples:

3, 6, 12, 24, … 𝑎 = 3, 𝑟 = 2

-2, -6, -18, -54, … 𝑎 = −2, 𝑟 = 3


1
16, -8, 4, -2, … 𝑎 = 16, 𝑟 = − 2
1 1 1 1 1
− 16 , 8 , − 4 , 2 , … 𝑎 = − 16 , 𝑟 = −2
CHAPTER 1 Page 25
Illustration 1.23

Find the 7th term of the geometric progression of 2,6,18, …

Answer:
6
𝑎 = 2, 𝑟 = 2 = 3, 𝑛 = 7 ⇒ 𝑇7 =?

Substitution, 𝑇𝑛 = 𝑎𝑟 𝑛−1

𝑇7 = 𝑎𝑟 6 = 2(3)6 = 1,458

Illustration 1.24

The first term of a geometric progression is 40. Fourth term is −5. Find the 11th term
and nth term.

Answer:

𝑎 = 40, 𝑇4 = −5 ⇒ 𝑇11 =? And 𝑇𝑛

𝑇4 = −5

𝑎𝑟 3 = −5
1 1
40𝑟 3 = −5 ⇒ 𝑟 3 = − ⇒ 𝑟 = −
8 2
1 10 1 5
𝑇11 = 𝑎𝑟 10 = 40 (− ) = 40 × =
2 1024 128
1 𝑛−1
𝑇𝑛 = 𝑎𝑟 𝑛−1 𝑇𝑛 = 40 × (− )
2
CHAPTER 1 Page 26
Illustration 1.25

The second term of a geometric progression which has a positive common ratio is 4 and
4th term is 8. In this series obtain

(a) Value of the common ratio


(b) Value of the first term
(c) Value of the 10th term
Answer:

(a) Let’s consider for the first term is 𝑎, the common ratio is 𝑟 of a geometric progression
which fulfills the given requirements.
𝑇2 = 4 ⇒ 𝑎𝑟 1 = 4 (1)
3
𝑇4 = 8 ⇒ 𝑎𝑟 = 8 (2)

𝑎𝑟 3 8
(2) ÷ (1) = 4 ⇒ 𝑟2 = 2
𝑎𝑟 1

𝑟 = ±2

But 𝑟 should be positive 𝑟 = √2


We can consider common ratio is √2

(b) Substituting 𝑟 = √2 of (1)


4 2 × √2 × √2
𝑎(√2) = 4 ⇒ 𝑎 = = = 𝑎 = 2√2
√2 √2
(c) 𝑇10 = 𝑎𝑟 9
9 10
𝑇10 = 2√2(√2) = 2 × (√2) = 2 × 25
∴ 𝑇10 = 64

Illustration 1.26

3, 𝑥 and (𝑥 + 6) are first three terms of a geometric progression with positive of all
terms.

(a) What is value taken for 𝑥?


(b) What is the value of 10th term of this geometric progression?

Answer:

(a) 3, 𝑥, (𝑥 + 6), …
𝑥 (𝑥+6)
Since this geometric progression, the common ratios, 3 and should be the same,
𝑥

𝑥 (𝑥 + 6)
∴ =
3 𝑥
CHAPTER 1 Page 27
𝑥 2 = 3(𝑥 + 6)

𝑥 2 − 3𝑥 − 18 = 0
(𝑥 − 6)(𝑥 + 3) = 0

𝑥 = 6 𝑜𝑟 𝑥 = −3

But, when 𝑥 = −3, all terms are not positive. Hence, 𝑥 = 6.

(b) When 𝑥 = 6, the geometric progression becomes 3, 6, 12, ......


6
Here, 𝑎 = 3 and 𝑟 = = 2
3
9
𝑇10 = 𝑎𝑟
𝑇10 = 3(2)9 = 1536

Illustration 1.27

Dulanjith put a sum of A on an investment cam an interest at 4% per annum. The value
of investment alter 5 years will be Rs. 100,000. Compute the value of investment after
10 years into approximate rupee,

Answer:

Since interest rate is 4%, the common ratio is 1.04.


100 → 104
[ 104 ]
1→ = 1.04
100
Value of investment at the end oldie first year = 𝐴(1.04)

Value of investment at the end of the second year = 𝐴(1.04)2

Accordingly, value of investment at the end of the 5th year = 𝐴(1.04)5

𝐴(1.04)5 = 100000
100000
𝐴= (1.04)5

Value at the end of 10th year = 𝐴(1.04)10


100000
= (1.04)5
× (1.04)10

= 𝑅𝑠. 121,665.29

= 𝑅𝑠. 121,665 (Approx. Rupee)


CHAPTER 1 Page 28
Illustration 1.28

Fish in one area of an Indian Ocean will be reduced by 6% per annum owing to the
reason of fishing heavily. How many times will take the number of fish live in that area
to reduce exactly by an half of the numbers existing currently?

Answer:
𝑥
Suppose the existing number of fish is 𝑥. Then exactly half equals to 2.Since reducing by
6% per annum, the number of fish are in a geometric progression and the common ratio
of it is 0.94.
100 → 94
[ 94 ]
1→ = 0.94
100

The number of fish at the end of the first year = 𝑥 (0.94)1

The number of fish at the end of the second year = 𝑥(0.94)2

If time taken to be a half is n, the number of fish at the end of that year = 𝑥(0.94) n
𝑥
∴ = 𝑥(0.94)𝑛
2
1
= (0.94)𝑛
2
By adopting logarithms to both sides.

log(0.94)𝑛 = log(0.5)

n log 0.94 = log 0.5


log 0.5
𝑛=
log 0.94

𝑛 = 11.2 𝑦𝑒𝑎𝑟𝑠
CHAPTER 1 Page 29
Illustration 1.29

The sum of the first and second terms of a geometric progression is 9. The sum of the
first, second and third terms is 21. Find the first term and the common ratio of this
series and write the first three terms.

Answer:

𝑇1 + 𝑇2 = 9 and 𝑇1 + 𝑇2 + 𝑇3 = 21

𝑎 + 𝑎𝑟 = 9 ⇒ 𝑎(1 + 𝑟) = 9 (1)

𝑎 + 𝑎𝑟 + 𝑎𝑟 2 = 21 ⇒ 𝑎(1 + 𝑟 + 𝑟 2 ) = 21 (2)
𝑎(1+𝑟+𝑟 2 ) 21
(2) ÷ (1) => =
𝑎(1+𝑟) 9

Since 𝑎 ≠ 0
(1+𝑟+𝑟 2 ) 21
(1+𝑟)
= 9

9(1 + 𝑟 + 𝑟 2 ) = 21(1 + 𝑟)

9 + 9𝑟 + 9𝑟 2 = 21 + 21𝑟

9𝑟 2 − 12𝑟 − 12 = 0

3𝑟 2 − 4𝑟 − 4 = 0
(3𝑟 + 2)(𝑟 − 2) = 0
2
𝑟 = − 3 𝑜𝑟 𝑟 = 2

2 2
When, 𝑟 = − 3 𝑏𝑦 (1), 𝑎 (1 − 3) = 9
1
𝑎 × 3 = 9 ⇒ 𝑎 = 27

When, 𝑟 = 2 𝑏𝑦 (1), 𝑎(1 + 2) = 9

𝑎=3

Accordingly, two geometric progressions are received:

when 𝑎 = 3 and 𝑟 = 2 the first three terms of geometric series are 3, 6, 12, ....
2
when 𝑎 = 27 and 𝑟 = − 3 the first three terms of geometric series are 27, -18, 12, ....
CHAPTER 1 Page 30
1.9.1 A Sum of the First 𝒏 Terms of a Geometric Progression

Let 𝑆𝑛 is the sum of first 𝑛 terms of a geometric series,

𝑎(𝑟 𝑛 − 1) 𝑎(1 − 𝑟 𝑛 )
𝑆𝑛 = { = , 𝑤ℎ𝑒𝑛 𝑟 ≠ 1
𝑟−1 1−𝑟
𝑛𝑎, 𝑤ℎ𝑒𝑛 𝑟 = 1
𝑎(𝑟 𝑛 −1)
Note: Calculation will be easy with the formula, 𝑆𝑛 = when 𝑟 > 1 and the formula
(𝑟−1)
𝑎(1−𝑟 𝑛 )
𝑆𝑛 = when 𝑟 < 1.
(1−𝑟)

Illustration 1.30

Find the sum of first 10 terms of a geometric progression of 1, 2, 4, 8, ….

Answer:

1,2,4,8, ….
2
𝑎 = 1, 𝑟 = = 2, 𝑛 = 10
1
𝑎(𝑟 𝑛 −1)
Since 𝑟 > 1, let us use the formula; 𝑆𝑛 = 𝑟−1

1(210 − 1)
∴ 𝑆𝑛 = = 1023
2−1

Illustration 1.31

Find the sum of the first 8 terms of a geometric progression of 4, -12, 36, -108, ...

Answer:

4, -12, 36, -108, ...


12
𝑎 = 4, 𝑟 = − = −3, 𝑛 = 8
4
𝑎(1−𝑟 𝑛 )
Since 𝑟 < 1, let us use the formula; 𝑆𝑛 = 1−𝑟

4{1 − (−3)8 }
∴ 𝑆𝑛 = = 1 − 6551 = −6550
1 − (−3)
CHAPTER 1 Page 31
Illustration 1.32
3
The first term of a geometric progression is 16 and the common ratio is 2 . How many
terms should be taken from this progression if the sum of these terms is 211?

Answer:
3
𝑎 = 16, 𝑟 = , 𝑆𝑛 = 211, 𝑛 =?
2
3 𝑎(𝑟 𝑛 −1)
Let use, 𝑟 = 2 > 1 ∴ 𝑆𝑛 = 𝑟−1

3 𝑛
16{( ) −1}
2
211 = 3
−1
2

3 𝑛
211 = 32 {( ) − 1}
2
𝑛
211 3
=( ) −1
32 2
211 3 𝑛
+1=( )
32 2
3 𝑛 243 35
( ) = = 5
2 32 2
𝑛 5
3 3
( ) =( )
2 2
𝑛 = 5.

1.10 Derivative of a Function

The concept of the derivative is the basis of calculus. Differentiation is a method used to find
the slope of a function at any point. Although this is a useful tool in itself, it also forms the
basis for some very powerful techniques for solving optimization problems. The process of
finding the derivative is called differentiation. To denote the derivative of 𝑦 = 𝑓(𝑥) at,
followings can be used.
𝑑𝑦 𝑑
𝑓 ′ (𝑥), , (𝑥), and 𝑦 ′
𝑑𝑥 𝑑𝑥

1.10.1 Basic Rules for Differentiation

In this section, we shall look at some of the most important rules of differentiation. These
rules are involved in completely mechanical and efficient procedures for differentiation.

Rule 1 – The Constant Rule

If 𝑓(𝑥) = 𝑐, where 𝑐 is a constant, then𝑓 ′ (𝑥) = 0

Eg: 𝑓(𝑥) = 5, 𝑓 ′ (𝑥) = 0


CHAPTER 1 Page 32
Rule 2 – The Power Rule

If 𝑓(𝑥) = 𝑥 𝑛 , where n is any real number then 𝑓 ′ (𝑥) = 𝑛𝑥 𝑛−1

Eg: 𝑓(𝑥) = 𝑥 5 , 𝑓 ′ (𝑥) = 5𝑥 4

Rule 3-The constant –Multiple Rule

If 𝑔(𝑥) = 𝑐𝑓(𝑥) = 𝑥 5 and 𝑓 ′ (𝑥) exists, then 𝑔′ (𝑥) = 𝑐𝑓 ′ (𝑥)

Eg: 𝑓(𝑥) = 5𝑥 4 𝑓 ′ (𝑥) = 5 × 4𝑥 3 = 20𝑥 3

Rule 4 –The Summation Rule

If 𝑓(𝑥) = 𝑔(𝑥) ± ℎ(𝑥) and if 𝑓 ′ (𝑥) and 𝑔′ (𝑥) exist, then 𝑓 ′ (𝑥) = 𝑔′ (𝑥) ± ℎ′ (𝑥).

Eg: 𝑓(𝑥) = 𝑥 5 + 𝑥 4 𝑓 ′ (𝑥) = 5𝑥 4 + 4𝑥 3

Rule 4 can be extended to the derivative of any finite number of sums and differences of
functions.

Rule 5 - General Power Rule

If 𝑦 = {𝑔(𝑥)}𝑛 = 𝑢𝑛 , where n is any real number and 𝑔′ (𝑥) exists, then,


𝑑𝑦 𝑑𝑦 𝑑𝑢
= 𝑛{𝑔(𝑥)}𝑛−1 𝑔′ (𝑥) that is, = 𝑛𝑢𝑛−1 . 𝑑𝑥
𝑑𝑥 𝑑𝑥

Eg: 𝑓(𝑥) = (𝑥 3 + 1)2 𝑓 ′(𝑥) = 2(𝑥 3 + 1)(3𝑥 2 ) = 6𝑥 2 (𝑥 3 + 1)

Rule 6 –Product Rule

If ℎ(𝑥) = 𝑓(𝑥)𝑔(𝑥) and 𝑓 ′ (𝑥) and 𝑔′ (𝑥) exist, then ℎ′ (𝑥) = 𝑓(𝑥)𝑔′ (𝑥) + 𝑔(𝑥)𝑓′(𝑥).

That is, the derivative of the product of two functions is the first function times the derivative
of the second, plus the second function times the derivative of the first.

Eg: 𝑓(𝑥) = 𝑥 4 (𝑥 3 + 1)5 𝑓 ′ (𝑥) = 𝑥 4 . {5(𝑥 3 + 1)4 3𝑥 2 } + (𝑥 3 + 1)5 {4𝑥 3 }

Rule 7- The Quotient Rule

𝑓(𝑥)
Let ℎ(𝑥) = such that 𝑔(𝑥) ≠ 0. If 𝑓 ′ (𝑥)and 𝑔′ (𝑥) exist, then
𝑔(𝑥)
𝑔(𝑥)𝑓′(𝑥) − 𝑓(𝑥)𝑔′(𝑥)
ℎ′ (𝑥) =
{𝑔(𝑥)}2
CHAPTER 1 Page 33
2𝑥+1
Eg: 𝑓(𝑥) = 𝑥−1

(𝑥 − 1)(2) − (2𝑥 + 1)(1) 2𝑥 − 2 − 2𝑥 − 1 −3


𝑓 ′ (𝑥) = = =
(𝑥 − 1)2 (𝑥 − 1)2 (𝑥 − 1)2

1.10.2 The Sign of the Derivative and the Nature of a Function

To analyse the behaviour of a function, we must be able to draw the graph of that function
accuratelyenough to show, where the function is increasing and where it is decreasing. In this
section, we discuss the use of the first derivative of a function to determine this.

If 𝑓′(𝑥) > 0 for all 𝑥 in an interval (𝑐, 𝑑), then 𝑓(𝑥) is increasing over the interval.

If 𝑓 ′ (𝑥) < 0 for all 𝑥 in an interval (𝑐, 𝑑), then 𝑓(𝑥) is decreasing over the interval.

1.10.3 Relative Maximum, Relative Minimum, and Inflexion Points of a Function

Let 𝑓(𝑥) be a function. The point (𝑎, 𝑓(𝑎)) is a relative maximum point for 𝑓(𝑥) if (𝑥) ≤ (𝑎)
for all values of 𝑥 near either side of 𝑎. The point (𝑏, 𝑓(𝑏)) is a relative minimum point for
𝑓(𝑥) if 𝑓(𝑥) ≥ 𝑓(𝑏) for all values of 𝑥 near 𝑏 either side of 𝑏. The function whose graph is
shown in the following figure has relative maxima at the points (4, 8) and (9, 7) relative
minima at the points (3, 2) and (7, 4). The point (5,6) is known as a point of inflexion.
CHAPTER 1 Page 34
The relative maximum or relative minimum points of a function together are also called
relative extreme or turning points. If the function 𝑓(𝑥) has a relative extreme at 𝑥 = 𝑎 , then
𝑓 ′ (𝑎) = 0 or 𝑓 ′ (𝑥) does not exist at 𝑥 = 𝑎. All the values of 𝑥 for which 𝑓′(𝑎) = 0 and also all
the values of 𝑥 for which 𝑓 ′ (𝑥) is not defined, are called critical values of the function 𝑓(𝑥).
If 𝑎 is a critical value (𝑎, 𝑓(𝑎)) is called a critical point.

To find the relative maxima, relative minima, and inflexion points of a function 𝑓(𝑥), two
commonly used tests: First Derivative Test and Second Derivative test are as follows.

1.10.4 The First Derivative Test

1. Find the critical values of 𝑓(𝑥).


2. Divide the domain of the function into intervals using critical values.
3. Examine the sign of 𝑓′(𝑥) on each of the intervals obtained in step 2.
4. As 𝑥 increases through the critical value,
o If 𝑓′(𝑥) changes sign from (+) to (-), then the critical point is a relative maxima
o If 𝑓′(𝑥) changes sign from (-) to (+), then the critical point is a relative minima
o If 𝑓′(𝑥) does not change sign, then critical point is an inflexion point.

1.10.5 Successive Differentiation

Since the derivative of a function is itself a function, it too may be differentiated. When this
is done, the result may also be differentiated. Continuing in this way, we obtain higher-order
derivatives. If 𝑦 = 𝑓(𝑥) then 𝑓′(𝑥) is called the first order derivative of 𝑓(𝑥) with respect to
𝑥. The derivatives of 𝑓′(𝑥) denoted by 𝑓′′(𝑥) is called the second –order derivative of 𝑓(𝑥)
with respect to x etc.

If 𝑦 = 𝑓(𝑥)
𝑑𝑦
First derivative 𝑓 ′ (𝑥) = 𝑑𝑥

𝑑𝑦 𝑑𝑦 𝑑2 𝑦
Second derivative 𝑓′′(𝑥) = ( )=
𝑑𝑥 𝑑𝑥 𝑑𝑥 2

𝑑𝑦 𝑑2 𝑦 𝑑3 𝑦
Third derivative 𝑓 ′′′ (𝑥) = 𝑑𝑥 . (𝑑𝑥 2 ) = 𝑑𝑥 3

𝑑𝑦 𝑑3 𝑦 𝑑4 𝑦
Forth derivative 𝑓 𝑖𝑣 (𝑥) = .( )=
𝑑𝑥 𝑑𝑥 3 𝑑𝑥 4
CHAPTER 1 Page 35
The second derivative Test

Suppose 𝑓 ′ (𝑎) = 0

If 𝑓 ′′ (𝑎) < 0 then 𝑓(𝑥) has a relative maximum at 𝑥 = 𝑎

If 𝑓 ′′ (𝑎) > 0 then 𝑓(𝑥) has a relative maxium at 𝑥 = 𝑎

If f ′′(𝑎) = 0 , the test gives no information, that is (𝑎, 𝑓(𝑎)) may be a relative minimum point,
a relative maximum point, or an inflexion point (use the first derivative test).

Illustration 1.33

Use the second derivative test to examine the function 𝑓(𝑥) = 6𝑥 4 − 8𝑥 3 + 1 for
relative maxima and minima.

𝑓(𝑥) = 6𝑥 4 − 8𝑥 3 + 1

𝑓′(𝑥) = 24𝑥 3 − 24𝑥 2

𝑓′′(𝑥) = 72𝑥 2 − 48𝑥


For critical values:

𝑓′(𝑥) = 24𝑥 3 − 24𝑥 2 = 0

24𝑥 2 (𝑥 − 1) = 0

∴ 𝑥 = 0 or 𝑥 = 1 are critical values.

Second derivative test at x = 0

𝑓′′(0) = 72(02 ) − 48(0) = 0 ∴ the test fails at 𝑥 = 0

To perform the first derivative test at 𝑥 = 0, let’s select two arbitrary values around
𝑥 = 0 such that 𝑥 = −0.5 and 𝑥 = 0.5.

At 𝑥 = −0.5, 𝑓 ′ (−0.5) = 24(−0.5)3 − 24(−0.5)2 = −9 < 0

At 𝑥 = 0.5, 𝑓 ′ (0.5) = 24(0.5)3 − 24(0.5)2 = −3 < 0

∴ 𝑓 ′ (𝑥) does not change sign, then the critical point at 𝑥 = 0 is an inflexion point.

Second derivative test at x=1

𝑓′′(1) = 72(12 ) − 48(1) = 24 > 0 ∴ there is a relative minimum at 𝑥 = 1


CHAPTER 1 Page 36
1.11 Applications of Differentiation

1.11.1 Marginal Revenue and Total Revenue

What differentiation does is, it looks at the effect of an infinitely small change in the
independent variable 𝑥 on the dependent variable 𝑦 in a function 𝑦 = 𝑓(𝑥). In Economics,
marginal revenue (MR) is sometimes defined as the increase in total revenue (TR) received
from sales caused by an increase in output by 𝑞 = 1 unit. This is not a precise definition
though. It only gives an approximate value for marginal revenue, and it will vary if the units
that output is measured in, are changed.

A more precise definition of marginal revenue is that it is the rate of change of total revenue
relative to increases in output. We know that the slope of a function can be found by
differentiation and so it must be the case that,

𝑑𝑇𝑅
𝑀𝑅 =
𝑑𝑄

Given that 𝑇𝑅 = 80𝑞 − 2𝑞 2 , derive a function for MR.


𝑑𝑇𝑅 𝑑
𝑀𝑅 = = 𝑑𝑞 (80𝑞 − 2𝑞 2 ) = 80 – 4q
𝑑𝑞

1.11.2 Marginal cost and total cost

Marginal cost (MC) is the rate of change of the total cost (TC) function. In fact, in nearly all
situations where one is dealing with the concept of a marginal increase, the marginal function
is equal to the rate of change of the original function i.e., to derive the marginal function one
just differentiates the original function.

𝑑𝑇𝐶
𝑀𝐶 =
𝑑𝑞
CHAPTER 1 Page 37
Illustration 1.34

Given 𝑇𝐶 = 6 + 4𝑞 2 derive the MC function.


𝑑𝑇𝐶 𝑑
𝑀𝐶 = = 𝑑𝑞 (6 + 4𝑞 2 ) = 8𝑞
𝑑𝑞

1.11.3 Profit Maximisation

We are now ready to see how calculus can help a firm to find the quantity that maximises
profit. At this stage, we shall just use the MC = MR rule for profit maximisation.

Illustration 1.35
A monopoly faces the demand function 𝑝 = 460 − 2𝑞 and the cost function 𝑇𝐶 = 20 +
0.5𝑞 2 . How much should it sell to maximise profit and what will this maximum profit
be?
𝑑𝑇𝐶
Given 𝑇𝐶 = 20 + 0.5𝑞 2 𝑀𝐶 = =𝑞
𝑑𝑞
Since 𝑝 = 460 − 2𝑞,

𝑇𝑅 = 𝑝. 𝑐 = (460 − 2𝑞)𝑞 = 460𝑞 − 2𝑞 2

𝑑𝑇𝑅
Then, 𝑀𝑅 = = 460 − 4𝑞
𝑑𝑞

To maximise profit MR = MC. Therefore,

460 − 4𝑞 = 𝑞

460 = 5𝑞

∴ 𝑞 = 92

The actual maximum profit when the output is 92 will be,

𝑇𝑅 − 𝑇𝐶 = (460𝑞 − 2𝑞 2 ) − (20 + 0.5𝑞 2 )


= 460𝑞 − 2𝑞 2 − 20 − 0.5𝑞 2
= 460(92) − 2.5(92)2 − 20
= 42320 − 21160 − 20 = 21140
CHAPTER 1 Page 38
1.12 Integration

Integral calculus involves the process of finding the function itself when its derivative is
known. This is the inverse process to finding the derivative of a function. The process of
finding an integral of a function is called integration. There are two kinds of integration,
indefinite and definite. We will discuss the indefinite type of integration in this section.

1.12.1 Indefinite Integration

If 𝐹(𝑥) is an integral with respect to 𝑥 of the function 𝑓(𝑥), the relationship between 𝐹(𝑥)
and 𝑓(𝑥) is expressed as follows:

∫ 𝑓(𝑥)𝑑𝑥 = 𝐹(𝑥) + 𝐶

Where the left-hand part is read as “integral of 𝒇(𝒙) with respect to x”. The symbol ∫ 𝑑𝑥

is the integral sign, 𝑓(𝑥) is the integrand, 𝐹(𝑥) is a particular integral, 𝐶 is constant of
integration, and 𝐹(𝑥) + 𝐶 is the indefinite integral.

1.12.2 Rules of integration

Rule 1
𝑑𝑘𝑥
For any constant 𝑘, =𝑘 Hence ∫ 𝑘 𝑑𝑥 = 𝑘𝑥 + 𝐶
𝑑𝑥

Illustration 1.36

∫ 3 𝑑𝑥 = 3𝑥 + 𝐶

𝑑(3𝑥+5)
Here C is used because the derivative of a constant is 0. For example, = 3𝑥
𝑑𝑥

𝑑(3𝑥−2)
as well as = 3𝑥 .
𝑑𝑥

Therefore
∫ 3 𝑑𝑥 = 3𝑥 + 𝐶 is valid for any constant C.

Rule 2
𝑥𝑛+1
𝑑( ) 𝑥 𝑛+1
𝑛+1
For any number 𝑛 ≠ 1, = 𝑥 𝑛 , hence ∫ 𝑥 𝑛 𝑑𝑥 = +𝐶
𝑑𝑥 𝑛+1

Illustration 1.37
3
𝑥 3+1 𝑥4
∫ 𝑥 𝑑𝑥 = +𝐶 = +𝐶
𝑛+1 4
CHAPTER 1 Page 39
Rule 3
𝑑(ln 𝑥) 1 1
= , hence, ∫ 𝑥 𝑑𝑥 = ln 𝑥 + 𝐶
𝑑𝑥 𝑥

𝑑{ln 𝑓(𝑥)} 𝑓 ′ (𝑥)


More generally, =
𝑑𝑥 𝑓(𝑥)

𝑓 ′ (𝑥)
Hence, ∫ 𝑑𝑥 = ln 𝑓(𝑥) + 𝐶
𝑓(𝑥)

Illustration 1.38

2𝑥
∫ 𝑥 2 +1 𝑑𝑥 = ln(𝑥 2 + 1) + 𝐶

Chapter 1 - Reinforcement Problems

1) Simplify and express with positive exponents.


−2
3
27𝑥 3 b) √𝑎−2 𝑏 x √𝑎𝑏 −3
a) 3𝑦 −2
𝑥 2 −𝑥 𝑥 2 +3𝑥 𝑎 𝑎𝑦+3𝑥
c) × d) 3 - +
2𝑥 3 +6𝑥 2 𝑥 2 −1 2𝑥 2𝑥𝑦

2) Factor the followings.


a) 𝑥2 − 𝑥 − 71 b) 9𝑥2 − 4𝑥𝑦2
c) 8𝑥3 − 343 d) 6𝑥2 − 5𝑥3 + 𝑥4

3) Solve the following equations.


(a) (3x + 2) = 5

𝑥 5𝑥+8 2𝑥−9
(b) 4 − =
6 3

(c) 3𝑥 = 7 + 𝑦
5𝑥 − 9𝑦 = 41

4) Solve the following inequalities.


(a) 6 − 4𝑥 > −8 + 3𝑥
1 3
(b) 2 𝑥 + 2 < 𝑥–4
2

5) The supply function for a commodity is, 𝑠 = 300𝑝, and the demand function is,
𝑑 = −200𝑝 + 150,000, where 𝑝 is the sales price per unit.

(a) Graph these functions on the same set of axes.


(b) What is the supply and demand for gadgets if they sell at a piece rate of Rs. 200 each?
CHAPTER 1 Page 40
6) A firm sells each unit it produces for Rs. 28. (Hint, Revenue = Price x Quantity)
(a) Find the firm’s revenue function and graph it.
(b) What is the firm’s revenue if it sells 100 units?
(c) How many units must the firm sell in order to receive a revenue of Rs. 19, 152?

7) Express as a single logarithm.

(a) 5𝑙𝑜𝑔4 7 + 2𝑙𝑜𝑔4 3 − 𝑙𝑜𝑔4 243

(b) 2 log 2 + 2 log 3 + 1

8) Solve equations without using the calculator.


(a) 𝑙𝑜𝑔𝑎 (5𝑥 − 6) + 𝑙𝑜𝑔𝑎 (2𝑥 + 3) = 𝑙𝑜𝑔𝑎 (10𝑥 2 − 3𝑥 − 5)
(b) 2 ln(𝑥 − 2) = ln (2𝑥 − 5)

(c) 4𝑥 − 10(2𝑥 ) + 16 = 10

(d) 32𝑥−1 − 103𝑥 + 3 = 0

9) Solve the equation using the calculator.


(a) 0.2𝑥 = 0.8

(b) 2𝑥 × 5𝑥−1 = 82𝑥+1

10)The population of the world was 3.78 billion in 1972 and 4.43 billion in 1980. Assuming
that population growth is described by an exponential function of 𝑓(𝑡) = 𝑎𝑒 𝑟𝑡 ,
(a) Find the population growth function.
(b) Find the population of the world in the year 2050.
(c) Approximately when will the population of the world reach 50 billion?

11)Differentiate the following functions.


32 4
a) 𝑓(𝑥) = (17)3
b) 𝑓(𝑥) = (4𝑥)3
c) 𝑓(𝑡) = 14𝑡 3 − 6𝑡 2 + 7𝑡
7𝑥 3 +𝑥
d) 𝑦 = 2√𝑥
4𝑞 3 +7𝑞−4
e) 𝑔(𝑞) = 𝑞
f) 𝑓(𝑥) = 𝑥 (3 − 𝑥)2
3

g) 𝑓(𝑥) = 𝑥 2 (𝑥 − 2)(𝑥 + 4)
h) 𝑔(𝑥) = 𝑥 2 (3 + 𝑥)3
𝑥
i) ℎ(𝑥) = 𝑥−1
−23
j) 𝑦 = √12𝑡 2
−3𝑡+2
1+√𝑥
k) 𝑓(𝑥) = 1−√𝑥
l) 𝑓(𝑥) = (𝑥 + √1 + 𝑥 2 )4
CHAPTER 1 Page 41
12)Find the indicated derivative.

𝑑3 𝑦 3𝑥−1
(a) 𝑦 = 4𝑥 3 − 5𝑥 2 + 7𝑥 − 17 ; (b) 𝑓(𝑥) = ; 𝑓′′′(𝑥)
𝑑𝑥 3 𝑥+2

13)Use the first derivative test to find all relative maximum, relative minimum, and inflexion
points for the following functions.

(a) 𝑓(𝑥) = 2𝑥 3 + 9𝑥 2 ± 108𝑥 + 20 (b) 𝑓(𝑥) = 3𝑥 4 − 4𝑥 3 − 6𝑥 2 + 12𝑥

14)Use the second derivative test to find all relative maxima and minima for given functions.

(a) 𝑓(𝑥) = 2𝑥3 − 15𝑥2 − 216𝑥 + 25 (b) 𝑓(𝑥) = 𝑥5 − 15𝑥3

80−𝑞
15)The demand equation for a manufacturers’ product is = , where 𝑞 is the number of
𝑥+4
units and 𝑝 is price per unit. At what value of 𝑞 there is there maximum revenue?

16)A box with an open top is to have a square base and a volume of 500 cubic centimetres. If
the material for the box cost is Rs. 5 per square centimetre, find the dimensions of the box
that minimise its cost.

17)find the following indefinite integrals.


1
(a) ∫(4𝑥 + )𝑑𝑥
𝑥

(b) ∫(1 − 𝑥)𝑥 2 𝑑𝑥


1
(c) ∫[−𝑥(1 − 𝑥 2 )−2 ]𝑑𝑥
3𝑥 2 −7𝑥
(d) ∫ 𝑑𝑥
2𝑥

(e) ∫ √𝑥(3𝑥 + 4)𝑑𝑥


𝑥−2
(f) ∫ 2𝑥+5 𝑑𝑥
4
(g) ∫ 𝑑𝑥
√2+3𝑥

(h) ∫ (2𝑥 + 1)1/2 𝑑𝑥


CHAPTER 2 Page 42

Frequency Distributions and


Presentation of Data

Learning Outcomes

After successful completion of this chapter, you should be able to:

• Describe different types of data


• Explain the concepts related to frequencies
• Apply the appropriate technique in different business situations
• Interpret the features of distributions using measures of central tendency and
dispersion
• Compare different data presentation techniques
• Illustrate the use of data presentation techniques
• Apply the appropriate data presentation techniques

2.0 Introduction

Business statistics is a field of study that involves the collection, analysis, interpretation, and
presentation of data in order to make informed business decisions. It encompasses the
application of statistical methods and techniques to address various business-related
problems and situations. Business statistics helps businesses in a variety of ways, including:

▪ Descriptive Analysis: Summarising and presenting data in a meaningful way using


measures like central tendency (mean, median, mode), dispersion (range, variance,
standard deviation), percentages, and graphical representations. This helps in
understanding the current state of affairs and identifying trends.

▪ Inferential Analysis: Drawing conclusions and making predictions about a population


based on a sample, using techniques such as hypothesis testing and confidence intervals.
This allows businesses to make inferences and generalisations about a larger population
based on limited data.
CHAPTER 2 Page 43
▪ Forecasting: Using historical data to predict future trends and patterns in business
operations, demand for products/services, and market behaviour. This helps in planning
and resource allocation.

▪ Decision Making: Utilising statistical models and techniques to support decision-making


processes, including risk analysis, optimisation, and simulation. Statistical tools assist in
evaluating different options and selecting the best course of action.

▪ Quality Control: Assessing and monitoring the quality of products or processes through
statistical methods, such as control charts and statistical process control. This ensures
consistent quality and helps in identifying and rectifying issues.

Business statistics provide valuable insights and enable businesses to make data-driven
decisions, optimise processes, identify opportunities, manage risks, and evaluate
performance.

There are two major divisions of statistics such as:


• Descriptive statistics and
• Inferential statistics.

2.1 Descriptive statistics

Descriptive statistics involves summarising and describing the main characteristics of a


dataset. It provides a concise and meaningful way to understand and interpret the data. Here
are some key concepts and measures used in descriptive statistics:

• Measures of Central Tendency: These measures describe the central or average value of
a dataset. The most commonly used measures are the mean (average), median (middle
value), and mode (most frequent value).

• Measures of Dispersion: These measures indicate the spread or variability of the data.
Common measures include the range (difference between the maximum and minimum
values), variance, and standard deviation.

• Frequency Distribution: It represents the pattern of occurrence of different values in a


dataset, often displayed in the form of a histogram or a frequency table.

• Percentiles: Percentiles divide a dataset into equal-sized parts, providing information on


how values are distributed across the data. The median is the 50th percentile, quartiles
divide the data into quarters (25th, 50th, and 75th percentiles), and deciles divide the data
into tenths.
• Skewness and Kurtosis: Skewness measures the asymmetry of a distribution, indicating
whether it is skewed to the left or right. Kurtosis measures the degree of onward or
flatness of a distribution.
CHAPTER 2 Page 44
Descriptive statistics allow researchers and analysts to summarise and communicate key
characteristics of a dataset, understand its distribution, identify outliers, and gain initial
insights before further analysis.

2.2 Inferential statistics

Inferential statistics involves making inferences and drawing conclusions about a population
based on sample data. It extends beyond the descriptive analysis of a dataset and enables us
to generalise and make predictions about a larger population. Here are some key concepts
and techniques used in inferential statistics:

• Sampling: Inferential statistics relies on the selection of a representative sample from a


population. The sample should be chosen using appropriate sampling methods to ensure it
accurately represents the population of interest.

• Estimation: Estimation involves using sample data to estimate population parameters.


Point estimation provides a single value estimate of a parameter. For example, sample
mean is a point estimator for the population mean. It provides specific value as an estimate
without indicating the uncertainty associated with it. On the other hand, interval estimation
provides a range within which the parameter is likely to fall with a given level of confidence.
This range is known as confidence interval.

• Hypothesis Testing: Hypothesis testing is used to make decisions about the population
based on sample data. It involves formulating a null hypothesis and an alternative
hypothesis, collecting sample data, and using statistical tests to determine whether the
evidence supports rejecting or failing to reject the null hypothesis.

• Regression Analysis: Regression analysis examines the relationship between dependent


variable and independent variables and helps predict the value of the dependent variable
based on the values of other independent variables. It allows for the identification of
significant predictors and the estimation of the strength and direction of relationships.

In this chapter we are focusing solely on descriptive statistics. Firstly, let's identify simple
statistical definitions.

2.3 Data

Data refers to factual information or raw facts that can be collected, measured, or observed.
It can exist in various forms such as numbers, text, images, or audio. Data serves as the
foundation for generating knowledge and making informed decisions. It can be analysed,
organised, and interpreted to uncover patterns, trends, and insights. Data plays a crucial role
in research, business operations, and everyday life, enabling us to make informed choices
based on evidence.
CHAPTER 2 Page 45
Data is classified into two broad categories:
• Quantitative data
• Qualitative data

2.3.1 Quantitative data

Quantitative data refers to information that is expressed in numerical form and can be
measured or counted. It deals with quantities and numerical values, allowing for
mathematical and statistical analysis. Examples of quantitative data include numerical
measurements, counts, percentages, and ratings.

Continuous data refers to a type of quantitative data that can take on any value within a
specific range or interval. It is characterised by having an infinite number of possible values
within that range. Continuous data is typically measured on a continuous scale, such as time,
temperature, weight, height, or distance. Continuous data can be infinitely divided into
smaller and smaller units, according to the degree of accuracy. For example, temperature can
be measured with decimal precision, allowing for values like 25.5 degrees Celsius or 73.8
degrees Fahrenheit.

Discrete data refers to a type of quantitative data that can only take on specific, distinct
values. These values are typically whole numbers or categories and cannot be subdivided
further. Discrete data is often the result of counting or categorising items or events. Examples
of discrete data include the number of students in a classroom, the number of cars sold in a
month, or the types of animals in a zoo.

Interval data represents quantities where the difference between the values is meaningful
and consistent. It does not have true zero point but can have positive and negative values.
Examples include temperature measured in Celsius or Fahrenheit.

Ratio data has all the properties of interval data, but with a true zero point. Ratios between
values can be calculated, and the absence of certain values is represented by zero. Examples
include income, height, or profit.

Example 2.1 - Interval and ratio scales

Numerical Variable Level of Measurement


Temperature (in degrees Celsius or Fahrenheit) Interval
Standardised exam score (e.g., Z-score) Interval
Time to download a file (in seconds) Ratio
Age (in years or days) Ratio
Cost of a personal computer system (Rs.) Ratio
CHAPTER 2 Page 46
Grouped data refers to a data where the values are categorized or grouped into intervals or
classes. Instead of individual values, the data is presented in ranges or categories along with
their corresponding frequencies or counts. This grouping is often done to simplify and
summarize data when dealing with large data sets or continuous variables.

Exam score (X) Frequency (f)


30 – 39 4
40 – 49 7
50 – 59 9
60 – 69 15
70 – 79 10
80 – 89 3
90 – 99 2
Total 50
Table 2.1: grouped data with frequency distribution

Ungrouped data refers to a data where each individual value is listed separately without
any categorization or grouping. It presents the raw or individual values as they are collected
or observed.

Number of television sets (X) Frequency (f)


0 2
1 13
2 18
3 0
4 10
5 2
Total 45
Table 2.2: ungrouped data with frequency distribution

2.3.2. Qualitative Data

Qualitative data refers to non-numerical information that describes qualities, characteristics,


or attributes. It is typically obtained through methods such as interviews, observations, or
open-ended survey responses. Examples of qualitative data include gender, perception about
a certain product or service, or job satisfaction.
Qualitative data can be further divided into two groups: Nominal data and ordinal data:

Nominal data represents categories or labels with no inherent order or ranking. Examples
of nominal data include gender (male, female), marital status (single, married, divorced), or
product categories (electronics, clothing, furniture)
CHAPTER 2 Page 47
Example 2.2 – Nominal data
• Do you currently have a Facebook profile? ❑Yes ❑No

• Types of investments ❑ Stocks ❑ Bonds ❑ Other ❑ None

• Internet email ❑ Gmail ❑ Outlook ❑ Yahoo ❑ Other


provider

Ordinal data represents categories with a natural order or ranking. It involves qualitative
characteristics that can be ranked or ordered based on a specific criterion. Examples of
ordinal data include customer satisfaction ratings (poor, satisfactory, good, excellent),
ratings for a product (low, medium, high), or education level (high school, bachelor’s degree,
master’s degree).

Example 2.3 – Ordinal data


• Product satisfaction
1. Very unsatisfied
2. Fairly unsatisfied
3. Neutral
4. Fairly satisfied
5. Very satisfied
• Student grades
1. A: Best
2. B: Above- average
3. C: Satisfactory
4. D: Below-average
5. F: Poor

2.3.3 Data Sources


Statistical data are classified under two categories: primary data and secondary data.
Primary data refers to original data collected firsthand by researchers specifically for their
own study or investigation. It is gathered directly from original sources, such as surveys,
experiments, observations, or interviews to address specific research questions or objective.
Here, researchers have control over the data collection process, study design, and the type of
information they want to obtain. Examples of primary data include survey responses from
questionnaires, experimental data from controlled experiments, field observations, or
participant interviews.

Secondary data refers to existing data that has been collected by others or for a different
purpose than the current study. It is readily available through sources like research articles,
books, government reports, databases, or online repositories. Researchers utilise secondary
data to supplement their own research, saving time and resources compared to collecting
primary data. It can provide historical, comparative, or contextual information for research
purposes.
CHAPTER 2 Page 48
Examples of secondary data include census data, market research reports, historical records,
academic studies, and publicly available datasets. However, it is important to critically
evaluate the quality, relevance, and limitations of secondary data before using it in a research
study.

2.4 Presentation of Qualitative Data

2.4.1 Frequency Distributions of Qualitative Data

A frequency distribution for qualitative data lists all categories and the number of elements
that belong to each of the categories.

Illustration 2.1 – Frequency distribution


A sample of a r r e s t s i n a small city p r o v i d e d the following set of offenses with
which individuals were charged:

rape robbery burglary Arson murder robbery Rape manslaughter

arson theft arson Burglary theft robbery theft theft

theft burglary murder Murder theft theft theft manslaughter

Create a frequency distribution for the above data.

The variable, “Offense type”, has classified into the following categories: rape, robbery,
burglary, arson, murder, theft, and manslaughter. These seven categories are listed under
the column labelled “Offense type”, and the number of occurrences of each category is
represented by tally marks. The number of tallies for each offense is counted and displayed
in the column label “Frequency”. Sometimes, the term “Absolute frequency” is used
interchangeably with “Frequency”.

Offense type Tally marks Frequency


Rape // 2
Robbery /// 3
Burglary /// 3
Arson /// 3
Murder /// 3
Theft //// /// 8
Manslaughter /// 3
CHAPTER 2 Page 49
2.4.2. Bar graph

A bar graph is a graphical representation of bars, where the heights of the bars represent the
frequencies of different categories. Bars can be represented either vertically or horizontally
in a bar graph. It visually displays the same information about the qualitative data that a
frequency distribution presents in a tabular format.

2.4.3. Pie chart

A pie chart is also used to graphically display qualitative data. It involves dividing a circle into
segments that represent the relative frequencies or percentages of different categories. The
angle of each segment is determined by multiplying 3600 by the corresponding percentage.
For instance, referring to the Illustration 2.1, the angle of the “theft segment” can be
8
calculated as 360 ∗ = 1150
25

2.5 Presentation of Quantitative Data

2.5.1 Frequency Distribution for Quantitative Data

There are several similarities between frequency distributions for qualitative data and
frequency distributions for quantitative data. Firstly, let’s discuss the terminology used for
frequency distributions of quantitative data. Afterwards, we will provide examples to
illustrate the construction of frequency distributions for quantitative data.
CHAPTER 2 Page 50
Class limits, class boundaries, class marks, and class width

Class limits: Each class in a frequency distribution has two class limits. The lower-class limit
is the smallest value that can be included in the class, while the upper-class limit is the largest
value that can be included in the class.

Class boundaries: The class boundaries are the values that separate one class from another
and those help define the intervals within which data falls. Class boundaries are calculated
by subtracting half a unit of measure to the lower-class limit of a class and adding half a unit
of measure from the upper-class limit of a class. Or, we can add the lower-class limit of a class
to the upper-class limit of the preceding class and divide the sum by 2. This calculation
provides the upper boundary for the preceding class and the lower boundary for the current
class.

Class mark: The class mark represents the centre of the class. It is calculated by adding the
lower-class limit to the upper-class limit of a class and dividing the sum by 2. It is also known
as class midpoint.

Class width: The class width refers to the difference between the upper and lower
boundaries of a class. It represents the range or interval of values covered by each class in
the frequency distribution.

Illustration 2.2 – Classes, class boundary, class mark, class width


The following grouped data gives a frequency distribution of the IQ scores for 75
adults.

IQ Score Frequency
80 - 94 8
95 - 109 14
110 - 124 24
125 - 139 16
140 - 154 13

The above frequency distribution table consists of five classes: 80 - 94, 95 – 109, 110 -124,
125 – 139, and 140 - 154.
Each class is defined by a lower-class limit and an upper-class limit. For this distribution,
lower class limits are 80, 95, 110, 125, and 140, while the upper-class limits are 94, 109,
124, 139, and 154.
The lower-class boundaries are 79.5, 94.5, 109.5, 124.5, and 139.5 while the upper-class
boundaries are 94.5, 109.5, 124.5, 139.5, and 154.5.

The class marks are (80+94)/2=87, (95+109)/2=102, (110+124)/2=117,


(125+139)/2=132, and (140+154)/2=147.
CHAPTER 2 Page 51
Class width is 15 (for example 94.5 – 79.5 = 15)

Class Limits Class Boundaries Class Width Class Marks


80 – 94 79.5 - 94.5 15 87
95 – 109 94.5 - 109.5 15 102
110 – 124 109.5 - 124.5 15 117
125 – 139 124.5 - 139.5 15 132
140 – 154 139.5 - 154.5 15 147

Illustration 2.3

The price for 500 aspirin tablets were determined for each of twenty randomly selected
stores as partof a larger consumer study. The prices are as follows:

2.50 2.95 2.65 3.10 3.15 3.05 3.05 2.60 2.70 2.75
2.80 2.80 2.85 2.80 3.00 3.00 2.90 2.90 2.85 2.85

Suppose we want to group these data into seven classes. Since the maximum price is 3.15
and the minimum price is 2.50, the price range is 0.65. Each class should then have a width
equal to approximately 1/7 of 0.65, which is approximately 0.093. There is flexibility in
choosing the classes while following the guidelines. The following table shows the results
if a class width of 0.10 is selected and the first class begins at the minimum price.

Price (class interval) Class boundaries Tally mark Frequency


2.50 - 2.59 2.495-2.595 / 1
2.60 - 2.69 2.595 - 2.695 // 2
2.70 - 2.79 2. 695 - 2.795 // 2
2.80 - 2.89 2. 795 - 2.895 //// / 6
2.90 - 2.99 2. 895 - 2.995 /// 3
3.00 - 3.09 3. 995 - 3.095 //// 4
3.10 - 3.19 3. 095 - 3.195 // 2

2.5.2 Histograms

A histogram is a graph that displays the class boundaries on the horizontal axis and their
corresponding frequencies on the vertical axis. Each class is represented by a vertical bar,
and the height of the bar corresponds to the frequency of that class. While similar to bar
graph, a histogram differs in that it uses class boundaries along with frequencies and it has
no gaps between the bars. On the other hand, a bar graph uses categories and frequencies.
To illustrate, let’s construct a histogram for the example of aspirin prices mentioned earlier.
CHAPTER 2 Page 52

NOTE: Label the X axis

2.5.3 Cumulative Frequency Distributions

There are two types of cumulative distributions.

1. Less than type cumulative frequency distribution


2. More than type cumulative frequency distribution

Less than type cumulative frequency distribution, also known as cumulative frequency
distribution, shows the cumulative frequencies that are less than or equal to the upper
boundary of each class interval.

More than type cumulative frequency distribution, also known as greater than
cumulative frequency distribution, shows the cumulative frequencies that are greater than
or equal to the lower boundary of each class interval.

Illustration 2.4 – Less than cumulative frequency and more than cumulative
frequency.

Less than More than


Price Class boundary Frequency
Cumulative Cumulative
Frequency Frequency
2.50 - 2.59 2.495 - 2.595 1 1 20
2.60 - 2.69 2.595 - 2.695 2 3 19
2.70 - 2.79 2.695 - 2.795 2 5 17
2.80 - 2.89 2.795 - 2.895 6 11 15
2.90 - 2.99 2.895 - 2.995 3 14 9
3.00 - 3.09 3.995 - 3.095 4 18 6
3.10 - 3.19 3.095 - 3.195 2 20 2
CHAPTER 2 Page 53
2.5.4. Cumulative Frequency Graph – Ogive

An ogive is a graphical representation of a cumulative frequency distribution. An ogive graph


plots cumulative frequency on the y-axis and the corresponding upper-class boundaries
along the x-axis. It allows you to estimate percentiles and understand the cumulative
behaviour of the data. Ogives can also be constructed for a cumulative relative frequency
distribution as well as a cumulative percentage distribution.

2.6 Descriptive Summary Measures

Descriptive summary measures, also known as descriptive statistics, are used to summarise
and describe the main characteristics of a dataset. These measures provide insights into the
central tendency, dispersion, and shape of the data. If the measures are computed for data from
a sample, they are referred to as sample statistics.

If they are computed for data from a population, they are referred to as population
parameters. Descriptive summary measures provide a comprehensive overview of the data,
allowing for comparisons, identification of patterns, and understanding the data
distributions.

2.6.1 Measures of Central Tendency

Mean: The arithmetic average of the values in the dataset. Suppose x1 , x2 , ⋯ xn are the
measurements obtained for the variable X of randomly selected n individuals. Then the
sample mean of X, denoted by, x̅, is the sum of the values in the sample divided by the
number of values in the sample.
∑𝑛𝑖=1 𝑥𝑖
𝑥̅ = .
𝑛
CHAPTER 2 Page 54

Illustration 2.5 – Sample mean

The table below displays the monthly starting salaries of 12 university graduates who are
experts in the field of business. Compute the mean of the salary of selected graduates.

Graduate Monthly starting Graduate No Monthly starting


No salary (Rs.) salary (Rs.)
1 34500 7 34900
2 35500 8 37300
3 36500 9 35400
4 34800 10 39250
5 33550 11 35200
6 33100 12 34800

∑12
i=1 xi 34500+35500+ ⋯ +48000 424800
x̅ = = = = 𝑅𝑠. 35,400
12 12 12

Median: The middle value in the dataset when it is arranged in ascending or descending
order. For an odd number of observations, the median is the middle value and for even
number of observations, the median is the average of the two middle values. While the mean
is often preferred measure of central location, there are situations where median takes
precedence. This is because the mean can be susceptible to the influence of extreme small or
large data values.

Illustration 2.6 - Median

1) Compute the median of the given values: 46, 42, 32, 46, 54.

Arranging the data in ascending order provides the following list,

32 42 46 46 54

The median is the middle value since there are odd number of observations (𝑛 = 5).

Median = 46.

2) Compute the median of the given ordered values:


3310 3355 3450 3480 3480 3490 3520 3540 3550 3650 3730 3925

There are even number of observations here (𝑛 = 12), hence the median is the
average of the middle two values.

3490 + 3520
Median = = 3505
2

Mode: The value(s) that appear most frequently in the data set.
CHAPTER 2 Page 55

Illustration 2.7 - Mode

The systems manager, who is responsible for overseeing the company’s network,
maintains a log of the daily occurrences of server failures. The following data
represents the number of server failures that occurred over a two-week period:

1 3 0 3 26 2 7 4 0 2 3 3 6 3
Find the mode of the number of server failures that occurred over a two-week period.
The ordered array for these data is,

0 0 1 2 2 3 3 3 3 3 4 6 7 26

Mode = 3.

Note: There are situations where a dataset may have multiple modes or no mode at all.

Geometric Mean: A mathematical measure used to calculate the average value of a set of
numbers, by taking the 𝒏th toot of the product of 𝑛 values.

1
𝑥̅𝐺 = (𝑥1 × 𝑥1 × ⋯ × 𝑥𝑛 )𝑛

The geometric mean has various applications, such as calculating the overall growth
rate of investment returns across multiple periods, population growth rate, economic
growth rate, or sales growth rate.

Illustration 2.8 – Geometric mean

Suppose 𝑅1 , 𝑅1 , …, 𝑅𝑛 , are the rates of returns for 𝑛 consecutive years, the geometric mean
rate return per year is given by,
1
𝑅̅ = [(1 + 𝑅1 ) × (1 + 𝑅2 ) × ⋯ × (1 + 𝑅𝑛 )]𝑛 − 1.

The percentage change in the Russell 2000 Index of the stock prices of 2,000 small
companies was -33.79% in 2008 and 27.17% in 2009. Compute the geometric mean rate
of return per year.
1
𝑅̅ = [(1 + 𝑅1 ) × (1 + 𝑅2 )]2 − 1
1
𝑅̅ = [(1 − 0.3379) × (1 + 0.2717)]2 − 1
𝑅̅ = −0.0824

The geometric mean rate of return in the Russell 2000 Index for the two years is -8.24%
per year.
CHAPTER 2 Page 56
2.6.2 Quartiles

After arranging the data from smallest to largest, it is often desirable to divide data into four
parts, with each part containing approximately one- fourth, or 25% of the observations. The
division points are referred to as the quartiles and are defined as
𝑄1 = first quartile 𝑄2 = second quartile (also the median) 𝑄3 = third quartile
𝑛 + 1 𝑡ℎ
𝑄1 = (1 × ) data value
4
𝑛 + 1 𝑡ℎ
𝑄2 = (2 × ) data value
4
𝑛 + 1 𝑡ℎ
𝑄3 = (3 × ) data value
4

Illustration 2.9 - Quartiles


The nutritional data for a sample of seven breakfast cereals reveals the number of calories
per serving as:

100, 80, 130, 100, 200, 190, 110.

Compute the first, second and third quartiles of calories for the cereals.

Ordered dataset:

80, 100, 100, 110, 130, 190, 200

𝑛+1 𝑡ℎ 7+1 𝑡ℎ
𝑄1 = ( ) data value = ( ) data value = 2𝑛𝑑 data value =100
4 4
𝑛+1 𝑡ℎ 7+1 𝑡ℎ
𝑄2 = (2 × ) data value = (2 × ) data value = 4𝑡ℎ data value = 110
4 4

𝑛+1 𝑡ℎ 7+1 𝑡ℎ
𝑄3 = (3 × ) data value = (3 × ) data value = 6𝑡ℎ data value = 190
4 4

Illustration 2.10 – Quartiles


Compute the first, second and third quartile for the following ordered dataset.

29, 31, 35, 39, 39, 40, 43, 44, 44, 52

𝑛+1 𝑡ℎ 10+1 𝑡ℎ
𝑄1 = ( ) data value = ( ) data value = 2.75𝑛𝑑 data value = 2nd data value +
4 4
0.75 × (3𝑟𝑑 data value − 2 𝑛𝑑
data value) =31 + 0.75 × (35 − 31) = 31 + 3 = 34

𝑛+1 𝑡ℎ 10+1 𝑡ℎ
𝑄2 = (2 × ) data value = (2 × ) data value = 5.5𝑡ℎ data value = 5th data value
4 4
+ 0.5 × (6𝑡ℎ data value − 5𝑡ℎ data value)= 39 + 0.5 × (40 − 39) = 39 + 0.5 = 39.5

𝒏+𝟏 𝒕𝒉 𝟏𝟎+𝟏 𝒕𝒉
𝑸𝟑 = (𝟑 × ) 𝐝𝐚𝐭𝐚 𝐯𝐚𝐥𝐮𝐞 = (𝟑 × ) 𝐝𝐚𝐭𝐚 𝐯𝐚𝐥𝐮𝐞 = 𝟖. 𝟐𝟓𝒕𝒉 𝐝𝐚𝐭𝐚 𝐯𝐚𝐥𝐮𝐞 = 8th data
𝟒 𝟒
value + 𝟎. 𝟐𝟓 × (𝟗𝒕𝒉 𝐝𝐚𝐭𝐚 𝐯𝐚𝐥𝐮𝐞 − 𝟖𝒕𝒉 𝐝𝐚𝐭𝐚 𝐯𝐚𝐥𝐮𝐞)= 44 + 𝟎. 𝟐𝟓 × (𝟒𝟒 − 𝟒𝟒) = 44
CHAPTER 2 Page 57
2.6.3 Measures of Dispersion

By comparing multiple datasets, it is possible for the averages to be the same. However, the
variables within these datasets may differ significantly in magnitudes. As a result, the central
tendency calculated from these variables may not be the most typical or representative
measure in many cases. To understand the extent of spread or variation around these averages,
we need to consider other measures, such are dispersion.

The following methods are important for measuring dispersion.


• Range
• Inter-quartile Range
• Variance and Standard Deviation
• Coefficient of variation

Range: The range is the simplest measure of dispersion, defined as the difference between
the minimum and the maximum value in a data distribution. It provides a rough estimate of
dispersion, but it is influenced by the extreme values rather than considering all the items in
the dataset.

Range = Maximum value − Minimum value

Illustration 2.11 - Range

Find the range of the following ordered dataset.

80 100 100 110 130 190 200

Range = Maximum – Minimum = 200 – 80 = 120

Inter-Quartile Range: By eliminating the lowest 25% and the highest 25% of items in a
series, we are left with the central 50%, which are ordinarily free of extreme values. Inter
quartile range (IQR) is computed by deducting the value of first quartile (𝑄1 ) from the value
of third quartile (𝑄3 ).

𝐼𝑄𝑅 = 𝑄3 − 𝑄1

Illustration 2.12 – IQR (Refer illustration 2.9)


Compute the IQR for the following ordered dataset.
29, 31, 35, 39, 39, 40, 43, 44, 44, 52

𝑄1 = 34, 𝑄2 = 39.5 and 𝑄3 = 44

𝐼𝑄𝑅 = 𝑄3 − 𝑄1 = 44 − 34 = 10
CHAPTER 2 Page 58
Variance and standard deviation: Two commonly used measures of dispersion that take
into account all the data values in a distribution are the variance and the standard deviation.
The variance is the average of squared differences between each data point and the
mean of the dataset. It quantifies the overall spread of the data points around the mean. A
higher variance indicates greater variability in the dataset. The standard deviation is the
square root of the variance.

For a sample data, variance and standard deviation are denoted by 𝑆 2 and 𝑆, respectively.

∑𝑛
𝑖=1(𝑥𝑖 −𝑥̅ )
2 ∑𝑛 2
𝑖=1 𝑥𝑖 −𝑛𝑥̅
2
Sample variance =𝑆 2 = =
𝑛−1 𝑛−1
∑𝑛
𝑖=1(𝑥𝑖 −𝑥̅ )
2 ∑𝑛 2
𝑖=1 𝑥𝑖 −𝑛𝑥̅
2
Sample standard deviation = 𝑆 = √𝑆 2 = √ =√
𝑛−1 𝑛−1

Illustration 2.13 – Variance and standard deviation


The monthly revenue (in millions of rupees) for a company over the past 12 months is
given below:

10, 12, 9, 11, 14, 15, 13, 16, 18, 17, 16, 17

Find the variance and standard deviation of the monthly revenue for the given data.
(𝟏𝟎+𝟏𝟐+𝟗+𝟏𝟏+𝟏𝟒+𝟏𝟓+𝟏𝟑+𝟏𝟔+𝟏𝟖+𝟏𝟕+𝟏𝟔+𝟏𝟕) 𝟏𝟔𝟖
Mean revenue = = = 𝐑𝐬. 𝟏𝟒 (𝐢𝐧 𝐦𝐢𝐥𝐥𝐢𝐨𝐧𝐬)
𝟏𝟐 𝟏𝟐

∑𝒏 ̅ )𝟐
𝒊=𝟏(𝒙𝒊 −𝒙 (−𝟒)𝟐 +(−𝟐)𝟐 +(−𝟓)𝟐 +(−𝟑)𝟐 +𝟎+𝟏𝟐 +(−𝟏)𝟐 +𝟐𝟐 +𝟒𝟐 +𝟑𝟐 +𝟐𝟐 +𝟑𝟐 𝟗𝟏
Variance = 𝒏−𝟏
=
𝟏𝟏
=
𝟏𝟏
= 𝟖. 𝟐𝟕

∑𝒏𝒊=𝟏(𝒙𝒊 −𝒙̅ )𝟐
Standard deviation = √ = √𝟖. 𝟐𝟕 = 𝐑𝐬. 𝟐. 𝟖𝟖 (𝐢𝐧 𝐦𝐢𝐥𝐥𝐢𝐨𝐧𝐬)
𝒏−𝟏

Coefficient of variation (CV):

Coefficient of variation measures the relative variability of a dataset compared to its mean.
It is expressed as a percentage and is used to compare the variability of different datasets,
particularly their means are greatly different. The higher coefficient of variation indicates
that the corresponding group of data is more variable or less homogeneous.

𝑆
𝐶𝑉 = × 100%
𝑥̅

Illustration 2.14

Consider the following two investment options:


Option A: A low-risk investment with an average annual return of 5% and a standard
deviation of 2%.
Option B: A high-risk investment with an average annual return of 10% and a standard
deviation of 8%.
CHAPTER 2 Page 59
To determine which option carries more risk relative to its expected return, we can
calculate the coefficient of variation for each option:
0.02 0.08
CV for Option A = 0.05
̅̅̅̅̅̅
× 100% = 40% and CV for Option B = 0.10
̅̅̅̅̅̅
× 100% = 80%

In this case, Option B has a higher CV of 80%, indicating a higher level of relative risk or
volatility compared to Option A with a CV of 40%. This information can assist investors in
assessing the trade-off between potential returns and associated risks when making
investment decisions.

2.7 Measures of Shape

Shape refers to the pattern of the distribution of data values across the entire range of values.
A distribution can be either symmetrical or skewed. In a symmetrical distribution, the values
below the mean are distributed in the same manner as the values above the mean. This
balance between low and high values creates symmetry. On the other hand, in a skewed
distribution, the values are not symmetrical around the mean. This asymmetry leads to an
imbalance either in the lower values or the higher values. In a symmetric distribution, mean,
median and mode are coincident. For a distribution,

• Mean <median: negative, or left-skewed,

• Mean = median: symmetric, or zero skewness,

• Mean >median: positive, or right skewed.

The following Figure depicts three data sets, each with a different shape.

Skewness and kurtosis are two shape-related statistics. The skewness statistic measures the
extent to which a set of data is not symmetric. A symmetric distribution has a skewness value
of zero. A right-skewed distribution has a positive skewness value, and a left-skewed
distribution has a negative skewness value.

The kurtosis statistic measures the relative concentration of values in the centre of the
distribution of a data set, as compared with the normal distribution. That is, data sets with high
kurtosis tend to have heavy tails, or outliers. Data sets with low kurtosis tend to have light tails, or lack of
outliers.
CHAPTER 2 Page 60
Chapter 2 - Reinforcement Problems
1) Classify each of the following as descriptive statistics or inferential statistics.
a) The average filling volume, percent of under-filling, average number of machine
breakdowns, and average number of particles per bottle are measured in a soft drink
bottling process.
b) Ten percent of the boxes of cereal sampled by a quality technician are found to be less
than the labelled weight. Based on this finding, the filling machine is adjusted to
increase the amount of fill.
c) The Daily FT gives several pages of numerical quantities concerning prices and trading
volumesof stocks listed in the Colombo Stock Exchange.
d) Based on a study of 500 graduates by a social researcher, a magazine reports that 25%
of all graduates are employed in the public sector.

2) In a sociological study involving 35 low-income households, the number of children per


household was recorded for each household. What are the variables in this study? How
many observations are in the data set?

3) In a national survey, 2000 schools were mailed questionnaires, and one of the questions
asked for the number of toilets in each school. Out of the 2000 surveys mailed, one
thousand of the surveys were completed and returned. What is the variable in the study
and how large is the data set?

4) Classify the variables in Problems 2 and 3 as continuous or discrete.

5) Classify the following variables as qualitative or quantitative:


a) The colour of automobiles involved in several severe accidents.
b) The length of time required for rats to move through a maze.
c) The classification of police administrations as city, district, or province
d) The rating given to a pizza in a taste test as poor, good, or excellent.
e) The number of times a person has been married.

6) Indicate scale of measurements (ordinal, nominal, interval, or ratio) of the following variables:
racial origin, monthly phone bills, Fahrenheit and centigrade temperature scales, military
ranks, time taken to finish a 100m race, ranking of a personality trait, clinical diagnosis of
breast cancer, and calendar numbering of the years.

7) The following list gives the academic ranks of the 25 female faculty members at a small
liberal arts college: Give frequency and percentage distributions for these data.
CHAPTER 2 Page 61
8) The table below gives the frequency distribution for the cholesterol values of 45 patients
in a cardiac rehabilitation study. Give the lower and upper-class limits and boundaries as
well as the class marks for each class. Also, draw a histogram and an ogive for the frequency
distribution.

Cholesterol value Frequency


170 to 189 3
190 to 209 10
210 to 229 17
230 to 249 13
250 to 269 2

9) Compute (arithmetic) mean, median, mode, quartiles, range, interquartile range, variance,
standard deviation, and coefficient of variance for following datasets.

a) 15, 10, 30, 8, 20, 25, 30, 30, 20, 15, 8, 20, 20
b) 1, 6, 8, 3, 2, 5, 7, 4, 3, 3, 4, 5, 6, 6, 4, 5, 4

10) If a person received a 20% raise after one year of service and 10% raise after the second
year of service, find the average percentage raise per year.

11) The marks obtained by 500 students in a mathematics examination have been grouped
into class intervals, such as 1- 20, 21-40, 41-60, 61-80, and 81-100. The ogive for these
marks is shown below. (CHANGED and NEW GRAPH ADDED)

(a) Complete the following table:

Class interval Class Class mark Less than cumulative Frequency


boundary frequency
1 - 20
21 - 40
41 - 60
61 - 80
81 - 100

(b) Read off the median from the ogive.


(c) Find the inter-quartile range.
CHAPTER 2 Page 62

12) In a tea-bag-filling process, the weight of the tea in individual bags is a quality
characteristic of interest. The label on the tea package indicates an average weight of 5.5
grams per bag. However, achieving the exact amount of tea in each bag is challenging due
to various factors such as temperature and humidity variations within the factory,
differences in tea density, and the high-speed nature of the filling process. Below is a
sample of the weights of 20 tea bags produced within one hour by a single machine.

5.57 5.40 5.53 5.54 5.55 5.62 5.56 5.46 5.44 5.51

5.65 5.44 5.42 5.40 5.53 5.34 5.54 5.45 5.52 5.41

a) Compute the mean, median, first quartile, and third quartile.


b) Compute the range, interquartile range, variance, standard deviation, and
coefficientof variation.
c) Interpret the above computed measures of central tendency and dispersion
within the context of this problem. Why is it important for the company
manufacturing tea bags to consider both central tendency and variation?

13) The following data represents the total fat (in grams) for burgers and pizzas from a
sample of fast-foodsellers.
Burgers 35 19 31 34 43 39
Pizza 22 9 15 7 16 33 18 25

a) Compute mean, median, first quartile, and third quartile for burgers and pizzas
separately.
b) Compute inter-quartile range, standard deviation, and coefficient of variation for
burgers and pizzas separately.
c) Based on the measures obtained in a) and b) what comments can you make about the
variation in fat of burger and pizza.
CHAPTER 2 Page 63
Chapter 2 - Multiple choice questions with Answers

1. One of the distinctions between a population parameter and a sample statistic is,
a) A population parameter is only based on conceptual measurements, but a sample
statistic is based on a combination of real and conceptual measurement.
b) A sample statistic changes each time you try to measure it, but a population
parameter remains fixed.
c) A population parameter changes each time you try to measure it, but a sample
statistic remains fixed.
d) The true value of a sample statistic can never be known but the true value of a
population parameter can be known.

Answer: (b)

2. Fill in the missing words to the quote:


“Statistical methods may be described as methods for drawing conclusions about
…………………. based on ………………………. computed from the …………………...”
a) Statistics, samples, populations
b) Populations, parameters, samples
c) Statistics, parameters, samples
d) Populations, statistics, samples

Answer: (d)

3. Which of the following is NOT correct about constructing histograms?


a) All class intervals should be of equal width.
b) The bars of the histogram are cantered over the class mark (midpoint).
c) The first and last classes should be open-ended to account for extreme points.

Answer: (b)

4. Which of the following would be the MOST suitable for displaying the proportions of a city's
budget spent on different items?
a) Bar chart
b) Line graph
c) Pie chart
d) Histogra

Answer: (c)

5. Which of the following divides the distribution of data into four subgroups?
a) Quartiles
b) Percentiles
c) Standard deviation
d) Median

Answer: (a)
CHAPTER 2 Page 64
6. Which of the following is not a measure of central tendency?
a) Mean
b) Median
c) Standard deviation
d) Mode

Answer: (c)

7. The inter-quartile range is,


a) The range of the middle 50% of the data
b) Second quartile value
c) The difference between any two quartiles
d) The square of the standard deviation

Answer: (a)

8. Pulse rates of five randomly selected female undergraduates are: 70, 64, 80, 74, 92. What
is the median pulse rate?
a) 77

b) 76

c) 80

d) 74

Answer: (d)

9. Which of the following is not influenced by the occurrence of extreme values in a sample,
a) Mean

b) Median

c) Variance
d) Geometric mean
Answer: (b)

Chapter 2 – Summary

• Statistics is a discipline of study dealing with the collection, analysis, interpretation,


and presentation of data.

• The use of graphs, charts, and tables and the calculation of various statistical measures to
organise and summarise information is called descriptive statistics.

• The population is the complete collection of individuals, items, or data under


consideration in astatistical study.

• The portion of the population selected for analysis is called the sample.
CHAPTER 2 Page 65
• A variable is a characteristic of interest concerning the individual elements of a
population or a sample.

• A quantitative variable is determined when the description of the characteristic of


interest results in a numerical value.

• A qualitative variable is determined when the description of the characteristic of


interest results in a non-numeric value.

• Four levels of measurement are usually recognised in data: Nominal, Ordinal, Interval
and Ratio. Numbers of Nominal data are names or labels. The numbers have all the
features of nominal measures and also represent the rank order (1st, 2nd, 3rd etc.) are
ordinal measures.
The numbers have all the features of ordinal measurement and also are separated by the
same interval. In this case, differences between arbitrary pairs of numbers can be
meaningfully compared. The numbers have all the features of interval measurement and
also have meaningful ratios between arbitrary pairs of numbers and ratio measures.

• Primary data are those which are collected for the first time, and they are original in
character. Secondary data are those which are already collected by someone for some
purpose and are available for the present study.

• Raw data is obtained by observing values of a variable.

• A frequency distribution for qualitative data lists all categories and the number of
elements that belong to each of the categories.

• A bar graph is a graph composed of bars whose heights are the frequencies of the
different categories. A pie chart is also used to graphically display qualitative data. To
construct a pie chart, a circle is divided into portions that represent the relative
frequencies or percentages belonging to different categories.

• A histogram is a graph that displays the classes on the horizontal axis and the
frequencies of the classes on the vertical axis.

• A cumulative frequency distribution gives the total number of values that fall below
various class boundaries of a frequency distribution.

• An ogive is a graph in which a point is plotted above each class boundary at a height equal
to the cumulative frequency corresponding to that boundary.

• Summary measures are used to summarise the basic features of the data in a study.
CHAPTER 2 Page 66
• There is always a tendency of individual observations contained in any set of data to
cluster or centre around a specific value, which is the central value. This peculiar
characteristic of the data is referred to as central tendency.

• Central tendency measures:


∑𝑛
𝑖=1 𝑥𝑖
Mean: Average value of a dataset, 𝑥̅ = 𝑛
Median: middle value in a dataset when it is arranged in ascending or descending order.
Mode: Value that appears most frequently in a dataset.
1
Geometric mean: 𝑥̅𝐺 = (𝑥1 × 𝑥1 × ⋯ × 𝑥𝑛 )𝑛

• Quartiles for ungrouped data are defined as,


𝑛+1 𝑡ℎ
𝑄1 = ( ) data value
4
𝑛+1 𝑡ℎ
𝑄2 = (2 × ) data value
4
𝑛 + 1 𝑡ℎ
𝑄3 = (3 × ) data value
4

• Dispersion measures:
Range: Difference between the largest and smallest value in the data
Inter-quartile Range: 𝑄3 − 𝑄1

∑𝑛
𝑖=1(𝑥𝑖 −𝑥̅ )
2 ∑𝑛 2
𝑖=1 𝑥𝑖 −𝑛𝑥̅
2
Sample variance: 𝑆 2 = =
𝑛−1 𝑛−1
∑𝑛
𝑖=1(𝑥𝑖 −𝑥̅ )
2 ∑𝑛 2
𝑖=1 𝑥𝑖 −𝑛𝑥̅
2
Sample standard deviation: 𝑆 = √𝑆 2 = √ =√
𝑛−1 𝑛−1

• Central tendency measures.


CHAPTER 4 Page 67

Probability and Sampling


Theory

Learning Outcomes

After successful completion of this chapter, you should be able to:

• Describe the concept of probability


• Describe the relevant concepts/ laws of probability
• Apply laws and associated techniques of probability
• Explain the concepts of random variables and probability distributions
• Solve simple problems involving the normal distribution
• Describe the t-distribution
• Describe the role of samples in understanding populations
• Explain pitfalls in sampling
• Describe and compare different sampling methods

3.0 Introduction to Probability

Probability is a fundamental concept that enjoys widespread familiarity among individuals,


as it aligns with common expressions such as "chance," "likelihood," "possibility," and
"proportion." In fact, these terms, including probability itself, are regularly employed in
everyday conversations. It is worth noting that many statements encountered in various
business scenarios inherently convey the essence of probability.

• There is a 30% chance that this job will not be finished in time.
• There is likelihood that the business will make a profit next year.
• Nine times out of ten he arrives late for his appointments.
• There is no possibility of delivering the goods before Tuesday.

All the above are expressions indicating a degree of uncertainty. A very important branch of
mathematics called the theory of probability provides a numerical measure of uncertainty.
The probability describes certainty by1, impossibility by 0 and the various grades of
uncertainties by fractions or decimals in between 0 and 1.
For example, probability of drawing any number from 1 to 6 by throwing a die is 1. But the
CHAPTER 4 Page 68
probability that the number 7 appears from the die is 0. If the die is fair the probability of
getting number 4 (or any number from 1 to 6) is 1/6 which falls between 0 and 1.

3.1 Random Experiment

An experiment is defined as any repeatable process that yields an outcome, measurement, or


result. If the Outcomes of the experiment cannot be predicted with certainty, then the
experiment is considered a random experiment.

Illustration 3.1
1. Throwing a die
2. Tossing of a coin
3. Inspection of an item to determine whether it is defective or non-defective

3.1.1 Sample Space

An experiment can result in several possible outcomes. For example,

a) One toss of a coin results in the outcomes (H, T). If the coin is fair, then each outcome
is equallylikely.
b) Two tosses of a coin result in the outcomes (HH, HT, TH, TT). Again, if the coin is fair,
then eachoutcome is equally likely.
c) If a machine produces articles, some of which are defective, the outcomes are
(defective, or not defective). In this case the outcomes may not be equally likely.

Each possible outcome is called a sample point and the set of all possible outcomes is the
sample space, labelled as 𝑆. An event is a specific collection of sample points. For example,
when throwing an ordinary die, the sample space is S = {1,2,3,4,5,6}. Let 𝐸 be the event
"the number is odd", in this case,
E = {1,3,5}.

3.2 Approaches to Probability

In this section, we discuss three approaches to the definition and interpretation of


probability: Classical approach, relative frequency approach, and subjective approach.

3.2.1 Classical Approach


If a random experiment results in mutually exclusive and equally likely outcomes and more
favourable to the occurrence of event 𝐴, then the probability of occurrence of event 𝐴, usually
denoted by P(𝐴)is given by,

Number of favourable cases n(A) m


𝑃(𝐴) = = =
Total number of equally likely cases n(S) n
CHAPTER 4 Page 69
Example
Let A: Getting an even number when rolling a fair die.

Number of favourable cases n(A) 3


(𝐴) = = = = 0.5
Total number of equally likely cases n(S) 6

3.2.2 The Relative Frequency Approach

Suppose that an event 𝐴 occurs 𝑚 times in 𝑛 repetition of a random experiment. Then the
m m
ratio, n gives the relative frequency of the event 𝐴. The n will approach the probability of
occurrence of value event 𝐴 when 𝑛 becomes large.

In practice we can only try to have a close estimate of P(A) based on a large number of
observations. For practical convenience, the estimate of P(A) can be written as if it were
actually P(A) and the relative frequency definition of probability may be expressed as: 𝑃(𝐴)
m
= n The probability obtained by the above relative frequency definition is called a posterior
or empirical probability.

Illustration 3.2

If we flip the coin 100 times and observe heads 65 times, the relative frequency or
empirical probability of getting heads (A) is.

m 65
P(A) = = = 0.65
n 100

3.2.3 Subjective Approach

The Subjective probability is defined, as the probability assigned to an event by an individual


based on whatever evidence is available. Hence such probabilities are based on the
knowledge, experience, beliefs of the person making the probability statements.

Example
A lawyer decides that the chance of winning the case of his customer in courts would be 75%
accordingto the currently available evidence.
CHAPTER 4 Page 70
Illustration 3.3
Calculate the probability of observing at least one head in a toss of two coins.

Solution

Construct the sample space letting H represent a head, T a tail.

Event First Second 𝑷(𝑬)


Coin Coin
𝐸1 H H ¼
𝐸2 H T ¼
𝐸3 T H ¼
𝐸4 T T ¼

Let 𝐴: Observed at least one head.

According to the sample space, there are three outcome events with at least one head. They
are,

E1, E2 and E3 .

Hence,
1 1 1 3
P(A) = P(E1 ) + P(E2 ) + P(E3 ) = + + =
4 4 4 4

The number P(Ei )is called the probability of event 𝐸𝑖 of the sample space if

i. 0 ≤ P(Ei ) ≤ 1
ii. ∑s P(Ei ) = 1

Thus, we require that a probability be greater than or equal to 0 and less than or equal to 1
and that the sum of the probabilities over the entire sample space, S, be equal to 1.

3.3 Compound Events

Let 𝐴 and 𝐵 be two events in a sample space. The union of 𝐴 and 𝐵 is defined to be the event
containing all sample points in 𝐴 or 𝐵 or both. We denote the union of 𝐴 and 𝐵 by the symbol
𝐴 𝖴 𝐵.The intersection of 𝐴 and 𝐵 is the event composed of all sample points that are in both
A and B and is denoted by𝐴 ∩ 𝐵. The union and the intersection of 𝐴 and 𝐵 are shown
diagrammatically as shaded areas in the following figures.
CHAPTER 4 Page 71

𝐴𝖴𝐵 𝐴∩𝐵

3.4 The Additive Law of Probability

The probability of a union is equal to: P(A𝖴B) = P(A) + P(B) − P(A ∩ B)

If 𝐴 and 𝐵 are mutually exclusive, P(A ∩ B) = 0. That is, if 𝐴 and 𝐵 are mutually exclusive,
they do not share common elements.
Therefore, P(A𝖴B) = P(A) + P(B).

Illustration 3.4
In a group of 20 adults, 4 out of 7 women and 2 out of 13 men wear glasses. What is the
probability that a person chosen at random from a group is a woman or someone who
wear glasses?
Let 𝐴 be the event "the person chosen is a woman" and 𝐵 be the event "the person
chosen wears glasses".

7 6 4
Then, P(A) = 20 P(B) = 20 P(A ∩ B) = 20
𝐏(𝐀𝗨𝐁) = 𝐏(𝐀) + 𝐏(𝐁) – 𝐏(𝐀 ∩ 𝐁)

𝟕 𝟔 𝟒 𝟗
= + - =
𝟐𝟎 𝟐𝟎 𝟐𝟎 𝟐𝟎

3.5 The Complement of an Event

The complement of an event 𝐴 is the collection of all sample points of the sample space, 𝑆 and
not in 𝐴. The complement of 𝐴 is denoted by𝐴′, 𝐴𝑐, or 𝐴.

Since 𝐴 and 𝐴′ are mutually exclusive P(A ∩ A′ ) = 0 and exhaustive (that is, there are no
elementsoutside (𝐴 𝖴 𝐴′), and P (𝑆) = 1, P (𝐴) + P (𝐴′) = 1.

Therefore, P (A′) = 1 − P (A)

Illustration 3.5
A card is drawn from an ordinary pack of 52 playing cards. Find the probability that the
card is not a seven.
4
Let 𝐴 be the event, “the card is a seven”. Then, P(A) = 52
CHAPTER 4 Page 72

4
P(A) =
52
Therefore, the probability that the card is not a seven,

𝟒 𝟒𝟖
𝐏 (𝐀′ ) = 𝟏 − 𝐏 (𝐀) = 𝟏 – =
𝟓𝟐 𝟓𝟐

3.6 Conditional Probability

𝐴 and 𝐵 are two events. The probability of 𝐴, given that 𝐵 has already occurred is written P
(𝐴|𝐵) and is computed by,

P(A ∩ B)
P(A|B) =
P(B)

Similarly, the probability of 𝐵, given that 𝐴 has already occurred can be computed
by

P(A ∩ B)
P(B|A) =
P(A)

Illustration 3.6
Given that a heart is picked at random from a pack of 52 playing cards, find the probability
that it is a picture card.

Let 𝐴 be the event "a heart is drawn", 𝐵 be the event "a picture card is drawn".

13 3
Then, P(A) = 52 P(A ∩ B) = 52
Given K, Q and J are the picture cards.

Thus
P(A∩B) 3
P(B|A) = = 13
P(A)

3.7 Independent Events

Two events 𝐴 and 𝐵 are said to be independent, if and only if P(A|B) = P(A) or P(B|A) =
P(B). Otherwise, the events are said to be dependent.
CHAPTER 4 Page 73
3.8 The Multiplicative Law of Probability

Given two events 𝐴 and 𝐵, the probability of the intersection is P(A ∩ B) = P(A). P(B)

Illustration 3.7
A die is thrown twice. Find the probability of obtaining a 4 on the first throw and an odd
number on the second throw.

Let 𝐴 be the event "a 4 is obtained on the first throw" and 𝐵 be the event "an odd number
1 3 1
is obtained on the second throw”. Then, P(A) = 6 and P(B) = 6 = 2 . The occurrence of
the event A does not influence in any way the probability of the event 𝐵. Hence, the events
𝐴 and 𝐵 are independent.
1 1 1
P(A ∩ B) = P(A). P(B) = × = .
6 2 12

3.9 Tree Diagrams

A tree diagram can be used to find the elements of the sample space. For example, if we toss
a coin, then the possible outcomes of the experiment are head and tail. This information can
be shown as follows.

This is called a tree diagram. The sample space is 𝑆 = {𝐻, 𝑇}.

Illustration 3.8
Three coins are tossed. What is the probability of obtaining 2 heads? Find it using a
treediagram.

The tree diagram will be as shown below.


CHAPTER 4 Page 74

n(S) = 8

Let 𝐴 be the event that two heads are obtained.

n(A) 3
A = {HHT, HTH, THH} ∴ n(𝐴) = 3 P(A) = =
n(S) 8

Illustration 3.9
A bag contains 5 red marbles, 3 blue marbles and 2 green marbles. One marble is drawn
at random from the bag and replaced, and then a second marble is drawn from the bag.
Find theprobability of obtaining the following events using a tree diagram.

i. Both marbles are red.


ii. Two marbles are of different colours.
iii. First marble is blue.
iv. Second marble is green.
v. One marble is red.
vi. Second marble is red, given that the first marble is green.

The tree diagram will be as shown below.


CHAPTER 4 Page 75

n(S) = 9

Since the first marble is replaced before the second is drawn, the events are independent.

5 3 2
Hence P (R) = 10 P(B) = 10 P (G) = 10 for any drawing.
5 5 25 1
i. P (R, R) = 10 × = 100 = 4
10
ii. P (Marbles are of different from colours) = 1 – P (Marbles are of same colour)
= 1- [P (R, R) + P (B, B) + P (G, G)]
5 5 3 3 2 2 62 31
= 1- (10 × 10) – (10 × 10) - (10 × 10) = 100 = 50

iii. P (First Marble is blue) = P (B, R) + P (B, B) + (B, G)


3 5 3 3 3 2 30 3
= (10 × 10) + (10 × 10) + (10 × 10) = 100 = 10

iv. P (Second Marble is green) = P (R, G) + P (B, G) + P (G, G)


5 2 3 2 2 2 20 1
= (10 × 10) + (10 × 10) + (10 × 10) = 100 = 5

v. P (One marble is red) = P (R, R) + P (R, B) + P (R, G) + P (B, R) + P (G, R)


5 5 5 3 5 2 5 5 2 5 75 3
=(10 × 10) + (10 × 10) + (10 × 10) + (10 × 10) + (10 × 10) = 100 = 4

vi. P (Second marble is red first marble is green) = P(R|G)


5 1
Since events are independent, P(R|G) = P(R) = 10 = 2
CHAPTER 4 Page 76
3.10 Baye's Theorem

Suppose 𝐴1, 𝐴2, …, 𝐴𝑛 are mutually exclusive events of the sample space 𝑆 (that is 𝐴1 𝖴
𝐴2 𝖴 … 𝖴 𝐴𝑛 = 𝑆) and 𝐵 is an arbitrary event of 𝑆, then

P(B|Ai )P(Ai )
P(Ai |B) = n
∑i=1 P(B|Ai ). P(Ai )

This rule is called the Baye's rule for the probability. If n = 2, the Baye's reduces to the
expression,

P(B|Ai )P(Ai )
P(Ai |B) =
P(B|A1 ). P(A1 ) + P(B|A2 ). P(A2 )

where i = 1,2

Illustration 3.10
In recent years, much has been written about the possible link between cigarette smoking
and lung cancer. Suppose that in a large medical centre, of all the smokers who were
suspected of having lungcancer, 90% of them did, while only 5%of the non-smokers who
were also suspected of having lung cancer actually did. If the proportion (probability) of
smokers is 0.45, what is the probability that a lung cancer patient who is selected by
chance is a smoker?

Let 𝐴1 be the event "a patient is a smoker" 𝐴2 be the event "a patient is a nonsmoker" and
𝐵 be the event "a patient has lung cancer".
P(𝐴1) = 0.45 P(𝐴2) = 0.55 P(𝐵|𝐴1) = 0.9 P(𝐵|𝐴2) = 0.05

P(B|A1 )P(A1 ) (0.9)(0.45)


P(A1 |B) = = = 0.9364
P(B|A1 ). P(A1 ) + P(B|A2 ). P(A2 ) (0.9)(0.45) + (0.05)(0.55)

Illustration 3.11
Two companies (X and Y) produce RAM cards. An organization wishes to buy 150 RAM
cards and there is a 60% chance to place the order to company Y. RAM cards
manufactured by X and Y companies have 4% and 6% defects respectively. Given that a
defective RAM card has been found, what is the probability that it has come from Company
X?

Let 𝐴x be the event " selecting the RAM card from company X" 𝐴y be the event "selecting
the RAM card from company Y" and B be the event " RAM card is defective".
P(𝐴x) = 0.4 P(𝐴y) = 0.6 P(𝐵|𝐴x) = 0.04 P(𝐵|𝐴y) = 0.06

𝑃(𝐵|𝐴𝑥 )𝑃(𝐴𝑥 ) (0.04)(0.4)


𝑃(𝐴𝑥 |𝐵) = = = 0.3077
𝑃(𝐵|𝐴𝑥 )𝑃(𝐴𝑥 ) + 𝑃(𝐵|𝐴𝑦 ). 𝑃(𝐴𝑦 ) (0.04)(0.4) + (0.06)(0.6)
CHAPTER 4 Page 77
3.11 Random Variables

The numerical outcomes of experiments vary from one experiment to another and therefore
represent observations on a variable which we will denote by the symbol 𝑥. A variable X is a
random variable if the values (𝑥) that X assumes, corresponding to the various outcomes of
an experiment, are chance or random events. Random variables are classified as one of two
types: discrete or continuous. A discrete random variable is one that can assume a
countable number of values. Typical examples of discrete random variables are,

a) The number of defective bolts in a sample of ten drawn from industrial production.
b) The number of people in a waiting line for banking services.

A continuous random variable is one that can assume the infinitely large number of values
corresponding to the points on a line interval. Typical examples of continuous random
variables are,
a) The amount of gasoline produced per day in a refinery.
b) The waiting time before service at a supermarket counter.

3.12 Probability Distributions for Discrete Random Variables

The probability distribution for a discrete random variable is a formula, table or graph that
provides the probability associated with each value of the random variable. It is interesting
to note that the events cannot overlap because one and only one value of X is assigned to each
sample point; therefore, the values of X represent mutually exclusive numerical events. We
may therefore state two requirements for a probability distribution,
1. 0 ≤ p(x) ≤ 1
2. ∑all x p(x) = 1 for all x

where p(x) = P(X = x) is the probability that the random variable X takes the value x.

Illustration 3.12
Consider an experiment that consists of tossing two coins and let X be the number of
heads observed. Find the tabulated probability distribution for X.

Event Coin 1 Coin 2 𝑷(𝑬) x


𝐸1 H H ¼ 2
𝐸2 H T ¼ 1
𝐸3 T H ¼ 1
𝐸4 T T ¼ 0

The sample point 𝐸1 is associated with the sample event "observe a head on coin 1 and a
head on coin 2", then the value of X is 2, similarly, for the event 𝐸2, the value of X is 1, etc.
The sample space of X is, X = {0,1,2}
CHAPTER 4 Page 78

The values of X with respective probabilities are given in the following table.

X p(x) = P(X = x)
0 1⁄
4
1 2⁄
4
2 1⁄
4

Observe that the probability distribution of X in the above table can be represented
graphically as follows.

If you were to draw a sample from this population that is, if you were to throw two balanced
coins, say 100 times, and each time record the number of heads observed, then construct a
histogram using the 100 measurements. You would find the histogram of your sample would
appear very similar to that for p(x).

Illustration 3.13

The probability distribution of a random variable X is given by the formula p(x) =


cx 2 , for 𝑥 = 0, 1, 2, 3, 4. Given that 𝑐 is a constant, find the value of 𝑐 and the probability
function for X .

Since, p(x) = cx 2 , 𝑥 = 0, 1, 2, 3, 4, we have,

p(0) = P(X = 0) = 0
p(1) = P(X = 1) = c
p(2) = P(X = 2) = 4c
p(3) = P(X = 3) = 9c
p(4) = P(X = 4) = 16c.
We know that ∑all x p(x) = 1

Thus, 0 + c + 4c + 9c + 16c = 1
1
30c = 1, c = 30
1
Therefore, the probability distribution function is p(x) = x 2 for x = 0,1,2,3,4.
30
CHAPTER 4 Page 79

The tabulated probability distribution of X can be presented as follows.

X p(x) = P(X = x)
0 0
1 1⁄
30
2 4⁄
30
3 9⁄
30
4 16⁄
30

The probability diagram of X can be presented as follows.

3.13 Probability Distributions for Continuous Random Variables

We need to take a different approach to find the probability distribution for a continuous
random variable. The relative frequency associated with a particular class in the population
is the fraction of measurements in the population falling in that interval; it is also the
probability of drawing a measurement in that class. If the total area under the relative
frequency histogram were adjusted to be equal to 1, then the area under the frequency curve
would correspond to probabilities. Assume that the random variable, X, may take values on
a real line. The density of probability (but not the value of the probability), which varies with
the values of X may be represented by a mathematical expression fX (x) or f(x) , called the
probability density function (pdf) for X. The probability density function f(x) is defined so that
the total area under the curve is equal to 1 and therefore, the area lying above a given interval
of values will be equal to the probability that f(x) will fall in that interval. Therefore, the
probability that a < X < b is equal to the shaded area under the density function, f(x)
between the two points 𝑎 and 𝑏.
CHAPTER 4 Page 80

3.13.1 The Normal Distribution


The normal distribution is the most important and most widely used continuous
probability distribution. It is the cornerstone of the application of statistical inference in
analysis of data. This distribution has also been used to study random phenomena ranging
from the applied sciences and engineering to business and economics. The normal
probability distribution is the bell-shaped curve as shown in the following figure. The
distribution is symmetric about the mean.

Since the area lying over any particular point, say X = a, is 0, it follows that P(X = a) = 0.
This means that P(X ≤ a) = P(X < a) and P(X ≥ a) = P(X > a). This is, of course, not true
for a discrete random variable.

Because of the symmetry, we can simplify our areas by means of the transformation,
X−μ
Z= σ
CHAPTER 4 Page 81
Now, if the random variable X has a normal distribution with mean 𝜇 and standard deviation
𝜎, then the random variable Z has a standard normal distribution with mean 0 and standard
deviation 1. The area under the normal curve between the mean, Z = 0 and a specified value
of (Z > 0) , say 𝑧0, is the probability, P(0 ≤ Z ≤ z0 ). This area is recorded in Appendix A and
is shown as the shaded area in thefollowing figure.

Illustration 3.14

Find P(0 ≤ Z ≤ 1.63)

This probability corresponds to the area between the mean (Z = 0) and a point. Z = 1.63
standard deviations to the right of the mean. This area is shaded in the following figure.

Since the standard normal table in the Appendix A gives area under the normal curve to
the right of the mean, we need only find the tabulated value corresponding to z = 1.63.
Go down the left-hand column of the table to the row corresponding to z = 1.6. and across
the top of the table to the column marked 03. The intersection for this row and column
combination gives this area to be equal to 0.4484.Therefore P(0 ≤ Z ≤ 1.63) = 0.4484.
CHAPTER 4 Page 82
Illustration 3.15

Find P(−0.5 ≤ Z ≤ 1.0)

This probability corresponds to the area between z = −0.5 and z = 1.0 as shown in the
following figure.

The area required is equal to the sum of 𝐴1 and 𝐴2 as shown in the above figure. From
Appendix A, we read 𝐴2= 0.3413. The area 𝐴1 would equal the corresponding area
between Z = 0 and Z = 0.5. That is, 𝐴1= 0.1915. Thus,

P(−0.5 ≤ Z ≤ 1.0) = A1 + A2 = 0.1915 + 0.3413 = 0.5328

Illustration 3.16

Studies show that gasoline usage for compact cars sold is normally distributed with a
mean usage of 30.5 miles per gallon (mpg) and a standard deviation of 4.5 mpg. What
percentage of compacts obtain 35 or more miles per gallon?

First, we must find the 𝑧 value corresponding to x = 35 mpg . Substituting into the formula
for Z, we obtain,
X−μ 35 −30.5
Z= = = 1. The required area is given by the shaded area in the following figure.
σ 4.5

The area to the right of the mean corresponding to 𝑧 = 1.0 is 0.3413. The entire area to the
right of the mean is 0.5. Thus,
P(X > 35) = P(Z > 1.0) = 0.5 − 0.3413 = 0.1587.
Thus, the percentage exceeding 35 mpg is 15.87%.
CHAPTER 4 Page 83

Illustration 3.17
In the previous example, if the manufacturer wants to develop a compact car which
outperforms 95% of the current compacts in fuel economy, what must be the gasoline
usagefor the new car?

Let X be a normally distributed random variable with mean equal to 30.5 and standard
deviation equalto 4.5. We want to find the value x0 such that P(X < x0 ) = 0.95.
x0 − μ x0 − 30.5
P(X < x0 ) = P (Z < ) = P (Z < ) = 0.95
σ 4.5
x0 −30.5
Let = z0 such that P(Z < z0 ) = 0.95
4.5
Therefore, z0 = 1.645 which covers an area of 0.45 to the right side of the mean in Table
A.
x0 −30.5
Then = 1.645, x0 = 37.9.
4.5

3.14 The Student’s t-Distribution

The 𝒕-distribution is very similar in appearance to the standardized normal distribution.


Both distributions are symmetrical and bell-shaped, with the mean and the median equal to
zero. However, the 𝒕-distribution has more area in the tails and less in the centre than does
the standard normaldistribution. This is due to the fact that sample standard deviation 𝑺 is
used to estimate the unknown population standard deviation, σ, the values of 𝒕 are more
variable than those for 𝒁.

The shape of the t-distribution is characterized by a parameter namely, degrees of freedom.


Larger the degrees of freedom, closer the 𝑡-distribution into the standard normal
distribution. The 𝑡-distribution is mainly used in statistical inference methods of estimating
and testing population mean when population standard deviation is unknown.

3.15 Mathematical Expectation

The probability distribution provides a model for the theoretical frequency distribution of a
random variable and hence must possess a mean, variance, standard deviation, and other
descriptive measuresassociated with the theoretical population that it represents.
CHAPTER 4 Page 84
We shall confine our attention to the problem of calculating the mean value of a random
variable defined over a theoretical population. This mean (𝜇) is called the expected
value, E(X), of the random variable X. Let X be a discrete random variable with probability
distribution p(x) and let E(X) represent the expected value of X. Then,

E(X) = ∑ x . p(x) = ∑ x ∙ P(X = x)


all x all x

Illustration 3.18
Consider the random variable X representing the number of heads on the toss of two coins.
Find the expected value of X .

𝒙 p(x) x. p(x)
0 1⁄ 0
4
1 2⁄ 2⁄
4 4
2 1⁄ 2⁄
4 4

2 2
μ = E(X) = ∑ x ∙ p(x) = 0 + + =1
4 4
all x

Hence, if you repeat the experiment an infinite number of times, the average number of
heads on tossing the coin twice, would be one.

Illustration 3.19
Eight thousand tickets are to be sold at Rs. 20.00 each in a lottery conducted to benefit the
local school. The prize is a Rs. 52,000.00 valued mobile phone. If you purchase two tickets,
what is your expected gain?

Here, either you will lose Rs. 40.00 or will win Rs. 51,960.00 (= 52,000 - 40) with
probabilities 7998/8000 and 2/8000 respectively. The expected gain will be,

7998 2
E(x) = (−40) 8000 + (51960) 8000 = -Rs.27

3.16 Variance

Let X be a discrete random variable with probability distribution p(x); then the variance, 𝜎2
of X is,
σ2 = E(X − μ)2 = ∑(x − μ)2 . p(x) = ∑ x 2 . p(x)
all x all x
CHAPTER 4 Page 85

Illustration 3.20

Let X be a random variable with probability distribution given by the following table. Find
𝜇, 𝜎2, and 𝜎 of X.

𝑥 -1 0 1 2 3 4 5
p(x) 0.05 0.1 0.4 0.2 0.1 0.1 0.05
x. p(x) -0.05 0 0.4 0.4 0.3 0.4 0.25
x 2 . p(x) 0.05 0 0.4 0.8 0.9 1.6 1.25

μ = E(X) = ∑ x ∙ p(x) = −0.05 + 0 + 0.4 + 0.4 + 0.3 + 0.4 + 0.25 = 1.7


all x

∑ x 2 ∙ p(x) = −0.05 + 0 + 0.4 + 0.8 + 0.9 + 1.6 + 1.25 = 5


allx

σ2 = ∑ x 2 ∙ p(x) − μ2 = 52 − 1.72 = 22.11


allx

σ = √σ2 = √22.11 = 4.7

3.17 Sampling Theory

3.17.1 Census & Sampling Investigations

The need for adequate and reliable data is ever increasing for taking policy decisions in
different fieldsof human activity. There are two ways in which the required information may
be obtained: Census and Sampling.

a) Census
Under complete enumeration survey method, data are collected for each and every unit
(person, household, field, shop, factory etc.) of the population or universe, which is the
complete set of items(sampling frame), which are of interest in any particular situation. The
advantage of this type of survey will be that no unit is left out and hence greater accuracy may
be ensured. However, the effort, money and time required for carrying out complete
enumeration will generally be extremely large and in many cases cost may be so prohibitive
that the very idea of collecting information may be dropped. Hence, in modern times very
little use is made of complete enumeration survey.

Example
The government of Sri Lanka conducts a census once in ten years where every household is
surveyed.
CHAPTER 4 Page 86
b) Sampling

Very often our attitudes, our knowledge and our actions are based on samples. It applies
equally to everyday life and to scientific research. A person's opinion of a bank, or a shop, or
an institution is generally based on one or two encounters, which he had with it in the course
of several years of working with the former. A visitor's opinion about a country after
spending a few days in it will be determined by his experiences of a few places he has seen
and a few persons he has met. Perhaps our visitor is less likely to be aware of the extent of
his ignorance. Generally, a vaguely formulated understanding of sampling is part of what is
called common sense and is characteristic of everyday approach.

Examples
i. A trader examining a handful of grains from the bag.

ii. A tea taster trying different brands of tea.

iii. A technician making a blood test.

All of them are employing the method of sampling. Their confidence in their judgment’s rests
on thefact that the material they are sampling is so well mixed or homogeneous that the few
grains of wheat, a drop of blood, a few leaves of tea or a few grains of rice do adequately
represent the whole.

3.18 Reasons for Sampling


The following reasons generally cause for using a sample rather than the entire population.

• The destructive nature of certain tests

To test the quality of light bulbs, manufacturers test the life time of bulbs by lightening them
until a bulb is burned out. To ensure that the production meets the minimum standard, a
relatively small sample of bulbs is selected.

• The physical impossibility of checking all items in the population


The populations of fish, birds, snakes, mosquitoes, and the like are large and are constantly
moving, being born, and dying.

• The cost of studying all the items in a population is often prohibitive


• Greater Speed

The data can be collected and summarized more quickly with a sample than with a complete
count. This is as a vital consideration when the information is urgently needed.

• Greater Scope
In certain types of inquiry highly trained personnel or specialized equipment limited in
availability, must be used to obtain the data. A complete census is impracticable.
CHAPTER 4 Page 87
The choice lies between obtaining the information by sampling or not at all. Thus, surveys
that rely on sampling have more scope and flexibility regarding the types of information
that can be obtained.

• Greater Accuracy

In sampling, high-quality personnel can be employed and intensive training can be given to
them. Careful supervision of the fieldwork can also be done. Accurate processing of results
is also possible when the volume of work is reduced. Thus, a sample may produce more
accurate results than a census.

3.19 Sampling and Non-Sampling Errors

The error arising due to drawing inferences about the population on the basis of few
observations (sampling) is termed sampling error. Clearly the sampling error in this sense
is non-existent in a complete enumeration survey since the whole population is surveyed.
However, the errors mainly arising at the stages of ascertainment and processing of data,
which are termed non-sampling errors, are common both in complete enumeration and
sample surveys.

3.19.1 Sampling Errors


Even if utmost care has been taken in selecting a sample, the results derived from a sample
study may not be exactly equal to the true value in the population. The reason is that an
estimate is based on a part and not on the whole and samples are seldom, if ever, perfect
miniatures of the population. Hence sampling gives rise to certain errors known as sampling
errors (or sampling fluctuations). These errors would not be present in a complete
enumeration survey. However, these errors can be controlled. The modern sampling theory
helps in designing the survey in such a manner that the sampling errors can be made small.

3.19.2 Method of Reducing Sampling Errors


Once the absence of bias has been ensured, attention should be given to the random sampling
errors.Such errors must be reduced to the minimum so as to attain the desired accuracy. Apart
from reducing errors of bias, the simplest way of increasing the accuracy of results from a
sample is to increase its size. The sampling error usually decreases with the increase in
sample size (number of units selected in the samples) and in fact in many situations the
decrease is inversely proportional to the square root of the sample.

3.19.3 Non-Sampling Errors


When a complete enumeration of units in the universe is made, one would expect that it
would give rise to data free from errors. However, in practice it is not so. For example, it is
difficult to completely avoid errors of observation or ascertainment. So also, in the
processing of data tabulation errors may be committed affecting the final results. Errors
arising in this manner are termed non-sampling errors, as they are due to factors other than
the inductive process of inferring about the population from a sample.
CHAPTER 4 Page 88
Thus, the data obtained in an investigation by complete enumeration, although free from
sampling error, would still be subject to non-sampling errors, whereas the results of a
sample survey would be subject to sampling errors as well as non-sampling errors. Non-
sampling errors can occur atevery stage of planning and execution of the census or survey.
Such errors can arise due to a numberof causes such as defective methods of data collection
and tabulation, faulty definition, incomplete coverage of the population or sample, etc. More
specifically, non-sampling errors may arise from one or more of the following factors.

• Inaccurate or inappropriate statistical units, methods of interview, observation, or


measurement with inadequate or ambiguous schedules, definitions, or instructions.
• Errors in data processing operations such as coding, punching, verification, tabulation,
etc.
• Errors committed during presentation and printing of tabulated results.

These sources are not exhaustive, but are given to, indicate some of the possible sources of
error. In a sample survey, non-sampling errors may also arise due to a defective frame and
faulty selection of sampling units.

3.20 Methods of Sampling

There are many methods of sampling. The choice of method will be determined by the
purpose of sampling. The various methods can be categorized under two groups.

• Probability Sampling (Random Sampling)


• Non-Probability Sampling (Non-random Sampling)

3.20.1 Probability Sampling Methods


A sample selected in such a way that each item or person in the population being studied has
a known probability of being included in the sample. The sampling error also can be
measured here. In this chapter, four probability-sampling methods are presented.

a) Simple Random Sampling


A sample is selected here so that each item or person in the population has the same chance
of beingselected, and also, selecting n units out of the 𝑁 such that every
𝑛
one of the 𝐶𝑁 distinct
samples has an equal chance of being drawn. In practice, a simple random sample is drawn
unit by unit. The units in the population are numbered from 1 to 𝑁. A series of random
numbers between 1 and 𝑁 is then selected, either by means of a table of random numbers or
by means of a computer program that produces such numbers or by means of lottery method.

b) Stratified Random Sampling


When the population is heterogeneous with respect to the variable or characteristic under
study, the method of simple random sampling may not be appropriate. In such situations,
first the population is divided into a number of subgroups called strata.
CHAPTER 4 Page 89
These strata are internally homogeneous. Then a simple random sample is drawn from each
stratum. Some of the commonly used stratifying factors are age, sex, educational or income
groups, geographical area, economic status and so on. The following factors can be
considered to determine the number of sampling units from each stratum.

• Relative Importance
• Relative Variability
• Unit sampling cost
• Relative size

When first three factors are equal among each stratum, the method of proportional allocation
is used to select a sample.

c) Systematic Sampling
When a systematic sample is selected, it is also known as quasi-random sampling. When a
complete list of the population is available, this method is used. We arrange the items in
numerical, alphabetical, and geographical or any other order. If we want to select a sample
of 10 students from 120 students, under this method kth item is picked up from the sample
frame and 𝑘 is the sample interval.

𝑘 = 𝑁/𝑛 𝑘 = Sampling interval 𝑁 = Population size 𝑛 = sample size

In the above example, 𝑘 = 120/10 = 12. 12 is the sampling interval. First student will be
selected at randomly from the first 12 names and every 12th student will be taken as sample,
i.e., if first student will be number 2, then (2 + 12)th , (14 + 12)th, (26 + 12)th, and so on.

d) Cluster Sampling /Multistage Sampling


Cluster sampling is another method of sampling with widespread applications in survey
designs. This method involves, first the separation of the population into groups called
clusters and then the selection of a sample of these clusters can occur. After that, the
observations or measurements are made on each and every unit in the selected clusters.
Even though cluster sampling needs relatively low operational costs, it requires a larger
sample size than that of simple random sampling to provide estimates with a specified level
of precision.

Since it seems uneconomical to measure all the sampling units in the selected clusters,
instead of enumerating all the units, one can obtain a sample of units by resorting to sub
sampling within any chosen cluster. This technique is called two-stage sampling, clusters
being termed as primary units and the units within clusters as secondary units. Multi-stage
sampling is the generalization of the above technique.
CHAPTER 4 Page 90
Example
It is decided to take a sample of 5,000 households from the Western province.

First stage:
The province may be divided into a number of districts and a few districts selected at
random.

Second stage:
Each district may be sub-divided into Grama Niladhari (GN) divisions and a sample of GN
divisions may be taken at random.

Third stage:
A number of households may be selected from each of the GN divisions selected at the second
stage. In this way, at each stage the sample size becomes smaller and smaller.

3.20.2 Non-Probability Sampling


There are situations where it is not possible, or it is very inconvenient to design
methodologies that maintain equal chances for every member in the population of being in
the selected sample. Sample methods designed to handle such situations are referred to as
non-probability sampling methodologies.

a) Judgment Sampling

In judgment sampling the choice of sample items depends exclusively on the discretion of
the investigator. In other words, the investigator exercises his judgment in the choice and
includes those items in the sample, which he thinks are most typical of the population with
regard to the characteristics under investigation.

b) Convenience Sampling
The method of convenience sampling is also called the chunk. A chunk refers to that fraction
of the population being investigated which is selected neither by probability nor judgment
but by convenience. A sample obtained from readily available lists such as automobile
registration, telephone directories, etc. is a convenience sample and not a random sample
even if the sample is drawn at random from the lists.

c) Quota Sampling
Quota sampling is a non-probability sampling technique wherein the assembled sample has
the same proportions of individuals as the entire population with respect to known
characteristics, traits or focused phenomenon. The researcher must make sure that the
composition of the final sample to be used in the study meets the research's quota criteria.
The main reason why researchers choose quota samples is that it allows the researchers to
sample a subgroup that is of great interest to the study.
CHAPTER 4 Page 91
Chapter 3 - Reinforcement Problems

1) Consider the following experiment involving two urns. Urn A contains 2 white balls and 1
blackball. Urn B contains 1 white ball. A ball is drawn from urn A and placed in urn B.
Then a ball is drawn from urn B. What is the probability that the ball drawn from urn B
will be white?

2) A coin is tossed, and a die is thrown. What is the probability of obtaining a head on the
coin and an even number on the die?

3) The two events 𝐴 and 𝐵 are such that P(𝐴) = 0.6, P(𝐵) = 0.2, P(𝐴|𝐵) = 0.1. Calculate the
probabilities that,
a) Both of the events occur.
b) At least one of the events occurs.
c) 𝐵 occurs, given that 𝐴 has occurred.

4) Two coins are tossed. Using a tree diagram find the probability of obtaining:
a) 2 tails
b) Exactly one head
c) At least one head

5) Two coins are tossed. One coin is unbiased, and the other is biased so that a head is twice
as likely as a tail. Find the probability of obtaining,
a) Two heads
b) A head and a tail
c) Two tails, using a tree diagram.

6) A box contains 5 white, 4 blue, and 5 red marbles. One marble is drawn at random and
returned to the box; a second marble is then drawn. Using a tree diagram find the
probabilityof drawing a blue and a red marble in that order.

7) A has a probability of ¾ of winning a set against B. A match is won by the player who first
wins two sets. Use a tree diagram to find the probability that A wins the match.

8) As items come to the end of a production line, an inspector chooses which items are to go
through a complete inspection. 10% of all items produced are defective. 60% of all
defective items go through a complete inspection, and 20% of all good items go through
a complete inspection. Given that an item is completely inspected, what is the probability
it is defective?

9) The random variable 𝑋 is normally distributed with mean 300 and standard deviation 5,
find,
a) 𝑃 (𝑋 > 294) b) 𝑃 (𝑋 < 302) c) 𝑃 (𝑋 < 312)
CHAPTER 4 Page 92

10) The random variable 𝑋 is normally distributed with mean 45 and standard
deviation 4, find thevalue of 𝑥0 if,
a) (𝑋 < 𝑥0) = 0.0317 b) (𝑋 < 𝑥0) = 0.8950

11) Packages from a packing machine have a mass which is normally distributed
with mean 200 g and standard deviation 2 g. Find the probability that a package from the
machine weighs,
a) Less than 197 g b) More than 200.5 g c) Between 119.5 g and 198.5 g

12) The monthly demand for product A is normally distributed with mean 200 units
and standard deviation 40 units. The demand for another product, B, is also normally
distributed with mean 500 units and standard deviation 80 units. If a seller of these
products stocks 280 units of A and 650 units of B at the beginning of a month, what is the
probability that the seller will experience a stock out for both products during the month?

13) If the sampling frame available from the registrar’s files is a listing of the names
of all 4000 registered full-time students compiled from eight separate alphabetical lists,
based on the gender and academic year breakdowns, what type of sample should you
take? Discuss.

14) Suppose that each of the 5000 registered full-time students lived in one of the
10 university hostels. Each hostel accommodates 400 students. It is university policy to
fully integrate students by gender and academic year in each hostel. If the registrar is able
to compile a listingof all students by hostel, explain how you could take a cluster sample.

Chapter 3 - Multiple Choice Questions with Answers

1) What is the probability of getting an odd number when rolling a die?


a) 1 / 6 b) 4 / 11 c) 1 / 2 d) 1 / 4

Answer: (c)

2) The probability that the Kelani River will flood in any given year has been estimated from
200 years of historical data to be one in four. This means:
(a) The Kelani River will flood every four years.

(b) In the next 100 years, the Kelani River will flood exactly 25 times.

(c) In the last 100 years, the Kelani River flooded exactly 25 times.

(d) In the next 100 years, the Kelani River will flood about 25 times.

Answer: (d)
CHAPTER 4 Page 93

3) The joint probability of events 𝐴 and 𝐵 is 32 percent with the probability of event 𝐴 being
60 percent and the probability of event 𝐵 being 50 percent. Based on this information,
the conditional probability of event A given event B has occurred is closest to:
a) 0.30 b) 0.64 c) 0.53 d) 0.46

Answer: (b)

4) A surprise quiz contains three multiple choice questions; question 1 has 3 suggested
answers,question 2 has four, and question 3 has two. A completely unprepared student
decides to choose the answers at random. If 𝑋 is the number of questions the student
answers correctly,find the probability of at least one correct answer.
a) 0.75 b) 0.87 c) 0.65 d) 0.43

Answer: (a)

5) A physical fitness association is including the mile run in its secondary- school fitness test.
The time for this event for boys in secondary school is known to possess a normal
distribution with a mean of 450 seconds and a standard deviation of 50 seconds. The
fitness association wants to recognize the fastest 10% of the boys with certificates of
recognition. What time would the boys need to beat in order to earn a certificate of
recognition from the fitness association? (Correct to 2 decimal places)
a) 532.25 sec b) 367.75 sec c) 385.92 sec d) 514 sec

Answer: (c)

6) Which of the following random variables would be considered as continuous?


a) the number of brothers that a randomly chosen person has.
b) the time it takes for a randomly chosen woman to run 100 yards.
c) the number of cars owned by a randomly chosen adult male.
d) number of money orders received by a post office in a randomly chosen week.
Answer: (b)

7) Consider the random variable X representing the number of heads on the toss of two
coins.Find the expected value of X .

𝒙 p(x)
0 0.05
1 0.25
CHAPTER 4 Page 94
2 0.50
3 0.20

The expected value of X is


a) 1.58 b) 5.18 c) 4.05 d) 1.85

Answer: (d)

8) The variance of X is
a) 0.6725 b) 0.7921 c) 0.6275 d) 0.7291

Answer: (c)

Chapter 3 – Summary

Probability is a measure of uncertainty.

An experiment is any repeatable process from which an outcome, measurement or result is


obtained. When the outcomes cannot be predicted with certainty, then the experiment is a
random experiment.

The set of all possible outcomes of a random experiment is called the sample space. The
elements of sample space are called sample points.
An Event is any collection of outcomes of an experiment: that is, an event is any subset of the
samplespace.

The intersection of two events 𝐴 and 𝐵 results in another event containing all such elements
that are common to 𝐴 and 𝐵.

The union of two events 𝐴 and 𝐵 results in another event containing all elements, which
belong to either 𝐴 or 𝐵.

Compliment of an event 𝐴 denoted by A′ , the event containing all the elements of 𝑆 that
are notelements of 𝐴.

Events are said to be mutually exclusive or disjoint if two or more of them cannot occur
simultaneously in a single trial of an experiment.

Let 𝐴 and 𝐵 are two events. The probability of 𝑨, given that 𝑩 has already occurred is written
P(A|B)
P(A ∩ B)
and P(A|B) = P(B)

The Additive Law of Probability for two events: 𝐴 and 𝐵 is as P(A 𝖴 B) = P + P(B) −
CHAPTER 4 Page 95
P(A ∩ B).

The Multiplicative Law of Probability for two events A and B, the probability of the
intersection,is P(A ∩ B) = P. = P. P(A).
If events 𝐴 and 𝐵 are independent, P(A ∩ B) = P(A). P(B)
A variable X is a random variable if the values that X assumes, corresponding to the various
outcomes of an experiment, are chance or random events.

A discrete random variable is one that can assume a countable number of values.

A continuous random variable is one that can assume the infinitely large number of
values corresponding to the points on a line interval.

The probability distribution for a discrete random variable is a formula, table or graph that
providesthe probability associated with each value of the random variable.

The density of probability, which varies with 𝑥 may be represented by a mathematical


expression.
f(x), called the probability density function for X.

Census is a complete enumeration survey method, data are collected for each and every unit
(person,household, field, shop, factory etc.) of the population.

In most of the cases, sampling is done rather than studying the entire population.

The error arising due to drawing inferences about the population on the basis of few
observations (sampling) is termed sampling error.

The error mainly arising at the stages of ascertainment and processing of data, which are
termed non- sampling errors, are common both in complete enumeration and sample
surveys.

The various sampling methods can be categorized under two groups: probability sampling
and non-probability sampling.

In probability sampling, a sample is selected in such a way that each item or person in the
population being studied has a known probability of being included in the sample. Simple
Random Sampling, Stratified Random Sampling, Systematic Sampling and Cluster Sampling
are the probability sampling methods discussed in this chapter.

Under Non-probability Sampling, Judgment Sampling and Convenience Sampling have


been discussed.
CHAPTER 4 Page 96

Relationships and forecasting


techniques

Learning Outcomes

After successful completion of this chapter, you should be able to:

• Understanding the difference between independent variable and dependent


variable
• Drawing a scatter diagram and understand the underlying pattern
• Understanding the linear and non-linear pattern
• Calculating Person’s correlation coefficient and interpret it.
• Calculating Spearman’s correlation coefficient and interpret it
• Determining the simple linear regression and use it for predictions
• Understanding the interpolation and extrapolation
• Forecasting thorough time series analysis

4.0 Introduction

Managerial decisions often rely on the analysis of the relationship between multiple
variables. For instance, the marketing manager may examine the relationship between
advertising expenditure and sales in order to forecast sales for a specific advertising budget.
Similarly, a public utility may utilise the relationship between the daily temperature and
electricity demand to predict the future electricity usage on anticipated temperature for the
upcoming month. Sometimes, managers will rely on intuition to judge how two variables are
related.

4.1 Independent Variables and Dependent variables

4.1.1 Independent variable

This is the variable that remains unaffected by changes in other variables. In an experimental
study, the independent variable is the variable that researchers manipulate, control, or vary
to investigate its effects, is referred as “independent” because it is not influenced by any
other variables within the study. Independent variables are also known as explanatory
CHAPTER 4 Page 97
variables, predictor variables, input variables, regressors or exogenous variables.

4.1.2 Dependent variable

A dependent variable is the variable that changes in response to the manipulation of the
independent variable. It represents the outcome of interest that you wish to measure, and
its changes are influenced by the independent variable.
Dependent variables are also known as response variable, outcome variable, target variable
and endogenous variable. To illustrate, let’s consider the variables of advertising
expenditure and total revenue for a specific company. In this case, the sales revenue is
influenced by the amount spent on advertising. In other words, higher advertising
expenditure leads to higher sales revenue. Hence, advertising expenditure is the
independent variable, while sales revenue is the dependent variable.

Following table shows some further examples:

Scenario Independent Dependent


Profit and total cost of production Total cost Profit
Cost of capital and net present value (NPV) of a Cost of capital NPV
project
Age of a vehicle and its annual maintenance Age Maintenance cost
cost
Age of an asset and its net book value Age Net book value
Years of experience and annual salary Experience Annual salary
Speed of a vehicle and travel time to the Speed Travel time
destination
Table 4.1: Independent variables and Dependent Variables

4.1.3 Scatter plot (Scatter diagram)


Scatter plot is a graphical representation of the relationship between two quantitative
variables. The horizontal axis (x-axis) represents one variable (independent variable), while
the vertical axis (y-axis) represents the other variable (dependent variable). Each data point
on the scatter plot corresponds to a specific value pair of the variables.

In a scatter plot, if the points are closely clustered around a straight line, it suggests a strong
linear relationship between two variables.
CHAPTER 4 Page 98

Figure 4.1: Positive scatter plot

Illustration 4.1 – Scatter diagram

The data for the output at a factory each week for the last ten weeks and the
corresponding production cost of the output are given below.

Week Output (units) Cost (Rs.)


1 100 4,200
2 120 4,400
3 100 3,800
4 80 3,400
5 90 3,800
6 110 4,300
7 70 3,000
8 120 4,700
9 90 3,700
10 140 5,000

The scatter plot of output and production cost


CHAPTER 4 Page 99

• The production cost depends on the volume of output. Therefore, the volume of output
is the independent variable plotted on the x-axis, while the production cost is the
dependent variable plotted on the y-axis.
• Upon observing the graph, it becomes apparent that the scattered data points roughly
align along an upward-treading line. This suggests that as the output volume increases,
the production cost also tends to rise.

Note: A scatter diagram can reveal a range of different types and degrees of correlation.

4.2 Correlation

Correlation is a bivariate analysis that quantifies the strength of linear relationship between
two variables and the direction of the relationship. It assesses how changes in one variable
are associated with changes in another variable. In this chapter, we discuss two types of
correlation.

1. Pearson’s correlation coefficient


2. Spearman’s rank correlation coefficient.

4.2.1 Linear and non-linear relationships

The distinction between linear and non-linear relationships depends on the nature of the
relationship between two variables. Linear relationship refers to a situation where the
relationship between two variables can be adequately represented by a straight line. In this
case, as one variable increases, the other variable changes at a constant rate, resulting in a
constant ratio of change. Linear relationship is captured by the Pearson’s correlation
coefficient.
CHAPTER 4 Page 100
Non-linear relationship, also known as curvilinear relationship, describes a relationship
between two variables that cannot be represented by a straight line. In such cases, the rate
of changes of the variables is not constant, leading to varying ratios of change. The
relationship may follow a curved pattern, such as quadratic, exponential, logarithmic, or
other non-linear functions.

4.3 Correlation Coefficient

4.3.1 Pearson’s Correlation Coefficient


Pearson’s correlation coefficient describes the strength of the linear relationship between
two sets of intervals scaled or ratio scaled variables.
The sample coefficient is denoted by ‘r’ and the population coefficient is denoted by the ‘ρ’.

The line of best fit can be determined through regression analysis, a topic that will be
discussed in detail later in this chapter.

The following formula is used to calculate the Pearson’s Correlation Coefficient (r).
CHAPTER 4 Page 101

𝑛∑𝑥𝑦 − ∑𝑥∑𝑦
𝑟=
√(𝑛∑𝑥 2 − (∑𝑥)2 )x (𝑛∑𝑦 2 − (∑𝑦)2 )
Where,
r = Pearson’s correlation coefficient
n = Number of the paired observations
∑ y = Sum of the y values
∑ x =Sum of the x variable
∑ x2 =Sum of the x squared values
∑ y2 =Sum of the y squared values
∑ xy = Sum of the product of x and y

The correlation coefficient, 𝑟, varies between +1 and -1. A value of +1 indicates perfect
positive linear relationship between two variables, while a value of -1 indicates perfect
negative linear relationship. As the value 𝑟 approaches 0, the strength of the linear
relationship between the variables weakens.

The sign of 𝑟 indicates the direction of the relationship. A positive sign (+) suggests a
positive relationship, meaning that as one variable increases, another variable tend to
increase as well. Conversely, a negative sign (-), indicates a negative relationship, where an
increase in one variable is associated with a decrease in the other variable.

Figure 4.2: Types of Correlation and Degree of Correlation


CHAPTER 4 Page 102

Figure 4.3: Degree of correlation

Types of correlation
pPerfect positive Strong positive correlation Moderate positive
correlation correlation

Perfect negative Strong negative Moderate negative


correlation correlation correlation

No Correaltion
CHAPTER 4 Page 103

Illustration 4.2 – Calculating Pearson’s correlation coefficient

Evaluate Pearson’s correlation coefficient for the data on sales and advertising spend
in the table below.

Advertising expenditure in Total sales in the following


month (Rs. Mil) month (Rs. Mil)
1.3 151.6
0.9 100.1
1.8 199.3
2.1 221.2
1.5 170.0

Solution:
With calculations involving summations, we facilitate the calculations by setting out
in columns.
x y x2 y2 xy
1.3 151.6 1.69 22,982.56 197.08
0.9 100.1 0.81 10,020.01 90.09
1.8 199.3 3.24 39,720.49 358.74
2.1 221.2 4.41 48,929.44 464.52
1.5 170.0 2.25 28,900.00 255.00
7.6 842.2 12.40 150,552.50 1,365.43

𝑛∑𝑥𝑦− ∑𝑥∑𝑦
r=
√(𝑛∑𝑥2−(∑𝑥)2)x(𝑛∑𝑦2−(∑𝑦)2)

(5𝑥1,365..43)−(7.6𝑥842.2)
r=
√[(5𝑥12.4)−7.62)x(5𝑥150,552.5−842.22)]

r = 0.993

Interpretation:
The Pearson’s correlation coefficient of 0.996 indicates a very strong positive linear
relationship between advertising expenditure and sales.
CHAPTER 4 Page 104

Illustration 4.3 – Calculating and interpret Pearson’s correlation coefficient


Calculate the Pearson’s coefficient of correlation for the following data and interpret it.

x 39 43 21 64 57 47 28 75 34 52
y 65 78 52 82 92 89 73 98 56 75

x y x2 y2 xy
39 65 1,521 4,225 2,535
43 78 1,849 6,084 3,354
21 52 441 2,704 1,092
64 82 4,096 6,724 5,248
57 92 3,249 8,464 5,244
47 89 2,209 7,921 4,185
28 73 784 5,329 2,044
75 98 5,625 9,604 7,350
34 56 1,156 3,136 1,904
52 75 2,704 5,625 3,900

∑ 𝑥 = 460 ∑ 𝑦 = 760 ∑ 𝑥 2 = 23,634 ∑ 𝑥 2 = 59,816 ∑𝑥𝑦 =36,854

𝑛∑𝑥𝑦− ∑𝑥∑𝑦
r=
√[𝑛 ∑ 𝑥 2 −(∑ 𝑥)2 ][𝑛 ∑ 𝑦 2 −(∑ 𝑦)2 ]

r = 0.84
The correlation value 0.84 indicates that the linear relationship between 𝑥 and 𝑦 is strong
and positive.

Test yourself 4.1

A company wants to identify the relationship between the number sold of a new car
model and the amount of money spent on advertising. The information on the cost
incurred on advertising (x) (in Rs. million) and the number of cars sold (y) for 15
years are summarised below:
∑x = 177 ∑y = 679 ∑x2 = 2,576 ∑ y2 = 39,771 ∑xy = 9,915 n = 15
Based on the above data, The correlation coefficient between (x) and (y) is:
1) 0.91
2) -0.91
3) 0.19
4) -0.19
CHAPTER 4 Page 105

Test yourself 4.2

The following summary statistics has been obtained in relation to x and y:

∑x = 70 ∑y = 30.6 ∑xy = 310.5 ∑x2 = 952 ∑ y2 = 134.13 n = 7

The correlation coefficient between x and y would be:


1) 0.6479
2) –0.6479
3) –0.4697
4) 0.4697

Test yourself 4.3

State whether each of the following statements is True or False.

• The value of correlation coefficient between a pair of variables is always a positive


figure
• If the correlation coefficient between two variables is -1, there is a weak negative
relationship
• Positive correlation is a relationship between two variables in which both variables
move in the same direction

Test yourself – 4.4

The following table shows the advertising expenses and sales value for last 6 months
of a company:

Advertising Expenses
(Rs’000) (x) 44 29 74 12 9 50
Sales Value (Rs.’000)
(y) 550 480 630 230 240 610

You are required to:


Calculate and interpret Pearson’s correlation coefficient.

4.4 Spearman’s Rank Correlation Coefficient

Spearman’s rank correlation coefficient (Spearman’s rho) is the most common alternative to
Pearson’s correlation coefficient. It is used when the variables are measured on an ordinal
CHAPTER 4 Page 106
or ranked scale. It is a rank correlation coefficient because it uses the rankings of data from
each variable (e.g., from lowest to highest) rather than the raw data itself.

Spearman’s rank correlation coefficient is particularly useful when dealing with variables do
not have a linear relationship or when the data is not normally distributed. It is robust to
outliers. While the Pearson correlation coefficient measures the linearity of relationships,
the Spearman correlation coefficient measures the monotonic relationship between the
variables.

Interpreting the values of Spearman’s rank correlation is similar to interpreting the


Pearson’s correlation coefficient. A coefficient closer to -1 or +1 suggest a strong monotonic
relationship, while a coefficient close to 0 suggests a weak or no monotonic relationship. The
sign of the coefficient indicates the direction of the monotonic relationship, with a positive
sign indicating increasing ranks in both variables and a negative sign indicating decreasing
ranks in both variables.
Rank correlation is denoted by R.

6∑d2
R=1−
𝑛(n2 −1)

Where,
n = number of pairs of data
d = the difference between the ranks in each set of data

Illustration 4.5 – Rank correlation

The placements of seven students in two spot tests are as follows:

Student Test 01 placing Test 02 placing


A 2 1
B 1 3
C 4 7
D 6 5
E 5 6
F 3 2
G 7 4

Required
Determine whether the placement of the student in Test 01 correlates with their
placement in Test 02.

Solution
Correlation must be measured by Spearman’s rank coefficient because we are given
the placement of students rather than their actual marks.
CHAPTER 4 Page 107
6∑d2
R = 1 − 𝑛(n2 −1)

Where, d is the difference between the rank in Test 01 and the rank in Test 02 for
each student.

Student Rank - Test 01 Rank - Test 02 d d2

A 2 1 1 1
B 1 3 -2 4
C 4 7 -3 9
D 6 5 1 1
E 5 6 -1 1
F 3 2 1 1
G 7 4 3 9
26
6∑d2
R = 1 − 𝑛(n2 −1)

6𝑋26
R = 1 − 7(49−1)

156
R = 1 − 336 = 0.536

The correlation is positive, 0.536, but the correlation is not strong.

4.4.1 Handling tied ranks.


If some items in a question tie for a particular ranking, it is necessary to assign them an
average place before calculating the Spearman’s rank correlation coefficient. Here is an
example.

Student Position in examination New rank Reason


(Rank)
A 1 1.5 Average of 1 and 2
B 1 1.5 Average of 1 and 2
C 3 3 -
D 4 4 -
E 5 6 Average of 5,6 and 7
F 5 6 Average of 5,6 and 7
G 5 6 Average of 5,6 and 7
H 8 8 -

If in a question some of the items, tie for a particular ranking these must be given an average
place before the coefficient of rank correlation is calculated.
CHAPTER 4 Page 108

Here is an example.

Illustration 4.6 – Rank correlation, tied ranks

In a reality music show, five artists were placed in order of merit by two different
judges as follows.

Artist Judge Adam Rank Judge Smith Rank


A 1 4=
B 2= 1
C 4 3
D 5 2
E 2= 4=

Required
Assess how the two sets of rankings are correlated.

Solution
Artist Rank – Adam Rank – Smith d d2

A 1.0 4.5 -3.5 12.25


B 2.5 1.0 1.5 2.25
C 4.0 3.0 1.0 1.0
D 5.0 2.0 3.0 9.0
E 2.5 4.5 -2 4.0
28.5
6∑d2
R = 1 − 𝑛(n2−1)

6𝑥28.5
R = 1 − 5(25−1) = -0.425

There is a slight negative correlation between the rankings.

Limitations of Correlation Coefficient

• Correlation does not imply causation. Even if two variables have strong correlation, it does
not necessarily mean that one variable directly causes the other to change.

• Correlation coefficients, such as the Pearson correlation coefficient, measure linear


relationships between variables. If the relationship is nonlinear, the correlation
coefficient may not accurately capture the strength or nature of the association.
CHAPTER 4 Page 109
• Outliers can have a significant impact on correlation coefficients. A single extreme value
can distort the correlation and lead to misleading interpretations.
• Spurious correlation refers to coincidental relationships between variables that have no
meaningful connection. It is important to exercise caution and investigate potential
confounding variables or alternative explanations before drawing conclusions based
solely on correlation.

Test yourself – 4.5

Two wine tasters ranked eight bottles of wine as follows.

Wine A B C D E F G H
Taster X 3 7 1 8 5 2 4 6
Taster Y 3 8 2 7 4 1 5 6

Find Spearman’s rank correlation coefficient for this data, giving your answer to
three decimal places.

Test yourself – 4.6

Determine the ranks of the following data, with the smallest being ranked 1.

Values 3.21 3.49 3.99 4.05 3.49 4.49 4.99 4.05 3.49

4.5 Linear regression analysis

Regression is a statistical method used in various fields, including finance, and investing, to
analyse and understand the relationship between a dependent variable (Y) and a set of
independent variables (X’s). The primary goal of regression analysis is to determine the
strength, direction, and the nature of the relationship between the variables involved. For
an example, a manufacturer aims to predict the sales revenue of refrigerators (Y) by
analysing the key factors such as advertising expenditure (X1), average customer rating
(X2), price of the product (X3), and competitors’ advertising expenditure (X4). When a
single independent variable, X, is used to predict the dependent variable, it is referred to as
simple linear regression. This section discusses simple linear regression.

Here are a few examples where simple linear regression is used to establish a linear
relationship between an independent variable and a dependent variable.

1. Predicting the sales for a building materials store based on the size of the store.
2. Predicting the monthly rent of an apartment based on its size.
3. Predicting the monthly sales of a product in a supermarket based on the amount of shelf
space devoted to the product.
CHAPTER 4 Page 110

Simple Linear Regression Model

The equation of a straight line is y = β0 + β1x where β0 is the y intercept and β1, the slope of
the line. The linear model y = β0 + β1x is said to be a deterministic mathematical model
because when a value of x is substituted into the equation, the value of y is determined, and
no allowance is made for error. The simple linear regression model is given by,

y = β0 + β1x + ε

Where ε is assumed to be a random error variable with expected value equal to zero and
variance equal to σ2. Furthermore, we assume that the distribution of errors about the line
will be identically the same, regardless of the value of x, and that any error terms are
independent of one another.

In general, we will assume, that we are given 𝑛 pairs of observations regarding two variables,
independent and dependent, (𝑥𝑖 , 𝑦𝑖 ), 𝑖 = 1,2, … , 𝑛. Data values 𝑥𝑖 are called observations on
the independent variable and the data values 𝑦𝑖 are called observations on the dependent
variable.
The simple linear regression model for the data values (𝑥𝑖 , 𝑦𝑖 ), 𝑖 = 1,2, … , 𝑛, is,

𝑦𝑖 = 𝛽0 + 𝛽1 𝑥𝑖 + 𝜀𝑖 , 𝑖 = 1,2, … , 𝑛.

For notational convenience, suppressing 𝑖, we write the above model as, y= β0 + β1x + ε.

The Least-Squares Method.

The statistical procedure for finding the "best fitting" straight line for a set of points would
seem a formalisation of the procedure used to fit a line by the eye. If we denote the predicted
value of y obtained from the fitted line as 𝑦̂, the prediction equation will be

𝑦̂ = 𝑎 + 𝑏𝑥

where 𝑎 and 𝑏 represent estimators of the parameters β0 and β1.

The most common approach to finding 𝑎 and 𝑏 is using the least-squares method. This
method minimises the sum of the squared differences between the actual values, y, and the
predicted values 𝑦̂. This sum of squared differences is equal to ∑(𝑦 − 𝑦̂)2 = ∑(𝑦 − 𝑎 − 𝑏𝑥)2.
The least squares method determines the values of a and b that minimises the
Σ(𝑦 − 𝑎 − 𝑏𝑥)2 around the prediction line. The estimates a and b can be obtained using the
following mathematical formulas.

𝑛 ∑ 𝑥𝑦 − ∑ 𝑥 ∑ 𝑦 ∑ 𝑥𝑦 − 𝑛𝑥̅ 𝑦̅
𝑏= = and
𝑛 ∑ 𝑥 2 − (∑ 𝑥)2 ∑ 𝑥 2 − 𝑛𝑥̅ 2

𝑎 = 𝑦̅ − 𝑏𝑥̅
CHAPTER 4 Page 111

where 𝑥̅ and 𝑦̅ are the averages of 𝑥 and 𝑦, respectively.

Some helpful hints 1

• The value of 𝑏 must be calculated first as it is needed to calculate 𝑎


• Remember that X is the independent variable and Y is the dependent variable
• Set your working out in a table to find the figures to put into the formulae

Measures of Variation
When using the least-squares method to determine the regression coefficients for a set of
data, you need to compute three measures of variation. The first measure, the total sum of
squares (SST), is a measure of variation of the values, y around their mean 𝑦̅. The total
variation, or total sum of squares, (SST) is subdivided into explained variation and
unexplained variation.
The explained variation, or regression sum of squares (SSR), represents variation that is
explained by the relationship between x and y, and the unexplained variation, or error sum of
squares (SSE), represents variation due to factorsother than the relationship between x and
y.
The total sum of squares is equal to the regression sum of squares plus the error sum of
squares as follows:
𝑆𝑆𝑇 = 𝑆𝑆𝑅 + 𝑆𝑆𝐸,

Where,

𝑆𝑆𝑇 = ∑(𝑦 − 𝑦̅)2 = ∑ 𝑦 2 − 𝑛𝑦̅ 2

𝑆𝑆𝑅 = ∑(𝑦̂ − 𝑦̅)2 = 𝑏 2 (∑ 𝑥 2 − 𝑛𝑥̅ 2 )

𝑆𝑆𝐸 = 𝑆𝑆𝑇 − 𝑆𝑆𝑅

The coefficient of Determination (𝑹𝟐 )

The coefficient of determination, denoted as 𝑅 2 , is used to assess the goodness-of-fit of a


regression model. That is, it measures how well the regression line fits the observed data
points. It is computed by taking the ratio of SSR to SST. This ratio measures the proportion
of variation in 𝑦 that is explained by the independent variable 𝑥 in the regression model.
𝑆𝑆𝑅 𝑆𝑆𝐸
𝑅2 = = 1− .
𝑆𝑆𝑇 𝑆𝑆𝑇

𝑅 2 varies in between 0 and 1. The higher the value of 𝑅 2 the higher the explanatory power
of the regression model.
CHAPTER 4 Page 112
For example, an 𝑅 2 value of 0.8 means that 80% of the variation in the dependent variable is
explained by the independent variable in the model, while the remaining 20% is attributed
to other factors or random error.

Illustration 4.7 – Regression

A company has the following data on its sales during the last year in each of its
regions and the corresponding number of salespersons employed during this time.

Region Sales (Units) Salespersons


A 236 11
B 234 12
C 298 18
D 250 15
E 246 13
F 202 10

Obtain a simple linear regression model to forecast sales for each region.

The dependent variable, y, is the sales and the independent variable, x, is the number
of salespersons.

The next step is to evaluate the parameters a and b.


Number of Sales (y) 𝑥2 xy
salespersons (x)
11 236 121 2,596
12 234 144 2,808
18 298 324 5,364
15 250 225 3,750
13 246 169 3,198
10 202 100 2,020
∑ 𝒙 = 𝟕𝟗 ∑ 𝒚 = 𝟏, 𝟒𝟔𝟔 ∑ 𝒙2 = 𝟏, 𝟎𝟖𝟑 ∑ 𝒙 𝒚 = 𝟏𝟗, 𝟕𝟑𝟔

Thus,
(6×19,736)−(79×1,466) 2,602
𝑏= (6×1,083)−792
= = 10.12
257

79
𝑥̅ = = 13.17
6
1,466
𝑦̅ = = 244.33
6
Hence,
a = 244.33 – (10.12 x 13.17) = 111.05

thus, the least squares regression line is,


CHAPTER 4 Page 113
y = 111.05 + 10.12x

Interpretation of a and b

The b value of 10.12 tells us that each extra salesperson generates an extra 10.12
sales (on average), while the a-value of 111.05 means that 111.05 units will be sold
if no sales people are used.

Based on the fitted regression model, forecast the average number of sales that
would be expected next year in a region that employed
(a) 14 salespersons
(b) 25 salespersons

Solution
(a) The regression line is y = 111.05 + 10.12x. the y when 𝑥 = 14 is,
𝑦 = 111.05 + 10.12 × 14 = 252.73
Rounding this into whole number, it is expected that, on average, 253 units will
be sold in a region employing 14 salespersons.
(b) Similarly, when 𝑥 = 25, we have,
𝑦 = 111.05 + 10.12 × 25 = 364.05
When there are 25 salespersons, it is expected that, on average, there will be
sales of 364 units in a region.

Test yourself – 4.7


A company has the following data on its profits and advertising expenditure over the
last 6 years.
Profits (Rs. mil) Advertising expenditure (Rs.mil)
11.3 0.52

12.1 0.61

14.1 0.63

14.6 0.70

15.1 0.70

15.2 0.75

Fit a simple linear regression model for the above data. Based on the fitted model,
forecast the profit for next year if an advertising budget of Rs. 800,000 is allocated.
CHAPTER 4 Page 114
Test yourself – 4.8

The following data describes fuel consumption and flying hours for single-engine,
piston-driven general aviation aircraft from 2000 through 2006. Fuel consumption is
in millions of gallons, flying time is in millions of hours.

2000 2001 2002 2003 2004 2005 2006


Fuel
consumption 276.5 251.2 235.1 248.8 149.4 242.0 239.8
Flying time 27.9 25.2 23.8 23.4 23.4 22.2 22.0

(a) Determine the regression line for predicting fuel consumption on the basis of flying
time.
(b) If there were 24.0 million flying hours during a given year what would be the
prediction for the amount of fuel consumed.

Test yourself – 4.9

∑𝑥 = 79, ∑𝑦 = 1,466, ∑𝑥 2 = 1,083, ∑𝑦 2 = 363,076, ∑𝑥𝑦 = 19,736 and n = 6, then the


value of b, the gradient, to two decimal places, is:
(a) 10.12
(b) 111.03
(c) 13.62
(d) -8.53

Test yourself – 4.10


In a forecasting model based on y = a + bx, the intercept is Rs. 262. If the value of y is
Rs. 503 and x is 23, then the value of the gradient, rounded to two decimal places is:
(a) -20.96
(b) -10.48
(c) 10.48
(d) 20.96

4.6 Time Series

A time series refers to a compilation of systematically gathered data points, representing


well-defined entities, which are observed repeatedly over a duration of time. To illustrate,
let’s consider the monthly sales figures of a specific company. Such sales data forms a time
series because the sales can be clearly defined and consistently measured at regular
intervals. Typically, these observations are taken at fixed and evenly spaced time intervals,
such as days, weeks, months, quarters, or years.
It is important to note that data collected sporadically or only once does not qualify as time
series. This concept is a specialized instance of the correlation analysis, where one variable
consistently represents time, while the other variable is analysed over a specific time span.
CHAPTER 4 Page 115
Some situations in which time series occur naturally.

Stock Market Analysis: Time series analysis is commonly used to analyse stock market data,
such as daily closing prices or trading volumes over time. It helps investors identify trends,
patterns, and make informed decisions.

Climate and Weather Patterns: Time series analysis is crucial in studying climate and
weather patterns. Meteorologists analyse historical data to understand long-term trends,
seasonal variations, and predict future weather conditions.

Economic Forecasting: Time series data plays a vital role in economic forecasting.
Economists analyse past data on indicators such as GDP, inflation rates, unemployment rates,
and interest rates to predict future economic trends and make policy recommendations.

Epidemiology and Disease Outbreaks: Time series analysis is employed to study the
spread of diseases, track outbreaks, and monitor the effectiveness of interventions. By
analysing historical data, epidemiologists can make predictions and implement appropriate
measures to control and prevent the spread of diseases.

Energy Demand and Consumption: Time series analysis is used to study energy demand
and consumption patterns. Utilities and energy companies analyse historical data to forecast
future demand, optimize energy production and distribution, and plan for infrastructure
development.

Website Traffic and User Behaviour: Online businesses use time series analysis to
understand website traffic patterns, user behaviour, and engagement metrics over time. This
helps them optimize website performance, marketing campaigns, and user experience.

Sensor Data Analysis: Time series data from sensors is used in various fields like
manufacturing, transportation, and environmental monitoring. It helps detect anomalies,
predict equipment failure, optimize maintenance schedules, and improve operational
efficiency.

Financial Forecasting: Time series analysis is utilized in financial forecasting to predict


future market trends, interest rates, exchange rates, and asset prices. This aids financial
institutions, investors, and traders in making informed investment decisions.

Demand Forecasting: Time series analysis is used in demand forecasting for industries like
retail, supply chain management, and inventory planning. By analysing historical sales data,
businesses can predict future demand, optimize inventory levels, and improve operational
efficiency.

Social Media Analytics: Time series analysis is applied to social media data to analyse
trends, sentiment analysis, and predict user behaviour. This helps businesses understand
customer preferences, tailor marketing strategies, and improve brand reputation.
CHAPTER 4 Page 116
Examples:
• The supermarket's daily revenue recorded over a consecutive four-week timeframe
• The annual unemployment rate observed over the course of the past ten years
• The quarterly quantity of rice production, measured in tons
• The monthly production volume of a soft drink manufacturer, documented for a span of
48 months
• The count of taxi journeys performed per day during a three-week period
• The annual count of houses constructed in the Colombo city between the years 2015 and
2023
• The weekly sales figures of a company
• The yearly number of tourists arriving in Sri Lanka during the last five years
• The firm's annual revenue generated continuously for a period of ten years

4.6.1 Components and models of time series


In time series analysis, variations can be categorized into four fundamental components:
Trend (T)
Seasonal (S)
Cyclical (C)
Residual or Irregular (R)

The trend component represents the long-term direction of the time series, reflecting its
overall growth or decline. It captures persistent changes that extend beyond short-term
fluctuations.
The seasonal component pertains to systematic and short-term movements in the time series
that recur within specific calendar periods. These patterns can be linked to regular seasonal
factors, such as holidays or weather conditions, which influence the data.

The cyclical component encompasses medium-term fluctuations that occur over a span of
several years, typically recurring in a cyclical pattern. These fluctuations are not tied to
specific calendar periods and may be driven by economic cycles, business cycles, or other
underlying factors affecting the data.

The irregular component accounts for unsystematic and random fluctuations in the time
series that cannot be attributed to any identifiable trend, seasonality, or cyclical pattern.
These irregularities can arise from unpredictable events, measurement errors, or other
sources of noise in the data.

The four components of variation are assumed to combine to produce the variable in one of
two ways: thus, we have two mathematical models of the variable. In the first case there is
the additive model, in which the components are assumed to add together to give the
variable, Y:

Y=T+S+C+R

The second, multiplicative model, considers the components as multiplying to give Y:

Y=TxSxCxR
CHAPTER 4 Page 117
4.6.2 Trend

This is the general overall movement persisting over a long period of time. There are all sorts
of trends; some series increase slowly and some increase fast, others decrease at varying
rates, and some remain relatively constant for long periods of time. The various types of
trends are divided under two headings.
• Linear trend
• Non-linear trend

a) Linear trend
A linear trend is one in which the variable is basically changing at a steady rate. If the trend
in a time series is linear, it could be estimated using the least squares method.

4.6.3 Forecasting linear trends


In order to obtain a trend line for a time series, we can use the regression approach discussed
in Section 4.5. The procedure is briefly given blow:
• Take the time points as values of the independent variable x. It could be coded as 1, 2, 3, …
• Take the outcome variable values as values of the dependent variable Y
• Estimate the least square regression line of Y on X as y = a + bx
• Translate the regression line as T = a + bx where 𝑥 is the time point and T is the
corresponding trend value for the period

Illustration 4.8
The following table gives the quarterly sales figures of a small company over the last
three years.

Required:
Calculate the forecast for the next four values of the trend in the series.

Time period Sales (Rs. millions)

2012 Q1 (t = 1) 42
Q2 (t = 2) 41
Q3 (t = 3) 52
Q4 (t = 4) 39

2013 Q1 (t = 5) 45
Q2 (t = 6) 48
Q3 (t = 7) 61
Q4 (t = 8) 46

2014 Q1 (t = 9) 52
Q2 (t = 10) 51
Q3 (t = 11) 68
Q4 (t = 12) 48
CHAPTER 4 Page 118
The given points are plotted on a graph which is given below. If we look at the diagram,
we can see that although the figures are fluctuating quarter by quarter two things can
immediately be observed. Firstly, there is a general upward trend in the sales as a whole.

It is not an exceptional rise. Secondly, although the figures fluctuate, there is a visible
pattern. The sales amount is always the highest in the third quarter of the year, and
always the lowest in the fourth quarter. This is known as seasonal variation which will
be dealt with later.

Figure
80

70

60

50

40

30

20

10

0
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4

Let X and Y be the time period and the Sales (in Rs. million) respectively. Then the
following totals could be obtained manually or using a scientific calculator as mentioned
in chapter 6.

Σ 𝑥 = 78 Σ 𝑦 = 593 Σ 𝑥 2 = 650 Σ 𝑥𝑦 = 4,046

The regression equation could be obtained as y = 40.712 + 1.339x using a scientific


calculator.

This equation would then be translated as where T = 40.712 + 1.339x where T is the
trend for a given time period x.

Forecast for the quarters of 2015


Trend forecast (40.712+ 1.339 x) in Rs million

2015 Q1 40.712 + 1.339 × 13 = 58.119


Q2 40.712 + 1.339 × 14 = 59.458
Q3 40.712 + 1.339 × 15 = 60.797
Q4 40.712 + 1.339 × 16 = 62.136
CHAPTER 4 Page 119
4.6.4 Non-linear trend

In numerous instances, it is not feasible to rely on a straight-line pattern to gauge the trend.
This is particularly evident when a time series exhibits a slower (or faster) rate of growth in
its early stages and a faster (or slower) rate of growth in more recent times. In such cases, a
straight-line trend is unsuitable. The "method of moving averages" is the most frequently
employed approach for quantifying non-linear trends.

a) Method of moving averages

When using the method of moving averages to determine a trend, we gather data for a
specific number of years (quarters, months, weeks, or days). We then calculate the average
of these values, which represents the trend value for the time unit in the middle of the period
covered by the calculation. These calculated averages are referred to as "moving averages"
because each average is determined by shifting from one overlapping set of values to the
next. The number of values in each set is consistent and referred to as the "period" of the
moving average.

If the time series is considered to be as a, b, c, d, e, f,… and so on then the 3-season moving
𝑎+𝑏+𝑐 𝑏+𝑐 +𝑑 𝑐+ 𝑑 +𝑒
average would be , , , … and so on. Similarly, the 5-season moving
3 3 3
𝑎+𝑏+𝑐+𝑑+𝑒 𝑏+𝑐 +𝑑+𝑒+𝑓 𝑐+ 𝑑 +𝑒+𝑓+𝑔
average would be , , , … and so on.
5 5 5

Illustration 4.9

The power consumption of a factory over a period of one year is shown below. Draw the
time series graph and calculate the trend in power consumption over the period of 12
months using 3-month moving average.

Month Jan Feb Mar Apr May June July Aug Sept Oct Nov Dec
power 332 316 327 338 346 288 314 340 336 350 370 363
consumed
(units)
CHAPTER 4 Page 120
Solution
Figure – 2

390

370

350

330

310

290

270

250
Jan Feb March April May June July August Sept Oct Nov Dec

Month Power consumption 3 moths moving Trend


(In units) average total
Jan 332 - -
Feb 316 975 325
Mar 327 981 327
Apr 338 1,011 337
May 346 972 324
June 288 948 316
July 314 942 314
Aug 340 990 330
Sept 336 1,026 342
Oct 350 1,056 352
Nov 370 1,083 361
Dec 363 - -

If we notice the trend in power consumption, we can see that it increases up to the month of
April and thereafter decreases up to July and then again increases.

The main disadvantage of moving averages method is that no trend values are obtained for
the beginning and end time points of a series. With 3-season moving average, the first and
the last moving totals would be missing. Similarly with 5-season moving average, the first
two and the last two moving totals would be missing, and so on.
CHAPTER 4 Page 121
Note
When dealing with time series data in a business context, quarterly or monthly data is often
used. In the case of quarterly data, the appropriate moving average is the four-quarter
moving average. However, there is an issue when using an even number of seasons for the
moving average.

To address this problem, a technique called "cantering" is used. Cantering involves taking the
average of each four-quarter moving total with the subsequent moving total and placing it
between the two moving totals. These averaged values are referred to as "cantered moving
totals." Moving averages are then calculated by dividing the cantered moving totals by four.

By employing cantering, we can overcome the challenge of relating trend figures to specific
quarters and ensure accurate analysis of the time series data.

Illustration 4.10
The table below shows the number of visitors (in hundreds) to a hotel during a period
of three years. Calculate the trend forecast for the four quarters in 2015 based on the
data provided using four-quarter moving averages method.

Year Quarters
Q1 Q2 Q3 Q4
2012 36 18 22 44
2013 40 20 24 46
2014 48 20 26 56

Solution:

Year Quarter No. of Moving total Cantered Tread


visitors moving total
2012 Q1 36 - - -

Q2 18 - -
120
Q3 22 122 30.5
124
Q4 44 125 31.25
126
2013 Q1 40 127 31.75
128
Q2 20 129 32.25
130
Q3 24 134 33.5
138
Q4 46 138 34.5
CHAPTER 4 Page 122
138
2014 Q1 48 139 34.75
140
Q2 20 146 36.5
152
Q3 26 - - -

Q4 58 - - -

If we notice the growth in trend between 2012 Q3 and 2014 Q2 is 6.00 over a period of
eight quarters and so the average growth would be 0.75 per quarter.

Hence, the trend forecast would be,


Actual number of visitors expected
2015 Q1 38.75 3,875
Q2 39.50 3,950
Q3 40.25 4,025
Q4 41.00 4,100

Test yourself - 4.11


(a) List the four (04) components of variation in time series.
(b) The quarterly share prices of a company over the past four years are given below:

Year Quarter Share Price (Rs.)


1 1 88
2019 2 2 69
3 3 75
4 4 105
2020 1 5 45
2 6 75
3 7 65
4 8 98
2021 1 9 56
2 10 91
3 11 66
4 12 99
2022 1 13 75
2 14 96
3 15 78
4 16 110

Compute the following:


(i) Moving total values
(ii) Cantered moving total values
(iii) Quarterly Cantered moving average
(iv) Trend values
CHAPTER 4 Page 123
4.6.5 Seasonal Variations

Seasonal variations refer to the recurrent, short-term cyclic fluctuations in business activity
that transpire consistently on an annual basis. Given their predicable recurrence over a span
of twelve months, these variations can be reasonably anticipated with a fair degree of
accuracy.

Examples:
• During festival seasons, there is typically a surge in garment sales as people tend to make
more purchases for special occasions and celebrations.

• Rainy seasons witness increased sales of raincoats and umbrellas as individuals seek
protection from the inclement weather.

• On rainy days, the demand for ice cream tends to be quite low as people are less inclined
to consume cold treats in such weather conditions.

• Supermarket sales tend to be higher towards the end of the week compared to the
beginning, possibly due to factors such as people stocking up for the weekend or
increased shopping activities before the start of the weekend.

The factors that cause seasonal variations are


• Climate and weather conditions
During winter there is greater demand for woollen clothes and hot drinks.

• Customs, traditions, and habits


On certain occasions like Sinhala & Tamil New Year, Ramzan or Christmas there is a big
demand for sweets & cakes and the bank withdrawals go up because people want money
for shopping.

The seasonal variations show on average, by how much a particular season will tend to
increase or decrease the value from the trend.

4.6.6 Forecasting seasonal components

If a time series (Y) values are given together with the trend (T) values, then the procedure
for calculating the seasonal variations would be as follows:

With additive model


For each time point, the value of Y- T has been calculated.
(The difference between the original value and the trend value)
The average of all the Y-T values has been calculated for each seasonal time point (eq:
monthly or quarterly). If the total of the averages differs from zero, adjust one or more of
them so that their total is zero.
CHAPTER 4 Page 124
Illustration 4.11
Consider the example above:
Calculate the seasonal variations using additive time series model for the example
above.

Solution:
With additive model, the series value would be given as Y = T + C + S + R. If we assume
that there are no cyclical and random variations, then C = 0 and R = 0
Thus, we obtain that Y = T + S. This implies that S = Y - T

Year Quarter No. of visitors Tread Y-T


2012 Q1 36 - -

Q2 18 - -

Q3 22 30.5 -8.5

Q4 44 31.25 12.75

2013 Q1 40 31.75 8.25

Q2 20 32.25 -12.25

Q3 24 33.5 -9.50

Q4 46 34.5 11.50

2014 Q1 48 34.75 13.25

Q2 20 36.5 -16.50

Q3 26 - -

Q4 58 - -

Year Quarters
Q1 Q2 Q3 Q4
2012 - - -8.50 12.75
2013 8.25 -12.25 -9.50 11.50
2014 13.25 -16.50 - -
Total adjustments 21.50 -28.75 -18.00 24.25 = -1
Adjustment +0.25 +0.25 +0.25 +0.25 = +1
Adjusted seasonal variation 21.75 -28.50 -17.75 24.50 = 0
CHAPTER 4 Page 125
Projections for the year- 2015

Year 2015 Predicted Seasonal Forecast No. of visitors


trend variation predicted
Q1 38.75 21.75 60.50 60,500
Q2 39.50 -28.50 11.00 11,000
Q3 40.25 -17.75 22.50 22,500
Q4 41.00 24.50 16.50 16,500

4.6.7 Computations involving the multiplicative model

The seasonal component (S) is given as the arithmetic mean of Y/T for each quarter, where
Y denotes the actual sales and T the trend obtained by using the regression equation. Adjust
your average seasonal variations so that they add to four.

Illustration 4.11

The calculation of average seasonal variations using multiplicative model is shown


below through an example. Assume that the actual sales (Y) and the trend values (T)
for a time series are as follows.

With Multiplicative model, the series (Y) value can be given as the product of the
components of the time series.

Y=TxSxCxR
If we assume that there are no cyclical and random variations, then c = 1 and R = 1
Y=TxS

S=Y/T
Year Quarter t T Sales, Y Y/T
2011 1 1 30.87 24.8 0.8034
2 2 33.19 36.3 1.0937
3 3 35.51 38.20 1.0729
4 4 37.84 47.50 1.2553

2012 1 5 40.16 31.25 0.7769


2 6 42.49 42.0 0.9885
3 7 44.81 43.4 0.9685
4 8 47.14 55.9 1.1859

2013 1 9 49.46 40.0 0.8087


2 10 57.79 48.80 0.9423
3 11 54.11 54.0 0.9980
4 12 56.44 69.1 1.2243
CHAPTER 4 Page 126
2014 1 13 58.76 54.7 0.9309
2 14 61.08 57.8 0.9463
3 15 63.41 60.3 0.9510
4 16 65.73 68.9 1.0482

Year Q1 Q2 Q3 Q4
2011 0.8034 1.0937 1.0729 1.2553
2012 0.7769 0.9885 0.9685 1.1859
2013 0.8087 0.9423 0.9980 1.2243
2014 0.9309 0.9463 0.9510 1.0482
Total 3.3199 3.9708 3.9904 4.7137

Average 0.8300 0.9927 0.9976 1.1784 3.9987


Adjustment 0.0013 0.0013
Adjusted 0.8313 0.9927 0.9976 1.184 4.000
value

Seasonal factors Seasonal variations


Q1 0.8313 {0.8313 - 1.0000} x 100 = -16.87%
Q2 09927 {0.9927 - 1.0000} × 100 = -0.73%
Q3 0.9976 {0.9976 - 1.0000) x 100 = -0.24%
Q4 1.1784 {1.1784 - 1.0000} x 100 = +17.84%
Total = Nil

Test yourself - 4.12


The Air ticket prices in euro (Є) from Europe to Sri Lanka from 2011 to 2015 are
given below:
Year Quarter Ticket Four Cantered Y/T
Price (Є) quarterly moving
cantered average
Moving Trend
Average
1 1 900

2 2 960
2019 1,030.0
3 3 1,010 1,022.5 0.99
1,015.0
4 4 1,250 997.5 1.25
980.0
1 5 840 997.5 0.86
962.5
2 6 820 930.0 0.88
CHAPTER 4 Page 127
2020 897.5
3 7 940 917.5 1.02
937.5
4 8 990 917.5 1.03
976.3
1 9 1,000 917.5 1.00
976.3
2 10 975 1,051.3 0.93
2021 1,086.3
3 11 1,100 1,051.3 1.00
1,105.0
4 12 1,270 1,119.4 1.13
1,133.8
1 13 1,075 1,153.8 0.93
1,173.8
2 14 1,090 1,183.8 0.92
2022 1,193.8
3 15 1,260 1,203.1 1.05
1,212.5
4 16 1,350 1,228.8 1.10
1,245.0
1 17 1,150 1,253.8 0.92
1,245.0
2 18 1,220 1,275.0 0.96
2023 1,245.0
3 19 1,330

4 20 1,450

You are required to:


Compute the seasonal indices using multiplicative model.

4.6.8 Cyclical Variations

These variations can be classified as medium-term fluctuations. In business-related time


series, it is common to observe cyclical variations. A business cycle represents the repeated
pattern of alternating periods of growth and decline in business activity, distinct from the
long-term trend. The business cycle can be divided into four distinct phases or periods.

Namely:
- prosperity (or boom)
- decline
- depression
- recovery (or improvement)
CHAPTER 4 Page 128
4.6.9 Random Variations

These are disturbances due to everyday unpredictable influences such as weather


conditions, machine break downs, transport break downs, strike and so on.

The evaluation of cyclical and random components is not required for the purpose of
examination. Only a basic understanding of these two variations and examples for them are
expected.

4.6.10 Seasonally adjusted data

A seasonal effect refers to a consistent and calendar-related impact on data. Seasonal


adjustment involves estimating and removing this effect from a time series. After removing
the seasonal variations, the resulting data is called "deseasonalized" or "seasonally adjusted"
data. Seasonal adjustment is necessary because seasonal effects can hide the actual
underlying patterns in the series and certain non-seasonal characteristics.

With additive model, seasonally adjusted data would be Y - S whereas with multiplicative
model it would be Y / S

4.6.11 Uses of Time Series

• By analysing time series data, we can gain insights into past trends and patterns, enabling
us to better understand how things have behaved over time.

• Time series analysis aids in making informed decisions for future operations. By studying
historical data, we can identify patterns and make predictions, helping us plan and
prepare for what lies ahead.

• Time series analysis allows us to evaluate current fluctuations and understand the factors
driving them. By examining the data in real-time, we can identify and respond to any
deviations or changes in the patterns.

• Time series analysis enables meaningful comparisons between different datasets. By


comparing multiple time series, we can identify correlations, trends, and draw important
conclusions that contribute to our overall understanding of the data.
CHAPTER 5 Page 129

Introduction to Financial
Mathematics

Learning Outcomes

After successful completion of this chapter, you should be able to:

• Calculating the present value of a future cash flow


• Calculating the present value of an annuity and a perpetuity
• Calculating the net present value and decide the viability of a project
• Understanding the loan repayment process and preparing the amortisation
schedule
• Understanding the difference between nominal interest rate and effective annual
rate
• Understanding the impact of the inflation

5.0 Time value of money

The concept of time value of money (TVM) suggests that the present value of a sum of money
is higher than its future value due to the potential returns it could generate over time. TVM
is a fundamental principle in finance, asserting that a specific amount of money holds greater
value if received in the present rather than in the future.

5.1 Factors influencing the TVM

The TVM serves as a valuable tool for making sound financial decisions. Numerous factors
influence the TVM.

5.1.1 Consumption preference


When individuals experience an equal level of contentment, they often choose immediate
consumption over delayed consumption. However, many people are willing to forgo present
consumption in order to have the opportunity to consume more in the future.

5.1.2 Future uncertainty


Since future events are uncertain and cannot be guaranteed, it is generally more desirable to
engage in consumption now rather than waiting for the future.
CHAPTER 5 Page 130
5.1.3 Inflation
Inflation and the purchasing power of money are closely linked, as the value of money
generally erodes over time, resulting a decrease in its purchasing power.

5.1.4 Investments opportunities


If an investment generates a consistent stream of cash flow, the periodic returns can be
reinvested to potentially achieve even higher returns.

Recap – Compound interest


Under compound interest method, the interest is calculated on the principal amount and
the accumulated interest of previous periods.
Formula
𝑟 𝑛𝑡
𝑉 = 𝑃 (1 + )
𝑛
Where,
V = The amount of money accumulated after ‘t’ years, including interest
P = Principal amount
r× 100 = Annual rate of interest
n = Number of times the interest is compounded per year
t = The number of years (time period in years)

Alternative formula for compounding interest.


In some test book this formula is presented as follows
𝑉 = 𝑃(1 + 𝑟)𝑛

Illustration 5.1 – Compound interest


Ahamed deposited Rs. 100,000 in an account which offers 10% interest per annum. Find
the value of money after 05 years of time, if interest is compounded semi-annually.

P = Rs. 100,000
r = 0.1
n=2
t = 5 years

Solution
𝑟 𝑛𝑡
𝑉 = 𝑃 (1 + )
𝑛

0.1 2 ×5
𝑉 = 100, 000 (1 + )
2
V = Rs. 162,889.46
CHAPTER 5 Page 131
5.2 The concept of discounting

5.2.1 Present value


The concept of present value can be explained in two ways.
• It is the current value of an amount to be received some time in the future (It represent
the current value of an amount that will be due at a future point in time)
• It is the amount that needs to be invested today in order to generate a specific amount at
a future date

The term ‘present value’ simply means the amount of money that must be invested now, for
a duration of n years at an interest rate of r%, in order to accumulate a predetermined future
sum of money at the time it becomes due.

5.2.2 The basic principles of discounting

Discounting is the reverse process of compounding. Take a look at the rearrangement of the
formula below.
𝑉 = 𝑃(1 + 𝑟)𝑛
𝑉
𝑃=
(1 + 𝑟)𝑛
1
𝑃=𝑉 ×
(1 + 𝑟)𝑛

Let’s substitute PV for present value instead of P and FV for future value instead of V. Then
the formula becomes as follows:
1
𝑃𝑉 = 𝐹𝑉 ×
(1 + 𝑟)𝑛
1
Here, we define the term as the discounting factor (DCF)
(1+𝑟)𝑛
Therefore,
𝑃𝑉 = 𝐹𝑉 × 𝐷𝐶𝐹
Hence, the present value can be obtained by multiplying the future value with the discounting
factor.

5.2.3 How to find discounting factor.


There are two methods that can be used for calculating the discounting factor.

a) Method 1: Calculator method

Substitute the given values into the formula and perform necessary calculations using the
calculator to obtain the discounting factor. Following Table illustrates the calculation of
discounting factor for different values of r ×100 (interest rate) and n (number of
compounding periods)
CHAPTER 5 Page 132
Illustration 5.2 – Calculation of discounting factors

r n 𝟏 DCF
( 𝟏 + 𝐫 )𝐧
10% 0 1 1
( 1 + 0.1 )0
10% 1 1 0.9091
( 1 + 0.1 )1
10% 2 1 0.8264
( 1 + 0.1 )2
10% 5 1 0.6209
( 1 + 0.1 )5
15% 0 1 1
( 1 + 0.15 )0
15% 1 1 0.8696
( 1 + 0.15 )1
15% 2 1 0.7561
( 1 + 0.15 )2
15% 5 1 0.4972
( 1 + 0.15 )5
20% 0 1 1
( 1 + 0.2 )0
20% 1 year 6 months 1 0.7607
( 1 + 0.2 )1.5
20% 5 years 3 months 1 0.384
( 1 + 0.2 )5.25
12.5% 3 1 0.7023
( 1 + 0.125 )3
18.75% 8 1 0.2529
( 1 + 0.1875 )8

Note: It was assumed that the interest was compounded annually.


CHAPTER 5 Page 133

b) Using the present value table.

How to read the table.


▪ The first column indicates the time periods, which can be in years, quarters, months as
based on the question
▪ Starting from the second column, the interest rates range from 1% to 20%.
▪ The discounting factors are presented with three decimal places.
▪ For example, if you need the discounting factor for the interest rate of 10 % over the
period of 3 years, refer to the column for 10 % and find the value in the 3rd row, which is
0.751.

Illustration 5.3 – Present value

Find the present value of


(a) Rs. 400,000.00 payable in 2 years’ time, assuming an investment rate of 8 per cent
per annum compounded annually.

(b) Rs. 750,000.00 to be received in 3 years’ time, assuming an annually compounded


investment rate of 15% per annum.
CHAPTER 5 Page 134

Solution
Using the present value table:
(a) Using the table, you can find the present value factor (or discount factor) of 0.857
for n = 2 and r=8%. This means that the present value of Rs.1/= for n = 2 and r=8%
is 0.857; hence the present value (PV) of Rs. 400,000.00 is:

400,000 x 0.857 = 342,800.00

Using the calculator method:


1
𝑃𝑉 = 𝐹𝑉 ×
(1 + 𝑟)𝑛

1
𝑃𝑉 = 400,000 × = 𝑅𝑠. 342,935.53
(1 + 0.08)2

Using the present value table:

(b) Similarly, the PV factor (discount factor) for n=3 and r=15% is 0.658. Hence the PV
of 750,000 is:
750,000 x 0.658 = 493,500

Using the calculator method:


1
𝑃𝑉 = 750,000 × = 𝑅𝑠. 493,137.17
(1 + 0.15)3

Note: It is observed that utilising tables may result in a loss of some accuracy due to the
rounding errors of the discounting factor. Nevertheless, when numerous calculations
are involved, their usage proves to be significantly faster, making tables a preferred
choice. However, it is important to note that tables may not always be applicable due to
certain “gaps” present in them. For instance, combinations such as n = 2.5 years and r =
4.5% may not be appear in the tables, requiring the application of first principles
(calculator method) in examples involving such values.

Test yourself – 5.1

Calculate the present values of the following amounts, using PV tables whenever
possible, and resorting to first principles otherwise.
(a) Rs. 400,000 payables in 8 years’ time at a rate of 12%
(b) Rs. 125,000 payables in 3 years’ time at a rate of 7%
(c) Rs. 250,000 payables in 5 years’ and 9 months’ time at a rate of 10%
(d) Rs. 500,000 payables in 8 years’ time at a rate of 7.35%
CHAPTER 5 Page 135

5.3 Net present value

Net present value (NPV) is a financial metric used to determine the profitability of an
investment project. It represents the difference between the present value of cash inflows
and the present value of cash outflows over a given time period. NPV takes into account the
time value of money by discounting future cash flows back to their present value.

A positive NPV indicates that the investment is expected to generate more value than the
initial cost, making it potentially worthwhile. Conversely, a negative NPV suggests that the
investment may result in a loss or not meet the required return.

Illustration 5.4 – Net Present Value


A company has the option to purchase a machine for Rs. 850,000. The finance division has
estimated that the machine will generate the following cash flows over the next five years.
Year 01: Rs. 200,000
Year 02: Rs. 220,000
Year 03: Rs. 275,000
Year 04: Rs. 185,000
Year 05: Rs. 120,000

Additionally, at the end of five-year period, the machine can be scrapped for Rs. 55,000.
Assuming an interest rate of 10% per annum, find the NPV of the machine.
(Assume all the above cash inflows occur at the end of the year.)

Solution
A simple but simplified tabular form can be used.

Year Cash inflow at the Discounting factor at an Present value (Rs.)


end of each year interest rate of 10%
1 280,000 0.909 254,520
2 220,000 0.826 181,720
3 275,000 0.751 206,525
4 185,000 0.683 126,355
5 120,000 + 55,000 0.621 108,675
Total 877,795
NPV = 877, 795 – 850,000 27,795

The positive NPV indicates that the investment is profitable.


CHAPTER 5 Page 136

Test yourself – 5.2


Banana co. is evaluating the potential investment of Rs. 50,000 in a piece of equipment.
The project is expected to generate cash profits, (the excess of income over cash
expenditure), of Rs. 30,000 in the first year and Rs. 40,000 in the second year. However,
the company has a policy of only investing a project that offer a return in excess of 15%
per annum.
Required
Assess whether the investment is worthwhile or viable.

Test yourself – 5.3

A company is wondering whether to spend Rs. 180,000 on an item of equipment, in order


to obtain cash profits as follows.

Year Rs.
1 60,000
2 80,000
3 50,000
4 10,000

The company requires a return of 10% per annum.


Required
Assess whether the project is viable.

Test yourself – 5.4


Compare the following three potential investments, A, B, and C, given that the investor
has a maximum of Rs. 150,000 to invest and the prevailing rate of interest is 11%:

Investment A Investment B Investment C


(Rs.000) (Rs.000) (Rs.000)
Initial outlay 140 140 120
Inflow at the end of
Year 1 0 70 100
Year 2 60 70 80
Year 3 80 70 50
Year 4 100 70 50
Year 5 100 0 50
Year 6 100 70 50

None of the investments brings any income after year 6.


CHAPTER 5 Page 137

5.4 Annuities

An annuity is a financial product that involves a series of regular payments or receipts over
a specified number of periods. In the context of investment, an annuity represents a series of
equal payments made at equal intervals. Examples of annuities include regular deposits into
a savings account, monthly home mortgage payments, monthly insurance payments,
retirement planning and pension payments. Annuities can be categorised based on the
frequency of payment dates.

There are mainly two types of annuities according to the time the transactions occur. When
it happens at the beginning of each period is called an ordinary annuity. At the end it is said
to be an annuity due.

When comparing two or more annuities, it is important to consider that they may cover
different time periods and their net present values (NPV) become significant in making the
ultimate decision.

The present value of an annuity of amount of Rs. 1 paid for n years at the rate of r% is given
by,
1 1
[1 − (1+𝑟)𝑛]. This is called as annuity factor (or discount factor).
𝑟

This annuity factor represents the present value of an annuity of Rs. 1 period for the specified
number of periods and interest rate.
The present value of an annuity of an amount of Rs. C paid for n years at the rate of r% is
given by
1 1
𝐶 × [1 − ] = 𝐶 × Annuity factor.
𝑟 (1 + 𝑟)𝑛

Note: This interest rate is called as discount rate since it is used to discount a stream of
future cash flows to their present value.

Illustration 5.5 – Annuity factor

Interest rate Periods (n) 𝟏 𝟏 Cumulative discount


(𝟏 − )
or discount 𝒓 (𝟏 + 𝒓)𝒏 factor for n periods
rate (r)
5% 3 1 1
(1 − (1+0.05)3) 2.7232
0.05
5% 5 1 1
(1 − (1+0.05)5) 4.3295
0.05
10% 2 1 1
(1 − (1+0.1 )2) 1.7355
0.1
10% 5 1 1
(1 − (1+0.1 )5) 3.7908
0.1

The cumulative present value table also can be used.


CHAPTER 5 Page 138

5.4.1 Annuity tables (or cumulative present value table)

An annuity table is a financial tool used to calculate the present value of equal cash flows over
a specific period of time. An annuity table provides the annuity factor based on different
interest rates (discount rates) and time periods. To calculate the present value of an annuity,
you multiply the annual cash flows by relevant annuity factor obtained from the table. A
portion of the table is provided below for your convenience. For example, find the present
value of an annuity that provides a payment of Rs. 10,000 per year for a duration of 5 years,
assuming an expected interest rate of 10%. First using the annuity table, locate the
corresponding annuity factor, which is 3.791. Next multiply this by the given cash flow per
year (Rs. 10,000) to calculate the present value. Hence the answer is Rs. 10,000 × 3.791 = Rs.
37,910.

Illustration 5.6 – Annuities


What is the present value of Rs. 70,000 per annum for 4 years, using a discount rate of
10% per annum?

Answer
Method 1: Using the annuity formula:

C = Rs. 75,000, r = 0.1 and n = 4


1 1
PV = 775,000 × 0.1 (1 − (1+0.1 )4) = 𝑅𝑠. 237,739.91

Method 2: Using the annuity table:


Annuity factor for r = 0.1 and n = 4 is 3.170, C = Rs. 75,000
PV = 75,000 x 3.170 = Rs. 237,750
CHAPTER 5 Page 139

5.4.2 Use of annuity tables to calculate a required annuity

We know the relationship between the present value (PV) of an annuity, annuity (C) and the
annuity factor. Using this relationship, we can easily form an equation to find the annuity for
a given present value.

𝑃𝑉 = 𝐴𝑛𝑛𝑢𝑖𝑡𝑦 (𝐶) × Annuity factor

𝑃𝑉
𝐴𝑛𝑛𝑢𝑖𝑡𝑦 = Annuity factor

Illustration 5.7 – annuity amount


A lending company grants a loan of Rs. 125,000 at an interest rate of 12% per annum.
The borrower is required to repay the loan in 12 equal annual instalments. What is the
amount that must be paid each year to repay the loan?

Answer
The logic of the question can be depicted as follows.
Year Year Year Year Year …………………………………….. Year
01 02 03 04 05 12

Rs. 125,000
Since the company pays out the loan money now, the present value (PV) of the loan is Rs.
125,000. The annual repayment on the loan can be thought of as an annuity. We can
therefore use the annuity formula for the calculation of annuity.

𝑃𝑟𝑒𝑠𝑒𝑛𝑡 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑎𝑛 𝑎𝑛𝑛𝑢𝑖𝑡𝑦


Annuity =
𝐴𝑛𝑛𝑢𝑖𝑡𝑦 𝑓𝑎𝑐𝑡𝑜𝑟

The annuity factor can be found using the annuity table is found by looking in the
cumulative present value tables under n=12 and r=12%, which is. The corresponding
factor is 6.194.

125,000
Therefore, annuity = = Rs. 20,180.82
6.194

The borrower should pay Rs. 20,180.82 each year for 12 years to repay the loan.

Test yourself – 5.5


Rohan Perera is considering two annuities, both of which will involve the same purchase
price.
▪ Annuity X pays Rs. 500,000 each year for 20 years.
▪ Annuity Y pays Rs. 550,000 each year for 15 years.
Both annuities start payment 1 year after purchase and neither is affected by the death
of the investor.
CHAPTER 5 Page 140
Assuming a constant interest rate of 8% per annum, which of the two annuities is better?

Test yourself – 5.6


An annuity is structured such that you need to pay Rs.14,000/- at the end of each year for
the next 5 years. If the interest rate is 9% per annum, what would be the present value of
the annuity (to the nearest integer)?

Test yourself – 5.7


Nisal has borrowed a loan of Rs.600,000/- from a bank for a duration of 5 years at the
interest rate of 10% per annum. It is to be repaid in equal annual instalments. Calculate
the value of an annual instalment of the loan.

5.5 Perpetuity

A perpetuity is an annuity in which the periodic payments begin on a fixed date and
continue indefinitely. It is sometimes referred to as a perpetual annuity. Fixed coupon
payments on permanently invested (irredeemable) sums of money are prime examples of
perpetuities.

Perpetuity is similar to an annuity, but with the key distinction: the payments continue
indefinitely, without a predetermined end date. This makes perpetuities particularly
attractive to individuals or organisations seeking to ensure ongoing payments to their
descendants or support a long-term cause.
Since this is a type of annuity, the same formula used above to compute the present value of
annuity of an amount of Rs. C paid for n years at the rate of r% can be used.
The formula for calculating the present value of perpetuity is:
1 1 𝐶
lim 𝐶 × [1 − (1+𝑟)𝑛 ] =
𝑛→∞ 𝑟 𝑟

Where C is the cash flow received each period and r is the interest rate.

Illustration 5.8 – Perpetuity


How much should be invested now (to nearest Rs.) to receive Rs. 65,000 per annum in
perpetuity if the annual rate of interest is 12%?
Answer
𝐶
𝑃𝑣 = Where,
𝑟
c = Rs. 65,000 and r = 0.12
65,000
𝑃𝑉 =
0.12
PV = Rs. 541,667
The amount of Rs. 541,667 should be invested now in order to get Rs. 65,000 perpetuity.

Test yourself – 5.8


Alex has won a lottery that pays him Rs. 25,000 per annum for life. Using an interest rate
CHAPTER 5 Page 141
of 10%, calculate the present value of the income, assuming he lives forever.

5.6 Amortisation

Amortisation is an accounting technique used to periodically reduce the book value of


a loan or an intangible asset over a specified period. In the context of a loan, amortisation
involves the gradual repayment of the loan amount through regular payments spread out
over time. Similarly, when applied to an asset, such as intellectual property or patents,
amortisation functions similar to depreciation.

Amortisation is commonly associated with long-term loans such as mortgages, car loans, or
personal loans, where borrowers make regular payments over an extended period until the
loan is fully repaid.

5.6.1 Amortisation schedules

Amortisation schedules are used by lenders, such as financial institutions, to outline the
repayment schedule of a loan based on a specific maturity date. These schedules are
commonly used for amortising loans, which involve consistent payment amount throughout
the loan term, but with varying proportions of interest and principal in each payment. A
typical example of an amortising loan is a traditional mortgage.

A loan amortisation schedule provides a comprehensive table that illustrates the


periodic loan payments, showing the amount of principal and interest that comprise each
level payment until the loan is fully repaid by the end of its term. Initially, the majority of
each payment is applied to interest, while as the schedule progresses, an increasing portion
of each payment goes towards reducing the remaining principal balance of the loan.

5.6.2 Steps for Amortisation schedules

Step 01: Find the annuity value.

Step 02: Prepare a table with six columns showing years, loan at beginning, interest, total
to be paid, period payment and period-end balance.

Step 03: Complete the table with the values until the balance amount becomes zero (or
approximately zero)

Illustration 5.9 - Amortisation schedule

A loan of Rs.700,000/- was obtained by Amal for 5 years at the interest rate of 8% per
annum. The loan is to be settled in 5 equal annual instalments.

(a) Calculate the annual instalment of the loan.


(b) Prepare the amortisation schedule to illustrate the re-payment of the loan.
CHAPTER 5 Page 142

Solution
a) 𝑃𝑉 = 𝐴𝑛𝑛𝑢𝑖𝑡𝑦 (𝐶) × Annuity factor
𝑃𝑉
𝐴𝑛𝑛𝑢𝑖𝑡𝑦 = Annuity factor
700,000
Annuity amount = = Rs. 175,319.52
3.9927

b) Amortisation schedule
Year Loan at the Interest @ 8% Total to be Annual Balance at
beginning (A) B = (𝐴 × 0.08) paid. payment the end
A+B (C) A+B - C
01 700,000 56,000 756,000 175,319.52 580,680.48
02 580,680.48 46,454.44 627,134.92 175,319.52 451,815.40
03 451,815.40 36,145.23 487,960.63 175,319.52 312,641.11
04 312,641.11 25,011.29 337,652.40 175,319.52 162,332.88
05 162,332.88 12,986.63 175,319.51 175,319.52 0

Test yourself – 5.9


A loan of Rs.120,000/- was obtained by a person at the interest rate of 8% per annum
which is to be paid in 3 equal annual instalments.

(a) Calculate the value of an annual instalment of the loan.


(b) Prepare the amortisation table to illustrate the repayments of the loan.

5.7 Nominal and effective rates of interest

The nominal interest rate is the stated or quoted rate of interest on a financial instrument
without considering the effect of compounding.

The effective interest rate is the interest rate that takes into account the effect of
compounding, providing a more accurate measure of the cost of borrowing or the return on
investment.

The effective interest rate is typically higher than the nominal interest rate due to the
compounding effect.

Illustration 5.10 – Effective rates of interest


Suppose the rate of interest on a loan is stated as 12% per annum with payments made
every 06 months. This means that 6% would be paid every 06 months.

To find the effective annual rate of interest, we need to consider the impact of two 6%
increases on an initial value of Rs. 1.
CHAPTER 5 Page 143
The value at the end of 1 year = 1 x 1.06 x 1.06 = Rs. 1.1236

If we deduct the initial Rs. 1 from the final value we get Rs. 0.1236. This represents the
interest earned on Rs. 1 over one year.

Expressed as a percentage, this is 12.36% per annum, which is higher than the stated
nominal rate of 12%

Therefore, the nominal rate is 12% per annum, while the effective rate of interest is
12.36% per annum.

Test yourself 5.10


Find the effective annual rates of interest corresponding to the following,
(a) 3% per every 06 months
(b) 4% per quarter
(c) 1.5% per month

5.8 Impact of Inflation

Inflation is a sustained increase in the general price level of goods and services in an economy
over a period of time. When the price level rises, each unit of currency buys fewer goods and
services. Consequently, inflation reflects a reduction in the purchasing power per unit of
money – a loss of real value in the medium of exchange and unit of account within the
economy. A chief measure of price inflation is the inflation rate, the annualised percentage
change in the general price index. The opposite of inflation is deflation.
Institute of Certified Management Accountants of Sri Lanka
29/24, Visakha Private Road, Colombo 04, Sri Lanka
Tel : +94 (0)11 2506391, 2507087, 4641701-3 , Fax : Ext 118
www.cma-sri lanka.org
secretariat@cma-srilanka.org

ISBN 978-955-0926-43-5

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy