Question Bank Part II With Answers
Question Bank Part II With Answers
Ans: The p.d.f. (probability density function) of the standard normal distribution 𝑁~𝑍 0,1 is:
𝑧2
1
𝑓 𝑧 = 2𝜋
𝑒− 2 , −∞ < 𝑧 < +∞
2- Find the value of maximum ordinate for a normal distribution with mean 25 and variance 16.
1 1
Variance = 16, S.D. = 4,𝑓 𝑥 = 𝜎 2𝜋
= 22
=0.0997
4 2
7
3- If 𝑍~𝑁 0,1 , then find the value of “a” such that 𝑃 𝑍 > 𝑎 = 0.025
P(z<a) = 1-0.025=0.975, ф(a)=0.975, a=ф-1(0.975) = 1.96 (using table 10a)
5- In a normal probability distribution, what are the first four moments about mean?
𝜇1= 0 , 𝜇2= 𝜎 2 , 𝜇3= 0 , 𝜇4= 3𝜎 4
6- In a normal distribution 𝑄1 = 8 , 𝑄3 = 17. Find the value of mean and mode.
13- Find the standard deviation, if Q.D. = 3.3725 for a normal distribution.
Q.D.= 0.6745 𝜎, then 𝜎= Q.D. /0.6745 = 3.3725/0.6745 = 5
14- What is the relationship between mean, median and mode in a normal distribution?
The normal probability distribution is a symmetrical distribution. Thus the mean, median and mode
coincide at 𝜇.
Mean = Median = Mode = 𝜇
15- In a normal distribution 𝜇 = 103 and 𝑄3 = 171.094, find the standard deviation.
Q3 =𝜇 + 0.6745 𝜎
171.094 = 103 + 0.6745𝜎,
171.094 − 103
𝜎= = 100.95
0.6745
Chapter # 11
1- Define sample and sampling?
Sample is a part of population which is selected with the expectation that it will represent the
characteristics of the population.
Sampling is a procedure of selecting a representative sample from a given population.
6- What is population?
It is the totality of the observations made on all the objects under consideration, possessing some
common characteristics.
8 5
10- Given 𝑁1 = 3, 𝑛1 = 2, 𝑎𝑛𝑑 𝑁2 = 4, 𝑛2 = 2. If 𝜎1 2 = 3 and 𝜎2 2 = 4. Find 𝑣𝑎𝑟 𝑋1 − 𝑋2 when sampling
is done without replacement?
𝜎12 𝑁1 − 𝑛1 𝜎22 𝑁2 − 𝑛2
3−2 5 84−2 13
𝑉𝑎𝑟 𝑋1 − 𝑋2 = + = 3 ( ) + 4( )=
𝑛1 𝑁1 − 1 𝑛2 𝑁2 − 1 2 3−1 2 4−1 12
4
It is a process in which the sample is selected in such a way that every element of population has a
known non-zero probability of being included in the sample.
31- For finite population of size N = 4, find σx if μ = 6, σ = 5 and n = 2.
𝜎 𝑁−𝑛
𝜎𝑥 = 𝑛 𝑁−1
= 2.887
32- A population consists of values 0, 3, 6, 9. How many possible samples should be drawn without
replacement of size 3?
n
Cr = 4C3= 4 or n
Pr = 4P3 = 24
6
Chapter # 12
1- Explain what is meant by statistical estimation.
It is a procedure of making judgment about the unknown value of a population parameter by using
sample observations.
2- Given 𝑛 = 4 , 𝑋 = 120 , 𝑋 − 𝑋 2 = 303 , compute the best unbiased estimates of the population
mean 𝜇 and of variance 𝜎 2 .
𝑋 𝑋−𝑋 2
𝜇= 𝑛
= 30, 𝑆2 = = 101
𝑛−1
3- Define interval estimation.
It is a procedure of constructing an interval from a random sample; it has a high specified probability
of including the unknown true value of a parameter.
4- Differentiate between point estimate and point estimator.
A point estimate is a specific value which we obtain for an estimator is called point estimate or simply
an estimator.
A statistic used to estimate a population parameter is called a point estimator or simply an estimator.
5- What is meant by point estimation?
The object of point estimation is to obtain a single number from the sample which will represent the
unknown value of the parameter.
6- Write down only the name of the properties of good estimator.
Unbiasedness, consistency and efficiency.
7- Explain the statistical inference.
It is an art of drawing conclusions or inferences about the population from the limited information
contained in the sample.
8- What is meant by unbiasedness?
It refers to the desirability of the sampling distribution of an estimator being centered at the
parameter to be estimated.
An estimator is said to be unbiased estimator if its expected value equals the corresponding
parameter.
9- If 𝑋 = 100 , 𝜎 = 8 𝑎𝑛𝑑 𝑛 = 64 , set up a 95% confidence interval for 𝜇?
𝜎
𝑋 𝑍(1− 𝛼 ) 𝑛
, 1 - α = 95%, α/2 = .025, using table 10a, 𝑍(1− 𝛼 ) = 1.96
2 2
8
100±1.96 => (98.04, 101.96)
64
Chapter # 13
1- Distinguish between null hypothesis and alternative hypothesis.
A null hypothesis, denoted by “𝐻0 ”, is that hypothesis which is to be tested for possible rejection
under the assumption that it is true.
Any hypothesis that is different from the null hypothesis and is set up in parallel to the null hypothesis,
is called alternative hypothesis, denoted by 𝐻1 or 𝐻𝑎 .
2- Define simple hypothesis.
If a hypothesis completely specifies the form of the distribution as well as values of its parameters is
called simple hypothesis.
3- If 𝛼 = 0.05, what will be the value of 𝑍𝛼 ?
2
𝛼/2 = 0.025, using table 10a we get 𝑍𝛼 = -1.96
2
4- What are the assumptions of student’s t-statistics?
The population should be normal, whose variance is unknown and the sample size is small (n≤30).
5- Define level of significance.
The probability of making type-I error is called level of significance, denoted by “α”.
6- Given 𝑋 = 28 , 𝜇0 = 28. Find the value of z-score.
𝑋−𝜇
As Z = 𝜎 =0
𝑛
7- Write a short note on critical region.
A rejection region or critical region specifies a set of values of the test statistic for which the null
hypothesis is rejected.
8- Define acceptance region.
Acceptance region specifies a set of values of the test statistic for which the null hypothesis is not
rejected.
9- Define a Type – I error.
A Type-I error is made by rejecting 𝐻0 when it is actually; true.
10- What is meant by critical value?
The values of the test statistic which separate the rejection and nonrejection region for the test are
called critical values.
11- Explain simple and composite hypothesis.
If a hypothesis completely specifies the form of the distribution as well as values of its parameters is
called simple hypothesis. e.g. 𝜇 = 30
A hypothesis is said to be composite if it does not completely specify the form of the distribution as
well as values of its parameters is called composite hypothesis. e.g. 𝜇 ≤ 30
12- Write down the steps in testing hypothesis of population mean 4, when the sample size is large.
i. 𝐻0 : 𝜇 = 4
ii. 𝐻1 : 𝜇 ≠ 4
iii. Specify level of significance α
𝑋−𝜇
iv. Test Statistic: Z = 𝜎
𝑛
v. Calculation of the value of test statistic Z.
vi. Critical region: Z < 𝑍𝛼 or Z > 𝑍1− 𝛼
2 2
vii. Conclusion: Reject 𝐻0 , if value of test statistics Z falls in the region of rejection.
Acceptance region specifies a set of values of the test statistic for which the null hypothesis is not
rejected.
9
Chapter # 14
1- Express two properties of regression line.
(i) The sum of the residuals is zero, that is 𝑒𝑖 = 0 where 𝑒𝑖 = 𝑦𝑖 − 𝑦i
(ii) The regression line always passes through the point of means ( 𝑋 − 𝑌 , the centre of gravity of
the observed data.
2- What is meant by intercept?
It is the expected mean value of dependent variable when all the values of independent variable are 0.
Or A point where the regression line intersects the axis.
3- Differentiate between regressor and regressand?
The variable, that forms the basis of estimation or prediction, is called regressor or predictor or
independent variable.
The variable, whose value depends upon the selected value of the independent variable, is called
regressand or response or dependent variable.
4- Write any two formulae of correlation co-efficient.
𝑋−𝑋 𝑌−𝑌 𝑆𝑥𝑦
r= , r=
𝑆𝑥 𝑆𝑦
(𝑋−𝑋)2 (𝑌−𝑌)2
Correlation coefficients are used in statistics to measure how strong a relationship is between two
variables.
14- Interpret the meaning when r = 0.
Both the variables are independent.
15- Define negative correlation and positive correlation.
Correlation is said to be negative if both the variables tend to move in opposite direction.
Correlation is said to be positive if both the variables tend to move in same direction.
16- Define regression analysis.
It is a process by which we estimate one of the variables on the basis of other variable.
17- The regression equations of 𝑥 𝑜𝑛 𝑦 is 𝑥 = 40.7 − 0.587𝑦 and of 𝑦 𝑜𝑛 𝑥 is 𝑦 = 20.8 − 0.912𝑥, find 𝑟𝑥𝑦 .
𝑦 = 𝑎 + 𝑏𝑦𝑥 𝑥; 𝑏𝑦𝑥 = −0.912
𝑥 = 𝑎 + 𝑏𝑥𝑦 𝑦; 𝑏𝑥𝑦 = −0.587
𝑟= 𝑏𝑦𝑥 . 𝑏𝑥𝑦 = - 0.732
18- Given that 𝑛 = 15, 𝑆𝑦 = 16.627, 𝑆𝑥 = 7.933 and 𝑋−𝑋 𝑌 − 𝑌 = 148, find 𝑏𝑦𝑥 =? 𝑎𝑛𝑑 𝑏𝑥𝑦 =?
(𝑥−𝑥)(𝑦−𝑦)
𝑏𝑦𝑥 = = 2.35,
𝑆2𝑥
(𝑥−𝑥)(𝑦−𝑦)
𝑏𝑦𝑥 = = 0.535
𝑆2𝑥
19- Explain the difference between fixed variable and random variable.
Independent variable is called fixed variable whereas dependent variable is called random variable.
20- Given 𝑋 = 40, 𝑌 = 180 and 𝑏 = 2, find the value of intercept “a”.
𝑎 = 𝑦 − 𝑏𝑦𝑥 𝑥 = 100
21- If 𝑏𝑦𝑥 = 1.6 𝑎𝑛𝑑 𝑏𝑥𝑦 = 0.4, find the value of 𝑟𝑥𝑦 .
𝑟 = 𝑏𝑦𝑥 . 𝑏𝑥𝑦 = 0.8
22- Given 𝑏𝑥𝑦 = −0.86, 𝑏𝑦𝑥 = −0.85, find 𝑟𝑥𝑦
𝑟 = 𝑏𝑦𝑥 . 𝑏𝑥𝑦 = - 0.855
23- Define simple linear regression.
If the simple regression describes the dependence of the expected value of the dependent random
variable Y as a linear function of the independent variable X, then the regression is called simple linear
regression.
24- Find the slope and intercept of the line whose equation is 3X – 5𝑦 = 20.
−20 + 3 𝑋
𝑦= = - 4 + 0.6 X; Slope = 0.6, Intercept = -4
5
2 2
25- Given 𝑋 − 𝑋 𝑌−𝑌 =0, 𝑋−𝑋 = 10 , 𝑌−𝑌 = 10 and 𝑛 = 5, find the coefficient of
correlation.
𝑋−𝑋 𝑌−𝑌
r= =0
(𝑋−𝑋)2 (𝑌−𝑌)2
26- Explain the method of least square.
The least squares method is a statistical procedure to find the best fit for a set of data points by
minimizing the sum of the squares of residuals of points from the plotted curve.
27- What is the range of the correlation coefficient “r”.
correlation coefficient, r, can take a range of values from +1 to -1.
28- If 𝑟 = 0.48, 𝑠𝑥𝑦 = 36, 𝑠𝑥2 = 16, find the value of 𝑠𝑦 .
𝑆𝑥𝑦
Using r = , Sy= 18.75
𝑆𝑥 𝑆𝑦
29- Explain scatter diagram.
The scatter diagram is a set of points in a rectangular plane (where x measured horizontally and y
measured vertically), where each pair represents an observed pair of values.
30- What are the parameters of the simple linear regression model?
α: intercept; β: Slope
31- Explain the term residual.
11
In regression analysis, the difference between the observed value of the dependent variable (y) and
the predicted value (ŷ) is called the residual (e). where e = y - ŷ
12
Chapter # 15
1- Define contingency table.
A table consisting of ‘r’ rows and ‘c’ columns in which the data is classified according to two attributes
A and B is called (r × c) contingency table.
2- Define rank correlation.
The correlation between ranks of individuals for both the variables x and y is called rank correlation
3- Give formulae for spearman’s rank correlation.
6 𝑑2
𝑟 =1−
𝑛(𝑛2 − 1)
4- Give formulae for yule’s co-efficient of association.
𝐴𝐵 𝛼𝛽 − (𝐴𝛽)(𝛼𝐵)
𝑄=
𝐴𝐵 𝛼𝛽 + (𝐴𝛽)(𝛼𝐵)
5- Define class and class frequency in contingency table.
A class is a set of objects which are sharing a given characteristic.
A class frequency is the number of objects which are distributed in a class.
6- Define the term dichotomy for attributes.
Process of dividing the object into two mutually exclusive classes is called dichotomy.
7- What is positive and negative association?
𝐴 (𝐵)
Two attributes A and B are said to be positively associated, if 𝐴𝐵 > 𝑛
.
𝐴 (𝐵)
Two attributes A and B are said to be negatively associated, if 𝐴𝐵 < .
𝑛
8- What is a contrary class?
The classes Aβ, αB, etc. represented by both positive as well as negative attributes are called contrary
classes.
9- Define independence of attributes.
Two attributes A and B are said to be independent if there is no relationship between them.
(𝐴)(𝐵)
i.e. 𝐴𝐵 = 𝑛
10- Whether the two attributes are attributes are independent or associated for the given data:
N = 1024, (A) = 640, (B) = 384 and (AB) = 54
(𝐴)(𝐵)
If 𝐴𝐵 = 𝑛
, here 54 ≠ 240, A and B are associated.
11- Given that 𝑥 = 20.178, if d.f. = 4 ,𝛼 = 0.01. find table value of 𝑥 2 𝑐ℎ𝑖 𝑠𝑞𝑢𝑎𝑟𝑒 would be.
2
Chapter # 16
1- Define analysis of time series.
The analysis of time series is the decomposition (isolating) of time series into its components for their
separate study.
2- Write down main components of time series.
A time series has the four types of movements or variations are called components of time series.
Secular Trend, Cyclical movements, Seasonal Movements, Irregular Movements.
3- Write two merits of moving average method.
It is easy and simple.
The moving averages of an appropriate period eliminate the periodic fluctuations, so it may be used to
eliminate cyclical and seasonal fluctuations.
4- Define seasonal variations.
The seasonal movements are short term movements that represent the regularly recurring changes.
These variations indicate a repeated pattern of identical changes in the data within one year or less.
5- Define principle of least square.
It says that “the sum of squares of the deviations of the observed values from the corresponding
expected values should be least.”
6- Write down two properties of least square line.
2
The sum of squares of residuals is minimum. 𝑒𝑖 2 = (𝑦𝑖 − 𝑦i)
The least squares line always passes through the point of means ( 𝑋 − 𝑌 , the centre of gravity of the
observed data.
7- Enlist the different methods of measuring secular trend.
Free-Hand curve, Semi-average, Moving average, least squares
8- Define time series in short.
Arrangement of data by successive time periods is called a time series.
9- Write down four phases of a business cycle.
Trough (depression), Expansion (recovery), Peak (boom or prosperity), Recession(contraction)
10- Given that 𝑋 = 0, 𝑌 = 245, 𝑋 2 = 28, 𝑋𝑌 = 66 and n = 7. Fit linear trend.
𝑦 = 𝑎 + 𝑏𝑥
𝑥𝑦 𝑦
𝑥 = 0, 𝑏 = 𝑥2
= 2.36 and 𝑎 = 𝑛
= 35
𝑦 = 35 + 2.36𝑥
11- Explain irregular trend.
The irregular movements refer to fluctuations of irregular nature caused by chance such as wars,
floods etc.
12- Give two examples of seasonal trend.
An increase in consumption of electricity in summer.
An increase in sales of cold drinks during summer.
13- Define signal.
It is a systematic component of variation in time series.
14- Define stationary time series.
It is one whose statistical properties such as mean, variance, correlation, etc. are all constant over
time.
15- Define the term secular trend.
A secular trend is a long-term movement of time series in one direction. This movement is smooth,
steady and regular in nature.
16- Define noise.
The noise is an irregular component of variation in a time series.
17- Write down two advantages of the semi-average method.
It is simple, easy and quick.
It smoothes out seasonal variations.
18- What is historigram?
Graph of time series is historigram.
14
Chapter # 17
1- Differentiate between hardware and software.
The physical parts of the computer are called hardware. Computer hardware consists of CPU,
monitor, mouse, keyboard etc. Whereas, the set of instructions given to the computer to solve the
problem or to control the operations of the computer is called software. e.g. Microsoft windows,
Microsoft office etc.
2- What is secondary storage? Explain with examples.
Secondary storage refers to storage devices and media that are not constantly accessible by a
computer system. Examples include external hard drives, portable flash drives, CDs, and DVDs. These
devices and media must be either plugged in or inserted into a computer in order to be accessed by
the system.
3- What is computer software?
Computer software, also called software, is a set of instructions and its associated documentations
that tells a computer what to do or how to perform a task. Software includes all different software
programs on a computer, such as applications and the operating system.
4- What is compiler?
A compiler is a translator that converts a program written in a high level language into machine code
and operates the whole program at a time.
5- Define central processing unit (CPU).
The component of a computer system that performs the basic operations (such as processing data) of
the system, that exchanges data with the system's memory or peripherals, and that manages the
system's other components.
6- What is a super computer?
A supercomputer is a computer with a high level of performance compared to a general-purpose
computer. These systems are built to process huge amount of data. The performance of a
supercomputer is commonly measured in floating-point operations per second (FLOPS) instead of
million instructions per second (MIPS).
7- What do you understand by ALU?
The arithmetic logic unit is a part of the processor in which all arithmetic and logical operations on the
data are performed
8- What is mean by programming?
Programming is the process of creating a set of instructions that tell a computer how to perform a
task. Programming can be done using a variety of computer ‘languages,’ such as SQL, Java, Python,
C++, etc.
9- What is minicomputer?
The mini computers have large memory and faster input/output devices. They are more expensive and
have more processing speed than microcomputers.
10- What do you know about DOS?
The term DOS can refer to any operating system, but it is most often used as a shorthand for MS-DOS
(Microsoft disk operating system). Originally developed by Microsoft for IBM, MS-DOS was the
standard operating system for IBM-compatible personal computers.
11- Write down the names of different computers.
The four basic types of computers are as under:
i. Supercomputer.
ii. Mainframe Computer.
iii. Minicomputer.
iv. Microcomputer.