Assignment Answers Decision Science
Assignment Answers Decision Science
Q1. Bad gums may mean a bad mood. Researchers discovered that 85% of people who have
suffered a bad mood had periodontal disease, an inflammation of the gums. Only 29% of healthy
people have this disease. Suppose that in a certain community bad moods are quite rare, occurring
with only 10% probability. If someone has periodontal disease, what is the probability that he or
she will have a bad mood?
Ans.
85% of people who have suffered a bad mood had periodontal disease.
29% of healthy people have periodontal disease.
In a certain community, 10% people have bad mood.
To Find:
The probability of having bad mood in presence of periodontal disease in a community of 10%
population having bad mood.
Solution:
Hence, the probability of someone with periodontal disease will have a bad mood is .247
Q2. Using MS-EXCEL show the Regression model, consider ‘Instagram followers’ as dependent
variable and ‘no of post per day’ as an independent variable. Write the interpretation of EXCEL
Tables. Write the conclusion on the fitting of your model also.
Ans.
Introduction
Regression analysis is the part of statistical modelling which use to determine the relationship
between two or more variable. In MS excel, it is very easy to identify the regression relationship.
Relationship between two variables is called as Linear regression analysis and relationship of
more than 2 variables is known as multi regression analysis.
Regression analysis is an analysis that show relationship between dependent [variable which we
tried to understand and predict] and Independent variable [factor help to predict the value of
dependent variable] and produce an equation.
MS-Excel provide the inbuilt method to calculate the regression. Regression feature will be
available in Data tab of option bar of MS-Excel. If regression analysis option is not visible on
menu under Data tab then first install the same by following below steps.
Go to File excel menu bar.
Click on Options option.
Further click on Add-ins.
Select Analysis ToolPak and click on Go button.
Now go to Data tab of menu and Data Analysis option will be visible at the end.
Steps to follow regression analysis on excel
In first step, we will copy the data of both the variables [given in the question] in some sheet of
excel sheet.
Click on Data tab of Excel menu.
Click on Data Analysis option.
Select the list of values for Y axis [Dependent variable].
Select the list of values for X axis [Independent Variable].
Chose option of Labels.
Click on Ok button.
SUMMARY
OUTPUT
Regression
Statistics
Multiple R 0.110590242
R Square 0.012230202
Adjusted R Square -0.077567053
Standard Error 63.02825921
Observations 13
ANOVA
Significance
df SS MS F F
541.054712
Regression 1 541.0547129 9 0.136197946 0.719095592
Residual 11 43698.17606 3972.56146
Total 12 44239.23077
Standard
Coefficients Error t Stat P-value Lower 95% Upper 95%
9.03583858
Intercept 365.0211268 40.39703934 4 2.01618E-06 276.1078427 453.9344109
-
2 4.063380282 11.01037698 0.36905006 0.719095592 20.17029607 28.29705663
There are three components of the regression analysis data generated from MS-Excel.
1) Summary Output: This section provides the basic statistics about the regression model,
include the multiple correlation coefficient (R), adjusted R square, the standard error of
the estimate, and the number of observations. These statistics provide an overall view of
how well the regression model fits the data.
2) ANOVA: This section provides the analysis of variance. This table display the regression
model as a whole, and the significance of independent variable.
Df refers to degree of freedom.
SS refers to sum of squares.
MS is mean square.
F is f statistics that test the overall significance of the model.
Significance F that is the P- value of F.
The most essential component after the ANOVA table is coefficients. This allows to create the
linear regression equation in Excel.
y=bx+a
where, b = coefficient
a = intercept
y = 4.063 x + 365.02
This is the prediction number of “no of follower” increase when “when no of post is 2”.
Conclusion:
Regression is very useful statistical method. By using this, we can compare the different variables
relationship and impact on each other. This tool play a vital role in finance, forecasting etc.
The slope 4.063 represent the estimate change in Instagram follower for every increase in no of
post.
Q 3A): 1000 light bulbs with a mean life of 120 days are installed in a new factory and their
length of life is normally distributed with standard deviation of 20 days.
If it is decided to replace all the bulbs together, what interval should be allowed between
replacements if not more than 10% should expire before replacement?
Ans.
Mean life of light bulb (μ) = 120
Standard deviation (σ) = 20 days
Number of light bulbs (n) = 1000
Percentage of bulbs that should not expire before replacement = 90% = 0.9
Need to find the interval between replacement such that not more than 10% of the bulb expire
before replacement.
It is known that length of the life of the bulb is normal. So we can use standard normal
distribution value of x (length of life of bulb) for the given z-value.
z = (x - μ)/ σ
Ans.
First calculate total of male and female migrant as per the data given in question,
Now, we can calculate the total weighted age for each gender by multiply the median of age with
migrated number of that gender and age group.
Average age for female migrate= Weighted age of female / total of female migrated
Interpretation:
As per the above calculation of average of male and female, it is observed that average age of male
migrant is less than age of female migrant. It means, more young age males are migrating and for
female more aged woman are migrating.
Impact on migration of male and female could be different, policies can be modifying according to
the data interpreted.