Stata Assignment - Bryson Shelist
Stata Assignment - Bryson Shelist
A. There are 72 observations and 19 variables. The describe command appears on stata as
shown below. The variables being measured are flu vaccinations, high school graduation,
and alcohol impaired driving.
B. Histogram of median household income. The most common one we see is between
50,000 and 60,000.
C. Mean 55833.25, Standard deviation 9428.312, 95th percentile 76551
D. The 90% confidence interval for the median household income is 53,981.43, 57,685.07
a. These results show us that there is a 90% confidence that the median household
income in any country will fall in that area.
b. The standard error is 1111.137
i. The standard error of the mean is a measure of the dispersion of sample
means around all the population means
E. Using the ttest command for both to test whether the population mean of Median
Household Income is equal to $50,000 using a 10% size of test and redoing the test again
to test whether the population mean of Median Household Income is equal to $55000
using a 5% size of test.
a. Our results show that in the case of the 10% size, our men can only be less than
$50,000
b. Our results show that in the case of the 5% size, with income $55,000 there was a
0.7721 chance that the mean was less that $55,000 and 0.4558 chance that the
mean would actually equal $55,000 and a 0.2279 chance that the mean was larger
than the income of the $55,000 at a 5% size.
F. The correlation coefficient between median household income and high school
graduation rate is shown in the screenshot below from stata which equals to 0.3030
a. This is a very weak correlation but still positive
b. The rate is 0.3030
G. Use a scatter command to graph the scatter plot between high school graduation rate and
median household income. The most populated area of dots is around 90% and 95%
graduation rate and between 50,000 and 60,000 median household income.
a. The correlation coefficient 0.3030 which is considered a weak correlation
H. The screenshot provided shows us how the independent variable is related to the constant.
It shows that the median household income increases around 730 dollars when there is a
1$ increase in highschool graduation rates. This is statistically significant because the
0.010 is less than the significant level of 0.050. For this test, the significance level is at
%5 so we test the 95% interval for the t statistics
a. T statistics: 2.66 and -0.43
b. P statistics
i. The P values allows us to reject the null hypothesis
c. To understand the r-squared statistic we need to check the r-squared values that
are 0.0918
i. The r-squared value tells us that the variance in the dependent variable can
be explained by the independent variable.
d. The adjusted value of 0.0789 to clearly judge the fitment of the r-squared values
e. In conclusion, high school graduation rate does affect median household income