This document discusses analyzing and describing random variable data. It contains the results of 6 questions:
1) A normality test on two random variables, NormX and Norm2, found NormX to be more normally distributed.
2) Most observations for NormX and Norm2 fell within expected ranges, with outliers at the ends of the distribution.
3) A uniform distribution plot did not match previous normal distribution plots as different distributions were used.
4) Summary statistics were reported for a data set of 500 values, including mean, median, standard deviation, minimum, maximum, and quartiles. The data was left-skewed.
5) Autocorrelation analysis found lags 3
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
36 views2 pages
Le Hoang Phuc An - 2152367 - Report - CC01 - 2023
This document discusses analyzing and describing random variable data. It contains the results of 6 questions:
1) A normality test on two random variables, NormX and Norm2, found NormX to be more normally distributed.
2) Most observations for NormX and Norm2 fell within expected ranges, with outliers at the ends of the distribution.
3) A uniform distribution plot did not match previous normal distribution plots as different distributions were used.
4) Summary statistics were reported for a data set of 500 values, including mean, median, standard deviation, minimum, maximum, and quartiles. The data was left-skewed.
5) Autocorrelation analysis found lags 3
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2
1.
Minitab and Describing data
MSSV: 2152367 JX=367 JY=4.5 2. Question 1: X= JX-(the third digit of your student ID)=367-5=362 So: Z=(X-365.211)/3.4=-0,94 Question 2: We have: Z=-0.94 So p-value of Z: P(Z) approximately to P(z>=-0.94) is -0.173609
2. Working with random variables
Question 1: From the histogram of NormX we can see that the most frequently values are around 368 which near to the mean of the data, 367. And also, the histogram of NormX shows that it roughly resembles a bell-shaped curve, which is characteristic of a normal distribution. The red curve fitted to the histogram also closely follows this shape. So that, this variable is look like it was drawn from a normal distribution. Question 2: The Anderson-Darling test for the NormX plot is 0.389, which is smaller than the Norm2 value of 0.411. Additionally, NormX's p-value is 0.383 as opposed to Norm2's p-value of 0.341, which is greater. As a result, the NormX plot has a greater p-value than Norm2 for the data to be normally distributed, but for the Anderson-Darling test, Norm Most observations fall in line with the straight line in the ranges of 0.7394% to 97.6619% and 1.13909% to 96.0631% according to the plot of both NormX and Norm2, while the majority of the outliers are found at the head and tail of the red line. Question 3: The plot doesn’t match with the plots I drew in the previous questions because I use Uniform distribution for this plot while in the previous I used normal distribution. Question 4: N: there are 500 value in the data Mean: 367.1 is the average value of the distribution. Median: 367.30 is the central value of the distribution. Standard Deviation: 4.24 is let us know how the values distribute, near or far from mean. In this case it show that most of values are near the mean value. Minimum: 353.90 is the smallest value in the data Maximum: 380 is the largest value in the data Q1: 364,13 is the value where 25% other values smaller than and 75% larger than. Q3: 370.05 is the value where 25% other value larger than and 25% smaller than. Skewness: left-skewed because Mean<Median<Mode (367.1<367.3<368) Outlier: 380.003 and 353.898. Question 5: In the results we see that the lag-3,14,40 predictors are significant at the 0.05 level and other lags are very small. And it doesn’t have any specific pattern. The data in this series has a weak autocorrelation due to the lack of significant spikes during the time lags. So the data is random. Question 6: When compare to Chi-square table we can see that the value LBQ of lag time 3,4 larger than so this are the only 2 values suggesting that data is not random. Hypothesis test for autocorrelations equal to zero (Ljung-Box Q-Test) Null hypothesis test ( H 0 ¿: μ1=μ2 =μ 3 …=0 due to most of value is below the limit significant which make them meaningless in the probability and become 0. Thefore, the autocorrelations up to the highest lag are equal to zero, indicating that there is no autocorrelation in the data.