Rio 094551
Rio 094551
DEPARTMENT OF STATISTICS
ENROLLMENT NO 509
SUBMITTED TO MS KAINAT
1
Question no 1
Discuss the difference between time series and cross sectional data with
the help of suitable example.
Answer
Definition: Time series data tracks the same variable(s) over a period of time. It’s
like a timeline where each data point is associated with a specific time stamp.
Example: Daily temperature readings, monthly sales data, or annual GDP growth.
2. Cross-Sectional Data
Now that we’ve established the difference, let’s apply these concepts to cricket to
make it more engaging and relatable.
2
Time series data in cricket is like watching a player’s career unfold over the years.
It tracks performance metrics over time, revealing trends, peaks, and slumps.
Think of it as the storyline of a player’s journey.
Imagine we’re analyzing Virat Kohli’s batting average in ODIs from 2010 to 2023.
Your dataset might look like this:
YEARS Average
2010 35.00
2012 42.50
2013 42.80
…….. …….
2023 50.00
Now, imagine we’re analyzing the batting averages of the top 5 batsmen in the
2023 Cricket World Cup. Your dataset might look like this:
3
Babar Azam 60.00
David Warner 50.00
ANSWR NO 2
1. Data Collection:
2. Data Visualization:
Plot the data (e.g., line graph) to visualize trends, seasonality, and patterns.
3. Decomposition:
4
o Irregular (Noise): Random fluctuations.
4. Stationarity Check:
5. Model Selection:
6. Model Fitting:
7. Model Validation:
o Training and Testing Sets: Split the data into training (e.g., 80%) and
testing (e.g., 20%).
ANSWER NO 3
5
In time series analysis, stationarity is a key concept that determines whether the
statistical properties of a series remain constant over time. There are two types of
stationarity: weak stationarity and strong stationarity. Here's an explanation of
both and their differences:
1. Constant Mean: The mean of the series does not change over time.
6
Aspect Weak Stationarity Strong Stationarity
autocovariance. distribution.
Moments Only first and second moments All moments (mean, variance,
Considered (mean, variance). skewness, etc.).
ANSWER NO 4
Properties:
PART B
7
1. Simplifies Modeling:
Stationarity ensures that statistical tests and confidence intervals are valid.
ANSWER 5
A time series {Yt}{Yt} is called a Random Walk if it satisfies the following equation:
Yt=Yt−1+ϵtYt=Yt−1+ϵt
where:
ϵtϵt: Random noise (usually white noise with mean 0 and constant var