STAT 3301 - Dataset and Data Summary Report
STAT 3301 - Dataset and Data Summary Report
Jim Higginson
1
Table of Contents
Data Summary................................................................................................................................3
Cleaning Methodology................................................................................................................4
Validation Process.......................................................................................................................4
References......................................................................................................................................4
2
Data Collection Description
The data for our econometrics research goes into Apple’s quarterly stock price and
revenue. We chose Yahoo Finance as the source for Apple's stock price data due to its reputation
and reliable platform for accessing historical stock price information. Yahoo Finance retrieves
this data from various stock exchanges and financial markets globally, ensuring comprehensive
coverage. This choice is relevant to our research topic as we aim to analyze the relationship
between Apple’s stock performance and revenue. For collecting revenue data, we relied on
Apple’s financial reports. These reports are considered the most direct and reliable source for
Apple's revenue figures. Apple's financial reports are publicly available through their investor
relations website and regulatory filings, providing detailed insights into the company's financial
performance. By utilizing both Yahoo Finance for stock price data and Apple's financial reports
for revenue data, we ensure the accuracy and comprehensiveness necessary for conducting
Data Summary
In our dataset, we calculated the mean, standard deviation, minimum, and maximum
values, comparing them across the dependent variable, revenue, and our independent variables,
including revenue, stockmean, change, and growth. Stockmean represents the aggregated three-
month stock price data that we’ve compiled, in the charts below. We have included our
regression chart below to get a deeper understanding of the relationship between the dependent
and independent variables. Within our dataset, there are 68 recorded observations, each
associated with four distinct variables, all characterized as time series variables. A time series
variable involves tracking observations on a single variable or multiple variables over time, and
in our dataset, these observations are presented quarterly. As per Hayes (2022), it is a common
3
practice to monitor the price movement of a security across time. This approach proves valuable
in observing the fluctuations of a specific asset, security, or economic variable over a period.
Using the first regression results, we conducted a regression analysis with stockmean as
the independent variable and revenue as the dependent variable. The resulting regression
Examining the F-value and Prob > F in the output leads to the conclusion that the
independent variables effectively predict the dependent variable. This inference is drawn from
the comparison of the p-value to the alpha value, set at 0.05, with the p-value indeed being
smaller.
Variable Obs Mean Std. dev. Min Max
4
With our observation of the regression output below, the estimated regression is
When comparing F and Prob > F values the independent variables, change and growth, do not
Variable Obs Mean Std. err. Std. dev. [95% conf. interval]
5
Revenue
150
100
50
0
0 50 100 150 200
Stock Mean
Cleaning Methodology
Step 1: Download Apple Monthly Stock Price from Yahoo Finance and save it as an
Step 2: Add variables "year", "quarter", and "fiscal year" to the Excel file.
Add a variable in column H named year with if function (if the month of date >10,
year+1, year); Add a variable in column I named quarter with if function (if the month of date
<4, Q2, if the month of date <7, Q3, if the month of date <10, Q4, Q1); Add a variable in column
J named fiscal year with and function (Combine fiscal year and quarter).
Step 3: Input the Excel file "Apple Stock Price" into Stata.
6
Step 4: Calculate means from the variable "fiscalyear".
Step 5: Output table for fiscal quarter mean value as "apple quarterly stock price".
Step 6: Download Apple Quarterly Revenue data with Excel file named "Apple Total
Revenue".
Step 8: Use Text to Columns function to split Column B (revenue) and C (change) into
Step 9: Combine the mean stock price data into Excel "Apple Total Revenue" by adding a
Step 10: Input the Excel file "Apple Total Revenue" as a dataset into Stata.
Validation Process
Step 2: Perform linear regression with "revenue" as the dependent variable and
7
Step 3: Perform linear regression with "stockmean" as the dependent variable and
8
References
Hayes, A. (June, 2022). What is a time series and how is it used to analyze data? Investopedia.
https://www.investopedia.com/terms/t/timeseries.asp#:~:text=A%20time%20series%20can
%20be,of%20a%20security%20over%20time.
Yahoo Finance. (n.d.). Apple Inc. (AAPL) Stock Historical Prices & Data.
https://finance.yahoo.com/quote/AAPL/history/