Lecture 4-2
Lecture 4-2
Chen Tong
▶ When choosing a portfolio today, they consider not only how their
wealth might vary with future state variables tomorrow such as labor
income, the prices of consumption goods, and future stock expected
return, but also expectations about the labor income, consumption,
and investment opportunities available after tomorrow.
S
Erj = rf + βj [E (rm ) − rf ] + ∑ βsj [E (rs ) − rf ]
s=1
where
S = the total number of state variables in the economy;
rs = the return on state s mimicking portfolio (i.e., the portfolio that
has the maximum correlation with state);
βsj = asset j’s beta related to rs
where fjt is the realization of the j-th factor in period t and εit is the
disturbances or random errors.
▶ The K -factor are systematic since they affect all the stock returns.
The εit can be regarded as firm-specific factors or idiosyncratic risks.
E [˜
ri ] = rf + βi1 λ1 + ⋯ + βiK λK
where βik is the beta or risk exposure on the k-th factor, and λk is
the k-th factor risk premium.
▶ The CAPM can then viewed as a special case of the APT with
K = 1, f1,t = rm,t − rf , and λ1 = E (rm ) − rf .
▶ If one asset has only a unit beta risk on the first factor, that is,
βi1 = 1 and zero betas on all other factors, then its expected return
will be
E [ri ] = rf + λ1
This is, the risk-free rate increased by an extra return to compensate
for taking the factor risk on f1 . This is why λ1 is called the factor
risk premium on f1 , or the extra return one earns by taking one unit
beta risk on the factor. Interpretations for other λs follow the same
way.
▶ In the CAPM, all investors are informed about the true means,
variances, and covariances of the asset returns, and they all use
Markowitz portfolio theory to make their optimal investment
decisions. As a result, all investors hold the same market portfolio of
risky assets and differ only, depending on their risk aversions.
▶ Technically, the APT assumes a K -factor model. That is, the asset
returns are influenced by K factors in the economy via linear
regression equations,
E [˜
ri ] = rf + βi1 λ1 + ⋯ + βiK λK
r˜1 = µ1 + 0.8f˜
r˜2 = µ2 + 1.6f˜
where E (f˜) = 0.
z̃ = 2r̃ 1 − r̃ 2 = 2µ1 − µ2
2µ1 − µ2 = rf
▶ Similarly, the same beta pricing relation holds for any asset; that is,
µi = rf + βi λ1
w 1 β1 + w 2 β2 = 0
and the portfolio:z̃ = w1 r˜1 + w2 r˜2 will be riskless. Hence, it has the
risk-free rate of return
w1 µ1 + w2 µ2 = rf
w1 (µ1 − rf ) + w2 (µ2 − rf ) = 0
N
r˜p = ∑ wi r˜i
i=1
N N N N
= ∑ wi µi + (∑ wi βi1 ) f˜1t + ⋯ (∑ wi βiK ) f˜Kt + ∑ wi ε̃it
i=1 i=1 i=1 i=1
N
∑ wi βik = w1 β1k + w2 β2k + ⋯ + wN βNk = 0, k = 1, 2, . . . , K
i=1
N N
r˜p = ∑ wi µi + ∑ wi ε̃it
i=1 i=1
When there are a large number of assets, the weights can be roughly
of the same magnitude of 1/N, and hence σ (˜
2
rp ) should be of
magnitude of:
2 2 2
(1/N) × N × σu = σu /N
which approaches zero as N approaches infinity. This implies that r˜p
is almost risk free. Hence, in the absence of arbitrage, the return on
r˜p is approximately rf , or
E [˜
ri ] = µi = rf + βi1 λ1 + ⋯ + βiK λK
one can run time-series regressions to obtain all the parameters. (it
seems like the first-stage of fama-macbeth regression).
▶ Regressions of this form are run for each asset (or portfolio) in the
sample, so we have n time-series regressions in total.
▶ Note that the "Factors" in Stage 1 could be any time series variable
that is supposed to influence asset/portfolio returns...
Yt = BFt + Zt
where S̃MBt and H̃MLt are the book-market and size factors,
respectively. These factors are supported by studies that suggest
that market capitalization and stocks classified as either growth or
value impact excess returns.
▶ Without imposing what ft are from our likely incorrect belief, we can
statistically estimate the factors based on the factor model and data
(will be detailed in the next part).
▶ Rather than taking the view that there are only observable factors or
only latent factors, we can consider a more general factor model
with both views,
R̃t = α + β f˜t + βg g̃t + ε̃t
where f˜t is a K -vector of latent factors, g̃t is an L-vector of
observable factors, and βg are the betas associated with g̃t .
ût = Rt − β̂g gt
that is, the difference of the asset returns from their fitted values by
using the observed factors for all the time periods.
where ũt is the random differences whose realized values are ût . The
estimation method for this model is the same as estimating a latent
factor model.
▶ First, the factors are not uniquely defined in the model, but all sets
of factors are linear combinations of each other. This is because if f˜t
is a set of factors, then, for any K × K invertible matrix A, we have:
▶ The second property is that we can assume all the factors have zero
mean, that is, E [f˜t ] = 0. This is because if µf = E [ft ], then the
factor model can be written as:
▶ We need to consider only how to estimate the latent factors f˜t from
the K -factor model,
Ỹt = β f˜t + ε̃t
where both the dependent variables and the factors have zero means,
E (f˜t ) = 0, E [Ỹt ] = 0
where each row is the N asset returns after subtracting their sample
means at time t for t = 1, 2, . . . , T .
A = (x1 , y1 ) , B = (x2 , y2 )
and the inner produce is defined by
A ⋅ B = x1 x2 + y1 y2 = ∣A∣∣B∣ cos(α)
⎛ ζ1 ⎞
⎜ ζ2 ⎟
ξ=⎜
⎜ ⎟
⎟
′
⎜ ⎟ , ξξ=1
⎜ ⋮ ⎟
⎝ ζp ⎠
[p×1]
A ⋅ ξ = ξ A = ∣A∣ cos(α)
′
Var (ξ X ) = ξ ΣX ξ
′ ′
max
ξ
′
s.t. ξξ=1
L(ξ) = ξ ΣX ξ + λ (1 − ξ ξ)
′ ′
FOC
ΣX ξ = λξ
So, the λ is the eigenvalue of ΣX , and ξ is the associated
eigenvector!!!
that is
p
′
ΣX = ∑ λk ξk ξk
k=1
▶ Define the factor as fk = ξk′ X with variance var (fk ) = λk , and define
′
⎛ ξ1 ⎞ ⎛ f1 ⎞
⎜ ′
⎟ ⎜ ⎟
B=⎜ ⎟ f2
F =⎜ ⎟
⎜ ξ2 ⎟
⎜
⎜ ⎟
⎟ , ⎜
⎜ ⎟
⎟
⎜ ⋮ ⎟ ⎜ ⋮ ⎟
⎝ ξp
′ ⎠ ⎝ fp ⎠
[p×p] [p×p]
then we have
′
⎛ λ1 0 ⋯ 0 ⎞⎛ ξ1 ⎞
⎜ ⎟ ⎜ ′
⎟
⎟⎜ ⎟
0 λ2 0
ξp ) ⎜ ⎟
⋮ ⎜ ξ2 ⎟
ΣX = B Σ F B = ( ξ 1 ⎜
′
⎟⎜ ⎟
ξ2 ⋯ ⎜ ⎟ ⎜ ⎟
⎜ ⋮ 0 ⋱ 0 ⎜ ⋮ ⎟
⎝ 0 ⋯ 0 λp ⎠⎝ ξp
′ ⎠
k = {1, . . . , p}
′
fk = ξk X ,
Var (fk ) = λk
Cov (fi , fj ) = 0, i ≠j
▶ We also have
p
′
X = B F = ∑ ξk fk
k=1
and
p p
∑ ξk var (fk )ξk
′ ′ ′
ΣX = B ΣF B = = ∑ λk ξk ξk
k=1 k=1
▶ A simple index:
K
∑i=1 λi
R= p ∈ [0, 1]
∑j=1 λj
where K ≤ p.
N +T NT
IC (K ) = log(V (K )) + K ( ) log ( )
NT N +T
where
N T
2
V (K ) = ∑ ∑ (Yit − β̂i1 fˆ1t − β̂i2 fˆ2t − ⋯ − β̂iK fˆKt )
i=1 t=1
Note that V (K ) is the sum of the fitted squared residual errors of the
factor model.
whose sum is more than 99% of the sum of all the eigenvalues.
where i = 1, 2, . . . , 11.
▶ It is interesting that the loadings on the first factor are all positive.
This implies that a positive realization of F1 will have a positive
effect on the returns of all the bonds.
▶ The second factor has a different pattern from the first. A positive
realization of F2 will have a negative effect on short-term bonds, and
a positive effect on the long-term ones. This is equivalent to an
increase in the slope of the yield curve. So, F2 is commonly
identified as a steepness factor.