17 ae2
17 ae2
Chapter 17
May 5, 2025
We model:
Logistic CDF:
exp(z)
G (z) =
1 + exp(z)
Model:
exp(x ′ β)
P(y = 1|x) =
1 + exp(x ′ β)
Log-odds (Logit):
P(y = 1|x)
log = x ′β
1 − P(y = 1|x)
Model:
P(Y = 1 | X ) = Φ(X β) .
E (y |x1 , . . . , xk ) = exp(b0 + b1 x1 + · · · + bk xk )
log E (y |x) = b0 + b1 x1 + · · · + bk xk
exp(−λ)λh
P(y = h|x) = , λ = exp(xb)
h!
Fully determined by mean λ = E (y |x)
exp(−λ)λh
P(y = h|x) = , λ = exp(xb)
h!
Fully determined by mean λ = E (y |x)
Interpretation:
Approximate: 100 · bj ≈ % change in E (y |x) from a one-unit increase
in xj
Exact change for dummy change (0 to 1):
exp(bk ) − 1
Occurs when the data on the dependent variable are missing (not just
censored at a value) in a non-random way, typically due to an
endogenous selection mechanism.
Example: We only observe wages for people who choose to work.
The decision to work (and thus to have wage observed) may depend
on factors related to the wage outcome, causing a non-random
sample of wages.
Model Setup:
Selection equation: Di∗ = Zi γ + vi , Di = 1 if Di∗ > 0 (observation is
selected), otherwise Di = 0.
Outcome equation: yi = Xi β + ui , observed only if Di = 1.
Assume (ui , vi ) are correlated with correlation ρ. If ρ ̸= 0, then
E [ui | Di = 1] ̸= 0 (violating OLS assumption of zero conditional mean
in selected sample).