* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Econometrics-I-20
		                    
		                    
								Survey							
                            
		                
		                
                            
                            
								Document related concepts							
                        
                        
                    
						
						
							Transcript						
					
					Econometrics I Professor William Greene Stern School of Business Department of Economics 20-1/41 Part 20: Sample Selection Heteroscedasticity in Regression Generic Heteroscedasticity  y=β’x+, Var[i] = σi2  Can we find heteroscedasticity in the data?      In the model of the mean? Obviously not In the variances. Only if related to information in the sample/model Heteroscedasticity robust estimators find nothing if the heteroscedasticity is not related to the information set used in fitting the model. What does the B-P LM test find? LM = R2 in regression of [ei2/(s2-1)] on x or z etc. 20-2/41 Part 20: Sample Selection Heteroscedastic Probit  y=β’x+, Var[i] = σi2 = σ2 * wi2 but σ2 = 1.  Variance based heterogeneity  yi=β’xi+iwi i ~ N[0,1]  Prob(yi=1) = Prob(β’xi+iwi > 0) = Prob(β’xi/wi + i> 0 > 0) = Φ(β’xi/wi ) Mean based heterogeneity (functional form)  yi* = β’xi/wi + i, i N[0,1]  Prob(yi=1) = Φ(β’xi/wi ) same model and same data. Observational Equivalence We cannot distinguish between these two different “models” with observed data.    http://davegiles.blogspot.ca/2011/05/gripe-of-day.html http://davegiles.blogspot.com/2013/05/robust-standard-errors-for-nonlinear.html 20-3/41 Part 20: Sample Selection Harvey’s Model Applied to Probit Y=1(β’x+ > 0), Var[i] = [exp(z’)]2  Prob(y=1) = Φ[β’x /exp(z’)]      20-4/41 1. MLE of β is inconsistent 2. “Robust” vc matrix is not helpful. 3. There is no generic test for heteroscedasticity. There are LR, LM, wald tests for  = 0. 3. Testing for heteroscedasticity based on ‘residuals’ is meaningless. Because residuals y – Φ(β’x ) are meaningless. Part 20: Sample Selection An LM Test for Heteroscedastic Probit Var[i ]  [exp( zi  )]2    xi   xi    ln L    yi ln F   (1  y ) ln 1  F      . i   i 1   exp( z i  )   exp( z i  )      n  xi   xi   ln L n  f i ( yi  Fi )     exp(  z  ) x , f  f , F  F     i i i i     F (1  F ) ex p( z  ) exp( z  ) i 1  i i  i i      f ( y  Fi )   ln L   i i  exp(  zi  ) z i ( xi).  F (1  F ) i 1  i i  n ,  0   ln L n  f i ( yi  Fi )   x i  n if   0,     g i  i'G =  i'G      i 1  Fi (1  Fi )  ( xiˆ ) z i  i 1      LM = iG [GG]-1 Gi = nR2 where the regression is of a column of ones on wi. 20-5/41 Part 20: Sample Selection 20-6/41 Part 20: Sample Selection 20-7/41 Part 20: Sample Selection 20-8/41 Part 20: Sample Selection 20-9/41 Part 20: Sample Selection Effect of Productivity on Competitiveness The problem of endogeneity There are three potential sources of endogeneity – omitted variables, measurement errors, and simultaneity: First, omitted variables are likely to be an issue in our analysis, as the factors determining competitiveness are numerous, as outlined in the previous section. Given that we base our analysis on survey data, we are limited to the information provided by the survey. This restriction makes the analysis prone to omitted variables. Second, measurement errors occur frequently in surveys (Bertrand and Mullainathan 2001). Despite comprising mostly binary questions leaving relatively little room for systematic biases, they might still be present. Both omitted variable biases and measurement errors may bias the OLS result in any direction. Third, as outlined in detail in the previous section, the majority of the evidence suggests that increasing material productivity has a positive effect on microeconomic competitiveness. However, the causal effect can work in either direction: 20-10/41 Part 20: Sample Selection Endogeneity 20-11/41 Part 20: Sample Selection Endogenous RHS Variable  U* = β’x + θh + ε y = 1[U* > 0] E[ε|h] ≠ 0 (h is endogenous)    Case 1: h is continuous Case 2: h is binary = a treatment effect Approaches   Parametric: Maximum Likelihood Semiparametric (not developed here):    20-12/41 GMM Various approaches for case 2 2 Stage least squares – a good approximation? Part 20: Sample Selection Endogenous Continuous Variable U* = β’x + θh + ε Correlation = ρ. y = 1[U* > 0]  This is the source of the endogeneity h = α’z +u E[ε|h] ≠ 0  Cov[u, ε] ≠ 0 Additional Assumptions: (u,ε) ~ N[(0,0),(σu2, ρσu, 1)] z = a valid set of exogenous variables, uncorrelated with (u,ε) This is not IV estimation. Z may be uncorrelated with X without problems. 20-13/41 Part 20: Sample Selection Endogenous Income Income responds to Age, Age2, Educ, Married, Kids, Gender 0 = Not Healthy 1 = Healthy Healthy = 0 or 1 Age, Married, Kids, Gender, Income Determinants of Income (observed and unobserved) also determine health satisfaction. 20-14/41 Part 20: Sample Selection Estimation by ML (Control Function) Probit fit of y to x and h will not consistently estimate (,) because of the correlation between h and  induced by the correlation of u and . Using the bivariate normality,  x  h  ( /  )u  u  Prob( y  1| x, h)    2   1  Insert ui = (hi - αz )/u and include f(h|z ) to form logL logL= 20-15/41      hi - α z i    xi  hi    u    (2 y  1)  log    i  2 1   N    i=1        log 1   hi - αz i    u  u                          Part 20: Sample Selection Two Approaches to ML (1) Full information ML. Maximize the full log likelihood with respect to (,, u , , ) (The built in Stata routine IVPROBIT does this. It is not an instrumental variable estimator; it is a FIML estimator.) Note also, this does not imply replacing h with a prediction from the regression then using probit with hˆ instead of h. (2) Two step limited information ML. (Control Function) (a) Use OLS to estimate  and  u with a and s. (b) Compute vˆi = uˆi /s = (hi  az i ) / s  x  h  vˆ  i i ˆ ˆ  x  h  vˆ    log  (c) log   i i i i 2   1   The second step is to fit a probit model for y to (x,h,vˆ) then solve back for (,,) from (,,) and from the previously estimated a and s. Use the delta method to compute standard errors. 20-16/41 Part 20: Sample Selection FIML Estimates 20-17/41 ---------------------------------------------------------------------Probit with Endogenous RHS Variable Dependent variable HEALTHY Log likelihood function -6464.60772 --------+------------------------------------------------------------Variable| Coefficient Standard Error b/St.Er. P[|Z|>z] Mean of X --------+------------------------------------------------------------|Coefficients in Probit Equation for HEALTHY Constant| 1.21760*** .06359 19.149 .0000 AGE| -.02426*** .00081 -29.864 .0000 43.5257 MARRIED| -.02599 .02329 -1.116 .2644 .75862 HHKIDS| .06932*** .01890 3.668 .0002 .40273 FEMALE| -.14180*** .01583 -8.959 .0000 .47877 INCOME| .53778*** .14473 3.716 .0002 .35208 |Coefficients in Linear Regression for INCOME Constant| -.36099*** .01704 -21.180 .0000 AGE| .02159*** .00083 26.062 .0000 43.5257 AGESQ| -.00025*** .944134D-05 -26.569 .0000 2022.86 EDUC| .02064*** .00039 52.729 .0000 11.3206 MARRIED| .07783*** .00259 30.080 .0000 .75862 HHKIDS| -.03564*** .00232 -15.332 .0000 .40273 FEMALE| .00413** .00203 2.033 .0420 .47877 |Standard Deviation of Regression Disturbances Sigma(w)| .16445*** .00026 644.874 .0000 |Correlation Between Probit and Regression Disturbances Rho(e,w)| -.02630 .02499 -1.052 .2926 --------+------------------------------------------------------------- Part 20: Sample Selection Partial Effects: Scaled Coefficients Conditional Mean E[ y | x, h]   (x  h) h  z  u  z  u v where v ~ N[0,1] E[y|x,z,v] =[x  (z  u v)] Partial Effects. Assume z = x (just for convenience) E[y|x,z,v]  [x  (z  u v)](  ) x  E[y|x,z ]  E[y|x,z,v]   Ev   (  ) [x  (z  u v)](v)dv   x x   The integral does not have a closed form, but it can easily be simulated :  R E[y|x,z ] 1  (  ) [x  (z  u vr )] x R r 1 For variables only in x, omit  k . For variables only in z, omit k . Est. 20-18/41  Part 20: Sample Selection Partial Effects θ = 0.53778 The scale factor is computed using the model coefficients, means of the variables and 35,000 draws from the standard normal population. 20-19/41 Part 20: Sample Selection Two Stage Least Squares 20-20/41 Part 20: Sample Selection Endogenous Binary Variable U* = β’x + θh + ε Correlation = ρ.  This is the source of the endogeneity y = 1[U* > 0] h* = α’z +u h = 1[h* > 0] E[ε|h*] ≠ 0  Cov[u, ε] ≠ 0 Additional Assumptions: (u,ε) ~ N[(0,0),(σu2, ρσu, 1)] z = a valid set of exogenous variables, uncorrelated with (u,ε) This is not IV estimation. Z may be uncorrelated with X without problems. 20-21/41 Part 20: Sample Selection A Recursive Bivariate Probit Model Treatment Effects Recursive Simultaneous Equations Model y1 * = z + ε1, y1 = 1(y1 * > 0) y 2 * = β x + θy1 + ε 2 ,y 2 = 1(y 2 * > 0)  0   1 ρ    ε1    ~ N   ,   ε 0 ρ 1      2  This model is identified. It can be consistently and efficiently estimated by full information maximum likelihood. Treated as a bivariate probit model. The simultaneity is accounted for by the log likelihood formulation. 20-22/41 Part 20: Sample Selection Log Likelihood for the RBP Model h*  z  u , h  1( h*  0) y*  x  h  , y  1( y*  0)  0   1       ~ N 2   ,   u 0  1       log L   i| y 1,h 1 ln  2 (z i , xi  , )     20-23/41 i| y 1, h  0 ln  2 ( z i , xi , )  i| y  0, h 1 ln  2 (z i , xi  , )  i| y  0, h  0 ln  2 ( z i , xi , )  Part 20: Sample Selection ----------------------------------------------------------------------------FIML - Recursive Bivariate Probit Model Dependent variable PUBDOC Log likelihood function -25671.32339 Estimation based on N = 27326, K = 14 Inf.Cr.AIC = 51370.6 AIC/N = 1.880 --------+-------------------------------------------------------------------PUBLIC| Standard Prob. 95% Confidence DOCTOR| Coefficient Error z |z|>Z* Interval --------+-------------------------------------------------------------------|Index equation for PUBLIC.................................... Constant| 3.55056*** .07446 47.68 .0000 3.40462 3.69650 AGE| .00067 .00115 .58 .5626 -.00159 .00293 EDUC| -.16835*** .00416 -40.48 .0000 -.17650 -.16020 INCOME| -.98735*** .05172 -19.09 .0000 -1.08872 -.88598 MARRIED| -.00997 .02922 -.34 .7329 -.06724 .04729 HHKIDS| -.08094*** .02510 -3.22 .0013 -.13014 -.03174 FEMALE| .12140*** .02231 5.44 .0000 .07768 .16512 |Index equation for DOCTOR.................................... Constant| .58983*** .14474 4.08 .0000 .30615 .87351 AGE| -.05740*** .00601 -9.56 .0000 -.06917 -.04563 AGESQ| .00082*** .6817D-04 12.10 .0000 .00069 .00096 INCOME| .08900* .05097 1.75 .0808 -.01091 .18890 FEMALE| .34580*** .01629 21.22 .0000 .31386 .37773 PUBLIC| .43595*** .07358 5.92 .0000 .29174 .58016 |Disturbance correlation......................................... RHO(1,2)| -.17317*** .04075 -4.25 .0000 -.25303 -.09330 --------+-------------------------------------------------------------------- 20-24/41 Part 20: Sample Selection Treatment Effects y1 is a “treatment” Treatment effect of y1 on y2. Prob(y2=1)y1=1 – Prob(y2=1)y1=0 = (’x + ) - (’x) Treatment effect on the treated involves an unobserved counterfactual. Compare being treated to being untreated for someone who was actually treated. Prob(y2=1|y1=1)y1=1 - Prob(y2=1|y1=1)y1=0 20-25/41 Part 20: Sample Selection Treatment Effect on the Treated  2 (z, x  , )   2 (z, x, ) TET  (z ) Average treatment effect on the treated estimated by TET   y11 20-26/41  2 (z i , xi  , )   2 (z i , xi , )  (z i ) Part 20: Sample Selection Treatment Effects --------------------------------------------------------------------Partial Effects Analysis for RcrsvBvProb: ATE of PUBLIC on DOCTOR --------------------------------------------------------------------Effects on function with respect to PUBLIC Results are computed by average over sample observations Partial effects for binary var PUBLIC computed by first difference --------------------------------------------------------------------df/dPUBLIC Partial Standard (Delta Method) Effect Error |t| 95% Confidence Interval --------------------------------------------------------------------APE. Function .16446 .02820 5.83 .10920 .21973 --------------------------------------------------------------------Partial Effects Analysis for RcrsvBvProb: ATET of PUBLIC on DOCTOR --------------------------------------------------------------------Effects on function with respect to PUBLIC Results are computed by average over sample observations Partial effects for binary var PUBLIC computed by first difference --------------------------------------------------------------------df/dPUBLIC Partial Standard (Delta Method) Effect Error |t| 95% Confidence Interval --------------------------------------------------------------------APE. Function .15417 .02482 6.21 .10553 .20282 20-27/41 Part 20: Sample Selection 20-28/41 Part 20: Sample Selection recursive 20-29/41 Part 20: Sample Selection Causal Inference The authors used (1  1 X ij   PIP PIPij ) PIPij instead of  (1  1 X ij   PIP ) -  (1  1 X ij ) It is not clear why they could not use the delta method for this. 20-30/41 Part 20: Sample Selection 20-31/41 Part 20: Sample Selection 20-32/41 Part 20: Sample Selection Econometrics I Part 20 – Sample Selection 20-33/41 Part 20: Sample Selection 20-34/41 Part 20: Sample Selection The delivery was fine. But the book itself is the worst Econometric Analysis book I have ever come across. No examples. Only a continuous list of theorems. I would not recommend anyone this book. 20-35/41 Part 20: Sample Selection Dueling Selection Biases – From two emails, same day. “I am trying to find methods which can deal with data that is non-randomised and suffers from selection bias.”  “I explain the probability of answering questions using, among other independent variables, a variable which measures knowledge breadth. Knowledge breadth can be constructed only for those individuals that fill in a skill description in the company intranet. This is where the selection bias comes from.  20-36/41 Part 20: Sample Selection Received Sunday, April 27, 2014 I have a paper regarding strategic alliances between firms, and their impact on firm risk. While observing how a firm’s strategic alliance formation impacts its risk, I need to correct for two types of selection biases. The reviews at Journal of Marketing asked us to correct for the propensity of firms to enter into alliances, and also the propensity to select a specific partner, before we examine how the partnership itself impacts risk. Our approach involved conducting a probit of alliance formation propensity, take the inverse mills and include it in the second selection equation which is also a probit of partner selection. Then, we include inverse mills from the second selection into the main model. The review team states that this is not correct, and we need an MLE estimation in order to correctly model the set of three equations. The Associate Editor’s point is given below. Can you please provide any guidance on whether this is a valid criticism of our approach. Is there a procedure in LIMDEP that can handle this set of three equations with two selection probit models? AE’s comment: “Please note that the procedure of using an inverse mills ratio is only consistent when the main equation where the ratio is being used is linear. In non-linear cases (like the second probit used by the authors), this is not correct. Please see any standard econometric treatment like Greene or Wooldridge. A MLE estimator is needed which will be far from trivial to specify and estimate given error correlations between all three equations.” 20-37/41 Part 20: Sample Selection Samples and Populations  Consistent estimation   The sample is randomly drawn from the population Sample statistics converge to their population counterparts A presumption: The ‘population’ is the population of interest.  Implication: If the sample is randomly drawn from a specific subpopulation, statistics converge to the characteristics of that subpopulation  20-38/41 Part 20: Sample Selection Nonrandom Sampling     Simple nonrandom samples: Average incomes of airport travelers  mean income in the population as a whole? Survivorship: Time series of returns on business performance. Mutual fund performance. (Past performance is no guarantee of future success. ) Attrition: Drug trials. Effect of erythropoetin on quality of life survey. Self-selection:   20-39/41 Labor supply models Shere Hite’s (1976) “The Hite Report” ‘survey’ of sexual habits of Americans. “While her books are ground-breaking and important, they are based on flawed statistical methods and one must view their results with skepticism.” Part 20: Sample Selection The NYU No Action Letter 20-40/41 Part 20: Sample Selection The Crucial Element  Selection on the unobservables     Selection into the sample is based on both observables and unobservables All the observables are accounted for Unobservables in the selection rule also appear in the model of interest (or are correlated with unobservables in the model of interest) “Selection Bias”=the bias due to not accounting for the unobservables that link the equations. 20-41/41 Part 20: Sample Selection Heckman’s Canonical Model A behavioral model: Offered wage = o* = 'x+v (x  age,experience,educ...) Reservation wage = r* = 'z + u (z = age, kids, family stuff) Labor force participation: LFP = 1 if o*  r*, 0 otherwise Prob(LFP=1) =  ('x-'z)/ 2v  u2    Desired Hours = H* = 'w +  Actual Hours = H* if LFP = 1 unobserved if LFP = 0  and u are correlated.  and v might be correlated. What is E[H* | w,LFP = 1]? Not 'w. 20-42/41 Part 20: Sample Selection Standard Sample Selection Model di *   ' zi  ui di = 1(di * > 0) y i * = 'x i +i yi = y i * when di = 1, unobserved otherwise (ui ,v i ) ~ Bivariate Normal[(0,0),(1,,2 )] E[y i | y i is observed] = E[y i|di=1] = 'x i+E[ i | di  1] = 'x i+E[ i | ui   ' z i ] = 'x i+( ) ( ' zi )  ( ' zi ) = 'x+ i 20-43/41 Part 20: Sample Selection Incidental Truncation u1,u2~N[(0,0),(1,.71,1) 20-44/41 Part 20: Sample Selection Selection as a Specification Error E[yi|xi,yi observed] = β’xi + θλi  Regression of yi on xi omits λi.     λi will generally be correlated with xi if zi is. zi and xi often have variables in common. There is no specification error if θ = 0 <=> ρ = 0 “Selection Bias” is plim (b – β)  What is “selection bias…”  20-45/41 Part 20: Sample Selection Control Function Labor Force Participation d* = z + u What is u? Unmeasured factors that motivate LFP, u = (m,a) Desired Hours H* = x +  What is ? Unmeasured factors that motivate H*,  = (m,c)  = u + w  and u share factors, m. H* = x +  u + w Regression of H* on x omits u.  is the prediction of u. Note, the problem goes away if  = 0. 20-46/41 Part 20: Sample Selection Estimation of the Selection Model  Two step least squares     Inefficient Simple – exists in current software Simple to understand and widely used Full information maximum likelihood    20-47/41 Efficient Simple – exists in current software Not so simple to understand – widely misunderstood Part 20: Sample Selection Estimation Heckman’s two step procedure   20-48/41 (1) Estimate the probit model and compute λi for each observation using the estimated parameters. (2) a. Linearly regress yi on xi and λi using the observed data b. Correct the estimated asymptotic covariance matrix for the use of the estimated λi. (An application of Murphy and Topel (1984) – Heckman was 1979) See text, pp. 876-877. Part 20: Sample Selection Application – Labor Supply MROZ labor supply data. Cross section, 753 observations Use LFP for binary choice, KIDS for count models. LFP = labor force participation, 0 if no, 1 if yes. WHRS = wife's hours worked. 0 if LFP=0 KL6 = number of kids less than 6 K618 = kids 6 to 18 WA = wife's age WE = wife's education WW = wife's wage, 0 if LFP=0. RPWG = Wife's reported wage at the time of the interview HHRS = husband's hours HA = husband's age HE = husband's education HW = husband's wage FAMINC = family income MTR = marginal tax rate WMED = wife's mother's education WFED = wife's father's education UN = unemployment rate in county of residence CIT = dummy for urban residence AX = actual years of wife's previous labor market experience AGE = Age AGESQ = Age squared EARNINGS= WW * WHRS LOGE = Log of EARNINGS KIDS = 1 if kids < 18 in the home. 20-49/41 Part 20: Sample Selection Labor Supply Model NAMELIST NAMELIST PROBIT SELECT REGRESS REJECT REGRESS 20-50/41 ; Z = One,KL6,K618,WA,WE,HA,HE $ ; X = One,KL6,K618,Age,Agesq,WE,Faminc $ ; Lhs = LFP ; Rhs = Z ; Hold(IMR=Lambda) $ ; Lhs = WHRS ; Rhs = X $ ; Lhs = WHRS ; Rhs = X,Lambda $ ; LFP = 0 $ ; Lhs = WHRS ; Rhs = X $ Part 20: Sample Selection Participation Equation +---------------------------------------------+ | Binomial Probit Model | | Dependent variable LFP | | Weighting variable None | | Number of observations 753 | +---------------------------------------------+ +---------+--------------+----------------+--------+---------+----------+ |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | Mean of X| +---------+--------------+----------------+--------+---------+----------+ Index function for probability Constant 1.00264501 .49994379 2.006 .0449 KL6 -.90399802 .11434394 -7.906 .0000 .23771580 K618 -.05452607 .04021041 -1.356 .1751 1.35325365 WA -.02602427 .01332588 -1.953 .0508 42.5378486 WE .16038929 .02773622 5.783 .0000 12.2868526 HA -.01642514 .01329110 -1.236 .2165 45.1208499 HE -.05191039 .02040378 -2.544 .0110 12.4913679 20-51/41 Part 20: Sample Selection Hours Equation +----------------------------------------------------+ | Sample Selection Model | | Two stage least squares regression | | LHS=WHRS Mean = 1302.930 | | Standard deviation = 776.2744 | | WTS=none Number of observs. = 428 | | Model size Parameters = 8 | | Degrees of freedom = 420 | | Residuals Sum of squares = .2267214E+09 | | Standard error of e = 734.7195 | | Correlation of disturbance in regression | | and Selection Criterion (Rho)........... -.84541 | +----------------------------------------------------+ +---------+--------------+----------------+--------+---------+----------+ |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | Mean of X| +---------+--------------+----------------+--------+---------+----------+ Constant 2442.26665 1202.11143 2.032 .0422 KL6 115.109657 282.008565 .408 .6831 .14018692 K618 -101.720762 38.2833942 -2.657 .0079 1.35046729 AGE 14.6359451 53.1916591 .275 .7832 41.9719626 AGESQ -.10078602 .61856252 -.163 .8706 1821.12150 WE -102.203059 39.4096323 -2.593 .0095 12.6588785 FAMINC .01379467 .00345041 3.998 .0001 24130.4229 LAMBDA -793.857053 494.541008 -1.605 .1084 .61466207 20-52/41 Part 20: Sample Selection Selection “Bias” +---------+--------------+----------------+--------+---------+----------+ |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | Mean of X| +---------+--------------+----------------+--------+---------+----------+ Constant 2442.26665 1202.11143 2.032 .0422 KL6 115.109657 282.008565 .408 .6831 .14018692 K618 -101.720762 38.2833942 -2.657 .0079 1.35046729 AGE 14.6359451 53.1916591 .275 .7832 41.9719626 AGESQ -.10078602 .61856252 -.163 .8706 1821.12150 WE -102.203059 39.4096323 -2.593 .0095 12.6588785 FAMINC .01379467 .00345041 3.998 .0001 24130.4229 LAMBDA -793.857053 494.541008 -1.605 .1084 .61466207 +---------+--------------+----------------+--------+---------+----------+ |Variable | Coefficient | Standard Error |t-ratio |P[|T|>t] | Mean of X| +---------+--------------+----------------+--------+---------+----------+ Constant 1812.12538 1144.33342 1.584 .1140 KL6 -299.128041 100.033124 -2.990 .0030 .14018692 K618 -126.399697 30.8728451 -4.094 .0001 1.35046729 AGE 11.2795338 53.8442084 .209 .8342 41.9719626 AGESQ -.26103541 .62632815 -.417 .6771 1821.12150 WE -47.3271780 17.2968137 -2.736 .0065 12.6588785 FAMINC .01261889 .00338906 3.723 .0002 24130.4229 20-53/41 Part 20: Sample Selection Maximum Likelihood Estimation  logL  +  d1  d=0   exp  12 (i / )2  ( / )   ' z i i log      2 1  2   log 1  ( ' zi )      Re parameterize this: let qi   ' zi (1)  = 1/ (2)  =  / (Olsen transformation) (3)  =  / 1-2 (4) Constrain  to be in (-1,1) by using  = 1 2 logL  20-54/41 ln    atanh, so =atanh  1 1 d0 log(qi )   d1 -1 ( )  exp(2)  1 exp(2)  1 log   12 log 2  12 (y i   ' x i )2  log [(y i   ' x )  qi 1  2 ] Part 20: Sample Selection MLE +---------------------------------------------+ | ML Estimates of Selection Model | | Maximum Likelihood Estimates | | Number of observations 753 | | Iterations completed 47 | | Log likelihood function -3894.471 | | Number of parameters 16 | | FIRST 7 estimates are probit equation. | +---------------------------------------------+ +---------+--------------+----------------+--------+---------+ |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | +---------+--------------+----------------+--------+---------+ Selection (probit) equation for LFP Constant 1.01350651 .54823177 1.849 .0645 KL6 -.90129694 .11081111 -8.134 .0000 K618 -.05292375 .04137216 -1.279 .2008 WA -.02491779 .01428642 -1.744 .0811 WE .16396194 .02911763 5.631 .0000 HA -.01763340 .01431873 -1.231 .2181 HE -.05596671 .02133647 -2.623 .0087 Corrected regression, Regime 1 Constant 1946.84517 1167.56008 1.667 .0954 KL6 -209.024866 222.027462 -.941 .3465 K618 -120.969192 35.4425577 -3.413 .0006 AGE 12.0375636 51.9850307 .232 .8169 AGESQ -.22652298 .59912775 -.378 .7054 WE -59.2166488 33.3802882 -1.774 .0761 FAMINC .01289491 .00332219 3.881 .0001 SIGMA(1) 748.131644 59.7508375 12.521 .0000 RHO(1,2) -.22965163 .50082203 -.459 .6466 20-55/41 Part 20: Sample Selection MLE vs. Two Step Two Step Constant 2442.26665 1202.11143 2.032 KL6 115.109657 282.008565 .408 K618 -101.720762 38.2833942 -2.657 AGE 14.6359451 53.1916591 .275 AGESQ -.10078602 .61856252 -.163 WE -102.203059 39.4096323 -2.593 FAMINC .01379467 .00345041 3.998 LAMBDA -793.857053 494.541008 -1.605 | Standard error of e = 734.7195 | Correlation of disturbance in regression | and Selection Criterion (Rho)........... -.84541 MLE Constant 1946.84517 1167.56008 1.667 KL6 -209.024866 222.027462 -.941 K618 -120.969192 35.4425577 -3.413 AGE 12.0375636 51.9850307 .232 AGESQ -.22652298 .59912775 -.378 WE -59.2166488 33.3802882 -1.774 FAMINC .01289491 .00332219 3.881 SIGMA(1) 748.131644 59.7508375 12.521 RHO(1,2) -.22965163 .50082203 -.459 20-56/41 .0422 .6831 .0079 .7832 .8706 .0095 .0001 .1084 | | | .14018692 1.35046729 41.9719626 1821.12150 12.6588785 24130.4229 .61466207 .0954 .3465 .0006 .8169 .7054 .0761 .0001 .0000 .6466 Part 20: Sample Selection 2 Step Maximum Likelihood Estimation  logL  +  d1  d=0   exp  12 (i / )2  ( / )   ' z   i i    log  2     2  1    log 1  ( ' zi )  Step 1: Estimate  using probit on d. Step 2: Estimate beta, sigma, rho using reduced logL    exp  12 (i / )2  (i / )    ˆ ' zi       d1 log  2    2 1    ˆi   Re parameterize this: let q ˆ ' zi (1)  = 1/ (2)  =  / (Olsen transformation) (3)  =  / 1-2 (4) Constrain  to be in (-1,1) by using  = 1 2 logL  20-57/41 ln    1 1  atanh, so =atanh-1 ()  exp(2)  1  , = exp(2)  1 1-2 log   12 log 2  12 (y i   ' x i ) 2 d1  log [(y i   ' x)  ˆ qi 1  2 ] Part 20: Sample Selection How to Handle Selectivity  The ‘Mills Ratio’ approach – just add a ‘lambda’ to whatever model is being estimated?    The Heckman model applies to a probit model with a linear regression. The conditional mean in a nonlinear model is not something “+lambda” The model can sometimes be built up from first principles 20-58/41 Part 20: Sample Selection Dear Professor Greene, I am very sorry to bother you considering this is my first time emailing you. I am ****************, lecturer in Finance at &&&&&&&&&&& University (Scotland). I am doing a project investigating the impact of hedge fund manager's coinvestment on the survival probability of the fund. As fund managers' coinvestment decision is self-selection which might cause endogeneity issue, I jointly estimate the co-investment decision (Probit model) and the survival probability (Hazard model) to account for endogeneity of co-investment decision. I received one comment saying that I should use Heckman's two procedure to correct for endogeneity. My understanding is the Heckman's approach applies to a Probit and a LINEAR model. Since hazard model is nonlinear, simply adding inverse Mill's ration in the hazard model is wrong. What I am asking is if my understanding of this is correct? If so, why can we not simply add Mill's ratio in a nonlinear model? 20-59/41 Part 20: Sample Selection A Bivariate Probit Model Labor Force Participation Equation d*   ' z  u d  1(d* > 0) Full Time or Part Time? f* = 'x+ f = 1(f* > 0) Probability Model: Nonparticipant: Prob[d=0] = (- ' z) Participant and Full Time Prob[f=1,d=1]= Prob[f=1|d=1]Prob[d=1] = Bivariate Normal('x, ' z, ) Participant and Part Time Prob[f=0,d=1]= Prob[f=0|d=1]Prob[d=1] = Bivariate Normal('x,- ' z, ) 20-60/41 Part 20: Sample Selection Sample Selection Model: Estimation f(y1,y 2 ) = Prob[y1 = 1| y 2 = 1] * Prob[y 2 = 1] (y1 = 1,y 2 = 1) = Prob[y1 = 0 | y 2 = 1] * Prob[y 2 = 1] (y1 = 0,y 2 = 1) = Prob[y 2 = 0] Terms in the log likelihood : (y1 = 1,y 2 = 1) Φ2 (β1 x i1,β2 x i2 ,ρ) (y 2 = 0) (Bivariate normal) (y1 = 0,y 2 = 1) Φ2 (-β1 x i1,β2 x i2 ,-ρ) (Bivariate normal) (y 2 = 0) Φ(-β2 x i2 ) (Univariate normal) Estimation is by full information maximum likelihood. There is no "lambda" variable. 20-61/41 Part 20: Sample Selection FT/PT Selection Model +---------------------------------------------+ | FIML Estimates of Bivariate Probit Model | | Dependent variable FULLFP | | Weighting variable None | | Number of observations 753 | | Log likelihood function -723.9798 | | Number of parameters 16 | | Selection model based on LFP | +---------------------------------------------+ +---------+--------------+----------------+--------+---------+----------+ |Variable | Coefficient | Standard Error |b/St.Er.|P[|Z|>z] | Mean of X| +---------+--------------+----------------+--------+---------+----------+ Index equation for FULLTIME Constant .94532822 1.61674948 .585 .5587 WW -.02764944 .01941006 -1.424 .1543 4.17768154 KL6 .04098432 .26250878 .156 .8759 .14018692 K618 -.13640024 .05930081 -2.300 .0214 1.35046729 AGE .03543435 .07530788 .471 .6380 41.9719626 AGESQ -.00043848 .00088406 -.496 .6199 1821.12150 WE -.08622974 .02808185 -3.071 .0021 12.6588785 FAMINC .210971D-04 .503746D-05 4.188 .0000 24130.4229 Index equation for LFP Constant .98337341 .50679582 1.940 .0523 KL6 -.88485756 .11251971 -7.864 .0000 .23771580 K618 -.04101187 .04020437 -1.020 .3077 1.35325365 WA -.02462108 .01308154 -1.882 .0598 42.5378486 WE .16636047 .02738447 6.075 .0000 12.2868526 HA -.01652335 .01287662 -1.283 .1994 45.1208499 HE -.06276470 .01912877 -3.281 .0010 12.4913679 Disturbance correlation RHO(1,2) -.84102682 .25122229 -3.348 .0008 Full Time = Hours > 1000 20-62/41 Part 20: Sample Selection Application: Credit Scoring American Express: 1992  N = 13,444 Applications     Observed application data Observed acceptance/rejection of application N1 = 10,499 Cardholders   Observed demographics and economic data Observed default or not in first 12 months Full Sample is in AmEx.lpj; description shows when imported. 20-63/41 Part 20: Sample Selection 20-64/41 Part 20: Sample Selection 20-65/41 Part 20: Sample Selection 20-66/41 Part 20: Sample Selection Building a Likelihood for a Poisson Regression Model with Selection Poisson Probability Functions P(y i | x i )  exp(i ) i y / y i ! Covariates and Unobserved Heterogeneity (x i , i )=exp(x iβ+i ) Conditional Contribution to the Log Likelihood logL i | i  (x i , i )  y i log (x i , i )  log y i ! Probit Selection Mechanism di *  zi γ  ui , di  1[di *  0]  0   2    [i ,ui ] ~ BVN   ,    0    1   y i , x i observed only when di  1. 20-67/41 Part 20: Sample Selection Building the Likelihood The Conditional Probit Probability ui | i ~ N[( / )i , (1  2 )]  z  ( / )  i Prob[di  1 | zi , i ]    i  2   1  -z  ( / )  i Prob[di  0 | zi , i ]    i  2   1 Conditional Contribution to Likelihood L i (y i , di  1) | i ,  [f(y i | x i , i , di  1)Prob[di  1 | zi , i ] L i (di  0)  Prob[di  0 | zi , i ] 20-68/41 Part 20: Sample Selection Conditional Likelihood Conditional Density (not the log) f(y i , di  1 | i )  [f(y i | i , di  1)]Prob[di  1 | i ] f(y i , di  0 | i )  Prob[di  0 | i ] Unconditional Densities f(y i , di  1)  f(y i , di  0)        [f(y i | i , di  1)]Prob[di  1 | i ] 1     d   1  Prob[di  0 | i ]    d   Log Likelihoods logL i  log f(y i , di ) 20-69/41 Part 20: Sample Selection Poisson Model with Selection  Strategy:    Hermite quadrature or maximum simulated likelihood. Not by throwing a ‘lambda’ into the unconditional likelihood Could this be done without joint normality?    20-70/41 How robust is the model? Is there any other approach available? Not easily. The subject of ongoing research Part 20: Sample Selection Nonnormality Issue How robust is the Heckman model to nonnormality of the unobserved effects?  Are there other techniques    Parametric: Copula methods Semiparametric: Klein/Spady and Series methods Other forms of the selection equation – e.g., multinomial logit  Other forms of the primary model: e.g., as above.  20-71/41 Part 20: Sample Selection Application: Health Care Usage German Health Care Usage Data, 7,293 Individuals, Varying Numbers of Periods This is an unbalanced panel with 7,293 individuals. There are altogether 27,326 observations. The number of observations ranges from 1 to 7. (Frequencies are: 1=1525, 2=2158, 3=825, 4=926, 5=1051, 6=1000, 7=987). (Downloaded from the JAE Archive) Variables in the file are DOCTOR = 1(Number of doctor visits > 0) HOSPITAL = 1(Number of hospital visits > 0) HSAT = health satisfaction, coded 0 (low) - 10 (high) DOCVIS = number of doctor visits in last three months HOSPVIS = number of hospital visits in last calendar year PUBLIC = insured in public health insurance = 1; otherwise = 0 ADDON = insured by add-on insurance = 1; otherswise = 0 HHNINC = household nominal monthly net income in German marks / 10000. (4 observations with income=0 were dropped) HHKIDS = children under age 16 in the household = 1; otherwise = 0 EDUC = years of schooling AGE = age in years MARRIED = marital status 20-72/41 Part 20: Sample Selection 20-73/41 Part 20: Sample Selection 20-74/41 Part 20: Sample Selection 20-75/41 Part 20: Sample Selection 20-76/41 Part 20: Sample Selection 20-77/41 Part 20: Sample Selection 20-78/41 Part 20: Sample Selection
 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
									 
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                             
                                            