A COMPANION TO Theoretical Econometrics
Simulation Methods
In binary QRM there is little basis to choose between the logit and probit models because of the similarity of the cumulative normal and the logistic distributions. In the multinomial situation this is not the case. The multinomial logit (MNL) model has a closed form representation and is computationally tractable but lacks flexibility due to the IIA property. The multinomial probit (MNP) model gives flexibility but is computationally burdensome because of the need to evaluate multidimensional integrals. Another problem which involves high dimensional integrals is the probit (or tobit) model with serially correlated errors.
Until a few years ago, only 3 or 4 dimensional integrals could be evaluated. However, developments during the last decade on simulation based estimation allow us to estimate otherwise intractable models by approximating highdimensional integrals. A simple exposition of the econometric literature on simulation methods can be found in Stern (1997).
The generic problem for simulation is to evaluate expressions like E[ g(x)] = / g(x)f (x)dx where x is a random variable with density f(x) and g(x) is a function of x. The simulation method consists of drawing samples x1, x2,..., xN from f(x) and computing g(x;). Then, E[g(x)] = N £N1 g(x.) is an unbiased estimator of E[g(x)], and its variance is var(g(x;))/N. As N ^ ™, the variance of the simulator goes to zero. f (x) can be a discrete distribution.
Different applications of the simulation method depend on refinements in the way the sampling of f(x) is done. These refinements in probability simulators fall into the following categories: importance sampling, GHK simulator, Gibbs sampling, and antithetic variables. Importance sampling involves oversampling some "important" parts of f(x) from a well-chosen distribution g(x) from which it is easy to sample. The GHK simulator algorithm decomposes the joint density
of the errors into a product of conditional densities. It has been found to perform very well for simulating multinomial probit probabilities. Gibbs sampling is an iterative sampling method from conditional densities. Finally, the method of antithetic variables can be used on the above probability simulators to reduce sampling costs and the variance of the simulator through the sampling of pairs of random variables which are negatively correlated.8
The estimation methods commonly used are the method of simulated moments (MSM), the method of simulated likelihood (MSL), and the method of simulated scores (MSS). A detailed review of the above estimation methods can be found in Hajivassiliou and Ruud (1994) and in Geweke, Houser, and Keane, Chapter 22, in this companion.
The MSM is based on the GMM (generalized method of moments) method of estimation. The least squares method is a simple example of the method of moments. The GMM method depends on orthogonality conditions. For example, Avery, Hansen, and Hotz (1983) suggest how to estimate the multiperiod probit model by a set of within period orthogonality conditions that allows consistent estimation of the parameters and their standard errors under serial correlation. In many problems, the orthogonality conditions cannot be evaluated analytically. This is where simulation methods help. The resulting estimator is the MSM estimator. Let us illustrate this method with the MNP model. The loglikelihood function for a sample of size n with m alternatives is given by (17.9). The MSM consists of simulating P(j/x, P) using the multivariate normal distribution and substituting it in the moment equation:
XX - P(j/x, P)j = 0, (17.20)
i=1 j=1
for some weighting function w (k is an index taking on the values 1,..., K where K is the dimension of P). The estimator SMSM is obtained by solving (17.20).
The MSL is based on the maximum likelihood method of estimation and is useful when the likelihood function is not easily computed by analytical methods. In the context of multinomial choice models, MSL consists in simulating the actual choice probabilities given by Pjx, P) in (17.9). SMSL is obtained by maximizing (17.9) once the simulated choice probabilities are substituted.
The MSS is based on the idea that the score statistic (the derivative of the likelihood function) should have an expected value of zero at the true value of p. The potential advantage of the MSS relative to MSM is that it uses the efficiency properties of ML, but it is computationally more difficult than the MSM since the weight function itself has to be simulated, while it is analytically computed on the MSM. To see this, note that the MSS implies simulating P( j/x, P) in the following expression:
which makes clear that the weight function itself has to be simulated.
There are several papers using simulation methods but there are very few that compare the different methods. Geweke, Keane, and Runkle (1994) compare different methods in the estimation of the multinomial probit model, based on two Monte Carlo experiments for a seven choice model. They compare the MSL estimator using the GHK recursive probability simulator, the MSM method using the GHK recursive probability simulator and kernel-smoothed frequency simulators, and Bayesian methods based on Gibbs sampling. Overall, the Gibbs sampling algorithm had a slight edge, while the relative performance of the MSM and MSL (based on the GHK simulator) was difficult to evaluate. The MSM with the kernel-smoothed frequency simulator was clearly inferior.
In another paper, Geweke, Keane, and Runkle (1997) again compare the Bayesian methods based on Gibbs sampling and the MSM and MSL methods based on the GHK simulator. They do Monte Carlo studies with AR(1) errors in the multinomial multiperiod probit model, finding that the Gibbs sampling algorithm performs better than the MSM and MSL, especially under strong serial correlation in the disturbances (e. g. an AR(1) parameter of 0.8). They also find that to have root mean squared errors (RMSEs) for MSL and MSM within 10 percent of the RMSEs by the Gibbs sampling method one needs samples of 160 and 80 draws respectively (much higher than the 20 draws normally used). Thus, with serially correlated errors, the performance ranking is Gibbs sampling first, MSM second, and MSL last.
Keane (1993) discusses in detail the MSM estimator of limited dependent variable models with general patterns of serial correlation. He uses the recursive GHK algorithm. He argues that the equicorrelation model (implicit in the random effects probit model), for which the Butler and Moffitt algorithm works, is not empirically valid.
Notes
* We would like to thank three anonymous referees, the editor, Stephen Cosslett, and Kajal Lahiri for helpful comments. Remaining errors are our own. A. Flores-Lagunes gratefully acknowledges financial support from the National Council for Science and Technology of Mexico (CONACYT).
1 In (17.11) we need some normalization, like setting the first element of P to 1.
2 In the case of the binary choice models (logit and probit), Davidson and MacKinnon (1989) show that the score test statistic based on the exact information matrix can be computed easily using a particular artificial regression. See also Davidson and MacKinnon Chapter 1 in this companion.
3 We draw here from Gourieroux (1989).
4 Note that the approximating density is positive for all u.
5 GLL suggest, based on simulations, that using K = 3 allows for considerable flexibility in the distribution.
6 The asymptotic bias of the estimator will become negligible as K increases. GLL suggest starting with K = 3 (see previous fn.) and use score tests to determine whether such a value of K is appropriate.
7 The corresponding theorems can be found in Gallant and Nychka (1987).
8 References on probability simulators are Geweke, Keane, and Runkle (1994, 1997), Stern (1997), and Tanner (1996). For the reader interested in applying these methods,
Vassilis Hajivassiliou offers GAUSS and FORTRAN routines for the GHK simulator and other simulators at the following electronic address: http://econ. lse. ac. uk/~vassilis/ pub/simulation/gauss/
References
Amemiya, T. (1981). Qualitative response models: A survey. Journal of Economic Literature 19, 483-536.
Avery, R., L. Hansen, and V. J. Hotz (1983). Multiperiod probit models and orthogonality condition estimation. International Economic Review 24, 21-35.
Baltagi, B. H. (1999). Specification tests in panel data models using artificial regressions. Annales d’Economie et de Statistique 55-6, 277-98.
Butler, I., and R. Moffitt (1982). A computationally efficient procedure for the one factor multinomial probit model. Econometrica 50, 761-4.
Chamberlain, G. (1980). Analysis of covariance with qualitative data. Review of Economic Studies, 47, 225-38.
Chamberlain, G. (1984). Panel data. In Z. Griliches and M. D. Intrilligator (eds.) Handbook of Econometrics, Volume 2. Amsterdam and New York: North-Holland, ch. 22.
Chamberlain, G. (1985). Heterogeneity, omitted variable bias and duration dependence. In J. J. Heckman and B. Singer (eds.) Longitudinal Analysis of Labor Market Data. Cambridge: Cambridge University Press.
Cosslett, S. R. (1983). Distribution-free maximum likelihood estimator of the binary choice model. Econometrica 51, 765-82.
Davidson, R., and J. G. MacKinnon (1989). Testing for consistency using artificial regressions. Econometric Theory 50, 363-84.
Debreu, G. (1960). Review of R. D. Luce "Individual Choice Behavior". American Economic Review 50, 186-8.
Gabler, S., F. Laisney, and M. Lechner (1993). Seminonparametric estimation of binary - choice models with application to labor-force participation. Journal of Business and Economic Statistics 11, 61-80.
Gallant, R., and D. W. Nychka (1987). Seminonparametric maximum likelihood estimation.
Econometrica 55, 363-90.
Geweke, J. F., M. P. Keane, and D. E. Runkle (1994). Alternative computational approaches to inference in the multinomial probit model. Review of Economics and Statistics 76, 609-32.
Geweke, J. F., M. P. Keane, and D. E. Runkle (1997). Statistical inference in the multinomial multiperiod probit model. Journal of Econometrics 80, 125-65.
Gourieroux, C. (1989). Econometrie des Variables Qualitatives, 2nd edn. Paris: Economica.
Gourieroux, C., and A. Monfort (1995). Statistics and Econometrics Models, Volume 1. Cambridge: Cambridge University Press.
Gourieroux, C., A. Monfort, and A. Trognon (1987). Generalised residuals. Journal of Econometrics 34, 5-32.
Hajivassiliou, V. A., and P. A. Ruud (1994). Classical estimation methods for LDV models using simulation. In R. F. Engle and D. L. McFadden (eds.) Handbook of Econometrics, Volume 4. Amsterdam and New York: North-Holland, ch. 40.
Hausman, J. (1978). Specification tests in econometrics. Econometrica 46, 1251-71.
Heckman, J. J. (1981). Statistical models for discrete panel data. In C. F. Manski and D. L. McFadden (eds.) Structural Analysis of Discrete Data with Econometric Applications. Cambridge, MA: MIT Press.
Honore, B. E., and E. Kyriazidou (2000). Panel data discrete choice models with lagged dependent variables. Econometrica 68, 839-74.
Horowitz, J. L. (1993). Semiparametric and nonparametric estimation of quantal response models. In G. S. Maddala, C. R. Rao, and H. D. Vinod (eds.) Handbook of Statistics, Volume 11. Amsterdam and New York: North-Holland, ch. 2.
Johnson, N. L. and S. Kotz (1972). Continuous Multivariate Distributions. New York: Wiley.
Keane, M. P. (1993). Simulation estimation for panel data models with limited dependent variables. In G. S. Maddala, C. R. Rao, and H. D. Vinod (eds.) Handbook of Statistics, Volume 11. Amsterdam and New York: North-Holland, ch. 20.
Kim, J., and D. Pollard (1990). Cube root asymptotics. The Annals of Statistics 18, 191-219.
Klein, R. W., and R. H. Spady (1993). An efficient semiparametric estimator for discrete choice models. Econometrica 61, 387-421.
Lee, L. F. (1994). Semiparametric instrumental variables estimation of simultaneous equation sample selection models. Journal of Econometrics 63, 341-88.
Lee, L. F. (1997). Some common structures of simulated specification tests in multinormal discrete and limited dependent variable models. Working Paper 97/04, Hong Kong University of Science and Technology.
Maddala, G. S. (1983). Limited Dependent and Qualitative Variables in Econometrics. Cambridge: Cambridge University Press.
Maddala, G. S. (1987). Limited dependent variable models using panel data. Journal of Human Resources 22, 307-38.
Maddala, G. S. (1995). Specification tests in limited dependent variable models. In G. S. Maddala, P. C.B. Phillips and T. N. Srinivasan (eds.) Advances in Econometrics and Quantitative Economics: Essays in Honor of C. R. Rao. pp. 1-49. Oxford: Blackwell.
Manski, C. F. (1975). Maximum score estimation of the stochastic utility model of choice. Journal of Econometrics 3, 205-28.
Manski, C. F. (1985). Semiparametric analysis of discrete response: Asymptotic properties of the maximum score estimator. Journal of Econometrics 27, 313-33.
Manski, C. F., and S. Thompson (1986). Operational characteristics of maximum score estimation. Journal of Econometrics 32, 85-108.
McFadden, D. L. (1974). Conditional logit analysis of qualitative choice behavior. In P. Zarembka (ed.) Frontiers in Econometrics. New York: Academic Press.
McFadden, D. L. (1981). Econometric models of probabilistic choice. In C. F. Manski and D. L. McFadden (eds.) Structural Analysis of Discrete Data with Econometric Applications. Cambridge, MA: MIT Press.
McFadden, D. L. (1984). Econometric analysis of qualitative response models. In Z. Griliches and M. D. Intrilligator (eds.) Handbook of Econometrics, Volume 2. Amsterdam and New York: North-Holland, ch. 24.
Newey, W. K. (1985). Maximum likelihood specification testing and conditional moment tests. Econometrica 53, 1047-70.
Newey, W. K. (1990). Semiparametric efficiency bounds. Journal of Applied Econometrics 5, 99-135.
Orme, C. (1990). The small-sample performance of the information matrix test. Journal of Econometrics 46, 309-31.
Pagan, A. R. and F. Vella (1989). Diagnostic tests for models based on unit record data: A survey. Journal of Applied Econometrics 4, 529-59.
Powell, J. (1994). Estimation of semiparametric models. In R. F. Engle and D. L. McFadden (eds.) Handbook of Econometrics, Volume 4. Amsterdam and New York: North-Holland, ch. 41.
Stern, S. (1997). Simulation-based estimation. Journal of Economic Literature 35, 2006-39.
Tanner, M. A. (1996). Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions. New York: Springer-Verlag.
Tauchen, G. (1985). Diagnostic testing and evaluation of maximum likelihood models. Journal of Econometrics 30, 415-43.
Taylor, L. (1991). Testing exclusion restrictions for misspecified tobit model. Economics Letters 37, 411-16.
Thompson, T. S. (1989a). Identification of semiparametric discrete choice models. Discussion Paper 249, Center for Economic Research, University of Minnesota.
Thompson, T. S. (1989b). Least squares estimation of semiparametric discrete choice models.
Manuscript, Department of Economics, University of Minnesota.
White, H. (1982). Maximum likelihood estimation of misspecified models. Econometrica 50, 1-25.
White, H. (1983). Corrigendum. Econometrica 51, 513.