You are on page 1of 5

Economics Letters North-Holland

38 (1992) 269-273

269

A nested Tobit analysis for a sequentially censored regression model *


Byung-Joo
Unkersity

Lee
Boulder CO, USA

of Colorado,

Received Accepted

15 November 1991 17 January 1992

This paper proposes a nested Tobit analysis for a sequentially censored and analysis of multiple-step contingent valuation surveys. A consistent proper log-likelihood function is derived.

regression maximum

model. This model is useful for the design likelihood estimation is proposed, and the

1. Introduction

Limited dependent variable analysis has made rigorous econometric applications possible for a wide variety of survey designs. Survey data often consist in part of qualitative binary choice variables. In contingent valuation surveys, a subset of respondents will also have quantitative values. In such cases, various versions of Tobit model may be appropriate. There has been extensive research on most of those limited dependent variable models. McFadden (1973) and Amemiya (1973) led the early pioneering work, followed by the more elaborate simultaneous equation models of Nelson and Olson (19781, Heckman (19741, Amemiya (1974) and Lee (1978). Recent research interests shifted toward the semi-parametric or the non-parametric analysis, e.g., by Manski (1975), Powell (19861, Stoker (1986) and Ichimura (1987). However, for a certain type of survey structure, the existing models are not readily applicable. For example, consider a situation in which a policy maker is interested in providing several different levels of benefit with an additional tax payers burden. An appropriate survey would then consist of two parts: first, whether or not the respondent is willing to accept the benefit with an additional tax payment; and secondly, if accepting the proposal, how much they would be willing to pay more to get the benefit. For those who responded yes, the same questions are repeated for the next level of benefit. For this type of data, the existing models do not properly utilize the

Correspondence to: Byung-Joo Lee, Department of Economics, University of Colorado, Boulder, CO 80309-0256, USA. * The proposed model was first brought to my attention by my colleague, Charles Howe who used a sequentially censored sample in a contingent valuation survey. I thank him for calling to my attention the importance of this type of model in designing and analyzing surveys. 01651765/92/$05.00 0 1992 - Elsevier Science Publishers B.V. All rights reserved

270

B.-J. Lee /A nested Tobit analysis

additional information. A nested logit model is not appropriate because it has a quantitative response. A simple Tobit model is also inappropriate because there are sequentially censored responses from various levels of benefit. This paper proposes an appropriate estimation model which deals with sequentially censored quantitative responses. The proposed model is called a nested Tobit model which is a hybrid of the Tobit and the nested logit model. The next section develops the nested Tobit model and derives a log-likelihood function. Section 3 provides implications of this model and expands the scope of its applicability.

2. The nested Tobit model

This paper deals with only two-step sequential responses. Even though the multiple-step sequential responses can be modeled theoretically, the computational burden may not warrant its empirical application. Consider the following two-equation model:

where matrix

ui = (u,~, uZ,) has an i.i.d. bivariate

normal

distribution,

with mean

zero

and covariance

,=

l I
(T1 u12 u2 2 . u21

The censored variable y ,, is observed if x iip, + u ,; > 0, and 0 otherwise. Once y ,; is observed, yzI is sequentially observed if xiip2 + u2i > 0, and 0 otherwise. x,, and x1; are (k, x 1) and (k2 x 1) observable vectors, and they could be the same set of explanatory variables. This model is similar to Type-2 or Type-3 Tobit models of Amemiya (1984). However, neither of those Tobit models handle the sequential nature of the nested Tobit model properly. The proposed model is summarized as follows: yl*; =y,; if x;,p, + u,! > 0, if if xi,& + uZi > 0 50

y2,*

=yzi

i = 0 if x;,p, + u,, 5 0.

ZZ0

x;,&+u~~

(2)

Since we are dealing with the normal multiple sequential responses.

density

function,

a multivariate

integral

evaluation

is needed

for each step of the

B.-J. Lee /A

nested Tobit analysis

271

This model can be estimated consistently by maximum likelihood estimation. Define two index variables LI,~ = 1(~;~/3r + uli > 0) and d,, = l(&p2 + ~2~> 01, where the indicator function l(A) is one if A is true, and zero otherwise. Using these index variables, the likelihood function can be written as follows:
L=dIlopr(YE-<o)

I,

d,,= 1 [

FI

d,,=O

LI pr(Y,*,soJ

Yli)dIl,f(Y2,~

Yli)
1

(3)

2,

where n stands for the product over d,i and dzi. This likelihood function is a combination of the probability function and the density function. For a specific form of the likelihood function, consider each term of eq. (3) separately. Since ui has a bivariate normal distribution, the first term can be written as
Pr( yl*, 2 0) = Pr(

x;,p, + u,j

IO) = 1 - @ i

x;,p,,
@I

(4)

where @p( .) is the cumulative normal density function. The second term is

The joint normal density function f(yg, yIi) can be decomposed into the product of the conditional normal density function and the marginal normal density function. Then, eq. (5) can be expressed as

In eq. (61, f(yu) is the univariate normal density function with mean x;,p, and variance CT:, and f(yz I yr,) is the (conditional) univariate normal density given yz =yri, with mean x&p, + -(y,, -x;,p,> and variance (aI - u~~u;~ ). Therefore, the second term of eq. (3) is U12Ul

where c$(.> is the normal density function. The last term of eq. (3) is a bivariate normal density function for yzi and Y,~ such that

f(Y2;,

Y,i)

= &I

cI-1/2

exp[-f(Y~-xlP)-C-(Y,-xllj)],

(8)

where

272

B.-J. Lee / A nested Tohit analysis

and exp(.> is the exponential function. Collecting all these results, hood function to estimate 0 and 2 for the nested Tobit model:

we have the following

log-likeli-

+&

log &I
(

2 I -l/Z

eXp[

- +( yi -Xip)Z-(

Yi -Xi@)])

I)
(9)

From this log-likelihood function, each parameter of p and 2 is identified separately. These parameters are estimated iteratively using Newton-Raphson method or method of scoring. 2 This log-likelihood function has a univariate normal cumulative density function, which involves a single integral calculation. However, for the multiple-step response model, the log-likelihood function contains a multivariate integral evaluation. This can be quite costly even with high-speed computers.

3. Implications

and conclusion

This model is particularly important in a situation where a policy maker would like to know the marginal response rate for different levels of benefit. If we consider y,* = <yc, y,*,> as indirect utility functions with the relevant socio-economic variables xl, then a change from y: to yc. is a marginal indirect utility, and /3 can be interpreted as a marginal response rate of the socio-economic variables to the different levels of benefit. The same model can also be used to determine the optimal fee structure for the different levels of service for different groups. For various options of service, differential service fees can be assigned for different socio-economic groups using the estimated p. This paper proposes a nested Tobit model for sequentially censored samples. A nested Tobit model is similar to the nested logit model for the sequential choice problem. However, when we have sequential quantitative responses, the existing models do not properly utilize this additional information. A typical approach for the sequentially censored response model is to pool the quantitative responses as if each response were an independent observation. This procedure
* I have attempted to derive the analytic complicated because of the conditional log-likelihood function numerically. expression for the information matrix of eq. (9). However, normal density function. Computer evaluates the second it is unnecessarily derivatives of the

B.-J. Lee /A

nesfed Tobit analysis

273

incorrectly assumes that each response is independent, estimates. The proposed maximum likelihood function tently, considering the sequential nature of responses.

and it produces inconsistent parameter estimates the model parameters consis-

References
Amemiya, T., 1973, Regression analysis when the dependent variable is truncated normal, Econometrica 41, 997-1016. Amemiya, T., 1974, Multivariate regression and simultaneous equation models when the dependent variables are truncated normal, Econometrica 42, 999-1012. Amemiya, T., 1984, Tobit models: A survey, Journal of Econometrics 24, 3-61. Heckman, J., 1974, Shadow prices, market wages, and labor supply, Econometrica 42, 679-694. Ichimura, H., 1987, Estimation of single index models. Ph.D. thesis (MIT, Cambridge, MA). Lee, L.F., 1978, Unionism and wage rates: A simultaneous equation model with qualitative and limited dependent variables, International Economic Review 19, 415-433. McFadden, D., 1973, Conditional logit analysis of qualitative choice behaviour, in: P. Zarembka, ed., Frontiers in econometrics (Academic Press, New York). Manski, C., 1975, Maximum score estimation of the stochastic utility model of choice, Journal of Econometrics 3, 205-228. Nelson, F.D. and L. Olson, 1978, Specification and estimation of a simultaneous equation model with limited dependent variables, International Economic Review 19, 695-709. Powell, J., 1986, Symmetrically trimmed least squares estimation for Tobit models, Econometrica 54, 1435-1460. Stoker, T., 1986, Consistent estimation of scaled coefficients, Econometrica 54, 1461-1481.

You might also like