Professional Documents
Culture Documents
Modeling
Modeling
Multiple regression is used widely as a tool for estimating tourism demand functions.
Problems of heteroscedasticity, multicollinearity, and autocorrelation are well recognized
by modelers, but questions ofmodel specification are less widely understood. Misspecifi-
cation of the model, such as failure to include an important explanatory variable or a
wrongfunctionalform, can have significant impacts on the model estimated. The evidence
to date is that more complex or general functional forms give better models than more
simple functions such as the commonly used log-linear model. It is reasonable to expect
that tourism demand is a dynamicfactor, so the lag structure oftourism demand should be
considered in any model. A quadratic functionalform for this structure is derived.
Forecasts of tourism demand are needed for planning, encountered. Econometric texts such as Judge et al. (1982)
policymaking, and budgeting purposes by tourism operators, discuss these matters at the necessary depth. There are also
investors, and government bodies concerned with tourism. model specification difficulties (see Judge et al. [ 1982] for an
Nearly all forecasting methods generally available can be introduction to this complex area). For example, misspecifi-
applied to tourism demand, as is demonstrated by Uysal and cation of the model structure, such as failure or inability to
Crompton (1985) and by Calantone, di Benedetto, and include an important explanatory variable in the model or use
Bojanic (1987). Time series methods (such as moving aver- of an inadequate functional form for the model, sometimes
ages, Holt-Winters exponential smoothing, decomposition, manifests itself as significant evidence of heteroscedasticity.
and Box-Jenkins methods) rely on the continuance of past Multiple regression is used to fit the model form specified by
time relation patterns into the future, while causal or econo- the modeler with those variables chosen for inclusion by
metric methods model the demand as related to other, explana- modeler. The necessary and important decisions as to model
tory variables. The assumption that past trends and patterns form and explanatory variable set are made subjectively,
will continue into the future is an uncertain one for tourism perhaps expertly, by the modeler and not by the methodology.
demand; also, tourism planners consider that demand is influ- The use of methods such as &dquo;best subsets&dquo; regression or
enced by other, exogenous variables such as exchange rates, stepwise regression cannot negate the importance of the
fares, and incomes. Because of these factors, causal methods modeler’s own input.
which estimate a specified demand model in terms of chosen In a study of the suitability of regression analysis for
explanatory variables using multiple regression techniques to estimating tourism demand, Summary( 1987) concludes that
&dquo;
estimate the model parameters are often preferred. Econo- ’typical’ multivariate demand functions estimated by ordi-
metric models have the advantages of enabling investigation nary least squares regression may not represent the optimal
of causal effects of changes in the explanatory variables on the technique to use in all tourism demand studies.&dquo; This cau-
demand variable and of &dquo;what if&dquo; simulations (Witt and tiously worded but negative conclusion is critical of the
Martin 1987; Uysal and Crompton 1985). There is also the common use of regression for demand modeling, which con-
option of using more complex models which attempt to siders a limited set of independent variables to &dquo;explain&dquo;
combine the time series and causal methods, such as econo- demand and minimal theoretical or practical consideration of
metric models which incorporate time series structures and the model form. It is unlikely, however, that the widespread
Box-Jenkins methods, with transfer functions. Such models, use of multiple regression for estimating tourism demand
along with the incorporation of tourism into large, multiple functions will diminish, given its widely perceived advantages
equation econometric models, pose special problems in anal- over other ways of forecasting demand. The demand mea-
ysis and interpretation. sured by a regression model is a theoretical construct involv-
With the widespread availability of computer packages ing the functional form and the variables included in the
which can perform the necessary calculations, multiple re- model. Using it as a demand measure involves assuming that
gression is now reasonably easy to use. Less widespread is an most of the effect of determinants of demand is captured by
understanding of some of the subtle difficulties that may be the explanatory variables included in the model, is adequately
present in any use of the technique, such as those arising from modeled by the functional form of the model, and is pre-
the error distribution and from model misspecification. dominantly systematic. If these assumptions are valid, then
Heteroscedasticity, multicollinearity, and autocorrelation the error structure, which is made up of the differences be-
problems may be discovered in error distributions, and tech- tween the modeled demand and the actual numbers of tourist
nical methods are available for addressing such problems. visitors, will have values which are, for practical purposes,
However, the available methods cannot handle all of these random and allow the effects of the determinants to be reliably
problems, nor do these problems exhaust those that may be estimated. In this article the functional form of the demand
model, the variables used in the model, and the inter-temporal
structure are discussed to suggest ways the typical demand
Clive Morley is at the Graduate School of Management,
Royal Melbourne Institute of Technology, Victoria, Australia. modeling of tourism using the powerful tool of multivariate
regression analysis can be improved.
Downloaded from jtr.sagepub.com at PENNSYLVANIA STATE UNIV on March 6, 2016
40
FUNCTIONAL FORM depend on the functional form, and that more complex and
general forms perform best, notably the translog form ahead
Many admissible functional forms for travel and tourism of the Box-Cox transformation and then the log-linear form,
demand models exist, and it is not practical to consider all the with the linear and logit forms further behind. Gaudry and
possibilities every time a model is to be estimated. The set of Wills (1978) use the Box-Tukey transformation (Tukey
forms to be considered can be reduced to manageable propor- 1957; Box and Cox 1964) as a general functional form,
tions by deriving forms theoretically from basic principles estimating the parameters of the transformation from the data
inherent in the structure of the demand being modeled. Alter- using maximum likelihood. The Box-Cox transformation,
natively, a fairly general form can be used and the specific and hence the linear and log-linear functional forms, is a
form derived by estimation from the data. Less soundly based special case of the Box-Tukey transformation. Gaudry and
is to assume a specific form and ignore other possibilities. Wills conclude that this flexible functional form is superior to
Many studies assume a multiplicative form of model made commonly used forms such as the log-linear form for total
linear by a logarithmic transformation of the variables-the demand and the logit model for modal shares. They argue that
log-linear form (Arbel and Ravid 1985; Poole 1988; Strong as the best functional form (the one with maximum likelihood
and Redman 1982; Summary 1987; Witt and Martin 1987). estimated parameter values) has the most convincing signs
Other functional forms that have been used to model tourism and values for elasticities then the practice of trying a few
demand include the Almost Ideal Demand System (O’Hagan forms and using the one that yields the best results has some
and Harrison 1984; White 1985), logit regression (Sheldon rationale and can improve on the use of standard model forms.
and Mak 1987), a Box-Cox transformation of the variables While these studies are not conclusive, they do indicate that
(Fujii and Mak 1981), and a specifically built form (Witt the choice of functional form for a model can have a signifi-
1983). The functional forms that can be used range from cant impact on the results and that conclusions, such as on the
simple linear regression models to complex forms derived importance of variables in affecting demand, derived from a
from statistical or econometric theory, which are more dif- particular form are not necessarily applicable to another
ficult to estimate. form. They also suggest that the more complex, general or
Misspecification of the structure of the model can be tested flexible functional forms are likely to repay the extra effort
for with tests such as the RESET test of Ramsey (1969, involved in their estimation with more reliable results.
1974) and Ramsey and Schmidt ( 1976). Godfrey, McAleer, Parameter estimates are also sensitive to different error
and McKenzie (1988) conclude an assessment of this and distribution assumptions. Coursey and Nyquist ( 1988) pro-
other such tests with recommendation of the RESET test for duce empirical evidence to show that the effect of assuming
its simplicity of use, robustness to non-normal error distribu- different error distributions, and hence varying the appropriate
tions, and good power. Gaudry and Wills (1978) have estimation technique, is of a similar magnitude to structural
shown that whether an explanatory variable is significant or assumptions of functional form or explanatory variables. The
not, or even has the correct sign, can depend on the functional demand equation specification should therefore consider the
form of the model. So an error in specification of the model possibilities of error distributions other than the usual assump-
form can result in the incorrect inclusion or exclusion of tion of a normal distribution of errors, and robust estimation
explanatory variables and false estimates of the values of techniques should be considered as well as least squares
parameters measuring the impact of variables on demand, techniques.
such as elasticities. Further, the choice of functional form
affects the distribution of the residuals and bias of the test VARIABLES
statistics. While the theory of mathematical statistics leads us
to expect such impacts, Gaudry and Wills show that they are The literature on the estimation of variously specified
also important in practice in the estimation of travel demand international tourist demand equations, examples of which
models. Oum (1989) finds similar effects in freight transport have been cited previously, suggests a large number of pos-
models. Oum compared five functional forms for estimating sible variables for inclusion in a model of tourism demand.
freight transport demand models. The forms are: Schulmeister (1979) lists the following variables as being
the most important exogenous variables in an economic
1. the linear model - a straightforward and easily applied model of tourism demand:
model, but the assumption of linear effects of all variables is Disposable income of private households
usually considered too simplistic to be realistic for tourism Gross National Product (to capture the effect of both
demand modeling and the model is not often used;
2. the log-linear model - the logarithm of demand is specified
private households’ incomes and total production)
Private consumption
as a linear function of the logarithms of the explanatory
Prices of consumer goods
variables; a commonly used form in transport and tour-
ism studies, it assumes constant elasticities and is easily Tourism prices
estimated; Transport costs
3. the logit model - applied to demand market shares, it can be
Incomes
derived from a random utility function model in a discrete Consumer economic expectations
choice context, which provides a convincing theoretical basis Special factors (for example, the Olympic Games or
for the model form; it is also not difficult to estimate and has social unrest in the destination country)
been known to yield reasonable-seeming results; Short-term factors (e.g., school holidays, weather)
4. the translog model - a flexible functional form giving a Long-term factors (stock of private cars, length of holi-
quadratic approximation to the unknown true form of the days, population by age group, degree of urbanization)
utility function, with variation in own and cross-elasticities Relative prices between domestic and foreign tourism
allowed for, it is more difficult to estimate; Relative prices between destinations
5. the Box-Cox transformation - as a generalization of the Other econometric demand studies use various subsets of
first two model forms. these variables, sometimes supplemented by the variables
Oum replicates Gaudry and Wills’ finding that the values listed below (with examples of studies which have incorpo-
of elasticities and the variables found significant in the model rated them):
(1982), O’Hagan and Harrison (1984), Rosensweig multicollinearity among the explanatory variables was identi-
(1986), Summary (1987), Witt and Martin (1987). fied as a concern. These two examples illustrate some of the
Variables such as price, income, and consumption are problems with the variables that arise. In particular, they
sometimes lagged, and the lagged dependent variable has show how small is the set of explanatory variables actually
been included as an explanatory variable reflecting habit considered in many models in comparison with the lists of
build-up, word-of-mouth reporting, and rigidity of supply potential variables given above. Theoretical models of tour-
ism demand and the practice of demand modeling are far from
(Witt and Martin 1987). Faulkner (1988) discusses possible
factors which impinge on the number of tourist arrivals in integrated. The presence of serial autocorrelation in models
Australia from overseas in the light of recent rapid increases and time-related misspecification, which may contribute to
in the numbers. He identifies some permanent and transient multicollinearity problems, suggests that the dynamics and
changes to which he ascribes the observed effect on tourist lag structure of tourism demand may need closer analysis
numbers. The variables are than they have previously received.
particular destination if it were in their search field and n( T) is estimated, but a and b cannot be uniquely estimated from the
the cumulative total number of time T of people at the source one data set; this degeneracy requires independent data on the
who are past tourists to the destination. Denote the flow of parameters for its resolution.
tourists to the destination as a result of a search for informa-
tion by nl’ then if it is assumed that potential tourists find CONCLUSIONS
information about the destination independently and with
equal probability the flow of new tourists (that is, those with Tourism demand is typically modeled using multiple re-
no previous experience of tourism to the destination) resulting
gression to fit a linear or log-linear model with a small set of
from a search for information is proportional to N-n(T). So
explanatory variables. This methodology has the advantage
nl’ a(N-n(T) ) for some a, 0 <_ a ~1.
=
of simplicity, but it ignores much of the current state of know-
Some new tourists will obtain information about the desti- ledge with respect to estimation problems in multiple regres-
nation not as a result of a deliberate search but by a chance sion, the effects of misspecification of model structures, and
different error distributions. The results obtained from demand
encounter with the information. If information is dissemi-
nated at random in terms of its potential reception, then the models, such as estimates of elasticities and the significance
of explanatory variables, can be highly reliant on the func-
number of information messages received in this way will be
tional form of the model, with the consequence that policy
some fraction of the potential N-n(T). If the amount of infor-
decisions and demand forecasts can be made which are as
mation sent out is constant, then-assuming independent and
much an outcome of the model form chosen by the modeler as
equally likely exposure to the information and a constant they are of the data. Evidence in the econometric, statistical,
probability of choosing the destination for those with the and transportation economics literatures indicates that more
information-the flow of new tourists to the destination owing
to receipt of information randomly is N2’ = b(N-n(T) ). It general, flexible functional forms give better results than the
can be hypothesized that the parameter b may depend on simple forms often used.
The lag structure of tourism demand has received little
variables such as the level of advertising. The third possible
method of information flow is word of mouth through the explicit analysis, despite indications that it is likely to be an
conjunction of a past tourist and a potential tourist. The flow important aspect of a well specified model. From considera-
tion of information flows to potential tourists there is con-
of new tourists due to this information flow, n3’, is propor-
tional to both the number of past tourists and the number of structed a dynamic structural model for tourism demand as a
potential new tourists, analogous to the spread of a new quadratic function of the total prior visitors to a destination.
product in a market (Deaton and Muellbauer 1980, p. 371).
REFERENCES
Thus
Arbel, A., and S. A. Ravid(1985). "On Recreation Demand: A Time-Series
Approach." Applied Economics, 17: 979-90
Box, G. E. P., and D. R Cox (1964). "An Analysis of Transformations."
Journal of the Royal Statistical Society Series B, 26: 211-43.
There will also be a flow of repeat business, n4’, by tourists Calantone, R J., C. A. di Benedetto, and D. Bjanic (1987). "A Compre-
who choose a destination using information about the desti- hensive Review of the Tourism Forecasting Literature." Journal of
Travel Research, 26: 28-39.
nation from a past visit. Assuming a constant proportion of
Coursey, D., and H. Nyquist (1988). "Applications of Robust Estimation
past visitors choose to return, Techniques in Demand Analysis." Applied Economics, 20: 595-610.
Deaton, A., andJ. Muellbauer( 1980). Economics and ConsumerBehavior.
n4’ = d.n(T). Cambridge University Press.
Faulkner, B. (1988). Patterns of Tourism Demand in Australia: An Analysis
of Recent Trends. BTR Occasional Paper No 4. Canberra: Bureau of
Thus if n’ is the total flow of tourists due to these means of Tourism Research.
information transmission, Fujii, E. T. and J. Mak (1981). "Forecasting Tourism Reward: Some
Methodological Issues." Annals of Regional Science, 15: 72-82,
Gaudry, M. J. I., and M. J. Wills (1978). "Estimating the Functional Form of
Travel Demand Models." Transportation Research, 12: 257-89.
Godfrey, L. G., M. McAleer, and C. R McKenzie (1988). "Variable Addi-
tion and Lagrange Multiplier Tests for Linear and Logarithmic Regres-
sion Models." Review ofEconomics and Statistics, 70: 492-503.
Greenwood, M. J. (1985). "Human Migration: Theory, Models and Empiri-
cal Studies." Journal of Regional Science, 25: 521-44.
Re-arranging gives Hollander, G. (1982). Determinants of Demand for Travel To and From
Australia. Working Paper No. 26. Canberra: Bureau of Industry
Economics.