Professional Documents
Culture Documents
net/publication/316845827
CITATIONS READS
0 416
2 authors:
Some of the authors of this publication are also working on these related projects:
Multidimensional Item Response Theory models for practical application in large tests designed to measure multiple constructs View project
Development and implementation of a multifaceteditem response modelforanalysis of teacherevaluation at theNationalUniversity of Colombia-Bogotá View project
All content following this page was uploaded by Diana Carolina Carolina Gamboa Pinilla on 10 May 2017.
1 Introduction
Latent traits, are random variables which cannot be observed directly, Baker
and Kim [1]. Some known examples of latent traits are intelligence, feelings and
preferences. In this paper we work with happiness. To measure a latent trait,
a test is designed to obtain observable data which are related with it. Each
question in those tests is called an item. The item response theory (IRT) models
have been developed for estimating continuous latent traits from categorial
data. IRT models are really factor analysis of discretized variables, Takane and
Leeuw [13]. In the general case, latent traits are multidimensional, Reckase [10].
However, when a latent trait is approximately unidimensional, unidimensional
(UIRT) models may be used, Montenegro and Cepeda [7]. In this case, a
unidimensional scale is adopted for the latent trait. The key idea when we use
UIRT models is that the categories of the items and the latent traits are in
the same scale. Consequently, if the items in a test are ordinal, say by using a
Likert scale, the categories and the latent traits are sorted in the real line such
that people with low latent trait have high probability of choosing low values
of Likert scale and vice versa.
Two general types of IRT models are studied separately, even though one of
them is a subclass of the the other. To the first class belong the dichotomous or
dichotomized models. In this case, all the variables in a test a dichotomic. To
the second class belong the models where the items are categorial, and where
each item may have more than two categories. This kind of models are more
adequate for using in surveys. The members of the second class are called
polytomous IRT models. In this paper, we focus on the graded response model
393
Proceedings, 15th Applied Stochastic Models and Data Analysis (ASMDA2013)
International Conference, Mataró (Barcelona), Spain 25 - 28 June 2013
Since the main objective of UIRT models is to state a scale and sort the
people according with such scale, the estimated latent traits are synthetic in-
dices, Montenegro and Cepeda [7]. Some times, principal component analysis
(PCA) or multiple correspondence analysis (MCA), Lebart [4], are used to ob-
tain synthetic indices. The results are relatively similar, because all of these
are factorial methods. In this paper we use this fact, and apply MCA as an
exploratory tool in order to help in the interpretation of the index.
In surveys, the most common type of variables are categorical, and more
specifically ordinal. In these cases, Likert scales are used. The data used
along this paper are from the European Social Survey (the ESS). The ESS,
is a biennial multi-country survey covering over 30 nations. The first round
was fielded in 2002/2003, the fifth in 2010/2011. The project is funded jointly
by the European Commission, the European Science Foundation and academic
funding bodies in each participating country. For details about the EES go to
http://www.europeansocialsurvey.org/.
For the construction of the index we used the human values section of the
Spanish sample. Some of the statements of the survey that we used were: I am
always optimistic about my future; my life is exactly as I want it to be; I like
to plan and prepare for the future; I feel I can decide for myself how to live;
All the time I feel that I will fail. The common response categories for this
kind of statements in the ESS are: strongly disagree, disagree, neither agree
nor disagree, agree, strongly agree. The complete data and the R procedures
used in this work can be asked to the authors, via e-mail.
Where ai is called the discrimination parameter of item i and big is the bound-
ary location parameter which corresponds to the boundary between categories
g and g − 1 associated with item i, Ostini and Nering[8].
394
Proceedings, 15th Applied Stochastic Models and Data Analysis (ASMDA2013)
International Conference, Mataró (Barcelona), Spain 25 - 28 June 2013
∗ ∗
Pig = Pig − Pi(g+1) (2)
To completely define the model in terms of the available boundaries, two
additional definitions are required.
On the other hand, the category response function is given by Pig (θ), i.e.,
Pig (θ) is the probability of choosing category i to item g as a function of θ.
The figure 2 shows the graphs of the response functions of an item.
3 Data description
The data belongs to the ESS, applied in 2006. The test includes a wide range
of social variables, social and public trust; social exclusion, national, ethnic
and religious allegiances; well-being; health and security; human values; de-
mographics and socio-economics. For the construction of the index we used
the human values section of the Spanish sample. The sample size was N=1876
and the test size was K=27. In the selected subtest the items 5 to 19 have 4
categories. On the other hand, the other items have 5 categories.
There were 218 missing data in the test. This is not a problem with IRT
models. For calibration, only the non missing responses are used in the likeli-
hood function. However, for using MCA it is necessary to complete the data.
Thus, we first fitted the data using the graded Samejima’s model. Item cali-
bration was used to compute the latent trait estimation. Once the latent trait
estimations were obtained, the missing data was estimated item to item by
categories with highest probability.
There exist different procedures for assessing the unidimensionality of the data.
We use a recent procedure based on the Cronbach’s α (alpha) coefficient. This
coefficient is commonly used to measure the internal consistency or reliability
of a test. The coefficient is defined as
P 2
k S
α= 1− 2 i ,
k−1 Stotal
where k is the number of items, Si2 is the variability of item i and Stotal
2
is
the total variability of the test. The coefficient may be interpreted as follows.
395
Proceedings, 15th Applied Stochastic Models and Data Analysis (ASMDA2013)
International Conference, Mataró (Barcelona), Spain 25 - 28 June 2013
To fit the data we use the marginal likelihood method implemented in the
package ltm of R. To define a scale, it is assumed that the latent traits θj of
the people are a sample from a random variable Θ ∼ N (0, 1). In this method,
the likehood is integrated with respect to the latent trait to calibrate the item
parameters. The scoring for the latent traits is running using the posterior
expectation of the latent traits. For details, see Rizopoulos [11].
396
Proceedings, 15th Applied Stochastic Models and Data Analysis (ASMDA2013)
International Conference, Mataró (Barcelona), Spain 25 - 28 June 2013
between -2.5 and -0.8 have higher probability for selecting the second category
response; finally people with trait level greater than -0.8 will have high chance
to answer the last category.
Once the item parameters were calibrated, we estimated the latent traits
θj for the people. We named this latent trait as people’s happiness.
6 Goodness of fit
In this work, we used a goodness of fit test, to identify problematic items
(ambiguous or flawed) and to detect people with responses that not follow
397
Proceedings, 15th Applied Stochastic Models and Data Analysis (ASMDA2013)
International Conference, Mataró (Barcelona), Spain 25 - 28 June 2013
the general pattern responses.[14]. Given a person j with latent trait θj , the
probability to obtain the xij score is denoted by Pijk . The expected value and
variance of xij are,
mi
X
Eij ≡ E(xij ) = kPijk (3)
k=0
and
mi
X
Wij ≡ V (xij ) = (k − Eij )2 Pijk (4)
k=0
The standardized differences between score of the person j and its expected
response for the item i is given by
(xij − Eij )
zij = p (5)
Wij
The standardized residuals were computed based on the equations 3, 4 y 5. We
detected that the number of residuals such |zij | > 2 was 1556, that correspond
to 3.9% of the responses of the responses. As this percent was less than 5%,
there is no reason to worry.
A multiple correspondence analysis was done with the test responses as active
variables. The latent trait was projected as an illustrative variable. Figure
3 shows the first factorial map. We observed a parabolic structure (Guttman
Effect). This effect revealed that the first factor summarizes the order structure
of modalities; remaining factors are functions of the first factor, Lebart [4].
The first factor shows opposition among extreme categories. The second factor
shows opposition among extreme categories and average categories. The latent
trait variable has a high correlation with the first factorial axis (0.96) therefore
both variables measure essentially the same.
8 Clustering
A clustering analysis was ran. Four clusters were obtained. Cluster 1 had the
lowest happiness index. To this cluster belongs 7.26 % of the sample. The
cluster was characterized by people that the most of the time felt depressed,
sad, anxious and bore; they never felt energetic or enjoyed their life and they
never felt peaceful or calm; they always felt really tired when they woke up
in the morning; they felt that their life was far from how they would like it
to be. They had bad health, they were widowed or divorced; they usually did
housework, they were looking after children or other people almost all the time
and they rarely met with their friends.
398
Proceedings, 15th Applied Stochastic Models and Data Analysis (ASMDA2013)
International Conference, Mataró (Barcelona), Spain 25 - 28 June 2013
9 Conclusion
In this paper, it was proposed a methodology to build a happiness index by
using a polytomous item response theory. This is a synthetic index based on
qualitative information. In contrast to the common methods to construct in-
dexes, the happiness index is not a linear combination of other indexes. Instead,
the happiness index is a latent trait. We illustrated, how polytomous IRT mod-
els may be useful for analyzing data from surveys. We show the main issues
that the researchers most consider before using the models. In other areas like
marketing, the ideas are useful if the researcher understand the models well
and the assumptions about them.
399
Proceedings, 15th Applied Stochastic Models and Data Analysis (ASMDA2013)
International Conference, Mataró (Barcelona), Spain 25 - 28 June 2013
References
1.Baker, F. B. and Kim Seok-Ho (2004), Item Response Theory, second edition, Mar-
cel Decker Inc.
2.Cameletti, M & CAviezel, V. (2012), The cronbach-mesbah curve for assessing the
unidimensionality of an item set: The r package cmc.
3.Cortina, J. (1993), ’What is Coefficient Alpha? An examination of Theory and
Applications’, Journal of Applied Psychology 78 (1), 98-104.
4.Lebart, L., M. A. & Piron, M. (1995), Statistique Exploratoire Multidimensionnelle,
Dunod.
5.Masters, G. (1982), ’A Rasch model for partial credit scoring’, Applied Measurement
in Education, I, 279- 297.
6.Mesbah, M. (2010), ’Statistical Quality of Life’, In N. Balakrishnan (ed.), Method
and Applications of Statistics in the Life and Health Sciences, Wiley, 839-864.
7.Montenegro, A. and Cepeda, E., ’Synthesizing the Ability in Multidimensional Item
Response Theory Models’, Revista Colombiana de Estadstica 33, 127-147.
8.Ostini, R. & Nering, M. (2006), polytomous item response models, SAGE publica-
tions.
9.R Development Core Team (2012), R: A Language and Environment for Statistical
Computing, R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-
900051-07-0. URL: http://www.R-project.org
10.Reckase, M., (2009), Multidimensional Item Response Theory, Springer
11.Rizopoulos, D. (2006), ’ltm: An R package for latent variable modeling and item
response theory analyses’, Journal of Statistical Software 17, 1-25.
12.Samejima, F (1969), Estimation of latent ability using a response pattern of graded
scores.Psychometrika, Monograph Supplement No. 17.
13.Takane, Y. and Leeuw, J. (1987), ’On the relationship between item response
theory and factor analysis of discretized variables’, Psychometrika 52(3), 393-
408.
14.Van der Linden, W. & Hambleton, R. (1998), Handbook of modern item response
theory, Springer.
400