Professional Documents
Culture Documents
Abstract: Despite efforts in recent years, the construction industry remains one of the top contributors for workplace fatalities in many
Downloaded from ascelibrary.org by Tokyo Univ Seisan Gijutsu on 05/14/15. Copyright ASCE. For personal use only; all rights reserved.
countries. One of the key concerns in the industry is the management of workers’ safety behavior. This paper aims to explore the cognitive
factors influencing the unsafe behavior of not anchoring a safety harness when working at height. In addition, multiple stepwise linear
regression, artificial neural network, and decision tree techniques were applied in the study to assess their usefulness in evaluating survey
data of safety cognitive factors. The theory of planned behavior (TPB) was adopted to model the cognitive factors influencing the unsafe
behavior of scaffolders. The TPB postulates that attitude, perceived behavioral control, and subjective norms affect the intention of workers,
which ultimately affects intentional behavior. The unsafe act of not anchoring harnesses while working on a scaffold was selected as the focal
behavior based on observations and interviews with safety supervisors. Supervisors also provided their opinions on the underlying reasons for
the unsafe act. A questionnaire was then developed based on the site observations, interviews, and literature review. Subsequently, 40 migrant
workers from Bangladesh, India, and China were surveyed. Stepwise multiple linear regression, neural network, and decision tree analyses
were implemented. The analyses revealed that subjective norm was the key variable influencing a worker’s decision to anchor the safety
harness. The significance of subjective norm was probably affected by the national culture of the migrant workers. In addition, the analyses
showed that the relationships between the variables were probably nonlinear, thus neural network and decision tree are suitable techniques.
The exploratory study provides the basis for design of an in-depth study on the cognitive factors influencing safety behavior and it expands
the choice of analyses techniques. DOI: 10.1061/(ASCE)CO.1943-7862.0000972. © 2015 American Society of Civil Engineers.
Author keywords: Cognitive analysis; Safety behavior; Theory of planned behavior; Construction safety; Neural network; Decision tree;
Data mining; Migrant worker; Working at height; Labor and personnel issues.
Summation Layer
Neural Network
Neural network is an established artificial intelligence technique
that mimics biological nervous systems, which have adaptive learn- Pattern Layer
ing properties. Each neural network is a mathematical model made
Input
up of nodes (or neurons) that are connected through a network.
A learning algorithm is implemented to adjust the weights of
the connections between the neurons so that the network is able Fig. 2. Structure of a generalized regression neural network with two
to minimize prediction errors. Many construction management inputs
>=3
Contract sum
>=3
<3
Competency
Low Risk of supervisor
<4 >=4
and competency of supervisor is very high (≥ 4), then the risk of the the migrant workers are used to the lower level of safety standards
project is low. The algorithm has only selected three attributes to in their home countries. The supervisors opined that scaffolders
classify the data set. The selected attributes are more likely to be tend to be focused on getting the task done and frequently neglect
more influential in determining the risk level of projects. However, safety measures such as anchoring their safety harnesses. In gen-
most decision trees can only predict nominal class. eral, the supervisors felt poor safety attitude of individual scaffold-
ers was the main reason for the failure to anchor safety harnesses.
11 I always hook safety harness onto suitable anchors when working on scaffolding Behavior
(0.197). The model implies that among the TPB variables, subjective
norm was the only significant predictor of intention. The residual ver-
sus fit plot in Fig. 4 seems to suggest that the assumption of linearity is
acceptable. However, a closer examination shows that the points are
more dispersed below the zero line and do not spread randomly across
the chart. This points to possible nonlinearity issues.
In accordance with the TPB, a stepwise multiple regression was
used to assess the importance of intention and PBC in influencing
the scaffolders’ decision to anchor their safety harness. With refer-
ence to Table 6, Intention was included in the regression model
(behavior ¼ 2.554 þ 0.4 × intention) and PBC was excluded. As
can be observed from Table 6, the R2 was also low. Table 7 shows
the results for the regression analysis to predict behavior with
attitude, SN, and PBC as the independent variables. The final
model (Step 2), behavior ¼ −0.238 þ 0.56 · SN þ 0.469 · attitude
(Table 7) shows that SN and attitude were the significant variables
influencing behavior of scaffolders and PBC was excluded. The
final model has a relatively higher R2 (0.371). In addition, both
Figs. 5 and 6 show that the relationships of the regression analyses Fig. 6. Residual versus fit plot (behavior as dependent variable and SN,
are probably nonlinear. attitude, and PBC as independent variables)
Thus as a whole, the three regression analyses indicated that
among the TPB variables, PBC is not significant. On the other
hand, SN is a relatively influential variable. Furthermore, the useful for small data sets, such as herein. In the traditional parti-
residual analyses suggested that the relationship between the var- tioning or holdout approach, a fixed percentage of the training set is
iables was probably nonlinear. used for testing. However, in small data sets, the small number of
cases for testing is very limited and the testing results will not be
credible. In contrast, the cross-validation method employs all the
Neural Network Analysis
available data for testing.
The neural network analyses were conducted using NeuralTools 6 NeuralTools (Palisade 2014) produces an indicator called var-
(Palisade 2014). Because the variables are numeric, the GRNN iable impact to measure the relative importance of each indepen-
(Specht 1991) was adopted. Unlike the multilayer feed-forward dent variable in predicting the dependent variable. Even though
network (MLFN), the GRNN does not require arbitrary assignment variable impact is not meant to give a firm assessment of the im-
of the number of hidden layers and the number of neurons on each portance of each variable, the indicator does provide useful guid-
layer [e.g., Goh and Chua (2013)]. The 10-fold cross-validation ance on relative importance. Based on Palisade (2013), variable
technique (Samarasinghe 2007; Witten 2011) was used to test the impact is calculated by focusing on each independent variable,
credibility of the network. In the 10-fold cross validation, the train- X i sequentially, where i ¼ 1; : : : ; m and m is the number of inde-
ing set was randomly split into 10 subsets and within each subset, pendent variables. The algorithm then steps through each training
10% of the training set were randomly assigned for testing and case j, where j ¼ 1; : : : ; n and n is the number of training cases.
withdrawn from the training process. Each subset of testing cases For each training case j, the value of the ith independent variable
was then used to assess the root-mean-square error of the corre- X ij , is varied, but the other independent variables are kept un-
sponding network. The process was repeated 10 times and the aver- changed. Each specific value of X ij within the training set is used
age of the root-mean-square error for the 10 testing sets was used to to derive the corresponding dependent variable Y ij derived from the
estimate the root-mean-square error of the network trained using neural network. The difference between the maximum and mini-
the full training set. The cross-validation technique is especially mum value of the dependent variable Y ij , Δij , is then calculated
the Δij for all training cases, the mean of Δij (Δ̄i ) for independent dition, because all scaffolders feel that they always anchor their
variable i is then calculated. The process is repeated for other in- harnesses, this forms peer pressure on the other scaffolders, espe-
dependent variables (2; : : : ; m) to obtain the mean difference for cially if the level of individualism is low. Furthermore, because
each independent variable. Thus, the variable impact for indepen- individualism is likely to be low in this group of workers, the effect
dent variable i is of an individual’s attitude and perceived behavioral control, as re-
flected in the analysis, is likely to be low. Thus, the national cul-
Δ̄i
VI i ¼ ð1Þ tures of the migrant workers and the location of work could have
Δ̄1 þ : : : þ Δ̄m played a part in making subjective norm the key variable influenc-
ing assessment of consistency of safety behavior. However, this
As in the case of regression, three networks were trained and
exploratory study did not collect information on the site and organi-
tested (Table 8). Besides variable impact, Table 8 shows the RMS
zational culture in which the scaffolders worked. As shown in past
error for each of the networks trained. Because the Likert scale used
studies (Zohar 2010), safety climate at group and organizational
in the survey is an integer scale of 1 to 5, a RMS error of 0.5 or less
levels are important and these factors should be considered in future
was deemed to be acceptable. It can be observed that Network 3 has
studies.
the lowest RMS error. In alignment with the regression analysis, the
Based on the residual versus fit plots of the stepwise multiple
GRNN analyses indicate subjective norm as the independent var-
linear regression models, the relationships between the variables
iable with the greatest influence on behavior and intention.
were probably nonlinear. In addition, regression analysis requires
more stringent checks on the statistical characteristics of the pop-
Decision Tree Analysis ulation to assure that the assumptions are valid. Artificial neural
To facilitate the application of the decision tree technique, the de- network and decision tree are better suited to handle nonlinear
pendent variables, intention and behavior had to be converted into relationships and they do not require any assumptions about the
nominal data type. Thus, for both intention and behavior, a value of statistical characteristics of the population being investigated. How-
5 is classified as A and a value less than 5 is classified as B. The ever, a neural network is very much a black box and it is hard to
analyses used the J48 classifier (Witten 2011), which is a modifi- understand the details of the network. In contrast, a decision tree
cation of the widely implemented C4.5 (Quinlan 1993). In accor- usually provides a simple and straightforward solution. In this
dance with the previous analyses, three trees were developed study, all the decision trees only have one node. This could be be-
(Table 9) and all the trees were tested using 10-fold cross valida- cause of the small number of variables and sample size. Further-
tion. All three trees only have one node. Tree 1 has a high percent more, a decision tree analysis require a nominal class to act as the
bad prediction (percentage of predictions that were wrong), but dependent variable. Arbitrary classification of the dependent vari-
Trees 2 and 3 had acceptable prediction accuracy. In addition, it able may affect the results of the decision tree. Nevertheless, both
was noted that as in previous analyses, subjective norm surfaced as data mining approaches have shown that they are useful approaches
the most influential independent variable. for understanding cognitive factors of safety-related behaviors.
During the interviews, the supervisors opined that scaffolders
fail to anchor their harnesses due to poor safety attitude. However,
Discussions the study showed that workers were more influenced by subjective
norm than attitude. The supervisors underappreciate their own in-
The analyses consistently identified SN as the most influential var- fluence on the workers and the importance of group norms. Further
iable in determining supervisors’ ratings of migrant scaffolders’ studies of the contribution of supervisory actions to unsafe behavior
consistency in anchoring their safety harnesses. It is not unaccept- should be conducted.
able that only one variable (SN) is identified as significant because Future studies can use this exploratory study as a basis to gen-
the TPB only stipulates possible relationships between the variables erate hypotheses and select research methods. Studies on cognitive
and does not demand inclusion of all the variables for all situations factors should take into account nationality of the subjects being
(Ajzen 1991). Different behavior and population can yield differ- studied. For migrant workers from Asia, particularly southern Asia,
ent results. In the context of this study, the respondents are from it is necessary to measure and probe the factors influencing sub-
Bangladesh, China, and India. In addition, they are working in jective norm in more detail. A wider range of behaviors can also
Singapore. All four countries are known to have high scores (77–80) be studied to assess if the type of behavior played a significant role
for power distance dimension (degree of acceptance of unequal in influencing the relative importance of the underlying cognitive
distribution of power) and low or moderate scores (20–48) for indi- factors. Because worker behavior is affected by organization struc-
vidualism dimension [degree of preference for a “loosely-knit ture, processes, and culture (Guldenmund 2007), a wider range of
social framework in which individuals are expected to take care factors can also be studied. In addition, many safety-related behav-
of only themselves and their immediate families” (The Hofstede iors are decided on the spot depending on the decision environment