Professional Documents
Culture Documents
by
Francoise Kubierske
MINI DISSERTATION
submitted in partial fulfillment of the requirements for the degree of
in the
FACULTY OF HUMANITIES
at
Supervisor: Dr N. Coetzee
2008
Acknowledgements
I would like to express my sincere gratitude to the following people for their
ongoing contribution and support:
Anneli Harding and Statkon. Thank you for the ongoing effort and work in
support of this research.
The schools, teachers and pupils who participated with enthusiasm. Thank
you for the opportunity to conduct the research with your help, and for all
the hard work.
ii
TABLE OF CONTENTS
Page no
ABSTRACT.......................................................................................... xi
CHAPTER 1
INTRODUCTION TO RESEARCH
1.5 Conclusion................................................................................... 7
CHAPTER 2
LITERATURE REVIEW: EMOTIONAL DISTURBANCE
iii
Status...................................................................................... 10
2.2.2 Indicators of Emotional Disturbance ....................... 14
2.2.3 The relationship between Emotional Disturbance and
Emotional Disorders ........................................................... 17
CHAPTER 3
LITERATURE REVIEW: DRAW-A-PERSON
iv
3.4 Conclusion ................................................................................ 48
CHAPTER 4
RESEARCH METHODOLOGY
v
4.7 Scoring of Instruments ............................................................... 66
4.7.1 Scoring of the DAP: SPED ............................................ 67
4.7.2 Scoring of the BYI-II ...................................................... 73
4.7.3 Scoring of the QQOBC ................................................. 73
CHAPTER 5
RESULTS
5.1 Introduction................................................................................... 77
vi
5.2.4 Results of the Mann-Whitney U Test ……………………. 82
CHAPTER 6
DISCUSSION OF FINDINGS, LIMITATIONS AND RECOMMENDATIONS
vii
6.4 Credibility of Inter-rater Scores ............................................... 111
viii
LIST OF TABLES
Page no
ix
Table 5.5 Mann-Whitney U Test for Present and Absent
Emotional Disturbance (ED) Groups ............................................ 83
x
LIST OF FIGURES
Page no
xi
LIST OF APPENDICES
Page no
APPENDIX A
Qualitative Quesionnaire for Observed Behaviour in the
Classroom (QQOBC) ..................................................................... 134
APPENDIX B
Permission to Partake in Research ............................................. 136
APPENDIX C
Feedback for Participants Regarding the Presence of
Emotional Disturbance ................................................................ 138
APPENDIX D
Feedback for Children Excluded from the Sample ................... 139
APPENDIX E
Example of Qualitative Analysis Spreadsheet ………………… 141
xii
ABSTRACT
South Africa is perceived as a country with many economic and social problems.
These are stressors that put individuals at risk of developing emotional
disturbance. Children, as the most vulnerable sector of society, have the least
access to resources such as psychological intervention. A measurement
instrument that can identify emotional disturbance within this high-risk group; that
minimises the effects of formal learning, cultural and language barriers; and
utilises the minimum of resources is needed. The Draw A Person: Screening
Procedure for Emotional Disturbance (DAP: SPED) (Naglieri, McNeish & Bardos,
1991) is a screening test for emotional disturbance that seems to fulfil these
criteria.
The present study aimed to explore how useful the DAP: SPED is to identify
children with emotional disturbance in South Africa. This involved comparing the
scores of the DAP: SPED with those of an independent measure for emotional
disturbance, the Beck Youth Inventory (BYI-II). To further explore the usefulness
of the DAP: SPED, internal reliability and inter-rater reliability were examined.
The DAP: SPED and the BYI-II were administered to a group of English-literate
Grade 4 learners from two mainstream schools.
The emotional status of the children was unknown, and scores from the BYI-II
were used to provide this information. As the BYI-II was used as a control
measure, the internal reliability coefficients of its subscales were investigated
within the context of the present research. Internal reliability coefficients were
good (r=.823 to r=.929, p<.01). A third measure, the Qualitative Questionnaire for
Observed Behaviour in the Classroom (QQOBC) was completed by the teachers
of participating learners. This measure was used to provide depth to the results
as well as independently rate the children’s behaviour.
xiii
Results showed that the DAP: SPED achieved low correlations with the BYI-II
(rho=.024 to rho= .76, p<.01), thereby supporting the null hypothesis. The DAP:
SPED was also unable to distinguish between emotional disturbance Present
and Absent groups, created post facto according to scores on the BYI-II. A
statistical analysis of the internal reliability showed that the DAP: SPED had low
internal reliability (α=.448), and that the omission of the measurement items
increased the internal reliability (α=.525). Inter-rater reliability was significantly
high (rho=.687 to rho=. 816, p<.01), thereby rejecting the null hypothesis. Despite
the good inter-rater reliability, certain scoring problems were encountered that
showed that a margin for scoring bias could influence the usefulness of the
measure.
xiv
CHAPTER 1
INTRODUCTION TO RESEARCH
1.1 Introduction
South Africa faces many social and economic problems (Shell, 2000). These
are pervasive and the most vulnerable sectors of society, namely children and
the poor, are the worst affected (Strydom & Strydom, 2006). Their
vulnerability is heightened by the fact that they have limited access to
resources such as health care. Children are particularly vulnerable because
they depend on adults to connect them to necessary resources, such as
psychological care (Dawes & Donald, 1994). The South African Government
has recognised the dire need for the resources mentioned, but service
delivery is difficult in the face of vast geographical areas and the limited
availability of health care workers (White Paper for the Transformation of the
Health System in South Africa, 1997). The result is that the most vulnerable
sector of society has the least access to resources and consequently
becomes increasingly exposed (Shell, 2000).
1
1989). Language forms a barrier in psychological service delivery especially
when a client and psychologist cannot communicate in each other’s
language. Language also forms a barrier when psychologists work with
children. Even if the child and psychologist do communicate in the same
language and thus understand each other, children are still developing their
verbal vocabulary and cannot always successfully express themselves
(Siegler, Deloache & Eisenberg, 2003). A non-verbal measure eliminates the
need for spoken language as a means of communication and is consequently
beneficial amongst a diverse group of clients (Foxcroft & Roodt, 2004).
Over the years, many different versions of how to study human drawings
have emerged. Each version differed slightly in relation to the emphasis
placed on administration and the purpose thereof. The most recent version of
using a human figure as an assessment tool is the Draw-A-Person: Screening
Procedure for Emotional Disturbance (DAP: SPED) (Naglieri, McNeish &
Bardos, 1991).
The DAP: SPED assures a quick form of assessment to determine the need
for further therapeutic intervention (Naglieri, McNeish & Bardos, 1991). The
non-verbal nature of the screening measure minimises the effects of formal
learning and language barriers and is therefore more suitable for cross-
cultural use than static verbal measures (Foxcroft & Roodt, 2004). Research
2
has shown that this measure is useful across cultures within the American
context and that it can accurately identify children with emotional problems
(Matto, 2002; Naglieri, McNeish & Bardos, 1991). The effectiveness of this
measure for children of various backgrounds in America suggests that it
would be valuable in the South African context.
Although the DAP: SPED has been available for more than a decade, no
literature revealed its local application. The necessity to determine the utility
of the DAP: SPED in the South African context originates from different
factors. Although the DAP: SPED has been reported valid within an American
context, it must be kept in mind that South Africa is a developing country that
faces different social and economical problems.
One of the areas where South Africa differs form America is the high
incidence of violent crime. Violence in South Africa has been likened to war-
torn countries such as Palestine (Neser, 2006; Ward, Martin, Theron &
Distiller, 2007). Violent crime has an escalating negative impact on the
emotional well-being of individuals and as a result, the need for psychological
services will increase (Harris & Radaelli, 2007). Masango (2004) noted that
prolonged exposure to violent crime has detrimental effects on cognitive and
emotional functioning. Children in particular become increasingly vulnerable
to the psychological impact of violence, since they are still developing as
individuals and have not yet developed a wide range of coping skills (Wicks-
Nelson & Israel, 2003).
Within the limited scope of the current research, it was decided not to
replicate previous validation studies. Instead the usefulness of the DAP:
SPED as a screening measure was investigated within a small sample of
South African learners. Usefulness is an aspect of validity. The usefulness of
a measure relates to how effective the measure is within a given application
(Murphy & Davidshofer, 2001). It is specifically concerned with functionality in
terms of how well it serves its intended purpose (Cronbach, 1988).
3
In order to ascertain how well the DAP: SPED may screen for emotional
disturbance in South African children, it was compared with an independent
measure. This is different from the original standardisation process or
subsequent validation studies, where no equivalent measure was employed
(Naglieri, McNeish & Bardos, 1991). Another reason why the current research
did not replicate previous studies, is that information was needed regarding
the DAP: SPED’s ability to distinguish between emotionally disturbed and
non-disturbed children when their emotional status was unknown.
4
As explained in 1.1, South African children are particularly vulnerable to the
social problems facing the country. Due to a shortcoming in the provision of
psychological services, children’s limited range of coping skills and their
frequent inability to communicate their needs, children are at risk of
developing emotional disturbance (Wicks-Nelson & Israel, 200). The DAP:
SPED has the potential to identify emotionally disturbed children who might
otherwise go unnoticed. This would be the first step in connecting affected
children with appropriate help. Professional caregivers can then help schools
by instituting relevant programs to assist children to cope with adversity.
The DAP: SPED, however, has not yet been used in the South African
context, and it is unclear if it will be useful as a screening measure for
emotional disturbance in children. The usefulness of the DAP: SPED
therefore needs to be examined within the envisioned application thereof.
The primary aim of this study is to determine whether the DAP: SPED could
be a useful instrument for use with South African children to identify those
with emotional disturbance.
From the primary aim, the following secondary aims were identified:
5
H o : There is no correlation between the scores of the DAP: SPED and
the BYI-II in a sample of Grade 4 South African learners.
H 1 : There is a significant correlation between the scores of the DAP:
SPED and the BYI-II in a sample of Grade 4 South African
learners.
The results of the attainment of these aims and the measurement of these
hypotheses will be presented during the course of this study. Before the
results are reported, however, certain issues will be discussed to demarcate
the study field. The study will therefore be presented according to a specific
structure, which is as follows:
6
1.4 Proposed structure of the study
1.4 Conclusion
The high levels of crime and the nature of violence in the South African
society potentially negatively affect the emotional and cognitive development
of children (Masango, 2004; Shell, 2000; Williams, Xintolo & Nombaca, 2001).
Widespread poverty and reduced access to resources makes help-seeking
behaviour difficult. This is particularly true for children who are dependent on
adults for information about services and access to those services. As a
result, the effects of poverty and violence are not easily mediated (Shell,
2000). Furthermore, South Africa has great diversity in terms of culture and
language, which further complicates psychological service provision. It could
be beneficial for both professionals and help-seeking children if a measure
were available that is minimally influenced by cultural and language issues
and is widely available and easy to administer. The DAP: SPED has the
potential to fill this void, but its usefulness in the South African context should
be examined.
7
CHAPTER 2
LITERATURE REVIEW: EMOTIONAL DISTURBANCE
2.1 Introduction
In Chapter 1, a synopsis was presented of the need for a measure like the
Draw-A-Person: Screening Procedure for Emotional Disturbance (DAP:
SPED) in the South African context. The primary objective of the research
was to establish whether this measure could be used locally. The next two
chapters will focus on a detailed look at important concepts such as emotional
disturbance, specifically in the South African context, as well as the
development of the Human Figure Drawings (HFD) as a screening tool for
emotional disturbance, and the theory upon which this measure is based .
8
2.2 Emotional Disturbance
When using this definition of emotional disturbance, it becomes clear that the
concept relates to other concepts such as emotional distress. Pert (1997)
described emotional distress in terms of a system overload - when emotions
are experienced too fast and too intensely in the face of overwhelming
circumstances, and to such an extent that it leads to impaired functioning. It is
apparent that this definition is analogous with that of Murphy’s (1963), and it
is thus postulated that these terms are interchangeable. Because emotional
disturbance (distress) is closely associated with emotional well-being
(Brannon & Feist, 2004; Murphy, 1963), it is important to explore the
relationship between the two concepts.
9
experience difficult circumstances as it drives personal development. He
suggested that the successful negotiation of such crises could actually
enhance a feeling of emotional well-being (Murphy, 1963).
Disturbance Well-being
The same can be said about children. However, when working with children it
is important to remember that their level of development will influence their
ability to cope with, and learn from, stressful situations. New coping skills are
only learnt when a child successfully adapts to the situation. In turn,
10
adaptation and coping lead to the building of resilience, which means that the
child is better equipped when confronted with other stressful situations
(Wicks-Nelson & Israel, 2003). Conversely, when the child is overwhelmed by
the intensity or duration of the stress, the risk for developing emotional
disturbance increases (Wicks-Nelson & Israel, 2003).
11
Table 2.1 Milestones in emotional development
Sources: Louw (1990), Sadock and Sadock (2003), and Senior (2002).
Toddlerhood • By the age of 18 months, tender affection and shame are observable.
• By 24 months, the child can experience pride.
• Toddlers will look to their parents and other people for cues about how
to respond emotionally to an event and this is referred to as social
referencing. For example, when a toddler falls he might look at his
parent’s reaction before crying or laughing about it.
• Toddlers are increasingly able to organise their demonstrations of love
by combining several behaviours to express an emotion. For example,
a toddler might smilingly run to a caregiver to hug and kiss that person.
• It is also during this time that children seem to be more comfortable
with familiar adults, while being more apprehensive with strangers.
12
Pre-school • The ability to be less egocentric emerges in sharing, and in empathic
and cooperative behaviours.
(continued)
• It is also at this age that children are able to turn parental restriction into
self-guidance.
• Towards the end of this period the conscience becomes established.
School- • Children prefer to interact with the same sex, as there is shyness about
the opposite sex.
going
• Once children start school, and the environment becomes more
Age
structured and demanding, fear is likely to take the form of a social fear
or a fear of academic failure.
• A toddler might use aggression to get a toy, but by school going age, a
child might use aggression for the purpose of hurting someone else.
• As cognition develops, it influences the expression and use of
emotions.
Table 2.1 clearly shows that emotional development involves coping with
increasingly complex emotions as the child grows older. Emotional
development seems to involve a process of monitoring, evaluating and then
modifying the intensity and timing of emotions (Wicks-Nelson & Israel, 2003).
This is a gradual process, and is more easily mastered by some. The mastery
of emotional development is dependent on factors such as temperament and
circumstance (Wicks-Nelson & Israel, 2003). The successful outcome of
emotional development would contribute to a subjective feeling of emotional
well-being, as the child grows into an individual who is able to cope with
stressful situations and who can negotiate the expression of emotion within a
social context (Goleman, 1996).
Another aspect that comes to the fore when studying Table 2.1 is the fact that
emotions, such as anger, potentially facilitate communication when the infant
can speak his or her discomfort to others. Responsiveness by the caretaker
could contribute to a feeling of well-being in the child. Over time, such
responsiveness could also lead to the development of sympathy for others
13
and the development of a conscience (Wicks-Nelson & Israel, 2003).
However, if the caretaker is uncomfortable with the expression of anger then
this basic emotion becomes confusing and the child might then restrict the
display of such negative emotions, or internalise his or her feelings.
Externalising behaviour might also take shape in the form of acting out in
tantrum-like behaviour (Wicks-Nelson & Israel, 2003).
Wicks-Nelson and Israel (2003) state that emotions play a role in most
behavioural difficulties, and children displaying strong emotions associated
with poor coping skills that endure over time, require professional assistance.
From the literature, three clusters of behaviour are identifiable, and
classification is based upon where this behaviour is directed. The three
clusters are known as internalising behaviour, externalising behaviour and
mixed symptoms (Wicks-Nelson & Israel, 2003). These clusters are further
related to eight syndromes commonly used to identify whether a child
experiences emotional disturbance. A summary of the behavioural clusters
and related syndromes is presented in Table 2.2.
14
Table 2.2 Eight Syndromes common to childhood measures of
emotional disturbance
Source: Wicks-Nelson and Israel (2003)
Stomachaches Impulsive
15
changes such as weight loss- or gain are also observed in children
experiencing emotional disturbance (Lewis, 1999).
16
2.2.3 The Relationship between Emotional Disturbance and Emotional
Disorders
The DSM IV-TR (2000) addresses the matter of intensity from two directions.
When making a diagnosis on the fifth axis of the DSM IV-TR, it is necessary
to rate the individual’s functioning on the Global Assessment of Functioning
scale (GAF). The GAF is a sliding scale from zero to one hundred. A score of
zero indicates severely impaired functioning to the point of being life
threatening; and a score of one hundred signifies that the person has superior
coping skills without any symptoms (Sadock & Sadock, 2003). The second
requirement in making a diagnosis with the DSM IV-TR (2000) is that the
disorder should be classified as being mild, moderate or severe. A DSM IV-
TR classification can therefore be made on varying levels of emotional
disturbance, with the most severe being an emotional disorder. Concerning
intensity, it therefore seems that emotional disorders can be viewed as
severe forms of emotional disturbance. Severe emotional disturbance is
present in only a small percentage of the population (Davidson & Neale,
2001).
The purpose of the DAP: SPED, as a screening measure, would require the
identification of a spectrum of emotional disturbance, including clinically
diagnosable features of emotional disorders. Although the DAP: SPED should
be able to identify most cases of emotional disturbance (including the
disorders), it should not necessarily be able to classify the intensity of
17
emotional disturbance. This would be the function of a diagnostic tool
(Murphy & Davidshofer, 2001). A diagnostic tool, such as the Beck Youth
Inventory (BYI-II), is based on the criteria of the DSM IV-TR (2000) and
measures the intensity of emotional disturbance.
Emotional disorders
↓
X----------------------------------------------------------------------X
Emotional Disturbance Emotional Well-being
18
2.3 Contributing Factors to Emotional Disturbance in the South African
Context
Lewis (1999) described three different kinds of stressful events that could
potentially lead to emotional disturbance. The first is stressful situations.
Stressful situations tend to be short lived and include exams, being fetched
late from school or pressure to perform well. Children are generally able to
cope with stressful events in small measures, but this becomes increasingly
difficult when the stress is continuous and intense. Crises, the second type of
stressful event, are more intense and more difficult to cope with, although
these are not necessarily negative experiences. This type of event often
offers new opportunities for growth and development. Crises include events
like starting school, parental divorce or even the transition into adolescence.
Lewis (1999) stated that because every child is different, what would be a
crisis for one could merely be a stressor for another.
The third type of stressful event is trauma. Trauma differs from the other two
types of stressful events in that it is experienced as horrifying and unexpected
(Lewis, 1999). Trauma victims might also experience intense feelings of being
out of control as well as a fear for personal safety or the safety of a loved one.
Examples of trauma include natural disasters, human-caused disasters as
well as intentional or unintentional violence (Lewis, 1999). Although trauma
19
places severe stress on the coping skills of individuals, continuous or multiple
crises could have the same effect and lead to emotional disturbance.
At present, the three types of stressful events are occurring with varying
frequency and intensity all over the world. In South Africa, however, the social
and economic problems currently experienced are of such a nature that
increasingly strains adults and children. Add to this the alarming increase in
violent crime and its associated trauma, and the inference is that South
Africans are at high risk of experiencing emotional disturbance.
2.3.1 Poverty
20
Another way in which poverty contributes to emotional disturbance is that it
negatively affects families’ ability to sustain their basic needs like food and
shelter. Malnutrition and hunger influence children’s ability to concentrate in
school and this in turn impacts on school performance (Phillips, 2002).
Malnutrition has the long-term effect of making children more prone to illness,
which weakens cognitive functioning as well as impairs physical, social and
psychological development (Phillips, 2002).
From the discussion, it is clear that children are especially vulnerable to the
effects of poverty. Living below the breadline places continuous stress on
members of the household. Children feel the effects of not having adequate
food and shelter, and in addition are often exposed to the worst of parental
stress. Furthermore, circumstances might compel some children to leave
21
school in order to supplement the family’s income. Poverty therefore
contributes to emotional disturbance in children who are not educated and
become progressively more disadvantaged (Shell, 2000). For example,
uneducated children are unable to compete in the job market with educated
applicants (Richter, Griesel, Durrheim, Wilson, Surendorff & Asafo-Agyei,
1988). In addition, some schools also offer resources such as counselling and
sometimes run a feeding program for the poorest children. Children who
leave school to help the family will not have access to these resources.
22
children however, still have the responsibility of taking care of the household
(Conradie, 2003). To provide for the family, children are consequently driven
into a life of crime. This is very dangerous for children who are then at
increased risk of becoming victims of crime (Conradie, 2003).
23
distinction between direct and indirect exposure to violent crimes (Rudenberg,
Jansen & Fridjohn, 2001).
Indirect exposure, on the other hand, would include any encounter through
conversations of friends and family or watching the news on television (Lewis,
1999). Indirect exposure to violence is insidious as it increases stress levels
and could lead to long-term negative changes in perception and attitude. This
in turn affects emotional well-being (Lewis, 1999). Aggressive play,
avoidance, phobias and intrusive thoughts are linked to exposure to violence
(Barbarin & Richter, 2001).
Ward et al. (2007) found that both direct and indirect exposure to violence
lead to internalising behaviours, such as anxiety and depression, but that
direct exposure additionally leads to conduct problems. They reported that
anxiety and depression relate to concentration problems, while conduct
problems, including aggression and substance abuse, put children at further
risk of being exposed to violence. Furthermore, relentless exposure to
violence intensifies this effect (Ward et al., 2007).
24
disturbance was present. Further confirmation of emotional disturbance is
evident in the study by Kleintjes, Flischer, Fick, Railoun, Lund, Molteno and
Robertson (2006) in areas of the Western Cape where gang related activity is
rife. They found that the most common mental health disorders in their
sample of children and teenagers were generalised anxiety disorder (11%),
posttraumatic stress disorder (8%) and depressive disorders (8%).
Oppositional Defiant Disorder (6%) and Conduct Disorders (4%) were also
common.
It could therefore be said that both direct and indirect exposure to violence
impact negatively on the personal well-being of children. Indirect exposure
might be considered a constant stressor in the child’s life, while direct
exposure might be experienced as a crisis or trauma that potentially puts the
child at higher risk of developing emotional disturbance (Strydom & Strydom,
2006).
25
The discussion has thus far focused on the impact that exposure to violence
has on children, mainly in relation to adult perpetrators. However, children are
also constantly exposed to violence in schools. South African children’s
exposure to violence in schools mostly relates to criminal incidences of
intimidation and victimisation, as well as sexual assault. In this regard, Neser
(2006) found that:
There is a clear link between violent crime and emotional disturbance, and
the high incidence of such violence implies that increased levels of emotional
disturbance are probably present in South African children. Adults’
experiences of both crime and poverty contribute in varying degrees to
emotional disturbance, and in turn arguably influence the quality of parenting.
The additional stress of illness, such as AIDS, further affects the capacity of
parenting and increases the risk of emotional disturbance in children.
26
2.3.3 AIDS
In the long term, children who inherit the additional responsibility of being the
breadwinner and emotional caregiver for siblings and sick parents are at an
increased risk for emotional disturbance. School-aged children who become
the head of the household often have to leave school (Desmond, Michael &
Gow, 2000). Children who are unable to cope with such responsibility, who
lack parental nurturing similar to neglect and who have to contend with
eventual death of their parents are likely to be emotionally underdeveloped
(Whiteside & Sunter, 2000). Shell (2000) further states that the impact this
27
has on society is that children are left to fend for themselves, and become
vulnerable to sexual exploitation and factors such as crime, homelessness,
poverty and malnourishment, which in turn affects health and well-being
(Desmond, Michael & Gow, 2000; Shell, 2000; Whiteside & Sunter, 2000).
The double stress of losing parents and becoming a parentified child plunges
children into multiple crises that challenge their coping skills and therefore
puts them at risk of developing emotional disturbance.
People living with AIDS are more prone to developing depression and
experience feelings of loneliness (Shell, 2000). This could be partially due to
the nature of the illness being terminal, but also to the stigma attached to
AIDS and HIV. By association, family members might be stigmatised and
thereby become social outcasts. The sense of isolation is similar to that of
those with the illness. Panic attacks, disordered thoughts, denial, aggression
and depression are all observed reactions to the reality of being HIV positive
or having AIDS. The burden that caring for people living with AIDS puts on
caregivers includes dealing with illnesses such as tuberculosis, and perhaps
later with dementia (Shell, 2000).
28
AIDS, poverty and violent crimes are factors that contribute to the
development of emotional disturbance in South African children. Children
further become the victims of their parents’ inability to manage stresses
caused by these factors, while simultaneously having to cope with their
environment. There is a complex relationship of circular causality between
these factors. There also seems to be a synergistic effect when these factors
are combined so that multiple stressors, crises or traumas result in substantial
damaging effects on the emotional well-being of South Africans. Children
experience symptoms of emotional disturbance and this is evident in
internalising, externalising and mixed indicators such as those described in
Table 2.2 and section 2.2.2.
2.4 Conclusion
29
The symptoms of emotional disturbance are discernible in many South
African children. Some studies have shown that South African children are at
higher risk for developing emotional disturbance due to the distinctive set of
risk factors that compound stress, such as poverty, exposure to violence and
crime as well as the high prevalence of AIDS. Multiple incidences of trauma
and crises become overwhelming and children might experience the
subjective feelings of hopelessness, fear or exasperation. The process of
negotiation between environment, in terms of coping and socialising, and the
management of overwhelming feelings becomes unbearable for children and
negatively affects their ability to cope with daily demands.
The children exposed to these circumstances in South Africa are mostly also
the children with the least access to resources, such as health care. A
measure that can quickly and inexpensively assess emotional disturbance in
children, and that is administrable by a wide variety of professionals could be
useful to address one component of this problem. One version of the Draw-A-
Person has had some success in identifying emotionally disturbed children in
South Africa, and indicates that a measure such as the DAP: SPED could be
useful in the south African context (Williams, 2001). Chapter 3 will focus on
the potential that the DAP: SPED has for identifying South African children
with emotional disturbance.
30
CHAPTER 3
LITERATURE REVIEW: DRAW-A-PERSON
3.1 Introduction
The previous chapter clarified the term emotional disturbance, and showed
how South African children might be particularly vulnerable to developing
emotional problems. In the light of the effects of economical and social
problems on psychological well-being, as well as restricted access to
resources, a large part of the South African population could benefit from
quick psychological interventions. This chapter examines the Draw-A-Person
(DAP) as a potential instrument to address this need. A brief look at the
history and development of the DAP will show how it was a forerunner to the
development of the Draw-A-Person: Screening Procedure for Emotional
Disturbance (DAP: SPED).
31
split and this resulted in quantitative and qualitative scoring methods. The
highlights in development of the DAP are represented in Figure 3.1.
32
Cooke & Ricci recognise link between cognitive development and
drawing skill in the 1800’s
Cognitive
Branch
Draw-A-Man
Projective Branch
(DAM)
Goodenough Humans are the most popular
Qualitative
Quantitative
Harris revises to create
the Goodenough-Harris
Drawing Test (GHDT) Machover
1963 Uses
Human Figure
GHDT
Drawing (HFD)
(1949)
Koppitz
Draw-A-Person: Quantitative Scoring 1968
System (DAP: QSS) Other
interpreters
Naglieri 1988
including
Hammer
(1960) and
Jolles
33
3.2.1 The DAP as a Cognitive Measure
The origins of the DAP dates back to the 1800s when Cooke and Ricci
respectively observed that children's drawings change as they develop
(Naglieri, 1988). The DAP in its earliest form was developed in 1926 by
Goodenough, who realised that the increased sophistication in children’s
drawings was related to cognitive ability rather than to chronological age
(Goodenough, 1926). Based on this observation she developed the Draw-A-
Man test to quantitatively measure cognitive development (Harris, 1963). In
1963, Harris revised the test to produce the Goodenough-Harris Drawing Test
(GHDT) (Harris, 1963). Her motivation for revising the test was to include
norms for a wider age range. Two additional drawings of a woman and the
self allowed for greater accuracy in measurement. Recently Naglieri (1988)
updated the GHDT, which resulted in the DAP: A Quantitative Scoring
System (DAP: QSS). Children between the ages of six and seventeen years
were included from different cultures and socio-economic status. The norms
were updated and the categories were refined into half-year and quarter-year
intervals. This resulted in 21 age categories to keep up with rapid
development at certain ages.
The rationale behind the DAP as a cognitive measure relates to the belief that
developmental changes in children’s drawings are concurrent with cognitive
development and that changes in children’s drawings confirm this
development (Jacobs & van der Merwe, 1992). Developmental stages that
are universally recognisable in children’s drawings encapsulate this
development (Golomb, 2002). It is also believed that these changes remain
stable over generations (Di Leo, 1970).
34
Table 3.1 Developmental Stages in Drawing
Note. Table compiled by author from the following sources: Arnheim (1971),
Gardner (1980), Goodnow (1977), Kellogg (1979), Koppitz (1968), and
Lowenfeld and Brittain (1969).
Stage Description
• Six months after the onset of basic scribbling a new phase emerges, namely
2. Controlled controlled scribbling.
The • Often shapes are practiced in small before they are incorporated into the
drawing. Detail is added as the shapes become familiar.
Preschematic
• Idiosyncratic symbols emerge that are not easily recognisable to the adult.
Stage:
• These symbols are not representative of reality, but reflection an inner
4 to 5 years representation of something.
• The human figure is the first recognisable symbol regardless of the child’s
nationality or background.
35
Preschematic • The human figure is often drawn in five points, namely the head and four
limbs.
stage
• The human figure and particularly the face have emotive value.
(continued)
The Symbolic • During this stage only relevant detail is added, for example the arm might be
omitted in one drawing, but when the figure is throwing a ball the arm is
Stage:
included.
5 to 6 years
• Detail might be omitted when space is limited.
• Children do not like crossing lines in drawings, and would rather omit a
detail. For example, if emphasis is placed on long hair, then there might be
no space for arms.
• This is indicative of the child’s inability to plan or think ahead at this age,
which only develops by eight years.
• Sixty percent (60%) of children draw stiff outstretched arms in this stage.
• Straight lines often symbolise elongated objects such as limbs.
• Up to the age of 5 years, drawings become increasingly conventional and
stereotypical. For example, a dog might be representative of all animals.
• Children are quite sure of their lines because of the stereotypical drawings.
• An eraser is seldom used. A child would rather restart or talk a mistake
away.
• Gender is often not depicted. If emphasis is placed on gender they will be
distinguished by the inclusion of hair or a dress.
• Figures are often slanted or upside down. If the figure starts off skew this is
not corrected.
• The bottom of the page is not seen as the ground.
• An upside down figure would only be significant from six years and older.
The Schematic • Cognitive development allows for schema to form. These are fixed ideas
about objects that are categorised.
Stage:
• These schema become automated or stereotyped, and then allow the child
6 to 9 years
to start experimenting with variations, for example, movement.
• Often exaggerations, omissions and additions are found at this stage.
• The drawings are purposeful, and the visual result is important.
• Sometimes disproportion occurs if particular emphasis is placed on certain
detail.
• Multiple perspectives might appear due to the stereotypical nature of the
36
object.
Schematic
• By 6 years understanding of direction develops, so that the ground is
stage
represented by a baseline. By 8 to 9 years, a baseline is present in 90% of
(continued) drawings. A skyline is also often present.
• The ground line is taken as the nearest point of reference (e.g. a person
climbing a hill is drawn at 90º to the hill rather than the ground).
• X-ray vision or transparencies might occur due to poor planning ability.
• Small figures might be a result of the development of the hand muscles.
• From the age of seven years, spontaneous drawings decline and humans
are drawn less frequently.
The Emergence • Developmentally there is a growing awareness of the child’s place within
society.
Of Realism:
• Previous schemas are no longer adequate to express these new
9 to 11 years
relationships.
• The baseline in drawings acquires a new dimension as the child realises that
detail underground can be filled in.
• Multiple baselines might appear.
• The skyline tends to disappear.
• Some overlapping might occur (e.g. a tree might overlap with the sky). This
implies that multiple relationships between objects is understood.
• Exaggeration occurs less frequently.
• Children want to draw correctly at this age, and can be pedantic and literal.
• X-ray drawings are considered unnatural and therefore disappear in this
stage.
• Children still do not draw what is seen, but draw their experience of reality.
However, they do become aware of techniques that can be used to create an
effect.
37
rate of cognitive development. Simultaneous representation from consecutive
stages might co-occur as the child develops, emphasising the fluidity of the
categories (Di Leo, 1970).
Table 3.1 showed that development in drawing reflects cognitive and physical
development and that this forms the basis of cognitive measures like the
Goodenough-Harris DAP and the DAP: QSS. Another important aspect that is
highlighted in Table 3.1 is that certain inclusions, omissions or peculiarities
might be characteristic of drawings at a certain level of development.
38
assessment when the assessor had no knowledge of what is ordinary or
unusual in drawings. Therefore, the benefit of having such guidelines as
those described in Table 3.1, is that children’s drawings can be judged within
the context of normative development. Further discussion about this issue
follows in section 3.2.2.
As described above, the early history of the DAP reflected the idea that
development in children’s drawings was related to cognitive ability. Research
showed that as their cognition developed and their experience as well as their
knowledge increased, children's concepts of people were reflected in their
drawings (Malchiodi, 1990). There were two main schools of thought in regard
to the interpretation of these concepts of people in drawings, namely as a
cognitive maturity measure or as a projective measure (see Figure 3.1). The
main promoters of the projective measure at the time were Machover, Levy,
Hammer and Jolles (Koppitz, 1968). One of the projective uses of the DAP
was for the assessment of emotional disturbance. A succinct look at the
projective use of the DAP will give some insight into the rationale for
developing a quantitative scoring measure for emotional disturbance and how
this is related to developmental changes in children’s drawings.
The rationale behind the projective use of drawings is based on the belief that
information about underlying attitudes and feelings is communicated non-
verbally. Trevisan (1996) stated that the feelings, beliefs and current mental
state of the drawer would influence the outcome of what is drawn, and as a
result, one could say that the child projects into the picture. La Roche (1994)
goes a step further and refers to the projective nature of drawings as the
creation of a ‘visual vocabulary’. Both authors seemingly agree that
communication is taking place through projection in drawings. Gardner (1980)
furthermore identified a group of children who consciously seem to tell a story
through their pictures, and named this group Dramatists (see Table 3.1).
39
The specific use of the Human Figure as the projective subject is embedded
in the widespread belief that because children across the world so commonly
draw the human, it is a familiar and non-threatening subject (Golomb, 2000).
The familiarity allows for projection, as the child is not focused on
implementing cognitive constructs to an unfamiliar subject. Both Machover
and Koppitz (1968) believed that drawing of a person would always be a
reflection of the child’s inner representation of the self as that is what the child
knows best. This inner representation is subjective and not meant to reflect
reality. As Jacobs and van der Merwe (1992) explain, it is rather a perception
of reality that is reflected in drawings (also see Table 3.1).
Di Leo (1973) echoed this sentiment. He believed that drawings are not
supposed to be representations of anything, but rather say something about
what is drawn. Although Di Leo (1973) proposed that drawing a person alone
(and not in relation to the family, for example) relies on a concept or schema
of the body image, he also thought that cognitive representations would
change. The schema or stereotype would differ in expression, depending on
where the child’s attention was focused. Di Leo (1973, p. 36) stated that:
40
small figures, and self-deprecating drawings and statements (Malchiodi,
1990).
While authors place different nuances on the meaning of the Human Figure,
there seems to be consensus that drawings serve as a tool for
communication. The manner in which this tool is utilised reflects each author’s
underlying belief. The following two sections will focus on the two main
differences that have emerged over history, namely the qualitative and
quantitative employment of the DAP (also refer to Figure 3.1).
41
with thousands of clinical patients, as well as the psychoanalytical approach
that informs her work, formed the basis of her interpretation of these symbols.
Though Machover’s (1980) main aim was to find personality types, much of
her work with the GHDT identified symbols that pointed to emotional
disturbance. These specific symbols are known as emotional indicators. It is
possible that Machover recognised underlying emotional issues because she
worked with an adult clinical sample with mild to severe levels of emotional
disturbance. Furthermore, the psychoanalytic theory that underpins her work
views personality in terms of habitual defences for emotional pain
(Rudenberg, Jansen & Fridjohn, 1998). This theory is simultaneously a
personality theory, as well as a theory that explains the aetiology of emotional
disturbance (Sadock & Sadock, 2003).
42
• Abnormally long arms could indicate excessive demands for affection.
• Fingers drawn without hands tend to indicate infantile aggression, while
mitten type hands could also indicate sporadic bursts of aggression when
there are evasive tendencies.
43
An added concern is that the qualitative interpretation of drawings is
particularly susceptible to disregarding the age and developmental stage of
the artist. This is the unease that Kellogg (1970) was communicating when
she said that interpreters generally do not have enough experience with
normative drawings of children to differentiate between an emotional indicator
and a developmentally determined factor in a drawing. This is a very
important issue, as emotional indicators are sometimes similar to normal
developmental occurrences in drawings. For example, transparencies, or X-
ray vision, might be common in drawings of children between six and nine
years. This is because the child has inadequate planning skills, and draws the
body first and then dresses it (see Table 3.1). However, transparency as an
emotional indicator means that the drawer has lost touch with reality (Blau,
1992). The inclusion of a baseline is also normal for children at this age, as
their perception of relationships between objects is emerging. Yet, a baseline
as an emotional indicator signifies feelings of insecurity or a need to be
grounded (Blau, 1992).
In defence of Machover, her work was based on an adult clinical sample, and
she aimed to find personality traits in adults (Machover, 1980). Therefore,
developmental stages in drawing would be less relevant. When an interpreter
chooses to use Machover’s indicators to measure children’s drawings, it
would constitute misuse of her work. One could therefore say that indicators
of emotional disturbance in drawings are only relevant once the drawer has
passed a certain stage of drawing development.
44
3.2.2.2 Quantitative Interpretation of the DAP as a Projective Technique
The rationale behind the quantitative scoring of the DAP is based upon the
belief that the occurrence and quantity of emotional indicators sufficiently
illustrate whether emotional disturbance is present. This eliminates the need
for linearly interpreting emotional indicators while still acknowledging their
presence. Another important aspect of a quantitative measure is that it is
scored normatively. In her measure, the Human Figure Drawing (HFD),
Koppitz (1968) standardised the scoring of emotional indicators by
establishing the frequency with which certain indicators could be expected at
certain stages in development.
45
example, repeatedly draw certain parts, but never actually improve on the
drawing. Other indicators of overall quality are disorganised features and
include confusing or hard to identify body parts or the drawing of a
disembodied head (where the body is omitted) (Malchiodi, 1990). Di Leo
(1973) also found that scattering of body parts, scribbling over the drawing,
rigid figures, excessive shading, small figures and figures drawn at the bottom
of the page as though for support, related to the overall quality of the drawing
and therefore were indicators of emotional disturbance. According to
Malchiodi (1990), disorganised art is particularly alarming as it could be
indicative of overwhelming emotions expressed in a vulnerable state.
The second and third categories include specific unusual features and
omitted features, such as too small or large head, cut off arms or hands, teeth
or clinging arms (Oster & Gould, 1987). These indicators are similar to the
qualitative indicators discussed in 3.2.2.1.
Various studies on the use of the HFD include children with learning
disabilities (Eno, Elliot & Woehlke, 2001) and sexual abuse or maltreatment
(La Roche, 1994; Webster, 2000). La Roche (1994) compared various
authors’ findings and found much evidence in support of the HFD. He found
that the drawings could be invaluable for early detection of emotional distress
(La Roche, 1994). Based on the early success and popularity of the HFD,
Naglieri, McNeish and Bardos (1991) were inspired to develop a new
measure for emotional disturbance.
It has been shown that the DAP has evolved over more than a century, firstly
as a cognitive measure, and then as a projective measure. Similarly, the
development of the DAP: SPED is founded on a predecessor which is a
cognitive measure. Following the development of the DAP: QSS, Naglieri,
McNeish and Bardos (1991) used the same data to develop a measure for
emotional indicators. (The rationale for using the same data is unknown,
46
although it was presumably economical to use existing data.) They wanted to
create a standardised measure for the emotional screening of children such
as Koppitz’s (1968). Because Koppitz’s measure was developed three
decades previously, Naglieri, McNeish and Bardos (1991) wanted to construct
a measure for emotional disturbance that was more relevant for modern day
use and had updated norms. This resulted in the development of the DAP:
SPED. McNeish & Naglieri (1993) reported that, compared with previous
research findings, the DAP: SPED showed a significant improvement of the
Koppitz (1968) qualitative scoring system.
From the literature reviewed in this chapter, it seems that the DAP: SPED will
be beneficial for use in the following ways:
47
• The fact that the measure is standardised eliminates the need for the
assessor to have in-depth knowledge of normative drawing development.
• Standardised scoring improves the reliability.
• Standardised scoring allows the DAP: SPED to be administered by a wide
range of trained professionals (Naglieri, McNeish & Bardos, 1991).
3.4 Conclusion
Over the greater part of the last century the DAP has been identified as an
accurate and fast assessment tool for intelligence (Jacobs & van der Merwe,
1992). A quantitative scoring procedure resulted in a user-friendly measure,
and had the benefit of showing inter-rater reliability (Harris, 1963). To date
this measure has also been popular for assessing emotional problems, and
authors like Machover (1980) and Koppitz (1968) researched the occurrences
and meanings of emotional indicators in drawings. While Machover used an
interpretive system for adults, Koppitz preferred a quantified system for
children. Recently Naglieri, McNeish and Bardos (1991) created a
standardised quantitative scoring system for emotional indicators in drawings
with several benefits: A standardised measure has the benefit of providing
results with reduced influence of the assessor’s frame of reference in
interpretation and are therefore more objective. Quantitative scoring also
allows for faster and more accurately obtained results. Furthermore, different
norms for different age groups allow for developmental differences in
children’s drawings, which are not always familiar to the therapist (see
3.2.1.1).
A screening measure with good validity and reliability such as the DAP: SPED
is invaluable as a tool for people in the helping professions, especially in
South Africa. As discussed above, the non-verbal nature of the measure
48
allows for children from any language to successfully use this test. The
universality of the human figure and the frequency with which it is drawn
allows for projection on a familiar and emotive subject. Because the human
figure is drawn so frequently, it has been possible to establish norms for norm
referenced evaluation. These benefits suggest that the DAP: SPED would be
useful for assessing emotional disturbance in South African children.
In order to ascertain whether the DAP: SPED might be useful in the South
African context, certain statistical procedures need to be performed and then
reported on. Chapter 4 will focus on the procedures of this research process.
49
CHAPTER 4
RESEARCH METHODOLOGY
4.1 Introduction
In 1.3, the purpose of the present research was stated to ascertain whether
the DAP: SPED is useful as a non-verbal screening measure for emotional
disturbance in South African children. The DAP: SPED was found to be valid
within the American context across cultures, in varying degrees of
urbanisation and rural areas, as well as across economic strata (Naglieri,
McNeish & Bardos, 1991). No literature revealed the applicability of the DAP:
SPED in South Africa, and thus the current study set out to examine the
usefulness of the DAP: SPED locally.
To attain this primary aim, secondary aims were identified, and these
included:
50
measure such as the BYI-II, which has been statistically proven to
measure emotional disturbance, would provide information about how
effective the DAP: SPED is in identifying children with emotional
disturbance. The following hypothesis was proposed to verify that this
secondary aim is met:
H o : There is no correlation between the scores of the DAP: SPED and the
BYI-II in a sample of Grade 4 South African learners.
H 1 : There is a significant correlation between the scores of the DAP:
SPED and the BYI-II in a sample of Grade 4 South African learners.
51
disturbance. Current levels of emotional disturbance of the sample group
could therefore be recorded and described.
4.4 Sampling
52
more representative sample might therefore not have had access to gratis
psychological services.
Considering that the intended use of the DAP: SPED in the local context
would be as a group screening measure in schools, the target group was a
sample of children from local schools. Therefore, even though the sample
might not be representative of the population at large, the DAP: SPED was
applied in terms of its intended use. One could thus argue that the results
obtained in the current research will reflect how effective the measure might
be in identifying learners at any given Model C school. Furthermore, despite
using a sample of convenience, participants had to meet certain criteria to be
included in the sample. This was necessary since the measures used
imposed certain limitations.
On the one hand the DAP in general has shown to be more accurate on
younger test-takers (Matto, 2002), while on the other hand the BYI-II requires
older children with English reading proficiency (Beck, Beck, Jolly & Steer,
2005). This inferred that the children included in the sample should be neither
too young, nor too old to comply with the best circumstances for both
measures. The result was that Grade 4 learners were selected for fulfilling
these prerequisites.
Two model C schools provided the pool of Grade four learners for the sample
selection. One inclusion criterion for participating schools was the medium of
education, namely English. Despite the fact that English was not the mother
tongue of most of the learners, the fact that these children were schooled in
English indicated that they should have mastered a basic level of competence
in English, especially at a Grade 4 level. In this regard, Foxcroft and Roodt
(2004) are of the opinion that testing is often more advantageous in the
schooling language rather than the mother tongue, especially when test items
resemble school tasks. This is due to the different degrees of westernisation
influencing the linguistic purity of the mother tongue. The school principals
53
assured the researcher that the participating learners would be sufficiently
proficient in English to understand and respond appropriately to the
requirements of the testing situation.
Other inclusion criteria that guided the selection of the schools were the
proximity to the researcher, as well as their willingness to participate. The
schools are located in suburban areas in the Ekhurhuleni Metropolitan
Council, near the researcher’s home and place of work. These areas are
broadly lower middle class, although many of the children come from a low
socio-economic background. Both the schools were Model C schools,
meaning that they are open to the public and integrate children from all races
and socio-economic status. These schools are likely to be similar to other
schools in suburban areas across the country.
The original sample size of the research was 129 Grade 4 learners, of which
66 attended school A, and 60 attended school B. Four learners’ scores were
eliminated from the data analysis, due to reasons discussed in 4.7. The
realised sample was therefore 125 learners. Figure 4.1 shows the gender
composition of the realised sample.
54
Gender Composition of Sample
70
68
66
Amount of learners
64
62
Boys
60
Girls
58
56
54
52
50
Gender
The ages of the participants varied between nine and twelve years. There is
an age difference of three years and one month between the eldest and
youngest respondents. The average age of respondents was ten years and
two months, and the median age was slightly less at ten years and one
month. Most respondents were between the ages of nine years and seven
months and ten years and eight months. Figure 4.2 shows the ages of the
respondents in the sample.
80
70
60
Amount of learne
9yrs
50
10yrs
40
11yrs
30
12yrs
20
10
0
Age
55
The learners were all educated in English, even though for most this was not
their mother tongue. Home languages varied from Zulu, Sotho and Xhosa to
Portuguese. The majority of learners resided in the immediate area around
the school, although some travelled from townships around Gauteng. A large
majority of learners’ parents in school B were domestic workers in the area.
This supports previous notions that many of these learners are from a low
socio-economic background.
Measurement instruments that were applicable to achieving the goals set for
the study and testing the hypotheses were implemented. The first measure
was the one being investigated, namely the DAP: SPED. The second
measure, namely the Beck Youth Inventory (BYI-II), was employed as it is
believed to be a reliable measure for identifying children with emotional
disturbance. As a result the DAP: SPED could be compared to this measure
to establish the latter’s efficacy. A third measure, the Qualitative
Questionnaire for Observed Behaviour in the Classroom (QQOBC), was
developed and employed to triangulate the data and to give depth to the data
obtained from the two quantitative measures. A detailed discussion of the
benefits and limitations of these measures follows.
The DAP: SPED is based on three drawings, namely that of Man, Woman and
Self. These drawings are quick to administer (5 minutes per drawing). The
measure has been standardised in America for children aged six to seventeen
years. The DAP: SPED is suitable for group administration, which allows for
multiple concurrent assessments.
The standardisation sample yielded the following results for the psychometric
properties of the DAP: SPED:
56
• Internal consistency was measured with Cronbach’s alpha coefficient and
values between .67 to .78 (p<.001) were reported for the three age groups
(Naglieri, McNeish & Bardos, 1991).
• Inter-rater reliability was measured with Pearson's product-moment
correlation coefficient and it was found that r >.90 (Naglieri, McNeish &
Bardos, 1991).
• Intra-rater reliability was measured with Pearson’s product moment
coefficient and it was established that r >.90 (Trevisan, 1996).
• The DAP: SPED was able to distinguish between individuals who belonged
to emotionally disturbed and non-disturbed groups. A significant chi-square
(5.01, p < .05) was obtained, which indicated that the DAP: SPED
classifications were correlated with group membership (McNeish & Naglieri,
1993).
On studying the literature regarding the DAP: SPED, the following limitations
were noted:
• Information regarding observed behaviour during group assessment is
lost. For example, it is difficult to know whether a child might have
hesitated before drawing, or if they drew carefully or carelessly. Koppitz
(1968) believed that this information is as valuable as the finished product.
• The official answering booklet does not lend itself to group administration,
as the scoring criteria are listed on the booklet. The scoring criteria are
therefore readily accessible to the test-takers and could possibly influence
the drawing.
• All previous validation studies were conducted by individuals who are
apparently affiliated with Naglieri, as is evident from co-authored articles.
Limited independent research has not been able to replicate these
promising findings (see 6.2 and 6.3). The lack of additional independent
research is a possible limitation in the validation studies of the DAP: SPED
thus far.
57
4.5.2 The Beck Youth Inventories of Emotional and Social Impairment
(second edition) (BYI-II)
58
aged seven to eighteen years in America and can be administered in group
settings (Beck et al., 2005).
Although the BYI-II has not been standardised for South African use, Jooste
(2004) stated that it is a reliable tool for research purposes. Moreover, the
criteria for diagnosing childhood disorders in the DSM IV-TR (2000) provided
the criteria for the items of the BYI-II subscales. It follows then that as the
DSM IV-TR (2000) is widely used as a diagnostic tool in South Africa, that the
BYI-II would be equally usable in South Africa. In order to ascertain how
reliably the BYI-II could be used in South Africa, some of the psychometric
properties were studied from the data obtained in the sample group. The
following results were found:
59
4.5.2.1 Results of the BYI-II in a South African context
Internal reliability for the BYI-II was calculated within the five individual
subtests, as well as between them. Because the data of the BYI-II was
normally distributed, the internal reliability for each of the five subscales was
determined using Pearson’s reliability coefficient. Values close to 1.0 show a
high correlation between these subtests, while values close to -1.0 indicate
that there is a strong negative correlation. Values around 0.0 show that no
correlation is present. The results are charted in Table 5.5.
These reliability coefficients are high and indicate that there is good internal
reliability for the individual BYI-II subtests.
60
Table 4.2 Pearson’s Correlation Coefficients between Subscales of the
BYI-II
Pearson
BSCI BAI BDI BANI BDBI
Correlation
From Table 4.2 above, it is clear that the subscales of the BYI-II are
significantly correlated. The four subscales of the BYI-II that are related to
syndromes of emotional disturbance (i.e. the BAI, BDI, BANI and BDBI) are
positively correlated. This means that high scores on one of these four
subscales would likely coincide with high scores on one of the other four
subscales for emotional disturbance. The strongest correlation seems to exist
between the depression (BDI) and anger (BANI) subscales (r = 0.689 at p>
0.01) (N= 125). The weakest relationship between the emotional disturbance
subscales exists between the anxiety (BAI) and behavioural (BDBI) scales (r =
0.345 at p> 0.01) (N=125).
A negative correlation exists between the BSCI and the other four subscales.
Negative correlations suggest that an increase in value from one variable
would signify a decrease in value for the other variable (Field, 2005).
Therefore, a high score on the BSCI would correlate with a low score on any
of the other four subscales of the BYI-II, and vice versa. In practical terms,
this indicates that the higher a learner’s self-concept, the less likely that
learner is to show signs of emotional disturbance. The correlation is stronger
61
between the self-concept scale (BSCI) and depression (BDI), anger (BANI)
and disruptive behaviour (BDBI) subscales than between the BSCI and the
anxiety subscale (BAI).
The reliability scores for the BYI-II supports Jooste’s (2004) claim that the
measure can be used reliably in the South African context. All of the
subscales demonstrated that they have good internal consistency.
Furthermore, the good correlation between the subscales illustrate that all of
the concepts measured are closely related to the same construct, namely
emotional disturbance.
Within the limited scope of the present study, these positive results suggest
that the BYI-II is a good independent measure of emotional disturbance within
the South African sample. It could therefore be used with relative confidence
for comparison with the DAP: SPED.
Despite the good reliability and validity on the BYI-II, some limitations were
noted. These limitations affected the sampling criteria, as mentioned in 4.4, as
well as the administration procedure.
• The BYI-II is limited in its use to English-speaking test takers. It requires
the respondent to be able to read English at a minimum of Grade 2 level
and comprehend what has been read, as well as understand the format in
which to answer (Beck et al., 2005). In a group setting the BYI-II can
therefore only reasonably be used with confidence with children who are
literate and who are able to follow instructions.
• The five scales take a combined time of up to 50 minutes to complete,
depending on reading proficiency. It therefore requires sustained
concentration from children.
• There is no social desirability scale incorporated in the measure. As
discussed in 2.2.1, disclosure of emotional disturbance is often
discouraged by caregivers and society. This might inhibit full disclosure of
62
emotional status (fake good). A respondent might for example deny
stealing behaviour because it is socially unacceptable, even if the
behaviour is present. On the other hand, a respondent might also be
motivated to exaggerate symptoms in order to get attention (fake bad).
Jooste (2004) stated that extremely low scores are probably exaggerations
and can be linked to histrionic or borderline personality disorders.
Although Jooste (2004) was referring to the Beck Depression Inventory for
adults, the same might be true of the BYI-II. Interpreting the answering
style qualitatively and focusing on specific answers might provide valuable
information regarding these tendencies. For example, if a respondent has
a tendency to downplay most of the negative symptoms listed, but has a
low self-esteem; one might expect that some form of social desirability is
present. However, this is not an accurate gauge, and caution in its
application is advisable.
63
authors recommend that careful observations be made (Foxcroft & Roodt,
2004; Riethmiller & Handler, 1997). This is true in any assessment situation,
and especially valuable for non-verbal measures.
The measure was constructed exclusively for the purpose of the present
study. As it was not central to ascertaining the usefulness of the DAP: SPED,
no psychometric properties were ascertained beforehand. The questionnaire
was kept as short as possible to ease the workload on the teachers, as well
as to keep the data analysis uncomplicated. The content of the questionnaire
was based on the five scales of the BYI-II to provide easy comparisons
between the two verbal measures.
One could argue that psychological terms on the measure would be too
technical in nature, and that individual understanding of states such as anxiety
could differ dramatically between raters. To minimise the effect of technical
jargon, psychological terms were described by using other terms such as
nervous, fearful and worry.
Data collection happened in two phases. The first phase entailed testing the
participating learners at the two schools respectively. The second phase
involved distributing the QQOBC to relevant teachers and subsequently
collecting the measure. The following procedure was followed during Phase 1:
64
1. The school principles were approached and permission was obtained to
conduct the research at their schools.
2. Letters were sent out to the parents informing them of the research and
requesting permission to include their children in the research (see
Appendix B).
3. Research assistants were appointed. One was an intern psychologist and
the other was a psychometrist. Training consisted of working through the
manual of the DAP: SPED, as per recommendation of Naglieri, McNeish
and Bardos (1991).
4. Testing started at approximately 08h30 and continued for one hour.
5. The DAP: SPED was administered first. According to Jooste (2004) the
non-verbal nature and the familiarity of the task helps to put test-takers at
ease.
6. Learners were given five minutes for completing each drawing, in
accordance with the administration procedure (Naglieri, McNeish &
Bardos, 1991).
7. The BYI-II was administered next. The instructions and individual items
were read aloud to account for different levels of reading proficiency.
8. Debriefing for participating children took place by allowing individual time
with the administrators at the end of each testing session. Children were
given the opportunity to ask questions, and the administrators advised the
children appropriately (see 4.9.2).
65
types of behaviours. Learners were therefore given the opportunity to
change answers once the questionnaires were completed.
• Although only five minutes were allocated for each drawing, during the
collection, some learners continued to draw until the last possible moment.
With more resources, interesting information regarding these learners could
have been obtained. For example, it would have been good to know
whether these learners had higher levels of anxiety than other learners who
completed their pictures on time.
Phase 2 involved distributing the QQOBC to the relevant class teachers and
then collecting the data from them once the questionnaires were completed.
Another participant was excluded from the sample post facto, as there were
significantly more omissions (about two items per page) on the BYI-II than the
other learners showed. The instructions clearly require participants to answer
every question, but two omissions per page are permissible (Beck et al.,
66
2005). As seven omissions for 100 questions were recorded for this learner,
compared with an average of two per 100 questions for other learners who
had omissions, this respondent’s data was excluded. The participant in
question identified herself and informed the researcher that she had a vision
deficit that might explain the high frequency of omissions.
The last participant whose responses were excluded was a boy who did not
understand how to respond to a Likert scale, and therefore placed all answers
in the margin of the booklet. His answering style therefore invalidated his BYI-
II. The boy’s behaviour suggested that mild mental retardation might be
present. The elimination of these four participants resulted in a final sample
size of 125 learners. (The data from these four learners were excluded from
Figures 4.1 and 4.2.)
During the scoring of the DAP: SPED, it became apparent that within the
standardised scoring system of the DAP: SPED subjectivity was a more
prominent issue than anticipated.
The two trained research assistants that helped with administration of the
measures, helped with the scoring of the DAP: SPED. The research
assistants were trained in accordance with the DAP: SPED manual (Naglieri,
McNeish & Bardos, 1991). This involved working through the manual and
doing practice examples. This was the best training available in the absence
of training courses in South Africa, and was sufficient according to Naglieri,
McNeish and Bardos (1991).
Both scorers marked all of the drawings. As they had no stake in the outcome
of the present research, they were considered to be objective scorers. The
research assistants received monetary compensation after the fact for all their
help.
67
During the scoring process, it became evident that the scoring guidelines
stipulated by the manual of the DAP: SPED were broad in order to be
inclusive. The result was that some accuracy was lost, and scoring criteria
were insufficient for clear decisions with some drawings. As explained in
3.2.2.1, objectivity in rating drawings is always in danger when there is room
for subjectivity. Therefore, the manual was consulted carefully again so as
uphold the standardised scoring system. After close but unproductive
consultation of the manual, the researcher contacted Naglieri (J.A. Naglieri,
personal communication, April 8, 2008). This communication offered little
insight into the specific scoring difficulties and how to deal with these
discrepancies, as Naglieri referred the researcher back to the manual. The
lack of clarity regarding scoring necessitated decision-making based on
logical reasoning and past literature on the DAP and Human Figure Drawings.
Therefore a subset of scoring rules was created for the South African sample
so that scoring would be standardised. Table 4.3 provides an outline of
ambiguous scoring criteria and the scoring decisions applied to the data in the
current study.
68
Table 4.3 Discrepancies in the Scoring System
12. Lettering and Scored when any letters or numbers appear outside of Learners were asked to supply an Names on the pages were scored
Numbering the figure on the page identification number on the front of as additional lettering.
the page. They often included their
names, even though the instructions
clearly indicated not to do so.
16. Failed “Failed Integration is scored if any of the following are These instructions are unclear. This item was scored when:
Integration present but not attached: (a) body parts are attached in the
a. Head is attached to neck or top of torso. incorrect place (e.g. arms too low
b. Two arms (one if in profile) are attached to the top half of down on the torso, or arms
the torso (above the halfway mark in the vertical sprouting from the head) or (b) if it
measurement of the torso or bottom of the dress). The seemed that the body parts were
torso extends from the top of the trunk, where it meets the not securely attached.
head or neck, to the bottom, where it meets the legs or
crotch.
c. Two legs (one if in profile) are attached at the bottom of
the torso (below the halfway mark in the vertical
measurement of the torso or bottom of a dress). “
69
Table 4.3 Discrepancies in the scoring system (continued)
18. Restart Any attempt at restart should be Item 29: Crotch Erasure. Refers to erasure and Restart was only scored for erasures
scored, including erasures and could be confused with restart. Erasures below above the torso.
abandoning a first attempt. the torso are scored as crotch erasure.
20. Hair omitted Any attempt at drawing hair, including Some drawings showed no hair on the head, as This item was scored when no facial hair,
facial hair is recognised and therefore the head was covered with a hat. However, it is including eyebrows and eyelashes were
not scored. possible that hair was inferred by the drawer, but visible.
that no space was available to draw hair.
24. Torso omitted When any of these body parts are In the event of only a head being drawn, every The items were scored according to the
25. Arms omitted omitted, the respective items are one of these items should be scored. The instruction, although hands cut off were
scored. premise of scoring hands cut off when there are excluded when there were no arms.
26. Fingers omitted
no arms seems incorrect.
45. Hands cut off
30. Crotch shading An area is shaded when additional Pockets on pants and zip detail, as well as brand Any additional lines were scored. This
32. Feet shading lines are present such as reinforced names on shoes are debatably shading. could possibly lead to an
pencil strokes or patterning on overrepresentation of these items.
material.
70
Table 4.3 Discrepancies in the scoring system (continued)
34. Vacant eyes Empty circles for eyes are Eyes with an iris but without a pupil look vacant, Vacant Eyes was only scored when there was
scored. although they technically were not. no iris or pupil drawn in.
49. Aggressive Any aggressive symbols What constitutes as an aggressive symbol is open The presence of muscles was not scored.
symbols are scored. This includes to interpretation. For example, the “Bad Boy” logo
profanity and weapons has angry looking eyes. In other pictures muscles
such as guns. were drawn and clothing torn to reveal these
muscles. Not all muscles seemed to depict
aggression, as some apparently represented
masculinity.
52. Monster Non-human figures and Some figures were human, but gave the impression Only drawings that were clearly non-human,
monsters should be of being scary. such as a figure drawn like the cartoon
scored. character “Spongebob Squarepants”, were
scored.
71
Table 4.3 Discrepancies in the scoring system (continued)
55. Uniformed Any depiction of a uniform Girls often drew brides. These were usually only Pictures that were clearly recognisable in
figure such as a policeman or a recognisable by the word “bride” written somewhere uniform were scored. When blazers or shirts
cowboy should be scored. on the page. Other figures looked like they had suits and ties were drawn, they were only scored
on. This could argualbly be a work uniform. when an emblem was also included.
*Scoring criteria as set out in the manual of the DAP: SPED by Naglieri, McNeish and Bardos (1991, pp 27-29)
72
Once the assistants returned the drawings and scoring sheets to the
researcher, the raw scores were converted to norm (t) scores. During this
process, it became clear that the data had to be rechecked for irregularities.
The researcher rechecked all the drawings, and some adjustments were made
to the scores according to the criteria in Table 4.3. Data was therefore cross-
checked and re-examined by the researcher before being captured. The
adjusted scores from the assistants were used as the raw data for the current
research. In order to determine the credibility of the rating system, the two
independent raters’ original scores, as well as the researcher’s scores were
used in the calculations of inter-rater reliability.
The researcher scored and processed all of the BYI-II answering sheets. There
is no room for interpretation in this measure, as scoring simply involves adding
pre-established values as chosen by the participants on the Likert scale. In the
event that the respondent selected more than one option, the one with the
highest value was scored, as recommended by Beck et al. (2005). The scoring
procedure for the BYI-II was relatively simple and quick. Overall, the
impression was that the learners understood the instructions well and were
able to respond appropriately. The process of scoring and finding the norms
was relatively quick, and the only margins for error in terms of scoring lie with
incorrect adding and incorrectly converting the raw scores to norm scores. The
researcher controlled for these errors by randomly rechecking some BYI-II
answering sheets.
73
4.8 Statistical analysis
Raw scores were calculated and transformed into norm scores. Then the norm
scores for the DAP: SPED and BYI-II were subjected to statistical techniques.
SPSS computer software was utilised for statistical analysis of the data. Basic
descriptive statistics were employed to summarise the data. Inferential
statistical techniques were employed to investigate the relationships between
the scores of the two instruments. The statistical techniques and their results
will be discussed in more detail in the next chapter.
The following ethical considerations were relevant to the present study, and
ethical protocol was observed as follows:
Both the parents and the schools gave permission for the learners to
participate in this study. A letter (see Appendix B) informed the parents of the
nature and purpose of the study. The letter notified parents of the voluntary
nature of participation, that withdrawal was permitted at any time, as well as
explaining the procedures that ensured confidentiality.
4.9.2 Debriefing
74
to ask questions. Learners were informed that if the results pointed to
emotional distress, their parents would be informed. During the question and
answer time, after completion of the assessment, learners had the opportunity
to talk to the administrators about personal problems. This allowed an
opportunity to address emotions that might have surfaced during assessment.
Learners were encouraged to contact their school counsellors or speak with
their parents, depending on the presenting problem. Furthermore, at the time
of approaching the schools in connection with participation, the psychological
services of the researcher’s place of work were offered to participants at the
schools if they chose to make use of them. Parents received relevant contact
numbers on the permission slip, as well as on the feedback letter.
The decision to use information from the QQOBC for all participants,
regardless of the outcome of their DAP: SPED and BYI-II, prevented oversight
of any learner by the present study. A previous study by Matto (2002) showed
that although the DAP: SPED was able to identify emotionally disturbed
children correctly, the measure was unable to identify all cases of emotional
disturbance within that target group. A similar pattern of identification was
present with the BYI-II (Beck et al., 2005). The process of gathering data from
multiple sources therefore allowed for more accurate identification rates and
consequently, more accurate recommendations. This was essential as parents
who permitted their children to participate would most likely be interested in
75
their children’s emotional status. Inaccurate or incomplete information would
give the wrong impression and could lead to decision-making that is not in the
best interests of the children.
4.10 Conclusion
Extensive literature reviews have set the boundaries of this study and
explained relevant concepts for investigating the DAP: SPED. In order to
establish whether the DAP: SPED could be useful in the South African context,
it is necessary to explore the psychometric properties of this measure, as well
as determine how successful it is in distinguishing emotionally disturbed
learners from those who are not. Another measure, the BYI-II served as an
equivalent measure and served as a control for the DAP: SPED. In-depth
qualitative information was obtained in the form of the QQOBC. This chapter
explained the data gathering and -analysis procedures, along with the
challenges encountered along the way. Chapter 5 reports the results of the
statistical calculations of the data obtained during the processes described in
the present chapter.
76
CHAPTER 5
RESULTS
5.1 Introduction
Chapter 4 contextualised the aims of the present study and described the
research processes implemented to achieve these goals. In-depth discussions
were presented on the challenges encountered in administration and scoring,
and how the raters contended with these issues. This chapter will report on the
findings of the research process implemented to realise the aims presented in
4.2.
In order to compare the success of the DAP: SPED against that of the BYI-II,
a correlational study was conducted. Correlations between two sets of data
provide information about the strength of their relationship (Murphy &
Davidshofer, 2001). A good relationship between the DAP: SPED and the
BYI-II would be evident in correlations close to 1.0, which would suggest that
the DAP: SPED measures emotional disturbance well.
77
5.2.1.1 Descriptive Statistics for DAP: SPED
The descriptive statistics in Table 5.1 shows the central tendencies, variation
and range for the DAP: SPED.
Mean 59.63
Median 60.00
Mode 60.00
Minimum 39.00
Maximum 80.00
Table 5.2 provides a summary of the data from the sample group for the BYI-
II.
78
Table 5.2 Descriptive statistics for the BYI-II
Mode♦ 54 55 49 53 44
The measures of central tendency provide information about the data set as a
whole. However, no inferences regarding the aims of the study are possible
without further statistical calculations. The following section will report on
inferential statistical techniques that relate to the aims described in 4.2.
5.2.2 Results of the Correlational Analysis between the DAP: SPED and
the BYI-II
Table 5.3 shows the results of the correlational analysis between the DAP:
SPED and the BYI-II.
79
Table 5.3 Spearman’s Rho Correlation between the DAP: SPED and the
BYI-II Subtests
None of the correlations was significant at either the .05 or .01 level, therefore
rejecting the hypothesis that significant correlations would be found between
scores of the DAP: SPED and the BYI-II. Because the DAP: SPED achieved
such low correlation coefficients with the BYI-II, an investigation into the DAP:
SPED’s ability to distinguish between the presence or absence of emotional
disturbance was launched. This involved further correlational studies, as well
as other inferential statistical techniques, such as the Mann-Whitney U-test.
5.2.3 Results of the Correlational Analysis between the DAP: SPED and
the BYI-II for Emotional Disturbance Present and Absent Groups
Since the scores of the BYI-II proved to be reliable in the context of the
present study (see 4.5.2), they were used to divide the respondents into
emotional disturbance Present and Absent groups. As both measures work
with t-scores with a maximum of 100 and a mean of 50, the following
procedure was followed: Respondents with scores above 55 on the BYI-II (the
cut-off score for emotional disturbance on the DAP: SPED) were categorised
as emotional disturbance present (Present), while scores under 55 would
result in an emotional disturbance absent (Absent) categorisation.
The ability of the DAP: SPED to distinguish between the presence and
absence of emotional disturbance was then examined by calculating
80
correlation coefficients with the subscales of the BYI-II. High correlations
between the DAP: SPED and subscales of the BYI-II for the two conditions
(Present or Absent) would indicate that the DAP: SPED is able to discriminate
between the emotional disturbance Present and Absent groups. Spearman’s
rho correlation coefficient was used to determine these correlations.
Table 5.4 below shows the correlation coefficients for the emotional
disturbance Present and Absent groups. The specific focus was on the four
subscales of the BYI-II that pertain to emotional disturbance, as described in
2.2.2. These include the anxiety scale (BAI), the depression scale (BDI), the
anger scale (BANI) and the disruptive behaviour scale (BDBI).
Table 5.4 Spearman’s Rho Correlations between the DAP: SPED and the
BYI-II for emotional disturbance Present and Absent Groups
Emotional DAP:
Status SPED BAI BDI BANI BDBI
From Table 5.4 it is evident that the DAP: SPED was unable to yield
significant correlations with the Present and Absent emotional disturbance
groups. The only exception was a significant relationship between the
81
disruptive behavioural scale (BDBI) and Absent emotional disturbance (r s =
.241; p< .05).
These results do not support the tentative assumption that the DAP: SPED
would be functional in the South African context (see 1.2). Further exploration
into the DAP: SPED’s ability to distinguish between Present and Absent
emotional disturbance groups was therefore undertaken. This was done by
conducting a Mann-Whitney U test.
In the present study, the Mann-Whitney was used to examine the extent to
which scores on the DAP: SPED differed for the emotional disturbance
Present and Absent groups. If the DAP: SPED is useful to identify the
presence of emotional disturbance in the group of children in the present
study, then the emotional disturbance Present group would obtain
significantly higher scores than the emotional disturbance Absent group.
Table 5.5 shows the results of the Mann-Whitney U test.
82
Table 5.5 Mann-Whitney U Test for Present and Absent Emotional
Disturbance (ED) Groups
ASYMP.
MANN-
DAP: MEAN SUM OF SIG.
WHITNEY
SPED BAI N RANK RANKS (2-
U
TAILED)
Asymp.
Mean Sum of Mann-
Sig.
BDI N Rank Ranks Whitney U
(2-Tailed)
Asymp.
Mean Sum of Mann-
Sig.
BANI N Rank Ranks Whitney U
(2-Tailed)
Asymp.
Mean Sum of Mann-
Sig.
BDBI N Rank Ranks Whitney U
(2-Tailed)
83
The following is evident from Table 5.5:
• The emotional disturbance Present group (Mean Rank 61.13) did not differ
significantly in Anxiety (BAI) levels from the emotional disturbance Absent
group (Mean Rank 67.29), U = 1490.000, ns, r = -0.079; > .05.
• The emotional disturbance Present group (Mean Rank 67.57) did not differ
significantly in Depression (BDI) levels from the emotional disturbance
Absent group (Mean Rank 60.60), U = 1566.500, ns, r = -0.092; > .05.
• The emotional disturbance Present group (Mean Rank 67.90) did not differ
significantly in Anger (BANI) levels from the emotional disturbance Absent
group (Mean Rank 59.51), U = 1643.000, ns, r = -0.115; > .05.
• The emotional disturbance Present group (Mean Rank 72.82) did not differ
significantly in Disruptive Behaviour (BDBI) levels from the emotional
disturbance Absent group (Mean Rank 59.48), U = 1194.000, ns, r = -0.163;
> .05.
The Mann-Whitney statistical procedure showed that the DAP: SPED was
unable to yield significantly different results between Present and Absent
groups of emotional disturbance.
The first aim of the present study pertained to the usefulness of the
DAP: SPED in the South African sample. The Spearman’s rho
correlation coefficient showed that no significant correlations were
present between scores of the DAP: SPED and the BYI-II, thereby
accepting the null hypothesis. It is thus concluded that there is no
correlation between the scores of the DAP: SPED and the BYI-II in a
sample of Grade 4 South African learners.
84
The alternative hypothesis (H 1 ) was rejected, indicating that the DAP: SPED
is unable to successfully distinguish between emotionally disturbed and non-
disturbed children in a sample of Grade 4 South African learners.
The second aim of the current research was to determine the reliability scores
for the measurement of emotional disturbance by the DAP: SPED. From the
preceding section, it was clear that the DAP: SPED was unable to distinguish
between Present and Absent emotional disturbance. The reliability with which
the DAP: SPED measures emotional disturbance was consequently
questioned, and it was necessary to investigate the reliability coefficients for
the DAP: SPED within the South African context.
85
items. To illustrate how the omission of these nine items influenced the
reliability coefficient, both coefficients are reported in Table 5.6.
Cronbach’s alpha
DAP: SPED Sample Size
coefficient
DAP: SPED
125 .448
(55 Items)
DAP: SPED
125 .525
(46 Items)
The results of the reliability coefficients were generally low. This prompted a
re-examination of the DAP: SPED manual. In the manual, reliability
coefficients were reported for different ages (Naglieri, McNeish & Bardos,
1991). These same age categories were then utilised to calculate appropriate
reliability coefficients for the South African sample. Table 5.7 shows the
Cronbach’s alpha coefficients for the different age groups.
86
Table 5.7 shows that internal reliability increases when the nine measurement
items are omitted.
Although the internal reliability coefficients were not reported for the three
respective drawings in the manual of the DAP: SPED, a decision was made
to investigate them. Table 5.8 shows these reliability coefficients for the
measure with all the items included, as well as for the measure with the nine-
item omission.
Table 5.8 Internal Reliability for the Three Drawings of the DAP: SPED
Table 5.8 shows extremely low internal reliability for each of the three
drawings. These rates are lower than the internal reliability for the measure as
a whole (Table 5.6). The low internal reliability of the three separate drawings
indicates that internally the items of the drawings lack coherence, but when
the three drawings are together the internal reliability of the measure is
increased (Table 5.6).
87
The relationship between the three drawings was examined next, even
though Naglieri, McNeish and Bardos (1991) did not report on this
relationship. This decision was made for two reasons. Firstly, it was curious
that the internal reliability for the three individual drawings was low, but when
put together they increased the reliability of the measure. Secondly, the
investigation of the BYI-II included a correlational study between the
subscales (see 4.5.2), and therefore, it seemed relevant to do the same with
the DAP: SPED. The non-parametric Spearman’s rho correlation coefficient
was used for the DAP: SPED correlational study. As the DAP: SPED showed
higher internal reliability when the first nine items were omitted, the 46 item
total was used for this calculation.
Table 5.9 shows that although the three drawings are significantly correlated
with one another, these correlations are low. Interestingly, each drawing
correlates highly with the total score of the DAP: SPED.
The second aim of the current research was to investigate the reliability
of scores of the DAP: SPED concerning emotional disturbance. The
reliability scores transpired to be moderately low for the measure as a
whole, and even lower for individual parts of the measure.
88
5.4 The Credibility of Inter-rater Scores
The third aim of the present study was to investigate the credibility of the
DAP: SPED’s scoring system. In order to study the scoring system, the inter-
rater reliability coefficient was calculated for the scores allocated by the three
raters. This form of reliability is used to determine the extent to which different
raters achieve similar results on the same measure (Smit, 1991). Two or
more raters score one set of answers (data) to determine the degree of
correlation between them. A standardised measure should have well-defined
scoring criteria that minimises the possibility of rater bias. Under these
conditions, different raters should record similar results (Smit, 1991). Similar
results are reflected in high values for the correlation coefficients.
The original scores by the two independent raters were utilised for the
calculation, as well as the scores of the moderator, namely the researcher
(see 4.7). The inter-rater reliability scores were determined by using the non-
parametric Spearman’s rho correlation coefficient. The inter-rater reliability
measured in the current research on the DAP: SPED in South Africa, is
shown in Table 5.10.
Raters A B C
A 1 .816** .687**
B .816** 1 .811**
C .687** .811** 1
** Correlations significant at the .01 level
Table 5.10 shows that correlations were significantly high between the three
raters.
89
The third aim of the current research pertained to the scoring system of
the DAP: SPED. The level of agreement between raters was examined by
finding correlations between the raters. Inter-rater reliability between the
three raters varied within acceptable ranges. Significant correlations
between raters’ scores indicate that the alternative hypothesis is
accepted. It is thus concluded that there is a significant correlation
between the scores given by different raters the DAP: SPED when the
DAP: SPED is scored.
The fourth aim of the current research was to explore the occurrence of
emotional disturbance in the South African target group. Although some
insight regarding this matter could be obtained by studying the descriptive
statistics (see 5.2.1), more detailed information is presented in the following
frequency distributions for the DAP: SPED en BYI-II. Frequency distributions
make it easy to understand large amounts of data by showing how many
times each score occurred (Field & Hole, 2003). These scores are shown
within certain classes or categories. Next, the results of the Qualitative
Questionnaire of Observed Behaviour in the Classroom (QQOBC) will be
reported.
90
Table 5.11 Frequency Table of Emotional Disturbance scores measured
by the DAP: SPED
Cumulative Cumulative
T-Scores Frequency Percentage
Frequency Percentage
0-9 0 0 0 0
10-19 0 0 0 0
20-29 0 0 0 0
From Table 5.11 above, it is evident that 67 learners achieved scores above
60 on the DAP: SPED. Although not directly discernable from Table 5.11, it
should be noted that 18 learners achieved scores above 55. This brings the
total amount of learners identified with emotional disturbance by the DAP:
SPED to 85 or 68%.
A frequency distribution was compiled for the BYI-II subscales. The four
subscales of emotional disturbance are reported individually, and total scores
are also shown.
91
Table 5.12 Frequency distribution for emotional disturbance measured
by the four subscales of the BYI-II
Table 5.12 shows that the majority of learners obtained t scores between 40
and 59. Scores above 60 varied between the subscales, and anxiety levels
were particularly high.
92
• Learners with scores above 55 on the BAI and/ or BDI scales were placed
in the Internalising category.
• Learners with scores above 55 on the BANI and/ or BDBI scales were
categorised as Externalising.
• Learners with high scores on BAI and/ or BDI, as well as high scores on
BANI and/ or BDBI were categorised as Mixed.
Table 5.13 shows how the clusters of emotional disturbance manifested in the
respondents in the current research.
When the above criteria are used, the nature of emotional disturbance in the
South African sample manifests mainly in the form of mixed behavioural
symptoms. Internalising behaviour is also common in respondents.
Externalising or acting out behaviour mostly co-occurred with depression or
anxiety.
93
5.5.3 Results of the Questionnaire for Observed Behaviour in the
Classroom (QQOBC)
Percentage Of Learners
Teachers’ Disruptive
Anxiety Depression Anger
Ratings Behaviour
Never 73 71 87 59
Sometimes 20 19 14 24
Often 6 10 8 13
Always 1 0 1 4
Total 100 100 100 100
The answering style between the four teachers varied considerably, with
some supplying minimal information about the learners in question, while
others gave repetitive and stereotypical answers. The following observations
were made regarding the answering styles of the teachers:
94
• Teachers tended to recognise externalising behaviour, and often
described such learners as “disruptive” or “naughty”. These children were
often rated as being very “confident”.
• Overall, however, the teachers’ ratings had a low agreement with the
DAP: SPED (53% of the ratings did not agree with scores on the DAP:
SPED).
95
5.6 Conclusion
This chapter has reported on the results of the data as measured by the three
measures used in the study, namely the BYI-II, the DAP: SPED and the
QQOBC. A discussion of these results, as well as recommendations for
further research follows in Chapter 6.
96
CHAPTER 6
DISCUSSION OF FINDINGS, LIMITATIONS, RECOMMENDATIONS AND
CONCLUSION OF THE STUDY
6.1 Introduction
The current research has aimed to establish how useful the Draw-A-Person:
Screening Procedure for Emotional Disturbance (DAP: SPED) would be for
identifying emotional distress in a group of South African children. Literature
indicated that the non-verbal nature of the DAP: SPED may make it ideal for
the multilingual population, while the standardisation of administration and
scoring could make it quick and simple to use. This measure was thought to
be particularly useful within the South African context as it was postulated that
high levels of emotional disturbance would be present amongst children.
Literature seemed to indicate that the DAP: SPED would be a good match to
fulfil this need. Despite the good potential for the DAP: SPED in the South
African context, the results of the present study were not that promising. A
discussion of these results will follow.
At the onset of the present study, it was theorised that the DAP: SPED would
be useful in the South African context if it could screen a group of learners for
the presence of emotional disturbance. The current study was embarked
upon to test this theory. It involved the DAP: SPED being administered to a
group of learners whose emotional status was unknown, to see if and how
well it could identify emotional disturbance.
97
measure by means of a correlational analysis. The independent measure that
was selected was the Beck Youth Inventory (BYI-II). This instrument proved
to be a reliable measure of emotional disturbance.
The following results were evident in studying the usefulness of the DAP:
SPED within the context of the present study:
• Two groups were created post facto, namely the emotional disturbance
Present and -Absent groups. The scores of the DAP: SPED achieved
low correlation coefficients with the emotional disturbance Present
group, while better coefficients for the emotional disturbance Absent
group were present, but also low (Table 5.4). If high scores on the
98
DAP: SPED correlate better with low emotional disturbance scores on
the BYI-II, there is an inverse relationship between the measures.
These findings confirm that the DAP: SPED did not measure emotional
disturbance well, and that a confounding variable might be a source of
error. A possible confounding variable is cognitive ability. From the
literature study presented in Chapter 3, it was clear that cognitive
developmental levels are reflected in children’s drawings, and thus
cognitive ability could influence the quality of a drawing. There is also
some evidence that artistic ability might be a confounding variable.
These factors will be discussed shortly.
99
The results of this informal analysis were turned into percentages and
it was found that the DAP: SPED scores contained 26.4% false
negatives and 16.8% false positives.
100
The results of the current research contradict previous findings by Naglieri,
McNeish and Bardos (1991) and Matto (2002), where researchers have
reported that the DAP: SPED is useful to identify emotional disturbance in
children. In 4.5.1, the results of these studies were summarised and showed
that good psychometric qualities for the DAP: SPED were reported. These
researchers also concluded that moderate identification rates were
acceptable for a screening measure (Matto, 2002; McNeish & Naglieri, 1993;
Naglieri, McNeish & Bardos, 1991). The current research, however, has found
that the DAP: SPED has low identification rates of emotional disturbance in a
group of Grade 4 learners. A closer look at other studies might shed light on
the contrary findings of the present study.
101
emotionally disturbed. The assumption therefore is that no control measure
was present, either in the form of a control group or in a control instrument at
that stage of the development of the DAP: SPED. This lack of control at this
early stage of development is likely to have incorporated a systematic error
into the measure, affecting its usefulness (Field, 2005; Murphy & Davidshofer,
2001).
The method used by Naglieri, McNeish and Bardos (1991) for selecting
representative items was an actuarial process. As discussed in 3.3, the
actuarial approach involves a process of determining the similarity of
individual responses in comparison with the group (Murphy & Davidshofer,
2001). Unusual responses are identified during the actuarial process. The
unusual responses would apparently be indicative of deviance in emotional
status, i.e. emotional disturbance.
102
place to differentiate between emotionally disturbed and emotionally well
children.
The selection of emotional indicators for the DAP: SPED seems to be flawed.
Emotional indicators were selected based on atypical presence, and the
unusualness of their presence was used to validate that they were indicators
of emotional disturbance. This tautological reasoning is a source of error
(Neuman, 2000) and questions the usefulness of these indicators, and in turn
the usefulness of the measure.
Despite the good results, the four validation studies reported in the manual
should not be accepted without question. Firstly, it is important to note in this
103
regard that no information regarding the scoring process was provided, apart
from the training of the raters (Naglieri, McNeish & Bardos, 1991). It is
therefore possible that the scorers had knowledge of the respondent’s
emotional status (in terms of group membership). A critical discussion
regarding scoring follows in section 6.4.
104
important to note that 41% emotionally disturbed learners remained
unidentified, while 45% non-disturbed learners were identified incorrectly.
This is also contradictory to other findings that stated that although the DAP:
SPED did not identify all learners, all the identified learners were emotionally
disturbed (Matto, 2002).
As noted in 6.2 above, the decisions based on the measure determine how
accurate the measure should be. In this regard, Saklofske and Wrightson
(2000) argued that one would expect a screening measure to over-identify
cases of emotional disturbance, rather than under-identify. Furthermore, it
should also correlate highly with other instruments measuring the same
construct (Saklofske & Wrightson, 2000). The lack of independent measures
of emotional disturbance is therefore a major criticism of standardisation and
validation studies up to this point.
105
significant correlations seem to indicate that the DAP: SPED had moderate
equivalence in measuring internalising behaviour (SAC), as well as hostile
behaviour (CAAP). However, Matto (2002) found that the DAP: SPED was
not significantly able to predict externalising behaviours and could not explain
variation in externalising behaviour beyond the Hostility scale of the CAAP.
Matto (2002) compared the results of the DAP: SPED with those of other
measures of emotional disturbance, and as a result provided information
about how well the DAP: SPED was able to measure the emotional
disturbance construct. Although Matto (2002) found a significant relationship
between internalising behaviour and the DAP: SPED, the correlation was
moderate and limited to one scale. The implication is that the DAP: SPED
does not measure the whole range of experiences of the construct of
emotional disturbance described in Chapter 2. Another problem is that the
part of emotional disturbance that is measured is measured only moderately
well.
106
Another study confirming that the DAP: SPED is not as useful as originally
thought, is one very similar to the present study. In this study by Williams and
Alfred (1997), the DAP: SPED did not correlate well with either the
independent self-report measure used, namely The Revised Child Manifest
Anxiety Scale and The Reynolds Child Depression Scale. It also did not
correlate well with informant observed ratings based on the Behavior
Assessment System for Children (Williams & Alfred, 1997). Williams and
Alfred (1997) concluded that the DAP: SPED Total score was not useful for
identifying internalising disorders.
The results of the abovementioned studies show that the DAP: SPED at best
measures only part of the construct emotional disturbance. However, in the
light of the flawed standardisation process there is a reasonable chance that
the DAP: SPED in fact measures a different, although related construct. Two
different sets of research support this argument.
Wessel, Carter, Jones, Hatch, Kasten, Noll, Schofield and Surges (1993)
studied the use of the DAP: SPED in learning disabled-, emotionally
disturbed- and cognitively disabled children. Results showed that the DAP:
SPED was inaccurate in identifying large percentages of learners, and both
false positive (30%) and false negative (47%) classification errors were
present. The DAP: SPED in particular was unable to distinguish between the
cognitively disabled group and the emotionally disturbed group. These
findings suggest that the DAP: SPED is inaccurate as a screening measure
and that cognitive ability is a confounding variable in the measurement of
emotional disturbance.
Naglieri, McNeish & Bardos (1991) addressed this issue in the manual, and a
brief investigation reported that cognitive ability had no effect on the scores of
the DAP: SPED. Yet, past independent research has shown that emotional
indicators were confounded with indicators of conceptual maturity, a factor
related to cognitive ability (Cox & Catte, 2000). From Table 3.1 in Chapter 3, it
was clear that conceptual maturity manifested in children’s drawings, and that
the drawings reflect cognitive ability. The visual representation of cognitive
107
development in children’s drawings motivated the development of the DAP as
a measure of cognitive maturity.
Cox and Catte (2000) further argued that drawing ability influences the quality
of the drawing, and consequently the perception of disturbance is
contaminated. According to them, drawing ability better accounts for
differences between drawings than does emotional disturbance (Cox & Catte,
2000). One could therefore argue that confounding variables such as drawing
ability negatively influence the efficacy of the DAP in measuring emotional
disturbance. These confounding variables suggest that the DAP in general
might not be useful as an accurate measure of emotional disturbance.
From the above independent studies, the usefulness of the DAP: SPED is
disputable. On the one hand, Naglieri and associates have focused on limited
successes of the DAP: SPED while downplaying large margins of error
(Matto, 2002; McNeish & Naglieri, 1993; Naglieri, McNeish & Bardos, 1991;
Naglieri & Pfeiffer, 1992). On the other hand, independent researchers have
found that the DAP: SPED is unable to measure the general construct of
emotional disturbance (Saklofske & Wrightson, 2000; Wessel, et al., 1993), or
that a confounding variable interferes with the accuracy of the scores (Cox &
Catte, 2000; Wessel, et al., 1993). The latter results correspond with the
findings of the present research.
Now that it has been established that results from the DAP: SPED have been
equivocal in different applied contexts, it is also important to note that the
108
usefulness of a measure is influenced by various other factors as well, such
as the properties of the measure and the characteristics of the target group
(Murphy & Davidshofer, 2001). Although doubt has been cast over the
properties of the measure, certain differences in respondents might also
account for the lower efficacy of the DAP: SPED locally. Cultural differences
between the South African and American populations should be kept in mind
when evaluating the usefulness of the DAP: SPED within the context of the
present study.
These results seem to confirm that the DAP: SPED is not influenced by
cultural differences, although criticisms against the methods used in the
standardisation sample exist and have already been extensively discussed. If
one, however, agrees with the results of the research by Naglieri and his
colleagues, and accepts that the measure is culture free within the American
context, the question still needs to be asked if emotional indicators are as
universal as the developmental milestones in children’s drawings are.
109
African cultures have different values that could influence their responses. It
is therefore possible that the emotional indicators are not representative of
emotional disturbance for African people.
If these indicators are not valid for an African population, then they might also
be invalid for other non-European cultures. Although not much information is
available about the usefulness of the DAP: SPED outside of America,
research findings by Kwan Sau-kuen (1998) indicated that within the Chinese
population, the DAP: SPED was not useful to identify emotional disturbance.
Such findings cast further doubt on the usefulness of the DAP: SPED outside
of the American context. As was previously mentioned, further research is
needed on the subject in order to make conclusive findings.
The study into the usefulness of the DAP SPED showed that it was unable to
identify emotionally disturbed children accurately, and that it could not
successfully distinguish between high and low emotional disturbance in a
group of South African children. A closer investigation into the internal
reliability of scores of the DAP: SPED confirmed these findings.
Internal reliability examines how coherent the different parts of a measure are
(Murphy & Davidshofer, 2001). According to McIntire and Miller (2007), the
more homogenous the test is in terms of the constructs that are included, the
higher the reliability coefficients would be. Another way of understanding
internal reliability is that any individual item in the scale should generate
results that are consistent with the overall measure (Field, 2005). Internal
reliability is examined using the statistical technique of Cronbach’s alpha
coefficient, which involves finding the average of several correlation
coefficients when the measure is repeatedly and varyingly divided into two
halves (Bryman & Duncan, 2001; Field, 2005).
110
The moderately low internal reliability coefficient of the DAP: SPED in the
present study, compared with reliability coefficients reported by Naglieri,
McNeish and Bardos (1991) and Matto (2002), prompted an investigation into
the internal reliability of each of the three drawings. The results showed that
the individual drawings had lower internal reliability than the measure as a
whole, and that negative values were obtained in some instances (see Table
5.8). Negative reliability coefficients generally indicate that as the value of one
variable increases the value of the other decreases (Field, 2005; Murphy &
Davidshofer, 2001). Within a measure one would therefore expect that if
items were inversely correlated, that the scale would make use of negative
scoring. This is however, not the case with the DAP: SPED.
The relationship between the three drawings was also examined, and low
correlations were evident (r s = . 426, p<. 01 to r s = .488, p< .01). What was
peculiar however, is that while the three drawings attained low correlations
with one another, each individual drawing’s score attained significantly high
correlations with the total score of the DAP: SPED (Table 5.9). This implies
that each of the three drawings contributes significantly to the total score but
at the same time show low relationships with one another. Due to the
peculiarity of the matter, it is recommended that this state of events should be
further researched.
Matto (2002) also reported the results of the relationships between drawings,
although her focus was on the relationship between the DAP: SPED as a
whole and the SAC and CAAP respectively (see 6.2). Correlation coefficients
111
between the drawings were significant for hypothesis testing and ranged
between r = .180 and r = .425 (p< .01). The highest correlation between a
single drawing and the measure as a whole, was between the DAP: SPED
Total score and DAP: SPED Woman score (r = .497, p< .01). These
correlation coefficients were generally lower than for the South African
sample.
Although Matto reported good results for the overall reliability of the DAP:
SPED, correlations between the drawings were much lower. Matto (2002) did
not discuss these findings. Like with the current research, these findings
suggest that the results from the three drawings are so dissimilar that there is
a weak relationship between them.
The overall results show that the scores for the DAP: SPED are
insufficiently reliable for the type of decision-making it is intended for in
the South African context. Because of this, and the findings related in
6.2, the usefulness of the DAP: SPED for identifying emotional
disturbance in South African children is doubtful.
As was the case with establishing the usefulness of the DAP: SPED, two
reasons may be offered for the moderately low reliability coefficient:
112
6.4 Credibility of Inter-rater Scores
The credibility of the scoring system of the DAP: SPED was investigated by
calculating the correlation coefficient of scores assigned by the raters
involved in the study. The degree to which raters’ scores agree reflects the
efficacy of the standardisation process, and therefore on the reliability of the
measure. A standardised measure should have well-defined scoring criteria
that minimises the possibility of rater bias (Smit, 1991). Under these
conditions, different raters should record similar results. Inter-rater reliability is
used to determine the extent to which different raters achieve similar results
on the same measure (Smit, 1991).
The results from of the inter-rater reliability scores were presented in Table
5.10. The correlation coefficients between scores of the three raters were
calculated using the Spearman’s rho correlation coefficient. The scores of the
two independent raters correlated moderately well (r s = .687, p<.01), and all
correlations were significant at p< .01. The significant correlations show that
the scoring system is sufficiently standardised, and therefore rejects the null
hypothesis which stated: There is no correlation between the scores given by
different raters when the DAP: SPED is scored.
In this regard, Smit (1991) stated that two important factors influence the
scores on a measure, namely the effectiveness of the training of scorers, and
the accuracy in recording the scores (Smit, 1991). To ensure that the
standardisation conditions are upheld, it is imperative that the administrative
guidelines are followed as closely as possible. This would ensure that similar
results are attained with each measurement. When raters have undergone
sufficient training in the use of a particular measure similar scores could be
expected, and therefore yield a high correlation coefficient for inter-rater
113
reliability. However, even when administration and scoring is standardised,
error in measurement is possible when administrative mistakes, such as
faulty computations, take place. Therefore, raters need to be well trained and
conscientious about the application of the measure in use.
Concerning these two factors, the following measures were put in place:
Raters were trained in accordance with the manual of the DAP: SPED. As no
training is available in South Africa on the DAP: SPED (see 4.5.1), the
authors of the measure recommend that the manual be followed carefully
(Naglieri, McNeish & Bardos, 1991). Due to the lack of formal training, it is
therefore possible that training was inadequate to ensure a standardised
scoring procedure. Calculations were minimal for the DAP: SPED and
through the process of moderation all scores were checked for computational
mistakes.
A third factor in scoring accuracy has become evident during the scoring
process. Subjectivity in scoring, or rater bias, refers to the decision of whether
to score an item or not (Smit, 1991). The choice of whether to score a
particular item could depend on many different factors, such as motivation
and perception (Smit, 1991). If, for example, the rater has the attitude that all
possible incidents of emotional disturbance should be identified, the rater’s
scoring style would be over-inclusive. On the other hand, a rater who is not
very motivated to score a large amount of response sets might have a
careless approach and miss some of the items.
Rater bias might also be present in the decisions whether to score items
when the criteria are unclear. Table 4.5 in Chapter 4 summarised the items
that were unclear in the manual, and clearly showed that certain decisions
were made to standardise the scoring procedure for this set of data. Even this
secondary standardisation process was subject to interpretation and bias as
the decision whether to score might rest on subjective feelings or past
experiences (Lilienfeld, Wood & Garb, 2000). Subjective reasoning on how to
score items could result in inaccurate measurement and therefore affect both
inter-rater reliability and the validity of the measurement.
114
Bruening, Wagner and Johnson (1997) conducted research on the possibility
of rater bias with the DAP: SPED. Their study focused specifically on the
impact of rater knowledge in sexually abused girls. Twenty abused and 20
non-abused respondents were randomly placed into one of two groups,
namely Pretend and Actual. Raters were fed details about each respondent
(either true or false information) before scoring the drawings. No significant
effects were found to indicate that scoring bias would be a confounding
variable. Results therefore showed that the scoring system of the DAP: SPED
was sufficiently objective to dismiss the idea that rater bias might influence
the scores (Bruening, Wagner & Johnson, 1997).
According to the researchers, one of the limitations of the study was the small
sample size (Bruening, Wagner & Johnson, 1997). Another limitation was
that the raters were suspicious of the purpose of the study. This could have
resulted in the raters being overly careful in the experimental circumstances
compared with everyday test administration (Bruening, Wagner & Johnson,
1997). Another consideration in this research is that all the participants were
clinic patients. It is thus assumed that some form of emotional disturbance
was likely present in all respondents. This would in turn result in higher DAP:
SPED scores on emotional disturbance for the group as a whole. As a result,
a small margin of difference would occur on emotional disturbance scores
between the sexually abused and non-abused groups, which would lead to
unjustified conclusions regarding the objectivity of the scoring system.
Despite the moderately high correlations for inter-rater reliability in the present
research, it was evident that there was a margin for error in the scoring
process. This margin of error has also not been excluded from other studies,
such as the one conducted by Bruening, Wagner and Johnson (1997).
Furthermore, the lack of information regarding the scoring process on the
validation studies by Naglieri, McNeish and Bardos (1991) prevents critical
examination of the said procedures. There is, for example, no mention of
whether respondents’ drawings were scored within the pre-existing groups or
115
not. The extent to which knowledge of group membership may have biased
the decision to score items could therefore not be established.
Riethmiller and Handler (1997) made another salient point. They argued that
attempts to make the scoring of drawings completely mechanical would
inevitably be unsuccessful, as drawings should be studied within context. For
example, when a picture contains a lot of detail, this detail should be viewed
in relation to other indicators in the drawing. A sparsely drawn human figure
that contains a lot of detail concentrated in one area would be more indicative
of pathology such as hypervigilance or obsessive compulsive behaviour than
an artistically well-drawn human that generally contains a lot of detail.
Riethmiller and Handler (1997) subsequently suggested that the most
effective way of using the DAP as a measurement instrument involves a
phenomenological approach where empathising with the drawing would give
the interpreter a sense of what is being communicated. This implies that
research should also involve qualitative interpretations of the drawings. In this
regard, the authors stated that observation of the drawing process as well as
interaction with the drawer concerning the picture will be helpful (Riethmiller &
Handler, 1997).
116
margin for error, where the rater has to rely on previous experience and
knowledge. Although the margin for rater bias was not big enough to
affect the significance of inter-rater reliability correlation coefficients in
the present study, subjective decisions of whether to score an item
could have in turn affected the usefulness of the DAP: SPED. This
variability between scorers could have resulted in unreliable scores of
emotional disturbance as measured by the DAP: SPED.
The fourth aim of the present study was to observe the occurrence of
emotional disturbance in the South African context. Two standardised
instruments, namely the BYI-II and the DAP: SPED, measured the presence
of emotional disturbance in a group of Grade 4 learners (see 5.7). Both
instruments yielded similar results in terms of the amount of learners that
were identified as emotionally disturbed (68% on the DAP: SPED and 76% on
the BYI-II). An in-depth investigation to identify the learners revealed that the
instruments did not identify the same learners. This could explain the
extremely low correlations between the DAP: SPED and BYI-II scores.
Anxiety levels measured by the BAI seemed abnormally high within the target
group. Different levels of anxiety were reported within this group. Severe
anxiety was present in 33 cases (26.4%). According to Sadock and Sadock
(2003), generalised anxiety disorder is estimated to occur in only three
percent of school-aged children. Other anxiety related issues, such as
separation anxiety, nail biting, and bed-wetting, are likely to increase the
prevalence rate. However, the extremely high rate of the severest intensity of
anxiety in the local study indicates that emotional disturbance is elevated in
the children who participated in the study.
The anxiety levels might have been artificially high due to the testing
conditions. The learners might have been apprehensive about being ‘tested’,
and this could have evoked similar feelings experienced during exam times.
117
Furthermore, learners who were afraid of what the results might reveal about
them, and how this information would be used, would also experience
increased levels of anxiety. As was evident in 4.9.2, children had many
questions regarding the test results during the debriefing stage of the data
collection process.
The occurrence of the most severe levels of anger (BANI) and disordered
behaviour (BDBI) in the target group was low (7.2% and 5.6% respectively for
t scores above 70). The total number of learners with disruptive behavioural
problems, as identified by scores above 60 on the BYI-II, came to 16.8% (21
learners, see Table 5.11). When one looks at the prevalence rates of
emotional disturbance within the standardisation sample of the BYI-II (8.41%
for t scores above 70), it is apparent that South African children have similar
rates of disruptive behaviour. Although Sadock and Sadock (2003) reported
that prevalence rates for the disruptive behaviour disorders are between 16 to
22% of school-aged children, it could be assumed that this includes moderate
as well as severe forms of disruptive behaviour. These figures are similar to
the prevalence rates within the target group when scores are inclusive of
moderate levels of disturbance.
118
good). According to teachers’ reports on the QQOBC, 48% of the learners
were considered disruptive. It is therefore likely that due to social desirability
learners were not forthcoming on their true emotional status. The differences
between scores on the BYI-II and ratings by teachers on the QQOBC indicate
that social desirability plays a significant role in verbal self-report inventories,
such as the BYI-II. However, the extent to which social desirability influenced
learners’ scores on the BANI and BDBI is unknown. Initial findings show that
when emotional disturbance is assessed in children, it is important to include
information from various sources to control for errors in measurement.
Further research in this matter could provide information regarding the level of
motivational distortion and the extent to which these levels are similar to
those of American children.
The results of the QQOBC showed that although many children had
symptoms of emotional disturbance from time to time, these symptoms
occurred intermittently. Ratings from teachers indicated that only a few
children were highly emotionally disturbed. According to the teachers’ ratings
on the QQOBC, consistent levels of anxiety were only present in seven
percent of the learners, while depression only occurred in ten percent of the
learners (see Table 5.14). Severe behavioural problems were identified in
16.8% of learners. These findings suggest that externalising behaviour is
more readily identified than internalising behaviour. This corroborates findings
from the literature, which suggest that caregivers observe emotional
disturbance in the form of externalising behaviour more easily. Caregivers are
more likely to respond to externalising behaviour, as it is more uncomfortable
to bear when children are disruptive (Wicks-Nelson & Israel, 2003).
The literature review in 2.3 indicated that South African children would
have comparatively higher rates of emotional disturbance due to factors
such as poverty, violence and crime, as well as the high prevalence of
AIDS. The results of the present research confirmed these initial
suppositions, indicating that identifying emotionally disturbed children
is important. Furthermore, it indicates that intervention plans are both
necessary and urgent.
119
6.6 Limitations of the Study and Recommendations for Future Research
The four aims of the present study related to how useful the DAP: SPED
would be in the South African context. The research focused on comparing
the results of the DAP: SPED with an independent self-report measure, the
BYI-II. Additional information was gained from a third measure, the QQOBC,
where teachers were asked to rate the behaviour of each child participating in
the study. The DAP: SPED was applied within a context where it had to
screen for emotionally disturbed learners from a group of children with
unspecified emotional status. Previous studies focused mainly on the efficacy
of the DAP: SPED between pre-existing groups or on comparing the DAP:
SPED with adults’ ratings of respondents’ behaviour. Further investigation
into the efficacy of the DAP: SPED in comparison with other measures is
recommended.
One of the limitations of the current study was the small sample size obtained
through convenience sampling. This sample was not representative of the
population at large and results should therefore be generalised with caution.
Further research on the DAP: SPED within the South African context should
thus focus on a larger and a more representative sample.
Another limitation of the study is that it did not initially take into account the
impact of cultural diversity on the scores obtained on the DAP: SPED. Since it
was noted that cultural diversity might have an impact on the usefulness of
the DAP: SPED in a South African context, it is recommended that future
research should determine whether cultural differences impact on how
emotional disturbance is defined. Such research should also focus on what,
within the South African culture, will constitute indicators of emotional
disturbance in children’s drawings.
A third limitation of the study concerns the efficacy of the BYI-II in a South
African context. Within the boundaries of the current study, the BYI-II was
found to be a reliable instrument and offered valuable information regarding
the nature of emotional disturbance. Not only was it successful at pointing out
120
emotionally disturbed children, but it also identified internalising behaviour
that is easily overlooked by parents and teachers. The problem with the BYI-II
is that it is limited to an English speaking and literate population, and
therefore would not be applicable within the wider multi-cultural South African
context. The search for an instrument that measures emotional disturbance
and can be used on individuals from different backgrounds should thus
continue.
Another problem identified when the BYI-II was used, was that it did not
control for socially desirable responses. Socially desirable responses could
have had a negative impact on correlations between the DAP: SPED and the
BYI-II. The results of the present study showed low correlations between the
DAP: SPED and externalising behaviours, as measured by the BDBI and
BANI subscales of the BYI-II. Due to the limited scope of the present
research, no statistical analysis was undertaken to examine the extent to
which social desirability influenced correlations between the DAP SPED and
the BYI-II. Such an examination could include finding correlations between
the results of the DAP: SPED and the QQOBC. Further research should
therefore include a data analysis between the scores of the DAP: SPED and
independent observers’ ratings. Standardised measures for observers such
as those used by Matto (2002) and Saklofske and Wrightson (2000) might be
more helpful in further research than the QQOBC.
121
hand, structured interviews with either teachers or the parents would perhaps
be more helpful than questionnaires.
6.7 Conclusion
Initially it was theorised that the DAP: SPED would have some potential in the
South African context due to the many potential benefits of the measure. The
positive results reported by Naglieri and his colleagues for the DAP: SPED
supported this notion. However, results of the present study showed that
these findings could not be corroborated in the context of the present study.
The DAP: SPED was unable to distinguish between the presence or absence
of emotional disturbance within the group of Grade 4 learners who
participated in the study. The DAP: SPED also presented with low internal
reliability. Statistical analysis showed that the standardised scoring system
was good. However, the application of the scoring criteria presented problems
that might have influenced the reliability of the measure as a whole.
Had the results of the DAP: SPED matched the moderate success reported
by Naglieri and colleagues, this success rate would still be insufficient for use
in South Africa. Too many false positive and negative results would lead to a
high degree of faulty decision-making. This in turn could result in wasted
resources and emotionally disturbed children that remain untreated. The
prevalence of emotional disturbance within the group of South African
122
children was high, and indicates that effective intervention is urgently needed.
The quest for a measure that could successfully identify emotional
disturbance within the diverse South African population should thus continue.
123
REFERENCE LIST
Barbarin, O.A. & Richter, L. (2001). Economic status, community danger and
psychological problems among South African children. Childhood, 8 (1), 115-
133. Retrieved January 28, 2008, from
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=19934552
Beck, J.S., Beck, A.T. & Jolly, J.B. (2001). Beck Youth Inventories of
Emotional and Social Impairment. New York: The Psychological Corporation.
Bruening, C.C., Wagner, W.G. & Johnson, J.T., (1997). Impact of rater
knowledge on sexually abused and non-abused girls’ scores on the Draw A
Person: Screening Procedure for Emotional Disturbance (DAP: SPED).
Journal of Personality Assessment, 68(3), 665-677.
124
Bryman, A. and Duncan, C. (2001). Qualitative data analysis with SPSS
release 10 for windows: a guide for social scientists. London: Routledge.
Burger, J. (2007). Time to take action: The 200/ 2006 crime statistics. SA
Crime Quarterly. (21), 1-7.
Conradie, H. (2003). Are we failing to deliver the best interest of the child?
Retrieved January 28, 2008, from
www.crisa.org.za/fighting_crimes.pdf.hd
Davidson, G.C. & Neale, J.M. (2001). Abnormal Psychology (8th ed.). New
York: John Wiley.
Department of Health (1997, April). White Paper for the Transformation of the
Health System in South Africa [Announcement]. Retrieved March 8, 2004,
from http://www.gov.za/whitepaper/1997/health.htm
Desmond, C., Michael, K. & Gow, J. (2000). The hidden battle: HIV/Aids in
the household and community. South African Journal of International Affairs.
7(2), 39-58.
Di Leo, J.H. (1970). Young children and their drawings. New York: Brunner/
Mazel.
125
Di Leo, J.H. (1973). Children's drawings as diagnostic aids. New York:
Bruner/Mazel.
Eno, L., Elliot, C. & Woehlke, P. (1981). Koppitz emotional indicators in the
Human Figure Drawings of children with problems. The Journal of Special
Education. 15(4), 459-470.
Field, A. (2005). Discovering statistics using SPSS (2nd ed). London: Sage.
Field, A. & Hole, G. (2003). How to design and report experiments. London:
Sage.
Goleman, D. (1996). Emotional Intelligence: Why it can matter more that IQ.
London: Bloomsbury.
126
Harris, D.B. (1963). Children’s drawings as Measures of Intellectual Maturity:
A revision and extension of the Goodenough Draw A Man Test. New York:
Harcourt, Brace & World.
Jacobs, L.J. & van der Merwe, S.E. (1992). Die Interpretasie van
kindertekeninge. Pretoria: Academia.
Kleintjes, S., Flisher, A.J., Fick, M., Railoun, A., Lund, C., Malteno, C. &
Robertson, B.A. (2006). The prevalence of mental disorders among children,
adolescents and adults in the Western Cape, South Africa. South African
Psychiatry Review (9), 157-160.
127
Lillenfield, S.O., Wood, J.M. &Garb, H.N. (2000). The scientific status of
projective techniques. Psychological Science in the Public Interest, 1 (2), 27-
66.
Lowenfeld, V. and Brittain, W.L. (1969). Creative and Mental Growth (4th ed.).
New York: Macmillan.
Masango, M.J. (2004). Aggression, anger and violence in South Africa. HTS:
Theological Studies, 60(3), 993-1006.
Matto, H.C. & Naglieri, J.A. (2005). Race and ethnic differences and Human
Figure Drawings: Clinical utility of the DAP: SPED. Journal of Child and
Adolescent Psychology, 34 (4), 706-711.
128
Messick, S. (1988). The once and future issues of validity: Assessing the
meaning and consequences of measurement. In Wainer, H. & Braun, H.I.
(1988). Test validity. Hillsdale: Lawrence Erlbaum.
Naglieri, J.A., McNeish, T.J & Bardos, A.N. (1991). DAP: SPED Draw-A-
Person: Screening Procedure for Emotional Disturbance Examiner’s Manual.
Austin, Texas: Pro-ed.
129
Pert, C.B. (1997). Molecules of emotion: Why you feel the way you feel.
London: Simon & Schuster.
Phillips, D. (2002, November, 4). The lifelong impact of poverty. BBC News
World Edition. Retrieved January 28, 2008, from
http://news. bbc.co.uk/2/hi/uk_news/scotland/2394193.stm
Richter, L. M., Griesel, R.D., & Wortley, M.E. (1989). The Draw-A-Man test: A
50-year perspective on drawings done by black South African children. South
African Journal of Psychology. 19 (1), 1-5.
Richter, L. M., Griesel, R.D., Durrheim, K., Wilson, M., Surendorff, N. &
Asafo-Agyei, L. (1998). Employment Opportunities for Graduates in South
Africa: A Contemporary Analysis. South African Journal for Psychology. 28
(1), 1-7.
Rudenberg, S.L., Jansen, P. & Fridjhon, P. (2001). Living and Coping with
ongoing violence in South Africa: A cross-national analysis of children’s
drawings using structured rating indices. Childhood. 8 (1), 31-55.
130
Sadock, B.J. & Sadock, V.A. (2003). Kaplan & Sadock’s Synopsis of
Psychiatry: Behavioral Sciences/ Clinical Psychiatry. (9th ed.). Philadelphia:
Lippencott, Williams & Wilkins.
Saklofske, D.H. & Wrightson, L. (2000). Validity and reliability of the Draw A
Person: Screening Procedure for Emotional Disturbance with adolescent
students. Canadian Journal of School Psychology, 16 (1), 95-102.
Siegler, R.S, Deloache, J.S., & Eisenberg, N. (2003). How children develop.
New York: Palgrave Mcmillan.
131
Veltman, M.W.M. & Browne, K.D., (2002). The assessment of drawings from
children who have been maltreated: a systematic review. Child Abuse
Review. 11(1), 19-37.
Ward, C.L., Martin, E. Theron, C. & Distiller, G.P. (2007). Factors affecting
resilience in children exposed to violence. South African Journal of
Psychology. 37 (1), 165-187.
Wessel, J., Carter, D., Jones, S., Hatch, K., Kasten, H., Noll, T., Schofield, D.
& Surges, K. (1993, April). Use of the DAP: SPED with a sample of students
enrolled in ED, CD, and LD public school programs. Paper presented at the
Annual Convention of the National Association of School Psychologists,
Washington, DC.
132
Williams, J.A., Xintolo, N., & Nombaca, B. (2001). Emotional distress reflected
in drawings of children from Alexandra Township. Southern African Journal of
Child and Adolescent Mental Health. 14 (2), 44-46.
133
APPENDIX A
Qualitative Questionnaire for Observed Behaviour in the Classroom
(QQOBC)
Dear teacher,
Please take the time to complete this questionnaire on behaviour you might have
observed of ___(insert name of participant) . This will help to conclude the
research and to further identify which children are in need of emotional support.
Please note that each parent has already granted permission for your feedback.
134
6. Does this learner get into trouble?
Never Sometimes Often Always
What for?
___________________________________________________________________
_____________________________________________________________
7. Are you aware of any emotional or social problems for this child?
Yes No
Comments:
___________________________________________________________________
___________________________________________________________________
___________________________________________________________________
___________________________________________________________________
___________________________________________________________________
135
APPENDIX B
Permission to partake in Research
Dear Parents
For me to conduct this research, I need to assess children in Grade 4. This involves
filling out a questionnaire as well as completing three drawings. It would also be
necessary to obtain collateral information from you as the parent, or from the class
teacher. The assessment will take place one morning during school hours, and would
take approximately two hours.
Participation is voluntary. If you do not want your child to participate, the school will
make alternative arrangements for that period. A total of 120 children need to be
assessed for the research, and they will be chosen randomly from those for who
consent have been obtained.
All information will be strictly confidential. This means that the school will receive
global results. If any individual child needs further help, the parents will be contacted
and appropriate recommendations will be made.
Any further queries can be directed to Francoise Kubierske on 082 563 9278
Please fill out the form below to indicate whether you would like your child to
participate.
Many thanks
________________________________
Francoise Kubierske
MA (Psych) Intern
University of Johannesburg
136
Please tear along the line and return to school by Tuesday 30 October 2007
137
APPENDIX C
Feedback for participants Regarding the Presence of Emotional
Disturbance
Dear Parent,
The results of the screening procedure for emotional troubles indicate that
your child’s level of emotional distress at the time of testing was:
Please seek further assistance for moderate and high scores. Your
headmaster or school counsellor can guide you in this regard. Alternatively,
my office contact details are as follows:
Francoise Kubierske
Mindsense Counseling and Learning Centre
Suite 10
123 Rietfontein Rd
Primrose
Tel: (011) 822 2959
Kind regards
Francoise
____________________________
138
APPENDIX D
Dear Parent,
Thank you for giving permission for your child to participate. As indicated, a
limited amount of learners could be accommodated in the research due to the
restricted availability of testing material. I would therefore like to inform you
that your child was not included in the sample, and no emotional evaluation
was carried out.
If you have any concerns in this regard, please feel free to contact your
headmaster or school counselor. Alternatively, my office contact details are as
follows:
Francoise Kubierske
Mindsense Counseling and Learning Centre
Suite 10
123 Rietfontein Rd
Primrose
Tel: (011) 822 2959
Kind regards,
Francoise Kubierske
____________________________
139
APPENDIX E
Example of Qualitative Analysis Spreadsheet
SN4 50 61 47 52 47 66
QQOBC low ave never never never s/times Quiet, reserved. Trouble for not doing homework
SN5 51 68 59 59 48 56
high
QQOBC never never never s/times Intelligent, talkative, racial incidents, wrong friends. Hostile to high achievers
ave
SN6 34 56 61 55 53 59
QQOBC good never never never s/times Friendly, chatty
SN7 38 64 49 58 59 60
No boundaries, intelligent, no morals or respect. Racial remarks, bad language.
QQOBC low ave never never s/times often Good home, hates high achievers
SN8 37 53 49 49 55 50
QQOBC low ave never never never often Disruptive
SN9 55 65 44 41 48 41
QQOBC good never never never s/times Friendly, talkative
SN10 57 72 49 44 51 60
QQOBC good s/times s/times s/times often High achiever, poor morals, racial remarks (stealing)
141
SN11 60 48 38 39 40 59
QQOBC good never never never never Intelligent, considerate.
SN12 54 61 41 47 45 45
QQOBC low ave s/times s/times never never Slow learner, frustrated with academics, doest ask for help
SN13 54 54 54 64 55 48
QQOBC good never never never s/times Friendly, overenthusiastic, over-familiar
SN14 53 54 44 50 88 47
QQOBC good never never s/times s/times Intelligent, naughty. Good communicator. Language, noisy, backchats
SN15 59 37 37 35 48 43
QQOBC good never never never never Intelligent, hardworking, neat, caring, independent
SN16 48 62 46 56 80 39
Forgetful, needs attention to do work, fear of losing stuff so puts lock on bag.
QQOBC low ave s/times often s/times s/times Homework incomplete. Father died, mother overprotective.
SN17 39 80 88 62 75 61
QQOBC low ave often often never often Naughty, unkempt, tearful, withdrawn when reprimanded. Homework not done.
SN18 47 63 44 52 53 54
QQOBC good never never never never Quiet, reserved
SN19 45 70 46 63 51 54
QQOBC good never never never s/times Talkative, friendly. Disruptive in class.
142