Professional Documents
Culture Documents
ABSTRACT
This paper reports on the preliminary findings of an on-going study on the washback effect
of the newly introduced school-based assessment (SBA) at the lower-secondary level
in Malaysia. This study specifically investigates how the school-based assessment has
affected the perceptions of students in relation to learning English as a second language.
In addition, the study attempts to explore the students’ responses to a call for change from
a purely testing culture into a learning culture at the beginning of its implementation. The
objectives of the study are therefore twofold: to gauge the washback effect on students’
overall perceptions of SBA and external examinations and challenges of implementing
School-based Assessment (SBA). Drawing on the data collected by means of questionnaires,
it was found that the sampled students were equally pessimistic about external examinations
and SBA. In addition, some barriers in implementing SBA as perceived by the students in
the given context are reported. It is hoped that the findings of this small-scale study which
was carried out after two years into the implementation of SBA, would contribute to a better
understanding of the complex phenomenon of “washback” in relation to SBA in Malaysia.
E-mail addresses:
allabaksh@usm.my (Alla Baksh, M. A.),
salleh@ukm.edu.my (Mohd Sallehhudin, A. A.),
yahyaameen73@yahoo.com (Tayeb, Y. A.),
haslinda.hassan@ppinang.uitm.edu.my (Norhaslinda, H.)
* Corresponding author
1070 Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016)
Washback Effect of SBA on Students’ Perceptions
Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016) 1071
Alla Baksh, M. A., Mohd Sallehhudin, A. A., Tayeb, Y. A. and Norhaslinda, H.
identified in the approaches of assessments agencies is minimised but the teachers’ role
employed in the 20th and addressing them in as assessors is increased.
21st century. “While the former pursued the On the other hand, Malaysia has
evidence at the end of the learning process actively been participating in international
(summative), the international agenda for assessments like PISA (Program for
twenty first century is the recognition of International Student Assessment) and
using assessment for learning purposes TIMSS (Trends in International Mathematics
(formative)” (Berry, 2011). and Science Study). A below-average
In line with this global shift in assessment performance by 15 year old Malaysian
practices, the Malaysian government students in such international assessments
introduced standards-referenced school- was another factor that led the government to
based assessment into its education system consider and embark on assessment reforms.
at the primary and lower secondary levels in It was discovered by means of recent PISA
2011 and 2012 respectively. The rationales results that the Malaysian students were
and stages of implementation were discussed not able to deal with items which required
in the Blueprint (2013). higher-order thinking skills. Therefore, the
Shohamy (1991) argued that the use of Malaysian government has set a target of
tests for power and control is a very common being ranked top-third in such international
practice in countries which the educational assessments by 2025 (MES, 2014).
systems are centralised: the curriculum is Comparatively, implementing an
controlled by central agencies. Malaysia entirely school-based assessment system
is one of these countries which controls with a minimal involvement of central
its curriculum, teaching and learning, and agencies in a high-stakes test like SPM
assessment through MES and MEC. The may not be accepted by society due to the
government’s intention of implementing impact the tests have on students’ future:
SBA is to promote real learning of the scholarships and other perks are awarded
subject matters among the students instead based on the test results. However, in its
of rote-learning and memorisation (MES, effort to promote assessment for learning at
2014). However, given the stakes attached to this level, the government has implemented
assessments at different levels along with the school-based oral assessment (SBOA)
society’s (macro-level stakeholders) faith in since 2002. Thus, considering the stakes/
teachers grading their own students without consequences of assessments on students’
fear and favour, the Malaysian government lives along with the formative stage of
had to choose the lower levels of education learning of the students, SBA began with
namely primary and lower-secondary levels the primary and lower-secondary levels of
in implementing an entirely school-based education (low-stakes) (MES, 2014).
assessment in which the role of central
1072 Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016)
Washback Effect of SBA on Students’ Perceptions
The assessment reform undertaken The school assessment on the other hand
by Malaysia is a synergistic school-based is a combination of formative and summative
assessment at the lower-secondary level components. Three aspects are contained
(MES, 2014) in which four assessment within the school assessment: assessment
components are contained within the new for learning, assessment of learning and
Form 3 assessment system. assessment as learning. Researchers (Black
They are: et al., 2003, as cited in Yu, 2010) have opined
that while raising students’ achievement is
i. Form 3 (central) assessment
the first priority of assessment for learning,
ii. School assessment it also involves teachers in multiple formal
iii. Physical Activities, sports and co- and informal assessment methods such
curriculum, and; as unit tests, quizzes, oral presentations,
listening activities and homework to judge
iv. Psychometric assessment
the quality of their students’ learning against
The first two assessments are categorised a set criteria or standards. In this regard, the
under the academic component whereas the MES has provided the teachers with a band
other two are non-academic ones. At the end scale of 1 to 6 in which 1 indicates weak
of the lower-secondary level, students now and 6 indicates advanced learners. Students
are provided with four different forms of are required to achieve the highest bands
results representing each component of the possible over the year. Black et al. (2003,
broad school-based assessment. The non- as cited in Yu, 2010) also highlighted that
academic component of the school-based an assessment activity should, ideally, be
assessment is, however, beyond the scope able to provide feedback to both teachers
of this paper. and students which may assist them in
The Form 3 (central) assessment is a assessing each other and also adapting
summative paper-and-pencil test which their teaching and learning activities. The
involves an evaluation of all four language school assessment as stated in the official
skills. Hence, it is assessment of learning documents of the ministry (moe.gov.
as it comes at the end of the term. The my), requires the teachers to identify their
MES provides the schools nationwide with students’ actual performance and their
sets of question papers to choose from desired performance as required by the
(comparative standards). Teachers grade ministry (intended washback) and provide
the exam scripts of their own students by their students with necessary feedback
strictly adhering to the guidelines provided with an aim to bridge the gap. Besides,
by the ministry. It is, however, worth noting students also have the opportunity to be
here that the previous assessment (PMR) assessed by their peers (peer-assessment)
focused mainly on the reading and writing and themselves (self-assessment).
skills only.
Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016) 1073
Alla Baksh, M. A., Mohd Sallehhudin, A. A., Tayeb, Y. A. and Norhaslinda, H.
1074 Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016)
Washback Effect of SBA on Students’ Perceptions
Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016) 1075
Alla Baksh, M. A., Mohd Sallehhudin, A. A., Tayeb, Y. A. and Norhaslinda, H.
1076 Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016)
Washback Effect of SBA on Students’ Perceptions
and what is happening in reality within of Penang and the district of Seberang
classrooms warrants an investigation into Perai Tengah were chosen for convenience
the washback effect of the newly introduced purposes. The sampled school had requested
school-based assessment on learners at the anonymity. The researchers were able to
beginning of its implementation. sample a balanced number of both genders,
As the previous studies on school-based 17 males and 17 females (n=34) from this
assessment in Malaysia have been centred school. Schools in Malaysia are ranked by
on concerns among teachers with regards to bands: band 1 being the highest and band 6
its implementation (Faizah, 2011; Baidzawi the lowest. As this study was conducted at
& Abu, 2013; Nair et al., 2014), it is deemed the very beginning of the implementation of
significant and timely to carry out a study SBA in Malaysia, the researchers employed
on the washback effect of school-based purposive sampling by identifying a middle-
assessment on students’ perceptions as they band (band 4) school.
are one of the direct stakeholders of any
Choosing a school of middle
assessment reforms.
banding to some extent minimises
Shih (2009) in her study on ‘how
students’ ability as a factor in
tests change teaching?’ highlighted that
making reference of the school’s
research on washback to date has been
experiences to other contexts, as
centred on washback of tests on teaching
compared to the other two scenarios
or they investigated the impact of teachers’
of either choosing a high-banding or
educational backgrounds or beliefs on their
a low-banding school. Experiences
teaching. However, scant attention has been
of a middle-banding school allow a
paid to the role played by student factors
wider range of readers to recognise
in affecting teaching within the washback
similarities of issues in their own
mechanism, and the researchers believe this
context (Yu, 2010, p.79).
is particularly so in Malaysia. According
to Shih (2009), some of the potential areas
of washback in relation to student factors Instruments
are like ‘how do students’ feedback affect This study looks into the washback effect
teaching?’ and ‘how do students’ learning of a newly introduced language assessment.
motivation influence teaching?’ Therefore, the researchers had to rely
on official documents (booklets, press
METHOD release, etc.) issued by the ministry to
Participants gauge the positive/intended washback. After
reviewing all the available documents, the
The participants of this study were from
researchers then triangulated the students’
the lower-secondary level i.e., PT3 of a
responses by means of their self-reported
co-education national secondary school in
questionnaires. Therefore, this study
Seberang Perai Tengah, Penang. The state
Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016) 1077
Alla Baksh, M. A., Mohd Sallehhudin, A. A., Tayeb, Y. A. and Norhaslinda, H.
1078 Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016)
Washback Effect of SBA on Students’ Perceptions
Table 1
Group Statistics
Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016) 1079
Table 2
1080
Levene's Test for Equality of Variances
is large, with eta squared=0.172. As for A p=0.091. This may imply that both female
student’s score on external examination is a and male students felt that they work equally
good indication of how well he/she will be hard to achieve their best in SBA
able to apply what has been learned, there is
a significant difference, i.e., male (M:1.24), II. Challenges of Implementing School-
female (M:1.65) with a t-value of 2.578 and based Assessment (SBA)
p=0.015. Female students here see their Measure of central tendency and standard
score in their external examination as a good deviation were computed to summarise data
indication of how well they will be able to for students’ perceived challenges of SBA.
apply what has been learned compared with As shown in Table 3, the mean value
the male students. The magnitude of the represents the average score of how the
differences in the means is large, with eta respondents perceived the challenges of
squared=0.172. The last item which had a implementing SBA in their school whereas
significant difference is A student’s score on the mode indicates the number of frequently
SBA is a good indication of how well he/she chosen responses by the students. The five
will be able to apply what has been learned, items (challenges) as shown in Table 3
i.e., male (M:1.00), female (M:1.59) with a are ranked from 1 (strongly disagree) to 4
t-value of 4.781 and p=0.000. This indicates (strongly agree).
that the female students see their score in The mean value for “Our school does not
SBA as a good indication of how well they have sufficient books on SBA” is 2.94, which
will be able to apply what has been learned indicates that students on average agreed
compared with the male students. The that the school does not have sufficient
magnitude of the differences in the means materials on SBA which seem to hinder
is large, with eta squared=0.417. the implementation of SBA in their school.
Two items from the tables above had On the other hand, the mean value for “We
insignificant differences. For the item, All find it difficult to understand the content
students work hard to achieve their best of the SBA books” is 3.24, which signifies
in external examinations, no significant that the students on average think that it is
difference is found in the students’ self- very difficult to understand the content of
reported responses with a t-value of 1.796 the SBA materials. The mean value for the
and p=0.082. This may imply that both item “Teachers have the knowledge and
female and male students felt that they work skills to implement SBA” is 1.56, which
equally hard to achieve their best in external shows that the students on average slightly
examinations. Finally, the last item which disagreed that their teachers have the
had insignificant difference is All students knowledge and skills to carry out SBA tasks.
work hard to achieve their best in SBA. The mean value for the item, “Students
There is no significant difference in the may not trust teachers’ assessment in SBA”
students’ views with a t-value of 1.741 and is 3.06, which shows that on average,
Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016) 1081
Alla Baksh, M. A., Mohd Sallehhudin, A. A., Tayeb, Y. A. and Norhaslinda, H.
respondents agreed that they may not trust the knowledge and skills to implement
the grades given by their own teachers in SBA” is 0.894, standard deviation for
relation to SBA. The mode for this item item “Students and teachers may not trust
is 4, which indicates that majority of the teachers’ assessment in SBA” is 0.919 and
respondents strongly agreed with this standard deviation for item “We do not have
statement. adequate class time for carrying out SBA
The mean value for item “We do not tasks” is 0.666.
have adequate class time for carrying
out SBA tasks” is 3.26, which shows that DISCUSSION
respondents agreed and strongly agreed The analysis of items for the first construct
that they do not have adequate class time revealed that overall, ambivalent attitude
for carrying out SBA tasks. The mode was evident among male and female
value is 3, which indicates that majority of students in the sampled school. They
the respondents agreed that the class time were uncertain of which assessment
allocated for SBA tasks is insufficient. method was more valuable for learning
As shown in Table 3, the standard purposes. Moreover, some of them provided
deviation of the item “Our school does not contradictory responses when asked about
have sufficient books on SBA” is 1.071, the two different assessment methods
which means data points are spread out over namely external examination and school-
a wider range of values. Since the mean is based assessment. Given such a reaction
2.94 and the standard deviation is 1.071, among this group of students and the MOE’s
it is estimated that approximately 95% of predilections for a ‘synergistic assessment
the scores will fall in the range of 2.94- system’ which combines both school
(2*1.071) to 2.94+ (2*1.071), or between assessment (formative and summative)
0.798 and 5.082. The standard deviation and central assessment (summative), the
of item “We find it difficult to understand question of which particular component has
the content of the SBA books” is 0.955, drawn serious attention among the student
standard deviation for item “Teachers have population in other schools and by extension,
Table 3
Challenges of Implementing School-based Assessment in our school
1082 Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016)
Washback Effect of SBA on Students’ Perceptions
states, then arises. Ideally, both components there were barriers (materials, teachers’
should be equally favoured by teachers and assessment literacy and time-constraint) in
students alike as they complement each other implementing SBA at the national schools
(intended washback). Also, do the ministry around the country.
officials (macro-level stakeholders) and the Overall, the students’ ambivalence
students (micro-level stakeholders) think towards SBA has led the researchers to
along the same lines is one big question that conclude that respondents of the study
has yet to be answered. By means of the might not be very clear of the purposes,
quantitative data analysis of this study, it is requirements and the potential benefits
safe to assume that the intended washback of of the newly introduced school-based
the SBA on students was not fully achieved assessment. Moreover, the background of
as there was a mismatch between what the respondents may have had an impact
was required by the policy makers and the on their perceptions. Different language
concept of SBA being operationalised by medium and academic achievement of
this group of students who participated in students may also have impacted the way
this study. In this circumstance, it is rather of learning and students’ perceptions on
difficult to say that the SBA has negatively assessment. Hence, the intended washback at
affected their knowledge and readiness as this point in time was still at the surface level
the researchers could not tell if the students due to many uncertainties. The overarching
were indifferent to the implementation of examination-orientedness could inevitably
SBA or other factors contributed to their be one of them.
actions.
As can be seen in Table 3, students at the CONCLUSION
sampled school provided negative responses For accurate interpretations of the results
in relation to the items under the second derived from this study, it is obligatory
construct. These students’ account indicate on the researchers’ part to acknowledge
that their schools were devoid of resources its limitations. The first limitation of this
(materials) and those at their disposal were study lies in the instrumentation employed
not helpful. One interesting dimension of to collect data from the participants. When
the new assessment system is the teachers this study was carried out, the teachers and
are now empowered to teach and assess other administrative officials in schools
their own students. However, judging the were not approachable due to various
students’ responses in this school, they uncertainties with regards to the newly-
appear to be quite pessimistic about this introduced school-based assessment. Hence,
idea. Also, time constraint as a barrier to students, who are one of the mostly affected
carry out SBA activities was perceived by stakeholders, were approached instead.
these students. The students’ responses for The researchers could only employ
the second construct overall indicate that quantitative method i.e., the questionnaire
Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016) 1083
Alla Baksh, M. A., Mohd Sallehhudin, A. A., Tayeb, Y. A. and Norhaslinda, H.
one of the 14 states in Malaysia. Hence, Bailey, K. M. (1996). Working for washback: A review
generalising the findings to other contexts of the washback concept in language testing.
should be done with caution. Language testing, 13(3), 257-279.
Malaysia. However, considering the fact that Baidzawi, A. M., & Abu. A. G. (2013). A comparative
the sampled population of this study was analysis of primary and secondary school
the first batch of students to experience this teachers in the implementation of school-based
assessment. Malaysian Journal of Research,
new assessment and their perceptions and
1(1), 28-36.
attitudes being shaped by other stakeholders
at micro level (teachers and peers) and Berry, R. (2011). Assessment Reforms Around the
World. In Berry, R. & Adamson, B. (Eds.),
macro level (parents, policymakers and other
Assessment Reform in Education (Vol. 14, pp.
stakeholders), the SBA has only managed 89-102): Springer Netherlands.
1084 Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016)
Washback Effect of SBA on Students’ Perceptions
Black, P., Harrison, C., & Lee, C. (2003). Assessment Messick, S. (1996). Validity and washback in language
for learning: Putting it into practice. McGraw- testing. Language Testing, 13(3) 241-256.
Hill Education: the United Kingdom.
MES (2014). Information on School-based
Blueprint, M. E. Blueprint 2013-2025. (2013). Assessment: Introduction to the school-based
Preliminary Report. Preschool to Post-Secondary assessment. Malaysia: Malaysian Examinations
Education. Ministry of Education Malaysia. Syndicate, Ministry of Education.
Chan, Y. F., Gurnam, K.S., & Rizal, Y., (2009). Nair, G. K. S., Setia, R., Samad, N. Z. A., Zahri, R. N.
School-Based Assessment Enhancing Knowledge H. B. R., Luqman, A., Vadeveloo, T., & Ngah, H.
and Best Practices. Shah Alam: University C. (2014). Teachers’ Knowledge and Issues in the
Publication Centre (UPENA). Implementation of School-Based Assessment:
A Case of Schools in Terengganu. Asian Social
Cheng, L., Watanabe, Y. & Curtis, A. (2004).
Science, 10(3), 186.
Washback in language testing: Research contexts
and methods. New Jersey: Lawrence Erlbaum Popham, W. J. (1987). The merits of measurement-
Associates. driven instruction. Phi Delta Kappan, 68(9),
679-682.
Cohen, J. (1988). The t test for means. Statistical
power analysis for the behavioral sciences, Qi, L. (2005). Stakeholders’ conflicting aims
pp.19-74. undermine the washback function of a high
stakes test. Language Testing, 22(2), 142–173.
Faizah, A. M. (2011). School-Based Assessment in
Malaysian Schools: The Concerns of the English Saw, L. O. (2010). Assessment profile of Malaysia:
Teachers. Online Submission. high-stakes external examinations dominate.
Assessment in Education: Principles, Policy and
Fredriksen, J., & Collins, A. (1989). A system
Practice, 17(1), 91-103.
approach to educational testing. Educational
Researcher, 18(9), 27-32. Shih, C-M. (2009). How Tests Change Teaching: A
Model for Reference. English Teaching: Practice
Hamp-Lyons, L. (2000). Social, Professional and
and Critique, 8(2), 188-206.
Individual Responsibility in Language Testing.
System, 28(4), 579-591. Shohamy, E. (1991). International Perspectives of
Foreign Language Testing Systems and Policy.
Hughes, A. (1993). Backwash and TOEFL 2000.
In Ervin, G. (Ed.), International perspectives
(Unpublished manuscript). University of
on Foreign Language Teaching, pp. 91-107.
Reading.
Lincolnwood, Ill.: National Textbook Company.
Khaniya, T. R. (1990). Examinations as instruments
Shohamy, E., Donitsa-Schmidt, S. & Ferman, I.
for educational change: Investigating the
(1996). Test impact revisited: Washback effect
washback effect of the Nepalese English exams.
over time. Language Testing, 13(3), 298–317.
(Unpublished doctoral dissertation). University
of Edinburgh. Stoneman, B. W. H. (2006). The impact of an exit
English test on Hong Kong undergraduates:
McNamara, T. (2010). Language Testing, Oxford:
a study investigating the effects of test status
Oxford University Press.
on students’ test preparation behaviours.
Morrow, K. (1986). The evaluation of tests on (Unpublished Doctoral dissertation). The Hong
communicative performance. Portal (Ed). Kong Polytechnic University.
Windsor, Berkshire: NFER/Nelson.
Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016) 1085
Alla Baksh, M. A., Mohd Sallehhudin, A. A., Tayeb, Y. A. and Norhaslinda, H.
Wall, D. (1996). Introducing new tests into traditional Yu, Ying. (2010). The washback effects of school-
systems: Insights from general education and based assessment on teaching and learning: A
from innovation theory. Language Testing, 13(3), case study. (Unpublished PhD dissertation). The
334-354. University of Hong Kong.
1086 Pertanika J. Soc. Sci. & Hum. 24 (3): 1069 - 1086 (2016)