Professional Documents
Culture Documents
Educational Evaluation in East Asia Emerging Issues and Challenges
Educational Evaluation in East Asia Emerging Issues and Challenges
No part of this digital document may be reproduced, stored in a retrieval system or transmitted in any form or
by any means. The publisher has taken reasonable care in the preparation of this digital document, but makes no
expressed or implied warranty of any kind and assumes no responsibility for any errors or omissions. No
liability is assumed for incidental or consequential damages in connection with or arising out of information
contained herein. This digital document is sold with the clear understanding that the publisher is not engaged in
rendering legal, medical or any other professional services.
EDUCATIONAL EVALUATION
IN EAST ASIA: EMERGING ISSUES
AND CHALLENGES
SAMUEL S. PENG
AND
JOHN CHI-KIN LEE
EDITORS
All rights reserved. No part of this book may be reproduced, stored in a retrieval system or
transmitted in any form or by any means: electronic, electrostatic, magnetic, tape, mechanical
photocopying, recording or otherwise without the written permission of the Publisher.
For permission to use material from this book please contact us:
Telephone 631-231-7269; Fax 631-231-8175
Web Site: http://www.novapublishers.com
The Publisher has taken reasonable care in the preparation of this book, but makes no expressed or
implied warranty of any kind and assumes no responsibility for any errors or omissions. No
liability is assumed for incidental or consequential damages in connection with or arising out of
information contained in this book. The Publisher shall not be liable for any special,
consequential, or exemplary damages resulting, in whole or in part, from the readers’ use of, or
reliance upon, this material. Any parts of this book based on government reports are so indicated
and copyright is claimed for those parts to the extent applicable to compilations of such works.
Independent verification should be sought for any data, advice or recommendations contained in
this book. In addition, no responsibility is assumed by the publisher for any injury and/or damage
to persons or property arising from any methods, products, instructions, ideas or otherwise
contained in this publication.
This publication is designed to provide accurate and authoritative information with regard to the
subject matter covered herein. It is sold with the clear understanding that the Publisher is not
engaged in rendering legal or any other professional services. If legal or any other expert
assistance is required, the services of a competent person should be sought. FROM A
DECLARATION OF PARTICIPANTS JOINTLY ADOPTED BY A COMMITTEE OF THE
AMERICAN BAR ASSOCIATION AND A COMMITTEE OF PUBLISHERS.
Introduction 1
Samuel S. Peng and John C. K. Lee
Part I. Practices and Challenges 11
Chapter 1 Education Evaluation and Assessment in Taiwan:
Status, Innovations and Challenges 13
Mu-lin Lu
Chapter 2 Educational Evaluation in Singapore:
Current Status and Future Challenges 23
R. Subramaniam
Chapter 3 Educational Evaluation in South Korea:
Current Issues and Future Prospects 35
Sun-Geun Baek
Chapter 4 Development of Educational Evaluation and Challenges
of Practicing Educational Evaluation in Thailand 45
Sirichai Kanjanawasee
Chapter 5 Educational Evaluation in Hong Kong:
Status and Challenges 61
John Chi-Kin Lee
Part II. Variations and Evoluation 73
Chapter 6 Educational Evaluation and Quality Assurance:
The Case of Hong Kong 75
Nicholas Sun-keung Pang
Chapter 7 School-Based Assessments (SBA):
Implementation and Challenges in the Malaysian Context 93
Suan Yoong
Chapter 8 Current Issues Regarding the School Evaluation System in Japan 111
Yasuhiko Washiyama
vi Contents
INTRODUCTION
BACKGROUND
Educational evaluation plays an important role in the pursuit of educational excellence
(Kennedy and Lee, 2008). At one level, it has been used in the evaluation of students'
individual differences and performance.
At another level, it has been used to contribute to an understanding of the overall
effectiveness and efficiency of institutions and academic programs, as well as such matters as
the specific qualities of school management, curriculum, teaching staff, pedagogical practices
and educational policies.
Educational evaluation is not a simple look in the mirror, a vain exercise; it is an effective
way to provide evidence for all decision-making pertaining to resource allocation, program
improvement, personnel hiring and promotion, selection of candidates, and educational
policy-making. Evaluation is, in fact, an integral part of the multi-faceted process of
enhancing quality and attaining excellence.
Educational evaluation, as concept and as a set of practices, is not new in East Asia.
However, in recent years it has received much greater attention from the general public as
well as the educational community because of changes in the political and economic
conditions in society.
First, as the demand for educational resources increases, both the government and the
schools as a whole need to know how well their resources have been effectively allocated and
used; they also need to know the extent to which their objectives have been achieved.
Evaluation is thus a necessary mechanism for monitoring the educational quality of
institutions and programs.
Secondly, as education in general and postsecondary education in particular have been
expanded in response to global competition and the generally rapid rate of economic
2 Samuel S. Peng and John C. K. Lee
development in society, there are concerns about the equity and quality of education on the
part of all stakeholders, including students, parents, teachers, school administrators and
policy-makers.
Subsequently, there have been strong demands for an effective evaluation system to
ensure equity and quality, on the one hand, and proficiency and productivity on the other.
Moreover, constant reforms of the educational system increase the demand for input, process
and outcome evaluation. The great attention given to accountability and quality assurance
makes the practice of evaluation more complex and innovative, with ever more diversified
paradigms, methods and approaches.
Thirdly, globalization increases awareness of the importance of a country’s international
competitiveness and stimulates interest in international comparisons of student ability and
competence. For example, the International Association for the Evaluation of Educational
Achievement (IEA) and the Center for Educational Research and Innovation of the
Organization for Economic and Development (CERI/OECD) have been carrying out
international surveys of student achievement in science, mathematics, language and civic
education.
The results of their surveys have motivated the participating countries to consider how
best to evaluate and improve their educational systems to ensure that their students are well
prepared and competent. Consequently, the need for effective educational evaluation systems
and methods is an important issue, one that is frequently studied and discussed in East Asian
countries.
However, the development of an effective evaluation system is not a simple task. Many
factors need to be considered.
Aside from measurement techniques and methodological perspectives, an effective
system needs to be cognizant of the cultural, political, economic and social contexts in which
an evaluation system is implemented. Consideration of such contexts is critical in East Asia
for one simple reason – education is highly valued in our society due to its traditional
Confucian culture.
All high-stakes educational evaluations, such as senior high school entrance
examinations, college entrance examinations and teacher qualification examinations, as well
as school performance evaluations, often receive a great deal of public attention and scrutiny.
Any official evaluation system, in terms of its contents, standards, and implementation
procedures, may set the guidelines for teaching and student learning; schools tend to operate
on the basis of, teachers tend to teach and students are likely to learn mainly what is to be
assessed and evaluated.
What is not tested or otherwise judged or assessed for “official” purposes usually gets
less attention or is simply ignored. Thus, implementing a national evaluation system that is
objective and fair without having any negative impact on schooling is a major challenge in
most places in East Asia. Scholars in the field of educational evaluation are actively engaged
in the search for an Asian model that will work well within all East Asian social and cultural
contexts.
East Asian countries serve as an ideal testing ground for judging the effectiveness of
various evaluation models and programs because they share many things in common. Aside
from the high value placed on positive attitudes toward education, educational governance at
the systems level tends to be centralized and controlled by the central government. Given the
similar cultural and social contexts, educators may be able to identify theories and practices
Introduction 3
that are most suitable for their students and educational systems by comparing the different
evaluation systems or programs developed by other countries or special administrative
regions.
It was for this reason that the “Forum for Educational Evaluation in East Asia: Emerging
Issues and Challenges” was initiated and held on October 2007 at National Taiwan Normal
University.
It brought together a group of distinguished scholars from Hong Kong SAR (China),
Japan, Macao SAR (China), Malaysia, Singapore, South Korea, Taiwan and Thailand for
scholarly discussion of the problem of developing, implementing and improving educational
evaluation systems that are sensitive to the local cultures and social contexts. The breadth of
issues discussed, free and open expression of ideas and opinions, and general sense of the
success of this forum prompted the idea of publishing the presented papers in order to share
the participants’ knowledge, ideas and experience with those who did not have an opportunity
to participate directly in the forum.
This book was therefore developed with the intention of providing a foundation for
further discussion of educational evaluation models within the context of East Asian cultural
and social conditions. It aims not only at identifying challenges of and strategies for
educational evaluation but also at facilitating advanced research on educational evaluation in
East Asia.
Through an examination of the current principles, policies and practices of educational
evaluation in this region, researchers will hopefully be able to further construct and refine
evaluation theories and models rooted in the traditional cultures and educational values of
East Asia.
Of course, it is hoped that such theories and models will also be applicable to other
regions of the world.
Topical area M.L. R.S. S.B. S.K. C.L. S.P. S.Y. W.Y. P.S. K.C.
Evaluation policies ©
Evaluation of educational © © © © ©
progress
Evaluation of basic education © ©
Evaluation of the learning of © ©
reading, science and
mathematics
Evaluation of K-12 teachers © © ©
Evaluation of K-12 schools © © © © © ©
Evaluation of higher ©
educational institutions
Evaluation of academic © ©
programs
Evaluation of college © ©
candidates
Evaluation methods © ©
Note: M.L. -- Mu-lin Lu; R.S. -- Ramanathan Subramaniam; S.B. -- Sun-Geun Baek; S.K. -- Sirichai
Kanjanawasee; C.L. -- Chi-Kin John Lee; S.P. – Nicholas Sun-Keung Pang; S.Y. – Suan Yoong;
W.Y. – Washiyama Yasuhiko; P.S. – Pruet Siribanpitak; K.C. – Kwok-cheung Cheung.
CHAPTER SYNOPSIS
1. Evaluation of Higher Education
Mu-lin Lu, in the first of the case reports, focuses on the connection between higher
education evaluation systems and the trend towards globalization. In Taiwan, under the
revised University Act promulgated in 2005, the Minister of Education (MOE) took
responsibility for organizing an evaluation committee and formulating evaluation policies and
guidelines. Subsequently, in December 2005, the MOE and Taiwan’s universities and
colleges jointly established the Higher Education Evaluation and Accreditation Council of
Taiwan (HEEACT). Subsidized annually by the MOE, HEEACT is responsible for the
general operation and administration of university evaluation, and is expected to safeguard
the objectivity and accuracy of evaluations: the government’s financial support and the
enrollment size of each institution are directly dependent on these evaluations. The MOE also
expects the implementation of a systematic evaluation mechanism to lead universities toward
greater autonomy and management efficiency. However, as a newly-founded public
organization HEEACT confronts several technical problems, including the shortage of
professional evaluators, the dependence on governmental funding, and the lack of consensus
regarding evaluation indicators and methods and procedures for implementing evaluation.
These challenges have made the goal of achieving fairness and objectivity in evaluation more
difficult to accomplish. Dr. Lu argues that setting up a long-term evaluation mechanism to
Introduction 5
monitor the performance of universities is needed in order to assure teaching and learning
quality, but also that universities still need time to accommodate and assimilate the “culture
of evaluation.”
Sun-Geun Baek describes the extent to which educational evaluation has been used as
one of the main tools for the development and improvement of primary, secondary and higher
education in South Korea. He refers to five educational evaluation programs in South Korea:
Baek points out that there have been ongoing conflicts among stakeholders on
educational evaluation policies in Korea. For example, one of the teacher unions is strongly
against the implementation of TCDE. Some teacher educators and institutes are against the
implementation of TEIAS; furthermore, some teachers and professors disagree with changes
in the educational evaluation system, and they also disagree with the implementation of the
competition system. Nevertheless, Baek asserts that the future prospects of education in South
Korea will be bright because Koreans not only acknowledge the importance of education but
also are making substantial investments in education in order to build a powerful state.
6 Samuel S. Peng and John C. K. Lee
John Chi-Kin Lee emphasizes the problem of keeping the right balance between
decentralization and accountability of educational evaluation, and invites readers to think of
evaluation as being essentially an “accompaniment” rather than a “monitoring” device. Lee’s
chapter provides a brief overview of current educational evaluation measures and issues in
Hong Kong. With regard to such quality assurance mechanisms as school self-evaluation
(SSE), external school review (ESR) and Territory-wide System Assessment (TSA), he leaves
three questions open for discussion. First, how can we maximize the coherence, in both
rhetoric and practice, between and among educational reform, educational supervision and
evaluation policies? Second, how can we promote a systematic linkage between centralized
and localized systems so as to drive effective school improvement and teacher development?
Third, how can teachers’ workloads be reduced and teachers be relieved of their anxieties
regarding educational evaluation, and how can we foster a harmonious educational-evaluation
“community” of government agencies and teachers?
6. School Self-Evaluation
Part II of the volume contains five studies of recent innovations in educational evaluation
in East Asia. Nicholas Sun-Keung Pang introduces the details of a series of school-university
partnership projects on school self-evaluation in Hong Kong SAR. He also reports on an
ongoing investigation into the effectiveness of implementing self-evaluation in Hong Kong
schools and the factors that have hindered and /or facilitated this self-evaluation movement.
Pang suggests that there are two approaches to assuring the quality of education in Hong
Kong: an external mechanism via inspection and an internal framework via self-evaluation.
He concludes that, in order to facilitate change in schools, administrators should have the
power and leadership capacity to unify a given school’s goals and specify the technology for
achieving them, to promote consensus and the sharing of values among all members, and to
Introduction 7
focus attention by carefully selecting targets, controlling resources, and acting forcefully.
Pang further suggests that effective leaders are those who can be flexible in adopting
approaches to change that can cope with the challenges created by the ever-changing external
environment, and in leading their organizations toward excellence.
Suan Yoong provides a critical review of the student assessment system currently used in
Malaysia. Professor Yoong outlines the effects of the national assessment system, provides an
overview of the strengths and weakness of an over-obsession with examinations, and explains
the dominant forces shaping educational assessments in Malaysia, carefully taking into
account the cultural contexts. In presenting the challenges faced by the implementation of
school-based assessment (SBA) in Malaysia, Yoong points out that the examination-oriented
tradition imposes a great pressure on schools as well as students to excel on examinations.
Such pressure has led schools to attempt to control what students are to study, narrowing the
school curriculum to include only those subjects and topics that are expected to be tested on
exams and adopting policies that are geared solely toward helping students pass the high-
stakes examinations. He introduces the serious problems that have arisen from this narrow-
minded approach, including the failure to measure higher cognitive skills, the de-
professionalization of teachers and the taking of all the fun out of going to school. Yet Yoong
also expresses concern that the Ministry of Education’s latest effort in examination system
reform might find it difficult to overcome Malaysia’s deeply entrenched, examination-
oriented culture, a culture that places the gaining of high test scores as the top priority of
teaching and learning.
The chapter by Yasuhiko Washiyama deals with the notion of “autonomous school
administration” in Japan and that country’s subsequent evaluation practices in response to the
neoliberal ideas of deregulation and decentralization. Initially, Washiyama provides an
historical overview of the school evaluation system in Japan. In the contexts of deregulation
and decentralization, two evaluation-related policies have been developed: the Regulative
Standards for School Establishment, revised in 2002, include new provisions on self-
inspection and self-assessment; the Basic Policy of Economic Financial Administration and
Structural Reform of 2005 mandates that all schools providing compulsory education be
subjected to external evaluation. Professor Washiyama then introduces the Evaluation
Guidelines for Schools of Compulsory Education. The purpose of these guidelines is to
establish a Plan-Do-Check- Action management cycle for evaluation, to improve the quality
of education, to provide for self-evaluation and external evaluation, and to create a more open
school atmosphere. These guidelines cover the transition from the traditional quality control
system to a system focusing on outcomes, various criteria for evaluation, more focused
evaluations, and the establishment of a third-party evaluation organization. He also raises
concerns about school evaluation, including the problem of a too-heavy evaluation workload
for teachers and the fact that evaluation may become a form of routine paper work.
8 Samuel S. Peng and John C. K. Lee
Washiyama argues that these problems could be solved by the implementation of shadowing,
the transition from school evaluation to school diagnosis, and the facilitation of collaboration
among teachers. He further emphasizes that evaluation results must be communicated to
teachers, students, parents and community members as favorable forces for improvement.
Pruet Siribapitak first describes the number and qualifications of teachers in Thailand. He
then discusses current issues in teacher evaluation, and explores the future prospects of
teacher evaluation in that country’s public schools. Siribapitak discusses the various types of
teacher evaluation for public school teachers, including induction program evaluation, annual
performance evaluation, and academic status evaluation. He also points out that the Teacher
Civil Service Commission of the Ministry of Education had set up a taskforce to review the
teacher evaluation criteria in terms of three new dimensions: teacher discipline, morality and
ethical conduct; the quality of teaching and student development; and the results of teaching
and student development. Professor Siribapitak concludes that it might be more productive to
design systems, not for their formative or summative orientations but in terms of the decisions
that need to be made at different levels, e.g. by teacher, the principal, the school district, and
the MOE.
• Learning environments that can facilitate the life-long learning goals of 15-year-olds.
Macao’s PISA then exemplifies a unique and effective international evaluation
program
In short, the authors of the essays in this book provide comprehensive information about
their countries’ evaluation systems, programs and policies at various educational levels, and
show how these are responding—or trying to respond—to the unique needs of their specific
educational contexts. They also point out that there are still many problems to be solved.
First of all, the evaluation of student achievement for the purpose of allowing entrance to
higher levels of education, including senior high schools and colleges and universities,
remains a challenge. Unlike the United States, most East Asian countries have a centralized
system that requires students entering senior high schools and colleges and universities to
take a national examination. Admissions are solely based on students' performance on a test
which covers only selected subjects. “As a result, students tend to pay less attention to
learning in classrooms and concentrate more on private tutorials in particular subjects tested
in the entrance examination” (Kanjanawasee, p. 75). Such a system often hampers the normal
developmental process of a pre-college education. Many individuals’ true potential cannot be
fully expressed and developed because of the lack of opportunities and appropriate evaluation
mechanisms. Thus, how to screen and identify studetns’ talents and abilities, and how to
assure the high quality of education deserved by these students, remain serious questions.
Thailand’s new program, which requires the use of high school grades and students’
performance outside of school, offers an alternative model for college admission practices.
However, its success remains to be tested.
The second challenge is how to evaluate teacher quality and performance in the context
of fundamental educational reform. In particular, it remains hard to determine which
characteristics of teachers are critical for effective teaching and other professional duties.
From the perspective of East Asia’s Confucian heritage, a teacher should be able to fulfill
three missions: cultivating a virtuous person, teaching life skills, and removing students’
doubts. This implies that teachers will not only need professional skills for effective teaching:
they must also be able to serve as good role models for their students. In this age of
specialization and professionalism, the traditional image of a teacher-saint has gradually
faded away. Rather, there is a call for teacher-scholars who, as in China and Japan, are
expected to do original research as well as teach effectively. Things have become more
complicated in an era when more and more educators seem to view their students as mere
“consumers.” Is it possible to talk about professional ethics when the pedagogical relationship
has become a pedagogical contract that places more emphasis on responsibilities and rights
than on the potential development of individuals? There may be two ways to approach the
problem of teacher evaluation in our current society. One is normative, and seeks to define a
qualified, professional teacher. The other is technical, and seeks to evaluate teachers to see if
they are qualified and professional. Of course, we must have the definition or the standard
before we can have evaluation. Thailand’s teacher evaluation program includes the evaluation
10 Samuel S. Peng and John C. K. Lee
of teachers’ ethics and moral behavior, and this approach is worthy of the consideration of
other East Asian educators.
Moreover, a national evaluation program needs to evaluate both for accountability and
for innovation (summative and formative evaluation). Yet can we really rank schools and
colleges only for the sake of accountability? And how can we lower the pressure placed on
students competing for the best schools and colleges based solely on achievement test scores?
And should qualitative evaluation be used, in order to improve both instruction and learning,
rather than or together with the giving of merely quantitative ranks and grades? These are
some of the several key issues that still need to be resolved.
Finally, as shown in some of the essays in this book, successful implementation of
educational evaluation measures necessitates supportive school leadership (Chapter 6),
reduction of teacher anxieties and workloads (Chapter 5), enhanced communication among
stakeholders (Chapter 8), and the fostering of shared decision-making across all levels,
including government, school, parents and the community (Chapters 5 and 9). But to achieve
these prerequisite goals will take the collective wisdom of scholars in the field of educational
evaluation.
In conclusion, evaluation is an integral part of the educational reform and innovation
process. Assessment has the potential to act as a driving force for achieving educational
excellence, but it must be carefully designed and implemented. Inappropriately designed
educational evaluation programs or systems—those which cannot fit their contexts—may
have a negative impact that could be very harmful to education. For example, in many East
Asian countries where high-stakes examinations are prevalent, most students cram for exams
and are good at taking tests but they have little time to pursue their own personal
development, explore their talents, or reflect on the meaning of life. In other words, the
pursuit of high scores on tests becomes in itself the central theme or purpose of their life. Yet
as Lu in Chapter 1 points out, effective educational evaluation—with the ultimate goal of
reforming or improving the educational system—needs the cooperation of the whole
community: the parents (and larger public), students, teachers, school administrators and
government officials. It will be above all this common or communal will that can take us to
where we want to be.
REFERENCES
De Grauwe, A. and Naidoo, J. P. (eds.). (2004). School evaluation for quality improvement:
An ANTRIEP report. Paris: International Institute for Educational Planning, UNESCO.
Kennedy, K. J. and Lee, J. C. K. (2008). The changing roles of schools in Asian societies:
Schools for the knowledge society. London and New York: Routledge.
PART I. PRACTICES AND CHALLENGES
In: Educational Evaluation in East Asia ISBN 978-1-60692-887-5
Editor: Samuel S. Peng and John Ch-Kin Lee © 2009 Nova Science Publishers, Inc.
Chapter 1
Mu-lin Lu
Ministry of Education, Taiwan, ROC
I. PREFACE
Today, in the 21st century - the age of knowledge economy - a country’s competitiveness
is measured by its ability to apply knowledge towards economic progress and the pursuit of
excellence. In this light, higher education is regarded as the means to cultivating a nation’s
high caliber workforce. As such, governments worldwide are performing evaluations of
universities to ensure high quality within institutions of higher education, as well as to
measure administrative performance and to determine how to allocate funding and resources.
The number of Taiwan’s higher education institutions has expanded rapidly over the past
decade, more than 70% of Taiwan’s high school and vocational school graduates advance to
college or university. Therefore, the importance of controlling and safeguarding the quality of
higher education continues to grow. Establishing and implementing an evaluation system is
essential for improving the quality of university education. To this end, the Ministry of
Education (MOE) authorized the independent Taiwan Assessment and Evaluation
Association in 2004 to conduct evaluations of all universities in Taiwan. These evaluations
focused primarily on university administration and secondly on academic disciplines, with the
universities evaluated in groups. The evaluation results were announced as ratings. This
round of evaluations motivated universities and colleges to make a greater effort toward self-
improvement, and it also triggered new thinking regarding the development of university
evaluations in Taiwan. Based on the experiences of advanced countries, the MOE established
the Higher Education Evaluation and Accreditation Council of Taiwan (HEEACT) in
December 2005 to serve as an independent, dedicated agency handling all matters relating to
the evaluation and assessment of higher education institutions. The goal of establishing of
HEEACT aimed to make it easier to share evaluation experiences, accumulate data, develop
14 Mu-lin Lu
professional and objective evaluation criteria, train professional evaluators, and build a talent
pool and database. In the following, we explain the historical development of Taiwan’s
university evaluation system, its future development and current challenges.
II. STATUS
A. Legal Basis – The University Act: The MOE’s Responsiblities
in Evaluating Universities
The revised University Act was promulgated in December 2005. Article 5, Paragraph 1,
of the Act states that “Universities shall regularly carry out self-evaluations of teaching,
research, services, academic affairs, administration, student participation and other
proceedings; regulations for evaluations shall be formulated by the universities.” Article 5,
Paragraph 2 stipulates that “In order to promote the development of universities, the Ministry
of Education shall organize an Evaluation Committee, commission academic organizations or
professional evaluators to carry out regular evaluation on the universities, and publish the
results as a reference for educational subsidies from the government and the scale of
adjustment and development of universities; the methods of evaluation shall be formulated by
the Ministry of Education.” These two passages clearly stipulate that carrying out university
evaluations is the joint responsibility of the MOE and the universities. We can only attain the
goal of improving the quality of higher education if the two parties work together in a
concerted effort to plan and promote university evaluations with a sense of direction.
Plan of 2004, the University and College Faculty Development Center Evaluation Planning
and Implementation Plan of 2005, the Evaluation of Universities of Science and Technology
of 2005 and 2006, and the University Administration Follow-up Evaluation Plan of 2006.
In addition, the MOE and Taiwan’s universities and colleges came together in December
2005 to jointly establish the Higher Education Evaluation and Accreditation Council of
Taiwan (HEEACT), a public organization with the goal of serving in a capacity similar to the
evaluation agencies of Britain and Australia, that are known for their “independence from
official agencies.” However, as a sound evaluation institution cannot be created overnight, the
MOE is currently supervising and assisting HEEACT to ensure that its development proceeds
smoothly and efficiently. Presently, the MOE subsidizes funding that HEEACT requires for
planning and evaluations, as it gradually works to establish a systematized, institutionalized
dedicated agency for university evaluation.
2. Stage-by-Stage Establishment of a Higher Education Evaluation System
a. Present Goals. Present goals for HEEACT include: to have HEEACT plan and
implement all types of evaluation work so as to accumulate practical experience; to compile
and study information on higher education evaluation in advanced countries, produce
objective, professional evaluation criteria, and provide courses for the training of university
evaluators; to develop the council into an evaluation planning institution; and to set up a
diversified evaluation mechanism.
b. Medium-to-Long Term Goals. Medium to long-term goals for HEEACT include: to
gradually develop the Council into a dedicated, professional evaluation institution with the
task of researching and planning the evaluation of Taiwan’s universities; to set up databases
and talent pools; and to pass on these evaluations to other specialized evaluation institutions.
3. Present University Evaluations Conduct
a. Putting into Practice University Self Evaluation. The University Act stipulates that
universities shall regularly conduct self evaluations. Moreover for the sake of sustainable
development, universities shall safeguard the quality of their administrative affairs and
assume responsibility for administrative performance and improving their image and
reputation.
b. Assisting TEEAC in Implementing University Evaluation Work. Current Major Tasks:
c. Implementing the University Department and Graduate School Evaluation Plan (2006-
2010) based on a Five-Year Cycle
Accreditations for university departments and graduate schools that are put “on hold”
mainly for the following reasons: insufficient number of full-time teachers; insufficient
learning space and resources; unclear goals; or course design does not reflect the goals of the
university department/graduate school and lacks self improvement mechanisms.
Accreditations are denied if the above-mentioned problems are severe enough that student
learning and teaching quality are seriously affected. University departments and graduate
schools whose accreditations have been granted shall submit a self improvement plan and
Education Evaluation and Assessment in Taiwan 17
implementation objectives based on the suggestions in the evaluation result report. They will
not have to undergo another evaluation for the following five years. For those whose
accreditations have been put on hold, HEEACT will conduct a follow-up evaluation regarding
their shortcomings. Meanwhile, schools ‘on hold’ may not expand student recruitment or
apply for the establishment of new graduate schools. For those whose accreditations have
been denied, HEEACT will carry out a reevaluation of all items and the MOE will consider a
reduction of student quotas. Should accreditation be denied after a follow-up evaluation or a
reevaluation, student recruitment will be stopped.
4. Continuing with follow-up evaluations of university administrations
5. Encouraging universities to participate in international accreditation
As Taiwan’s universities seek to develop unique characteristics and reach global
standards, we cannot only use evaluations and assessments to strengthen the quality and
development direction of academic disciplines. Universities and colleges are being
encouraged to actively participate in international accreditation schemes as a way of obtaining
international recognition. University departments and graduate schools that have already
obtained international accreditation will be exempt from further evaluations during the
accreditation period. Participation in international accreditation schemes will raise the quality
of higher education and speed up the internationalization of universities in Taiwan, while also
winning them the recognition in international academia. Universities and colleges in Taiwan
currently participate in the following international accreditation schemes:
Since medicine is a specialized field, medicine department evaluations are planned and
conducted by the Taiwan Medical Accreditation Council (TMAC). In 2002, TMAC’s
evaluation system was approved by the National Committee on Foreign Medical Education
and Accreditation in the United States.
The business management departments of Fu Jen Catholic University and National Sun
Yat-sen University gained AACSB accreditation in April 2005, while National Chengchi
University followed suit in December 2006. The business management departments of other
universities including Asia University, Feng Chia University, National Cheng Kung
University, National Chiao Tung University, National Chung Cheng University, National
18 Mu-lin Lu
Taiwan University, Tamkang University, and Yuan Ze University have applied for AACSB
accreditation.
In Britain, higher education evaluation covers the two areas: teaching and research. The
Quality Assurance Agency for Higher Education (QAA) is in charge of institutional and
teaching evaluations, while the Research Assessment Exercise (RAE) is mainly in charge of
evaluating research. Both agencies submit their evaluation results to the Higher Education
Education Evaluation and Assessment in Taiwan 19
Funding Council for England (HEFCE), which distributes grants and funding to higher
education institutions based on these results. While the establishment of a systematized
university evaluation system in Taiwan is still in the initial stages, we hope that the
implementation of university evaluations will compel universities to attach greater
importance to administrative performance. Moreover, the MOE will refer to evaluation results
when deciding on matters relating to university development including funding, grants, and
subsidies, the increases or changes to colleges, departments, graduate schools, and student
quotas. The MOE’s overall aim in this regard is to achieve more efficient use of higher
education resources, as well as their proper allocation.
IV. CHALLENGES
In comparison with many advanced countries, the implementation of a university
evaluation system in Taiwan is still in the initial stage. Numerous systemic or technical
problems still need to be solved. Eight major challenges facing Taiwan’s university
evaluation are listed below:
Evaluators presently selected by the dedicated evaluation agency can be divided into
three categories based on the type of evaluation: 1. Veteran professors with research or
practical experience in higher education administration; 2. Business or industry
representatives familiar with specialized academic fields or university affairs; 3. Experts and
scholars specializing in certain specialized fields. TEEAC’s evaluators hold an abundance of
experience and knowledge in university administration or teaching in a specialized field.
However, as university evaluation is a highly specialized undertaking, we need to address
how the evaluation expertise of the evaluators can be raised through relevant courses,
seminars and workshops.
One example of the effort to design evaluation indicators and methods are the evaluations
currently used for Taiwan’s university administrations, departments and graduate schools and
colleges of medicine, where evaluation results come in the form of suggestions for
improvement submitted by evaluators. This presents a new challenge to the schools and the
public, who are accustomed to thinking in quantitative terms and are likely to note that
evaluators evaluating different universities, departments and graduate schools are not the
same. Our next step is to convince the schools and the public of the completeness of these
indicators, the scientific soundness of the evaluation methods and the truth of the evaluation
results.
Evaluation results can be presented as rankings, ratings, or the selected best. In the case
of university administrations, and departments and graduate schools in Taiwan, the evaluation
results are announced in the form of ratings, which complies with the spirit of the
accreditation system. Furthermore, the criteria for deciding in favor or against accreditation
include whether relevant measures taken by the institutions match their stated goals and
whether they have implemented self improvement mechanisms. Feedback from the public
shows that people ask themselves why the MOE does not announce university rankings to
enable citizens to immediately clearly understand the strengths and weaknesses of each
university and to help students make their choices. In actuality, the announcement of the
evaluation results pertains to the goals and spirit of university evaluation. Therefore, there are
certain differences in the way evaluation results are announced, depending on the type of
evaluation and the evaluation goal. In the future, the MOE will hold more in-depth
discussions with the evaluating agency regarding the form in which evaluation results will be
announced.
The focus of evaluations is not based on grade or rank, but rather to gain an
understanding of teaching quality and students’ learning performance in a systematic,
scientific, and objective way. Thus, a long-term evaluation mechanism must be established so
that the results of previous evaluations can serve as reference for monitoring a university’s
performance. Only then will universities be able to keep improving their shortcomings and
move toward excellence.
22 Mu-lin Lu
CONCLUSION
In light of the trend towards globalization and internationalization, Taiwan’s higher
education must be able to improve its quality, ensure performance, find its niche and develop
unique strengths to gain a sure footing on the international stage and move towards
excellence. University evaluation work plays a major role in this process. Our higher
education evaluation work will succeed only if it is systematically and scientifically planned,
implemented and reviewed. Moreover, we need to keep studying effective approaches used
by advanced nations to serve as lessons for us. We hope that Taiwan’s higher education
evaluation - with the assistance of an attentive public, active university participation, efficient
government action and a well-rounded evaluation system-will be able to help our universities
to achieve the vision of quality assurance and assume genuine responsibility for their
performance.
In: Educational Evaluation in East Asia ISBN 978-1-60692-887-5
Editor: Samuel S. Peng and John Ch-Kin Lee © 2009 Nova Science Publishers, Inc.
Chapter 2
R. Subramaniam
National Institute of Education, Nanyang Technological University, Singapore
ABSTRACT
As an island state, which is bereft of any natural resources, Singapore places
tremendous emphasis on the development of its human resources through education. It
has consistently ensured that its curricula are not only internationally competitive but its
examination system is also benchmarked against international standards. Educational
evaluation is a critical aspect of this quality control. This article focuses on educational
evaluation in two areas: learning of science and mathematics at the primary, secondary
and pre-university levels, and institutional evaluation of schools. It is shown that the use
of external evaluation for assessing learning: centralized testing at the primary leaving
level by an independent authority, a partnership model of this independent authority with
an international examinations syndicate for testing at the secondary and pre-university
leaving levels, and regular testing of science and mathematics achievement of students at
the primary and secondary levels by an international evaluation association, have been
factors which explain why the science and mathematics curricula in Singapore continue
to be of a high standard. The use of academic attainment of students to band secondary
schools and junior colleges in separate achievement tables, plus the inclusion of other
academic and non-academic factors in these achievement tables is a key aspect of
institutional evaluation, and explains why stakeholders in the country have been able to
derive satisfaction from the government’s heavy investments in education. Some
implications of these modes of educational evaluation are discussed.
INTRODUCTION
Education is probably the most important lever to empower a person in life as well as
stimulate socio-economic development in a country. High levels of education of the citizenry
have traditionally been correlated with enhanced developmental activities in a country.
The challenges posed by globalization, new developments in science and technology,
diffusion of information and communication technology in society, outsourcing, off-shoring
and corporate downsizing have impacted tremendously on countries in the past ten years.
Jobs have taken flight to countries which can provide best value for money. Since these
developments affect the competitiveness of nations, they therefore impact not only on the
employment of their workers but also on their education system since this prepares students
for life in the workforce. Educational reform has been an important way in which many
countries have been addressing these issues to varying extents, so that their students are better
able to meet the challenge of the new economic landscape.
Science and technology are potent agents for stimulating socio-economic development.
The land-labor-capital model of economic development is a yesteryear paradigm that is
increasingly giving way to a model based on science- and technology- driven economic
development. The experiences of Western countries show that this development model has
been very helpful in providing their people with a good standard of living. A number of
countries in Asia have also successfully experimented with this model of economic
development for nation building, for example, Japan and Singapore (Tan and Subramaniam,
1998). One of the key aspects of this model is that a country’s science and mathematics
curricula need to be internationally competitive. Regular benchmarking of standards will help
to ensure that educational reform is proceeding in the right direction and that the education
system is well positioned for achieving the desired outcomes.
Educational evaluation is a powerful tool to assess various aspects of the education
process as it can produce performance data for the use of stakeholders. The origins of
educational evaluation can be traced to the desire for quality control in the education process
so that standards and accountability can be promoted. The large number of schools in a
country, the diversity in their curricula and differences in assessment procedures at various
levels pose issues of a systemic nature which, if not addressed properly, can affect standards.
Educational evaluation affords directions in this regard as it can help to benchmark schools
and subject offerings at the various levels, thus ensuring that they measure up to the desired
norms. The epistemological foundations of educational evaluation have matured well over the
past 40 years, and several evaluation instruments and methodologies are available to assess
the efficacy of various aspects of the education process. Its importance has led to the
formation of a number of professional bodies - for example, the International Association for
the Evaluation of Educational Achievement (IEA), which has done yeoman service since its
inception in 1959. The findings of this association have appeared in international journals and
the media, and are highly regarded by the political establishment in several countries.
The principal objective of this paper is to share some insights into the process of
educational evaluation of science and mathematics learning at the primary, secondary and
pre-university levels in Singapore. An additional objective is to describe an evaluation
approach that has been used to appraise schools in the country. Whilst traditional modes of
educational evaluation are deeply embedded in the education system in Singapore, the use of
Educational Evaluation in Singapore 25
administer their own examinations in all the subjects. Needless to say, there is bound to be
some variability in the standards and difficulty levels for the various subject examinations. It
is only at the Primary School Leaving Examinations (PSLE), which is held towards the end of
primary six, that all students sit for the same examination in all the subjects and know where
they stand at the national level. The PSLE is administered by the Singapore Examinations and
Assessment Board (SEAB), the national independent agency for assessment and
examinations.
The PSLE is of a high standard for the following reasons:
(a) The range of questions set in the subject examinations encompasses the different
levels of Bloom’s taxonomy, with significant emphasis being placed on the higher
levels. This is in view of the pronounced weightage given in the curricula on
promoting thinking skills and problem-solving skills among students.
(b) Even though the PSLE examination papers are not available in the public domain (as
students are not allowed to take back the papers), anecdotal evidence based on the
feedback provided by teachers and students as well as the attention occasionally
given to these in the media indicate that the questions are intellectually challenging.
(c) It is set by an independent authority, and there are rigorous quality checks on what
appears in the test papers.
Additional evidence that the science and mathematics standards at the primary level are
internationally competitive can be drawn from the results of the Trends in Mathematics and
Science Study (TIMSS), an examination set by the International Association for the
Evaluation of Educational Achievement (IEA) every four years. These examinations are taken
by students of many countries and afford a rigorous comparative benchmark of the science
and mathematics curricula of a country. The findings from the TIMSS study reveal the
following observations in relation to Singapore (Martin et al. 1997, Mullis et al. 1997; Toh et
al. 2006):
• In TIMSS 1995, primary three students from Singapore were ranked 2nd in
mathematics, while primary four students were ranked 1st in mathematics and 7th in
science
• In TIMSS 2003, primary four students from Singapore were ranked 1st in
mathematics and 1st in science.
Juxtaposing the observations made on the PSLE with the results of the TIMSS support
the assertion that the curricula and assessment practices for gauging learning in science and
mathematics at the primary level in Singapore are of a high standard.
individual schools and junior colleges on a semester basis. There is bound to be some
variability in the standards and difficulty levels of these examinations. However, the
milestone examinations taken at the end of secondary and pre-university education are
national examinations and, in effect, provide an evaluation of how well schools have prepared
their students in the various subjects. What is significant is that these leaving level
examinations are conducted jointly by two independent authorities: the Singapore
Examinations and Assessment Board (SEAB) and the University of Cambridge Local
Examinations Syndicate (UCLES), an international examinations syndicate. The external
checks and validity provided in this manner constitute a reliable endorsement of the standards
of the syllabus and curricula in the sciences and mathematics.
The involvement of UCLES in educational evaluation of students’ learning, based on the
syllabus prescribed by the Ministry of Education, offers a number of advantages:
Additional support for the international competitiveness of the science and mathematics
curricula at the secondary level is provided by data from the TIMSS studies done in 1995,
1997 and 2003. The findings are as follows for the Secondary two level:
• In TIMSS 1995, Singapore students were ranked 1st in science and 1st mathematics.
• In TIMSS 1999, Singapore students were ranked 2nd in science and 1st in
mathematics.
• In TIMSS 2003, Singapore students were ranked 1st in science and 1st in
mathematics.
Against the backdrop of the foregoing, it is clear that evaluation of science and
mathematics learning at the secondary and pre-university levels are rigorous from an
international standpoint.
• Academic Value-added Performance: this is a measure of how well the schools are
expected to perform in the GCE ‘O’ Level examinations based on the PSLE results
of their students and how they actually performed.
• Physical and Aesthetic Achievement: the data for physical fitness is weighted on
three scores – percentage of students who pass the National Physical Fitness Award
test, percentage of overweight students, and fitness index.
• Best Practices: this focuses on effective practices and systems that lead to desirable
outcomes in education – for example, organizational effectiveness, students’ all-
round development, staff well-being, teaching and learning.
• Character Development: this is to reiterate the point that values education and life
skills are an integral aspect of students’ holistic education.
• National Education: this is to ensure that students are cognizant of Singapore’s
history and of how the country has achieved much despite the absence of natural
resources and the presence of other constraints.
them to improve their standings. The rankings done by the Ministry of Education have the
advantages that the evaluation criteria are transparent and rigorous.
DISCUSSION
Educational evaluation affords a systematic way of appraising the various processes in
education. If carried out by a competent authority, the information it provides can offer an
impartial perspective that would be of important value to stakeholders and the public. It
would, in effect, contribute towards certification of the quality of education.
In this chapter, we have focused on two aspects of educational evaluation: assessment of
science and mathematics learning at the primary, secondary and pre-university levels, and
evaluation of how schools meet their overall objectives.
In a country with numerous schools and where there are bound to be variations in the
standards and the difficulty levels of the examination papers set by these schools, a common
leaving examination provides the best mode of evaluating how schools have prepared their
students based on the syllabus prescribed by its Ministry of Education. This process of
centralized testing evens out any differences in standards, difficulty levels and anomalies in
content coverage that are inherently present in school-based examinations. This does not
mean that school-based examinations have their drawbacks, it is just that the common
denominator afforded by the same examination taken by all students provides more rigorous
benchmarking of learning. More importantly, it is of interest to stakeholders to know how
effective schools have been in their mission at a national level - a common mode of
referencing provided by centralized testing offers validity and reliability in this regard.
At the primary level, the examinations set by the SEAB, an independent agency, provides
not only quality assurance but also third party evaluation of how schools have prepared their
students for the examinations. Whist this mode of appraisal based only on the leaving level
(primary six) to evaluate learning may be criticized for the absence of consideration of the
results achieved by students at the other levels (primary one to five), it has to be borne in
mind that the PSLE is not just an examination to assess what students have learnt in primary
six but generally at the upper primary level (primary four to six) as well. That the
examinations set at the primary leaving level are of a high standard can also be gauged from
the fact that the national agency has been requested to provide testing services at the primary
leaving level for other countries in the region such as Thailand, Philippines and Indonesia. On
an international basis, the TIMSS results provided by the International Association for the
Study of Educational Evaluation provide additional evidence that the science and
mathematics curricula in Singapore are internationally competitive.
At the secondary level, again a similar situation prevails – school-based examinations at
all stages except towards the end of secondary education, where a nationwide examination for
all subjects is taken. The examinations are administered jointly by SEAB and UCLES. The
involvement of a renowned international examinations syndicate in national examinations
provides strong support that international third party evaluation has its merits as it can ensure
that high standards are maintained and that the examination results are internationally
recognized.
30 R. Subramaniam
Additional support for the international competitiveness of the science and mathematics
curricula at the primary and secondary levels is provided by the results of the TIMSS
examinations administered by the International Association for the Study of Educational
Evaluation.
The situation at the pre-university level is also similar. Again, with the examinations
jointly administered by the Singapore Examinations and Assessment Board and the
University of Cambridge Local Examinations Syndicate, educational institutions know where
exactly they stand at the national level in terms of the academic achievements of their
students.
A major reason why many countries participate in TIMSS is that it provides data for the
evaluation of the effectiveness of their education system in science and mathematics, two
subjects that count significantly in the science- and technology- driven world of today, with
respect to those of other countries (Johnson, 1999). The testing is expensive and logistics-
intensive but the data they provide is impartial and can catalyze much needed educational
reform. Purves (1991, p34) has summarized this succinctly as follows: The world could be
conceived as a huge educational laboratory where different national practices lend
themselves to comparisons that could yield new insights into determinants of educational
outcomes. The significance given to IEA surveys of science and mathematics achievement
has led to a situation where rankings of countries have become a de-facto endorsement of
educational indices of these countries (Johnson, 1999), not only in the international research
literature but also in the popular media. Thus, the TIMSS results provide a solid endorsement
of the educational standards in science and mathematics at the primary and secondary levels
in Singapore.
The system of streaming in schools, in Singapore has been criticized as being elitist and
that it puts a stigma on those in the less desirable stream. It is a unique practice not found
elsewhere. Critics of streaming tend to ignore the fact that streaming ensures that students are
channeled into classes where they can study at a pace which suits their academic
predisposition. Teachers are then in a position to provide better attention and support for their
learning needs.
One of the criticisms leveled against the ranking system is that it places too much
emphasis on academic attainment. The process of education is more than just studying, it
needs to develop students holistically through other means and these ought to be given
weightage in the rankings – so say the critics! Also, as the premier schools get the better
students (Tan and Gopinathan, 2000), the ranking is skewed in their favor. Moreover, with
the excessive emphasis on examination results, schools may focus on producing examination-
smart students and not explore innovative approaches in their teaching. (Poh, 1999; Cheah,
1998). These concerns have led the Ministry of Education in 2004 to tweak the evaluation
system by expanding the number of dimensions on which assessment is based. Instead of
ranking lists, there are now School Achievement Tables, one for the Special / Express stream
and another for the Normal stream. In the process, schools are now banded together – those
with similar academic results are grouped together and are listed alphabetically, and it is not
possible for the public to know the exact position of the schools in the tables. In the new
evaluation system, predominant weightage is still given to the performance of the schools in
the leaving level examinations as this is central to the mission of education but additionally
other academic and non-academic factors are included in the achievement tables. This ensures
that schools prepare students holistically, not just academically. In the new landscape,
Educational Evaluation in Singapore 31
significant emphasis is given to the promotion of creativity, innovation and enterprise among
students (Tan and Subramaniam, 2002).
The high stakes examination culture has put additional pressure on teachers to deliver and
students to perform. Anecdotal evidence suggests that these may have implications on family
life – for example, teachers putting in longer working hours and students clocking lengthier
hours of study. There is, however, recognition that Singapore is unique – it is a small country
with no natural resources and its survival is very much dependent on its linkages to the global
economy and attracting of investments; education thus provides the best opportunity for its
people to aspire for a better life and compete in the global economy. In such a scenario, it is
inevitable that sacrifices have to be made. To a significant extent, the high salaries that
teachers draw help to mitigate any such concerns they may have. Another aspect of the high
stakes examination culture is the concern that late developers as well as those who are not
academically inclined but who excel in pursuits such as the arts and sports are at a
disadvantage. To address the former, multiple routes for entry to tertiary institutes are in place
– for example, a student who has been in EM 3 stream in primary school and normal
technical stream in secondary school can enter university after a stint in a technical institute
and then at a polytechnic, provided that admission criteria to the university are met. To help
meet the aspirations of the latter group of students, a School of the Arts as well as the
Singapore Sports School has been established. In fact, the arts and sports scene in Singapore
has undergone a renaissance in recent times, and job opportunities abound.
The Singapore experience offers the following pointers to other East Asian countries
which have less centralized system of education:
FUTURE SCENARIOS
The 21st century presents new challenges to countries in various areas. No where is this
more pronounced than in the area of education as it has the potential to influence
developments in other sectors of the economy. Countries whose education systems are well
positioned to address the new challenges can ensure that their students would be able to
compete effectively in the new economy. Educational evaluation will thus continue to exert
importance as it affords scope for enhancing institutional practice so that schools can position
themselves for the future whilst evolving organically to the next level.
In Singapore, the present system of educational evaluation of science and mathematics
learning – use of national leaving examinations at the primary, secondary and pre-university
32 R. Subramaniam
levels, is not likely to change as it has served her well over the years. These are high stakes
testing that have proved their utility value over the years. The examinations set by an
independent authority at the primary leaving level affords third party validation of the quality
and rigor of the educational standards prescribed by the Ministry of Education for schools.
The links with UCLES at the post-primary levels are likely to be maintained as these have
served Singapore well over the past few decades.
In the author’s view, one aspect of testing that may be introduced in future examinations
to gauge learning is the use of 2-tier multiple choice questions. Currently, multiple choice
questions (MCQs) appear as part of most subject examinations at all leaving levels. The use
of MCQs has the advantage that it can test a good number of concepts across the curricula in
a limited time and also provides ease of administration and marking for teachers. One of the
disadvantages of MCQs is the probabilistic dimension inherent in its format. In an MCQ with
four options, the probability of getting the correct answer purely by guess work is 0.25. Also,
by a process of elimination of options by intelligent guessing, it is possible to narrow down
the answer to the correct option. A 2-tier MCQ addresses this significantly by asking students
to justify their answers from another choice of options (Treagust, 1988). An example of a 2-
tier MCQ is given here:
Question
Peter and Jane stand 300 meters apart in an open area where there are no buildings or
trees. They shouted ‘Hello’ to each other at the same time. Given that Jane’s voice has a
lower pitch compared to Peter’s and that there is no wind blowing, which one of the
following statements is true:
Answer
(a) Peter and Jane hear the word ‘Hello’ at the same time.
(b) Peter hears Jane’s voice first
(c) Jane hears Peter’s voice first
(d) None of the above
Reason
(i) Since Peter’s voice has a higher pitch, it travels faster and Jane thus hears him first.
(ii) A lower pitch sound carries less energy, and thus travels slower.
(iii) The speed of sound is not affected by the pitch, hence Peter and Jane will hear at the
same time.
(iv) ____________________________________________________________
(Write your reason here if you do not agree with any of the above reasons.)
Use of 2-tier MCQs would thus provide better indication of students’ understanding of a
concept tested in a question. The use of 2-tier MCQs, however, is not without their
Educational Evaluation in Singapore 33
CONCLUSION
Educational evaluation is a powerful approach to appraise various processes in the
education system. It can provide important information to stakeholders.
In this chapter, we have seen that the use of primary school leaving examinations
conducted by an independent agency that is not connected to schools or the Ministry of
Education provides the best form of educational evaluation of how schools prepare their
students. In the case of leaving level examinations at the secondary and pre-univeristy levels,
again the use of an international examinations syndicate in conjunction with an independent
local agency provides valuable information on how effective schools are in preparing their
students. Additionally, in the case of primary and secondary levels, the participation of
students in the TIMSS examinations provides supporting evidence that the curricula in the
science and mathematics in Singapore are internationally competitive. The high standing of
Singapore students in science and mathematics in the TIMSS examinations in 1995, 1999 and
2003 shows that students are prepared to international standards. Moreover, the system of
banding (previously ranking) of schools ensures that they are cognizant of the concepts of
accountability and stakeholder value.
34 R. Subramaniam
Singapore has consistently sought international benchmarks for its education system as it
provides the best assurance that the curricula and assessment are not only current but also
follow international norms
REFERENCES
Banfield, G. and Cayago-Gicain, S. (2006). Qualitative approaches to educational evaluation:
A regional conference workshop, International Journal of Education, 7(4), 510-513.
Caleon, I and Subramaniam, R. (2005). The impact of a cryogenics-based enrichment
programme on attitude towards science and learning of science concepts, International
Journal of Science Education, 27(6), 679-704.
Cheah, Y. M. (1998). The examination culture and its impact on literary innovation: The case
of Singapore, Language and Learning, 12, 192-209.
Johnson, S. (1999). International Association for the evaluation of educational achievement:
Science assessment in developing countries, Assessment in Education, 6(1), 57-73.
Lim, E. P. Y. and Tan, A. (1999). Educational assessment in Singapore, Assessment in
Education, 6(3), 391-404.
Martin, M. O., Mullis, I. V. S., Beaton, A. E., Gonzales, E. J., Kelly, D. C. and Smith, T. A.
(1999). Science achievement in the primary school years: IEA’s third international
mathematics and science study (TIMSS). Chestnut Hill, MA: Boston College.
Mullis, I. V. S., Martin, M. O., Beaton, A. E., Gonzales, E. J., Smith, T. A. and Kelly, D. C
(1997). Mathematics Achievement in the Primary School Years: IEA’s third international
mathematics and science study (TIMSS). Chestnut Hill, MA: Boston College.
Poh, S. H. (1999). Assessment issues in Singapore, Educational Evaluation: Issues and
Practice, 18, 31-32.
Purves, A. C. (1991). Brief history of International Evaluation Association. In W. A. Hayes
(Ed.), Activities Institutions and People, IEA Guidebook 1991, pp 34-48. The Hague:
IEA.
Tan, J. and Gopinathan, S. (2000). Education reform in Singapore: Towards greater creativity
and innovation, NIRA Review, 7, 5-10.
Tan, W. H. L. and Subramaniam, R. (1998). Developing countries need to popularise science,
New Scientist, 2139, 52.
Tan, W. H. L. and Subramaniam, R. (2002). Science and the student entrepreneur. Science,
298, 1556.
Toh K. A., Riley J. P., Lourdusamy A., Subramaniam R. (2006). School and science
achievement in Japan and Singapore: A tale of two cities. Educational Research for
Policy and Practice, 5, 1–13.
Treagust, D. F. (1988). Development and use of diagnostic tests to evaluate students’
misconceptions in science, International Journal of Science Education, 10, 159-170.
In: Educational Evaluation in East Asia ISBN 978-1-60692-887-5
Editor: Samuel S. Peng and John Ch-Kin Lee © 2009 Nova Science Publishers, Inc.
Chapter 3
Sun-Geun Baek
Department of Education, Seoul National University, Republic of Korea
ABSTRACT
This paper describes five significant features of educational evaluation in South
Korea. The first feature is ‘the School Activities Records (SAR)’, which is designed to
evaluate primary, middle, and high school students based on academic achievements, as
well as the development of social behaviors. The second feature is ‘the National
Assessment of Educational Achievement (NAEA)’, which is to control the quality of the
nationwide educational system by providing information on the overall educational
achievement and where improvement might be needed. The third feature is ‘the College
Scholastic Ability Test (CSAT)’, which is administered at the national level to screen the
eligible candidates for higher education. The fourth feature is ‘the Teacher Competence
Development Evaluation (TCDE)’, which is designed to evaluate primary, middle, and
high school teachers based on their teaching skills and management abilities. The final
feature is ‘the Teacher Education Institute Accreditation System (TEIAS)’, which is
designed to evaluate and accredit each teacher education program and institute. In
conclusion, current issues and future prospects of educational evaluation in South Korea
will be discussed.
I. INTRODUCTION
Located in the eastern part of the Asian Continent, South Korea is a democratic state and
a constitutional republic that holds a five-thousand year history. South Korea has 16
administrative units: The metropolis of Seoul, six metropolitan cities, and nine provinces.
South Korea has a population of around 48 million.
36 Sun-Geun Baek
Table 1. The Number of Schools, Teachers and Students in South Korea (2006)
More than 10 million people live in Seoul, which is the country’s capital. For your
reference, the number of schools, teachers and students in South Korea are indicated in Table
1. In addition, the enrollment rates for primary, middle, and high schools have reached about
100 percent since 1994. The ratio of high school students, who go on to a higher stage of
education, reached 82.8 percent in 2007, which was the one of the highest ratios in the world
(KEDI, 2007).
In this paper, five significant features of educational evaluation in South Korea will be
described. In addition, current issues and future prospects of educational evaluation in South
Korea will be discussed at the end.
(1) Personal Information: Student's name, gender, identification number, address, parent
names and their occupation, and special comments.
(2) Educational Background: Student's schooling (entrance, transfer, and graduation)
records, and special comments.
Educational Evaluation in South Korea 37
(3) School Attendance: Number of instructional days per year, number of absent days,
the reasons for absences (sickness, accident, and others), number of late days, the
reasons for lateness (sickness, accident, and others), number of absent classes, the
reasons for absences (sickness, accident, and others), and special comments.
(4) Awards: Prize name, level of prize, award date, and provider.
(5) Certificates: Type of certificate, registration number of certificate, registered date,
and provider.
(6) Career Guidance: Student's special talents and skills, interests, student's prospective
occupation, parent expectations, and special comments.
(7) School Optional Creative Courses: Areas of specific activities for optional creative
courses and comments.
(8) Extracurricular Activities: Class activities, school activities, club activities, group
activities, and general comments for each sub-category.
(9) Volunteer Work and Special Experiences: Contents, frequencies, places, and periods
of time for both volunteer work and special experiences.
(10) Academic Achievement:
− For primary school students: Special comments for each subject per year without
grade and rank, and special talents and interests.
− For middle school students: Raw score, five scale grade and rank for
each subject per semester, and special talents and interests.
− For high school students: Raw score and stanine score for each subject per
semester, and special talents and interests.
Each student's information on the SAR has been used to improve individual teaching-
learning processes. In addition, high schools have used each student’s information to select
their new students among middle school graduates. Colleges and universities have also used
each student’s information in SAR to select their new students among high school graduates.
Recently, the portion of each student’s information within total entrance examination scores
for higher education is gradually increased.
(1) To diagnose the educational achievements of elementary, middle, and high school
students and to check systematically the trends of their achievements.
38 Sun-Geun Baek
(2) To provide basic reference data for improving the national curriculum by analyzing
the students' achievements of the goals of the national curriculum and checking the
problems of the national curriculum implementation at the school and classroom
levels.
(3) To improve teaching and learning methods and to produce data to set up a learning
encouragement policy by analyzing the test items and the relationship between the
students’ achievements and background variables.
(4) To improve student assessment methods in the schools by developing and utilizing
new and appropriate assessment tools.
(5) To explore new research designs and methods, in order to attain the purposes of the
study of the NAEA.
Until 1988, several educational research centers, such as the Korean Educational
Development Institute (KEDI) or the Central Educational Research Center had intermittent
conduct with the NAEA.
However, the National Board of Educational Evaluation had regular conduct with the
NAEA from 1989 to 1997. Since 1998, the Korea Institute of Curriculum and Evaluation
(KICE), which was established as a government funded educational research and testing
center on January 1 in 1998, has had annual conduct with the NAEA. The general procedures
for the NAEA are as follows (KICE, 2007):
Table 2 shows 2007 NAEA's research subjects, assessment areas, item types, and others,
such as the listening comprehension test or questionnaires.
Specifically, both Korean Language and English tests included listening comprehension
through the Educational Broadcasting Systems (EBS), which was a nationwide broadcasting
system in South Korea. In addition, KICE analyzed about 3% ~ 5%-sampled data from the
total population and reported the results to the public. Additionally, almost all municipal and
provincial educational authorities have administered the NAEA tests to their all students and
analyzed their whole data according to their own purposes.
Educational Evaluation in South Korea 39
Korean
Social Studies
6th Grade Mathematics
Sciences
English - Multiple-choice items and constructed response items for each s
ubject were developed for each subject.
Korean - Constructed response items took 20-40% of the total score depen
Social Studies ding on the features of the subject.
9th Grade Mathematics - Listening comprehension test was included in Korean and Englis
Sciences h.
English - Questionnaires for students, teachers, and principals were develo
ped to investigate the relationship between the background variabl
es and the students’ academic achievements.
Korean - 2007 NAEA implemented on October 16-17, 2007.
Social Studies
10th Grade Mathematics
Sciences
English
In addition, the general procedures for CSAT are as follows (KICE, 2007):
(1) The Ministry of Education and Human Resources Development (MOE and HRD)
has set up the basic policy for CSAT.
(2) KICE works out the basic action plan.
(3) KICE develops the test items.
(4) Municipal and provincial educational authorities administer these tests to their
students.
(5) KICE analyzes and reports the test results.
About 580,000 applicants (12th grade students, high school graduates, and those having
equivalent certificates) took the CSAT on November 15, 2007. The stanine score for each
content area appeared exclusively on the student report card of CSAT in 2007. For your
reference, the stanine score is a type of standardized score with nine levels (see Table 3).
40 Sun-Geun Baek
However, the stanine score, percentile score, and one standardized score within two types
(one type’s mean equals 100 with standard deviation equals 20 or another type’s mean equals
50 with standard deviation equals 10) will be appeared on the student report card of CSAT
since 2008.
Stanine
1 2 3 4 5 6 7 8 9
Scores
% 4 7 12 17 20 17 12 7 4
Percentile 100-97 96-90 89-78 77-61 60-41 40-24 23-12 11-5 4-1
Times Maximum
Content Areas Items
(Minutes) Scores
Korean 80 100 50
English 70 100 50
Table 4 shows the 2007 CSAT's content areas, periods of time for tests, maximum raw
scores, the number of items, and item types.
The CSAT score is one of the most important data for getting admission into colleges or
universities.
For your reference, higher educational institutes in South Korea generally use 'the total
entrance examination scores' to select new students among high school graduates or those
who have equivalent certificates.
The total entrance examination scores are consisted differently among higher educational
institutes or even are consisted differently among departments, within the same
institute. Some institutes (or departments) use only (i) CSAT scores, whereas some institutes
use (i) CSAT scores and (ii) the information on SAR, and others use (i) CSAT scores, (ii) the
information on SAR, and (iii) institutes' own entrance examination scores such as interview
scores, essay writing scores, recommendation letters' scores, awards, and so on.
Educational Evaluation in South Korea 41
In order to present more substantial information about the college and university
admission system in South Korea, Table 5 shows the case of the Seoul National University,
which is the best university in South Korea. Please note there are a few differences in the
contents of Table 5 depending on departments or colleges within Seoul National University.
- The residents of
rural or fishing
villages, or disabled
Higher than level 2 (top
students.
Qualification - 11%) in CSAT at least 2
- Higher than level 2
out of 4 areas.
(top 11%) in CSAT
at least 2 out of 4
areas.
<Type A>
Selection Step I
SAR: 100%
(selecting 1.5 - 3
CSAT: 100%
times of the
<Type B>
quorum)
Documents: 100%
<Type 1>
CSAT: 50%
<Type A> SAR: 30%
SAR: 80% Interview: 20%
Documents: 10%
Interview: 10% <Type 2>
SAR: 50%
CSAT: 40%
Selection Step II Essay Writing:
<Type B> SAR: 20%
(final stage) 30%
1) Documents: 50% Interview: 40%
Interview: 20%
Interview: 30%
Essay Writing: 20%
2) Documents: 50%
Interview: 50%
42 Sun-Geun Baek
REFERENCES
Baek, S. G. (2007). Political suggestions for the successful implementation of the
accreditation system on teacher education institutes. Journal of Educational Evaluation,
20(2), 25-49.
Fitzpatrick, J. L., Sanders, J. R., and Worthen, B. R. (2004). Program evaluation: Alternative
approaches and practical guidelines (3rd ed.). NY: Pearson Education, Inc.
Korea Institute of Curriculum and Evaluation. (2007). The implementing plan for the college
scholastic ability test in 2007. KICE homepage: http://www.kice.re.kr/.
Korean Educational Development Institute. (2007). 2006 educational statistics in South
Korea. KEDI homepage: http://www.kedi.re.kr/.
44 Sun-Geun Baek
Ministry of Education and Human Resources Development. (2007a). New teacher’s manual
for school activities records. MOE and HRD homepage: http://www.moe.go.kr/.
Ministry of Education and Human Resources Development. (2007b). New implementation
plan for the teacher competence development evaluation. MOE and HRD homepage:
http://www.moe.go.kr/.
Ministry of Education and Human Resources Development. (2007c). New implementation
plan for the teacher education institute accreditation system. MOE and HRD homepage:
http://www.moe.go.kr/.
Seoul National University. (2007). The selecting plan for new coming students. SNU
homepage: http://www.snu.ac.kr/.
In: Educational Evaluation in East Asia ISBN 978-1-60692-887-5
Editor: Samuel S. Peng and John Ch-Kin Lee © 2009 Nova Science Publishers, Inc.
Chapter 4
Sirichai Kanjanawasee
Center on Educational Testing and Evaluation for Educational
and Professional Development, Faculty of Education,
Chulalongkorn University, Thailand
ABSTRACT
This paper presents a brief overview of current educational evaluation issues and
challenges in Thailand. Thai basic education is divided into 4 levels: 1) Grades 1-3 2)
Grades 4-6 3) Grades 7-9 ,and 4) Grades 10-12. At the final year of each level, students
must pass 4 types of assessments correspondence to each institution’s standards: 1) 8
subject areas assessment, 2) reading, thinking and writing skills’ assessment, 3) desired
learner attributes assessment, and 4) assessment of the participation in a developmental
activities program. Therefore, we need to develop phototypes of effective integrated
measurement tools and a procedure manual for a systematic assessment.
The new university admission system was designed and implemented in 2006, using
integrated multiple indicators to evaluate student achievement and performance. These
included: 1) The overall cumulative grade point average (GPAX) for Grades 10-12, 2) the
grade point average (GPA) in Grades 10-12 in 3 to 5 of the 8 subject groups of the core
curriculum; and 3) results of the Ordinary National Educational Test (O-NET) and/or the
Advanced National Educational Test (A-NET). These designs are facing new challenges
regarding the practice of evaluation techniques that are meaningful and fair for all
students.
INTRODUCTION
This paper provides general information on Thailand’s education system relating to the
basic education, professional standards for teachers and higher education admission system. It
presents an overview on related aspects of academic achievement assessments in basic
education and the evaluation of student achievement and performance for entering a new
higher education admission system. It also addresses the challenges of the evaluation role to
improve educational quality system and provides some policy recommendations.
At present, the framework of education reform in Thailand is based on the 1999 National
Education Act which provides principles and challenging guidelines for the provision and
development of Thai education in order to prepare all Thai people to live in a learning
society. In this regard, Thai people shall attain full development in terms of physical and
spiritual health, intellect, morality and integrity, as well as a desirable way of life that focuses
on living in harmony with other people.
The educational system is classified into 3 types: formal education, non-formal
education; and informal education. The formal education services are provided to these inside
the school system, through both public and private bodies. It is divided into 2 levels: basic
education and higher education. The basic education covers pre-primary, 6 years of primary
education, 3 years of lower secondary education, and 3 years of upper secondary education.
Higher education is provided at universities, institutions, colleges or other types of
institutions. It is divided into two levels: associate degree and degree levels. The non-formal
education services are provided by both public and private bodies to those outside the school
system. Informal education enables learners to learn by themselves according to their
interests, potential, readiness and the opportunities available that are organized by
individuals, families, communities, authorities or other learning networks.
The Office of the Education Council (OEC) is responsible for proposing national
education standards. Consequently, sets of standards were formulated by the Office in
cooperation with the offices responsible for basic, vocational, and higher education as well as
the Office for National Education Standards and Quality Assessment. With approval from the
Council of Ministers, on October 26, 2004, agencies providing education at all levels are
expected to abide by the national education standards, which are comprised of three
categories:
Development of Educational Evaluation and Challenges… 47
(I) Desirable characteristics of the Thai people, as both citizens of the country and
members of the world community, consist of 5 indicators:
The national education standards serve as the basis for setting assessment standards of
internal and external quality assurance mechanisms.
Currently, all agencies concerned have made progress in the development of relevant
educational standards for internal quality assurance as a guideline in the provision of
education by educational institutions under their supervision.
For example, learning standards for basic education have been formulated to respond to
the 2001 curriculum. In addition, the Committee for Development of a System to Evaluate
Higher Education Quality has appointed a sub-Committee to develop and set national
standards for higher education.
48 Sirichai Kanjanawasee
The Office for National Education Standards and Quality Assessment (ONESQA)
conducts external quality assessment at the levels of basic education and higher education
(lower-than-degree and degree levels) in line with relevant educational standards that focus
on assessment of educational institutions in the following respects: (1) educational
achievement (output/outcome); (2) input/process; and (3) efficiency in administration and
leadership.
Internal Quality Assurance, primarily serves as a basis for external assessment. All
educational institutions are required to implement an internal quality assurance system
comprised of self-control, self-audit, and self-assessment toward their educational standards.
The institutions must disseminate their Self-Study Report (SSR) each year.
Development of Educational Evaluation and Challenges… 49
The Office for National Education Standards and Quality Assessment (ONESQA),
established as a public organization in November 2000 to oversee external quality
assessment. The ONESQA, utilizing qualified external assessors under contract to assess both
basic and higher education institutions following standards relating to educational
achievement (output/outcome); input/processes; and efficiency in administration and
leadership. The external quality assessment of all institutions is conducted at least once every
five years. The assessment outcomes will be submitted to the agencies concerned and made
available to the general public.
Besides the education standards, the National Education Act also stipulated that teachers’
Council of Thailand shall be regarded as the Teachers and Educational Personnel Council. It
is incorporated with the principal objective to determine professional standards, issue and
revoke licenses, supervise and monitor the compliance with the professional standards and
code of ethics, including professional development, so that educational professional
practitioners, namely, those who are licensed to practice the teaching profession, educational
institution administrators, educational administrators and other educational personnel
(educational supervisors), shall have knowledge and understanding in the practice of
educational profession which is the licensed profession. The Secretariat Office of the
Teachers’ Council of Thailand therefore established these Educational Professional
Standards.
The Educational Professional Standards, which consist of standards of professional
knowledge and experience, standards of performance and standards of conduct, have been
announced as the Teachers Council of Thailand Regulation on Professional Standards and
Ethics in 2005.
In the educational professional practice, those who enter such profession shall meet the
specified standards of professional knowledge and experience, so as to be eligible to obtain
the license to practice the profession.
50 Sirichai Kanjanawasee
Standards of Performance
Standards of Conduct
• Personal ethics
• Professional ethics
3) Education professional practitioners shall care for and be merciful to, pay
attention to, assist and encourage their students and clients on an equal treatment
basis, in accordance with their roles and duties.
4) Education professional practitioners shall encourage their students and clients to
achieve learning, skills, good and proper conduct, in accordance with their roles
and duties, to their full capability and generosity.
5) Education professional practitioners shall conduct themselves as a good role
model in terms of physical conduct, speech and mental conduct.
6) Education professional practitioners shall not act in any manner, which is against
the physical, intellectual, mental, emotional and social development of their
students and clients.
7) Educational professional practitioners shall provide service honestly and equally
without asking for, accepting or acquiring any interests, which would be
considered abuses of their authority.
• Collegial ethics
• Societal ethics
conservation and development as well as preserve the public interests and adhere
to the democratic regime of government with the King as Head of the State.
According to the teachers and Educational Personnel Act of B.E.2547 (2004), there is a
six-scale teacher classification framework based on academic status for the new salary
structure. The classification ranges from assistant teachers, teachers, experienced teachers,
higher experienced teachers, expert teachers, to specialized teachers. The Teacher Civil
Service Commission sets up the criteria for performance-based teacher evaluation system
including three areas: (1) The discipline, morality and ethical conduct, (2) Performance
quality/core and specific competencies, and (3) The performance results. The evaluation
system tries to promote teachers to higher academic status. Teachers who get promoted to a
higher level will get extra monthly allowance. In addition, all public school teachers will be
evaluated their performance every six months for a salary increase.
BASIC EDUCATION
In accordance with the focus on educational reform, many steps - from national policy to
institutional level – are being taken to develop the basic education curriculum and the
teaching-learning process, to improve the assessment and evaluation of achievement, and to
expand access to formal education.
In 2002, in accordance with the National Education Act, 12 years of free basic education
was made available to students throughout the country, for the first time. The government
later increased that number to 14 years to include two years of pre-primary education. Basic
education covers pre-primary instruction, 6 years of primary, 3 years of lower secondary, and
3 years of upper secondary education, with the 9 years of primary and lower secondary levels
considered compulsory.
A 2001 update of the Basic Education Curriculum called for “core requirements” as
prescribed by the Basic Education Commission, and “specific content”, is directly related to
local needs and contexts, to be developed by the educational institutions themselves.
Technology- Related Education, and Foreign Language. Additionally, activities that focus on
responding to the learner’s specific interests are also included.
In 2005, following nationwide training programmes for administrators, supervisors,
teachers, and personnel in related departments, the 2001 Curriculum for Basic Education
became fully implemented for all grades.
At each level, Grades 3, 6, 9, and 12, students must pass four types of assessments
correspondence to each institution’s standards, they are: 1) Eight subjects areas’ assessment,
2) Reading, critical thinking and writing skills’ assessment, 3) Desired learner attributes’
assessment, and 4) Assessment of participation in developmental activities program.
Since 2001, students studying in Grades 3, 6, and 9, are obliged to take national
educational tests administered by the Office of the Basic Education Commission (OBEC) in 2
to 5 subjects. Beginning in 2005, Grade 12 students wishing to pursue higher education must
take the tests administered by the National Institute of Educational Testing Service (NIETS) as
shown in Table 1.
The national tests for Grade 12 students are of 2 types, the Ordinary National Educational
Tests (O-NET) and the Advanced National Educational Test (A-NET). Both O-NET and A-
NET require tests in 5 subjects; however, A-NET focuses more on critical thinking and
analytical skills. Results from O-NET and A-NET are used as one of the factors in the revised
higher education admission system.
54 Sirichai Kanjanawasee
Subjects Required
Levels of
Mathematics Thai Social Organizers
Education Science English
Language Sciences
Grade 3 x x OBEC
Grade 6 x x x x OBEC
Grade 9 x x x x x OBEC
Grade 12 x x x x x NIETS
Source: Bureau of Educational Testing, OBEC and The National Institute of Educational Testing
Service.
2. Teacher Evaluation
Based on performance-based teacher evaluation systems, applying teachers are evaluated
by authorities from schools and local educational service areas. Many teachers are passed and
get promoted to a higher academic status, whereas students’ achievement remains the same,
and in some schools students’ achievement becomes worse.
The Teacher Civil Service Commission must make clear the goals and roles of the
teacher evaluation. The goals must be to elevate the quality of teaching and learning, while
the roles must involve both formative and summative purposes. Additionally, the evaluation
criteria needs to be revised to focus on classroom practices, student development, and
learning outcomes.
3. The assessments of desired learners’ attributes are based on the subjective judgment
of teachers.
1. The school principal should impose effective teacher training regarding the
assessment of the eight subject areas that focus on higher ability measurement.
2. One key role for school principal is to ask academic staff and assessment experts to
develop prototypes for integrated reading, critical thinking and writing assessment
tools for teachers.
3. The school principal needs to ask academic staff and assessment experts to create a
procedure manual for systematic assessment of desired learners’ attributes.
The previous national admission system, in effect since 1967, it was well-accepted as one
of the most fair educational procedures in Thailand. However it depended solely upon an
entrance examination that evaluated a student’s academic performance only through that
particular test. As a result, students tended to place less importance on learning for knowledge
and skills in their regular classrooms and concentrated their efforts on tutorials for particular
subjects, in order just to be successful on the entrance examination. Consequently,
responsible agencies, such as the Office of the Higher Education Commission, the Council of
University Presidents of Thailand, and the educational institutions themselves called for a
change from the previous approach, with a new Central University Admission System
(CUAS).
56 Sirichai Kanjanawasee
Beginning in 2006, students completing Grade 12 are obliged to take national educational
tests of 2 types: the Ordinary National Educational Test (O-NET) and the Advanced National
Educational Test (A-NET).
Both examinations are administered by the National Institute of Educational Testing
Service to assess student proficiency in Mathematics, Thai language, Science, English
language and Social Studies, Religion and Culture. The A-NET tests focus more on critical
thinking and analytical skills, as well as aptitude in relation to professional disciplines.
The newly-modified university admission system can be summed up as follows:
(1)New Designs
Higher education admission system is always a high stake and crucial issue in Thailand.
It means the future life for students and their families, and there are many stakeholders
involved.
Development of Educational Evaluation and Challenges… 57
Academic Year
Components
2006 2007 2008
1. GPAX 10% 10% 10%
2. GPA (3-5 subjects areas from
20% 30% 40%
8subjects areas)
3. O-NET* 35-70%
60% 50%
4. A-NET* 0-35%
*
Remark Both O-NET and A-NET examinations are administered by the National Institute of
Educational Testing Service (NIETS).
The Office of Higher Education Commission and the Council of University Presidents of
Thailand has concluded the integrated multiple indicators to evaluate student achievement
and performance for entering higher education institutions as shown in Table 2.
• The Basic Education Commission must call for national content standards of core
curriculum in the eight subject areas. The NIETS should develop the O-NET Tests
which cover the national content standards for each subject area.
• The NIETS should develop the A-NET, Tests which focus more on critical thinking
and analytical skills as well as aptitude measures in relation to well-defined groups of
professional disciplines, such as medicine, engineering, education, architecture,
foreign languages, fine arts and music.
• The NIETS should establish testing standards for test construction, test
administration, test scoring and reporting. The NIETS should prepare for
computerized adaptive testing.
CONCLUSION
As specified, in the 1999 National Education Act, educational reform in Thailand must
establish educational standards, revised curriculum, and quality assurance. Currently, there
are three types of standards: national standards, and standards for internal quality assurance
and for external quality assessment. In accordance with the focus on educational reform, the
Basic Education Commission called for curriculum change at the national level (core –
curriculum) and at the institution level (local – curriculum). The proportion of core
curriculum to local content developed by the institution should be approximately 70:30
flexibly applied in compliance with the nature of each subject.
The Basic Education is divided into 4 levels: 1) Grades 1-3, 2) Grades 4-6, 3) Grades 7-9,
and 4) Grades 10-12. At each level, upon leaving grades 3, 6, 9, and 12 students must pass 4
types of assessments correspondence to each institution standards: 8 subject areas’
assessment; reading, thinking and writing skills’ assessment; desired learner attributes
assessment; and assessment of the participation in developmental activities program. The
Thai Basic Education System is facing many assessment barriers. The national quality
assurance policy should link external quality assurance with internal quality assurance to
serve local wisdom and to meet minimum national standards. The new teacher evaluation
approach should focus on classroom practices, student development and learning outcomes.
The system needs effective in-service teacher training on the assessment of learners’ higher
ability. It also needs to develop phototypes of integrated reading, thinking and writing
assessment tools and procedure manual for systematic assessment of desired learners’
attributes.
The previous national admission systems for Higher Education depended solely upon an
entrance examination that evaluated a student’s academic performance only through that
particular test. As a result, students tended to pay less attention to learning in classrooms and
concentrated more on private tutorials, in particular on subjects that counted for the entrance
examination. Consequently, responsible agencies called for a change in the previous approach
to a new Central University Admission System (CUAS). The system was designed and
implemented in 2006, using integrated multiple indicators to evaluate student achievement
and performance. These included: 1) The overall cumulative grade point average (GPAX) for
Grades 10-12, 2) the grade point average (GPA) in Grades 10-12 in 3 to 5 of the 8 subject
groups of the core curriculum; and 3) results of the Ordinary National Educational Tests (O-
Development of Educational Evaluation and Challenges… 59
NET) and/or the Advanced National Educational Tests (A-NET). The admission system is
facing the challenges of developing the O-NET tests that represent content standards for
different school curricula; the A-NET tests that corresponds to well-defined groups of
professional disciplines. The system also needs to develop effective GPA – equating methods
and composite index calibration methods, probably based on IRT, that will be meaningful and
fair for all students.
REFERENCES
Kanjanawasee, S. et al. (2007). The research and development of monitoring and evaluation
system on Thailand education. Bangkok: Office of the Education Council.
Kanjanawasee, S. et al. (2006). The achievement evaluation of Ministry of Education’s
performance project. Bangkok: Faculty of Education, Chulalongkorn University.
Kanjanawasee, S. (2000). Learning assessment : Executive recommendations. Journal of Research
Methodology, 13(1), 75 – 91.
Kanjanawasee, S. (2007). Modern test theories. (3rd ed.). Bangkok: Chulalongkorn University Book
Center.
Kolen, M. J. and Brennan, R. L. (1995). Test equating: Methods and practices. New York: Springer.
Office of the National Education Commission. (1999). The National Education Act of B.E.
2542(1999). Bangkok: Seven Printing Group.
Office of the Education Council. (2006). Education in Thailand. Bangkok: I.C.C. International Public
Co., Ltd.
Office of the Education Council. (2005). National education standards. Bangkok: VTC
communication Ltd. Partnership.
Secretariat office of the teachers’ Council of Thailand. (2005). Education professional
standards. Bangkok: Professional Standards Bureau.
In: Educational Evaluation in East Asia ISBN 978-1-60692-887-5
Editor: Samuel S. Peng and John Ch-Kin Lee © 2009 Nova Science Publishers, Inc.
Chapter 5
ABSTRACT
This paper provides a brief overview of current educational evaluation measures and
related issues in Hong Kong. The foci of measures for discussion are quality assurance
mechanisms especially school self-evaluation (SSE) and external school review (ESR),
and Territory-wide System Assessment (TSA). Three questions are posed for discussion:
(a) How can we promote coherence among educational reform (highlighting students’
whole-person development) and educational supervision and evaluation policies
(stressing academic performances) in both rhetoric and practice? (b) How can we
promote a systematic linkage between centralized systems (ESR, SSE, BCA/TSA) and
localized systems (teacher evaluation) so as to drive school improvement/effectiveness
and teacher development/effectiveness? (c) How can we help relieve teachers’ workload
and anxieties linked with educational evaluation and foster a shared community of
educational evaluation among government agencies, SSB/SMC/IMC and teachers?
I. INTRODUCTION
The term “educational evaluation” has diverse meanings and its scope is wide
encompassing many sub-fields and topics ranging from student evaluation (e.g., external
(public) examinations), personnel evaluation (e.g., teacher and principal evaluation) to
programme/project evaluation, school evaluation and system evaluation (e.g., international
studies of educational achievement, state and school district evaluation) (Kellaghan and
62 John Chi-Kin Lee
Stufflebeam, 2003a, 2003b). In this Forum for Educational Evaluation in East Asia –
Emerging Issues and Challenges, topics of presentations and sessions reveal the diversity
ranging from school evaluation systems, school-based assessments, teacher evaluation,
language proficiency assessment, international assessment (such as PISA), information
technology and psychological applications, to mobile devices and learning assessment, and
institutional evaluation (capacity building of evaluation personnel). While I shall give a very
brief overview of some of these topics, the paper will concentrate on the issues related to
school evaluation and teacher evaluation.
According to a seminar organized by the Asian Network of Training and Research
Institutions in Educational Planning (ANTRIEP) (De Grauwe and Naidoo, 2004, p.22),
school evaluation is “part of the decision-making process in education; it involves judgments
about the performance of schools through systematically collecting and analyzing information
and relating this to explicit objectives, criteria and values. Ideally, school evaluation involves
an (internal and external) assessment that covers all aspects of a school and their impact upon
student learning…From this perspective, school evaluation is not an end in itself, but the first
step in the process of school improvement and quality enhancement.” In that seminar, three
main evaluation tools, (external review or inspection, examinations and school self-evaluation
or internal evaluation) were discussed.
With regard to teacher evaluation, this covers a broad range of issues, such as licensure
testing for beginning teachers, innovations in teacher assessment and teacher tests and
connections between teacher evaluation, accountability and pay (Pearlman and Tannenbaum,
2003).
evaluation, it is useful to understand briefly the main bodies for coordinating and arranging
different evaluation policies and measures in Hong Kong. As shown in Figure 1, most public
examinations and assessment measures (e.g., Basic Competency Assessment and Territory-
wide System Assessement) are under the purview of the Hong Kong Examinations and
Assessment Authority (HKEAA) while the Education Bureau (EDB) (formerly known as the
Education and Manpower Bureau (EMB)) has a Curriculum and Quality Assurance Branch to
implement quality assurance measures such as external school reviews and quality assurance
inspections. On the other hand, the EDB may commission external bodies (e.g., universities)
to conduct evaluation studies or large-scale assessments such as the PISA (Programme for
International Assessment).
With regard to system evaluation, Hong Kong has taken part in a number of large-scale
international studies of educational achievement such as PISA, TIMSS (Trends in
International Mathematics and Science Study) (e.g., Leung, 2005) and PIRLS (Progress in
International Reading Literacy Study) (e.g., Tse and Loh, 2007). Taking PISA, organized by
the OECD (Organization for Economic Co-operation and Development) as an example, the
analysis of findings from PISA revealed that “Hong Kong’s education system has achieved
both excellence and equality of opportunity when compared with other participating
countries…Hong Kong has the highest percentage of immigrant students among the
participating countries/regions, and the achievement gap between the non-native students and
local students is substantial” (Ho, 2005a, p.2). The findings from PISA inform us about other
issues, such as the influence of students’ self-regulated learning on their achievements (Ho,
2004) as well as student performance in various subjects in different medium-of-instruction
schools (e.g., Ho and Man, 2007).
Figure 1. Main educational bodies and selected educational evaluation measures affecting Hong Kong
schools.
64 John Chi-Kin Lee
On the other hand, there are public examinations and territory-wide assessments
organized by the Hong Kong Examinations and Assessment Authority (HKEAA). The former
are exemplified by the Hong Kong Certification of Education Examination (HKCEE) at the
end of secondary five (grade 11) and the Hong Kong Advanced Level Examination (HKALE)
at the end of the sixth form (secondary seven, grade 13). It is worth noting that “standards of
performance on the HKCEE and HKALE have for many years been benchmarked against
standards in comparable subjects on British GCE and A/AS-Level examinations”.
(http://eant01.hkeaa.edu.hk/hkea/switch.asp?p_left=ir_left.aspandp_clickurl=http://www.hkea
a.edu.hk/zh/ir_intro.htm access: 1-11-2007). The latter refers to the Territory-wide System
Assessment (TSA), which will be discussed in the following section. Moreover, the
Education Bureau implemented a policy of language proficiency assessment for teachers
teaching English and Putonghua in which the HKEAA is responsible for arranging the
Language Proficiency Assessment for Teachers (LPAT).
As regards programme evaluation, schools have started to become acquainted with
monitoring or review of projects, especially those submitted to the Quality Education Fund
(QEF), established in 1998 to promote school-based, educational innovations. The QEF
stipulates that leaders of QEF projects need to submit progress reports, involving self-
evaluation of project effectiveness (with regard to attainment of the stated objectives, success
of the activities in the light of the impact on the participants and resources (e.g., people, time
and equipment, etc.) used for the activities). For the final report, the project impact in terms
of a project’s effects on learning effectiveness, professional development and/or school
development should be addressed and evaluated using an evidence-based approach. In
addition, evaluation of a project needs to include the quality and dissemination value of the
project deliverables (http://qef.org.hk/eng/main.htm?plan/plan01.htm, access: 26-7-2007). On
the other hand, teachers in some schools may have the chance to engage in small-scale action
research (e.g., Lee and Leung, 2006; Lee, 2007; Lee, Yin and Zhou, 2008) or learning studies
(Lo, 2006) through participation in university-school partnership projects (Lee and Williams,
2006).
In 1997, the Quality Assurance Inspectorate (QAI) was established and since 2003 there
has been “School Development and Accountability” (SDA) framework, set up to promote
school improvement through a combination of School Self-evaluation (SSE) and External
School Review (ESR). Such a framework reflects a gradual shift from an emphasis on
external inspection to a school’s own review of its performance as a starting point, to be
validated by ESR and the school as the center for change and improvement (Tang, 2008).
In the handbook Self-evaluation: Background, principles and key learning (MacBeath,
2005), published by the National College for School Leadership, there is a section on “Self-
evaluation and school review in Hong Kong” (pp.39-40). This praised Hong Kong’s model as
“exemplary of the sequential model, one that lies closest to the New Relationship in Schools
in England and one that offers some pointers to both the strengths and potential pitfalls of
Educational Evaluation in Hong Kong: Status and Challenges 65
shorter, sharper inspection centred on the school’s own internal evaluation” (p.39). It also
highlights the following issues (p.39):
• There may be disagreements over the final grade of different aspects of school
quality or a lack of democratic process in arriving at the final grade; and
• Downgrading of grades by the external school review (ESR) team may sometimes
lead to disappointment or resentment but in most cases the school accepts the
professional judgment and opinions of the ESR team
In Hong Kong, some school improvement projects were launched that helped schools to
familiarize themselves with, and implement, self-evaluation. Pang (2004, p.5), for example,
initiated projects adopting a bottom-up approach that aimed to help schools: develop their
own models of school-based management in the spirit of the recommendations of the
Education Commission Report No.7; institutionalize a self-evaluation framework in daily
practices for continuous improvement; and develop their own sets of school-based
performance indicators for use in school self-evaluation.
On the other hand, the Education and Manpower Bureau of the Hong Kong SAR
Government launched a partnership project known as the School Development through
School Self-evaluation Project with twenty-one schools (McGlynn, 2004; QAC, EMB, 2003a,
2003b).
The project proposed three focuses or circles of evaluation: learning at the center or first
level; culture, which refers to the enabling conditions and ethos conducive to student learning,
as the second level; and leadership as the third level (Kennedy and Lee, in press). The SSE
activities chosen by the project schools ranged from reading and/or writing, peer
observation/collaborative teaching, students’ evaluation of teachers’ performance, using a
staff appraisal process to promote school self-evaluation, teachers’ view of the ideal and
actual school, appraising the school’s performance through surveys of stakeholders to
students’ perceptions of the quality of school life and comparison with teachers’ views to
evaluation of the programme on personal and social education, evaluation of the staff
development programme in a new school, evaluation of mathematics teaching and students’
views on their first week in primary school (QAC, EMB, 2003a, p.12).
In addition to the efforts by various universities and the Government, some professional
organizations played a part in promoting school self-evaluation. Under the auspices of the
District Teachers’ Network, the Hong Kong Primary Educational Research Association and
the Education and Manpower Bureau co-organised with the School of Educational
Management of Beijing Normal University a “Sharing and Action Research on School Self-
evaluation in Primary School” project involving nine schools (Hong Kong Primary
Educational Research Association, 2003).
The project aimed to promote a culture of school self-evaluation through workshops on
using schools’ performance indicators, teachers’ and students’ self-evaluation, peer
observation and action learning on teaching practices (Kennedy and Lee, in press; Wu and
Lee, 2005).
MacBeath and Clark, commissioned by the Quality Assurance Division (QAD),
Education and Manpower Bureau (EMB) (now called the Education Bureau), led and
conducted two phases (Phase I and II) of “The Impact Study on the Effectiveness of External
School Review in Enhancing School Improvement through School Self-evaluation in Hong
66 John Chi-Kin Lee
Kong” (QAC, EMB, 2005, 2006). It is noteworthy that “the role and task of the SIT/SSE
team holds the key to embedding self-evaluation in the school (QAC, EMB, 2005, p.11) and
“school leadership is ultimately the single most important ingredient in making SSE work.”
(QAC, EMB, 2006, p.14).
According to a Legislative Council paper submitted by the Education and Manpower
Bureau in June 2007 (LC Paper No. CB(2)2071/06-07(01)), the major achievements and
observations of the SDA framework are (EMB, 2007, pp.4-5):
In the second cycle, scheduled to start in 2008/09, the following guidelines will be used
(based on p.7 and p.8):
In Hong Kong, it is basically up to the individual School Sponsoring Body (SSB) and/or
School Management Committee (SMC)/Incorporated Management Committee (IMC) to
decide upon their own teacher appraisal system (Lee, Lam and Li, 2003). These observations
are pertinent:
• While staff appraisal systems and the peer observation in schools are more popular,
they are still in various stages of development depending on individual SSB, SMC or
IMC. As staff appraisal systems are not usually and directly linked with incentives,
the assessment does not tend to be very rigorous (meeting the threshold and having a
central tendency in the case of having different raters). Most teachers do not feel
threatened by in-house evaluation.
• Teacher evaluation places more emphasis on overall professional duties than on
actual classroom teaching practices and/or student learning effectiveness/quality.
• There is also an increasing tendency to involve more than one evaluator in the
process of school-based teacher evaluation and having principal evaluation
conducted by teachers.
• There is little linkage or loose coupling between teacher evaluation and school
improvement.
ACKNOWLEDGMENT
The author would like to thank EDB’s colleagues, Mr Tony Tang, PAS(QA) for his
considerable support in providing useful information and Ms Po-ling Wu, PEO(QA) for her
help in interviews conducted for this study. Sincere thanks are extended to Professor Michael
Williams for his advice.
REFERENCES
Advisory Committee on School-based Management (2000). Transforming schools into
dynamic and accountable professional learning communities: School-based management
consultation document. Hong Kong: Printing Department.
De Grauwe, A., and Naidoo, J. P. (2004). Seminar report: School evaluation for quality
improvement: issues and challenges. In A. De Grauwe and J. P. Naidoo. (Eds.), School
evaluation for quality improvement: An ANTRIEP report (pp.15-40). UNESCO:
International Institute for Educational Planning.
Education and Manpower Bureau (EMB) (2007). Report on the implementation of the school
development and accountability and the planning of the second cycle. LC Paper No.
CB(2)2071/06-07(01).
Kennedy, K., and Lee, J. C. K. (in press). The changing role of schools in Asian societies:
Schools for the knowledge society. London: Routledge.
Ho. E. S. C. (2004). The nature and impact of self-regulated learning on student’s
achievement: What we have learned from the first cycle of PISA. Hong Kong: Faculty of
Education and Hong Kong Institute of Educational Research, The Chinese University of
Hong Kong (in Chinese).
Ho, E. S. C. (2005). Can basic education system in Hong Kong be equal and excellent:
Results from PISA2000+. Hong Kong: Faculty of Education and Hong Kong Institute of
Educational Research.
Ho, E. S. C. and Man, E. Y. F. (2007). Student performance in Chinese medium-of-
instruction (CMI) and English medium-of-instruction (EMI) schools: What we learned
from the PISA study. Hong Kong: Faculty of Education and Hong Kong Institute of
Educational Research.
Hong Kong Primary Educational Research Association (Ed.). (2003). Action research and
sharing of primary schools Self-evaluation. Hong Kong: Hong Kong Primary Educational
Research Association. (in Chinese).
Hong Kong Primary Educational Research Association (HKPERA) and Education
Convergence (EC) (2006). A research study on Hong Kong teachers’ stress: Preliminary
analysis. Educational Research Report Series No. 2. Hong Kong: HKERA and EC.
Kellaghan, T. and Stufflebeam, D. L. (Eds.). (2003a). International handbook of educational
evaluation (Part one: Perspectives). Dordrecht: Kluwer Academic Publishers.
Kellaghan, T. and Stufflebeam, D. L. (Eds.). (2003b). International handbook of educational
evaluation (Part two: Practice). Dordrecht: Kluwer Academic Publishers.
Lee, J. C. K. (2005). School effectiveness and improvement in Hong Kong: Review and
prospect. Educational Journal, 33(1-2), 1-23. (in Chinese).
70 John Chi-Kin Lee
Lee, J. C. K. (2006). Commentary on papers presented at the MORE symposium at the ICSEI
Annual Conference, Barcelona, Spain, 2005. Educational Research and Evaluation,
12(6), 595-600.
Lee, J. C. K. (Li Zijian) (2007). Foreword: University and school partnership research: From
4P to 4R. Shanghai Research on Education, 8, 1.
Lee, J. C. K., Lam, W.P. and Li, Y. Y. (2003). Teacher evaluation and effectiveness in Hong
Kong: Issues and challenges. Journal of Personnel Evaluation in Education, 17(1),
pp.41-65.
Lee, J. C. K. and Leung, Y. M. J. (2006). Exploring the core elements of improving
classroom learning and teaching. Hong Kong: Hong Kong Institute of Educational
Research, The Chinese University of Hong Kong.
Lee, J. C. K., and Williams, M. (Eds.) (2006). School improvement: International
perspectives. New York: Nova Science Publishers, Inc.
Lee, J.C.K., Yin, H. and Zhou, Z. (2008). Facilitating teacher professional development
through ‘4-P’ model: Experiences from Hong Kong’s Partnership for Improvement of
Learning and Teaching (PILT) project. Journal of Educational Research and
Development, 4(2), 17-47. (in Chinese)
Leung, F. K. S. (2005). Some characteristics of East Asian mathematics classrooms based on
data from the TIMSS 1999 video study. Educational Studies in Mathematics, 60(2), 199-
215.
Lo, M. L. (2006). Learning study – the Hong Kong version of lesson study: Development,
impact and challenges. In M. Matoba, Crawford, K. A., Mohammad, R. S. Arani (Eds.),
Lesson study: International perspective on policy and practice (pp. 133-157). Beijing:
Educational Science Publishing House.
McGlynn, A. (2004). Self-evaluation or self-delusion: The choice is yours. in N.S.K. Pang, J.
MacBeath and A. McGlynn (Eds), Self-evaluation and school development (pp.17-22).
Hong Kong: Faculty of Education and Hong Kong Institute of Educational Research, The
Chinese University of Hong Kong, Hong Kong.
MacBeath, J. (2005). Self-Evaluation: Background, Principles and Key Learning.
Nottingham: National College of School Leadership.
Pang, N. S. K. (2004). Developing schools through self-evaluation, in N. S. K. Pang, J
MacBeath and A. McGlynn (Eds), Self-evaluation and school development (pp.1-9).
Hong Kong: Faculty of Education and Hong Kong Institute of Educational Research The
Chinese University of Hong Kong.
Pearlman, M., and Tannenbaum, R. (2003). Teacher evaluation practices in the accountability
era. In T. Kellaghan and D. L. Stufflebeam (Eds.), International handbook of educational
evaluation (Part two: Practice) (pp.609-642). Dordrecht: Kluwer Academic Publishers.
Quality Assurance Division, Education and Manpower Bureau (QAC, EMB) (2003a). Is our
school a Hong Kong school of today and tomorrow: A self-evaluating school is the way
to success. Retrieved July 26, 2007, from http://www.edb.gov.hk/FileManager/
Tc/Content_705/SSE_Pub1.pdf.
Quality Assurance Division, Education and Manpower Bureau (QAC, EMB) (2003b). School
self-evaluation: Making it happen in twenty one schools. Retrieved July 26, 2007, from
http://www.edb.gov.hk/FileManager/Tc/Content_705/SSE_Pub2.doc.
Quality Assurance Division, Education and Manpower Bureau (QAC, EMB) (2005). The
impact study on the effectiveness of external school review in enhancing school
Educational Evaluation in Hong Kong: Status and Challenges 71
Chapter 6
ABSTRACT
In recent years, the Education and Manpower Bureau (now Education Bureau) has
implemented a two-pronged approach to assuring the quality of education in Hong Kong:
an external mechanism via inspection and an internal framework via self-evaluation.
However, both of these are mainly top-down and less effective than expected. This essay
argues that quality assurance in collaboration with an external independent agency
through a bottom-up approach is a more effective way to sustain development in schools.
Schools can only improve continuously when they have institutionalized a bottom-up
approach to school self-evaluation. After introducing the details of a series of school-
university partnership projects concerned with school self-evaluation in Hong Kong, this
paper investigates the potential effectiveness of implementing self-evaluation in Hong
Kong schools and the factors that have both hindered and facilitated such a policy.
the establishment, in 1997, of a Quality Assurance Inspectorate (QAI) to which schools were
accountable, the internal quality assurance framework relied on schools’ own ability to
evaluate themselves as part of the process of school improvement. The external quality
assurance mechanism worked by adopting a whole-school approach to inspection by the QAI,
which assessed schools' effectiveness, identified their strengths and weaknesses, made
suggestions on ways of improvement and development in the schools, and released inspection
reports for public reference. In order to continuously improve the quality of school education,
all schools were also expected to engage in cyclical processes of planning, implementation
and evaluation (Pang, MacBeath, and McGlynn, 2004).
Every school was expected to work towards meeting the educational needs of its students
as effectively as it could, and self-evaluation would provide information on which to base
plans for improvement. As for this self-evaluation, all schools were to produce documents
which outlined the long-term goals, prioritized development areas, set out specific targets for
implementation, evaluated progress made in work during the previous school year, and set
improvement or development targets for the coming year (Ministry of Education, 1984;
Scottish Office, 2002).
However, in the initial stage, the two strategies in this two-pronged approach to assuring
educational quality in Hong Kong might not have been as effective as expected. For one
thing, as for the external assurance mechanism, whole-school inspections could only take
place for two- or three-week periods (each time) and the QAI would not re-visit the same
school for at least five to six years, given the QAI teams’ manpower. Thus while whole-
school inspections were implemented in Hong Kong in the 1997/98 school year, between that
year and 2001/02 the number of schools inspected was only about 11.4% of the total.
Secondly, in conducting whole-school inspections in over 200 schools from 1998 to 2002, the
QAI found that most of these schools had not established a self-evaluation framework, and in
fact at that time there were still no appropriate school-based indicators for school self-
evaluation.
Schools can only improve continuously when they have institutionalized a self-evaluation
framework in daily practice and when there is a set of valid, reliable and school-based
performance indicators available for use in self-evaluation. Practicing self-evaluation enables
schools (i) to develop formal procedures for setting school goals; (ii) to have the participation
of teachers, parents and alumni in school management, development, planning, evaluation
and decision-making; (iii) to assess their progress towards goals as well as their own
performance over time; and (iv) to take appropriate steps toward self-improvement. When
school-based indicators are derived from the schools’ own stated aims, they are useful tools
for measuring and monitoring school performance in areas of interest. Self-evaluation with
appropriate school-based indicators provides information to schools, teachers, parents,
students and the community, using for reference the general profiles of schools and the
comparison among schools of similar background or quality. School self-evaluation and
school-based performance indicators are the crucial elements needed for continuous
improvement in and of schools (Cuttance, 1994; Scottish Office, 2002). A summary of the
pros and cons of this two-pronged approach to quality assurance in education is given in
Table 1.
Educational Evaluation and Quality Assurance 77
Table 1. The pros and cons of the two-pronged quality assurance strategy
School self-evaluation (SSE) is a mechanism through which a school can help itself
review the quality of the education it provides, improve continuously and develop itself into a
more effective school. The three major questions usually asked in the process of school self-
evaluation are: (1) How can we know about our school’s present performance? (2) What is
78 Nicholas Sun-keung Pang
our school's present performance? (3) What will we do after knowing this? These seem to be
simple questions, but if we want to have a full picture or thorough understanding of our own
school, we need to go through a systematic and objective self-evaluation.
After conducting the first few cycles of whole-school inspection starting in 1998, Hong
Kong’s QAI (in 2003 renamed the Quality Assurance Division, QAD) found that a self-
evaluation framework had not been generally established in Hong Kong schools and that no
appropriate school-based indicators had been developed for use in school self-evaluation. In
response to these weaknesses the author, with the support of the Hong Kong Centre for the
Development of Educational Leadership (HKCDEL) and the Quality Education Fund (QEF),
launched a series of school improvement projects to help Hong Kong schools institutionalize
a self-evaluation framework in daily practice, acquire the skills and techniques needed for
school self-evaluation, and develop a set of valid, reliable and school-based indicators for use
in self-evaluation.
These projects aimed at helping schools to (i) develop their own models of school-based
management in the spirit of the recommendations of Education Commission Report No. 7;
(ii) acquire the skills and techniques of school self-evaluation (SSE) and institutionalize a
self-evaluation framework in daily practice for continuous improvement; (iii) develop their
own sets of school-based performance indicators for use in school self-evaluation; (iv)
enhance the capability of coping with challenges arising from educational reforms that would
have a great impact on schools in the near future; and (v) keep developing continuously and
fostering a culture of organizational learning among all participating schools. In total, 107
kindergartens, 154 primary schools and 197 secondary schools in Hong Kong have taken part
in these projects in the period 2000-2010 (see Table 2).
Table 2. The school improvement projects organized by the School Development and
Evaluation Team (SDET) in Hong Kong
Table 2. (Continued)
Two concurrent programs were run for SDEC members, one for the primary schools and
the other for the secondary schools. Modules 1-5 were conducted in the first year, and
Modules 6-10 in the second year. The programs aimed at enabling the principals and teachers
to grow and develop and to initiate changes in their own schools. Activities in the form of
workshops were designed to facilitate principals’ and teachers’ reflection on the concepts
introduced in the courses. There were training packages for each module; that is, in total ten
different packages for the program. The training packages were designed and produced for
82 Nicholas Sun-keung Pang
self-learning purposes, with contents that were adaptable with minimal preparation or
background training.
A participatory approach was adopted for conducting the school self-evaluation
exercises. It should be stressed that practicing self-evaluation is the responsibility not only of
the SDEC members but also of the stakeholders in the schools, including school managers,
administrators, teachers, parents and students. Through the processes of self-evaluation a self-
renewal strategy has been institutionalized in the schools’ management structure, and through
participatory school self-evaluation a quality culture has been fostered in these schools.
The projects lasted for either one or two years, depending on each school’s own
preference. Four workshops within two years were offered for all teaching staff from each of
the participating schools. The first whole-school workshop in Year 1 aimed to (i) facilitate
setting up a self-evaluation framework in each of the participating schools; (ii) help teachers
incorporate self-evaluation exercises in their daily work; and (iii) enable teachers to acquire
the skills of self-evaluation and knowledge of how to use performance indicators. The second
whole-school workshop in Year 1 aimed to facilitate a participatory approach, in order to (i)
interpret the results of self-evaluation; and (ii) formulate a school development plan for the
next year. These schools were also provided with consultancy and in-service training in
interpreting self-evaluation reports and formulating school development plans in the
workshops. The first whole-school workshop in Year 2 aimed to (i) reinforce and consolidate
the school’s self-evaluation culture; (ii) facilitate the school’s development of its own ‘quality
culture’; and (iii) foster a culture of organizational learning in the school. The second whole-
school workshop in Year 2 aimed to (i) present the results of school self-evaluation for the
past two years to all staff members of the school; (ii) facilitate a participatory approach to
interpreting the longitudinal self-evaluation data for the purpose of comparison; and (iii)
facilitate a participatory approach to the formulating of a school development plan for the
next few years.
School self-evaluation (SSE) and performance indicators (PIs) are both essential to
schools’ continuous improvement. Schools should have information about the perceptions of
administrators, teachers, parents, students and the community for reference and also for
comparison among schools of similar background. It is difficult to “see” or demonstrate
continuous school improvement, if appropriate performance indicators have not been
developed for use in school self-evaluation.
A theoretical framework for the relationships among performance indicators, school
improvement and educational research is proposed in Figure 1. When we evaluate the
performance of a number of schools against a standardized set of performance indicators, we
are obviously doing one kind of educational research. The group of schools we are looking at
in our research will form a quality circle for comparison. Findings from such research will
inform us about the best schooling practices and tell us what we can learn from the experience
of failure or poor performance.
Educational Evaluation and Quality Assurance 83
PI 10
PI 9 School Improvement
PI 8
PI 7
PI 6
PI 5 Educational Research
PI 4
PI 3
PI 2
PI 1
Sch 1 Sch 2 Sch 3 Sch 4 Sch 5 Sch 6 Sch 7 Sch 8 Sch 9 Sch 10
Figure 1. The Relations among Performance Indicators, School Improvement, and Educational
Research.
On the other hand, when a school’s performance is evaluated against different sets of
indicators, the results of this evaluation will show the school’s performance in different areas
of concern, and thus it will become clear which areas are weakest and most in need of
improvement for a particular school.
In light of the above considerations, it will be clear that the development of sets of
performance indicators (PIs) for use in self-evaluation is also an important task within the
larger project of school self-improvement. Two kinds of PIs were employed in these school
improvement projects:
(1) a standardized set of PIs for comparing the performance of the participating schools,
and
(2) sets of school-based PIs (SBPIs) developed by teachers for their own use in self-
evaluation in different schools. A theoretical framework for educational evaluation is
proposed in Figure 2.
This framework allows the School Development and Evaluation Committee (SDEC)
members to consider the kinds of indicators that will suit their needs, the areas of evaluation
that they are interested in, and the sources of data that they will need in order to design a set
of school-based performance indicators for their own use in self-evaluation. The probable
areas to be evaluated within the schools include administration and management, teaching
and learning, quality of school life for teachers and students, staff development and
organizational learning, school ethos and culture, home-school-community cooperation, and
so forth.
84 Nicholas Sun-keung Pang
For the sake of comparison and forming a quality circle, all schools participating in the
projects have used these sets of PIs for evaluating school performance. These PI sets were
mainly developed by the School Development and Evaluation Team (SDET) of the Chinese
University of Hong Kong. These sets are theoretically grounded and have good validity and
reliability. Furthermore, the areas for evaluation are different in the two-year school
improvement projects. The areas to be evaluated in the participating schools for Year 1 are
summarized in Table 3, for Year 2 in Table 4.
Generally, most principals and teachers opined that school self-evaluation had not been
the normal practice in schools and it was, to them, a new and innovative concept. They
thought that since the implementation of SSE involved a paradigm shift in school
management and change in normal school life for all teachers, external support including
financial resources, staff development programs and in-house and consultancy services
should be provided. In addition, most principals and teachers would like SSE to be
implemented phase by phase, thus giving them more time and “space” to acquire the relevant
new knowledge and skills. They reported that in the present turbulent school environment,
where there were already many school reforms and innovations, the further introduction of
new concepts such as school self-evaluation would inevitably lead to resistance. There were
also many specific factors that hindered the implementation of SSE in Hong Kong schools,
and these are summarized and classified below at the system level and the school
organizational level.
2. A too-ambitious plan. There are approximately 1,000 schools, including primary and
secondary, in the Hong Kong education system. Conducting whole-school territory-
wide inspections or external school reviews within a few short years was an
unrealistic plan. Expecting most schools to be self-reliant in conducting self-
evaluations, and to become more open to change within a year or so, was again
impractical and unattainable.
3. Too many existing reforms. There have already been many new reform proposals for
the educational system in Hong Kong in the 21st century, in addition to those left
over from the last decade. Furthermore, the implementation of these reforms and
policies has mainly been attempted without good planning and co-ordination.
Schools have been suffering from the great confusion and other burdens arising from
these reforms. Any introduction of further new reforms and programs in schools
would cause, at best, indifference and perhaps resistance because of the tremendous
pressure and workloads already existing in schools.
4. School self-evaluation is a complex process. The implementation of school self-
evaluation involves a change of school culture and a change of general practices in
school life. Such changes cannot be achieved only through directives issued by the
education authorities, but need a well-planned, bottom-up strategy of initiation and
introduction which means extra resources and support from external sources.
5. Lack of resources. Effective implementation of new reforms or initiatives needs extra
resources and support. At a time of economic recession in the early years of the new
millennium in Hong Kong, the shortage of financial and human resources has created
more difficulties for the implementation of school self-evaluation throughout the
territory.
1. The plurality of categories of stakeholders and the diversity of views and opinions in
schools may have led to many excellent reforms being opposed.
2. Past failures in the implementation of educational policy caused schools to take a
passive and conservative approach to educational reforms.
3. Schools are inevitably political arenas and power struggles are common. These
created resistance to educational policy change in the schools.
4. School leaders and teachers in some schools might have been embroiled in conflicts
which caused tension, fear, and low morale among teachers, not conducive to self-
reflection and self-evaluation.
5. The communication breakdown between teachers and administrators found in some
schools meant a very weak basis for professional collaboration and commitment.
6. Most teachers and principals had little knowledge of or skill in school self-
evaluation, and misconceptions about SSE were common.
88 Nicholas Sun-keung Pang
7. There were no guidelines and other criteria for measuring success that were
commonly agreed upon for use in evaluation.
8. There were no formal, systematic, in-depth, well-designed professional training
programs to train people in the implementation of SSE.
9. Most schools were either passive or actively opposed to change, and their
organizational learning culture was not well-developed.
10. Most schools lacked a long-term vision of, or ability to plan for, school development
and improvement.
Though there were a predominance of factors that seem to have hindered the effective
implementation of self-evaluation in Hong Kong schools, there were also a few factors, at
both the system level and the school level, that facilitated self-evaluation in the school
context.
aspects of schoolwork and student performance for SSE and ESR, and also
established territory-wide norms against which school performance could be
measured and assessed. Stakeholders’ questionnaires for teachers, pupils and parents,
and other tools in the social and affective domains for students, were developed for
use in SSE, and schools were allowed to choose the relevant indicators for their own
needs and uses.
The above are then the findings of the author’s qualitative research into the effectiveness
of implementing self-evaluation in schools and the factors that both hindered and facilitated
organizational change. These findings shed light on how school administrators can lead and
manage the kind of organizational change that will really promote school development and
improvement.
CONCLUSION
Evidence-based organizational change has become a very recent trend in the school
reform and improvement movement, a movement in which school self-evaluation plays an
important role. School self-evaluation provides a framework which allows school leaders to
institutionalize a self-renewal strategy in daily managerial practices as well as to lead and
manage change (Pang, 2006c). Due to various hindrances at both the educational system level
and the school organizational level, many Hong Kong schools have not been effectively
establishing a self-evaluation framework, and many still have a relatively weak culture of
self-renewal. Nevertheless, a normative re-education strategy may be effective in helping
schools to surmount the resistance and the hindrances found at both system and
organizational levels. The school improvement projects launched in this study are a case in
point, showing that organizational change can be initiated through the putting into practice of
school self-evaluation. The practical experience gained from even a few successful schools in
the project can shed light on the problem of how to help other schools transform into learning
organizations through this implementation of SSE.
In order to facilitate change in schools, administrators need to demonstrate strong
leadership that can unify their school’s goals and make clear what kind of technology is
needed for achieving them. Such leadership is also necessary in order to promote the sharing
of values among all members and their agreement on preferences. It is necessary for the
focusing of leaders’ attention by carefully selecting targets, for controlling resources, and of
course for acting forcefully (Gamage and Pang, 2006). A good team spirit, high staff morale
and strong sense of professionalism do not only form the crucial basis for change, they also
help reduce resistance to change (Pang, 2006b). Effective leaders are those who can adopt
these approaches in order to be flexible in coping with the challenges created by an ever-
changing external environment, and in continuing to lead their organizations towards
excellence.
REFERENCES
Cuttance, P. F. (1994). Monitoring educational quality through performance indicators for
school practice. School Effectiveness and School Improvement, 5(2), 101-126.
Education and Manpower Bureau. (1998). Review of the Education Department. Hong Kong:
The Government Printer.
Education and Manpower Bureau. (2003). Enhancing school development and accountability
through school self-evaluation and external school review. EMB Circular No. 23/2003.
June 12, 2003.
Educational Evaluation and Quality Assurance 91
Chapter 7
Suan Yoong
Educational Testing Center, Universiti Pendidikan Sultan Idris, Malaysia
ABSTRACT
National examinations are important annual events in Malaysia. This paper briefly
provides a critical review of the assessment system currently used in Malaysia, outlines
the effects of over-obsession with examinations, provides an overview of the Ministry of
Education’s efforts to dilute the obsession with examinations, reviews the proposed
implementation of school-based assessment, and explains the dominant forces shaping
educational assessments in Malaysia, above all taking into account the cultural contexts.
1. INTRODUCTION
National examinations are important annual events in Asia, especially when the stakes
are high in determining entry into premier universities. Come examination time, it has
become a common sight in many East Asian cities to see large crowds of parents or relatives
waiting in the vicinity of the examination halls, giving moral and material support to their
loved ones who are taking the examination. In fact, around examination time it is not
uncommon to see parents readying their children for the last dash through the “dreaded and
stressful” hurdles of examinations. They may do this by preparing “dietary tonics and
fortifying medicines” (e.g., jinseng) and other food essences to supplement the examinee’s
diet, or by extending this “fortifying culture” in the form of supplementary tutoring by
ferrying students to and from private tuition centers. Indeed, the glaring presence and
94 Suan Yoong
vigorous growth of these cram schools, designed purely for examination preparation in Japan,
South Korea, Mainland China, Taiwan, Hong Kong, Macao, Singapore and Malaysia, is
further testimony to the presence of East Asia’s examination-oriented culture (Bray, 2003;
Cheng, 1990; 1995; Chew and Leong, 1995; Foondun, 1992; 1998; Harnisch, 1994; Hussein,
1987; Kim, 2000; Kwok, 2004; Rohlen, 1980; Sorensen, 1994; Zeng, 1999). The
internationally renowned “boulevard" of tuition centers on Nanjing Street, near the Taipei
Train Station in Taiwan, is perhaps the most vivid example of this culture (Chen, 1996).
Come D-day and the announcement of examination results, the local media rush to flash news
reports that may highlight the most successful high schools, glorify outstanding individual
performances, and present ranking tables that compare school performances.
Of course, learning cultures in other parts of the world are also to some degree
examination-oriented. However, in East Asia this phenomenon and its side-effects are more
pronounced, and thus here especially they are felt to warrant a closer examination. The
extraordinary emphasis placed by East Asian students and parents on examinations is also one
of the pre-eminent characteristics of the educational system in Malaysia. While the
examination orientation has been a force driving many students to excel in their studies, this
obsession has also created both educational and social problems. Many Malaysian educators,
practitioners, curriculum developers and policy makers have long recognized the negative
impacts and detrimental effects of over-obsession with high-stake assessments, and have
criticized such practices on the grounds that they distort the basic objectives of education.
Like a record playing over and over again, these concerns have been publicly voiced year in
and year out, especially after public examination results are splashed in the print media.
This paper briefly provides a critical review of the assessment system currently used in
Malaysia, outlines the effects of the national assessment system, provides an overview of the
advantages and disadvantages of an over-obsession with examinations, reviews the
implementation of the school-based assessment (SBA) system in Malaysia, and explains the
dominant forces shaping educational assessments in Malaysia, with a particular emphasis on
the cultural contexts.
of instruction, offer 3 years of lower secondary and 2 years of upper secondary education.
Students are streamed into academic or technical and vocational tracks at upper secondary
levels. Pre-university education (sixth form or matriculation classes) is offered in selected
secondary schools for 2 more years for students wishing to enter government-sponsored
public universities.
Students completing primary education must sit for the UPSR (Primary School
Assessment) Examination. After completing their lower secondary education, they must take
the PMR (Lower Secondary Assessment) Examination. Upon completing their upper
secondary education, students in the academic and technical tracks will sit for the SPM
(Malaysian Certificate of Education) Examination (equivalent to the British O-level
examination, now referred to as the GCSE), while students in the vocational track will sit for
the SPMV (Malaysian Certificate of Education -Vocational) Examination. Students at pre-
university levels must sit for either the STPM (Higher School Certificate) Examination
(equivalent to the British A-level examination) or for matriculation examinations accredited
by the local universities.
All public examinations at the end of the various levels, namely primary, lower
secondary, upper secondary, and pre-university/matriculation, are controlled and administered
by the Malaysian Examination Syndicate (MES) set up by the Ministry of Education.
Although these public examinations were initially designed for selection purposes, student
96 Suan Yoong
promotions through primary to upper secondary grades were made automatic when Malaysia
instituted 11-year free universal education. However, SPM results will determine whether or
not students can further their education in government pre-university or matriculation classes,
while the STPM matriculation exams remain the major selection examinations for entry into
local government universities.
Although promotion of students from grade to grade is automatic until the secondary
level, the obsession with public examinations remains very great in Malaysia. Education as a
social activity carries with it, after all, the social values specific to particular societies and
cultures. Yet while in many societies having a prestigious career requires attending an elite
university, in Malaysia (and countries like it) attending an elite university means passing
examinations at the end of every tier, and ultimately passing the university entrance
examination with a high enough grade. Furthermore, in order to pass the latter with a high
enough grade to enter one of the best schools, students must begin preparing at an early age
(Feiler, 1991). The final prize of an elite higher education being so attractive, the socio-
cultural value of assessment results over-ride their academic value. For example, though the
UPSR and PMR have since been designed as diagnostic examinations, their results are still
used primarily as a basis for channeling students into premier secondary schools or streaming
them into or onto academic or technical-vocational tracks at the upper secondary level
(Ministry of Education, 2004). Thus the SPM and STPM are high-stakes exams that
determine entry into national universities and/or local private universities and colleges with
overseas university collaboration programs. Through a policy of rewards and other forms of
recognition dating back to the colonial era, education has, in other words, become the most
significant instrument in determining one's future job (career) and consequent social status.
Students therefore need to overcome a series of examination hurdles in their pursuit of the
final prize, and this “scramble” to gain entry into premier elitist schools starts from an early
age. It is only natural that examinations gained real legitimacy in Malaysia as a reliable
means of facilitating and regulating upward social mobility.
Apart from the public secondary schools, there also exist 60 privately-managed Chinese
independent high schools where most subjects are taught in Chinese. About 20% of the
students who completed primary education in public Chinese primary schools continue their
education in Chinese independent high schools. Chinese independent high schools offer 3
years of Junior Middle and 3 years of Senior Middle education. Upon completing each tier of
high school education, students must sit for the UEC (Unified Examination Certificate for
Chinese Schools) Examination conducted by a private Examination Board that is itself set up
by a united Chinese school community organization, the Dong Jiao Zong. The Senior Middle
UEC is recognized by over 300 universities and colleges around the world. With the UEC
results, independent high school students can further their studies in overseas universities,
especially in Taiwan, China, Japan, the United States, Canada, the United Kingdom, Australia
and New Zealand, or enter local private colleges which have twinning programs with
overseas universities. Some Chinese independent high schools offer dual programs, teaching
in both Malay and Chinese, so that the students can sit for the PMR, SPM, STPM and/or
UEC.
School-Based Assessments (SBA) 97
The UEC is an equally high-stakes exam as far as the students and parents of Chinese
independent high schools are concerned, since it offers the opportunity for students to gain
entry into overseas universities in the United States, Canada, the United Kingdom, Australia
and New Zealand. Moreover, the Chinese independent high schools are also more stringent in
their automatic promotion policy, retaining students longer at each grade level if they fail to
perform adequately at that level. A significant number of premier Chinese independent high
schools also control their student intake by conducting entrance exams and only admitting the
top students from Chinese public primary schools. Thus an extraordinary emphasis is placed
on education in Malaysia and especially by Chinese students and parents, necessitating a
high-stakes examination system.
preparatory texts and past test papers is a distinct instructional style that can be seen at all
levels in most Malaysian schools.
The obsession with teaching-to-the-test has also led to the phenomenon of curriculum
"narrowing" on the part of teachers (William, 2001; quoted in Gregory and Clarke, 2003),
that is, de-emphasizing or neglecting subject areas not tested or emphasized on exams, and
precluding any supplementary curricula that encourage students’ all-round development.
Classroom instructional and assessment strategies that mirror the content and format of the
examination tend to be emphasized, thus leaving significant portions of the “non-examinable”
content untaught or quickly skimmed through. Concerned educators worry that subject areas
not tested (for instance, fine arts, physical education, music, and other “low-stakes” subjects)
are accorded less class time or set aside altogether. Malaysian schools in particular are also
devoting non-critical subject times to examination subjects, even in the early grades.
which in turn increase the children’s stress and fatigue. The demand for tutoring is spawned
by the exam-oriented learning culture, that is, by the increasingly high levels of competition
for social mobility (Bray, 2003), but also by the multi-faceted relationships between and
among supply and demand patterns in the economy and other factors leading to the
desirability of tutoring jobs (Kwok, 2004). Small wonder then that private tutoring that is
parasitic on daytime schooling has become popular and is developing into a thriving
enterprise in many countries, including more recently Malaysia. Moreover, students who
view the passing of exams as an insurmountable barrier may ultimately drop out of school
early. High-visibility examples of security breaches, rampant cheating in examinations and
test-tampering have also been reported; leakage of examination questions has been reported
as occurring sometimes even at the primary school level in Malaysia (Kong, 2007).
Then in 1996 the Ministry of Education introduced the Secondary School Progressive
Assessment or PKSM. This required students who sat for the PMR (lower secondary) and
SPM (upper secondary) examinations to submit coursework output in the form of portfolios
and projects for History, Geography, and Integrated Living Skills and Design. These were
school-based assignments or projects graded by the school teachers following standardized
marking schemes supplied by the Malaysian Examination Syndicate (MES), and the grades
were submitted to the MES to be included in the final stage of PMR or SPM grading. Similar
practices had also been instituted by the Chinese Independent Schools Examination Board for
100 Suan Yoong
the UEC examination. For the PMR examination, oral assessments are also planned for Malay
and English languages, while practical assessment is planned for science. Unfortunately,
grade inflation and cheating in the form of non-authentic or plagiarized work are so
widespread that the school-based assignments or oral practice sessions have been criticized
heavily by the public. Teachers are unhappy because these have also increased their workload
considerably. Moreover, the effect of both practices on reducing examination pressure was
minimal. However, both of them—school-based assignments and oral practice sessions—are
still in use today.
Most of the Ministry’s subsequent efforts seemed to be aimed at circumventing the
system by enhancing the teaching and learning process as a means to reduce the pressure of
final public examinations on students, and to reinforce the importance of consistent learning
throughout the school years. This initiative was promoted through PKBS (School-Based
Progressive Assessments). PKBS was a program of formal assessments in the form of
weekly, monthly or term tests, projects and regular class assignments that were set by the
respective subject teachers and conducted throughout the year, in order to evaluate pupils’
progress at all levels in the cognitive, affective and psychomotor domains. These assessments
were done through classroom observations, orals, and written works. The information from
these non- standardized assessments was used to reinforce the instructional process. However,
teachers were unhappy with this program because their workload was increased considerably.
Moreover, the results of these assessments were neither included in the public examination
grades nor needed for the selection and promotion of students to the next grade. As a result,
the teachers kept reverting to their old ways of teaching to the test.
4.2. The Planned New Role of ICT in Assessment and Smart Schools
Then in 1996, Malaysia began to plan its own Multimedia Super Corridor (MSC) as a
dynamic hub for the global ICT industry, in order to leap forward into the information and
knowledge age. The Multimedia Super Corridor is a 15km by 40km area, encompassing the
Kuala Lumpur City Centre and Kuala Lumpur International Airport, which will be developed
as a high growth centre. MSC Malaysia now hosts more than 1000 multinationals, including
foreign-owned and home-grown Malaysian companies, focused on multimedia and
communications products, solutions, services and research and development (MSC Malaysia,
2008). The Malaysian government has equipped core sectors of MSC Malaysia with high-
capacity global telecommunications and logistics networks. Six innovative flagship
applications have been developed in MSC Malaysia to accelerate its growth, among them the
Smart School flagship application (Malaysia, 1997). The Malaysian Smart School Flagship
was premised on the strong belief that information and communications technology is a key
enabler, that it can stimulate a desire to learn in everyone. Deemed one of the most forward-
looking ICT-mediated learning initiatives in the world, the Smart School Flagship was
established in order to reinvent the core teaching-learning processes by integrating ICT with
the curricula, pedagogy, and assessment practices of Malaysia’s schools. The goal would be
to help students learn more effectively and efficiently while simultaneously gaining a
competitive position in an increasingly globalized, ICT-driven market. Computerized
management software was developed to help the “smart schools” manage more efficiently
and effectively the resources and processes required to support teaching-learning functions,
School-Based Assessments (SBA) 101
thus enabling students to practice self-accessed and self-directed learning, and making the
teaching-learning process easier, more fun and more effective. The Smart School system was
piloted in 87 schools beginning in 1999, and will have been expanded to include 10,000
schools by 2010.
A key element of the Smart School system is a smart assessment system that involves a
significant departure from the “current assessment system which is characterized by a single
delivery (one session only) at a fixed time of the year to students of a given age group”
(Malaysia, 1997). Within this system, and integrated with ICT, criterion-referenced on-going
assessment would be emphasized to provide a more holistic and accurate picture of a
student’s performance. Teachers, students and parents will be able to access on-line
assessment items as well as get a more accurate picture of a student’s achievement, readiness,
progress, aptitude, learning styles, and abilities. Smart School assessment will be flexible and
learner-friendly, while the quality of the assessment information will be assured by using
multiple instruments and approaches. This will lead to a flexible assessment and certification
system that involves using a combination of classroom-based, school-based, and centralized
assessments, using authentic, alternative and performance-based assessment instruments and
approaches. The Smart School assessment system not only attests to a student’s cumulative
accomplishments but will also promote continued improvement on a lifetime basis.
According to the plan, assessment for achievement tests will be implemented first, and
assessment for readiness, progress and aptitude tests will be developed as teachers grow more
capable in the use of the skills and tools that will be made available to them. The Smart
School management system would have the capability to store assessment records that
document students’ accomplishments during their entire educational career, to be identified
by a single ID (Malaysia, 1997).
However, evaluation of the Smart Schools program revealed that although ICT holds
great promise, the reality in the Malaysia Smart schools can hardly match or reflect the state-
of-the-art in technological advances, and the application of ICT to instruction, assessment,
and testing in the Smart Schools is still in its infancy (MDC, 2004; Frost and Sullivan, 2004).
Therefore, despite these proposed innovative efforts, the smart assessment system has yet to
surface in the Malaysian school system, and even in the pilot Smart schools there seems to
have been no noticeable change in the Malaysian examination culture.
More recently, the Ministry of Education unveiled the Blueprint for Educational
Development 2006-2010 in an effort to reform the educational system and "improve the long-
term international competitive edge of Malaysia” (Ministry of Education, 2006). The
Blueprint is specifically for the Ninth Malaysia Plan: 2006-2010, which stresses that the
implementation of the SBA system will be accelerated to enable continuous evaluation of,
and support for, creativity as well as analytical problem-solving skills (Malaysia, 2006). This
Blueprint specifically notes that “presently emphasis has been placed on summative
evaluation that focuses on student achievements and not on identifying strengths and
weaknesses in teaching-learning” (Ministry of Education, 2006). According to the Blueprint,
the assessment system will be modified so as to make schooling less oriented toward
examinations; this will be done by reducing the number of public examination subjects, while
other subjects will be assessed at the school-based level. The Blueprint also charts the way
toward achieving a new examination reform, a new examination policy that will strengthen
SBA because it will be holistic, task-based and criterion-referenced, and will also promote
character-building.
Specific recommendations include changing the emphasis of public examinations: rather
than content-based they will now be skill/ability-based; they will assess ability rather than
achievement. Also, there will be more room for alternative assessments and for the quality
assurance of assessments, and national benchmark standards will be established (Ministry of
Education, 2006).
Dr. Adi nonetheless noted that this will change the whole culture of knowledge
acquisition in Malaysia, for students may begin to feel that learning can actually be fun.
However, he also said that it would take time, because the transition would require several
stages—from getting input from academicians, teachers, parents and students to choosing the
right methods of assessment in the Malaysian context (Koh, 2007a).
Public reactions to the announcement ranged from expressions of full support to those of
having reservations. The following news report is typical:
A parent, V. Vasantha, said she had reservations about teachers being objective when it
came to school-based assessment. Psychometric testing, for example, is something new and
will require training for teachers, or they could come out with "10 conclusions about one
student." And could teachers cope with assessing so many students? There may be adjustment
problems but "We do not want any bias in the grading or assessments, so the standards must
be comparable for schools to want to compete with each other" (Koh, 2007a).
[I’m assuming these news reports were translated by author from Chinese or Malay?] No!
it was actually from the Online news report - Yoong
Many had urged the Education Ministry to be cautious in its implementation of the
system, saying it would really need to depend on teachers’ performance and the guidelines for
all schools when it came to assessment (Chok, 2007; Koh, 2007b, Kong, 2007). "It is good
that it will be more school-based but there should be a guarantee that there is a standard for
all. Some others find hope in the Education Ministry’s plan to reform the assessment system
for students, which will shift the emphasis from examinations to continuous forms of SBA
such as aptitude, personality and diagnostic tests” (Koh, 2007a).
The following excerpt from a local news report records a typical reaction to the recent
proposal to scrap the UPSR and PMR examinations:
“I remember a time when students had to do well enough in Primary school, passing all
the exams before they were allowed to go to Secondary school. And then you had to pass your
SRP before you could move to Form 4. And then our Government decided to make things
easier for students, dumbing down the content of the education, and now we are even thinking
of removing Public Examinations! Do you really trust individual schools to set their own test
and examinations? I don’t. Not that I don’t trust teachers, but it’s more that I don’t trust the
schools. There will be leaks in the papers, and with no proper guidelines for marking and
104 Suan Yoong
grades, what will happen is that on paper the Government will suddenly show how “Smart”
our students are, when in reality our educational system is going down the drain. I think we
should be making our exams harder, getting some quality students out there, not people who
can get an A in English yet have really terrible grammar and not be able to hold a
conversation. Gone are the days when our education meant something, and when students
took exams seriously. Back in the day, you studied for the SRP1 because if you failed, you
would be held back a year!” (Simrit Kaur, 2007)
In Asia, public examinations have always meant an extremely high status for education.
An understanding of the cultural dimensions of East Asian societies, including Malaysia, and
of the role in them of education will prove helpful in understanding the deeply-entrenched
examination-oriented culture in Malaysia, especially when it comes to educational evaluation
or assessment—for this is seen by the community as being very important, as having high
stakes. In Malaysia, different sectors of the community—whether teachers, parents, students,
administrators, businessmen, political leaders, or even professionals—all seem to share a very
similar cultural attitude toward what education should be and how children should be
educated. The fear of losing a competitive edge in the international arena also foster concerns
about the quality of education, the latter being most often cast in terms of education
attainments, competitiveness, world class standards, and academic excellence. Thus, the
Ministry of Education has always focused on enhancing the knowledge, skills and expertise
of future generations: these will be needed to support a knowledge-based economy and
realize Malaysia’s vision of becoming a developed nation by the year 2020.
The centrality of education in East Asian societies with identifiable common goals has
fostered a virtual coalition between parents and teachers in the quest for excellence in
education. As a result, students are required to adapt themselves to the system, and indeed
adaptability is an essential crucial quality in the Asian education process. The dialectics of
work and play, happiness and pain, give and take, tolerance and gain, ease and difficulty, are
deep-rooted in these societies and could explain the rationale for adaptability behind many
educational practices. Cheng (1995) noted that Chinese parents and students prefer to adapt to
the system instead of intervening in the educational process, and thus that they do (like their
children) expect learning to continue beyond school hours. In such a socio-cultural-
educational context, the students have little room for any activities apart from formal
academic learning, and the latter can take place in or out of schools, can extend into private
homes (including those of private tutors) and after-school hours. A clear percentage of
students adapted well and achieved an excellent academic performance. Unfortunately, a
large proportion of those who could not make it were “legitimately” ignored for failing to do
so.
Given the fact of this Malaysian culture with its deeply-entrenched examination-
orientation, it is not surprising that the public generally supports public exams as the best way
to maintain objectivity, higher standards, and stricter accountability in the educational system.
The Malaysian public feels strongly that examinations, while not perfect, are probably the
1
Note: SRP examination was replaced by PMR in the 80s.
School-Based Assessments (SBA) 105
best way to hold all students accountable for learning what they are expected to learn during
their school careers. Given this entrenched attitude, it is clear that the Ministry of Education
will have a hard time reducing (let alone eradicating) the public’s obsession with
examinations as the fairest means of student evaluation.
CONCLUDING REMARKS
One must be clear about the policy goals of examination reform: know the strengths and
weaknesses of all testing instruments; recognize the political, social, and educational trade-
offs involved in using one form of assessment over another; and, most importantly, make no
demand of any testing instrument for which it was not designed. One-shot assessment, no
matter what the format, cannot address multiple needs (Goertz and Duffy, 2003). Public
examinations are designed to meet certain purposes, notably to provide reliable and
comparable scores for individuals as well as groups. They are cost-efficient in terms of
development, scoring, and administration, and sample a broad set of curriculum objectives
within a limited time frame. In general, the competencies tapped are those that lend
themselves to being assessed in these ways, while aspects of learning that cannot be assessed
under such constrained conditions are not addressed. To design new kinds of assessment
methods as alternative approaches to large-scale assessment for capturing the complexity of
cognition and learning will require breaking out of the current paradigm.
It is now a common practice to characterize student achievement in terms of multiple
aspects of proficiency rather than relying on a single score. It is also possible to chart
students' progress over time instead of simply measuring performance at a particular point in
time. Schools should use multiple paths in evaluating and look for alternative forms of valued
performance. Many of these methods are not yet widely used because they are not easily
understood or packaged in accessible ways. Furthermore, having a broad array of models
does not mean the measurement problem has been solved. There is a need to focus, in
psychometric model-building, on the critical features of achievement that we want to assess.
Information and communication technologies are helpful in removing some of the constraints
that have limited the practice of evaluation in the past, and new ICTs are expanding the types
of constructs that can be tapped through assessment. By enriching assessment situations
through the use of multimedia, interactivity, and control over the stimulus display, it is
possible to evaluate a much wider array of constructs than was previously possible. The
Smart School assessment system offers opportunities in this direction. New capabilities
afforded by technology include directly assessing problem-solving skills, making visible the
sequences of actions taken by learners in solving problems, and modeling complex reasoning
tasks (National Research Council, 2002). Technology also makes possible the simplification
of teachers’ tasks. (Chudowsky and Pellegrino, 2003)
Introducing SBA into the public examination system improves the validity of
assessments. Some important outcomes which cannot be evaluated within the context of a
written examination can readily be assessed through SBA (e.g., practical work, theatres and
workshops, research projects, portfolios of creative work, active artistic or design projects).
SBA must be designed so that evaluations become an integral part of teaching and learning
within school subjects and not an ‘add-on’. There should be no significant increase in the
106 Suan Yoong
workload of teachers. In order to carry out SBA in a rigorous and consistent manner, teachers
will need to receive appropriate training and support. They need to develop a wider range of
assessment skills which will enable them to better facilitate the implementation of SBA.
In the implementation of any educational reform, the capacity of the system to support
change in practice must be addressed. Research on SBA programs shows that clear goals and
incentives are necessary but not sufficient to motivate teachers to reach their school's student
achievement goals. Teacher motivation is also influenced by the presence of various capacity-
building conditions, most obviously the opportunity for meaningful professional
development. In addition, teacher knowledge and skills related to improved instruction are
important (Sloane and Kelly, 2003). Moreover, teacher education must be funded along with
the various complementary testing programs. In other words, schools need knowledge, human
and financial resources to successfully implement any reforms.
The Ministry of Education’s latest effort in examination-system reform will involve a
huge effort to overcome a deeply-entrenched exam-taking culture. Cultural reform is very
difficult to achieve; deeply-entrenched beliefs and practices are hard to eradicate, let alone
transform. The public needs to be more broadly educated about the possible range of testing
and test-interpretation formats, if people are ever going to shake off their traditional concept
of assessment and embrace a new assessment culture that is forward-looking,
psychometrically sound, and humanistic. Such a reform must be accompanied, then, by a
structural change in the society and culture as well as in the educational system. The highly
centralized Malaysian system of education must be further decentralized and democratized,
and schools must be appropriately empowered. This depends, once again, on a clearer
understanding of the deeply-entrenched examination-orientation of Chinese and Malaysian
culture, the tendency to see exam-taking as a kind of one-shot, high-stakes “gamble.”
An example showing that progress in this direction is possible is the PTS (Level One
Assessment) aptitude test, introduced to select grade 3 students so that they might skip grade
4 and proceed directly to grade 5. (This was in line with the new education policy which
stipulated that primary education can be completed in five to seven years.) Pupils were tested
on their abilities and potential in verbal, quantitative and thinking skills. Of course, this PTS
inevitably became a high-stakes test in the eyes of teachers and parents as well as students;
fortunately, due to the pressures it generated, it was discontinued two years later. On the other
hand, allowing almost everyone to enter university does not necessarily reduce examination
pressure, as students will still be competing to get into the best universities. Thus, irrespective
of what changes are made in the assessment mode, the newly-introduced measure always
becomes high-stakes unless there is a strong will to dilute and diversify the incentives and
reward systems that bind upward social mobility so tightly with education. We must be able
to deal with the fact that if and when School-Based Assessments are given a more significant
weight in university entrance selection, they, too, will become relatively high-stakes—but
hopefully relatively less high-stakes than winner-take-all exams.
We are still a long way, in Malaysia, from the goal of overcoming a deeply-entrenched
examination-orientation culture. There are no simple solutions. The key point to keep in mind
is that the final goal of standards and assessment systems is not the race for ever-higher
scores, but the race for students’ solid preparation for the workplace or postsecondary
education. Furthermore, even if the SBA program is increasingly implemented, accountability
remains an important problem to be reckoned with, most notably in the context of the
School-Based Assessments (SBA) 107
documented gaps in achievement between students of different economic, ethnic, and racial
backgrounds.
REFERENCES
Adi B. T. (2007). A proposal towards a more holistic education assessment system in
Malaysia. Keynote address at International Conference on Assessment (KLICA) of the
National Educational Assessment System (NEAS): Looking Ahead, organized by the
Examinations Syndicate, Ministry of Education, Malaysia. 7-9 May 2007 at Sunway
Resort Hotel and Spa, Pelaling Jaya, Malaysia.
Bernama. (2007). New Assessment System for Schools To Be Implemented In 2010.
Bernama News Agency, Malaysia. May 17, 2006.
Bray, M. (1999). The shadow education system: Private tutoring and its implications for
planners. Paris: UNESCO / IIEP.
Bray, M. (2003). Adverse effects of private supplementary tutoring. Paris: UNESCO / IIEP.
Chen, J. (1996). Cram Culture - A Fresh Look at the Buxiban. Retrieved July 15, 2007, from
http://home.sina.com/sinorama/1196/article2/index.html.
Cheng, K. M. (1995). Excellence in education: is it culture-free? Keynote address presented
at the 9th annual meeting of the Educational Research Association, November, 22–24,
Singapore.
Cheng, K. M. (1990). The culture of schooling in East Asia. In N. Entwistle (Ed.), Handbook
of education ideas and practice (pp. 163-173). London: Routledge.
Chew, S. B., and Leong, Y. C. (Eds.). (1995). Private tuition in Malaysia and Sri Lanka: A
comparative study. Kuala Lumpur: University of Malaya, Department of Social
Foundations in Education.
Chok S. L. (2007). It takes courage to cut the exam knot. New Straits Times, June 3, 2007.
Chudowsky, N. and Pellegrino, J. W. (2003) Large-scale assessments that support learning:
what will it take? Theory Into Practice, Winter, 2003.
Feiler, B. S. (1991) Learning to bow - Inside the heart of Japan. New York: Ticknor and
Fields.
Foondun, A. R. (1992). Private tuition in Mauritius: The mad race for a place in a ‘five-star’
secondary school. IIEP Research and Studies Programme, Monograph Vol. 8. Paris:
IIEP.
Foondun, A. R. (1998). Private tuition: A comparison of tutoring practices at the primary
level in Mauritius and some South East Asian countries. Bangkok: UNICEF East Asia
and Pacific Regional Office.
Frost and Sullivan. (2004). Benchmarking of the Smart School Integrated Solution. Retrieved
July 15, 2007, from http://www.msc.com.my/smartschool/downloads/benchmarking.pdf.
Goertz, M. and Duffy, M. (2003). Mapping the landscape of high-stakes testing and
accountability programs. Theory into Practice, Winter, 2003.
Gregory, K. and Clarke, M. (2003) High-stakes assessment in England and Singapore. Theory
into Practice, Winter, 2003.
Gunzenhauser, M. G. (2003). High-stakes testing and the default philosophy of education
Theory into Practice, Winter, 2003.
Harnisch, D. L. (1994). Supplemental education in Japan: Juku schooling and its implications.
Journal of Curriculum Studies, 26(3), 323-334.
108 Suan Yoong
Chapter 8
Yasuhiko Washiyama
Tokyo Gakugei University, Japan
Council for Education released its report on “Redesigning Compulsory Education for a New
Era,” which proposed the formulation of guidelines in order to enhance evaluation systems
for schools.
The “Evaluation Guidelines for Compulsory Education Schools” were released in 2006,
marking a new beginning for school evaluation in Japan.
(1) Curriculum and academic guidelines; (2) student behavioral guidance; (3) student
career and further education guidance; (4) safety management; (5) health management; (6)
special needs education; (7) organizational operations; (8) teacher training; (9) collaboration
with guardians and members of the local community; (10) facilities.
Current Issues Regarding the School Evaluation System in Japan 113
part of students. Scholastic ability is thus employed as a principal indicator in the evaluation
process, which highlights the issue of degrees of scholastic achievement.
In April of 2007, a national test of scholastic ability was conducted in Japan for the first
time in 43 years. Students in the sixth year of elementary school and the third year of junior
high school were tested in Japanese and arithmetic/mathematics, with almost all national and
other public school students participating along with 60% of those in private schools.
This test is an important tool for comprehending the true current state of scholastic
achievement in Japan and applying this knowledge in guidance and instruction activities.
However, there is also the potential for this test to be used in ways which are highly
problematic. One such case has already emerged in Adachi Ward, part of the Metropolis of
Tokyo, which has been conducting its own scholastic ability test since before the national test
was instituted. It was revealed that teachers proctoring the test at schools in Adachi Ward had
been communicating answers to students taking the test by means of signals or cues. The
worst-performing area in Tokyo, Adachi Ward authorities had been under pressure to
improve test results. They sought to achieve this goal by making all the schools in their
jurisdiction compete in the test, publishing a ranking of schools based on students’ test
results, reflecting these results in budget allocations to schools, and encouraging families to
refer to the results when choosing schools to attend. The improper conduct during the test on
the part of both teachers and students was a direct outcome of this competition-based
approach.
As this example shows, scholastic ability tests run the risk of being used to exacerbate
competition, generate hierarchies and create disparities between and among schools –
outcomes not envisaged in the original concept of school evaluation. This is an important
problem and one which calls for careful consideration; school administrators must be able to
approach the publication and use of test results in a manner which accords with the spirit of
honest and objective school evaluation.
If asked to identify three more general problems which may arise from the new school
evaluation system, many would name the following: (1) greater disparities between and
among different schools and teachers and more cases of loss of motivation and mental illness
in the teaching profession; (2) encroachment by the evaluation workload on the time teachers
usually spend with their students and on studying and preparing teaching materials; (3) the
treatment of evaluation as a routine practice, carried out as a matter of form, with content
being buried in the demands of paperwork. Measures must be instituted to forestall these
problems.
Evaluation is often thought of as the responsibility of principals, head teachers and other
school administrators, and not the concern of either mid-career or recently-qualified teachers.
However, it is important that evaluation be pursued as a school-wide initiative. Teachers must
look beyond the confines of their own classroom, adopting a broad and multi-layered
perspective which incorporates the overall organizational activities of the school and shares
ideals regarding the school’s direction. Teachers must work together with guardians and
members of the wider community in activities aimed at enhancing not just students’
scholastic ability but also their study habits and lifestyles. The crucial point here is to use the
evaluation process as a means to develop mutual trust and confidence among teachers.
Current Issues Regarding the School Evaluation System in Japan 115
Shadowing refers to a practice whereby evaluators stay close to, or shadow, those being
evaluated, facilitating close observation. Shadowing of a class group, for instance, enables
assessment not only of in-class lessons but also of pupils’ attitudes, movements and everyday
interactions during breaks and meal times. Over a two-day period, the evaluator can thereby
gain a complete picture of the teacher’s and students’ educational conduct, which will
contribute to the envisaging of current conditions and challenges for the school as a whole.
The workings of a school, after all, are the sum of its individual teachers. Viewing each
teacher’s activities in their entirety helps to bring concrete issues into focus, thus enabling
formative evaluation based on dialogue, generating relationships of trust, and making it
possible to conduct evaluation in accordance with each teacher’s individual characteristics
and distinctive qualities. Surely there is no greater force for improvement than this.
CONCLUSION
School evaluation must not be limited to mere questionnaire-based data collection. It is
crucial that external interests and third parties be included in practice-based school diagnosis
activities, the results of which are used in feedback to teachers; here opportunities must be
provided for expression of opinions and criticism of the results, to further enable their
verification. Senior high schools in Tokyo now publish evaluation details produced through
this kind of consensual process, an experiment which appears to have been received
favorably.
The great Chinese thinker Laozi said that “the highest good is like water.” Ideally, the
Plan-Do-Check-Act evaluation cycle should also be like water: fluid and supple, able to move
unhampered through any location, following the current to permeate the lowest spaces. The
flow of this highest good to schoolchildren, to teachers, to guardians and to community
116 Yasuhiko Washiyama
members is the quintessential ideal of all educational evaluation. Ultimately, in the river’s
lower reaches, it is the task of school principals to bring everything together – both problems
and achievements – and use them to put a new cycle into motion.
REFERENCES
Ministry of Education, Culture, Sports, Science and Technology-Japan. (2007). Regulative
Standard for School Establishment, Retrieved September 28, 2007, from
http://www.mext.go.jp/b_menu/houdou/18/03/06032817/003.htm.
In: Educational Evaluation in East Asia ISBN 978-1-60692-887-5
Editor: Samuel S. Peng and John Ch-Kin Lee © 2009 Nova Science Publishers, Inc.
Chapter 9
Pruet Siribanpitak
Faculty of Education, Chulalongkorn University, Thailand
ABSTRACT
The purpose of this paper is to examine the current number and qualifications of
public school teachers, and to explore the current issues in, and future prospects of,
public school teacher evaluation in Thailand. Most secondary school teachers in Thailand
are in fact public school teachers who have completed their undergraduate study; ninety-
two percent of them have completed a bachelor’s degree. Teacher evaluations have
become more important since 1999. There are three types of teacher evaluation for public
school teachers: (1) Induction Program Evaluation, (2) Annual Performance Evaluation,
and (3) Academic Status Evaluation. The importance of teacher evaluation is reflected in
the controversies regarding divergent criteria for performance-based evaluation and
professional judgment in performance assessment. The Teacher Civil Service
Commission-Ministry of Education has set up a taskforce to review the teacher
evaluation criteria. It is likely to accept three new dimensions: (1) teachers’ discipline,
morality and ethical conduct; (2) quality of teaching and student development; and (3)
results of teaching and student development.
1. INTRODUCTION
Teacher Evaluation in Thailand has gained increasing importance since 1999, a fact
which is partly attributed to the public’s demand for accountability in education. The 1999
National Education Act acknowledged the importance of educational standards and quality
assurance, and it is clear that the improvement of teacher evaluations will be beneficial to
educational quality and students as well as to the society at large.
118 Pruet Siribanpitak
2. OBJECTIVES
The purposes of this paper are to examine the quantity and qualifications of teachers and
the current issues in teacher evaluation* and to explore the future prospects of teacher
evaluation in Thailand, emphasizing the development of public school teachers.
The total number of school teachers in basic education** in Thailand gradually increased
during the academic years of 2001 to 2005 from 633,818 teachers in 2001 to 680,272 teachers
in 2005. (See Table 1).
Academic Years
Responsible Agencies
2001 2002 2003 2004 2005
Ministry of Education 588,899 568,283 532,814 597,680 614,481
Ministry of Tourism and Sports 1,128 116 1,131 983
Ministry of Culture 12,538 886 980 1,009
Ministry of Interior 42,923 44,054 57,408 72,142 61,863
- Department of Local 29,407 29,792 43,286 57,720 47,372
Administration
- BMA 13,516 14,262 14,135 14,422 14,491
Ministry of Social Development and 755 779 106
Human Security
Royal Thai Police 1,805 1,669 1,778 1,830
Total 633,818 650,548 593,648 674,490 680,272
Source: Office of the Education Council. (2006). Education in Thailand: 2005–2006. Bangkok: Office
of the Education Council. * School teachers in basic education. ** Basic education includes: Early
Childhood Education, Elementary Education, Secondary Education, and others (except higher
education).
The total number of teachers in higher education increased steadily during the academic
years 2001-2005, from 42,080 teachers in 2001 to 61,395 teachers in 2005. (See Table 2).
Most of the school teachers (in basic education) are public school teachers. In the
academic year 2004, there were 418,880 public school teachers as opposed to only 51,598
private school teachers. (See Table 3).
Teacher Evaluation in Thailand 119
Degrees Earned
Types of School Less than Bachelor’s Master’s Degree Total
Bachelor’s Degree Degree or Higher
5 91 4 100
Total
(23,827) (427,037) (19,632) (470,496)
Public Schools 4 92 4 100
(16,843) (348,358) (17,679) (418,880)
Poverty Enrollment
Low - - - -
High - - - -
Community Type
Urban 4 92 4 100
(6,260) (147,112) (6,942) (160,314)
Suburban and Rural 4 92 4 100
(9,246) (204,754) (9,455) (223,445)
Other 4 92 4 100
(1,337) (32,492) (1,292) (35,121)
Grade Level
Elementary* 8 88 4 100
(24,890) (285,992) (12,606) (323,488)
Secondary 2 65 33 100
(2,579) (80,248) (40,802) (123,629)
Private Schools 13 83 4 100
(6,984) (42,679) (1,935) (51,598)
Grade Level
Elementary 16 81 3 100
(6,253) (31,985) (1,935) (39,289)
Secondary 6 87 7 100
(731) (10,694) (884) (12,309)
Definitions: a. Elementary grade level refers to Kindergarten to Grade 6. b. Secondary grade level
refers to Grades 7-12. c. Private schools are those schools that are not primarily funded nor
administered by the local or central government. d. Community type refers to the whole
community in the case of small provinces with only one school district. * Includes those teachers
who teach lower secondary school classes in elementary schools.
Source: Siribanpitak, P. and Boonyananta, S. (2006). The Qualifications of the Teaching Force in
Thailand. In: A Comparative Study of Teacher Preparation and Qualifications in Six Nations. R.
M. Ingersoll (ed.), p. 89. Philadelphia: Consortium for Policy Research in Education.
120 Pruet Siribanpitak
Concerning the qualifications of school teachers, most Thai elementary and secondary
school teachers completed their undergraduate study. Ninety-one percent completed a
bachelor’s degree, 5% held less than a bachelor’s degree and 4% completed a master’s degree
or higher. There are pronounced differences between public schools and private schools. The
data also shows that students in private schools had less access to qualified teachers. For
example, 92% of all public school teachers completed a bachelor’s degree as opposed to 83%
of private school teachers, and 5% held less than a bachelor’s degree as opposed to 13% of
private school teachers. Moreover, about 33% of all public secondary school teachers held a
master’s degree or higher, as compared with 7% of private secondary-school teachers. (See
Table 3).
The data in Table 3 also reveals that community type (urban or suburban) does not have a
significant impact on teacher qualification levels. In other words, there is virtually no
difference between suburban and urban schools regarding the percentage of teachers with
degrees. For example, about 92% of all teachers in urban, suburban and rural areas held a
bachelor’s degree and about 4% held a master’s degree or higher. This means that the
qualifications of teachers in the 175 educational service areas (school districts) in Thailand
are roughly equivalent.
On the other hand, teachers at different grade levels in public schools do tend,
predictably, to have different qualifications. For example, about 33% of all secondary school
teachers have completed a master’s degree or higher, as opposed to only 4% in the case of
elementary school teachers.
When individuals join the teaching profession, they are placed on a salary scale that
corresponds to their qualifications and prior experience and the civil service salary structure.
The base level monthly salary for teachers with a four-year bachelor’s degree is 7,360 Baht
(190 US$), and 9,320 Baht (233 US$) for teachers with a master’s degree. By comparison,
other professionals such as doctors and engineers typically earn about 30,000 – 50,000 Baht
per month (750-1,250 US$).
The Teachers and Educational Personnel Act of B.E. 2547 (2004) proposed a new salary
structure, using a six-scale teacher classification framework based on academic status. The
classifications, moving upward, are as follows: assistant teachers, teachers, experienced
teachers, highly experienced teachers, expert teachers, specialized teachers. The entry
requirement for assistant teachers is either a five-year or a 4+1 year of post-secondary pre-
service teacher training. An assistant teacher’s salary is 8,360 Baht per month (209 US$), and
a teacher’s salary is 11,470 Baht per month (287 US$). Teachers who get promoted to a
higher level (higher academic status) based on their performance receive an extra monthly
allowance. The scale for the additional monthly allowances is 3,500 Baht (88 US$), 5,600
Baht (140 US$), 9,900 Baht (248 US$), and 13,000 Baht (325 US$) for experienced teachers,
highly experienced teachers, expert teachers, and specialized teachers respectively (See Table
4).
Teacher Evaluation in Thailand 121
With this new framework the teacher will earn a higher monthly salary than with the
previous framework (See Table 5). The aim is not only to attract talented high school students
into pre-service teacher training programs but also to attract talented graduates into the
teaching profession.
According to the Teachers and Educational Personnel Act of B.E.2547 (2004), all new
recruited public school teachers shall be appointed as “assistant teachers” for two years.
During this two-year period, the new teachers must participate in the induction program and
122 Pruet Siribanpitak
must be evaluated by the school committee in order to get promoted to the position of
“teacher.”
According to the Administrative Procedures for Civil Servants Act of B.E. 2538 (1995),
all public school teachers’ performances shall be evaluated every six months to determine
whether a salary increase is justified. There are two categories of salary increase:
According to the Teachers and Educational Personnel Act of B.E. 2547 (2004), all public
school teachers shall be promoted to a higher academic status (rank). There are four titles to
mark academic status: (1) Experienced Teacher, (2) Highly Experienced Teacher, (3) Expert
Teacher, and (4) Specialized Teacher. Teachers who get promoted to a higher academic rank
will get a higher level of monthly extra allowance. (See Table 4)
According to the National Education Act of B.E. 2542 (1999), the Teacher Civil Service
Commission, Ministry of Education (TCSC- MOE) is the central organization responsible for
administering teachers.
The personnel administration will be based on the principle of decentralization to
educational service areas and schools.
A central concern in creating criteria for performance-based teacher assessment,
however, is one that has plagued teacher assessment for decades and has often been described
as intractable: How to define “good teaching” in a way that is appropriate for assessment
purposes and yet remains faithful to the art and science of teaching as it is experienced by
knowledgeable practitioners?
The Teachers’ Civil Service Commission set up the following criteria for performance-
based teacher evaluation. This will include three areas:
Teacher Evaluation in Thailand 123
The Teacher Civil Service Commission also set the minimum scores for promotion to the
next level as follows:
124 Pruet Siribanpitak
(1)
Additional criteria for highly experienced teachers.
(2)
Additional criteria for expert teachers.
(3)
Additional criteria for specialized teachers.
Currently, there is some controversy regarding the assessment criteria for the academic
paper – which is required for highly experienced teachers, expert teachers, and specialized
teachers. The arguments are important because they reflect upon both the quality of teaching
and students’ achievement.
The proposed minimum scores for promotion using the new criteria are as follows:
Minimum Scores
Total Highly
Criteria Experienced Expert Specialized
Score Experienced
Teacher Teacher Teacher
Teacher
1. Discipline, morality, and pass/ pass/ pass/ pass/
-
ethical conduct fail fail fail fail
2. Quality of teaching and
100 60% 70% 80% 90%
student development
3. Results of teaching and
100 60% 70% 80% 90%
student development
126 Pruet Siribanpitak
REFERENCES
Shinkfield, A. J. and Stufflebeam, D. (1995). Teacher evaluation: Guide to effective practice.
London: Kluwer Academic Publishers.
Siribanpitak, P. and Boonyananta, S. (2006). The qualifications of the teaching force in
Thailand. In Ingersoll, R. M. (Ed.) A comparative study of teacher preparation and
qualifications in six nations, Philadelphia: Consortium for Policy Research in Education
(CPRE).
Office of Basic Education Commission. (2004). Statistical data of basic education in
Thailand. Bangkok: Donmuang Printing.
Teacher Evaluation in Thailand 127
Chapter 10
Kwok-cheung Cheung
Faculty of Education, University of Macau, China
ABSTRACT
Macao is a small city situated on the western side of Pearl River estuary. It has a
population of less than half a million and a history of more than four hundred years.
Macao’s educational system is characterized by the fact that the majority of its schools
are privately-operated. Many of these private schools still receive a large annual subsidy
from the government. This is because the policy in Macao is 15 years of free obligatory
education and this is applicable not only to the few public schools but also to private
schools joining the school-nets. In Macao, all private schools are given full administrative
autonomy and complete freedom of instruction, and there is as yet no territory-wide
public examination from kindergarten through high school. While this might seem like a
good thing, it also means that the public has no way to objectively evaluate the
educational achievement of schools, even though the schools may be rather different and
aim to serve different communities. Thus a system of standardized evaluation with
clearly spelled out benchmarks is necessary so that the government can justify to the
public the amount of money it is spending on education.
Therefore, it is heartening to learn that PISA (the Program for International Student
Assessment), a flagship research program conducted by the Educational Testing and
Assessment Research Centre (ETARC) of the University of Macau (UM), now serves as
a catalyst for capacity building of essential research personnel for conducting educational
evaluation in Macao. In particular, the assessment of mathematical, scientific, and
reading literacy in PISA 2003, 2006 and 2009 studies covers pertinent school subject
areas. This allows the public to see for the first time, from both intra-system and inter-
system educational perspectives, how 15-year-old students perform after completing their
obligatory basic education in the Macao school system (see Lo and Cheung, 2005;
Cheung and Sit, 2007).
1. INTRODUCTION
There is no public examination from kindergarten up to the end of the senior year in high
school (i.e. grades 1-12) in Macao. Basically student assessment is in the hands of the
individual schools which conduct school-based evaluations regularly for their students.
Because of this tradition, for the past four hundred years Macao citizens have not had
objective assessment information with which to judge the quality of their schools. Since the
majority of Macao schools are private, they have been legally entitled to enjoy complete
freedom in educational administration and autonomy in classroom instruction. It is a known
weakness of the Macao educational system that the government has no mechanism to arrange
the students sit for publicly administered examinations.
This is in spite of the fact that many of the private schools which join the school-nets are
heavily financed by the government. Given this awkward situation, the introduction of PISA
(the Program for International Student Assessment) was considered by the government to be a
way to begin capacity building of essential research personnel for conducting territory-wide
educational evaluation in Macao. Early in 2002, the DSEJ (i.e. Macao Education and Youth
Affairs Bureau) asked the Hong Kong-PISA Centre to help it launch the PISA 2003
Mathematical Literacy Study. Because this study was mainly executed by Hong Kong’s
educational researchers, many of the test instruments were simply borrowed from and much
of the research infrastructure modeled after Hong Kong. As a result of this special
arrangement, capacity building of essential research personnel was only developed on a very
limited scale at this early stage of study.
The PISA 2006 Scientific Literacy Study provided Macao its first chance to learn how to
conduct a full-scale international assessment project in accordance with established technical
standards set up by PISA. This chapter seeks to document seven benefits Macao gains, with
regard to capacity building of essential research personnel for conducting territory-wide
educational evaluation, by participating in the various cycles or stages of PISA studies. The
seven benefits are:
1. Forge links and promote dialogues within and across the various operational levels
and administrative infrastructure of PISA projects;
2. Acquire a first-hand understanding of perspectives that will enable a literacy
assessment framework for test units design;
3. Gain a first-hand understanding of conceptual frameworks that will make possible a
roadmap for questionnaire design;
4. Acquire cutting-edge methodologies for handling weighting and sampling errors in
complex sampling design;
5. Acquire cutting-edge methodologies for handling scaling and measurement errors in
complex test booklet design;
6. Learn well-planned strategies for reporting PISA results to Macao schools and the
public;
7. Be involved in pioneering research and development in the assessment of electronic
reading literacy.
PISA as a Catalyst for Capacity Building for Educational Evaluation in Macao 131
In the late 1990s, the Organization for Economic Co-operation and Development
(OECD) commissioned the Program for International Student Assessment (PISA) as an
ongoing international project undertaking periodic comparative study of the mathematical,
scientific and reading literacy of 15-year-old students amongst participating
countries/economies (OECD, 1999). Most countries/economies took advantage of this
opportunity to use PISA as a catalyst for capacity building of essential research personnel in
the area of educational evaluation, and Macao is no exception (OECD, 2001). The
infrastructure for PISA projects can be envisaged as operating at a number of levels, as
described concisely below (See Figure 1, a diagram on the MyPISA website which is being
used for the internal management of PISA projects).
Experts
z Mathematics Expert Group
z Reading Expert Group
z Science Expert Group
z Questionnaire Expert Group
z Technical Expert Group
z Others as needed
First, at the highest level, all PISA projects are overseen by the secretariat of the OECD
in Paris. Second, senior educational administrators, delegates and observers from
participating countries/economies form the PISA Governing Board (PGB) – a committee
132 Kwok-cheung Cheung
within the OECD which sets policy priorities through collaborative processes in order to steer
projects.
Third, for each PISA project a National Project Manager (NPM) is appointed to
coordinate all activities of the National Center (NC) of participating countries/economies,
which is responsible for executing the PISA projects. At each NC, the NPM needs to work
closely with the PGB, and forms working teams that take care of project development,
implementation, and the reporting of results to participating schools and the public. Fourth,
there are Consortiums, i.e. groups of international research and academic organizations,
which develop test instruments and questionnaires and are responsible for implementing
PISA projects in accordance with pre-determined time-lines. Fifth, Technical Advisory
Groups and Expert Groups are formed to supervise and to solve problems raised by
Consortiums during the planning and implementation stage of PISA projects. Evidently,
researchers working at the Macau-PISA Center, the NC within ETARC of UM responsible
for undertaking all PISA projects in Macao, are benefited immensely by participating in such
a large-scale international student assessment program. This is because they are able to forge
links and engage in interactive dialogues with superb, internationally-oriented administrative
infrastructures.
The PISA assessments take a literacy perspective. They focus on the extent to which
students can use the knowledge and skills they have gained and practiced at school when
confronted with situations and challenges for which that knowledge may be relevant (OECD,
2005a). That is, PISA assesses the extent to which students can use their scientific knowledge
and skills to understand, interpret and resolve various kinds of scientific situations and
challenges. Additionally, it judges the extent to which students can use their mathematical
knowledge and skills to solve various kinds of mathematical challenges and problems, and
the extent to which they can use their reading skills to understand and interpret various kinds
of written material that they are likely to meet in their daily lives (OECD, 1999 and 2006).
When PISA initiates an international assessment of the three literacy domains, an assessment
framework with sample items will be drafted and released for consultation by the
participating countries/economies. This framework describes the breadth of each of the three
literacy domains, and the sub-areas and skills associated with each (see Figure 2 for the PISA
2006 science assessment framework, see also OECD, 2007d, p.35).
In the assessment framework shown in Figure 2, context refers to engagement with
science in a variety of life situations which are not limited to life in school (i.e. personal,
social and global situations within application areas such as health, natural resources,
environment, hazards, and frontiers of science and technology). These applications require
students to demonstrate three main kinds of scientific competencies (i.e. identify scientific
issues, explain phenomena scientifically, and use scientific evidence). How students do so is
influenced by their knowledge (i.e. knowledge of science and knowledge about science) and
attitudes (i.e. interest in science, support for scientific enquiry, and responsibility toward
resources and environments). Knowledge of science is further classified into four categories:
PISA as a Catalyst for Capacity Building for Educational Evaluation in Macao 133
(1) physical systems, (2) living systems, (3) earth and space systems, and (4) technology
systems, whereas knowledge about science into two categories: (1) scientific enquiry and (2)
scientific explanations. In the test booklets, there is a balance of items assessing the various
components of the scientific assessment framework.
Context
Life situations that
Require people to Competencies
involve science and
technology
Identify scientific issues
Explain phenomena
scientifically
Use scientific evidence
Attitudes
b) How they respond to science issues (interest,
support for scientific enquiry, responsibility).
The PISA survey collects information from students on various aspects of their home,
family and school background, as well as information from schools about various aspects of
their organization and educational programs. This information is collected to facilitate a
detailed study of intra- and international factors associated with varying levels of reading,
mathematical and scientific literacy among the 15-year-old students of participating
countries/economies. Topics of thematic reports need to be determined before the
Questionnaire Expert Group (QEG) knows what questions will need to be asked in the
various questionnaires (i.e. student questionnaire, school questionnaire, parent questionnaire,
134 Kwok-cheung Cheung
Questionnaire Rubrics
1. Student Questionnaire (11 rubrics)
• Basic student characteristics and school career
• Parental education and occupation, immigration status, home resources and family wealth
• Reading engagement outside school as measured in PISA 2000
• Learning strategies and learning preferences
• Generally supportive teaching and learning environments
• Classroom and school climate
• Monitoring and feedback
• Opportunity to learn reading literacy at school
• Extended measures of reading engagement
• Extended measures of school conditions that are expected to support reading engagement
• Extended measures of meta-cognition of learning strategies
Questionnaire Rubrics
4. Educational Career Questionnaire (5 rubrics)
• Student’s past, current and anticipated educational career
• Change of study program
• Setting and ability grouping arrangement
• Participation in additional instruction and courses relevant to reading literacy
• School choice
For example, in the PISA 2009 reading literacy study, a roadmap entitled “From
Conceptual Framework to Questionnaire Content” was drafted by Professor Jaap Scheerens
in liaison with the Reading Expert Group (REG). In this roadmap, four themes are proposed
which cover: (1) system level indicators, cost effectiveness, and equity; (2) school
effectiveness, educational leadership, equity and cost effectiveness; (3) establishing an
effective learning environment in reading literacy at the instructional setting level; and (4)
establishing an effective learning environment in reading literacy at the individual student
level (OECD, 2007b). The Questionnaire Framework for PISA 2009 Reading Literacy Study
sketches data sources, and includes a rationale of the themes chosen, as well as the
antecedents, amenable processes and outcomes entailed.
Table 1 summarizes the rubrics of the five questionnaires used in the PISA 2009 Reading
Literacy Study. Questions are drafted and included in the respective questionnaires, and thus
intended for inclusion in the pilot study and then in the main study.
Sit and Cheung (2008) was the first attempt to report PISA 2006 Study results to
participating schools and the public. Reporting PISA findings and assessment results is a job
needed meticulous attention. This is because schools will decline to join PISA assessment if
schools find that the findings and assessment results released are not useful to them for
purposes of school improvement. Often, below-average positions in the international
assessment league table appear as headlines in local newspapers. Officials in charge of
education are then very nervous if they cannot provide satisfactory explanations for the
adverse results reported, and this is particularly sensitive when there is a decline in literacy
performance from the level reported in the last international assessment administered three
years ago.
For those interested, the official report of the Macao-PISA 2006 Study comprises six
bilingual chapters: (1) Conduct of enquiry; (2) A profile of literacy performance for 15-year-
olds in Macao; (3) Quality science education indicators; (4) Literacy-ESCS relationships for
Macao schools; (5) International comparison of literacy performance; and (6) Thematic
reports and follow-up studies (Cheung and Sit, 2007). During the school briefing session, the
following figure was found to be of considerable interest to the principals because it showed
the slightly non-linear relationship between between-school scientific literacy performance
and the economic, social and cultural status of the home of the students enrolled (see Figure
3; see also Cheung and Sit, 2007, p.57-58). Based on the plot shown in Figure 3, Macao
school administrators on the one hand are able to judge whether their schools are high-
performing, average-performing or low-performing, and on the other hand to determine the
index of the economic, social and cultural status of students’ homes, so as to have a better
understanding of the achievement and social class of the students they serve.
In a nutshell, if we classify the scientific literacy performance of the 43 schools
participating in the PISA 2006 Study into high-performing (13 schools), average-performing
(17 schools), and low-performing (13 schools), the following three observations may be
made:
3. The eleven lowest-performing schools in scientific literacy (excluding the two low-
performing International/Portuguese schools mentioned before) are generally
associated with homes with a lower ESCS (i.e. -1.80 < ESCS < -0.90). However, two
low-performing schools (schools number 15 and 43) have students coming from
homes with a higher ESCS (i.e. ESCS = -0.44 and -0.01).
600 8
27 3
30
20
24
550 6 23
19 33 38 9
42 14
17 21
12
26 5 7
16 22 37 4
500 40
32
39
35
Sc i en ti f i c L i ter ac y
34
13
29
1 18
31
450 2
43
10
25
41 15
400 28 36
350
11
300
-2 -1.5 -1 -0.5 0 0.5 1
Index of Economic, Social and Cultural Status (ESCS)
At the turn of the century, electronic reading literacy emerged as a new form of literacy
which needed to be inculcated during the basic education stage of schooling (OECD, 2007a).
Nowadays, quite a lot of students (especially in the more developed countries) are reading
electronically. Therefore, electronic reading literacy has been emerging as an increasingly
PISA as a Catalyst for Capacity Building for Educational Evaluation in Macao 139
important skill in the educational agenda. However, assessment of this type of competence
demands the appropriate delineation of a sound assessment framework, one which deals with
both print-based and electronic reading texts simultaneously. In response to the digital age,
the PISA 2009 Reading Literacy Study seeks to achieve this goal (OECD, 2007c).
Figure 4. Relationship between fixed reading texts and competences of assessment tasks presented in
the print medium.
Figure 5. Relationship between dynamic reading texts and competences of assessment tasks delivered
in the electronic médium.
140 Kwok-cheung Cheung
Compared with print-based reading literacy, there are three aspects of electronic reading
literacy needed to be addressed if a sound assessment framework is to be developed (see
Figure 4 and 5 for the comparison). The three aspects are: (1) students who read online access
and retrieve information via search engines and embedded hyperlinks; (2) students construct
and integrate texts in accordance with access structures they opt to follow, i.e. by clicking
hyperlinks and searching for usable information in multiple texts until they judge that the
information collected has been adequately and meaningfully synthesized; (3) students need to
reflect on and evaluate critically authorship, as well as the accuracy and credibility of
information retrieved. Again, capacity building of essential research personnel is urgently
needed in this innovative area of reading literacy assessment.
CONCLUSION
Supported by renowned scholars and institutions, PISA studies are able to furnish reliable
and valid data in order that the government may hold schools accountable for the subsidies
rendered to them as part of the Macao’s obligatory universal basic education program.
Because there is no tradition of public examination in Macao, capacity building of essential
research personnel for the conduct of territory-wide educational evaluation is waiting to be
developed. There are five benchmarks at which the outcomes of PISA studies in Macao can
be targeted, and of which capacity building of essential research personnel can be conceived
as successful: (1) Monitoring of quality of Macao’s educational system in the inculcation of
reading, mathematical and scientific literacy for 15-year-olds across successive cycles of
PISA studies; (2) Monitoring of equity in the distribution of learning opportunities as
measured by the impact of the economic, social and cultural status of the home on school’s
literacy performance; (3) Maintenance of consistency of literacy standards amongst different
types of schools, especially between the public and private schools; (4) Revelation and
explanation of differential gender difference patterns in the three domains of literacy, i.e.
reading, mathematical and scientific literacy; (5) Revelation and explanation of teaching and
learning environments facilitative of the life-long learning for 15-year-old students.
In sum, the seven benefits delineated and gained by Macau-PISA Centre via active
participation in PISA 2003, 2006 and 2009 studies are pertinent to answer whether PISA can
serve as a catalyst for capacity building of essential research personnel for educational
evaluation in Macao.
REFERENCES
Cheung, K. C., and Sit, P. S. (2007). Macao PISA 2006 Study Report Number One:
Assessment of Scientific, Mathematical and Reading Literacy Performance of 15-year-
old Students from an International Comparison Perspective. Macao: Educational Testing
and Assessment Research Centre, University of Macau.
Sit, P. S., and Cheung, K. C. (2008). Key Findings of the Macao-China PISA 2006 Scientific
Literacy Study. Invited paper presented at the International Conference on “PISA 2006:
PISA as a Catalyst for Capacity Building for Educational Evaluation in Macao 141
Chinese University of Hong Kong. He has led the development of large-scale school
improvement project including the Accelerated Schools for Quality Education Project
and the Partnership for Improvement of Learning and Teaching Project. He is the Chief
Editor of Journal of Basic Education and Regional Editor (Asia) of Educational Research
and Evaluation (Routledge). He is author and co-author of four academic books,
including Curriculum, Teaching and School Reforms: Educational Development in a
New Century (The Chinese University Press, in Chinese, 326pp.) and (with Kerry
Kennedy) The Changing Role of Schools in Asian Societies: Schools for the Knowledge
Society (Routledge, 228pp.). He is also the co-editor (with Leslie Lo and Allan Walker)
of Partnership and Change for School Development (The Chinese University Press), (co-
edited with Roger Cheng and Leslie Lo) Values Education for Citizens in the New
Century, (co-edited with Michael Williams), School Improvement: International
perspectives (Nova Science Publisher, Inc.) and (co-edited with Michael Williams)
Environmental and Geographical Education for Sustainability: Cultural contexts (Nova
Science Publisher, Inc.).
Mu-Lin Lu is the Political Deputy Minister, for the Ministry of Education, Taiwan. As an
experienced administrative officer in charge of Taiwan’s higher education, international
cultural and educational relations, since 2000, Dr. Lu has implemented important policies
for the facilitation of innovated educational policies in Taiwan. Prior to his current
Ministry position, Dr. Lu was the Administrative Deputy Minister, 2000-2006; the
Secretary General for the Ministry of Interior, 1997-1998; and the Director of the
Department of Cultural and Educational Affairs for the Mainland Affairs Council, 1992-
1997. Dr. Lu received his doctorate degree from the University of Texas at Austin, in the
United States. His research interests include: supervision of instruction, teacher
development and education policy.
Nicholas Sun-Keung Pang is a Professor of Department of Educational Administration and
Policy in the Chinese University of Hong Kong. His research interests include
educational administration, management and leadership, as well as school effectiveness
and improvement. He also serves as Executive for the Hong Kong Centre for the
Development of Educational Leadership, Leader of the School Development and
Evaluation Team (SDET) and Programme Director of the Preparation for Principalship
Course and School Managers Course commissioned by the Hong Kong Education
Bureau. He was elected the Chairman of Hong Kong Educational Research Association
for the year 1999-2001 and served as member of the Advisory Committee on Quality
Assurance Inspectorate and the Working Group on the Study of Effectiveness of Public-
sector Secondary Schools of the then Education and Manpower Bureau. He is an
appointed Sector/Subject Specialist of the Hong Kong Council for Accreditation of
Academic and Vocational Qualification for 2008-2011. He has been publishing widely in
local and international journals. He is Co-author of an academic book entitled Leadership
and Management in Education: Developing Essential Skills and Competencies (322
pages) published in 2003 and the Editor of an academic book entitled Globalization:
Educational Research, Change and Reforms, published in 2006. He was invited to speak
at major international academic meetings in Hong Kong, mainland China, Macau,
Taiwan, Vietnam, India, Australia, Slovenia, and the United States.
Samuel S. Peng is a Chair Professor of the National Taiwan Normal University (NTNU).
Before joining NTNU, he was a Chair Professor of the National Tsing Hua University
Contributors 145
(2003-2006). From 1981 to 2003, he was a program director at the National Center for
Education Statistics, U.S. Department of Education, responsible for several national data
systems. He was a reviewer for professional journals such as American Educational
Research Journal, Educational Researcher, and Educational Evaluation and Policy
Analysis. In addition, he was the Editor in Chief of the New Waves -- Educational
Research and Development, the quarterly journal of the Chinese American Educational
Research and Development Association from 1996 to 2006. Currently he is the Editor in
Chief of the Journal of Higher Education in Taiwan.
Pruet Siribanpitak is an Associate Professor and Dean of Faculty of Education at
Chulalongkorn University Bangkok, Thailand. He got his bachelor’s degree in
Elementary Education, master’s degree in Educational Administration from
Chulalongkorn University, and doctoral degree in International and Development
Education concentration in Economics of Education from University of Pittsburgh,
Pennsylvania. His experiences including a faculty member of Department of Education
Research, Department of Educational Administration and Department of Foundations of
Education, head of Department of Foundations of Education, Associate Dean for
Planning and Development, Dean of Faculty of Education at Chulalongkorn University,
president of Thailand Deans of Education Council, expert member of the Committee on
Education of the House of Representatives and the Research Project on Teacher
Education, the Office of Education Council, Ministry of Education. He also has a variety
of international experiences including resource person and consultant in Vietnam, Nepal,
Lao, and India. He used to serve as a member of advisory group for the UNESCO on
“Reorienting Teacher Education for a Sustainable Future” at Thessaloniki, Greece.
Ramanathan Subramaniam holds the concurrent positions of Associate Dean (Graduate
Academic Programs) in the Graduate Programmes and Research Office, and Associate
Professor in the Natural Sciences and Science Education Academic Group at the National
Institute of Education in Nanyang Technological University. He has diverse research
interests: physical chemistry, physics education, chemistry education, primary science
education, ICT in education, informal science education, teacher education, science
communication, and creativity. He has over 50 research papers to his credit in
international (refereed) journals published in the USA and Europe. He has also over 25
refereed chapters to his credit in books of international publishers in the USA and
Europe. Four of his books were recently published by major international publishers. His
biography appears in the Dictionary of International Biography (Cambridge, UK) and the
Marquis Who’s Who in Asia (New Jersey, USA).
Suan Yoong,B.Sc.,M.Ed. ( Malaya ); M.S.,Ph.D.(Indiana), is a Fellow/Professor of Education
at the Sultan Idris University of Education, Malaysia and Deputy Chair of the
International Organisation for Science Education, IOSTE. His bachelor’s degree was in
Chemistry and Mathematics, but he specialized in science and technology education and
psychometric at the masters and doctoral levels. He was the Chairman of Higher Degree
Program and a Professor of Education at the School of Educational Studies, University of
Science Malaysia. He was also the Project Coordinator of the Malaysian General Science
Project at the Malaysian Curriculum Development Center. While at Indiana University,
he won the Outstanding Graduate Student award and was inducted into Phi Delta Kappa,
the National Honor Society in Education. He was a recipient of the UNESCO Fellowship
Award for the 1992-93 Participation Program for research in Psychometric and Testing -
146 Contributors
Asian, 2, 3, 9, 10, 31, 35, 62, 69, 70, 93, 94, 97, 104,
A 107, 108, 144
Asian countries, 94
academic, 1, 4, 8, 13, 14, 15, 17, 19, 20, 23, 25, 28,
assessment, 6, 7, 13, 24, 26, 29, 30, 33, 34, 38, 45,
30, 33, 35, 36, 39, 46, 47, 49, 50, 51, 52, 54, 55,
46, 47, 48, 49, 52, 53, 55, 58, 59, 62, 63, 64, 66,
58, 61, 68, 78, 88, 95, 96, 102, 104, 112, 118,
67, 68, 88, 93, 94, 96, 98, 99, 100, 101, 102, 103,
120, 121, 122, 124, 125, 132, 144
104, 105, 106, 107, 108, 109, 115, 117, 122, 124,
academic performance, 55, 58, 61, 68, 104, 121
129, 130, 132, 133, 136, 137, 139, 140
accountability, 2, 6, 10, 24, 31, 33, 62, 68, 69, 70,
assessment procedures, 24
71, 88, 90, 104, 106, 107, 117
assessment tools, 6, 38, 55, 58
accreditation, 15, 16, 17, 21, 42, 43, 44
atmosphere, 7, 112
accuracy, 4, 140
attitudes, 77, 115, 133
achievement, 8, 10, 23, 30, 34, 35, 37, 46, 48, 49, 52,
auditing, 18
54, 58, 59, 61, 63, 69, 97, 101, 102, 105, 106,
Australia, 14, 15, 27, 91, 96, 97, 108, 144
123, 124, 129, 137
authority, 23, 25, 26, 29, 32, 51, 88, 111, 113
achievement test, 10, 101
autonomy, 4, 8, 86, 129, 130
action research, 33, 64
awareness, 2
adaptability, 104
adjustment, 14, 47, 103 B
administration, 4, 7, 13, 14, 19, 20, 32, 48, 49, 57,
77, 83, 105, 111, 112, 122, 124, 130, 144 barrier, 99
administrative, 3, 8, 13, 15, 19, 20, 35, 48, 129, 130, barriers, 58
131, 132, 144 BCA, 61, 66, 68
administrative efficiency, 20 beginning teachers, 62
administrators, 2, 6, 10, 47, 48, 49, 53, 79, 82, 85, behavior, 123
87, 90, 104, 114, 115, 132, 137 Beijing, 65, 70
advocacy, 62 beliefs, 106
after-school, 104 benchmark, 24, 26, 102
age, 9, 13, 25, 43, 94, 96, 100, 101, 135, 139 benchmarking, 24, 29, 46, 107
agent, 24, 77 benchmarks, 8, 34, 129, 140
alternative, 9, 101, 102, 105 benefits, 68, 85, 130, 140
antecedents, 135 Best Practice, 28
anxiety, 67, 98 bias, 103
application, 18, 100, 101, 108, 113, 132 bilingual, 137
aptitude, 56, 57, 101, 103, 106 birth, 62
arithmetic, 114 birth rate, 62
artistic, 105 blocks, 136
Asia, 1, 2, 3, 6, 9, 17, 24, 62, 93, 94, 104, 107, 108, BMA, 118
144, 145 Boston, 34
148 Index
bottom-up, 65, 75, 81, 87 compliance, 49, 53, 58, 111, 113
breaches, 99 components, 47, 66, 89, 133
breakdown, 87 comprehension, 38, 39
Britain, 14, 15, 18 compulsory education, 7, 111, 113
broadband, 33 concentration, 145
buildings, 32 conceptualization, 136
business management, 17 concrete, 85, 115
confidence, 78, 85, 114
C confusion, 87
consciousness, 47, 115
caliber, 13
consensus, 4, 6, 20, 85
calibration, 58, 59, 136
conservation, 52
Canada, 96, 97
constraints, 28, 31, 105
candidates, 1, 4, 35, 39
construction, 57
capacity building, 8, 62, 129, 130, 131, 140
consumers, 9
cast, 104
control, 7, 24, 33, 35, 37, 94, 97, 105, 111
catalyst, 8, 129, 131, 140
cost effectiveness, 135
Catholic, 17, 79
costs, 19
Catholic Church, 79
Council of Ministers, 46, 48
certificate, 25, 37, 101
coupling, 67
certification, 29, 101
course design, 16
changing environment, 89
covering, 81
cheating, 66, 99, 100
creativity, 31, 34, 98, 102, 145
childhood, 80
credibility, 140
children, 93, 94, 97, 98, 104
credit, 145
China, 3, 9, 91, 96, 97, 144
critical thinking, 53, 55, 56, 57
citizens, 21, 47, 55, 130
criticism, 115
citizenship, 47
cues, 114
civil service, 120
cultural heritage, 68
classes, 30, 37, 95, 96, 98, 119
culture, 2, 5, 7, 31, 34, 65, 66, 78, 79, 81, 82, 83, 85,
classification, 52, 120
87, 89, 90, 93, 97, 98, 101, 103, 104, 106, 107
classroom, 38, 54, 58, 67, 70, 100, 101, 114, 123,
curriculum, 1, 6, 7, 38, 45, 47, 48, 50, 52, 53, 56, 57,
130
58, 62, 67, 94, 97, 98, 99, 105, 113, 123
classroom practice, 54, 58
curriculum change, 58
classrooms, 9, 55, 58, 70
cycles, 79, 130, 140
clients, 51
cyclical process, 76
closure, 62, 66, 68
clusters, 136 D
cognition, 84, 105, 134
coherence, 6, 61, 68 data analysis, 78
collaboration, 5, 8, 75, 87, 96, 112, 113 data collection, 78, 115
colleges, 4, 9, 10, 13, 15, 17, 19, 20, 23, 25, 27, 28, database, 14, 18
39, 40, 41, 46, 96 dating, 96
communication, 10, 24, 43, 59, 78, 87, 105, 134, 145 decentralization, 6, 7, 111, 122
communication technologies, 105 decision-making process, 62
communities, 46, 62, 69, 97, 129 decisions, 8, 28, 50, 126
community, 1, 6, 8, 10, 47, 50, 53, 61, 68, 76, 79, 82, definition, 9
83, 85, 96, 97, 104, 111, 112, 113, 114, 115, 116, delivery, 101
119, 120 delusion, 70
community cooperation, 83 democratization, 43
competence, 2, 42, 43, 44, 85, 123, 139 Department of Education, 35, 75, 143, 144, 145, 146
competition, 5, 43, 62, 99, 114 deregulation, 7, 111
competitiveness, 2, 13, 24, 27, 30, 55, 68, 104 developed countries, 34, 138
complexity, 105, 124 deviation, 40
Index 149
funding, 4, 13, 14, 15, 19 higher education, 4, 5, 13, 14, 15, 16, 17, 18, 19, 22,
27, 35, 37, 39, 40, 43, 46, 47, 48, 49, 53, 55, 56,
G 57, 96, 118, 144
high-level, 20
gauge, 32, 136
high-stakes testing programs, 109
gender, 8, 36, 140
hiring, 1
gender differences, 8
holistic, 28, 101, 102, 107
General Certificate of Education (GCE), 25, 28, 64
Hong Kong, 1, 3, 6, 61, 62, 63, 64, 65, 67, 68, 69,
generation, 33, 47
70, 71, 75, 76, 79, 84, 85, 86, 87, 88, 90, 91, 94,
global competition, 1
107, 108, 130, 144
global economy, 31
House, 70, 145
globalization, 2, 4, 22, 24, 43, 55, 91, 144
hub, 100
goal setting, 78
human, 5, 23, 43, 55, 87, 89, 106
goals, 6, 9, 10, 15, 16, 21, 38, 54, 76, 77, 89, 90, 104,
human resource development, 55
105, 106, 112, 113, 126
human resources, 5, 23, 43, 87
goal-setting, 112
going to school, 7 I
governance, 2, 88
government, 1, 2, 4, 6, 10, 13, 14, 16, 18, 22, 23, 38, identification, 36, 78
42, 52, 55, 61, 66, 68, 86, 94, 96, 100, 111, 119, IEA, 2, 24, 26, 30, 34
129, 130, 140 immigrants, 66
government policy, 55 immigration, 134
GPA, 45, 56, 57, 58 implementation, 2, 3, 4, 5, 7, 8, 10, 17, 18, 19, 25,
grades, 10, 25, 53, 58, 62, 65, 96, 98, 99, 100, 104, 38, 43, 44, 49, 50, 67, 69, 76, 77, 86, 87, 88, 89,
130 90, 93, 94, 102, 103, 106, 126, 132
grading, 57, 99, 103 incentives, 67, 106
grants, 19 inclusion, 23, 135
Great Britain, 94 increased workload, 68
Greece, 145 independence, 15
group activities, 37 India, 144, 145
grouping, 134 Indian, 94
groups, 13, 45, 48, 56, 57, 58, 94, 97, 105, 113, 132 Indiana, 145
growth, 94, 98, 100, 126 indication, 32
guessing, 5, 32 indicators, 4, 16, 20, 45, 47, 48, 56, 57, 58, 76, 79,
guidance, 89, 111, 112, 114 82, 83, 85, 89, 113, 135, 137
guidelines, 2, 4, 7, 43, 46, 49, 66, 88, 103, 112 indices, 30
guiding principles, 55 indigenous, 5, 94
individual characteristics, 115
H individual differences, 1
Indonesia, 29
handling, 13, 130, 135
induction, 8, 121
hands, 130
industry, 19, 100
happiness, 104
infancy, 101
harm, 46
inferences, 136
harmony, 46
inflation, 57, 58, 100
hazards, 132
information and communication technology (ICT),
health, 46, 47, 112, 132
24, 33, 43, 100, 101, 134, 145
heart, 107
information technology, 50, 62
high school, 2, 5, 9, 13, 35, 36, 37, 38, 39, 40, 42, 94,
infrastructure, 130, 131
96, 97, 111, 115, 121, 129, 130
inherited, 94
high school grades, 9
initiation, 87
high scores, 10
innovation, 2, 5, 10, 31, 34, 48, 50, 55, 79, 125, 143
inspection, 6, 7, 62, 64, 65, 75, 76, 79, 111, 113
inspections, 63, 76, 87
Index 151
institutionalization, 18 land, 24
institutions, 1, 4, 13, 14, 15, 19, 21, 30, 46, 47, 48, language, 2, 27, 56, 62, 64, 66, 67
49, 50, 52, 53, 55, 57, 140 language proficiency, 62, 64
institutions of higher education, 13 large-scale, 63, 99, 105, 132, 144
instruction, 8, 10, 25, 52, 63, 69, 94, 101, 106, 114, later life, 97
129, 130, 134, 144 leadership, 6, 10, 48, 49, 65, 66, 77, 89, 90, 134, 135,
instructional activities, 50 144
instruments, 24, 33, 101, 103, 105, 130, 132 leakage, 99
integrity, 46, 51 leaks, 103, 108
intellect, 46 learners, 46, 47, 50, 51, 55, 58, 105, 108, 123, 125
intentions, 68 learning, 2, 4, 5, 7, 8, 9, 10, 16, 21, 23, 24, 26, 27,
interactions, 115 28, 29, 30, 31, 32, 33, 34, 38, 46, 47, 51, 54, 55,
interactivity, 105 58, 62, 63, 64, 65, 66, 67, 68, 69, 70, 78, 79, 82,
interdependence, 88 83, 85, 88, 89, 90, 94, 97, 98, 99, 100, 101, 103,
international standards, 23, 33 104, 105, 107, 108, 113, 115, 123, 125, 126, 134,
internationalization, 17, 22 135, 140
internet, 33, 66, 134 learning culture, 88, 94, 99
interview, 40, 41 learning difficulties, 97
interviews, 28, 69, 78 learning environment, 134, 135, 140
investigative, 58 learning outcomes, 54, 58
investment, 43 learning process, 100, 101
island, 23 learning society, 46, 47
Italy, 141 learning styles, 101
item response theory, 58 licenses, 49
lifelong learning, 43
J lifestyles, 114
lifetime, 101
jackknife, 135
linear, 58, 137
Japan, 3, 7, 9, 24, 34, 94, 96, 97, 107, 111, 112, 113,
linkage, 6, 48, 61, 67, 68
114, 116
links, 32, 33, 130, 132
Japanese, 109, 114
listening, 38
jobs, 99
literacy, 8, 71, 129, 130, 131, 132, 133, 134, 135,
judge, 130, 137, 140
136, 137, 138, 140
judges, 132
local community, 111, 112
judgment, 43, 55, 65, 117, 124, 126
local government, 96
junior high, 114
location, 115, 136
junior high school, 114
logistics, 30, 100
jurisdiction, 114
London, 10, 69, 91, 107, 109, 126
K Los Angeles, 143
M
K-12, 4
kindergarten, 8, 129, 130
Macao, 3, 8, 9, 94, 129, 130, 131, 132, 137, 140,
King, 52
141, 143
knowledge acquisition, 103
Macau, 8, 129, 132, 140, 141, 143, 144
knowledge economy, 13
machinery, 25
knowledge-based economy, 55, 104
Madison, 146
Korea, 5, 35, 36, 38, 41, 43, 97, 108, 143
Mainland China, 94
Korean, 38, 39, 40, 43, 143
maintenance, 140
L Malaysia, 3, 7, 93, 94, 96, 97, 99, 100, 101, 102,
103, 104, 106, 107, 108, 109, 145
L1, 28 management, 1, 3, 4, 6, 7, 17, 18, 35, 42, 47, 50, 54,
labor, 24 55, 62, 65, 67, 69, 76, 78, 79, 81, 82, 83, 86, 88,
lack of opportunities, 9 89, 91, 100, 101, 112, 113, 123, 131, 144
152 Index
training, 6, 15, 20, 33, 53, 55, 58, 68, 78, 81, 82, 85, validity, 27, 28, 29, 84, 105, 124, 126
88, 102, 103, 106, 112, 120, 121 values, 3, 6, 28, 62, 89, 90, 96, 136
training programs, 88 variability, 26, 27
transfer, 36 variables, 38, 39
transformation, 55, 85, 89, 108 variation, 135
transition, 7, 103 Victoria, 91, 108
transparency, 66 Vietnam, 144, 145
transparent, 20, 29 visible, 105
trees, 32 vision, 22, 51, 77, 88, 104
trial, 42, 113 vocational, 13, 25, 46, 95, 96, 99
triangulation, 85 vocational tracks, 95, 96, 99
trust, 42, 78, 103, 114, 115 voice, 32
TSA, 6, 61, 64, 66, 67, 68 volunteer work, 37
tuition, 93, 107
turbulent, 86, 87, 89
W
tutoring, 43, 93, 98, 107, 108
weakness, 7, 130
twinning, 96
wealth, 134
well-being, 28
U Western countries, 24
wind, 32
uncertainty, 68 winning, 17
undergraduate, 117, 120 Wisconsin, 146
UNESCO, 10, 69, 107, 145 wisdom, 10, 53, 58
UNICEF, 107 workers, 24
uniform, 113 workforce, 13, 24
unions, 5, 43 working hours, 31
United Kingdom (UK), 27, 96, 97, 145 workload, 7, 33, 61, 66, 100, 105, 114
United States, 9, 14, 17, 96, 97, 144 workplace, 106
universities, 4, 9, 13, 14, 15, 16, 17, 18, 19, 20, 21, World War, 111
22, 27, 37, 39, 40, 46, 55, 56, 63, 65, 93, 95, 96, World War I, 111
97, 106 World War II, 111
university collaboration, 96 writing, 6, 40, 41, 45, 53, 54, 55, 58, 65
university education, 13, 25, 26, 95
Y
V
yield, 30
validation, 32