You are on page 1of 9


Adams, R. J. & Khoo, S. T. (1996). Quest: The interactive test analysis system
version 2.1. Victoria: The Australian Council for Educational Research

Aiken, L, R. (1980), “ Three coefficient of analyzing the realibility and validity of

ratings.” International of Educational and Phsychological Measurement,
40, 955-967

Allen & Yen. (1979). Introduction to measurement theory. California: Wadsworth


Ali, M., Abd-Talib, C., Ibrahim, N. H., Surif, J., & Abdullah, A. H. (2016). The
importance of monitoring skills in physics problem solving. European
Journal of Education Studies.
Amin, B. D., & Mahmud, A. (2016). The Development of Physics Learning
Instrument Based on Hypermedia and Its Influence on the Student
Problem Solving Skill. Journal of Education and Practice, 7(6), 22-28.

Ashcroft, K., & Palacio, D. (1996). Researching Into Assesment And Evaluation
In Collage And University. London: British library cataloguing in
publishing data.

Azwar, S. (2010). Penyusunan skala psikologi. Yogyakarta: Pustaka Pelajar.

Baker, F. B. (2001). The Basics Of Item Response Theory Second Edition.

College Park, MD: ERIC. Clearinghouse on Assessment and Evaluation

Bonham, S. W., Deardorff, D. L., & Beichner, R. J. (2003). Comparison of

student performance using web and paper‐based homework in college‐
level physics. Journal of Research in Science Teaching, 40(10), 1050-

Bransford, J.D. & Stein, B.S. (1984) The ideal problem solver: A guide for
improving thinking, learning, and creativity. Freeman, New York.

Brookhart, S. M. & Nitko, A. J., (2011). Educational Assesment of Students 6nd

Edition. Boston: Pearson Education, Inc
Bunderson, C. V., Inouye, D. K., & Olsen, J. B. (1988). The Four Generations Of
Computerized Educational Measurement. ETS Research Report Series,

Butler, A. C., & Roediger, H. L. (2008). Feedback enhances the positive effects
and reduces the negative effects of multiple-choice testing. Memory &
Cognition, 36(3), 604-616.
Cangelosi, C. (1995). Merancang Tes untuk menilai Prestasi Siswa. Bandung:
Penerbit ITB.
Chen, C.C., Lin, H-S, Tse, T.H (2002). Developing a two-tier diagnostic
instrument to assess high school students’ understanding-the formation of
images by a plane mirror. Proceedings of the National Science Council.
12(3), 106-121.

Chen, T. Y., Kuo, F. C., Merkel, R. G., & Tse, T. H. (2010). Adaptive random
testing: The art of test case diversity. Journal of Systems and Software,
83(1), 60-66.
Cohen, R., & Swerdlik, M.,. (2010). Psychological Testing and Assessment: An
Introduction to test and measurement sxth edition. New York: McGraw-
Hill Company
Demars, C. (2010). Item Response Theory. New York: Oxford University Press,

Dancy. J. (2000). Practical reality. Clarendon Press.Versi Elektronik Online.

Dharma, S. (2009). Arah kebijakan peningkatan mutu pendidikan tenaga
kependidikan. Makala disajikan dalam seminar nasional dalam rangka
dies natalis ke 44 di Universitas Negeri semarang

Dinica, M., Dinescu, L., Miron, C., & Barna, E. S. (2014). Formative values of
problem solving training in physics. Romanian Reports in Physics, 66(4),
Dongre, N,. (2015). “Development of Problem Solving Skill of Adolescents
through Teaching of Science for Sustainable Development.”( IOSR
Journal Of Humanities And Social Science, Volume 20, Issue 7, Ver. III ,
PP 46-52).
Economides & Roupas. (2007). Overexposure and Underexposure of Item In
Computerized Adaptive Testing. Measurement and Research
Departement Report.

Edward, G., Lyon, (2013). Assessment as Discourse”: A Pre-Service Physics

Teacher’s Evolving Capacity to Support an Equitable Pedagogy. Article
Published 19 july 2013 in Education sciences 2013. ISSN 2227-7102
DOI: 10.3390/educsci3030279
Ellison, G. J. (2009). Increasing Problem Solving Skills in Fifth Grade Advanced
Mathematics Students. Journal of Curriculum and Instruction, 3 (1),

Embretson, S. E., & Reise, S. P. (2013). Item response theory. Psychology Press.
Eraikhuemen, A., & Ogumogu, A.E. (2014). An Assessment of Secondary School
Physics Teachers Conceptual Understanding of Force and Motion In Edo
South Senatorial District. Academic Research International, 5, 253 –

Evangelos, T., Elissavet, G., Anastasio, A., Economides. (2008). The design and
evaluation of a computerized adaptive test on mobile devices. Computers
& Education DOI:10.1016/j.compedu.2008.12.005
Faize & Dahar. (2012). Engaging Secondary Grade Physics Students in
Developing test item. Journal of Turkih Sciences Education, 9, 3-11
Feher, E. & Rice, K. (1988). Shadows and anti-images: childrens’ conception of
light and vision. II. Science Education. 72(5), 637-649.

Gierl, M. J., & Lai, H. (2013). Instructional topics in educational measurement

(ITEMS) module: Using automated processes to generate test items.
Educational Measurement: Issues and Practice, 32(3), 36-50.

Guzel, H. (2011). Factors Affecting The Computer Usage Of Physics Teachers

Working At Private Training Centers. The turkish online journal of
education Technology Vol. 10 (2)

Gronlund, N.E. (1976). Measurement and evaluation in teaching. New York:

Macmilan Publishing Co.
Hadi, S. (2013). Pengembangan Computerized Adaptive Test Berbasis Web.
Yogyakarta: Aswaja Pressindo. Herbert (2009) & Chen et al (2010)
Hambleton, R, & Swaminathan, (1985). Item response theory. Boston, MA;
Kluwer Inc.
Hambleton. R.K., Swaminatha, H., Rogers, H. J (1991). Foundamental of item
response theory. Newbury Park, CA: Sage Publication Inc
Haryanto, H. (2010). Pengembangan Computerized Adaptive Testing (CAT)
dengan Algoritma Logika Fuzzy. Jurnal Penelitian dan Evaluasi
Pendidikan, 15(1), 47-70.
Helaiya, S. (2010). Development and Implementation of Life Skills Programme
for Student Teachers. Vadodara: Maharaja Sayaji Rao University of

Heller, P., Keith, R., & Anderson, S. (1992). Teaching problem solving through
cooperative grouping. Part 1: Group versus individual problem
solving. American journal of physics, 60(7), 627-636.
Heller, K., & Heller, P. (2010). Cooperative Problem Solving in Physics A User’s
Herbert. F, Janine, B., Otto B. Walter, and Rose, M.. (2009). Evaluation of a
computer-adaptive test for the assessment of depression (D-CAT) in
clinical application. International Journal of Methods in Psychiatric
Research Int. J. Methods Psychiatr. Res. 18(1): 23–36 DOI:
Ibrahim, B., & Rebello, N. S. (2012). Representational Task Formats And
Problem Solving Strategies In Kinematics And Work. Physical Review
Special Topics-Physics Education Research, 8(1), 010126.

Istiyono, E, Mardapi, D., & Suparno. (2014) Pengembangan Tes Kemampuan

Berfikir Tingkat Tinggi Fisika (PhyTHOTS) peserta didik SMA. Journal
Penelitian dan Evaluasi Pendidikan

Jozwiak, J. (2004). Teaching problem-solving skills to adults. Journal of Adult

Education, 33(1), 19.

Kamaruddin, K., & Haryanto, H. (2014). Pengembangan Sistem Penilaian Hasil

Belajar Mata Pelajaran Menganalisis Rangkaian Listrik Berbasis
Computerized Adaptive Testing. Jurnal Pendidikan Vokasi, 4(1).

Kerlinger, F. (1998). Asas-asas Penelitian Behavioral, terjemahan Landung R.

Simatupang, Yogyakarta: Gadjah Mada University Press.

Kerry, E., & David, G,. (2011). An-other Look at Assessment: Assessment in
Learning. New Zealand journal of teachers work, volume 8 issue 1, 11-
Keeves, J. P. & Alagumalai. (1999). New Approach To Measurement. Dalam:
Masters, G.N. & Keeves, J.P. (eds.). Advances in Measurement in
Educational Research and Assessment (pp.23-42). Amsterdam:
Pergamon, An imprint of Elsevier Science
Kemdikbud. (2012). Dokumen Kurikulum 2013. Jakarta: Kemdikbud

Kirkley, J. (2003). Principles for teaching problem solving. USA: PLATO

Learning Inc.
Kowsalya, D. N., Lakshmi, H. V., & Suresh, K. P. (2012). Development and
Validation of a Scale to assess Emotional Maturity in Mild Intellectually
Disabled Children. Language in India, 12(6).

Kubiszyn, T., & Borich, G. D. (2013). Educational testing and measurement:

Classroom application and practise. Hoboken, NJ: Willey.
Larson, J. W., & Madsen, H. S. (1985). Computerized Adaptive Language
Testing: Moving beyond Computer-Assisted Testing. CALICO Journal,
2(3), 32-36.

Lee, Y.J. (2015). Analyzing Log Files to Predict Students’ Problem Solving
Performance in a Computer-Based Physics Tutor. Educational
Technology & Society, 18 (2), 225–236.
Linacre, J. M. (2000). Computer-adaptive testing: A methodology whose time has
come. . In S. Chae, U. Kang, E. Jeon & J. M. Linacre (Eds.),
Development of computerized middle school achievement test (in
Korean). Seoul, South Korea: Komesa Press.
Linden, W.J. (2005). Linear models for optimal test design. New York: Springer
Science Business Media, Inc
Luppicini, R. (2007). Review of computer mediated communication research for
education. Journal of springer for instructional Science 35: 141-185-
_Springer. DOI 10.10007/s11251-006-9001-6.
Madu1, B. C. & Orji, E. (2015). Effects of Cognitive Conflict Instructional
Strategy on Students‘ Conceptual Change in Temperature and Heat.
SAGE Open, 5 (3), 1 – 15.
Mardapi. D. (2008). Teknik Penyusunan Instrumen Tes dan Nontes. Jogjakarta:
Mitra Cendikia Press.

____________(2012). Pengukuran Penilaian & Evaluasi Pendidikan. Yogyakarta:

Nuha Medika.

Masters, G. N. (2010). The Partial Credit Model. dalam Nering, M.L & Ostini, R
(Eds). Handbook of Item Response Theory Models. New York:

McBride, D.L., Zollman, D., & Rebello, N.S. 2010. Method for Analyzing
Students’ Utilization of Prior Physics Learning in New Contexts.
Physical Review Special Topics - Physics Education Research,
(http://dx.doi. org/10.1103/PhysRevSTPER.6.020101).

Mcdonald, F. (2010). An Investigation Of Students’problem Solving Skills In An

Introductory Physics Class (Doctoral Dissertation, University Of Central
Florida Orlando, Florida).
Meyer, J. P., & Zhu, S. (2013). Fair and equitable measurement of student
learning in MOOCs: An introduction to item response theory, scale
linking, and score equating. Research & Practice in Assessment, 8.
Miller, P.W. (2008). Measurement and teaching. Indiana: Patrick W. Miller &
Mourtos, N. J., Okamoto D & Rhee, J. (2004).Defining Teaching, and Assessing
Problem Solving Skills. Prosiding.UICEE Annual Conference on
Engineering Education. Mumbai, India, 9-13 Februari
Mundilarto. (2010). Penilaian hasil belajar Fisik. Pusat pengembangan
Instruksional Sains (P2IS) Jurdik Fisika Fmipa Uny.

Mumtaz, S. 2006. Factors affective teacher use of information and

communications technology: a review of the literature. Journal of
information Technology for Teacher Education

Neo, M., Neo, K. T., Tan, H. Y. J., Kwok, W. J., & Lai, C. H. (2012). Problem-
solving in a Multimedia Learning Environment: The MILE@ HOME
Project. Procedia-Social and Behavioral Sciences, 64, 26-33.
Oriondo, L & Antonio, EM. (1998). Evaluation educational outcomes. Manila:
Rex Printing Company, Inc.
Presseisen, B. Z,. (1985). Thinking Skills: meanings and models. Dalam Arthur L.
Costa (Edited), Developing minds: a resource book for teaching (pp.43-
48) alexandria Virginia: ASCD.
Polya, G. (1957) How to solve it: A new aspect of mathematical method.
Doubleday, Garden City.

Piaget. J (2005). The phychology of intelegence {versi elektronik] Taylor

&Francis e-library.

Purwanto, N. (2010). Prinsip Prinsip dan teknik Evaluasi Pengajaran. Bandung:

PT. Remaja Rosdakarya.
Patrick, G., Shelley G., Leanne, C. (2007). Standards-referenced assessmentfor
vocational education and training in schools. Australian Journal of
Education, Vol. 51, No. 1, 19–38.
Retnawati, H., (2014). Teori Respon Butir Dan Penerapannya. Yogyakarta; Nuha

Rezaie. M & Golshan M. (2015). Computer Adaptive Test (CAT): Advantages and
Limitations. International Journal of Educational Investigations. Vol.2, No.5:
128-137, 2015 (May) ISSN: 2410-3446

Reynolds, C.R., & Wilson, V. (2010). Measurement and assessment in education

(2nd ed). Baston: Pearson
Rusman. (2009). Teknologi Informasi Dan Komunikasi Dalam Pembelajaran Pedoman
Bagi Guru. Bandung: Universitas Pendidikan Indonesia

Sadiman, A. (2010). Media Pendidikan : Pengertian Pengembangan dan

Pemanfaatannya. Jakaerta: Rajawali Pers.
Santoso, A. (2010). Pengembangan Computerized Adaptive Testing untuk
Mengukur Hasil Belajar Mahasiswa Universitas Terbuka. Jurnal
Penelitian dan Evaluasi Pendidikan, 14(1).

Schittek, M., Mattheos, N., Lyon, H. C., & Attström, R. (2001). Computer
assisted learning. A review. European Journal of Dental Education, 5(3),
93-100. Lord, 1986: 23; Marzieh, 2015

Schuetzenhoefer, C,. Hopf, M. (2012). Testing Students’ Conceptual

Understanding in Geometrical Optics with a Two Tier Instrument.
Proceedings of TheWorld Conference onPhysics Education. Pegem
Akademi: Ankara.
Scott, T. F., & Schumayer, D. (2015). Students’ proficiency scores within
multitrait item response theory. Physical Review Special Topics-Physics
Education Research, 11(2), 020134.
Slameto. (2010). Belajar dan faktor-faktor yang mempengaruhi. Jakarta: Rineka

Stiggins, R.J. & Chappuis, J. (2012). An introduction to Students Involved

Assessment for learning (6th ed) Boston: Pearson.
Subali, B. (2009). Pengembangan tes pengukur keterampilan proses sains pola
divergen mata pelajaran biologi SMA.Prosiding Seminar Nasional
Biologi, Lingkungan dan Pembelajaran, jurdik Biologi, FMIPA UNY, 4
Juli 2009, 581-593.
Sudjana, N. (2009). Penilaian Proses Hasil Belajar Mengajar. Bandung: Remaja
Sukardjo. (2012). Buku pegangan kuliah evaluasi pembelajaram IPA untuk
mahasiswa S2 program studi sains. (tidak diterbitkan) Yogyakarta:
Universitas Negeri Yogykarta.
Sumintono & Warsito. W. (2015). Aplikasi pemodelan Rasch pada Pendekatan
Assessment Pendidikan. Cimahi: Trim Komunikata.
Sumardyono. (2009). Pengertian Dasar Problem Solving. Yogyakarta:P4TK

Taras, M. (2009). Summative assessment: the missing link for formative

assessment. Journal of further and higher education. Vol. 33 (1), 57-69
ISSN : 0309-877X DOI:10.1080/03098770802638671
Thompson, N., David J., Weiss. (2011). A Framework for the Development of
Computerized Adaptive Tests. Practical Assessment, Research &
Evaluation, Vol. 16(1) 2011, ISSN 1531-7714.
Thorpe, L., McMillan, E., Sigmon, T., Owings, R., Dawson, R., & Bouman, P.
(2007). Latent trait modeling with the Common Beliefs Survey: Using
item response theory to evaluate an irrational beliefs inventory. Journal
of Rational- Emotive & Cognitive-Behavior Therapy, 25, 175-189. doi:
Thiagarajan, S. S. DS & Semmel, MI.(1974). Instructional Development for
Training Teachers of Exceptional Children.

TIMSS. 2016. International Mathematics Findings from IEA’s. Diakses dari
Tillery, B.W. Enger, E.D., & Ross, F.D. (2007). Integreted Science (3th eds). New
York: McGraw Hill Higher education.
Tolga, G. (2010) “The General Assessment of Problem Solving Processes and
Metacognition in Physics Education.”. (Eurasian Journal Of Physics and
Chemistry Educationan.
Tonidandel, Scoot., Quiñones, M. A., & Adams, A. A. (2002). Computer-adaptive
testing: The impact of test characteristics on perceived performance and
test takers' reactions. Journal of Applied Psychology, 87(2), 320.

Trianto. (2009). Mendesain Model Pembelajaran Inovatif –Progresif. Jakarta:

Trilling, B., & Fadel, C. (2009). 21st century skills: Learning for life in our times.
John Wiley & Sons. (Wismatch et al, 2014.
Uno. H. B. (2007). Model pembelajaran menciptakan proses belajar mengajar
yang kreatif dan efektif. Jakarta: Bumi Aksara.

Wainer. H. (1990). Computer adaptive testing: A primer. Hillsdale, NJ: Lawrence

Erlbaum Associates, Publisher.

Walsh, L.N., Howard R.G., & Bowe, B. (2007). Phenomenographic study of

students’ problem solving approaches in physics. Physical Review
Special Topics - Physics Education Research

Weir, J.J. (1974) Problem solving is everybody’s problem. The Science Teacher,
4, 16-18
Weiss. D. J. (2004). Computerized adaptive testing for effective and efficient
measurement in counseling and education. Measurement and Evaluation
in Counseling and Development, 37,70.

Wendler, C. L., & Walker, M. E. (2006). Practical issues in designing and

maintaining multiple test forms for large-scale programs. Handbook of
test development, 445-467.

Winarno, (2012). Testing Results Computerized Adaptive Testing (CAT)

Software Islamic Religion Education Subject In Making Rekam Medik
Pembelajaran (RMP) to diagnose student's ability At School. Conference
Proceeding. Annual International Conference on Islamic Studies (AICIS
Winkel & Westgaard, R. H. (1996). Editorial: A model for solving work related
musculoskeletal problems in a profitable way. Applied
Ergonomics, 27(2), 71-77.
Wu, M., & Adams, R. (2007). Applying the Rasch model to psychosocial
measurement: A practical approach. Melbourne: Educational
Measurement Solutions
Yerushalmi, E., & Magen, E. (2006). Same old problem, new name? Alerting
students to the nature of the problem-solving process. Physics Education,
41, 161-167
Yu, H., Yu, L,. Yi,.C. (2008). A Practical Computer Adaptive Testing Model for
Small-Scale Scenarios. Educational Technology &Society. Vol. 11(3),

You might also like