Welcome to Scribd!

Reliability

Uploaded by

0% found this document useful (0 votes)

8 views4 pages

Reliability refers to the consistency or stability of test scores. There are several types of reliability: test-retest reliability measures temporal stability by administering the same test at different times; alternate form reliability uses parallel test forms to measure content sampling; and internal consistency reliability estimates the relationship between items within a single test administration. Improving reliability involves increasing the number of items to better sample the content domain, using multiple measurements, selecting the best items through analysis, and standardizing administration and scoring procedures.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views4 pages

Reliability

Uploaded by

Maica Ann Joy Simbulan

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Assessment of Learning 1: Reliability

Reliability

It is the user who must take responsibility for determining whether or not scores are
sufficiently trustworthy to justify anticipated uses and interpretations (AERA et al. 1999).

Reliability

• Reliability refers to the consistency or stability of scores.

• If a test taker was administered the test at a different time, would she
receive the same score?

• If the test contained a different selection of items, would the test taker get
the same score?

Types of Reliability

Test-Retest Reliability

• Test-Retest Reliability: reflects the temporal stability of a measure.

• Most applicable with tests administered more than once and/or with constructs
that are viewed as stable.

• It is important to consider the length of the interval between the two test
administrations.

• Subject to “carry-over effects.” Appropriate for tests that are not appreciably
impacted by carry-over effects.

Alternate Form Reliability

• Involves the administration of two “parallel” forms.

• Two approaches: simultaneous and delayed administration.

• Delayed Administration: reflects error due to temporal stability and content

sampling.

• Simultaneous Administration: reflects only error due to content sampling

Assessment of Learning 1: Reliability

➢ Some limitations:

• Reduces, but may not eliminate carry-over effects.

• Relatively few tests have alternate forms

Internal Consistency Reliability

• Estimates of reliability that are based on the relationship between items within a
test and are derived from a single administration of a test.

Split-Half Reliability

• Divide the test into two equivalent halves, usually an odd/even split.

• Reflects error due to content sampling.

• Since this really only provides an estimate of the reliability of a half-test, we use
the Spearman-Brown Formula to estimate the reliability of the complete test

Coefficient Alpha & Kuder-Richardson (KR 20)

• Reflects error due to content sampling.

• Also sensitive to the heterogeneity of the test content (or item homogeneity).

• Mathematically the average of all possible split-half coefficients.

• Since Coefficient Alpha is a general formula, it has gained in popularity

Inter-Rater Reliability
• Reflects differences due to the individuals scoring the test.
• Important when scoring requires subjective judgement by the scorer.
Assessment of Learning 1: Reliability

Major Types of Reliability

Type of Reliability Number Number of

Estimate of Test Forms Testing Sessions Summary

Test-Retest One Form Two Sessions Administer the same

test to the same
group at two
different sessions.

Alternate Forms

Simultaneous Two Forms One Session Administer two

Administration forms of the test to
the same group in
the same session.

Delayed Two Forms Two Sessions Administer two

Administration forms of the test to
the same group at
two different
sessions.

Split-Half One Form One Session Administer the test

to a group one time.
Split the test into
two equivalent
halves.

Coefficient Alpha One Form One Session Administer the test

or KR-20 to a group one time.
Apply appropriate
procedures.

Inter-Rater One Form One Session Administer the test

to a group one time.
Two or more raters
score the test
independently.

Improving Reliability

• Increase the number of items (i.e. , better domain sampling).

• Use multiple measurements (i.e., composite scores).

• Use “Item Analysis” procedures to select the best items.

• Increase standardization of the test (e.g., administration and scoring).

Assessment of Learning 1: Reliability

Summary Chapter Vi To X (Io Psych Aamodt)
Document16 pages
Summary Chapter Vi To X (Io Psych Aamodt)
Joseanne Marielle
100% (2)
Classic and Romantic German Aesthetics (Cambridge Texts in The History of Philosophy) - J.M. Bernstein, Ed
Document357 pages
Classic and Romantic German Aesthetics (Cambridge Texts in The History of Philosophy) - J.M. Bernstein, Ed
menaclavazq6271
80% (5)
Lesson 6 Establishing Test Validity and Reliability: Learning Instructional Modules For CPE 105
Document17 pages
Lesson 6 Establishing Test Validity and Reliability: Learning Instructional Modules For CPE 105
Hanna Deatras
100% (2)
Why We Pick Bad Leaders
Document2 pages
Why We Pick Bad Leaders
周峪
No ratings yet
Study Guide for Practical Statistics for Educators
From Everand
Study Guide for Practical Statistics for Educators
Ruth Ravid
Rating: 4 out of 5 stars
4/5 (1)
Difference Between Opentext Content Server and Archive Server
Document1 page
Difference Between Opentext Content Server and Archive Server
sharadcsingh
100% (1)
Customer Cat ET Overview
Document36 pages
Customer Cat ET Overview
lisahun
100% (5)
Listening Script
Document5 pages
Listening Script
Hafidzah
No ratings yet
Chapter 4: Reliability
Document40 pages
Chapter 4: Reliability
margot
No ratings yet
Prof Ed Assessment in Learning1.a-42
Document1 page
Prof Ed Assessment in Learning1.a-42
Christianne Ventura
No ratings yet
Industrial/Organizational Psychology - Chapters 6 and 7
Document13 pages
Industrial/Organizational Psychology - Chapters 6 and 7
Myca Katrina Cantanero
No ratings yet
Psych Ass Ratio March 4
Document4 pages
Psych Ass Ratio March 4
Geal Lim
No ratings yet
Instrument Validity and Reliability
Document3 pages
Instrument Validity and Reliability
Cham Cabaña
100% (1)
Chapter 6 Experimental Research
Document93 pages
Chapter 6 Experimental Research
Claire Acunin Togores
No ratings yet
Characteristicsofagoodtest3 140227023631 Phpapp02
Document41 pages
Characteristicsofagoodtest3 140227023631 Phpapp02
Ketki Mehta
No ratings yet
Good Psychometric Properties
Document44 pages
Good Psychometric Properties
Pamela joyce c. santos
No ratings yet
Item Analysis and Reliabilty Report
Document10 pages
Item Analysis and Reliabilty Report
Hannah Guarino Vitug
No ratings yet
Lessson 6 10 Reviewer I o Finals
Document25 pages
Lessson 6 10 Reviewer I o Finals
엘리자
No ratings yet
Assessment of Learning: Proverbs 3:3
Document2 pages
Assessment of Learning: Proverbs 3:3
Elisa R. Viscca
No ratings yet
Keboleh Percayaan
Document8 pages
Keboleh Percayaan
Aiez Diseno
No ratings yet
Lesson in EDUC 4 (Establishing Test Validity and Reliability)
Document20 pages
Lesson in EDUC 4 (Establishing Test Validity and Reliability)
Ray Lorenz Ortega
No ratings yet
Equiz No. 5 Abac, Regina Claire G.
Document4 pages
Equiz No. 5 Abac, Regina Claire G.
Regina Abac
No ratings yet
Lesson6 Establishing Test Validity and Reliability
Document42 pages
Lesson6 Establishing Test Validity and Reliability
Kathkath Placente
No ratings yet
Chapter 6 I O Psych
Document2 pages
Chapter 6 I O Psych
christineruthfare03
No ratings yet
Notes About Construction of A Test
Document3 pages
Notes About Construction of A Test
Christine Bas Chavez
No ratings yet
SPL-3 Unit 2
Document11 pages
SPL-3 Unit 2
Divyank Surum
No ratings yet
Here Are Four Procedures in Common Use For Computing The Reliability Coefficient
Document11 pages
Here Are Four Procedures in Common Use For Computing The Reliability Coefficient
Muhe Abu
No ratings yet
RELIABILITY
Document15 pages
RELIABILITY
Rio Darmasetiawan
No ratings yet
Large Scale Assessment
Document4 pages
Large Scale Assessment
Carlito T. Gelito
0% (1)
Reliability and Validity
Document11 pages
Reliability and Validity
shaluvino
No ratings yet
Types of Experimental Research Design
Document3 pages
Types of Experimental Research Design
Anisa Labiba
No ratings yet
Assessment of Learning Lecture
Document6 pages
Assessment of Learning Lecture
Angel Pihet
No ratings yet
Characteristics of Effective Selection Techniques
Document17 pages
Characteristics of Effective Selection Techniques
MitziRawrr
No ratings yet
RESEARCH LEARNING Module 2
Document2 pages
RESEARCH LEARNING Module 2
Diane Aguilar
No ratings yet
Lesson 6 Establishing Test Validity and Reliability
Document19 pages
Lesson 6 Establishing Test Validity and Reliability
Ray Lorenz Ortega
No ratings yet
Reliability
Document9 pages
Reliability
Fareena Baladi
No ratings yet
Chapter 6edited
Document15 pages
Chapter 6edited
Ella Mae Fulleros
No ratings yet
PSM108 Finals
Document4 pages
PSM108 Finals
Churva Eklavu
No ratings yet
Lecture Test Construction and Evaluation
Document7 pages
Lecture Test Construction and Evaluation
Anne Bautista
No ratings yet
Reliability Types and Methods
Document11 pages
Reliability Types and Methods
Margie Lhan Partosa Andam
No ratings yet
Validity and Reliability: Joanna Ochoa V
Document15 pages
Validity and Reliability: Joanna Ochoa V
Fitri Emi
No ratings yet
Basics in Experiment Design: Key Terms
Document14 pages
Basics in Experiment Design: Key Terms
cecsdistancelab
No ratings yet
Kcast Let Review: Assessment of Learning 1 & 2
Document34 pages
Kcast Let Review: Assessment of Learning 1 & 2
Alliah Jireh Lazarte
No ratings yet
Professional Educational 6a
Document29 pages
Professional Educational 6a
Shandy Largo
No ratings yet
Lesson 3 RELIABILITY USABILITY
Document10 pages
Lesson 3 RELIABILITY USABILITY
Erika Culaban
No ratings yet
Item Reliability: Presented By: Jhoanna Rose M. Moreno Group 2 Adv. Psychometrics
Document17 pages
Item Reliability: Presented By: Jhoanna Rose M. Moreno Group 2 Adv. Psychometrics
Ericka Riano
No ratings yet
Finals Psychass Reviewer
Document11 pages
Finals Psychass Reviewer
Ivan
No ratings yet
Lesson 45
Document5 pages
Lesson 45
jf3082000
No ratings yet
Features of Property Accomplished Test
Document11 pages
Features of Property Accomplished Test
Jackie Mae Galope Untua
No ratings yet
633826468043311250
Document46 pages
633826468043311250
halayehiah
No ratings yet
10 - Chapter 6 PDF
Document30 pages
10 - Chapter 6 PDF
Reybin Sanson
No ratings yet
IO Chapter6 Final
Document10 pages
IO Chapter6 Final
Julia Mopla
No ratings yet
Module 3 Educ 105 Modified
Document28 pages
Module 3 Educ 105 Modified
Maria Zobel Cruz
No ratings yet
Test Equating by Common Items and Common Subjects - Concepts and A
Document20 pages
Test Equating by Common Items and Common Subjects - Concepts and A
faustus83
No ratings yet
Chapter 11 Notes
Document3 pages
Chapter 11 Notes
Norhaine Gadin
No ratings yet
Research Design and Approachs
Document69 pages
Research Design and Approachs
Sudharani B Banappagoudar
100% (1)
Language Test Reliability
Document20 pages
Language Test Reliability
Cel Poyos
No ratings yet
UNIT 05: Reliability: Module Overview
Document9 pages
UNIT 05: Reliability: Module Overview
Jolly Abella
No ratings yet
01 Virtual Class 1 v2.0
Document18 pages
01 Virtual Class 1 v2.0
Kathleen Almasa
No ratings yet
Educational Research: Experimental Research Gay, Mills, and Airasian
Document33 pages
Educational Research: Experimental Research Gay, Mills, and Airasian
Haerunnisa Siddik
No ratings yet
Test Construction and Development
Document3 pages
Test Construction and Development
Rhea Mae Tabayag
No ratings yet
A Classification of Experimental Designs
Document15 pages
A Classification of Experimental Designs
sony21
100% (1)
CISA Exam-Testing Concept-Testing in SDLC (Domain-3)
From Everand
CISA Exam-Testing Concept-Testing in SDLC (Domain-3)
Hemang Doshi
No ratings yet
Evaluating Performance in Physical Education
From Everand
Evaluating Performance in Physical Education
B. Don Franks
No ratings yet
The 2 Minute Tester
From Everand
The 2 Minute Tester
David Bruce
No ratings yet
A5 English
Document21 pages
A5 English
Maica Ann Joy Simbulan
No ratings yet
A5 Biological Science
Document74 pages
A5 Biological Science
Maica Ann Joy Simbulan
No ratings yet
Gened Merged
Document44 pages
Gened Merged
Maica Ann Joy Simbulan
No ratings yet
Statistics
Document14 pages
Statistics
Maica Ann Joy Simbulan
No ratings yet
Chapte 2
Document58 pages
Chapte 2
Rabaa Doorii
No ratings yet
The Emergence of Organizations and Markets: John F. Padgett and Walter W. Powell
Document25 pages
The Emergence of Organizations and Markets: John F. Padgett and Walter W. Powell
mailarvindk
No ratings yet
Draft Allotment: 6160 - Jspms Imperial College of Engineering and Research, Wagholi, Pune
Document17 pages
Draft Allotment: 6160 - Jspms Imperial College of Engineering and Research, Wagholi, Pune
sarcastic chhokrey
No ratings yet
The Heliocentric Hoax
Document6 pages
The Heliocentric Hoax
ihatescrbd
No ratings yet
Maths em
Document396 pages
Maths em
Sri D
0% (1)
New Microsoft Word Document
Document30 pages
New Microsoft Word Document
Rajesh Kanna
No ratings yet
Shen and Hun
Document45 pages
Shen and Hun
ava
No ratings yet
C Pipe Material Behaviour Didier Ilunga Only)
Document24 pages
C Pipe Material Behaviour Didier Ilunga Only)
AlexandraOdinev
No ratings yet
Infopack Active Citizenship
Document6 pages
Infopack Active Citizenship
Vasilis Tsillas
No ratings yet
Second Sex - de Beauvoir, Simone
Document20 pages
Second Sex - de Beauvoir, Simone
KWins_182590355
0% (1)
Establishing Objectives and Budgeting For The Promotional Program
Document41 pages
Establishing Objectives and Budgeting For The Promotional Program
gttrans111
No ratings yet
Kontribusi Usaha Mikro Kecil Dan Menengah Terhadap Tingkat Pertumbuhan Pendapatan Daerah Provinsi Dki Jakarta
Document11 pages
Kontribusi Usaha Mikro Kecil Dan Menengah Terhadap Tingkat Pertumbuhan Pendapatan Daerah Provinsi Dki Jakarta
ira nugriah
No ratings yet
Course Information Confidential: Universiti Teknologi Mara
Document6 pages
Course Information Confidential: Universiti Teknologi Mara
Faiz Bakar
No ratings yet
Wave 5 - The Socialisation of Brands
Document50 pages
Wave 5 - The Socialisation of Brands
gtheaudiere
No ratings yet
Markovian Decision Process: Chapter Guide. This Chapter Applies Dynamic Programming To The Solution of A Stochas
Document20 pages
Markovian Decision Process: Chapter Guide. This Chapter Applies Dynamic Programming To The Solution of A Stochas
Anupam Kumar
No ratings yet
Work Power and Energy
Document18 pages
Work Power and Energy
Tabada Nicky
No ratings yet
Aett ZG 524-L2
Document17 pages
Aett ZG 524-L2
Gopal Athani
No ratings yet
Rapid Miner Tutorial
Document15 pages
Rapid Miner Tutorial
Deepika Vaidhyanathan
100% (1)
How To Build Abstract Syntax Trees With Coco/R
Document8 pages
How To Build Abstract Syntax Trees With Coco/R
Bacalaureat Iorga
No ratings yet
Organo-Bridged Silsesquioxane Titanates For Heterogeneous Catalytic Epoxidation With Aqueous Hydrogen Peroxide
Document6 pages
Organo-Bridged Silsesquioxane Titanates For Heterogeneous Catalytic Epoxidation With Aqueous Hydrogen Peroxide
Lê Hồng Khanh
No ratings yet
Solution Manual
Document3 pages
Solution Manual
hajasoftware
No ratings yet
Statistical - Yearbook - 2017 (St. MAARTEN) NOTES
Document74 pages
Statistical - Yearbook - 2017 (St. MAARTEN) NOTES
Desmond Jones
No ratings yet
Install Admin
Document304 pages
Install Admin
RaghavendraYadav
No ratings yet
Idea-Exemplar Science 4 Q4
Document8 pages
Idea-Exemplar Science 4 Q4
Mary Joy De la Banda
No ratings yet