Professional Documents
Culture Documents
How To Evaluate Training Kirkpatrick
How To Evaluate Training Kirkpatrick
like most about the course; and 2) What could be answer, and the distractors are correct answers
improved? to other questions. Here, again, proper grammar
makes a difference. If a question read like this:
Level 2: Learning An element whose oxidation number is 0 pre-
Level 2 of Kirkpatrick’s model is learning, that vents gas from forming compounds readily, is
is, how much of the content attendees learned as called an ________ gas:
Phil La Duke is an executive consultant, safety expert, speaker, blogger, trainer and a) inert
business author. He speaks on organizational change and worker safety, and serves b) low reaction
on the Wayne State Biomedical Safety Board. La Duke is the author of a weekly blog c) nonreactive
(www.philladuke.wordpress.com) and is also a guest blogger on an Australian blog d) I don’t know
(www.safetyrisk.au.com) and Monster’s company blog (www.monsterTHINKING.com).
20 ProfessionalSafety AUGUST 2017 www.asse.org
Since the word an is used directly before the knows nothing when they evaluate whether non-
blank, basic grammar tells us that the correct classroom training is effective. The cost-effective
answer begins with a vowel. Using common sense, way to deliver this training is through demonstra-
one can assume that d) I don’t know is incorrect. tion and practice. In this type of training, a skilled
By process of elimination, we can conclude that trainer or veteran employee uses a task list or a
a) inert must be the correct answer, as all other standard work instruction sheet to demonstrate
possibilities are grammatically incorrect. Never use the correct way to perform the task while the
answers such as “a) and b) only” or “all of/none of learner watches and asks questions until s/he is
the above” because you risk testing reading com- ready to practice the tasks under the watchful eye
prehension skills instead of knowledge acquisition. of the trainer.
Some critics think 20 questions is too many or In cases where the worker could injure him/her-
that they take too long to answer. But, 20 ques- self by operating heavy equipment or unfamiliar
tions is enough to be statistically valid (assuming a machinery, administering a pretest that allows the
couple of variables). Let’s assume 20 people are in learner to try to operate a machine or induction
the class and each person is taking a 20-question molder would be reckless and provide no value
test. For a confidence level of 95% and a confi- except to determine whether the worker already
dence interval of ± 5, one would need a popula- has the skills that the shadow training is meant
tion of 19. Once the test is validated, one can be to impart. Trainers must err on the side of caution
certain that the difference between the pre- and (not to mention avoiding legal liability and viola-
posttests is the result of the learning event. tion of the General Duty Clause) and provide the
When analyzing the test scores, understand training to the person.
they should be skewed to the right (i.e., the test Some trainers and especially veteran employ-
scores should be disproportionately high, indicat- ees tend to skip areas of training or skimp on
ing that most people mastered the content). One demonstration and practice because the worker
can further analyze the data using μ scores. A seems particularly adept at the skills. This makes
μ (pronounced moo) score is the average of the the Level 3 evaluations critical. I tend to wait a
averages; scores that are going up indicate that the month, then ask the worker to perform those
instructors are getting better at their jobs. Scores tasks on which s/he was trained to learn whether
that are going down indicate instructors are get- the worker is still able to perform the skills to
ting bored, taking shortcuts or for some other standard. During this evaluation, I ask specific
reason failing to present the full content. Assum- questions that align with the course objectives to
ing the content and the test have not changed, the determine what has been retained.
μ score is an accurate reflection of the instructor’s At this point, some may wonder whether it is
performance. necessary that the worker retain the information
The first two levels of Kirkpatrick’s model are presented in an orientation. The problem with an
how OSH trainers can validate whether their orientation is that learners frequently lack suffi-
training is effective. cient context to synthesize the concepts presented.
This may sound like psychobabble—after all, what
Level 3: Behavior & Level 4: Results sort of context does one need to understand that
Levels 3 (behavior) and 4 (results) of Kirkpat- fire can burn you and electricity can shock you?
rick’s model are a bit complex and in-depth dis- Unfortunately, after a month on the job a
cussion is of limited value since so few get the first worker often starts to feel invincible and the once-
two levels correct. Level 3 measures whether the terrifying becomes mundane or even acceptable.
learning is transferred into practice in the work- By conducting a Level 3 evaluation, one can both
place. This kind of measurement can be tricky, yet evaluate training effectiveness and provide context
many people skip the first two steps in favor of and much-needed reinforcement of critical safety
jumping to the third. Not only is this shortsighted, points that the training was intended to impart.
it might also be dangerous. With repeated evaluations and reinforcement, the
It is important to measure the first two levels. training can ultimately become internalized and
Once it is assured that the learners understand hardwired into the behavior.
the purpose and importance of the class they have There is a complex connection between evalu-
completed, and it is known whether they acquired ation and reinforcement. When trainers get to
skills as a result of the course, one must determine Level 3 evaluation, this intricate connection is es-
whether they apply those skills in the workplace. pecially strong. As for Level 4 (results), let’s leave
Correct application of skills is especially im- that to the training professionals to debate.
portant in safety because it can literally make the
difference between life and death. All training References
should be evaluated, and the concept of pre- and Kirkpatrick Partners. (2017). The Kirkpatrick model.
posttesting can be confusing to some. At times, Retrieved from www.kirkpatrickpartners.com/OurPhil
for safety’s sake, trainers must assume the worker osophy/TheKirkpatrickModel