Professional Documents
Culture Documents
Abstract
Kivanc Atesok, MD, MSc Simulation-based surgical skills training addresses several concerns
Richard M. Satava, MD associated with the traditional apprenticeship model, including patient
safety, efficient acquisition of complex skills, and cost. The surgical
J. Lawrence Marsh, MD
specialties already recognize the advantages of surgical training
Shepard R. Hurwitz, MD using simulation, and simulation-based methods are appearing in
surgical education and assessment for board certification. The
Downloaded from http://journals.lww.com/jaaos by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsIHo4XMi0hCywCX1AWnYQp/IlQrHD3i3D0OdRyi7TvSFl4Cf3VC1y0abggQZXdgGj2MwlZLeI= on 07/02/2021
Copyright ª the American Academy of Orthopaedic Surgeons. Unauthorized reproduction of this article is prohibited.
Measuring Surgical Skills in Simulation-based Training
Table 1
Summary of Measurement Tools Currently Used for the Assessment of Surgical Skills
Measurement Tool Objectivity Validity Reliability Comments
C-SATS = Crowd-Sourced Assessment of Technical Skills, GRS = Global Rating Scale, OSATS = Objective Structured Assessment of Technical
Skills, YY = low,YYY = very low,[ = moderate to high,[[ = high,[[[ = very high
inherent shortcomings, including sub- Objective Structured train the observers who perform the
jectivity and unfeasibility in terms of Assessment of Technical scoring so that interrater (observer)
standardization. Furthermore, vali- Skills reliability is .0.80 (ie, has almost
dating questionnaires can be chal- perfect agreement between observers)
The Objective Structured Assessment
lenging, because they evaluate to achieve unbiased results.
of Technical Skills (OSATS) was the
subjective measures that can be biased The OSATS methodology was de-
first assessment tool that made possible
by many variables related to the sub- signed for performance evaluation after
the quantitative measurement of surgi-
jects’ self-assessments of qualitative cal skill or task performance in surgical the completion of a training session;
parameters. Most published studies simulation.6 The OSATS is done by however, it can be used during training
in which comfort or knowledge ques- independent observers who evaluate to standardize formative feedback.
tionnaires were used as proficiency the trainee’s performance using a When the trainer (faculty member) uses
measures of surgical procedures re- checklist consisting of a set of specific a checklist during the novice practice
port that such questionnaires are surgical maneuvers that have been sessions, the skills or tasks can be
not validated instruments.4,5 Thus, a deemed essential elements of the pro- monitored in real time. When a check-
questionnaire is not a suitable mea- cedure, such as appropriate placement list item is not performed by the trainee,
surement tool for validated, standard, of plate on bone using a C-arm and standardized, formative feedback can
and metric assessments of surgical securing proximal and distal fixation1 be given. Training and assessment are
competence. (Figure 1). Accordingly, it is critical to two sides of the same coin; thus,
Dr. Satava or an immediate family member serves as a paid consultant to Medtronic Minimally Invasive Therapies; has received research or
institutional support from Intuitive Surgical; and serves as a board member, owner, officer, or committee member of the American College of
Surgeons. Dr. Marsh or an immediate family member has received royalties from Zimmer Biomet and Wright Medical Group N.V.; has stock
or stock options held in FxRedux; and serves as a board member, owner, officer, or committee member of the American Board of
Orthopaedic Surgery and the National Board of Medical Examiners. Dr. Hurwitz or an immediate family member serves as a board member,
owner, officer, or committee member of the American Board of Orthopaedic Surgery, the Orthopaedic Research and Education Foundation,
and the Orthopaedic Trauma Association. Neither Dr. Atesok nor any immediate family member has received anything of value from or has
stock or stock options held in a commercial company or institution related directly or indirectly to the subject of this article.
Copyright ª the American Academy of Orthopaedic Surgeons. Unauthorized reproduction of this article is prohibited.
Kivanc Atesok, MD, MSc, et al
Figure 1
The Objective Structured Assessment of Technical Skills checklist for carpal tunnel release. (Adapted with permission from
Van Heest A, Kuzel B, Agel J, et al: Objective structured assessment of technical skill in upper extremity surgery. J Hand
Surg 2012;37[2]:332-337.)
training can be greatly enhanced by cedure was completed; the tool does skills assessment tool used to measure
formative feedback, especially in train- not measure quality or surgical finesse.7 characteristic surgical behaviors (ie,
ing to proficiency. surgical finesse) during the perfor-
It is important to note that an OSATS Global Rating Scale mance of any given procedure (eg,
checklist reports whether each and The Global Rating Scale (GRS) is respect for soft tissues, fluidity of
every essential step of a surgical pro- another commonly used surgical movements, familiarity with the
Copyright ª the American Academy of Orthopaedic Surgeons. Unauthorized reproduction of this article is prohibited.
Measuring Surgical Skills in Simulation-based Training
instruments)1,8 (Figure 2). Although used metric measurement methods They found significant differences
the GRS was developed to comple- include task completion time and between the performances of the
ment OSATS, some researchers accuracy. Task completion time and surgeon and nonsurgeon groups
include this assessment tool as a error can provide sufficient data to (P , 0.0001) and between senior and
component of OSATS. Because the differentiate between a novice and an junior surgeons (P , 0.05).
surgical skill parameters measured expert in the simulated performance Despite the increasing availability
using the GRS have characteristic of surgical skills. Detection of differ- of simulators that can track and
differences from items included in ences between the intermediate level analyze motion and the positive
the OSATS checklist, it would be of expertise and the novice or expert effects of this method on the objec-
wise to think of the GRS and OSATS level can be challenging, however, tive assessment of proficiency in
as separate measurement tools. because the latter two parameters do surgical skills, the effect of these
Typically, the GRS uses a rating scale not supply metric information about metrics on trainees’ skill transfer to
such as the Likert scale and measures the fluidity of hand movements when the operating room has yet to be
surgical behaviors in general. performing a task. Motion tracking proved.13 It seems unlikely that
Therefore, the GRS arguably pro- and analysis appears to be an objec- motion analysis will be used widely
vides a comprehensive assessment, tive and valid tool for assessing sur- in the actual operating room set-
which includes objective and sub- gical skills in terms of precision and ting. Motion tracking and analysis
jective criteria and measures non- economy of movement during the will likely remain a research tool for
technical cognitive skills (eg, decision performance of surgical proce- selected laboratory-based simula-
making, judgment). Adding sub- dures.9,10 Motion tracking systems tion studies.
jective criteria to any measurement can be mounted to surgical tools and
tool that uses rating scales but does attached to or worn on the hands as
not use well-defined yes-or-no sensors. The movements of these Video Recording
checklists, however, results in the sensors are recorded as three- Video recording of the operation for
limitations associated with sub- dimensional coordinates to measure later assessment of surgical skills has
jectivity, including ambiguity, poor a variety of motions, including the several advantages over the currently
interrater reliability, and bias. total path length traced by each used assessment methods, including
In a recent study, Bernard et al7 sensor and the number of trans- real-time OSATS testing, during
reported that OSATS checklist scores lational or rotational movements.11 which an observer must be ready in
showed strong interrater reliability The main disadvantage of these the operating room to rate a trainee’s
(.0.8) between the evaluators in systems is that they require attaching performance on a checklist. Video-
assessing technical skills pertaining extra devices to surgical tools or based feedback is a practical method
to shoulder surgical approaches. The wearing equipment on the arms or that enables the assessment of sur-
GRS scores were found to be mod- hands, which can be cumbersome gical performance using the same
erately reliable (0.75) between eval- and impractical for the assessment of measurement tools as the OSATS or
uators, however. The results of this surgical skills. This method does not the GRS at the time that is most
study also showed that the OSATS always require such cumbersome convenient for the rater14 (Figure 3).
checklist and GRS scores correlated implementations, especially in Multiple raters can examine the
with the trainees’ levels of experi- arthroscopic procedures. Motion same video recording and score the
ence. This finding supports the val- analysis systems can be built into a performance, which may be effective
idity of these measurement tools in simulator to track and analyze in reducing bias. Video recordings
differentiating between the skills of instrument tip trajectory data.12 are edited after the procedure, and
novice and experienced trainees in Howells et al9 showed the validity of unnecessary parts of the recording
performing surgical approaches to a motion analysis system in its ability are cut; thus, the evaluation of sur-
the shoulder. to differentiate between subjects gical performance using video
with different expertise levels in recordings also may reduce the time
arthroscopic skills. The authors re- needed to assess the complete pro-
Motion Tracking and corded the time taken, total path cedure. Although the process of ed-
Analysis length, and number of movements iting may take extra time,15 the
Objective assessment of performance used when performing simulated potential time saved by enabling
with simulators requires metrics to arthroscopic tasks using a shoulder multiple raters to assess the edited
provide accurate measurement of simulator equipped with an electro- recordings arguably far outweighs
surgical skills. The most commonly magnetic motion tracking system. the time spent on editing.
Copyright ª the American Academy of Orthopaedic Surgeons. Unauthorized reproduction of this article is prohibited.
Kivanc Atesok, MD, MSc, et al
Figure 2
The Global Rating Scale score sheet for carpal tunnel release. (Adapted with permission from Van Heest A, Kuzel B, Agel J,
et al: Objective structured assessment of technical skill in upper extremity surgery. J Hand Surg 2012;37[2]:332-337.)
Copyright ª the American Academy of Orthopaedic Surgeons. Unauthorized reproduction of this article is prohibited.
Measuring Surgical Skills in Simulation-based Training
Copyright ª the American Academy of Orthopaedic Surgeons. Unauthorized reproduction of this article is prohibited.
Kivanc Atesok, MD, MSc, et al
has demonstrated the value of using experts. Nevertheless, questions sures.24 These methods are used
these parameters in surgical skills remain regarding the ability of alone or in combination based on the
assessments when compared with simulators to discriminate between preferences of each research group
accustomed methods such as novice and intermediate proficiency or institution. Therefore, heteroge-
OSATS.20,21 levels. The main reason for the neity exists in the literature con-
In a simulated intra-articular frac- inability to discriminate between cerning the available evidence
ture reduction model, Anderson relatively close proficiency levels needed to draw conclusions. There is
et al20 showed that OSATS did not could be the use of nonstandardized a need to define a standard, full–life-
correlate well with the actual frac- measurement techniques that are cycle, simulation-based surgical
ture reduction measured using three- not sensitive enough to quantify education curriculum along with
dimensional digital models of the such differences. measurement protocols using reli-
final reductions of articular surfaces. As the use of simulators becomes able, valid, and objective metrics and
Similarly, in a distal radius fracture more commonplace, it will be critical to adopt a proficiency-based pro-
fixation model, Putnam et al21 to define a full–life-cycle, simulation- gression methodology.24
showed that the biomechanical based surgical training curriculum
strength of the fracture construct did using proficiency-based progression
not correlate with medical knowl- methodology (ie, training to a References
edge tests and OSATS scores. These benchmark that has been established
data strongly suggest that the pre- by expert performance)23 along with Evidence-based Medicine: Levels of
viously described direct and objec- objective, reliable, and valid mea- evidence are described in the table of
tive metric measurement parameters surement protocols that are stan- contents. In this article, references 13
are critically important adjuncts to dardized across all training programs and 23 are level I studies. References
more commonly used techniques, nationwide. Perhaps such measure- 4-9, 11, 14, 15, and 19-21 are level II
such as the OSATS and the GRS, in ment protocols will need to combine studies. References 10, 12, and 22
the assessment of surgical skills. more objective techniques such as are level III studies. Reference 16 is a
OSATS, motion analysis, and direct level IV study. References 1-3, 18,
metrics with video recording and and 24 are level V expert opinion.
Future Work C-SATS. Although the curriculum References printed in bold type are
may be subject to change based on those published within the past 5
Currently, the consensus in ortho- the country in which it is used, it will years.
paedic leadership—and in surgical be necessary to achieve consistency in
1. Atesok K, Mabrey JD, Jazrawi LM, Egol
subspecialties in general—is that a measurement protocols internation- KA: Surgical simulation in orthopaedic
paradigm shift that integrates ally to set a standard and ensure skills training. J Am Acad Orthop Surg
2012;20(7):410-422.
simulation-based surgical skills efficiency in communicating the level
training into curricula is necessary of surgical proficiency. 2. Atesok K, Satava RM, Van Heest A, et al:
Retention of skills after simulation-based
in surgical education. Furthermore, training in orthopaedic surgery. J Am Acad
simulation-based assessment in high- Orthop Surg 2016;24(8):505-514.
stakes surgery board examinations Summary 3. Gallagher AG: Metric-based simulation
across all surgical subspecialties is training to proficiency in medical
also foreseeable. The measurement Simulation-based skills training and education: What it is and how to do it.
Ulster Med J 2012;81(3):107-113.
techniques that will determine the assessment are increasingly incorpo-
level of proficiency are not yet well rated into surgical education and 4. Beth Grossman L, Komatsu DE,
Badalamente MA, Braunstein AM, Hurst
defined, however. In a systematic certification processes. Measurement LC: Microsurgical simulation exercise for
review, Slade Shantz et al22 studied techniques to quantify the level of surgical training. J Surg Educ 2016;73(1):
116-120.
the internal validity of arthroscopic proficiency in the performance of
simulators and aimed to document surgical procedures will be the key 5. Monod C, Voekt CA, Gisin M, Gisin S,
Hoesli IM: Optimization of competency in
whether any standard validation element in the success of this para-
obstetrical emergencies: A role for
protocols exist. The authors re- digm shift in surgical education. simulation training. Arch Gynecol Obstet
ported excessive heterogeneity in Some of the current measurement 2014;289(4):733-738.
the literature concerning perfor- methods used to assess surgical skills 6. Reznick R, Regehr G, MacRae H,
mance metrics used in assessments include the OSATS, the GRS, motion Martin J, McCulloch W: Testing technical
skill via an innovative “bench station”
and noted that the simulators can tracking, video recording, C-SATS, examination. Am J Surg 1997;173(3):
discriminate between novices and and direct or objective metric mea- 226-230.
Copyright ª the American Academy of Orthopaedic Surgeons. Unauthorized reproduction of this article is prohibited.
Measuring Surgical Skills in Simulation-based Training
7. Bernard JA, Dattilo JR, Srikumaran U, 13. Stefanidis D, Yonce TC, Korndorffer JR Jr, skills: Differentiating animate surgical skill
Zikria BA, Jain A, LaPorte DM: Reliability Phillips R, Coker A: Does the incorporation through the wisdom of crowds. J Endourol
and validity of 3 methods of assessing of motion metrics into the existing FLS 2015;29(10):1183-1188.
orthopedic resident skill in shoulder metrics lead to improved skill acquisition
surgery. J Surg Educ 2016;73(6): on simulators? A single blinded, 20. Anderson DD, Long S, Thomas GW,
1020-1025. randomized controlled trial. Ann Surg Putnam MD, Bechtold JE, Karam MD:
2013;258(1):46-52. Objective structured assessments of
8. Alvand A, Logishetty K, Middleton R, technical skills (OSATS) does not assess the
et al: Validating a global rating scale to 14. Dath D, Regehr G, Birch D, et al: Toward quality of the surgical result effectively.
monitor individual resident learning reliable operative assessment: The Clin Orthop Relat Res 2016;474(4):
curves during arthroscopic knee reliability and feasibility of videotaped 874-881.
meniscal repair. Arthroscopy 2013;29 assessment of laparoscopic technical skills.
(5):906-912. Surg Endosc 2004;18(12):1800-1804. 21. Putnam MD, Kinnucan E, Adams JE, Van
Heest AE, Nuckley DJ, Shanedling J: On
9. Howells NR, Brinsden MD, Gill RS, Carr 15. Karam MD, Thomas GW, Koehler DM, orthopedic surgical skill prediction: The
AJ, Rees JL: Motion analysis: A validated et al: Surgical coaching from head-mounted limited value of traditional testing. J Surg
method for showing skill levels in video in the training of fluoroscopically Educ 2015;72(3):458-470.
arthroscopy. Arthroscopy 2008;24(3): guided articular fracture surgery. J Bone
335-342. 22. Slade Shantz JA, Leiter JR, Gottschalk T,
Joint Surg Am 2015;97(12):1031-1039.
MacDonald PB: The internal validity of
10. Mason JD, Ansell J, Warren N, Torkington J: 16. Jabbour N, Sidman J: Assessing instrument arthroscopic simulators and their
Is motion analysis a valid tool for assessing handling and operative consequences effectiveness in arthroscopic education.
laparoscopic skill? Surg Endosc 2013;27(5): simultaneously: A simple method for creating Knee Surg Sports Traumatol Arthrosc
1468-1477. synced multicamera videos for endosurgical 2014;22(1):33-40.
or microsurgical skills assessments. Simul
11. Clinkard D, Holden M, Ungi T, et al: The 23. Angelo RL, Ryu RK, Pedowitz RA, et al: A
Healthc 2011;6(5):299-303.
development and validation of hand proficiency-based progression training
motion analysis to evaluate competency in 17. Oxford Living Dictionaries. Online volume curriculum coupled with a model simulator
central line catheterization. Acad Emerg 2017. results in the acquisition of a superior
Med 2015;22(2):212-218. arthroscopic Bankart skill set. Arthroscopy
18. Lendvay TS, White L, Kowalewski T: 2015;31(10):1854-1871.
12. Tashiro Y, Miura H, Nakanishi Y, Okazaki K, Crowdsourcing to assess surgical skill.
Iwamoto Y: Evaluation of skills in JAMA Surg 2015;150(11):1086-1087. 24. Atesok K, MacDonald P, Leiter J, et al:
arthroscopic training based on trajectory Orthopaedic education in the era of surgical
and force data. Clin Orthop Relat Res 19. Holst D, Kowalewski TM, White LW, et al: simulation: Still at the crawling stage.
2009;467(2):546-552. Crowd-sourced assessment of technical World J Orthop 2017;8(4):290-294.
Copyright ª the American Academy of Orthopaedic Surgeons. Unauthorized reproduction of this article is prohibited.