The Journal of Emergency Medicine, Vol. -, No. -, pp. 1–8, 2016 Ó 2016 Elsevier Inc. All rights reserved. 0736-4679/$ - see front matter http://dx.doi.org/10.1016/j.jemermed.2016.12.002 Education STANDARDIZED DIRECT OBSERVATION ASSESSMENT TOOL: USING A TRAINING VIDEO Kathleen E. Kane, MD, Kevin R. Weaver, DO, Gavin C. Barr Jr., MD, Gary Bonfante, DO, Nicole L. Bendock, DO, Brian M. Berry, DO, Stephanie L. Goren-Garcia, DO, Marc B. Lewbart, DO, Allison L. Raines, DO, Gregory Smeriglio Jr., DO, and Bryan G. Kane, MD Department of Emergency Medicine, Lehigh Valley Hospital and Health Network/USF MCOM, Allentown, Pennsylvania Reprint Address: Kathleen E. Kane, MD, Lehigh Valley Hospital and Health Network, 5th Floor, Emergency Medicine Residency Suite, 2545 Schoenersville Road, Bethlehem, PA 18107 , Abstract—Background: We developed a DVD training tool to educate physicians evaluating emergency residents on accurate Standardized Direct Observation Assessment Tool (SDOT) application. Objective: Our goal was to assess whether this training video improved attendings’ and senior residents’ SDOT use. Methods: Participants voluntarily completed SDOT evaluations based on a scripted ‘‘test’’ video. A DVD with ‘‘positive’’ and ‘‘negative’’ scenarios of proper SDOT use was viewed. It included education on appropriate recording of 26 behaviors. The test scenario was viewed again and follow-up SDOTs submitted. Performances by attendings and residents on the pre- and posttest SDOTs were compared. Results: Twenty-six attendings and 26 senior residents participated. Prior SDOT experience was noted for 8 attendings and 11 residents. For 20 anchors, participants recorded observed behaviors with statistically significant difference on one each of the pretest (no. 20; p = 0.034) and post-test (no. 14; p = 0.041) SDOTs. On global competency assessments, pretest medical knowledge (p = 0.016) differed significantly between groups. The training intervention changed one anchor (no. 5; p = 0.035) and one global assessment (systems-based practice; p = 0.031) more negatively for residents. Recording SDOTs with exact agreement occurred 48.73% for attendings pretest and 54.41% post-test; resident scores were 45.86% and 49.55%, respectively. DVD exposure slightly raised attending scores (p = 0.289) and significantly lowered resident scores (p = 0.046). Conclusions: Exposure to an independently developed SDOT training video tended to raise attending scores, though without significance, while at the same time lowered senior resident scores statistically significantly. Emergency attendings’ and senior residents’ SDOT scoring rarely differed with significance; about half of anchor behaviors were recorded with exact agreement. This suggests senior residents, with appropriate education, may participate in SDOT assessment. Ó 2016 Elsevier Inc. All rights reserved. , Keywords—SDOT; training video The study was reviewed and approved as exempt by our network’s Institutional Review Board before any study procedures taking place. An abstract of this report was presented in poster format at the Council of Emergency Medicine Residency Directors Academic Assembly in 2012 in Atlanta, GA. The project was represented in an oral format at the 2012 Pennsylvania College of Emergency Physicians Scientific Assembly in Gettysburg, PA, where it received a Spivey Research Award. INTRODUCTION The Accreditation Council for Graduate Medical Education (ACGME) requires performance evaluation of all residents. Previously, the evaluation process was designated to address six areas referred to as the ‘‘Core RECEIVED: 3 December 2015; FINAL SUBMISSION RECEIVED: 27 July 2016; ACCEPTED: 2 December 2016 1 2 Competencies,’’ which were central to the prior ACGME evaluation system (1). These competencies consist of the following: patient care (PC), medical knowledge (MK), practice-based learning and improvement (PBL), interpersonal and communication skills (ICS), professionalism (PROF), and systems-based practice (SBP) (1). The Standardized Direct Observation Tool (SDOT) was developed with these competencies in mind (2). The initial validation was with video scenarios, and the use of senior residents as evaluators was not studied in this cohort. Recently, the American Board of Emergency Medicine and the ACGME have moved to the evaluation of emergency residents using the Next Accreditation System (NAS) ‘‘milestones.’’ Table 1 describes these milestones. The Council on Residency Directors for Emergency Medicine (CORD) recommends direct observation as an evaluation tool for emergency residents for the following milestones: 1, 2, 3, 4, 5, 6, 7, 8, 10 (via checklist), 12, 13, 16, 17, 18, 19, 20, 21, 22, and 23 (3). Direct observation of some type is recommended in 19 of the 23 milestones. The previous SDOT is specifically recommended for measurement in all but milestone 13, 18, and 23. The SDOT requires an emergency physician observe all aspects of a residentpatient encounter from initial contact to history and physical examination, reevaluation, and final disposition. The tool has standard definitions related to performance for each of the 26 evaluated areas, the six core competencies, a global assessment, and the ability to provide free-form feedback (4). These performance assessment definitions, or ‘‘anchors,’’ as we denote in this article, are translatable to measureable NAS behaviors. A previous study demonstrated that the SDOT instrument can be used with a high degree of inter-rater reliability by attendings in a summative fashion for five of the six competencies, with minimal training (5). The use of senior residents as SDOT evaluators has not, to date, been studied. We sought to develop a training tool that educated both attendings and senior residents on the application of the SDOT instrument, while viewing a resident during a performance encounter modeling positive behaviors (exceeds expectations), negative behaviors (below expectations), and mixed behaviors (combination of exceeds, meets, and below expectations). Accuracy would be defined as recording, on a study SDOT, behavior with exact agreement from a video encounter. The behaviors would be scripted with the tool’s definitions (4). This study’s specific goal was to determine whether this brief training video could improve attendings’, as well as postgraduate year (PGY) 3 and PGY4 residents’ SDOT use, as measured by exact agreement. K. E. Kane et al. Table 1. Emergency Medicine Milestones and Grading Rubric No. Category 1 2 PC1 PC2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 PC3 PC4 PC5 PC6 PC7 PC8 PC9 PC10 PC11 PC12 PC13 PC14 MK SBP1 SBP2 SBP3 PBL1 20 21 22 23 PROF1 PROF2 ICS1 ICS2 Milestone Emergency Stabilization Performance of a Focused History and Physical Examination Diagnostic Studies Diagnosis Pharmacotherapy Observation and Reassessment Disposition Multi-tasking General Approach to Procedures Airway Management Anesthesia and Acute Pain Management Goal-directed Focused Ultrasound Wound Management Vascular Access Medical Knowledge Patient Safety Systems-based Management Technology Practice-based Performance Improvement Professional Values Accountability Patient Centered Communication Team Management Grading rubic (applies independently to each milestone) Level 1: The resident demonstrates milestones expected of an incoming resident. Level 2: The resident is advancing and demonstrates additional milestones, but is not yet performing at a mid-residency level. Level 3: The resident continues to advance and demonstrate additional milestones; the resident demonstrates the majority of milestones targeted for residency in this sub-competency. Level 4: The resident has advanced so that he or she now substantially demonstrates the milestones targeted for residency. This level is designed as the graduation target. Level 5: The resident has advanced beyond performance targets set for residency and is demonstrating ‘‘aspirational’’ goals, which might describe the performance of someone who has been in practice for several years. It is expected only a few exceptional residents will reach this level. ICS = interpersonal and communication skills; MK = medical knowledge; PBL = practice-based learning and improvement; P = patient care; PROF = professionalism; SBP = systems-based practice. MATERIALS AND METHODS This was a pre-/post-education intervention study using the standards for performance evaluations as defined by the ACGME and CORD for performing SDOTs (4). The study was reviewed and approved as exempt by our network’s Institutional Review Board before any study procedures taking place. The study was conducted at an independent academic medical center, not affiliated with a local medical school, hosting a dually approved PGY1 through PGY4 emergency medicine (EM) residency hosting 14 residents per year. Employed at the SDOT: Using a Training Video time of the study were 46 attendings, 19 of whom were designated as teaching faculty. Because all attendings evaluate residents, no differentiation was made as to whether they were designated as teaching faculty on the Program Information Form (PIF). All emergency attending physicians and PGY3/PGY4 residents were given the material for improving faculty and senior resident knowledge. All were deemed eligible, and were asked to participate in this study. Core faculty who were members of the study team were excluded. Each of the attendings and the PGY3/PGY4 residents were asked to watch ‘‘test’’ and ‘‘educational’’ DVDs, which took approximately 70 min. EM core faculty and senior residents developed the DVD. Ideal scores for the test case were determined by design and intentionally modeled in the video seen on the DVDs. The research team provided the DVDs to all potential participants. Consent to participate was implied based on the return of evaluation forms; there was no penalty for those who chose not to participate. Study participants viewed two DVDs of a residentpatient encounter. The first DVD, considered the pretest DVD, incorporated mixed behaviors (a combination of exceeds, meets, and below expectations behaviors) in a resident performance surrounding a single mock residentpatient encounter and residentattending interaction. The participants viewed the pretest DVD and evaluated this scenario by completing the SDOT evaluation tool before receiving any education. The second, educational, DVD was given to study participants to view in its entirety after they submitted their initial completed SDOT evaluation form. This educational DVD was developed as the primary training tool that modeled positive behaviors (exceeds expectations), negative behaviors (below expectations), and mixed behaviors (combination of exceeds, meets, and below expectations) of resident performances surrounding three versions of a mock single-patient encounter highlighting a chief complaint of chest pain. The first two encounters on the educational DVD were informative and demonstrated a scripted positive and then a scripted negative resident performance. After viewing each of these scripted scenarios, the participants were asked to complete practice SDOT evaluation forms, but were not required to return this form to the researchers, as the DVD provided immediate feedback and ideal answers for educational purposes. After the third encounter—a repeat of the scenario on the first DVD that contained a combination of predetermined mixed behaviors—another SDOT evaluation form was completed. This final SDOT evaluation was submitted to the researchers as posteducation data and compared to the pre-education SDOT data provided before viewing the educational video. 3 All details of the mock patient’s case and the roles actors played were consistent in the scripted positive and negative cases. The elements that changed were related only to the resident actor’s performance, that is, whether it was overwhelmingly positive or negative. For the third encounter, in both the pre- and post-test scenario, details of the patient’s history and examination remained the same, except instead of needing to translate for the patient, the patient was hearing impaired. Here the resident actor’s performance was intentionally mixed. The testcase role actors were changed to prevent any affective bias from the first two cases. The intended scripted resident performance is demonstrated in Table 2. The 1 through 5 Likert global assessment scales for the six core competencies—PC, MK, PBL, ICS, PROF and SBP—were neither scripted nor taught in the educational video and, as such, interpretation was left to the study participant. Also unscripted was the overall clinical competence score. Pre- and post-education SDOT evaluation forms were matched by name, by the Emergency Medicine Research Office. The study’s research coordinator deidentified these forms, ultimately using a study identification number. Names were never released to study team members or the statistician. As part of the consent process, participants were informed only the research coordinator was aware of their identity. At the conclusion of the final SDOT scenario, study participants were asked to complete a demographic form collecting general demographic information and written comments regarding the viewed encounter. Participant names were later removed and replaced by their study identification number by the study’s research coordinator. Analysis included descriptive statistics (mean 6 standard deviation) for continuous variables and n (%) for categorical variables for each of the evaluation cycles. The primary outcome was comparison of individual participant pre- and post-test scores. Secondary outcomes included comparisons of faculty and resident scores. Student’s t-test, Pearson c2 (Fisher’s exact test), Mann-Whitney U tests, and McNemar tests were used, as appropriate. These two test scenario scores were compared using Wilcoxon two- and three-sample tests, as well as signed rank testing. ‘‘Not applicable’’ (NA) responses were excluded from some analysis based on the coding of 1 = needs improvement, 2 = meets expectations, and 3 = above expectations. Significance was determined at p < 0.05. RESULTS Twenty-six attendings with a mean of 12.2 6 7.1 years of EM experience and 26 senior residents agreed to 4 K. E. Kane et al. Table 2. Scripted Behaviors in Each Video SDOT Anchor 1. Respectful of privacy 2. Appears professional 3. Uses translation services 4. Efficient information gathering 5. Complaint oriented examination 6. Explains pathophysiology 7. Presents structured case 8. Discusses differential 9. Risks/benefits/indications 10. Critical actions 11. Procedural competency 12. Clear communication 13. Conflict avoidance/resolution 14. Discusses care plan 15. Clinical charting 16. Patient prioritization 17. Contextual use of resources 18. Concern for social constraints 19. Controls distractions 20. Informed decision making 21. Patient reevaluation 22. Documents reassessment 23. Use of resources 24. Discharge planning 25. Completes discharge plan 26. Arranges follow-up Good Poor Test Examples ME AE ME ME AE ME AE AE AE ME AE ME NA ME ME ME AE NA AE ME ME NI NA NA NA NA ME NI NI NI NI NI NI NI NI NI NI NI NI NI NA NA NI NI NI NI NI NA NA NA NA NA ME ME NI AE ME ME AE ME ME ME AE ME ME NA ME AE AE NA ME ME ME NA NA NA ME NA Encounter kept separate Appropriateness of attire Ask of services needed Organized and logical Focused on key elements Aware of disease mechanism Organized presentation Consider atypical presentation Discussion of side effects Knows key steps Preparation and completion Courteous and consistent Awareness of sensitive topics Communicates course Timely/complete documentation Acuity takes priority Calls in staff appropriately Considers patient compliance Excuses self when necessary Respects patient wishes Evaluates response to treatment Continued charting Aware of supportive consultants Anticipates patient needs Communicates with patient Ensures care after discharge AE = above expectations; ME = meets expectations; NA = not applicable; NI = needs improvement; SDOT = Standardized Direct Observation Assessment Tool. participate in this study. Some familiarity with the SDOT was noted by 8 of the attendings and 11 of the residents. Table 3 depicts the pre- and post-SDOT anchor evaluations submitted by the study participants. As shown in Table 2, the ideal scores (excluding those NA) for each SDOT anchor are in bold type. It is interesting to note how many residents and faculty provided answers for items that could clearly not be evaluated appropriately based on the content of the video (this applies to items 14, 18, 22, 23, 24, and 26, which were scripted as NA). The differences in anchor scores as recorded by attendings and senior residents were rarely significantly different. Only on anchor 20, on the pretest (p = 0.034), and anchor 14, on the post-test (p = 0.041), were the responses from the two groups statistically different. In both cases, the attending’s score was significantly higher than the resident’s. The exposure to the training video did not change this relationship between the groups for their recorded anchors with statistical significance except for anchor 5, where the training lowered resident scores (p = 0.035). Excluding the NA scripted responses; attendings recorded the intended scripted behavior with exact agreement on average 48.73% of the time for each anchor on the pretest. This improved to 54.41% on the post-test. Residents’ recording of exact agreement with intended scripted behavior improved, on average, for each anchor from 45.86% on the pretest to 49.55% on the post-test. Overall, residents became more critical in their evaluations of the video scripted performance after watching the instructional video in 14 of the 26 questions, less critical in 6 of the 26 questions, and had no change in 6 of 26 questions. Attendings were more critical in their evaluations after watching the instructional video in 10 of 26 questions, less critical in 12 of 26 questions, and showed no change in 4 of 26 questions. Exposure to the video raised faculty scores, although not in a statistically significant manner (p = 0.289) and lowered resident scores significantly (p = 0.046). The recorded responses for the unscripted evaluation of Global Competency are provided in Table 4. The only significant difference between attending and resident assessment of competency was in the pretest scores for medical knowledge. Here, attendings were more likely to rate global medical knowledge lower than residents (p = 0.016). Exposure to the training significantly lowered resident assessment scores as compared to attendings for SBP on the post-test (p = 0.031). This was the only significant change in the relationship between the two groups’ scores as a result of the intervention. DISCUSSION We hypothesized that an SDOT training video would educate both attendings and senior resident evaluators SDOT: Using a Training Video 5 Table 3. Frequency Tabulation by Anchor, Respondent, Observation Period, and Category Item No. 1 1 1 1 2 2 2 2 3 3 3 3 4 4 4 4 5 5 5 5 6 6 6 6 7 7 7 7 8 8 8 8 9 9 9 9 10 10 Variable Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Needs Meets Above Improvement,* Expectations,† Expected,‡ n n n 7 4 5 8 18 7 17 16k 7k k 15 k 15 k 8 5k 3 k k 10 1 11 11 1 11 0 0 11 3 12 12 6 25§ 12§ 0 1 0 12{ 20§ 2 1 23§ 1 0 3 1 17 9 1{ 1{ 2 18 3{ 8 9 5{ 5 1 20k 12k 1 0 3 17k 3 0 10k 9k 5 4 2 11k 10 15 14 8 2 0 13 6 11{ 7{ 9 0 3 12 k 7 6k k 12 k 9 14 14 15 2 3 13 15 15 1 0 9 13 14 3 12 12 14 14 2 12 13 13 7 k Item No. 10 2 0 18 k Table 3. Continued 16 16 16 16 { { 6 7 8 6 10 6 1 1 9k 10k 4 2 4 9k 10 8 12k 4 3 1 15k 10k 2 2 (Continued ) 17 17 17 17 18 18 18 18 19 19 19 19 Variable Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Needs Meets Above Improvement,* Expectations,† Expected,‡ n n n 5 12k 2 8 14k 1 1 0 1 1 1{ 1{ 0 10 2{ 0 7 3{ 20 10 5k 3k 1 0 17 4k 2 16 k 7 k 1 16 8 2 0k 0 0 13 3k 2 k 13 6 0 5 2 9 6 0 3 6 9 0 8 5 0 1 1 2k 3k 0 0 0 4k 0 0 7k 0 4 4 13 7 0{ 0{ 5 9 3{ 8 8 2{ 2 0 15 8 0{ 2{ 2 9 2{ 2 12 2{ 3 1 3 3 0 0 2 6 1 2 3 0 k 11 8 13 5k 1 0 10 10k 1 14 k 8 0 (Continued ) 6 K. E. Kane et al. Table 3. Continued Item No. 20 20 20 20 21 21 21 21 22 22 22 22 23 23 23 23 24 24 24 24 25 25 25 25 26 26 26 26 Variable Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Faculty pretest Faculty posttest Resident pretest Resident posttest Needs Meets Above Improvement,* Expectations,† Expected,‡ n n n 1 0 20k 11k 0 2 5 14k 3 5 k 13 2 5 1 7k 6k 0 1 3 6k 1 k 2 5 2 1 2 1 0 0 0 4 0 0 2 0 3 0 1 0 0 0 1 1 1 1 2 0 2 0 2 5 0 1 2 1 2 4 6 1 0 k 2 0 19 1k 1 0 0 5k 3 k 0 1 0 0 1 0 0 0 0 1 1 0 0 0 0 Results in bold type indicate ideal responses. Item numbers 14, 18, 22, 23, 24, and 26 were not able to be assessed in the video. * Needs Improvement score = 1. † Meets Expectations score = 2. ‡ Above Expected score = 3. § Needs Improvement does not meet acceptable resident behavior for PGY level. k Meets expectations meets acceptable resident behavior for PGY level. { Above Expected resident performance better than expected for PGY level. on the SDOT tool, improving their ability to accurately evaluate scripted behaviors. In this study, exact agreement of submitted SDOTs when compared to the in- tended, scripted performance for both groups, pretest and post-test, was approximately half. Exact agreement in a prior study averaged 65.99% per anchor (5). The video did improve exact agreement, but only by about 5%. Our agreement is lower than previously published, and may be a result of this study’s smaller sample size. In our pretest/post-test cohort, there are few significant differences between attending and resident scoring of the SDOT. This lack of statistical difference was found both for the majority of the specific anchors, which were scripted and addressed in the training video, as well as the global assessment of competency. Training, in the form of a locally developed instructional video, does not significantly change the relationship between resident and attending SDOT scores. In general, residents tended to be ‘‘more critical’’ after watching the video, while the attendings were ‘‘less critical.’’ However, the attendings’ subsequent changes after the video presented more of a mixed picture. It is interesting to note that after the educational video, the senior residents become significantly more negative in their SDOT evaluations. It may be inferred that after watching the video, residents were comfortable using the SDOT evaluation to provide helpful feedback and constructive criticism, rather than be concerned about critically evaluating their peers. The initial SDOT manuscript noted the challenge of providing negative feedback, so perhaps education around the SDOT tool may help residency programs identify specific areas of weakness for trainees (1). This may be especially true in programs where the faculty is known to be reluctant to provide negative feedback, as has been reported previously (6). Internally, we performed SDOTs using faculty observers who are not providing clinical care and estimate that an SDOT of the initial history and physical as well as a residentattending presentation, can take up to an hour. Remaining in the department to include closure of the case to disposition would, in our experience, require further faculty resources. This concern for faculty time in conjunction with the broad recommendations for the use of direct observation by the Emergency Medicine Milestones document, led to this group suggesting that senior residents could be utilized to perform this laborintensive task (3). In the original SDOT study, the authors note that using the instrument while contemporaneously working clinically may be difficult (1). The use of senior residents to conduct SDOTs may free up faculty resources for other programmatic needs. Limitations This study was completed within a single 4-year, dually approved, residency program, which limits its external validity. Eight of the faculty and 11 of the residents who participated in the study had prior training in evaluation of SDOT: Using a Training Video 7 Table 4. Pre- and Post-Test Frequency of Responses by Subject Type for Unscripted Global Competencies Variable Patient care Faculty pretest Faculty post-test Resident pretest Resident post-test Medical knowledge Faculty pretest Faculty post-test Resident pretest Resident post-test Practice-based learning and improvement Faculty pretest Faculty post-test Resident pretest Resident post-test Interpersonal and communication skills Faculty pretest Faculty post-test Resident pretest Resident post-test Professionalism Faculty pretest Faculty post-test Resident pretest Resident post-test Systems-based practice Faculty pretest Faculty post-test Resident pretest Resident post-test Overall Faculty pretest Faculty post-test Resident pretest Resident post-test 1 2 3 4 5 3 0 4 7 10 3 10 9 10 5 4 4 1 5 4 4 0 0 0 0 0 0 0 0 6 0 0 2 11 6 7 10 7 7 11 11 1 0 5 1 0 0 0 1 5 0 7 4 15 7 11 13 2 6 5 6 0 0 0 0 7 1 6 10 15 9 14 10 4 3 2 3 0 0 0 1 0 0 1 0 4 1 7 6 17 5 9 12 5 7 5 5 0 0 0 1 0 0 1 0 0 1 0 2 3 0 4 8 17 11 17 13 2 1 2 1 0 0 0 0 7 1 8 4 10 10 8 6 1 0 1 0 0 0 0 0 0 0 0 0 Values are n. resident performance. Specific training may have included encounters with feedback based on core competency performance. A comprehensive survey outlining the specifics of prior training was not obtained; it should be assumed that prior training may have biased participants’ performance in this study and confounded the results. The small sample size precluded us from separating our analysis by PIFdesignated teaching faculty and clinical faculty. Further, many of the study participants were lost to follow-up. The post-test cohort of 13 attendings was significantly smaller than the size of the pretest cohort, which included 26 members. This represents a substantial loss of study participants, and confounds the impact of the intervention and interpretation of the relationship postintervention between the attendings and senior residents. The largest limitation may be the internally developed video. While the study was a pilot, the quality of both the depiction of the scripted behaviors, as well as that of the educational intervention, has not been validated. This is likely the single largest reason why the exact agreement in this cohort differs from prior publications (5). Improvement would also be expected after watching the same video for a second time, regardless of the educational impact of the training tool. CONCLUSIONS Exposure to an independently developed training video tended to, without significance, raise attending scores of resident performance, while statistically significantly lowering scores provided by senior residents. This suggests that the senior residents may have been assigning more liberal pretraining scores to their peers. In our cohort, attending physicians and senior emergency residents rarely differ in their application of the SDOT with statistical significance, either before or after the training video. About half of scripted SDOT anchor behaviors were recorded with exact agreement for both groups pre- and post-test. The lack of significant difference between the two groups was found for both scripted anchors and unscripted global assessments. This lack of significant difference suggests that senior residents may be able to function as SDOT assessors. If utilized as SDOT evaluators, senior residents may benefit from training in the use of the instrument. Acknowledgments—The authors would like to acknowledge the research coordination and study supervision efforts of Kimberly M. Hamilton, BA, Valerie A. Rupp, RN, MSN, CRNP, Emese Futchko, MSN, RN, CCRC, Bernadette Glenn-Porter, BS, and Anita Kurt, PhD, RN. They would also like to thank Kristine Petre, MLS, CM, AHIP, for her senior librarian efforts, and Bruce Stouch, PhD, for his statistical consulting expertise. This study was funded in part by an unrestricted educational grant from PCOM MEDNet, Philadelphia, Pennsylvania. REFERENCES 1. Chapman DM, Hayden S, Sanders AB, et al. Integrating the Accreditation Council for Graduate Medical Education Core competencies into the model of the clinical practice of emergency medicine. Ann Emerg Med 2004;43:756–69. 2. Shayne P, Gallahue F, Rinnert S, et al. Reliability of a core competency checklist assessment in the emergency department: The Standardized Direct Observation Assessment Tool. Acad Emerg Med 2006;13:727–32. 3. Accreditation Council for Graduate Medical Education and the American Board of Emergency Medicine. Single accreditation system for AOA-approved programs. http://acgme.org/acgmeweb/ Portals/0/PDFs/Milestones/EmergencyMedicineMilestones.pdf. Published December 2012. Accessed April 23, 2015. 4. Council of Emergency Medicine Residency Directors. CORDTest. http://www.cordtests.org/SDOT.htm. Published 2014. Accessed April 23, 2015. 5. LaMantia J, Kane B, Yarris L, et al. Real-time inter-rater reliability of the Council of Emergency Medicine residency directors standardized direct observation assessment tool. Acad Emerg Med 2009;16(suppl 2):S51–7. 6. Yarris LM, Linden JA, Gene Hern H, et al. Attending and resident satisfaction with feedback in the emergency department. Acad Emerg Med 2009;16(suppl 2):S76–81. 8 K. E. Kane et al. ARTICLE SUMMARY 1. Why is this topic important? Emergency attendings’ and senior residents’ Standardized Direct Observation Assessment Tool (SDOT) scores rarely differed; about half of anchor behaviors were recorded with exact agreement. This suggests senior residents can participate in SDOT assessment. 2. What does this study attempt to show? The study’s aim was to assess whether a training video/ DVD improved attendings’ and senior residents’ use of the SDOT. 3. What are the key findings? Exposure to a training video significantly lowered resident scores, suggesting they may have been assigning more liberal pretraining scores to peers. Education to improve their objectivity may be useful. 4. How is patient care impacted? By definition, the SDOT is a Council of Emergency Medicine Residency Directorsdeveloped tool designed to specifically assess residents based on the Accreditation Council for Graduate Medical Education’s Core Competencies. Residents who are better trained in interpersonal medical communication skills will translate into improved, more professional patient care.