Monitoring Competency-Based Medical Education Uptake: Analysis of Entrustable Professional Activity Submission, Expiration, and Assessment Scores

Tahereh Firoozi; Anna Oswald; Deena M. Hamza; Hollis Lai

doi:10.4300/JGME-D-25-00152.1

ABSTRACT

Background Program directors need concrete indicators to monitor uptake of competency-based medical education (CBME). Entrustable professional activity (EPA) observation completion rates offer practical measures of CBME adoption.

Objective In this study, we used residents’ EPA observation data in clinical departments, specifically the submission and expiration of EPA observation forms and assessment scores, to explore the uptake of CBME practices across departments. Our research question asked: What are the patterns and contributing factors (department group, resident year, calendar year, program size) associated with EPA observation submission rates, expiration rates, and assessment scores?

Methods We conducted exploratory analysis of de-identified EPA observation data (n=233 176) from residents’ electronic portfolios (n=2110) across 45 programs in 12 departments at one Canadian institution from 2018 to 2023. Descriptive statistics summarized submission, expiration, and score distributions. Spearman correlations and logistic regression examined 4 predictors: department group, resident year, calendar year, and program size.

Results EPA submission rates (81.0%), expiration rates (7.7%), and assessment O-scores (M=4.4 out of 5) did not differ significantly by training department. Calendar year increased odds of an independent or full score by 26.3% per year (OR, 1.263; 95% CI, 1.259-1.267) while resident year (OR, 0.818; 95% CI, 0.813-0.825) and program size (OR, 0.995; 95% CI, 0.994-0.996) decreased those odds.

Conclusions EPA submission, expiration, and scoring patterns are consistent across departments and correlate with implementation year, resident training stage, and program size.

Introduction

Despite a national rollout in Canada of competency-based medical education (CBME) in 2017, there are limited quantitative measures for evaluating the adoption of key aspects of CBME implementation, yet such data could inform continuous quality improvement (CQI). Completion rates of entrustable professional activity (EPA) observations serve as direct, actionable metrics for CQI.¹

To address this gap, we examined 3 indicators of CBME uptake—EPA submission rates, expiration rates, and assessment scores—across 12 departments. We explore how these metrics vary by department group, resident year, calendar year, and program size. These factors are chosen for their relevance to supervisory accountability structures and program resources. The research question that leads the study is: What are the patterns and contributing factors (department group, resident year, calendar year, and program size) associated with EPA observation submission rates, expiration rates, and assessment scores?

Methods

Participants

We used routinely collected resident assessment data for secondary analysis. EPA observations (n=233 176) from residents’ electronic portfolios (n=2110) were collected from 45 residency programs within 12 departments at one Canadian institution from 2018 to 2023. Data were de-identified by data stewards in the Office of Postgraduate Medical Education who were outside the study team. Each year there are approximately 875 residents. The number of residents per program ranged from 2 to 436. Programs included in this study launched CBME between 2017 to 2023. Results are reported in 4 groups of related departments to ensure anonymity as described in Table 1. While the initiation/completion of EPA observations involve participation of both the resident and the supervisor, supervisors are often the rate-limiting source.² Supervisors in this institution are accountable to their department chairs, rather than residency program leadership, for their teaching engagement and quality. For these reasons, the supervisors’ department was chosen as the unit of analysis rather than the residents’ program. The number of residents (2018 to 2023) in each department group includes acute care: 322; diagnostic: 94; surgical: 629; and medical: 1065.

Materials

The institution’s Office of Postgraduate Medical Education provided program-level details: program name, annual enrollment, CBME implementation start year, and stage duration benchmarks. Each EPA record included initiation and submission time stamps, expiration status, and an assessment score on a 1-5 entrustment scale (“O-score,” 1=fully supervised to 5=independent). In 2020, saved forms expired after 30 days; in 2021, this window was reduced to 14 days. Cancelled forms were excluded from analysis.

Data Analysis

We calculated the EPA submission rate as the proportion of submitted forms to initiated forms and the expiration rate as the proportion of expired forms to the sum of expired and submitted forms. Descriptive statistics, including 95% confidence intervals, summarized submission rates, expiration rates, and mean O-scores by department group. To examine factors associated with EPA scoring, we performed Spearman correlation analyses between mean O-scores and 4 predefined predictors: department group, resident year in training, calendar year of data collection, and program size (number of residents). We then used logistic regression to model the odds of achieving an independent score (O-score=5) based on these same predictors.

This study was approved by the institutional research ethics board (#Pro00140071).

Results

Table 1 summarizes EPA submission rates, expiration rates, and mean assessment scores by department group. Overall, the submission rate was 81.0% (95% CI, 80.8-81.2), with departmental rates ranging from 77.6% to 83.3% (difference=5.7 pp; 95% CI, 5.5-5.9 pp). Expiration rates averaged 7.7% (95% CI, 7.6-7.8), ranging from 4.6% to 9.2% (difference=4.6 pp; 95% CI, 4.4-4.8 pp). Mean O-scores were 4.40 (95% CI, 4.39-4.41), with department means between 4.36 and 4.48 (difference=0.12; 95% CI, 0.11-0.13).

Correlations

We examined 4 predictors—calendar year, resident year in training, program size, and department group—and their association with mean O-scores. As the Figure shows, Spearman correlations were strongest for calendar year (ρ=0.48; 95% CI, 0.47-0.49) and weaker for resident year (ρ=-0.13; 95% CI, -0.14 to -0.12) and program size (ρ=-0.10; 95% CI, -0.11 to -0.09).

FigureCorrelation Among Variables
Citation: Journal of Graduate Medical Education 17, 5; 10.4300/JGME-D-25-00152.1

Predictors of Independent Scores

Logistic regression (Table 2) indicated that each additional calendar year increased the odds of an independent O-score by 26.3% (OR, 1.26; 95% CI, 1.25-1.26), whereas each additional resident training year reduced those odds by 18.2% (OR, 0.81; 95% CI, 0.81-0.82) and each additional program resident reduced them by 0.5% (OR, 0.99; 95% CI, 0.99-0.99). Submission completion time showed a nonsignificant association (OR, 0.99; 95% CI, 0.99-1.00).

Discussion

While department grouping did not significantly affect EPA submission, expiration, or assessment score metrics, later implementation years were associated with higher submission rates and faster completion times. Additionally, residents received fewer “independent” EPA scores as they progressed, reflecting the increasing complexity of later-stage EPAs.

Several prior studies have debated whether procedural versus diagnostic specialties experience smoother CBME adoption.^3,4 Our cross-departmental analysis extends this work by demonstrating minimal variation across specialty clusters, suggesting that program-level systems (eg, electronic portfolios, supervisory accountability) promote uniform adoption.

The staged Royal College of Physicians and Surgeons of Canada CBME framework provides a conceptual basis for our progression findings.¹ Early stages emphasize simpler EPAs achieved rapidly, whereas the longer core stage contains complex activities requiring more time. Our regression results—showing each additional training year reduced odds of an independent score (OR, 0.82; 95% CI, 0.81-0.83)—support the need for stage-specific benchmarks in CQI dashboards.

Limitations include reliance on a single institution’s data and binary categorization of scores, which may oversimplify competence decisions. Future multicenter studies incorporating narrative feedback and committee deliberations would deepen understanding of CBME uptake.

Conclusions

Overall, our findings demonstrate that EPA submission rates, expiration rates, and assessment scores serve as consistent, department-agnostic metrics of CBME adoption and are significantly associated with implementation year, resident training stage, and program size.

[1] 1.
Frank JR, Karpinski J, Sherbino J, et al. Competence by design: a transformational national model of time-variable competency-based postgraduate medical education. Perspect Med Educ. 2024;13(
1
):201. doi:10.5334/pme.1096
OpenURL
PubMed
Google Scholar
Crossref

[2] OpenURL

[3] PubMed

[4] Google Scholar

[5] Crossref

[6] 2.
Cheung K, Rogoza C, Chung AD, Kwan BYM. Analyzing the administrative burden of competency based medical education. Can Assoc Radiol J. 2021;73(
2
): 299-304. doi:10.1177/08465371211038963
OpenURL
PubMed
Google Scholar
Crossref

[7] OpenURL

[8] PubMed

[9] Google Scholar

[10] Crossref

[11] 3.
Clement EA, Oswald A, Ghosh S, Hamza DM. Exploring the quality of feedback in entrustable professional activity narratives across 24 residency training programs. J Grad Med Educ. 2024;16(
1
):23-29. doi:10.4300/JGME-D-23-00210.1
OpenURL
PubMed
Google Scholar
Crossref

[12] OpenURL

[13] PubMed

[14] Google Scholar

[15] Crossref

[16] 4.
Khan WU, Twomey J, Ryan E, et al. Barriers and enablers to achieving clinical procedure competency-based outcomes in a national paediatric training/residency program—a multi-centered qualitative study. BMC Med Educ. 2023;23(
1
):954. doi:10.1186/s12909-023-04928-4
OpenURL
PubMed
Google Scholar
Crossref

[17] OpenURL

[18] PubMed

[19] Google Scholar

[20] Crossref

Article Contents

Monitoring Competency-Based Medical Education Uptake: Analysis of Entrustable Professional Activity Submission, Expiration, and Assessment Scores

ABSTRACT

Introduction

Methods

Participants

Materials

Data Analysis

Results

Correlations

Predictors of Independent Scores

Discussion

Conclusions

Rouge on the Lips of Silence

Evaluating Methodology for Increasing Diversity in US Residency Training Programs: A Scoping Review

Scoping Review of Simulation-Based Training for Social Determinants of Health Within Residency Programs

Career Outcomes Among Graduates of 2 Urban Health Primary Care Training Programs

Trends in MedEdPORTAL Faculty Development Resources for Clinician Educators

The Effect of Paging Reminders on Fellowship Conference Attendance: A Multi-Program Randomized Crossover Study

A Values Affirmation Intervention to Improve Female Residents' Surgical Performance

Improving Residents' Safe Opioid Prescribing for Chronic Pain Using an Objective Structured Clinical Examination

Integrating a Resident-Driven Longitudinal Quality Improvement Curriculum Within an Ambulatory Block Schedule

Skills for Interviewing Adolescent Patients: Sustainability of Structured Feedback in Undergraduate Education on Performance in Residency

Get Email Alerts