You rolled out the new-process training last quarter. The completion dashboard glowed green: 98% of the team finished. Satisfaction scores were high, and everyone aced the end-of-session quiz. Two months later, the team is still doing it the old way, or the tickets still come back wrong. Every metric said the training worked. The work says it didn’t.
That gap has a name. Training transfer is whether what people learned in training actually shows up in the work they do, and it is the one thing a completion dashboard cannot see.
Completion is attendance. Transfer is the work changing.
Why Traditional L&D Metrics Miss the Mark
#Most L&D metrics measure activity, not capability. They capture what happened during or right after training, not what the team can still do on the job weeks later. Three metrics leaders trust, and what each actually measures:
- Completion rates: attendance with a timestamp. It tells you someone clicked through the modules. It says nothing about whether they can do the work.
- Satisfaction scores (the “smile sheet”): whether people liked the training, not whether it stuck. A high score rates the experience in the room, not the behavior back at the desk.
- End-of-Session Quizzes: An immediate quiz mostly tests recognition in the one moment the material is freshest. People can pick the right answer without being able to produce it later on the job, which is the illusion of competence: fluency in the room mistaken for durable mastery. In a 2006 study, Roediger and Karpicke found that people who practiced retrieval remembered far more on a delayed test a week later than those who re-read, even though the re-readers felt more confident right after. The task was narrow (undergraduates recalling prose), but the pattern travels: the moment right after training is exactly when the weakest method looks strongest. This is also the natural home of an AI-generated quiz, which defaults to that same in-the-moment recognition.
Each of these metrics measures something upstream of actual behavioral change. They can all read “success” while nothing on the job has changed, leaving leaders with capability debt: a gap between what is documented as learned and what the team can actually still do.
What Training Transfer Actually Requires
#Training transfer is unaided performance on the real task, after a delay, under conditions that resemble the job. It runs on a few memory mechanisms: retrieval practice and spacing are what move knowledge from a passive “seen it” to an active “can do it.” Without them, knowledge decays by default, and the skill gaps stay invisible until an incident exposes them. The distance between “trained” and “can still do it” is a hidden liability, one that comes due at the worst possible time: an incident, an audit, a customer.
How to Measure Training Transfer Effectively
#Measuring transfer means watching the work, not the attendance. Four moves:
- Define the observable change before training. Name the specific behavior or output that should look different on the job. If you cannot name what the team should be doing differently, you cannot measure transfer. Not “understand the new software,” but “submit the expense report correctly in the new system within a day of the purchase.”
- Check at a delay, unaided, on real work. Skip the in-session quiz. Weeks or months later, when the material is no longer fresh, have them do a real task without the training materials or prompts open. That is the condition the job actually runs under.
- Watch the work, not the self-report. People overestimate their own readiness right after training, off the same fluency that fooled them. So sample real output, observe the task, or read the quality numbers tied to the trained behavior, instead of asking “do you feel confident?”
- Look for the behavior in the wild. Is the new process actually being followed? Are the errors the training targeted going down? A completion dashboard tells you nothing about that; a drop in a specific kind of mistake tells you everything.
A training program’s success is not an event on a dashboard the day it ends. It is a change in the work, weeks later. Before you count a program as done, decide what behavior would prove it worked, and go look for that behavior on the job. If you cannot name what should be different, you were never measuring whether it transferred, only whether it was consumed.
Capability debt is payable. The teams that stay ahead of it are not the ones with the busiest dashboards; they are the ones that decide what steady, on-the-job performance should look like and then go check for it. For the mechanism underneath all of this, how memory turns training into something that holds, start with how memory works in teams.
Sources
#- Roediger, H.L. III, and Karpicke, J.D. (2006). Test-Enhanced Learning: Taking Memory Tests Improves Long-Term Retention. Psychological Science, 17(3), 249-255.
