EDORA Learn â Methods
Longitudinal Tracking and Cohort Linkage
Following the same people or cases over time turns isolated data points into trajectories. Longitudinal tracking is what allows researchers to ask not just âwhat happened,â but âwhat happened next.â
What Longitudinal Tracking Does
- Links an individualâs or caseâs records across months, years, or programs.
- Aligns data around key milestonesâsuch as first intake, release, or program completionâto measure change over consistent follow-up windows.
- Enables calculation of transition probabilities, durations, and repeat events.
Designing Cohorts
Cohorts form the backbone of longitudinal analysis. A closed cohort starts at a fixed time and admits no new members, while an open cohort continually adds entrants. Clear start and end rules ensure each observation has equal opportunity for follow-up.
- Start rule: Define the initiating eventârelease, admission, or intake date.
- End rule: Define the observation cutoff (6, 12, 24 months or fixed calendar date).
- Inclusion/exclusion: Note how missing dates, transfers, or re-entries are handled.
Tracking Across Systems
Multi-year linkage requires persistent identifiers. When these are unavailable, researchers use probabilistic linkageâmatching names, birthdates, or demographic traits across years. Maintaining a crosswalk of IDs is essential for longitudinal continuity, but privacy controls must prevent re-identification.
- Unique IDs: Stable keys that follow individuals or cases across reporting cycles.
- Crosswalk tables: Secure mappings between old and new IDs when systems migrate.
- Audit logs: Record linkage decisions to ensure reproducibility and error checking.
Attrition and Right-Censoring
Over time, cases drop out because they age out, move, or stop reporting. Distinguishing attrition (true exit) from data loss is critical. Mark censored cases explicitly and estimate the effect on observed rates rather than ignoring them.
Privacy and Stability
Longitudinal linkage heightens privacy risk because re-identification becomes easier as more time-linked details accumulate. Balancing analytic power with confidentiality requires pseudonymization, strict governance, and occasional re-keying of identifiers.
Data & Methods
The research file emphasizes that the best longitudinal systems combine deterministic and probabilistic linkage, document cohort entry/exit rules, and routinely test for bias introduced by attrition. Metadata should record every cohortâs start rule, time window, and linkage procedure to maintain transparency across years.
Related
Transparency note: Every longitudinal dataset should include a cohort definition file, a linkage summary, and attrition counts. Without these, trend analysis risks mistaking missing data for real change.