Presentation Title:
From Field to Forecast: Building a Reproducible Analytical Pipeline for IOM DTM Data in R
This 45-minute presentation is structured to balance high-level humanitarian context with technical R implementation. It is designed to keep both “Domain Experts” and “R Developers” engaged by alternating between Why (the mission) and How (the code).
Total Duration: 45 Minutes
Format: 35 Min Presentation + 10 Min Q&A
I. Introduction: The Data-Driven Humanitarian (5 Minutes)
The Mission: Brief overview of IOM’s role in global migration.
The Problem: The “Data Silo” vs. “The Need for Speed.” Why manual spreadsheets fail in rapid-onset crises.
The Solution: An end-to-end R pipeline that ensures transparency, speed, and reproducibility.
II. Domain Architecture: What are we actually measuring? (7 Minutes)
- The DTM Framework: explaining the “Four Pillars” (Mobility Tracking, Flow Monitoring, Registration, and Surveys).
- Data Granularity: Understanding Admin Levels (0–2) and the concept of “Data Rounds.”
- P-Codes: The “Primary Key” of the humanitarian world that links IOM data to the rest of the UN system.
III. Technical Architecture: Connecting R to the IOM API (8 Minutes)
- The API Gateway: Introduction to DTM API v3.
- The Authentication Layer: Using
.Renvironto protect subscription keys. - The Ingestion Script: * Using
httr2for API requests. - Using
jsonliteto flatten nested JSON structures into Tibbles. - Live Snippet: Showing a clean 10-line block of code that pulls live IDP figures.
IV. The Tidyverse Engine: Wrangling Humanitarian Data (10 Minutes)
- Standardizing Chaos: Cleaning inconsistent
place-names and handling
NAvalues. - Longitudinal Logic: Using
group_by()andlag()to calculate the velocity of displacement between rounds. - Demographic Reshaping: Using
pivot_longer()to prepare Sex and Age Disaggregated Data (SADD) for population pyramids. - Vulnerability Indexing: Creating a weighted
“Severity Score” using
mutate()andcase_when().
V. Spatial Analysis & Visualization (7 Minutes)
- Mapping with
sf: Joining DTM metrics with geographic shapefiles. - Effective Visuals: * Choropleth Maps: Visualizing IDP density.
- Trend Lines: Tracking flow volumes over time.
- The “So What?”: Moving from “The chart looks nice” to “This is where the mobile clinic needs to go.”
VI. Data Ethics: Security & Disclosure Control (3 Minutes)
- The “Do No Harm” Principle: Why we never share raw microdata.
- Statistical Disclosure Control (SDC): * A brief
look at the
sdcMicropackage. - Techniques: K-anonymity and Local Suppression to protect vulnerable individuals.
VII. Conclusion & The Future of Analysis (5 Minutes)
- Summary: How R bridges the gap between field enumerators and strategic decision-makers.
- Looking Ahead: Toward Anticipatory Action—using R to link DTM data with climate or economic indicators for predictive modeling.
VIII. Q&A / Discussion (10 Minutes)
- Open floor for technical questions (API limits, R packages) and domain questions (Data reliability, IOM methodology).
Presentation Notes for the Speaker:
For the R Experts: Remind them that “Admin 2” is essentially a
JOINkey.For the DTM Experts: Remind them that “Tidyverse” is essentially a more powerful, transparent version of an Excel Pivot Table.
Visual Aid: Keep code snippets on the left and the resulting “Humanitarian Map” on the right to show the direct impact of the script.