HMIS Data Adjustment and Scenario Creation

Handling outliers and incompleteness through multi-scenario analysis

INPUTS

Raw HMIS Data Service volumes by facility, indicator, month
Outlier Flags From Module 1 (DQA)
Completeness Flags From Module 1 (DQA)
1

Data preparation

  • Merge datasets by facility, indicator, and month
  • Identify excluded indicators (e.g., deaths)
  • Check for low-volume indicators requiring filtering
  • Validate facility-indicator combinations
2

Calculate rolling averages

  • Centered 6-month average (3 months before + 3 months after)
  • Forward 6-month average (next 6 months)
  • Backward 6-month average (previous 6 months)
  • Facility-specific historical mean (across all available data)

Apply adjustment hierarchy

For outliers
  • Use centered average
  • Use forward average
  • Use backward average
  • Use same month last year
  • Use historical mean
For missing/incomplete
  • Use centered average
  • Use forward average
  • Use backward average
  • Use historical mean

Create four scenarios

Scenario 1: None Original data (no adjustments)
Scenario 2: Outliers Only Adjust flagged outliers only
Scenario 3: Completeness Only Adjust missing/incomplete data only
Scenario 4: Both Adjustments Apply both outlier and completeness adjustments

Geographic aggregation

Facility Level
Individual facilities
Subnational Level
Provinces, districts
National Level
Country total
Each geographic level maintains all 4 scenarios

OUTPUTS

  • M2_adjusted_data.csv Facility-level data with all 4 scenarios
  • M2_adjusted_data_admin_area.csv Subnational aggregated data with all 4 scenarios
  • M2_adjusted_data_national.csv National-level aggregated data with all 4 scenarios
  • M2_low_volume_exclusions.csv Metadata on excluded indicators and facilities