Create summary stats on region level
- aggregate on GID_1 for DHS data and geolev1 for census
- make sure that both the region level and the district level summary stats have a unique identifier such that they can be merged (for easier comparison)
@kdurizzo feel free to add any additional notes/to-dos