About Our Data Products
To access AidData's latest core research release, which contains a downloadable snapshot of AidData's entire database, as well as our country-level research datasets, please follow this link: Country-Level Research Datasets.
To access AidData's sub-national, geospatial research datasets, please follow this link: Sub-National, Geospatial Research Datasets.
To access AidData's research datasets on GCC and Chinese development finance, please follow this link: Donor Datasets.
Data Product Processing Levels
AidData's data products are built using hierarchical data processing levels, numbering 1-4 (with sub-processing Levels like Level 1a). Each processing level is built upon the previous level (meaning the Level 1 product is derived from the Level 0 product and so on). Table 1 below lists the processing levels used by AidData in data production, with their notional meanings and descriptions.
AidData Data Level
Minimally processed raw data to Level 0 field names and table structure per the product line, converted to UTF-8. This product is for internal use only. This data will never be part of a public release.
Data post processed to Level 1 field names; Data decomposed to proper table structure by the product definition; financials deflated to base years where possible. Sectors crosswalked to aiddata sectors. Geocoded data is spatially scrubbed (verified to be in the correct boundary). Fields checked for intra and inter field consistency. Level 1 stats generated. Quality assurance (QA) flags added. This Level may include ancillary data included in the release, but is not quality assured by AidData. This is the first data product which is available as a public release.
A single table, denormalized version of the Level 1 product, with provided assumptions. This is a ‘joined’ product of Level 1 constituent tables or data.
Level 1 + any quality assured ancillary data (e.g. evaluation data); Aggregates and rollups of data by a data dimension (e.g. by Sector or Donor)
Rasterizations - Continuous surface representations of our Aid Information. For geocoded data, a geospatial representation of (a) the total dollars of aid we estimate are at a given location, and (b) the number of projects. This product additionally provides at least one surface of the uncertainty in our continuous estimates.
Simulated Products - These products are further refined versions of the Level 2 product line. For example, a product might produce a continuous surface estimate of aid projects that is weighted to account for slope, population, or road networks. These products will also provide a surface of uncertainty.