GIS and Academic Portfolio
Replication of
Original study by Malcomb, D. W., E. A. Weaver, and A. R. Krakowka. 2014. Vulnerability modeling for sub-Saharan Africa: An operationalized approach in Malawi. Applied Geography 48:17–30. DOI:10.1016/j.apgeog.2014.01.004
Replication Authors: Jackson Mumper, Joseph Holler, Kufre Udoh, Open Source GIScience students of fall 2019 and spring 2021
Replication Materials Available at: RP-Malcomb
Created: 26 April 2021
Revised: 4 May 2021
The original study is a multi-criteria analysis of vulnerability to Climate Change in Malawi, and is one of the earliest sub-national geographic models of climate change vulnerability for an African country. The study aims to be replicable, and had 40 citations in Google Scholar as of April 8, 2021.
The study region is the country of Malawi. The spatial support of input data includes DHS survey points, Traditional Authority boundaries, and raster grids of flood risk (0.833 degree resolution) and drought exposure (0.416 degree resolution).
The original study was published without data or code, but has detailed narrative description of the methodology. The methods used are feasible for undergraduate students to implement following completion of one introductory GIS course. The study states that its data is available for replication in 23 African countries. Replication is important as a means of ensuring the efficacy of research findings. It is particularly important for this type of study, where the result is highly variable to a somewhat subject list of inputs.
The original study compiled data from three sources: Demographic and Health Survey data, Famine Early Warning Network, and UNEP/GRID-Europe. The former data source was used calculate an adaptive capacity score for each of the Traditional Authorities (TAs) in the country based on assets and access variables in the DHS data. Adaptive capacity was both an outcome variable as well as an input variable in the vulnerability calculations, alongside livelihood sensitivity (from FEWSNET data) and physical exposure (from UNEP/GRID data). The full metric used by Malcomb et al to calculate adaptive capacity and household resilience can be found in their original Table 2, listed as Figure 1 below.
Figure 1: Inputs of adaptive capacity (from Malcomb et al, Table 2)
While Malcomb does not explicitly state the exact data variables from which the indicators in Table 2 were found in each dataset, the tend to be easily identifiable for the DHS variables, with names that were often identical to names in the DHS.
The FEWSNET data used to calculate livelihood sensitivity was aggregated based on livelihood zones. This uses only data from the poor wealth group in their spreadsheets. The four variables used to calculate livelihood sensitivity are:
Finally, physical exposure was calculated based on a rasterized layer of estimated flood risk and exposure to drought events.
The original study was conducted using ArcGIS and STATA, but does not state which versions of these software were used. The replication study uses R 4.0.5 in R-Studio 1.2.5033.
The final workflow used in this replication is summarized below. While there is uncertainty as to how closely this matches the workflow of Malcomb et al, this is similar to the narrative workflow outlined in the paper.
Our plan for comparing our analysis results to that of Malcomb et al involved:
This replication sought to reconstruct two primary maps from the original paper by Malcomb et al. In Malcolmb et al these are labeled as Figure 4, a choropleth map of adaptive capacity aggregated by TA, and a Figure 5, a rasterized map of vulnerability across Malawi. Ultimately, while we were able to more or less recreate the adaptive capacity map, the final vulnerability map was far from that put forth by Malcomb et al. Our results are in figures 2 and 3 below.
Figure 2: Recreation of adaptive capacity map
Figure 3: Recreation of vulnerability maps
In the case of adaptive capacity, similar results were found as with Malcomb et al, with southern Malawi generally being shown to have lower adaptive capacity scores than more northern parts of the country. The quantile differences between are summarized below in the confusion matrix in Table 1 and the map in Figure 4.
Table 1: Adaptive Capacity Confusion Matrix | 1 | 2 | 3 | 4 |
1 | 33 | 5 | 0 | 0 |
2 | 26 | 24 | 0 | 0 |
3 | 5 | 43 | 19 | 3 |
4 | 0 | 6 | 30 | 5 |
Figure 4: Map of localized differences in adaptive capacity
Spearman’s rho coefficients were calculated for both adaptive capacity and vulnerability to determine the divergence of this reproduction from the original Malcomb et al paper. For adaptive capacity, spearman’s rho was 0.7870, and for vulnerability, spearman’s rho was 0.2194. This shows that while in both cases the results of this study are correlated better than random with Malcomb’s results, only in the case of adaptive capacity can one say the reproduction performed well.
Differences in quantile classifications of adaptive capacity remained small between the original study and this replication, and the differences that were found were not localized to any specific region in Malawi, indicating that they were likely the result of differences in methodological discrepancies rather than conceptual differences in problem conception.
This was not the case, however, for vulnerability. Replicated vulnerability maps upon initial visual inspection seemed to agree with the original findings that areas in southern Malawi were more vulnerable to climate change. However, the replication introduced a lot of noise in the data that’s not present in the original report. The difference map of vulnerability, as well as a scatterplot comparison of the two raster maps can be found in Figures 5 and 6.
Figure 5: Difference map of climate change vulnerability
Figure 6: Scatterplot of Malcomb et al and replication differences in vulnerability
The differences in vulnerability were widespread, also with little geographic clustering. However, their severity indicates a greater degree of methodological divergence with the original paper than for adaptive capacity. This is particularly worrisome when combined with the fact that this replication almost invariably underestimated vulnerability in the region. This makes sense given the relative ease with which our group was able to identify the DHS variables to use for adaptive capacity, and difficulty in identifying the paper’s methodology for ascertaining livelihood sensitivity in particular.
Due to the lack of replicability standards in publication, there were many areas wherein the methodology described above diverged from that of Malcomb et al, and there were also slight deviations in this analysis from that workflow. Potential methodological deviations included:
Meanwhile, practical deviations in execution included:
This report seeks to replicate Malcomb et al’s adaptive capacity and climate vulnerability models of Malawi. While we were able to mostly replicate their findings for adaptive capacity, methodological uncertainties made it so that we were unable to replicate their final model of vulnerability. This failure is unlikely to have been caused by mistakes made by Malcomb et al in their methods, but rather practical causes. These could include but are not limited to:
Malcomb et al do give an adequate description of the methods and data used to calculate adaptive capacity, and for this reason we were able to more or less accurately replicate and confirm their results. However, the limitations outlined above caused us to be unable to replicate their final vulnerability map.
The results of this replication confirm Malcomb et al’s findings in their adaptive capacity map of Malawi, but not in the results of their final vulnerability model. Further research can be done to examine the vulnerability in the context of Malawi, as well as the efficacy of such models as useful tools for conservation and development planners.
Examining this result through a replication and open source lens reveals much greater implications for for theory. Much geographic and GIS research is currently faced with issues of replicability and reproducibility. There are examples of GIS findings being published in scientific literature with significant errors as the closed nature of much GIS research allows them to go unnoticed (Singleton et al, 1508-1509). While Malcomb et al has clearly made an attempt at reproducibility through their narrative methodological descriptions and explanation of data sources, this attempt still falls short of the level of detail required for a third-party to complete a replication. This creates challenges for the peer review process, making it impossible to truly know the efficacy of these results.
Malcomb, D. W., E. A. Weaver, and A. R. Krakowka. 2014. Vulnerability modeling for sub-Saharan Africa: An operationalized approach in Malawi. Applied Geography 48:17–30. DOI:10.1016/j.apgeog.2014.01.004
Singleton, A. D., S. Spielman and C. Brunsdon (2016) Establishing a framework for Open Geographic Information science. International Journal of Geographical Information Science 30:8, 1507-1521. DOI: 10.1080/13658816.2015.1137579
Tate, E. 2013. Uncertainty Analysis for a Social Vulnerability Index. Annals of the Association of American Geographers 103 (3):526–543. doi:10.1080/00045608.2012.700616.
Thank you to Professor Joe Holler, Kufre Udoh, and the rest of the Geog323 class for the collaborative spirit and aid in this project. Particular thanks to, Arielle Landau, Evan Killion, Steven Montilla-Morantes, Sanjana Roy, and Maddie Tango.