Skip to main content

A novel adaptation of spatial interpolation methods to map health attitudes related to COVID-19

Abstract

Background

The COVID-19 pandemic presented substantial challenges to public health stakeholders working to vaccinate populations against the disease, particularly among vaccine hesitant individuals in low- and middle-income countries. Data on the determinants of vaccine hesitancy are scarce, and often available only at the national level. In this paper, our goal is to inform programmatic decision making in support of local vaccine uptake. Our analytical objectives to support this goal are to (1) reliably estimate attitudinal data at the hyperlocal level, and (2) estimate the loss of data heterogeneity among these attitudinal indicators at higher levels of aggregation. With hyperlocal attitudinal data on the determinants of vaccine hesitancy, public health stakeholders can better tailor interventions aimed at increasing uptake sub-nationally, and even down to the individual vaccination site or neighborhood.

Methods

We estimated attitudinal data on the determinants of vaccine hesitancy as framed by the WHO’s Confidence, Complacency, and Convenience (“3Cs”) Model of Vaccine Hesitancy using a nationally and regionally representative household survey of 4,922 adults aged 18 and above, collected in February 2022. This custom survey was designed to collect information on attitudes towards COVID-19 and concerns about the COVID-19 vaccine. A machine learning (ML) framework was used to spatially interpolate metrics representative of the 3Cs at a one square kilometer (1km2) resolution using approximately 130 spatial covariates from high-resolution satellite imagery, and 24 covariates from the 2018 Nigeria Demographic and Health Survey (DHS).

Results

Spatial interpolated hyperlocal estimates of the 3Cs captured significant information on attitudes towards COVID-19 and COVID-19 vaccines. The interpolated estimates held increased heterogeneity within each subsequent level of disaggregation, with most variation at the 1km2 level.

Conclusions

Our findings demonstrate that a) attitudinal data can be successfully estimated at the hyperlocal level, and b) the determinants of COVID-19 vaccine hesitancy have large spatial variance that cannot be captured through national surveys alone. Access to community level attitudes toward vaccine safety and efficacy; vaccination access, time, and financial burden; and COVID-19 beliefs and infection concerns presents novel implications for public health practitioners and policymakers seeking to increase COVID-19 vaccine uptake through more customized community-level interventions.

Background

As of late September 2022, over 6.5 million known individuals have died as a result of COVID-19 and over 12 billion COVID-19 vaccines have been administered [1]. During the first 12 months that the vaccine was available, vaccines yielded an estimated 14.4 million deaths averted, based on official COVID-19 mortality counts, or 19.8 million deaths averted using excess death figures across 185 countries globally [2]. Despite this, the majority of low-income countries have vaccination rates below 40% for target populations, compared to rates above 70% in most high-income countries [1].

Need for a hyperlocal response

Several challenges have hampered vaccine uptake in low- and middle-income countries (LMICs), particularly in sub-Saharan Africa, including vaccine supply, attitudes towards vaccination [3], and last mile logistics. According to the Strategic Advisory Group of Experts on Immunization (SAGE) - the primary advisory committee to the World Health Organization (WHO) for immunization and vaccination - vaccine uptake is a complex and dynamic global problem with determinants that vary a) depending on context, time, place, program, and specific vaccine, b) across and within countries, and c) between population subgroups based on geographic, cultural, socioeconomic, and other factors [4]. For government health agencies and implementing organizations to be able to conduct effective vaccination programs within their respective locations, developing a local understanding of barriers to vaccine uptake as well as determinants of hesitancy is critical.

Lack of sub-national data

Traditionally data on vaccine hesitancy is limited to available datasets that are typically representative of either the national, or at most the first administrative level [5]. National-level statistics regarding access and attitudes towards vaccines, while helpful, obscure local differences and are not informative in developing locally targeted vaccination programs, or risk communication and community engagement programs. However, identifying more granular variation in the determinants of vaccine hesitancy, which are driven by both population characteristics as well as attitudes, presents a challenge. Previous research has relied on non-representative samples supplemented by very large sample sizes collected through social media websites such as Facebook [6,7,8]. However, subsequent research has highlighted the limited utility of increasing sample size to overcome a non-representative sampling methodology [9].

Benefits of spatial interpolation

Spatial interpolation methods driven by machine learning algorithms provide an avenue for estimating hyperlocal data that would otherwise be cost prohibitive. Collecting data at the hyperlocal level is usually restricted to large scale surveys like national censuses due to the time and resource intensity of data collection. Spatial interpolation methods allow us to instead impute hyperlocal data at a fraction of the cost due to smaller sample size requirements since these methods only require demographically and geographically representative household surveys and high-resolution satellite imagery to estimate data across missing clusters in the survey data. Spatial interpolation has been used previously [10], including in a public health context to understand patterns of HIV risk among adolescent girls and young women [11] and to create spatial layers using the Demographic and Health Surveys (DHS) [12].

Spatial interpolation methods were used by this project to create highly detailed, hyperlocal data at a 1km2 level to aid in understanding the geographic distribution of factors that might help or hinder vaccination efforts. This level of granularity was chosen for three key reasons: (1) many of the inputs to the modeling process are derived from high resolution satellite imagery that is more readily available at the 1km2 level; (2) this is sufficiently granular to inform community level public health programming, such as immunization microplanning; and (3) estimates can easily be aggregated to any custom catchment area or administrative boundary larger than 1km2. More information on how 1km2 level estimates readily align with existing Nigeria health microplanning can be found in the discussion section below. Of particular interest was the development of metrics that speak to health attitudes and behaviors, as machine-learning driven methods of spatial interpolation have not yet been utilized to interpolate representative attitudinal data at a hyperlocal level. This novel application of spatial interpolation methods has potentially wide-spread utility for public health practitioners in understanding behavioral drivers and barriers to public health interventions around the globe for a plethora of disease states and health issues beyond COVID-19.

The 3Cs of vaccine hesitancy

The SAGE Working Group report highlights the utility of the Confidence, Complacency, and Convenience (3Cs) model of vaccine hesitancy, wherein the 3Cs are three primary determinants that either bolster or hinder vaccination against a given disease in a given location [4]. Confidence is an individual’s trust in the safety and efficacy of the vaccine, the system that delivers them, and the motivations of policymakers [4]. Complacency is an individual’s perceived risk to a disease and necessity of preventative action [4]. Convenience is the availability, affordability, accessibility, and appeal of immunization services [4]. These determinants, together, provided the necessary framework for understanding the underlying drivers and barriers to COVID-19 vaccine uptake, including attitudinal determinants. The 3Cs model also allows us to capture the heterogeneity of vaccine uptake determinants between subgroups across countries. For public health microplanners or decentralized health intervention resource management, hyperlocal data on attitudes towards the risk of COVID-19 and concerns about vaccination can support more efficient targeting of resources and more effective local program design.

The multi-country project

Johnson and Johnson’s Global Public Health (J&J GPH) team partnered with Fraym, a producer of geospatial population data, as part of an initiative to support the uptake of all types of COVID-19 vaccines in sub-Saharan Africa. Fraym’s contribution was to develop hyperlocal data on population characteristics and attitudes related to COVID-19 across Ethiopia, Ghana, Kenya, Malawi, Mali, Nigeria, Rwanda, South Africa, Uganda, and Zambia. For several of these countries our project team used existing and publicly available data to develop ‘proxy’ models, which estimated the determinants of vaccine hesitancy Since these data were primarily collected prior to the COVID-19 pandemic, they lacked direct questioning on perceptions of COVID-19 and concerns regarding COVID-19 vaccination. Due to this limitation, proxy indicators were used to capture trends in expected behavior.

Over the last two years, our team conducted custom national and regionally representative household surveys in select countries to fill this gap in information, which allowed us to develop ‘COVID-19’ models using more explicit data on population attitudes around COVID-19 and vaccination. The COVID-19 model was applied in Ghana, Kenya, Nigeria, and South Africa. A detailed view of which model and data were used across countries is available below in Table 1. For the purpose of this paper, we focus on the application of the COVID-19 model in Nigeria to highlight the use of attitudinal data for hyperlocal targeting and microplanning.

Table 1 Data source summarya

Methods

In the Methods section we begin with an overview of the multi-step process the project team developed to adapt the WHO’s 3Cs framework to the given country and data source, followed by an explanation of the spatial interpolation process adapted for attitudinal data. Finally, we cover the specifications of the COVID-19 model for Nigeria including selected indicators.

An overview: adapting the WHO’s 3Cs model of vaccine hesitancy

The WHO’s 3Cs framework defined by the SAGE Working Group informed the development of a multi-step model to map the determinants of vaccine hesitancy, namely confidence, complacency, and convenience. As mentioned above, two models, the proxy model and the COVID-19 model, were developed to track these determinants to respond to varying data availability across countries. Below, we provide an overview of the customizable process we developed to model and map each of the 3Cs using either data input.

The 3Cs model development has three broad steps. In the first step, indicators from nationally representative and geo-referenced household survey data are mapped to each of the determinants of vaccine hesitancy such that the drivers of confidence, complacency, and convenience are well represented. These indicators are selected from household survey data, and as such the unit of analysis is the individual. These are considered the inputs to the model.

In the second step, an index is created for each determinant using Multiple Correspondence Analysis (MCA) so that confidence, complacency, and convenience are each individually represented by a single metric instead of several variables. Widely considered a counterpart to Principal Component Analysis (PCA), MCA is similarly a multivariate statistical method to represent underlying characteristics, but of unilaterally categorical data [13]. Since the data identified in step one is largely categorical, any remaining mapped inputs that are continuous are transformed to categorical variables. We confirm the directionality of inputs to match the direction of the index such that, for example, all inputs in the convenience model increase in the direction of higher convenience. We subsequently created a single MCA index to represent each of the 3Cs. Like the inputs, the final indices are also created at the individual level in the survey data, so that we can track variation in confidence, complacency, and convenience across individual respondents.

The construction of the convenience index differs slightly from that of confidence and complacency. The confidence and complacency indices are each developed using all respective inputs in a single index. The convenience index on the other hand is developed as a composite index. Convenience is an amalgamation of several distinct barriers to vaccine uptake, since one may face either accessibility, time, or financial burdens in seeking health care. Respective to context, the latent drivers for each of these burdens may be unrelated and should therefore be indexed separately. As such, the convenience index is the summation of two separate indices: the accessibility index and the time and financial burden index. Each final 3Cs index was normalized so they could be represented as a score ranging between zero and one hundred for easier interpretation, which also allows comparison across the indices.

To enable as much comparability as possible across the project countries, the index inputs were kept consistent as much as possible subject to data availability, respective to the model type. As such, proxy models across countries share the same input variables as collected by the respective DHS and Afrobarometer, while the COVID-19 models share the same inputs as surveyed by the project team.

Spatial interpolation

In the final step of the process the 3Cs indices are spatially interpolated to produce hyperlocal estimates. Spatial interpolation is enabled by an ML algorithm that processes various data, such as cluster-level survey data, high-resolution satellite imagery, and other derived data products that include earth observation and human settlement data and applies a spatial ensemble model to predict the missing survey data across the non-enumerated areas [11]. The final product is a geospatial raster layer where each pixel, represented here as a 1km2 grid, has an associated estimated value.

The spatial interpolation process also includes steps to ensure the validity of the model estimation. The interpolation algorithm includes several stages of cross-validation of training and testing data across several statistical models, which validates grid level estimations, before selecting the final model(s) with the lowest root mean squared error (RMSE). Additionally, the RMSE of the null model, which includes all covariates, is confirmed to be higher than the final ML model RMSE to ensure efficient covariate selection. The next stage of validation confirms robustness of national or sub-national estimations and consists of two steps: calculating the absolute difference between the survey mean and the interpolated raster mean at the level of representativeness of the base survey and identifying the statistical significance of the absolute difference by referencing the confidence intervals of the survey data. Additional checks, such as mean comparison of lower levels of administrative divisions and checking survey cluster distributions are reserved if the layer does not meet the primary validation criteria. A more detailed explanation of the interpolation process is provided in Additional file 1: Appendix A.

To interpolate attitudinal data, updates were made to the spatial interpolation ML model to ensure model performance. Based on a literature review of the leading attitudinal indicator, in this example vaccine hesitancy in Nigeria, we identified additional features to be included as inputs in the ML model. We identified several demographic and socioeconomic indicators that were associated with attitudes around vaccination in Nigeria. Some examples include gender [14], religion [15], and wealth [16]. These indicators were pulled from the Nigeria Demographic and Health Survey 2018 [17] and were first interpolated using our base interpolation process. The output layers served as inputs in the interpolation of the 3Cs indices and any other indicator capturing an opinion or attitude.

Applying the 3Cs model to Nigeria

Household survey

To develop the COVID-19 model, a nationally representative and georeferenced survey was developed by Fraym and administered by an external firm, GeoPoll, in February 2022. The survey was administered as a Computer Assisted Telephonic Interview (CATI) across 4,922 respondents aged 18 and above. The interviews were conducted in English, Hausa, Yoruba, and Igbo.

The survey itself included questions on demographic and socioeconomic characteristics, media consumption, plans to receive the COVID-19 vaccine, and attitudes towards COVID-19 and the COVID-19 vaccine. The survey instrument contained separate sections that asked concerns about accessing the vaccine, perceived risk of the virus, and confidence in the safety and efficacy of the vaccine. These sections were used to develop the confidence, complacency, and convenience indices, respectively.

To improve representativeness of a phone survey, the sample design included both demographic and socioeconomic quotas for sample selection, and the final data were weighted post-processing. The demographic quotas were based on data from the U.S. Census Bureau’s PEPFAR program and were nested by geopolitical zone, age, and sex [18]. Whereas the socioeconomic quotas were based on national rates of bank account ownership and connection to the electrical grid as per the 2018 DHS due to the strength of their relationship with the DHS wealth index. The collected and processed data were finally weighted using an Iterative Proportional Fitting (IPF) algorithm that matched the frequencies of the survey data to reference data, which were nationally representative. The scaling factors used were urbanicity and gender from the United Nations 2022 Revision of World Population Prospects report, and the same socioeconomic asset distribution used for the sample quotas from the DHS [19]. The IPF process achieved convergence within three iterations and prevalence rates were confirmed at the first administrative level.

Selected indicators

To track confidence in the COVID-19 vaccine, we asked respondents about their attitudes toward the safety and efficacy of the vaccine, as well as the vaccination status of people they trust. Vaccine complacency was captured in terms of perceived personal and community risk from COVID-19, via questions regarding belief that COVID-19 is not real, or belief that being healthy precludes concerns about hospitalization. To measure vaccine convenience, we tracked concerns related to accessibility, time, and financial burdens that were associated with receiving the vaccine. To better capture accessibility burdens, we included external data on distance to medium and large health facilities. The location of these health facilities in Nigeria was accessed via a WHO list of health facilities sourced from several government and non-government sources from 50 countries in Sub-Saharan Africa [20]. Spatial friction layers accessed via the Malaria Atlas Project allowed us to estimate least cost walking and driving time to these health facilities which account for travel costs such as slope and terrain [21]. The complete list of inputs can be found in Table 2 below. Our rationale for choosing the indicators and questions below, as well as for aligning them to the 3Cs model, is detailed further in Additional file 2: Appendix B.

Table 2 Selected inputs per 3Cs index

3Cs index terciles

Due to the nature of their construction, the 3Cs indices each reflect a relative score of prevalent concerns. Although the score allows us to compare levels of an index across hyperlocal communities, it does not yield itself to actionable insights on specific populations. As such, prior to interpolation when the indices were constructed using individual level survey data, the indices underwent an additional step where they were spliced into separate quantiles each representing a different level of the determinant. In most cases, including Nigeria, we were able to achieve a maximum of three quantiles, also referred to as terciles. Although there were some cases in other project countries where lack of 3C variation only afforded us 2 quantiles. As such, in the survey data we classified not only the confidence index for an individual, but also whether they were relatively low, medium, or high confidence. These terciles were also interpolated to reveal the proportion of the population who are low confidence at the hyperlocal, state, or national level. Combining these data on population proportions with hyperlocal estimates of population density from WorldPop [22], we can also map the total number of people who are low confidence at various geographic levels. This allows implementers to design programs around the proportion as well as number of people in a community or LGA who have low confidence.

Additional data for hyperlocal targeting

In addition to interpolating the 3Cs indices and their respective terciles, we also interpolated several population characteristics from the household survey, which included demographics, socioeconomics, and media usage. We were also able to identify the locations of public and private COVID-19 vaccination sites using data from The National Primary Health Care Development Agency, a parastatal of the Federal Ministry of Health [23]. These geo-referenced vaccination sites were overlaid on mapped output, but not directly included in index construction or manipulated for analysis. This data can be used to design targeted interventions and media campaigns at the hyperlocal level. Examples of how to use these data in conjunction with the 3Cs are shared in the Discussion section.

Analysis plan

In support of our dual analytical objectives, our statistical analysis plan is disaggregated into the following steps. To verify the reliable estimation of attitudinal data at the hyperlocal level, we confirm the sufficient capture of attitudes in the 3Cs indices and validate the results of the spatial interpolation. We also test the operationalization of the interpolations, such as by identifying the prevalent determinant at the hyperlocal level. This analysis is shared in Results. To better understand the heterogeneity of population characteristics and indicators of interest at the hyperlocal level, and the potential loss of information from increasing levels of aggregation, we explore three examples of hyperlocal variance in population characteristics which can be used for more impactful decision making by local implementers. These examples are shared in the Discussion.

Results

In this section we prioritize our first analytical objective, which is to reliably estimate attitudinal data at the hyperlocal level. We first explore the MCA indices constructed for each of the 3Cs, and the primary drivers of each. This is followed by a summary of the performance and results of the spatial interpolation models of vaccination attitudes.

MCA performance

The Confidence, Complacency, and Convenience (3Cs) MCA indices were each constructed and verified across their inputs to ensure input value addition and expected direction of correlation between the inputs and the respective index. The primary dimensions of all the MCA indices of attitudinal inputs, barring one, surpassed a minimum of 90 percent explained variance. Only the accessibility index, which contained the walking and driving times to health facilities and scooter ownership, achieved a primary dimension that explained 74 percent of variation.

Drivers of the determinants

Since the 3Cs scores are constructed as MCA indices and are composites of survey inputs (Table 2), we can test the pairwise correlations between each index input and the index itself to determine the contribution of inputs to the larger whole (Table 3). Understanding the drivers of determinants can shed light on the most pressing concerns of the population, especially since they can vary across countries and across time as attitudes and priorities change. Identifying these drivers can inform program design by streamlining the focus of interventions.

Table 3 Correlations of 3Cs indices and their inputs

Spatial interpolation model performance

Upon interpolation, the 3Cs index layers underwent standard data quality checks (as described in the Methods section). The final ML models selected for spatial interpolation of the 3Cs indices and respective terciles yielded a root mean squared error below 0.05. Since the survey collected for this project was representative at the first administrative level, i.e., states, we compared the survey and raster means across the thirty-six states of Nigeria. Although the confidence and complacency index interpolations largely passed our validation criteria, the convenience index layer had lower model performance across six states: Adamawa, Akwa Ibom, Bayelsa, Enugu, Gombe, and Nasarawa. As such we advise that caution be applied to estimates from these states.

Interpolation results

Spatial layers for each of the 3Cs and their respective terciles were created at three geographic levels: the state and local government area (LGA) administrative divisions, as well as the hyperlocal (1km2) level. An example of the three views of the confidence index at various levels of granularity is shared in Fig. 1 below.

Fig. 1
figure 1

Mapping the confidence index

We also produced national averages of the 3Cs indices and terciles to compare our subnational data against. We found that these national averages obscure the significant subnational heterogeneity of the 3Cs. For example, at the national level 30% of individuals in Nigeria were “low confidence” with a national score of 59 on the confidence index. Across LGAs, the average proportion of individuals with “low confidence” ranged from 8% all the way up to 60% while the average scores of the confidence index ranged from 36 all the way to 80. This variance increases further at the hyperlocal level as Table 4 demonstrates below.

Table 4 Summary of interpolations: 3Cs indices and respective levels

In addition to subnational heterogeneity, we also investigated intra-state heterogeneity to better understand the level of variance within a given state, especially between LGAs and hyperlocal 1km2 grids. To simplify our analysis across the 3Cs indices, we created a Mean Cumulative Score (MCS) to represent the average range across the 3Cs. We can now compare the MCS across states, and between LGAs and hyperlocal grids. For example, we find that Katsina has the highest (30 MCS) and Bayelsa the lowest (10 MCS) mean heterogeneity at the LGA level. Despite this, Bayelsa has significant intra-LGA heterogeneity, i.e., the hyperlocal grid level (a 34-point range in confidence, 26-point range in complacency, 53-point range in convenience, or 38 MCS). Interestingly the average range in heterogeneity for Bayelsa at the hyperlocal level (38 MCS) is higher even than the state with the most heterogeneity at the LGA level (Katsina state, 30 MCS).

This suggests that the significant amount of hyperlocal heterogeneity found in each 3C index across Nigeria (as presented above in Table 4) is likely due to both inter- and intra-state heterogeneity. Table 5 below summarizes intra-state heterogeneity for both Katsina and Bayelsa states at the LGA and hyperlocal range.

Table 5 Highest and lowest intra-state heterogeneity across 3Cs indices

Prevalent determinant of vaccine hesitancy at the hyperlocal level

In addition to mapping each individual 3Cs determinant, we mapped a community’s prevalent 3Cs determinant, defined as the index with the highest score per grid cell., This required inverting the confidence and convenience indices so that higher scores reflect a decrease in confidence and convenience, respectively. These are paired with the complacency index, where higher scores reflect an increase in complacency attitudes. As seen in Fig. 2 below, the prevalent determinant of vaccine hesitancy for communities across Nigeria is low levels of convenience. The exception is that in urban centers, higher levels of complacency are more prevalent. This is likely related to urban centers typically having greater access to vaccination centers and therefore higher convenience scores. Low rates of confidence are also evident in Abuja and Lagos.

Fig. 2
figure 2

Heatmap of prevalent determinant of vaccine hesitancy at the hyperlocal level

Population characteristics for hyperlocal targeting

Sociodemographic characteristics and media-usage patterns were also interpolated to demonstrate the use of hyperlocal data to target populations according to the prevalent or significant determinants of vaccine hesitancy. A sample of these population characteristics are shared for Lagos in Example 2 in the Discussion section below.

Discussion

Our first analytical objective for this study was to reliably estimate attitudinal data, specifically concerning vaccine hesitancy, at the hyperlocal level. The previous section makes the case that we can successfully build informative metrics to capture vaccine hesitancy and spatially interpolate these data at the 1km2 level. In this section, we would like to address our second objective which is to explore the loss of heterogeneity in our indicators of interest at higher levels of aggregation and discuss the ways in which public health stakeholders can best utilize hyperlocal data for targeted program design and implementation.

Benefits of hyperlocal data

The merits of hyperlocal estimates are evident in the variance of estimates captured at lower geographic levels of data. As noted in Table 4 the 3Cs indices produced at the state level had a range of approximately 24 points on average, whereas our hyperlocal estimates provided a range of approximately 77 points. This can be interpreted as a 52-point loss in variance by using state level data instead of hyperlocal. The same is true when viewing the 3Cs index tercile estimates. Public health stakeholders using traditional models of tracking determinants of vaccine hesitancy at a national level, or even at the first administrative division, are making decisions with imprecise and imperfect information that disguises the true heterogeneity in population characteristics and attitudes.

Hyperlocal estimates, especially at the 1km2 level, are increasingly prevalent within existing public health planning and programming and are already being used to inform microplanning for immunization in many countries including Nigeria. Most notable has been a growing interest in microplanning at the health facility level to improve targeting and programming in respective catchment areas. Data at the 1km2 level is optimal for constructing custom catchment areas. As far back as 2009, the WHO has issued guidance outlining the utility of developing microplans at the health facility level using catchment area analysis to identify hard to reach locations and populations in efforts to increase immunization rates [24]. Since 2017 the WHO Reaching Every District (RED) strategy, though national in scope, has been implemented at the district and health facility levels [25]. And in 2021, the WHO published updated guidance on operational microplanning, specifically for COVID-19 vaccination [26]. In Nigeria specifically, a study of the utility of digital microplanning using geographic information systems (GIS) also used health clinic catchment areas to align with government requirements, which included service distances of 0-2 km, 2-5 km, and 5-10 km [27]. The 1km2 level data produced by this project can be used explicitly and directly to inform the recommended health facility catchment area analysis for effective microplanning, especially in Nigeria where the government’s service distances are factors of 1km2. Even beyond the 1km2, our hyperlocal data can be aggregated to the district level (LGA in Nigeria) which is still much more granular than standard survey estimates, which are representative only at the national or first administrative level, such as the Nigeria 2018 DHS.

Below we share three examples of how our hyperlocal estimates can be used to provide far more granular, nuanced insights than survey data alone.

Example 1: Merits of all 3 convenience indices

The availability of data on COVID-19 vaccination sites from the Federal Ministry of Health in conjunction with the convenience index allowed for an analysis of barriers faced by communities in proximity to vaccinations sites. Contrary to expectation, we found that proximity to a COVID-19 vaccination site was not always a strong predictor of a high convenience score within a community. This disparity was often due to the very different underlying drivers of convenience across accessibility, time, and financial burdens of receiving vaccination. Since the convenience index is a composite of two distinct indices, we can separate convenience concerns across physical accessibility and more subjective perceived time and financial burdens of seeking the vaccine. We found that communities with low travel times to a vaccination center could still have low convenience if they were more likely to be concerned about the time and cost of receiving the vaccine.

For example, although the towns of Iba, Ire, and Eripa in the state of Osun (Fig. 3) are each served by the three COVID-19 vaccination sites, none of the neighborhoods had a convenience index greater than 50, with most of them falling between 15–30 (out of 100). That is, despite high physical proximity to vaccination sites other factors were pushing down their convenience score, such as high scores on the time and financial burden index, with most neighborhoods falling between 70–85.

Fig. 3
figure 3

Heatmaps of convenience, accessibility, and time and financial burden indices in Osun State

The maps above highlight three important findings: (1) convenience can be a significant barrier to vaccination even in communities with COVID-19 vaccination sites, (2) there can be significant heterogeneity in convenience in neighborhoods served by the same vaccination site, and (3) the underlying components of convenience can point in very different directions, so should ideally be investigated concurrently.

Example 2: Profile of high complacency population

In the example below, we highlight population heterogeneity across Lagos state to emphasize (a) variation in complacency rates across adjacent neighborhoods, and (b) variation in characteristics across high complacency populations. We first selected two communities across adjacent LGAs in urban Lagos: Lagos Mainland and Lagos Island (Fig. 4). Although located right next to each other, just 29% of the population in the selected Lagos Mainland neighborhoods have high complacency, compared to 51% in Lagos Island. For reference, the average rate of high complacency individuals is 40% across Lagos state. As seen in Fig. 4, this variation is evident across selected neighborhoods within respective LGAs. This highlights how nearby neighborhoods can have very different opinions about their likelihood of catching, spreading, or experiencing negative health outcomes from COVID-19 and require different messaging campaigns.

Fig. 4
figure 4

Heatmap of high complacency population (%) in Lagos, Nigeria

Further study of each neighborhood’s population characteristics revealed other significant differences. Compared to Lagos Mainland, the more densely populated Lagos Island neighborhoods were less likely to use Facebook or own a radio and more likely to be Catholic and watch television programming on Arewa 24. These and additional characteristics on media usage can be viewed below in Table 6 which contains sample averages of neighborhoods in Lagos Island and Lagos Mainland, as well as the respective difference in means.

Table 6 Summary statistics of population characteristics in Lagos, Nigeria

These differences have significant implications for tailoring both the message and the medium of public health communications or behavior change interventions. For example, messaging aimed at reducing vaccine complacency via television programming on Arewa 24 would likely be more effective in the Lagos Island neighborhoods than the Lagos mainland neighborhoods.

Example 3: Designing a media campaign targeting low confidence populations

One of the more common tools for social behavior change to improve health outcomes is tailored media programming delivered via social media [28], television [29], radio [30], newspaper [31], and even community theater [32]. However, it would be impractical and inefficient to deploy a national level campaign across all major media channels, with media content simultaneously addressing vaccine confidence, complacency, and convenience related issues. Tailored content that is designed to address the determinants of vaccine hesitancy within specific subgroups [33], delivered via the channels they are most likely to receive, and provided in the language they speak at home, will likely utilize resources more efficiently and be more effective at increasing vaccine uptake.

For this reason, we have also provided hyperlocal estimates on several complementary indicators, including the proportion of users among fifteen different social media channels and applications, thirteen different television channels, twelve different radio stations, and eight different newspapers across Nigeria. Paired with the proportion of individuals that speak four different languages at home, the medium of messaging can now be tailored to an unprecedented level.

Using the hyperlocal data on the 3Cs determinants of vaccine hesitancy, language, and media consumption, a public health stakeholder can design a custom media campaign to address low confidence among Hausa speakers across target LGAs. For example, they can devise media content to address medical misinformation regarding vaccine safety, deploy the media campaign in Hausa, and explicitly target smartphone users via Facebook Ads. To do so, we can isolate LGAs with at least 50% native Hausa speakers, that also have majority proportionsFootnote 1 of low confidence individualsFootnote 2 smartphone ownersFootnote 3 and Facebook usersFootnote 4 (Fig. 5).

Fig. 5
figure 5

LGAs best suited for counter-misinformation content in Hausa and Igbo

With these parameters, of the 774 LGAs the media campaign would be deployed in 17 of them across central Nigeria. If we wanted to run the same campaign but target Igbo speakers, we could refine the parameters to include only LGAs with at least 50% native Igbo speakers highlighting 45 separate LGAs across southern Nigeria to deploy the media campaign (Fig. 5). The value of hyperlocal data is evident in the ability to adjust resource allocation and targeting parameters rapidly and carefully for interventions aimed at increasing vaccine uptake.

Study limitations

This section describes potential study and analytical limitations that we would like to note with respect to representativeness and generalizability of both the data we used as well as the data we estimated. We also comment on the limitations of our analytical objectives, which precluded more expansive analyses, but which could serve as avenues for future research. For each challenge, we also describe steps we have taken to mitigate it.

Representativeness of a phone survey

Data for this study was collected by means of phone surveys, which restricted the sampling frame to phone owning populations. However, we do not think this compromised the generalizability of our survey results due the prevalence of phone ownership in Nigeria as well as the survey specifications. As per the DHS, the rate of mobile phone ownership across households in 2018 was roughly 90%, and although male, urban, and richer households were more likely to have a mobile phone, this increased likelihood was marginal and unlikely to skew the distribution.

Moreover, to mitigate any loss of representativeness, the project (a) administered sample quotas for socioeconomic status and nested quotas for age, gender, and state and (b) implemented post-survey reweighting, specifically of primary factors such as age, gender, socioeconomic factors, and urbanicity. These steps are elaborated upon in the Methods section above. As such we believe that sufficient steps were taken to be able to interpolate primary indicators of this research related to COVID-19 vaccine hesitancy and maintain relevance for the general population.

Unrecognized data on vaccination sites

Lack of exhaustive data on COVID-19 vaccination sites, which may have been more plentiful than captured, can contribute to an underestimation of the accessibility index, and therefore also the convenience index. It can also confound the relationship with the perceived time and financial burden index since we currently observe populations that report low levels of physical proximity, but also high levels of perceived convenience. However, data on pop up clinics, roaming community vaccination drives, or other temporary or uncaptured vaccination options were unavailable for our analysis. Moreover, due to their temporary nature, knowledge of them can be less widespread than permanent health facilities and therefore have a less powerful relationship with perceived and overall convenience.

The uniform 3Cs model approach

The multi-national scope of this project led to prioritizing comparability of the 3Cs model across countries rather than customizing each to the local or national context. As discussed in Methods, index inputs varied between the input data sources i.e., DHS and the COVID-19 survey, and resulted in two different models, the Proxy model and COVID-19 Model, respectively. Although, within each model, the inputs were consistent across countries. In this manner, we could compare relative 3Cs indices scores between Nigeria and Kenya.

On the other hand, a limitation is that a country model was not customized or tested for internal validity. This leads to certain challenges, such as the use of scooter ownership as an input in the accessibility index, and thereby the convenience Index. This indicator was included across project countries as a measure of mobility or travel accessibility to a vaccination site even though, in Nigeria scooters are not typically used for private transportation. We can note though that this input was not a statistically significant driver of the final convenience index.

A cost–benefit analysis of hyperlocal targeting

A comparative analysis of the costs and benefits of using standard aggregated household data as opposed to hyperlocal data as put forth by this paper would be incredibly informative. However, this analysis was out of scope for this paper where our objectives were to test spatial interpolation of attitudinal data, and better understand the benefit of hyperlocal heterogeneity for programmatic decision making. We do believe that it would be a fascinating avenue for future research.

Prior to this study, we have observed that the cost–benefit trade-off in collecting and generating hyperlocal data is becoming increasingly favorable for large-scale, well-funded public health programs. This is primarily because data collection and generation costs using spatial interpolation and machine learning methods are not particularly sensitive to changes in health program scale or structure but remain fairly static. Many multi-year, multi-million USD donor-funded public health programs are already contributing significant resources to baseline, midline, and endline evaluations. In addition, as outlined in our research, these hyperlocal estimates provide community level insights across the entire country. With great inter- and intra-donor funded project coordination, nationally representative and georeferenced survey data with sufficient sample sizes to inform hyperlocal estimates could be afforded with a negligible impact on program cost. There is also an opportunity to use hyperlocal estimates to more efficiently and effectively allocate public health resources. Thus, the cost–benefit ratio derived from increasingly precise health programming and resource allocation will become increasingly beneficial for larger scale public health programs.

Heterogeneity within survey clusters

There are expected cases where an indicator of interest may vary within a single high-density urban cluster. For example, a high-density urban cluster may include some rich households and some poor living alongside each other. In such cases, the effect of the respective data points (or households) within this cluster may average into one another. The statistical contribution of that cluster overall will be lowered since its predictive power of the indicator of interest, such as socioeconomic status, decreases.

However, this ‘averaged’ cluster affects all grid estimates equally since the spatial interpolation model does not estimate a grid’s values based on its nearest neighbor but rather is a function of model covariates from the full sample, i.e., all clusters. We expect that on average a sufficient number of remaining urban clusters capture exclusive levels of the indicator of interest, especially since areas with higher heterogeneity (or diversity) such as cities, are more likely to be represented by a higher number of clusters. The extent of the ‘averaging’ is finally also tested via the cross-validation process. As such we can be assured that any model that passes the interpolation checks, produces final estimates that minimize the effects of averaged grids.

Despite the limitation of the ‘averaging’ effect, it is important to make the distinction that some ‘averaging’ at the 1km2 level is certainly no more erroneous, any probably less so, than greater ‘averaging’ at the national or first administrative level.

Conclusions

The primary aim of our research has been in support of increasing vaccine uptake. To this end, we focused our work on developing hyperlocal estimates which could better inform local implementers and increase their ability to respond in a more targeted and impactful manner to community concerns and barriers. Our analytical objectives, as shared previously, were to (1) demonstrate the reliable estimation of attitudinal data at the hyperlocal level, and (2) better understand the loss of heterogeneity at higher levels of aggregation that are currently more common in making programmatic decisions.

The potential for mapping attitudinal data at the hyperlocal level via spatial interpolation is broad. Similar techniques could be used to understand health attitudes across many different public health concerns, particularly in low- and middle-income countries where reliable case data is scarcer. Metrics of health attitudes and behaviors that measure disease-related stigma, attitudes toward routine immunization and reproductive care, or perceived risk of illness at the hyperlocal level can provide insights that will fundamentally change how program planning decisions are made in a public health context.

Our research demonstrates that there is significant geographic heterogeneity across the determinants of COVID-19 vaccine hesitancy that is not captured by nationally representative survey data. By building on previous research producing hyperlocal estimates of demographic and socioeconomic indicators, we demonstrate the ability to also map attitudinal data down to one square kilometer. Using the WHO endorsed 3Cs framework of vaccine hesitancy, these attitudinal indicators allow us to better understand the unique determinants of vaccine hesitancy across an entire country at the neighborhood or community level. Given the difficulties of increasing COVID-19 vaccine uptake among hesitant populations, a better understanding of this variability will allow for more targeted communication and tailored interventions aimed at changing health attitudes and behaviors.

We see the strongest use cases for these data in local decision-making. National- or regional-level data could be used to make decisions regarding, for example, vaccine allocation at a state level. However, regional-level data does not help local program managers make decisions within their respective areas. As our data show, there are appreciable differences in each of the “3Cs” within many if not most states.

Our study contributes to existing research that finds recurring large, nationally representative, geotagged household surveys, such as the Nigeria DHS, can be used to reliably produce hyperlocal estimates of attitudinal data, which—up to this point, has not been done. We recognize the prevailing lack of available attitudinal data in such surveys and advocate for the inclusion of standardized survey questions pertaining to vaccine hesitancy in future rounds. Recognizing practicalities around survey length and country context, the most recently published WHO Behavioral and Social Drivers of Vaccination (BeSD) guidance has standardized a smaller number of priority questions and indicators that would serve this purpose well [34]. This BeSD guidance already delineates between priority, main, and optional survey questions for both childhood and COVID-19 vaccination so afford both flexibility and comparability across geographies and timelines. At the time of our research, the BeSD guidance was still being finalized.

Finally, we see opportunity for the use of hyperlocal attitudinal data in other public health contexts as well, aside from vaccine uptake support during the COVID-19 pandemic. Perceptions of access to medicine, as well as acceptance of ever-evolving medical advice and practices, may contribute significantly to the success or failure of other public health programs supporting populations with other communicable and non-communicable diseases. These hyperlocal data can support programmatic decision-making at a local level - where public health practitioners often work - not just national or regional levels, so that health program managers can be increasingly precise in their program implementations. We hope that improved availability of more granular data can be used to support public health programming worldwide.

Availability of data and materials

The 2018 Nigeria Demographic and Health Survey is available via the DHS Program website, https://dhsprogram.com/data/dataset/Nigeria_Standard-DHS_2018.cfm. The market research survey data collected in February 2022 are not publicly available but are available from the corresponding author on reasonable request. The hyperlocal estimates produced across the ten different countries are not publicly available but are being made available to authorized users through a password protected web-based data visualization dashboard. To receive access, users must be, or be directly supporting, verified public health stakeholders working towards increasing COVID-19 vaccine uptake in one of the ten countries.

Notes

  1. A majority proportion is defined here as the top 3 quintiles, or top 60%, of data.

  2. The minimum threshold for a majority proportion of low confidence at the LGA level was identified as 27 percent.

  3. The minimum threshold for a majority proportion of smartphone owners at the LGA level was identified as 85 percent.

  4. The minimum threshold for a majority proportion of Facebook usership at the LGA level was identified as 60 percent.

Abbreviations

1km2 :

One square kilometer

BeSD:

Behavioral and Social Drivers of Vaccination

3Cs:

Confidence, complacency, convenience

COVID-19:

Coronavirus disease 2019

CATI:

Computer assisted telephone interviewing

DHS:

Demographic and Health Survey

IPF:

Iterative proportional fitting

J&J GPH:

Johnson and Johnson’s Global Public Health

LMICs:

Low- and middle-income Countries

MoH:

Ministry of Health

MCA:

Multiple correspondence analysis

ML:

Machine learning

PCA:

Principal component analysis

PEPFAR:

The United States President’s Emergency Plan for AIDS Relief

SAGE:

Strategic Advisory Group of Experts on Immunization

WHO:

World Health Organization

References

  1. John Hopkins University. Johns Hopkins Coronavirus Resource Center. Johns Hopkins Coronavirus Resource Center. Johns Hopkins University & Medicine; 2022. Available from: https://coronavirus.jhu.edu/map.html.

  2. Watson OJ, Barnsley G, Toor J, Hogan AB, Winskill P, Ghani AC. Global impact of the first year of COVID-19 vaccination: a mathematical modelling study. Lancet Infect Dis. 2022;22(9):1293–302.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Machingaidze S, Wiysonge CS. Understanding COVID-19 vaccine hesitancy. Nat Med. 2021;27(8):1338–9.

    Article  CAS  PubMed  Google Scholar 

  4. MacDonald NE. Vaccine hesitancy: definition, scope and determinants. Vaccine. 2015;33(34):4161–4.

    Article  PubMed  Google Scholar 

  5. Gerretsen P, Kim J, Caravaggio F, Quilty L, Sanches M, Wells S, et al. Individual determinants of COVID-19 vaccine hesitancy. Inbaraj LR, editor. PLoS One. 2021;16(11):e0258462.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Barkay N, Cobb C, Eilat R, Galili T, Haimovich D, LaRocca S, et al. Weights and Methodology Brief for the COVID-19 Symptom Survey by University of Maryland and Carnegie Mellon University, in Partnership with Facebook. arXiv:200914675. 2020. Cited 2022 Oct 31. Available from: https://arxiv.org/abs/2009.14675.

  7. Group D. Delphi’s COVID-19 Trends and Impact Surveys (CTIS). delphi.cmu.edu; 2020. Updated 2022. Cited 2022 Oct 28. Available from: https://delphi.cmu.edu/covid19/ctis/.

  8. Kreuter F, Barkay N, Bilinski A, Bradford A, Chiu S, Eliat R, et al. Partnering with a global platform to inform research and public policy making. Surv Res Methods. 2020. Available from: https://madoc.bib.uni-mannheim.de/58686/1/7761-Article%20Text-25251-3-10-20200603.pdf.

  9. Bradley VC, Kuriwaki S, Isakov M, Sejdinovic D, Meng X-L, Flaxman S. Unrepresentative big surveys significantly overestimated US vaccine uptake. Nature. 2021;600(7890):695–700. Cited 2022 Jan 3. Available from: https://www.nature.com/articles/s41586-021-04198-4.

  10. Davies MM, Van Der Laan MJ. Optimal spatial prediction using ensemble machine learning. Int J Biostat. 2016;12(1):179–201.

    Article  PubMed  Google Scholar 

  11. Brugh KN, Lewis Q, Haddad C, Kumaresan J, Essam T, Li MS. Characterizing and mapping the spatial variability of HIV risk among adolescent girls and young women: a cross-county analysis of population-based surveys in Eswatini, Haiti, and Mozambique. PLoS One. 2021;16(12):e0261520.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Gething PW, Tatem AJ, Bird T, Burgert C. Creating spatial interpolation surfaces with DHS data: DHS spatial analysis. Rockville (MD): ICF International; 2015. 86 p. Report No.:11.

  13. Kempf-Leonard K. Encyclopedia of social measurement, vol. 1. Amsterdam: Elsevier; 2005.

    Google Scholar 

  14. Brackstone K, Atengble K, Head MG, Boateng LA. COVID-19 Vaccine Hesitancy in Ghana: The Roles of Political Allegiance, Misinformation Beliefs, and Sociodemographic Factors. 2022.

  15. Archibong B, Annan F. What do Pfizer’s 1996 drug trials in Nigeria teach us about vaccine hesitancy?. Brookings. Brookings; 2021. Cited 2022 Jan 20. Available from: https://www.brookings.edu/blog/africa-in-focus/2021/12/03/what-do-pfizers-1996-drug-trials-in-nigeria-teach-us-about-vaccine-hesitancy/.

  16. Taylor S, Khan M, Muhammad A, Akpala O, van Strien M, Morry C, et al. Understanding vaccine hesitancy in polio eradication in northern Nigeria. Vaccine. 2017;35(47):6438–43.

    Article  PubMed  Google Scholar 

  17. The DHS Program - Nigeria: Standard DHS, 2018. dhsprogram.com. Available from: https://dhsprogram.com/methodology/survey/survey-display-528.cfm.

  18. Bureau UC. Subnational Population Data by Geographic Area. Census.gov. Cited 2023 Jan 26. Available from: https://www.census.gov/geographies/mapping-files/time-series/demo/international-programs/subnationalpopulation.html.

  19. World Population Prospects 2022, Online Edition. United Nations, Department of Economic and Social Affairs, Population Division; 2022. Available from: https://population.un.org/wpp/Download/Standard/Population/.

  20. Humanitarian Data Exchange. Health Facilities in sub-Saharan Africa [data files]. data.humdata.org. 2020. Updated 2022. Cited 2022 Oct 1. Available from: https://data.humdata.org/dataset/health-facilities-in-sub-saharan-africa.

  21. The Malaria Atlas Project. Global Motorized Friction Surface and Global Walking Only Friction Surface [data files]. MAP. 2019. Cited 2022 Oct 28. Available from: https://data.malariaatlas.org/.

  22. Worldpop. The spatial distribution of population density in 2020, Nigeria [data file]. Center for International Earth Science Information Network (CIESIN), Columbia University: New York, NY; 2020. Cited 2022 Oct 28. Available from: https://hub.worldpop.org/geodata/summary?id=42923.

  23. Vaccination.gov.ng. 2021. Cited 2022 Oct 28. Available from: https://nphcda.vaccination.gov.ng/vaccination-sites.

  24. World Health Organization. Microplanning for immunization service delivery using the Reaching Every District (RED) strategy. apps.who.int. 2009. Cited 2023 Jan 25. Available from: https://apps.who.int/iris/handle/10665/70450.

  25. Reaching Every District (RED) - A guide to increasing coverage and equity in all communities in the African Region. WHO Regional Office for Africa. 2017. Available from: https://www.afro.who.int/publications/reaching-every-district-red-guide-increasing-coverage-and-equity-all-communities.

  26. Guidance on operational microplanning for COVID-19 vaccination. www.who.int. 2021. Cited 2023 Jan 25. Available from: https://www.who.int/publications/i/item/WHO-2019-nCoV-vaccination-microplanning-2021.1.

  27. Dougherty L, Abdulkarim M, Mikailu F, Tijani U, Owolabi K, Gilroy K, et al. From paper maps to digital maps: enhancing routine immunisation microplanning in Northern Nigeria. BMJ Glob Health. 2019;4(Suppl 5):e001606.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Evans WD, Abroms LC, Broniatowski D, Napolitano M, Arnold J, Ichimiya M, et al. Digital Media for behavior change: review of an emerging field of study. Int J Environ Res Public Health. 2022;19(15):9129. Cited 2022 Oct 1. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9331057/.

  29. Ahmed I. Television and women’s reproductive behavior: evidence from Uganda. J Media Econ. 2022;34(3):135–51.

    Article  Google Scholar 

  30. Diddi P, Kumble S, Shen F. Efficacy of radio entertainment education in promoting health behavior: a meta-analysis. J Radio Audio Media. 2023;30(1):387–407.

  31. Noar SM, Benac CN, Harris MS. Does tailoring matter? Meta-analytic review of tailored print health behavior change interventions. Psychol Bull. 2007;133(4):673–93.

    Article  PubMed  Google Scholar 

  32. Pleasant A, de Quadros A, Pereira-León M, Cabe J. A qualitative first look at the arts for behavior change program: theater for health. Arts Health. 2014;7(1):54–64.

    Article  Google Scholar 

  33. Butler R, MacDonald NE. Diagnosing the determinants of vaccine hesitancy in specific subgroups: the guide to Tailoring Immunization Programmes (TIP). Vaccine. 2015;33(34):4176–9.

    Article  PubMed  Google Scholar 

  34. World Health Organization. Behavioural and social drivers of vaccination: tools and practical guidance for achieving high uptake. apps.who.int. World Health Organization; 2022. Available from: https://apps.who.int/iris/handle/10665/354459.

  35. Hagen-Zanker A. A computational framework for generalized moving windows and its application to landscape pattern analysis. Int J Appl Earth Obs Geoinf. 2016;44:205–16.

    Google Scholar 

  36. Ghojogh B, Crowley M. The Theory Behind Overfitting, Cross Validation, Regularization, Bagging, and Boosting: Tutorial. arXiv:190512787. 2019. Available from: https://arxiv.org/abs/1905.12787.

  37. Department of Immunization, Vaccines and Biologicals (IVB) SAGE meeting Strategic Advisory Group of Experts on Immunization 4-8 October 2021 Virtual meeting, WHO: Geneva, Switzerland. Available from: https://terrance.who.int/mediacentre/data/sage/SAGE_eYB_Oct2021.pdf.

Download references

Acknowledgements

The authors would like to thank several members of the Fraym Analytics team for their work on adapting the 3Cs model of vaccine hesitancy across ten countries using several data streams: Ilse Paniagua for efficient management of a multi-year multi-country project; Quinn Lewis for instrumental technical guidance; Alyssa Staats, Carlos Chua, Dominic Palazzolo, Sean Walsh, Varsha Sivaram, and Yulia Korableva for their careful work on model development and data analysis; and Summer Wilson for data dashboard development. Additionally, Marissa Block, Sean Walsh, and Quinn Lewis for proofreading and diligent review.

We would also like to thank Lauren Marks and Caroline Baratz for their essential role in facilitating partnerships, onboarding users, and ensuring the data this research is based on was shared with key public health stakeholders around the world; and Janice April and Jason Betik for their technical guidance and overall support for this research.

Additionally, we want to acknowledge OpenStreetMap (OSM) for providing access to the map data shown in Figures 1, 3, 4, and 5. OSM is open data, licensed under the Open Data Commons Open Database License (ODbL) by the OpenStreetMap Foundation (OSMF). More information on OSM can be found at https://www.openstreetmap.org/copyright and more information on the ODbL can be found at https://opendatacommons.org/.

About this supplement

This article has been published as part of BMC Proceedings, Volume 17 Supplement 7, 2023: VARN2022: Shaping Global Vaccine Acceptance with Localized Knowledge. The full contents of the supplement are available at https://bmcproc.biomedcentral.com/articles/supplements/volume-17-supplement-7.

Additional information

Fraym is a small business and technology company headquartered in Arlington, VA. Fraym generates granular data about population characteristics, attitudes, and behaviors at any geographic level, including down to the community or neighborhood level. Organizations around the world use Fraym’s data to help solve the challenges of climate change, inequality and insecurity, global health, energy access, and more.

Fraym remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Funding

The project this research is based on was funded by Johnson and Johnson’s Global Public Health team as part of an initiative to support COVID-19 vaccination in sub-Saharan Africa across ten countries: Ethiopia, Ghana, Kenya, Malawi, Mali, Nigeria, Rwanda, South Africa, Uganda, and Zambia. The research regarding attitudes toward vaccination described here are not specific to any one brand of COVID-19 vaccine. The publication cost of this article was funded by the Sabin Vaccine Institute.

The funders had no role in the final design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, and approval of the manuscript; or decision to submit the manuscript for publication – with the sole exception of article author J.D., an employee of Johnson and Johnson, who independently contributed as described under the Author Information section below.

Author information

Authors and Affiliations

Authors

Contributions

R.B. led the design and conduct of the study and management, analysis, and interpretation of the data. K.D. contributed to the design of the study and interpretation of the data. R.B. and K.D. jointly led the preparation, writing, and review of all sections within the manuscript. J.D. developed the initial research goal and objectives as well as provided ongoing feedback for the project this research was based on. After the project and the subsequent research study were complete, J.D. contributed substantially to the preparation, writing, and review of all sections within the manuscript. All authors have read and approved the final manuscript. The Analytics unit at Fraym designed the study, developed the vaccine hesitancy model, and analyzed the data. GeoPoll, a remote mobile-based survey implementer administered the household survey.

Corresponding author

Correspondence to Kenneth Davis.

Ethics declarations

Ethics approval and consent to participate

Ethics approval was not acquired for this research because this was a retrospective secondary analysis of an anonymized market research survey procured through a third-party firm, GeoPoll, for purposes outside the scope of this research. The researchers were never in possession of any personally identifiable information (PII) and did not deploy any interventions through or based on this research. Our primary research question was whether we could produce reliable hyperlocal estimates of attitudinal data, and the answers provided by the survey respondents were not relevant to answering this research question.

The survey was administered by GeoPoll to adults over the age of 18 in English, Hausa, Yoruba, and Igbo. Participants were requested to select their preferred language, prior to asking for their consent to participate. Personal identifiable information (PII) data was neither collected, nor retained. The survey did not ask any sensitive questions about existing health conditions. GeoPoll follows industry guidelines surrounding research ethics, including those set by the European Society of Opinion and Marketing Research (ESOMAR), of which they are an active member.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publishers Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

A detailed explanation of the spatial interpolation process. An explanation of Fraym’s interpolation process is detailed further in Appendix A. A summary explanation of the interpolation process used by Fraym can also be found within the article by Brugh et al. coauthored by individuals also employed by Fraym. Some of these authors supported this research and served as reviewers to this article and are highlighted under Acknowledgements. The authors felt the interpolation process is a critical piece of context for this study, but not the subject of the study, so included this explanation as additional data file 1 rather than include it within the body of the article or rely solely on a reference to the previous publication. Sources [35] and [36] are only referred to in Appendix A, not within the manuscript.

Additional file 2:

Benefits of the 3Cs model and indicator selection rationale. Our rationale for choosing the indicators and questions included in this study, as well as for aligning them to the 3Cs model, is detailed further in Appendix B. The authors believe this rationale is important to include, but not essential for and potentially distracting from understanding the study Methods section, so included this rationale as additional data file 2. Source [37] is only referred to in Appendix B, not within the manuscript.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Behal, R., Davis, K. & Doering, J. A novel adaptation of spatial interpolation methods to map health attitudes related to COVID-19. BMC Proc 17 (Suppl 7), 17 (2023). https://doi.org/10.1186/s12919-023-00264-z

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12919-023-00264-z

Keywords